key: cord-0765761-nqawyd0f authors: Pokhrel, S.; Lee, L.; Kraemer, B.R.; Samardzic, K.; Mochly-Rosen, D. title: Increased elastase sensitivity and decreased intramolecular interactions in the more transmissible SARS-CoV-2 variants’ spike protein: Analysis of the new UK and SA SARS-CoV-2 variants date: 2021-01-20 journal: bioRxiv DOI: 10.1101/2021.01.19.427355 sha: 15845e0b52a8392a3c20e8bd492957695ff1ff56 doc_id: 765761 cord_uid: nqawyd0f Two SARS-CoV-2 variants showing increased transmissibility relative to the Wuhan virus have recently been identified. Although neither variant causes more severe illness or increased risk of death, the faster spread of the virus is a major threat. Using computational tools, we found that the new SARS-CoV-2 variants may acquire an increased transmissibility by increasing the propensity of its spike protein to expose the receptor binding domain. This information leads to the identification of potential treatments to avert the imminent threat of these more transmittable SARS-CoV-2 variants. Teaser The more infective SARS-CoV-2 variants may expose its Achilles Heel – an opportunity to reduce their spreading. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the novel coronavirus that has resulted in the coronavirus disease 2019 (COVID- 19) pandemic, continues to mutate while spreading throughout the world. A critical protein on the surface of the virus is the spike protein, that mediates the entry of the virus into the host cells (1). When a mutation or set of mutations provide an advantage over the previous variants, the new variant becomes the dominant variant to spread. This is what occurred when the aspartate 614 in the spike protein of the initial (Wuhan) SARS-CoV-2 virus, mutated to a glycine (Asn614Gly); within a month, the new variant became the most dominant in Europe (2) and now globally (3) . The Asn614Gly variant is associated with higher viral loads, but not with a more severe COVID-19 symptoms in infected individuals (4) . Importantly, the Asn614Gly mutation does not affect the ability of antibodies to neutralize that variant (5) . Nevertheless, as the virus continues to mutate at a very high rate, [for example, each amino acid in the 1273 amino acid of the viral spike protein has mutated on average 4 times since the protein was first sequenced, about a year ago (6) ], the need to continue active surveillance for the emergence of new variants and examination of means to slow down the spread of the new variants is of utmost importance. A novel SARS-CoV-2 variant, known as B.1.1.7 or 20B/501Y.V1, emerged in the United Kingdom at the end of September 2020 and became the dominant variant within a month (7) . This variant increases viral transmissibility between 40-70% (8) . As of January 16 2021, 501Y.V1 has been reported in more than 50 countries (9) . However, this new variant does not appear to increase the severity of COVID-19 (10) . The variant has concomitant 3 deletions and 10 amino acid changes in the 1273 amino acid spike protein (Table 1a) , compared with the initial SARS-CoV-2 index virus identified in Wuhan, China; only one (Asn501Tyr or N501Y) is in the angiotensin-converting enzyme 2 (ACE2) receptor-binding domain (RBD; amino acids 331-524) (11) . Since this Asn501Tyr mutation in the spike's RBD was observed alone as early as April 2020 in Brazil and later in Australia without reports of increased transmissibility, (12) , it is unlikely that this single substitution is sufficient to explain the new phenotype of B. (2)], the Asp80Ala, Lys417Asn, Glu484Lys and Ala701Val. It appears that the combination of these substitutions results in increased infectivity without increasing COVID-19 severity (13). Here we focused on mutations in the SARS-CoV-2 viral surface spike protein to examine how the new variant became more infective. We used computational methods, including Molecular Operating Environment (MOE) analysis (14) and software to predict the outcome of substitutions on the protein structure, to examine the features acquired by the new variants that enable them to increase the rate of infection and spreading without increasing the severity of COVID-19, the pathology resulting from the infection. The goal of this study is to determine whether the acquisition of the more infective phenotype (resulting in a more efficient and rapid viral transmission) exposes new vulnerabilities to target these variants with current drugs. The SARS-CoV-2 spike protein is a trimer protein on the surface of the virus. This protein mediates the binding to the human ACE2 receptor on the nasal mucosa (15) . Each monomer in the trimeric spike protein is found in two confirmations: a closed conformation, in which the ACE2 RBD is folded in and is inaccessible, and the open conformation -induced by proteolysis of the spike protein, in which the RBD becomes exposed (16) . When examining the location of the various mutations associated with increased transmissibility of 501Y.V1 and 501Y.V2, we found that these mutations are not located in a particular domain in the spike protein, nor are they all exposed in the close and or the open conformation ( Fig. 1a-c) . As neither variant with increased transmissibility causes more severe COVID-19 disease (3, 10) , and the mutations do not cluster to a particular site in the spike protein, we next examined the possibility that the acquired features in the new variants may lie in increased susceptibility of the spike protein to activation due to introduction of additional protease-activating sites. We hypothesized that the proteases responsible for this gain-of-function should be relatively unique to the initial site of SARS-CoV-2 entry, the nasal mucosa, as viral effects on internal organs do not seem to become more severe. At least seven ubiquitously expressed proteases have been suggested to act on the spike protein: TMPRSS2, furin, PC1, matriptase, trypsin, cathepsin L and/or cathepsin B (17, 18) . In contrast to the above proteases, the neutrophil elastase is enriched in the nose of humans compared to other tissues (19) and neutrophils, a source for this elastase, are recruited to the nose in SARS-CoV-2 infection (20) . Indeed, the now worldwide dominant Asp614Gly variant, introduced a new elastase proteolysis site in the spike protein (2, 21) . Using ExPASy (22), we identified Thr716Ile and Ser982Ala in 501Y.V1 as potential new elastase cleavage sites (Fig. 2a) . However, whereas Thr716 is on the surface of the spike protein, Ser982 is barely exposed in the open (active) conformation and is buried in the closed conformation (Fig. 1a) . We also identified new potential elastase cleavage sites in 501Y.V2: two of the substitutions are fixed in all variants of the new South African strain (Asp80Ala and Ala701Val) and the other two are not (Asp215Gly and Arg246Ile; Fig. 2b ). Note that although proteolysis-induced activation was first mapped to the S1/S2 boundary in the spike protein (23), protease-induced cuts at other sites may increase the propensity of the spike protein to expose the RBD. We next examined a complementary possible mechanism for the new mutations to compete for the older viral variants, namely substitutions that may decrease intramolecular interactions in the spike protein, thus rendering it more likely to be in the open/active conformation. The open confirmation in this trimeric protein exposes the RBD in a monomer, thus enabling it to bind to ACE2 and enter the body. Indeed, previous work suggested that the Asp614Gly substitution is more prone to be in the open conformation (24) . Using MOE, we found that Asn501Tyr substitution in both 501Y.V1 in 501Y.V2 may increase the propensity of the spike protein to be in the 'open' or active conformation (Fig. 3a) . The backbones of Asn501 residues in the three monomers are about 14 Å from each other in the closed trimeric structure. Therefore, a substitution with a bulky tyrosine residue (~7 Å, each) may introduce clashes and cause a loosening of the contact sites between the monomers in the closed conformation (Fig. 3a) . Note, however, MOE calculations suggest that 501Tyr substitution does not significantly destabilize the closed conformation (ΔStability 0.4369 kcal/mol) as compared with the index Wuhan 501N variant. Another intramolecular interaction in 501Y.V1 that can affect transition from closed to open conformer is mediated by serine 982. Ser982 in one monomer forms a hydrogen bond with a Thr547 in an adjacent monomer in the closed conformation (Fig. 3b) . The substitution serine to alanine in 501Y.V1 abolishes this interaction with Thr547 in the neighboring spike monomer, thus likely favoring the open conformation. In addition to the Asn501Tyr, the South African variant, 501Y.V2, has another amino acid substitution that may favor open conformation of spike protein: Lys417Asn (Table 1b ). In the closed conformation, Lys417 present in one monomer makes a proton-π interaction with Tyr369 present in the adjacent monomer (Fig. 3c) . Asparagine substitution at this position may disrupt this interaction with Tyr369, potentially favoring the transition to open conformation. Note also that Thr716Ile, Ser982Ala, and Asp1118His are also conserved in viruses isolated from bat, civet and pangolin, suggesting that these amino acid substitutions may be important for some function of the spike protein (Fig. 4) . Similarly, Asp80Ala and Ala701Val amino acid that are mutated in 501Y.V2 are conserved in these other species (Fig. 4) . Whether these mutations are associated with more efficient transmissibility in various species remain to be determined. The Ala501Tyr mutation common to both 501Y.V1 and 501Y.V2 variants and Glu484Lys and Lys417Asn of the 501Y.V2 variant is among the amino acids shown to interact with ACE2 (25) . It is therefore possible that these mutations increase the binding of the active conformer of the spike protein to ACE2 or to other associated receptors to mediate more effective transmissibility. Although the affinity calculations in MOE suggest that these mutations have minimal negative impact on the affinity of RBD with ACE2 (ΔAffinity ; Table 1c ), recent yeast display assay suggests that co-occurrence of these mutations increases the affinity of RBD to ACE2 by 2 and 10 folds for 501Y.V1 and 501Y.V2, respectively (26) . However, we suggest that mutations in RBD may also affect the closed conformation's stability, thus increasing the probability of binding to ACE2; the monomers wrap around one another through RBD in the closed trimeric structure and mutations in RBD that abolish these critical inter-monomeric interactions could favor the transition to open conformation. Therefore, the effect of these mutations, alone or in combination, on RBD-ACE2 affinity should be tested using intact viral particles. The emergence of new virus variants triggers the concern that previous exposure to the virus, antibody therapeutics and current vaccines against SARS-CoV-2 may not be effective. This public health concern needs to be addressed as soon as possible. Many neutralizing antibodies were mapped to the RBD, and four major classes of neutralizing sites have been identified in this domain (27, 28) . As one of the mutations in SARS-CoV-2 501Y.V1 and three mutations in 501Y.V2 are in the RBD, these mutations may decrease the neutralizing activities of antibodies (13) . However, neutralization studies refute this idea; both sera of convalescent patients and immunized subjects appear effective (29) (30) (31) . Notably, mutation E484K in the 501Y.V2 lineage reduces serum binding and neutralization in circulating SARS-CoV-2 isolates (32) . Furthermore, although point mutations in the spike protein may affect the therapeutic benefit of one or a combination of two monoclonal antibodies (33) , the benefit of polyclonal antibodies, such as those generated in infected individuals or when using either passive or active vaccines is less likely to be negatively impacted. Our analysis indicates that many of these mutations may also increase the exposure of the RBD of the spike protein. We therefore suggest that neutralization by antibodies may increasemore antigenic determinants per each viral particle may be exposed for the antibodies to bind to. Indeed, sera from hamsters infected with Asp614Gly virus exhibit slightly higher neutralization titers against the new dominant Gly614 variant than against the index variant Asp614 (13) . Furthermore, each viral particle has as many as 80 spike proteins on each virus (34) , only some of which need to bind ACE2, to induce viral entry into cells. If more spike proteins are activated and expose the antigenic RBD domain, neutralizing antibodies have an increased probability to bind to viral particles and agglutinate several particles together. The resulting agglutinated particles are less likely to have a successful fusion with the cell membrane and infection may be aborted. Increased infection efficiency due to increased propensity of the spike protein activation may also increase the vulnerability of the usually hidden spike protein core to the neutralizing antibodies. Proteomic analyses identified five proteins with increased expression in nasopharyngeal swab of subjects with high polymerase chain reaction titer for SARS-CoV-2 (20) . Among these five proteins is neutrophil elastase, whose levels increased 3-fold in SARS-CoV-2 infected vs. noninfected individuals. Furthermore, there is a 10-fold increase in neutrophil number in swabs from infected individuals. Elastase was also found to be critical for accelerating viral entry as well as inducing hypertension, thrombosis and vasculitis (20, 35) . As there are several approved elastase inhibitors, including Sivelestat (which is approved for acute respiratory syndrome), Alvestat (α1-antitrypsin) and Roseltide, the use of these inhibitors, perhaps as intranasal spray or drops, should be considered. Our study uses computational tools to examine the potential impact of the mutations found in SARS-CoV-2 501Y.V1 and 501Y.V2 on the viral infectivity has several limitations. We focused only on two aspects -sensitivity to protease and increased open conformer of the spike protein. Other mechanisms may increase viral transmissibility and were not examined here. For example, if proteolytic events contribute to viral inactivation at the mucosa, a decline in proteolysis susceptibility, especially to proteases present in the nose, may also contribute to increased infectivity. We also did not examine the possibility that the increased transmissibility of the new variants may be due to better survival of the virus in respiratory droplets, the potential increased affinity of the viral spike protein for receptors other than ACE2, nor the impact of mutations in other viral genes. Finally, our analysis tested mainly the potential impact of one mutation at a time. As our analysis is in silico, experiments to test the hypotheses raised by this study should be conducted next, to determine if these SARS-CoV-2 variants with more efficient and rapid transmission capabilities are also more sensitive to existing therapies and suggest new therapeutic targets to slow down and arrest the COVID-19 pandemic. The amino acid sequence of spike protein from the reference EPI_ISL_402124 (Wuhan) sequence was uploaded to ExPASy (22) (https://web.expasy.org/peptide_cutter/) and neutrophil elastase was selected as the protease. We examined also potential proteolysis site for the seven other proteases suggested to cleave SARS-CoV-2 spike proteins (17) . No changes in cleavage sites for both 501Y.V1 and 501Y.V2 were observed for furin, Cathepsin L/B, and PC1. However, a loss of a cleavage site was observed for matriptase in the 501Y.V1, due to alanine of serine 982 mutation. For TMPRSS2 and trypsin, which cleave at single arginine or lysine residues, a proteolysis site was gained at position 484, and a loss of proteolysis site was observed at both positions 246 and 417 in the 501Y.V2. PDB ID: 7A98 was prepared using the QuickPrep functionality at the default settings, to optimize the H-bond network and perform energy minimization on the system in MOE. The ΔAffinity calculations were performed using 7A98.A (S-protein monomer) and 7A98.D (ACE-2) chains with 7A98.D defined as ligand. Residues K417, E484 and N501 in S-protein (7A98.A) were selected and residue scan application was performed. The difference in affinity (ΔAffinity (kcal/mol)) between mutated residue and the wild-type were calculated as per MOE's definition. QuickPrep functionality was used as above to prepare PDB ID: 6ZB5 in MOE. 6ZB5.A, 6ZB5.B and 6ZB5.C chains were used to perform the calculations. N501 residue in S-protein in all three chains were selected and residue scan application was performed for the observed variants S, Y, T and R that are tolerated at this position with site-limit set to 3. The change in stability (ΔStability; kcal/mol) between the variants and the reference wild-type sequence were calculated as per MOE's definition. The amino acid sequence of spike protein from the index EPI_ISL_402124 (Wuhan) sequence was uploaded to PROVEAN (37) (http://provean.jcvi.org/index.php) and SIFT (38) (https://sift.bii.a-star.edu.sg). Every variant observed in 501Y.V1 (UK variant) and 501Y.V2 (South Africa variant) was also uploaded to compare against the reference sequence. Each variant was either predicted to be 'deleterious' or 'neutral' in PROVEAN or 'deleterious' or 'benign' in SIFT. MEGA X software (39) and the ClustalW alignment function was used to align the amino acid sequences of the human index, bat, civet, pangolin, 501Y.V1 and 501Y.V2 SARS-CoV-2 spike proteins. comparison to the index Wuhan human, bat, civet and pangolin spike protein sequences. Sequence alignments generated using ME 3 software. 4 5 1Y.V1 in 1Y.V2 in EGA X Structural Basis for the Recognition of SARS-CoV-2 by Full-Length Human ACE2 One Year Update on the COVID-19 Pandemic: Where Are We Now? World Health Organization Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on D614G Spike Mutation Increases SARS CoV-2 Susceptibility to Neutralization Natural Variants in SARS-CoV-2 S Protein Pinpoint Structural and Functional Hotspots; Implications for Prophylaxis Strategies. bioRxiv Transmission of SARS-CoV-2 Lineage B.1.1.7 in England: Insights from Linking Epidemiological and Genetic Data. medRxiv World Health Organization How Worrying Are the UK, South Africa, and Brazil Coronavirus Variants? BBC News Online Preliminary Genomic Characterisation of an Emergent SARS-CoV-2 Lineage in the UK Defined by a Novel Set of Spike Mutations Emergence of a New SARS-CoV-2 Variant in the UK Emergence and Rapid Spread of a New Severe Acute Respiratory Syndrome-Related Coronavirus 2 (SARS-CoV-2) Lineage with Multiple Spike Mutations in South Africa. medRxiv Tissue Distribution of ACE2 Protein, the Functional Receptor for SARS Coronavirus. A First Step in Understanding SARS Pathogenesis A Thermostable, Closed SARS-CoV-2 Spike Protein Trimer The Structural Basis of Accelerated Host Cell Entry by SARS-CoV-2 Nasal Mucus Proteomic Changes Reflect Altered Immune Responses and Epithelial Permeability in Patients with Allergic Rhinitis Proteins Associated with Neutrophil Degranulation Are Upregulated in Nasopharyngeal Swabs from SARS-CoV-2 Patients Genomic Characterisation and Epidemiology of 2019 Novel Coronavirus: Implications for Virus Origins and Receptor Binding Protein Identification and Analysis Tools on the Expasy Server The Sequence at Spike S1/S2 Site Enables Cleavage by Furin and Phospho-Regulation in SARS-Cov2 but Not in SARS-Cov1 or MERS-CoV Spike Mutation D614G Alters SARS-CoV-2 Fitness Receptor Binding and Priming of the Spike Protein of SARS-CoV-2 for Membrane Fusion SARS-CoV-2 RBD in Vitro Evolution Follows Contagious Mutation Spread, yet Generates an Able Infection Inhibitor. bioRxiv Potent Neutralizing Antibodies against Multiple Epitopes on SARS-CoV-2 Spike SARS-CoV-2 Neutralizing Antibody Structures Inform Therapeutic Strategies Cryo-Em Structure of the N501Y SARS-CoV-2 Spike Protein in Complex with a Potent Neutralizing Antibody. bioRxiv Neutralization of N501Y Mutant SARS-CoV-2 by BNT162b2 Vaccine-Elicited Sera. bioRxiv Could New Covid Variants Undermine Vaccines? Labs Scamble to Find Out Comprehensive Mapping of Mutations to the SARS-CoV-2 Receptor-Binding Domain That Affect Recognition by Polyclonal Human Serum Antibodies. bioRxiv Escape from Neutralizing Antibodies by SARS-CoV-2 Spike Protein Variants Structures and Distributions of SARS-CoV-2 Spike Proteins on Intact Virions Neutrophil Extracellular Traps and by-Products Play a Key Role in COVID-19: Pathogenesis, Risk Factors, and Therapy Free Fatty Acid Binding Pocket in the Locked Structure of SARS-CoV-2 Spike Protein Provean Web Server: A Tool to Predict the Functional Effect of Amino Acid Substitutions and Indels Sift Missense Predictions for Genomes Molecular Evolutionary Genetics Analysis across Computing Platforms Sift: Predicting Amino Acid Changes That Affect Protein Function Potential elastase sites in the spike protein of the new SARS-CoV-2 variants. Potential new elastase sites in (A) the more infective UK variant, 501Y.V1 and (B) the more infective South Africa variant, 501Y.V2. Images generated using BioRender. Blue boxes indicate fixed mutations and gray boxes -non General : We are grateful to the many investigators throughout the world that continue to provide SARS-CoV-2 sequences to public databases. Author contributions: S.P, L.L and B.R.K provided data analysis, visualization and draft writing. K.S contributed to visualization and manuscript preparation. D.M-R conceived the project and supervised the analysis and writing. The authors have no competing interests to declare. Tables Table 1: V1, (B) The N501Y and D614G mutations, which occur in both the Y501.V1 and Y501.V2 variants, and (C) Mutations unique for the 501Y.V2. Each mutation is shown in red in one of the monomers in the 3D structure of the spike protein trimer. Shown is the closed conformation (light gray) and the open (dark gray) conformation.