key: cord-0932138-55qnwqg7 authors: Gurung, Arun Bahadur; Ali, Mohammad Ajmal; Lee, Joongku; Farah, Mohammad Abul; Al-Anazi, Khalid Mashay; Al-Hemaid, Fahad; Sami, Hiba title: Structural and functional insights into the major mutations of SARS-CoV-2 Spike RBD and its interaction with human ACE2 receptor date: 2021-12-20 journal: J King Saud Univ Sci DOI: 10.1016/j.jksus.2021.101773 sha: dff7355527d5b23a3eb79faa482a2fa8cf14f519 doc_id: 932138 cord_uid: 55qnwqg7 Coronavirus Disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has rapidly spread around the world jeopardizing the global economy and health. The rapid proliferation and infectivity of the virus can be attributed to many accumulating mutations in the spike protein leading to continuous generation of variants. The spike protein is a glycoprotein that recognizes and binds to cell surface receptor known as angiotensin-converting enzyme 2 (ACE2) leading to the fusion of the viral and host cell membranes and entry into the host cells. These circulating variants in the population have greatly impacted the virulence, transmissibility, and immunological evasion of the host. The present study is aimed at understanding the impact of the major mutations (L452R, T478K and N501Y) in the receptor-binding domain (RBD) of spike protein and their consequences on the binding affinity to human ACE2 through protein-protein docking and molecular dynamics simulation approaches. Protein-protein docking and Molecular mechanics with generalised Born and surface area solvation (MM/GBSA) binding free energy analysis reveal that the spike mutants-L452R, T478K and N501Y have a higher binding affinity to human ACE2 as compared to the native spike protein. The increase in the number of interface residues, interface area and intermolecular forces such as hydrogen bonds, salt bridges and non-bonded contacts corroborated with the increase in the binding affinity of the spike mutants to ACE2. Further, 75 ns all-atom molecular dynamics simulation investigations show variations in the geometric properties such as root mean square deviation (RMSD), radius of gyration (Rg), total solvent accessible surface area (SASA) and number of hydrogen bonds (NHBs) in the mutant spike:ACE2 complexes with respect to the native spike:ACE2 complex. Therefore, the findings of this study unravel plausible molecular mechanisms of increase in binding affinity of spike mutants (L452R, T478K and N501Y) to human ACE2 leading to higher virulence and infectivity of emerging SARS-CoV-2 variants. The study will further aid in designing novel therapeutics targeting the interface residues between spike protein and ACE2 receptor. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a member of the Coronaviridae family, the Betacoronavirus genus, and the Sarbecovirus subgenus, with a 29.9kb linear single-stranded positive-sense RNA genome (Lu et al., 2020; Wang et al., 2020) . Spike (S), envelope (E), membrane (M), and nucleocapsid (N) are among the four structural proteins encoded by SARS-CoV-2 genome, along with 16 non-structural proteins (Nsp1 to Nsp16) (Wang et al., 2020) . The spike glycoprotein is a homotrimer found on the coronavirus surface that aids in the recognition of the human host cell surface receptor angiotensin converting enzyme 2 (ACE2) (Mercurio et al., 2021) . This recognition is necessary for the fusion of the viral and host cellular membranes, which allows the viral nucleocapsid to be transferred into the host cells (Zhang et al., 2020) . The coronavirus disease 2019 , which is caused by SARS-CoV-2, has a massive impact on world health and the economy . The epicentre of the current pandemic was initially detected in Wuhan, Hubei province, China. Since then, the disease has spread rapidly around the world, affecting millions of individuals and causing unprecedented number of deaths (Wu et al., 2020) . Several studies have shown that SARS-CoV-2 is closely linked to bat SARS-like-CoVs, but the virus's origin and intermediate host species remain unknown (Sun et al., 2020; Zhou and Shi, 2021) . The viral genome undergoes numerous changes in the spike proteins in order to be able to leap species and infect a new mammalian host (Guruprasad, 2021) . 3 The spike protein is made up of two subunits, S1 and S2, which aid in cellular receptor Angiotensin-converting enzyme 2 (ACE2) affinity and membrane fusion, respectively (Satarker and Nampoothiri, 2020) . The receptor-binding domain (RBD) of the S1 unit may directly bind to the ACE2 receptor and is also the primary target of SARS-CoV-2 neutralizing antibodies (Ab) (Kadam et al., 2021; Souza et al., 2021) . S1 is thus thought to be a hotspot for mutations of clinical significance in terms of virulence, transmissibility, and immunological evasion of the host (Shang et al., 2020; Yi et al., 2020) . In SARS-CoV-2, the PRRA sequence motif between the S1 and S2 subunits serves as a furin cleavage site (Ou et al., 2020) . SARS-CoV-2 has evolved into numerous co-circulating variants since its discovery in Wuhan in 2019 (Jia and Gong, 2021) . The binding ability of the circulating variants' spike protein to human ACE2 has considerably enhanced, resulting in a considerable increase in its replication and transmission (Korber et al., 2020; Thomson et al., 2021) . In the spike, there are 3698 nucleic acid mutations (frequency: 0.968%) and 2746 amino acid mutations (frequency: 2.157%). Substitution mutations like D614G, N501Y, Y453F, N439K/R, P681H, K417N/T, and E484K, as well as deletion mutations like ΔH69/V70 and Δ242-244 are the most common in the spike protein . The D614G mutation enhanced viral proliferation and transmission as compared to wild-type viruses (Daniloski et al., 2021) . Due to the E484K substitution, the 501Y.V2 variant is more resistant to multiple monoclonal antibodies, convalescent plasma, and vaccine sera, whereas the N501Y substitution enhanced the affinity to human ACE2 and infectivity (Cele et al., 2021; McCormick et al., 2021; Noh et al., 2021 Other than N501Y, both Beta and Gamma variants harbours additional substitutions. The E484K mutation is present in the Beta variants, whereas the E484K and K417T mutations are present in the Gamma variants. In India's second COVID-19 wave, the newest significant variants, Delta and Kappa, were discovered to share two mutations: E484Q and L452R. Delta 4 variant additionally has a unique mutation, T478K in addition to the two mutations listed above (Khateeb et al., 2021) . SARS-CoV-2 may evade human immune response by continuous genomic evolution, such as substitution in the viral RBD and/or deletion and insertion in the viral spike's N-terminal domain loops, particularly in immunocompromised hosts . The best way to prevent SARS-CoV-2 infection is to be vaccinated. There has been more than eight COVID- antibodies in convalescent plasma and vaccinee sera, it is uncertain if these vaccines are still efficacious against SARS-CoV-2 mutants continually produced in the community (Chen et al., 2021; Wang et al., 2021) . In the present studies, we have modelled the mutant structures of Spike protein-L452R, T478K and N501Y (Khateeb et al., 2021) through in silico mutagenesis technique and evaluated their binding interactions with human ACE2 with respect to the native spike protein using protein-protein docking method. We have characterized the interface regions of native and mutant spike:ACE2 complexes. The stability of native and mutant spike models were investigated using all-atom molecular dynamics simulations. The crystal structure of the receptor-binding domain (RBD) of SARS-CoV-2 spike protein bound to the cell receptor ACE2 was retrieved from protein data bank (http://www.rcsb.org/) using accession ID: 6M0J. The structure is solved through the X-ray diffraction method at a resolution of 2.45 Å (Lan et al., 2020) . The structure complex was split into ACE2 assigned as chain A and spike RBD assigned as chain B using UCSF Chimera tool (Pettersen et al., 2004) . Three mutant models of Spike protein-L452R, T478K and N501Y were generated using the mutagenesis program of PyMOL Molecular Graphics System software version 2.3.3 Schrödinger, LLC. The rotamer was selected using the highest frequency of occurrence. The interaction of ACE2 with the three mutant models of Spike (L452R, T478K and N501Y) were studied using High Ambiguity Driven protein-protein DOCKing (HADDOCK) version 2.4 program which uses an information-driven flexible protein-protein docking approach (Van Zundert et al., 2016) . The binding sites of ACE2 and Spike mutant were defined based on the interaction profile of crystal structure of ACE2-spike (PDB ID: 6M0J). The different conformations of the protein-protein complexes were ranked based on the HADDOCK score which is computed using the following scoring function (equation 1). HADDOCK Score=0.2×Electrostatic energy + 1.0 ×Van der Waals energy + 1.0×Desolvation energy + 0.1×Restraints violation energy - The best structures of the conformers were downloaded in PDB format. Molecular Mechanics/Generalized Born Surface Area (MM/GBSA) program of the HawkDock web server (Weng et al., 2019) was used to predict binding free energies of the protein-protein complexes as well as per residue-free energy contributions. The interface residues, interface area and molecular interactions were evaluated using PDBSum program (Laskowski, 2009) and the hot spot residues in the interface region were determined through alanine scanning mutagenesis program of DrugScorePPi webserver (Kruger and Gohlke, 2010). 6 The native spike:ACE2 and mutant spike:ACE2 complexes were subjected to 75 ns MD simulation studies using GROningen MAchine for Chemical Simulations (GROMACS) 2019.2 software package (Hess et al., 2008) . The topology files of the complexes were prepared using GROMOS96 43a1 force field. The complexes were prepared for MD simulation within a water-filled 3-D cube of 1 Å spacing using a three-point water-model (SPC216) with periodic boundary conditions. Newton's equations of motion were integrated using a leap-frog time integration algorithm. The complexes were neutralized by the addition of 0.15 M NaCl and energy minimization was performed using the steepest descent method. The temperature was set at 300 K and the complexes were subjected to equilibration under 100 ps in NVT (Number of particles, Volume and Temperature) ensemble and another 100 ps under NPT ensemble (Number of particles, Pressure and Temperature). The complexes were subjected to production MD run for 75 ns in NPT ensemble after heating and equilibration. The geometrical properties were computed using defined programs of GROMACS 2019.2 software. MD simulations data was plotted using Xmgrace plotting tool. We have selected three major mutations-L452R, T478K and N501Y that occur in the receptorbinding domain (RBD) of spike protein (Figure 1 ). The binding interactions between human ACE2 and these three mutant models of Spike (L452R, T478K and N501Y) were studied using protein-protein docking method. The binding affinity between the protein partners were ranked according to the HADDOCK score (Table 1) To explain the differences in the binding affinity of the native and mutant spike proteins with ACE2, their docked complexes were next subjected to interface statistics analysis using PDBSum (Table 3 ). The interface residues and molecular interactions between ACE2 and spike in native and mutant complexes are shown in Figure 2 . The complex between ACE2 and native Spike protein shows an interface area of 952:984 Å 2 , number of 19:18 interface residues, 1 salt bridge, 10 hydrogen bonds and 133 non-bonded contacts. In the ACE2 and L452R spike mutant model complex, there are 20:23 interface residues, an interface area of 1085:1059 Å 2 , 2 salt bridges, 11 hydrogen bonds and 177 non-bonded contacts. The complex between ACE2 and T478 Spike mutant shows 19:21 interface residues, 977:987 Å 2 area, 2 salt bridges, 10 hydrogen bonds and 156 non-bonded contacts. The interface between ACE2 and N501Y spike mutant has 19:19 residues, an interface area of 962:1020 Å 2 , 1 salt bridge, 13 hydrogen bonds and 143 non-bonded contacts. The increase in the binding affinity of L452R spike mutant to ACE2 can be attributed to an increase in the number of interface residues, interface area, salt bridges, hydrogen bonds and non-bonded contacts. The increased binding affinity of T478K spike mutant to ACE2 can be explained in terms of increase in the number of interface residues, interface area, number of salt bridges and number of non-bonded contacts. The increased binding affinity of N501Y spike mutant to ACE2 can be explained in terms of an increase in the number of interface residues, interface area, number of hydrogen bonds and number of nonbonded contacts. The total number of hotspot residues in the interface region of Native Spike:ACE2, L452R Spike:ACE2, T478K Spike:ACE2 and N501Y Spike:ACE2 were 5 (Asp30, Tyr41, Tyr83 in ACE2 and Tyr489, Tyr505 in native spike protein) (Suppl. Further, the Native Spike:ACE2, L452R Spike:ACE2, T478K Spike:ACE2 and N501Y Spike:ACE2 complexes were subjected to 75 ns MD simulations study in an aqueous solution. We made a comparative assessment of the geometric parameters such as RMSD, Rg, SASA and the number of hydrogen bonds between the mutant and native spike complexes (Table 4) . (Table 4 ). The increase in total SASA for mutant spike:ACE2 complexes causes alteration in the folding pattern of the protein-protein complex. On account of the increase in the SASA of the protein upon mutations, there is a decrease in the number of hydrogen bonds in the protein-protein complex ( Figure 6 ). The average number of hydrogen bonds in Native Spike:ACE2, L452R Spike:ACE2, T478K Spike:ACE2 and (Table 4 ). Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a member of the Coronaviridae family with a 29.9-kb linear single-stranded positive-sense RNA genome (Lu et al., 2020; Wang et al., 2020) . The virus uses the spike glycoprotein found on the coronavirus surface that aids in the recognition to ACE2 and the fusion of the viral and host cellular membranes (Zhang et al., 2020) . The coronavirus disease 2019 (COVID-19), which is caused by SARS-CoV-2, has caused a massive impact on world health and the economy . The spike protein is made up of two subunits, S1 and S2, which aid in cellular receptor Angiotensin-converting enzyme 2 (ACE2) affinity and membrane fusion, respectively (Satarker and Nampoothiri, 2020) . S1 is considered a hotspot for mutations of significant clinical significance in terms of virulence, transmissibility, and immunological evasion of the host (Shang et al., 2020; Yi et al., 2020) . SARS-CoV-2 has evolved into numerous cocirculating variants since its discovery in Wuhan in 2019 (Jia and Gong, 2021 (Korber et al., 2020; Thomson et al., 2021) . In the present study, we have investigated the binding interactions of native and three major mutant spike proteins-L452R, T478K and Y501Y to human ACE2 and provided plausible mechanisms of variations in the binding affinity using protein-protein docking and molecular dynamics simulation approaches. The HADDOCK docking scores and HawkDock MM/GBSA binding energy analysis show that the mutant spike proteins have a higher binding affinity to ACE2. Interface statistics show that the increase in the binding affinity of the mutant spike to ACE2 is correlated with the increase in the number of interface residues, interface area, hydrogen bonds, salt bridges and non-bonded contacts. The hotspot residues which cause a substantial increase in the binding free energy of at least 2.0 kcal/mol when mutated to alanine mutation (Thorn and Bogan, 2001) were computed for the native as well as mutant spike:ACE2 complexes. Molecular dynamics simulations studies reveal compelling changes in the geometric parameters such as RMSD, Rg, total SASA and number of hydrogen bonds. These data provide useful insights into the molecular aspects of the enhanced binding affinity of spike mutants to ACE2 receptor and may help in designing therapeutics targeting the interface region. While the functional impact of a single mutation alone on the spike protein has been investigated through our studies, it would be fascinating to explore the cumulative effects of all such mutations on the binding interactions of spike protein to ACE2 receptor. Further, deciphering the structure of mutant spike protein in complex with ACE2 through experimental techniques such as X-ray crystallography will provide detailed mechanistic insights into their binding interactions. The emerging SARS-CoV-2 variants have posed a threat to the world's healthcare systems due to their high transmissibility, virulence, infectivity, and their ability to neutralize the serum antibodies. This rapid infectivity of the virus can be attributed to the large accumulating mutations in the spike protein, a glycoprotein that helps the virus gain entry into the human cells via binding to cell surface ACE2 receptor. Using computational simulation approaches, we presented the probable mechanisms of increase in the binding affinity of spike mutants (L452R, T478K, and N501Y) to human ACE2 receptor. Our studies will help in developing novel therapeutics against emerging SARS-CoV-2 variants by blocking the interface region of Escape of SARS-CoV-2 501Y. V2 from neutralization by convalescent plasma Resistance of SARS-CoV-2 variants to neutralization by monoclonal and serum-derived polyclonal antibodies The Spike D614G mutation increases SARS-CoV-2 infection of multiple human cell types Human SARS CoV-2 spike protein mutations GRGMACS 4: Algorithms for highly efficient, load-balanced, and scalable molecular simulation Will Mutations in the Spike Protein of SARS-CoV-2 Lead to the Failure of COVID-19 Vaccines? SARS-CoV-2, the pandemic coronavirus: Molecular and structural insights Emerging SARS-CoV-2 variants of concern and potential intervention approaches Tracking changes in SARS-CoV-2 spike: evidence that D614G increases infectivity of the COVID-19 virus DrugScorePPI webserver: fast and accurate in silico alanine scanning for scoring protein--protein interactions Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor PDBsum new things Recent progress on the mutations of SARS-CoV-2 spike protein and suggestions for prevention and controlling of the pandemic Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding The emerging plasticity of SARS-CoV-2. Science (80-. ) Protein structure analysis of the interactions between SARS-CoV-2 spike protein and the human ACE2 receptor: from conformational changes to novel neutralizing antibodies SARS-CoV-2 mutations, vaccines, and immunity: implication of variants of concern Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV UCSF Chimera--a visualization system for exploratory research and analysis Structural proteins in severe acute respiratory syndrome coronavirus-2 Structural basis of receptor recognition by SARS-CoV-2 The human pandemic coronaviruses on the show: The spike glycoprotein as the main actor in the coronaviruses play COVID-19: epidemiology, evolution, and cross-disciplinary perspectives Circulating SARS-CoV-2 spike N439K variants maintain fitness while evading antibody-mediated immunity ASEdb: a database of alanine mutations and their effects on the free energy of binding in protein interactions The HADDOCK2. 2 web server: user-friendly integrative modeling of biomolecular complexes SARS-CoV-2: structure, biology, and structure-based therapeutics development Increased resistance of SARS-CoV-2 variant P. 1 to antibody neutralization HawkDock: a web server to predict and analyze the protein--protein complex based on computational docking and MM/GBSA others, 2020. A new coronavirus associated with human respiratory disease in China Key residues of the receptor binding motif in the spike protein of SARS-CoV-2 that interact with ACE2 and neutralizing antibodies Angiotensin-converting enzyme 2 (ACE2) as a SARS-CoV-2 receptor: molecular mechanisms and potential therapeutic target SARS-CoV-2 spillover events ACE2:Spike complex. The authors declare that they have no competing interests in this study. The authors would like to extend their sincere appreciation to the Researchers Supporting Project number (RSP-2021/306), King Saud University, Riyadh, Saudi Arabia. The authors declare that they have no competing interests in this study.