key: cord-0430812-4p1e0f5l authors: Sinha, Siddharth; Tam, Benjamin; Wang, San Ming title: Altered interaction between RBD and ACE2 receptor contributes towards the increased transmissibility of SARS CoV-2 delta, kappa, beta, and gamma strains with RBD double mutations date: 2021-08-31 journal: bioRxiv DOI: 10.1101/2021.08.30.458303 sha: b0589e0b09bde3724a080c0b18079775750fde98 doc_id: 430812 cord_uid: 4p1e0f5l The COVID-19 pandemics by SARS-CoV-2 causes catastrophic damage for global human health. The initial step of SARS-CoV-2 infection is the binding of the receptor-binding domain (RBD) in its spike protein to ACE2 receptor in host cell membrane. The evolving of SARS-CoV-2 constantly generates new mutations across its genome including RBD. Besides the well-known single mutation in RBD, the recent new mutation strains with RBD “double mutation” is causing new outbreaks globally, as represented by the delta strain containing RBD L452R/T478K. Although it is considered that the increased transmissibility of the double mutated strains could be attributed to the alteration of mutated RBD to ACE2 receptor, the molecular details remains to be unclear. Using the methods of molecular dynamics simulation, superimposed structural comparison, free binding energy estimation and antibody escaping, we investigated the relationship between ACE2 receptor and the RBD double mutant L452R/T478K (delta), L452R/E484Q (kappa) and E484K/N501Y (beta, gamma). The results demonstrated that each of the three RBD double mutants altered RBD structure, led to enhanced binding affinity of mutated RBD to ACE2 receptor, leading to increased transmissibility of SARS-CoV-2 to the host cells. SARS-CoV-2 pandemic has caused devastating consequences on global public health, with over 207 million people infected and over 4.3 million loss of life globally since the COVID-19 started (https://covid19.who.int accessed August 17, 2021). SARS-CoV-2 infects human cells through its spike (S) protein. In the process, the receptor-binding domain (RBD, residues 318-526) of the S protein (residues 1-1273)binds to the angiotensin-converting enzyme 2 (ACE2) receptor on host cell membrane to release its genome into the cell . Therefore, RBD is a determinant for SARS-CoV-2 infection into host cells. As RNA virus, the genome of SARS-CoV-2 is constantly evolving with new mutations generated across its genome including RBD. Since the first SARS-CoV-2 genome sequences reported in Jan 5, 2020, there have been 942 coding-changing mutations identified within the 193 positions of RBD, more than one mutation per day (942 mutations in 932 days), and 4.9 mutations per position on average (Supplementary table 1, http://covglue.cvr.gla.ac.uk/#/replacement, access July 31, 2021). While most of the mutations do not show pathogenic significance, the strains containing several RBD mutations, namely L452R (epsilon), T478K, E484K, E484Q, and N501Y, were selected ( Figure 1 ) and have caused multiple outbreaks much due to the increased transmissibility of SARS-CoV-2 contributed by these RBD mutations (Leung et al., 2021) , https://www.who.int/en/activities/tracking-SARS-CoV-2-variants/). Recently, several new SARS-CoV-2 strains with RBD "double mutations" are causing new challenge. The double mutations basically contain the same single mutations above, but they are more transmissible than the strains with corresponding single mutation. For example, the "delta" strain with RBD mutation L452R/T478K rapidly spreads to over 130 countries since it was identified in late 2020 (Torjesen, 2021) , https://www.cdc.gov/mmwr/volumes/70/wr/mm7031e2). L452R/E484Q (kalpa) and E484K/N501Y (beta, gamma). Thus, understanding the mechanism of increased transmissibility in RBD double mutations is urgently needed to develop strategies to control their spreading. While studies revealed how RBD single mutations increased SARS-CoV-2 transmissibility, it remains to know whether the SARS-CoV-2 with RBD double mutations adopted the same or similar manners as the SARS-CoV-2 with RBD single mutations or gains new features considering the more aggressive behavior of SARS-CoV-2 with RBD double mutations than SARS-CoV-2 with single RBD mutations. In this study, we investigated the relationship between ACE2 receptor and three major RBD double mutant L452R/T478K (delta), L452R/E484Q (kappa) and E484K/N501Y (beta, gamma). We used multiple methods to address the topic including molecular dynamics simulation (MDS), superimposed structural comparison, free binding energy estimation and antibody escaping. Molecular dynamics simulation measures conformational change of protein across a time period and uses the trajectories to describe the thermodynamics changes in RBD mutant structure (Karplus, 2002; Sinha & Wang, 2020) , superimposed structural comparison allows direct visualization of the altered RBD structure by the double mutated residues (Kufareva & Abagyan, 2012) , free binding energy change estimates the affinity Figure 1 . Locations of single mutated RBD and ACE2. The center shows the RBD-ACE2 complex, the surrounding shows the single mutations in RBD, with the left being the wild type and the right being the mutated residue. A) L452 and L452R; B) E484 and E484K; C) N501 and N501Y; D) T478 and T478K; E) E484 and E484Q. change caused by the double mutated residues between RBD and ACE2 receptor (Gumbart et al., 2013) , and mapping antibody binding site explains whether RBD structural change caused by mutated residues can result in escape of mutants from neutralizing antibodies (NAbs). The results from our study provided evidence that the three RBD double mutants altered RBD structure in the ways much different from these caused by the corresponding RBD single mutants, enhanced binding of mutated RBD to ACE2 receptor, leading to the high transmissibility of SARS-CoV-2 to the host cells. To investigate the effects of double mutations on RBD structure, we examined conformational changes of the RBD double mutant -ACE2 receptor complex in RBD double mutant L452R/T478K, L452R/E484Q, E484K/N501Y using MD simulations for 100 ns. We performed the study by using 5 different types of methods including simulations of RMSD (Root mean square deviation), RMSF (Root mean square fluctuation), Rg (radius of gyration), SASA (solvent accessible surface area) and MSD (mean square displacement) for each type of RBD double mutant, with the wild type RBD and 5 RBD single mutants as the controls. The results revealed substantial structural differences between the double mutants, wild type, and single mutants: RMSD: It demonstrates the change in the atomic coordinates between the wildtype and mutant structures (Dong et al., 2018) . The wild type RBD-ACE2 structure stabilized around ~0. ). x-axis shows the simulation lasted time (100 ns), y-axis shows the RMSD value for the structures. The results show substantial differences between single and double mutated RBD structure. It determines the resilience of residues to analyze the effects of substitution in native and mutant structures (Benson & Daggett, 2012) . The flexibility in polypeptide chain through RMSF showed that double mutant L452R/E484Q, L452R/T478K, and E484K/N501Y had high to medium resilience ~ 0.4 nm, ~0.15 nm, and ~0.25 nm, respectively, at residue position 360-380 and 470-490. In comparison, the single mutants had no greater flexibility in the backbone C-α atom than the native structure (Supplementary Figure 1) . The higher resilience in the double mutant L452R/E484Q and E484K/N501Y can be attributed to their differences in RMSD trajectory around residue position 360-380 and 470-490. Rg: It measures the compactness between the wild type and mutant structures (Daidone et al., 2003) . Rg value for the wild type RBD was ~3.10-3.15 nm. Double mutant L452R/T478K showed a greater Rg value ~3.2-3.25 nm for the entire trajectory than the wild type; L452R/E484Q had ~3.13-3.18 nm; E484K/N501Y had ~3.15-3.25 nm after 35 ns; L452R/E484Q and E484K/N501Y showed the change in Rg value ~80-100 ns and ~10-30 ns respectively. Their larger hydrodynamic radius implies the change in the shape of structure during protein folding and unfolding. T478K, E484K, and E484Q had higher Rg values than the wild type RBD but lower than the double mutants, except N501Y had higher value ~3.2-3.3 nm (Supplementary Figure 2) . SASA: SASA defines the solvent accessible surface area thereby measuring the relative expansion of the native and mutant structures (Zhang & Lazim, 2017) . The wild type RBD had a surface area of ~430 nm 2 , all single mutant L452R, T478K, E484K, E484Q and N501Y had decreased value of ~426 nm 2 , ~426 nm 2 , ~424 nm 2 , ~428 nm 2 and ~429 nm 2 , respectively. The double mutants also decreased values to these in single mutants but their patterns were different (Supplementary Figure 3) . MSD: MSD defines the mean square displacement of overall atoms from a set of initial positions between wild type RBD and RBD mutants (Yousefpour et al., 2015) . The wild type structure showed an average displacement value ~11.31. The double mutant L452R/T478K, L452R/E484Q and E484K/N501Y showed average displacement value ~10.88, ~7.33 and ~9.06, respectively, lower than the wild type but higher than these in the single mutants (Supplementary Figure 4) . To directly visualize the conformational changes in double mutated RBD, we superimposed structure between wildtype, double mutants and the corresponding single mutants. We observed that the conformational changes in each of the double mutant RBD were substantially different from these in The results show that the RBD conformational changes in double mutations were substantially different from single mutations. We used MM/GBSA method to estimate overall changes of free binding energy between RBD mutants and ACE2 receptor. The high binding energy of RBD to ACE2 receptor is largely contributed by several key residues including N501, L452 and T478, which had 4, 2 and 2 contact regions, respectively, whereas other residues such as E484 contributed less binding energy than N501 to ACE2 receptor (Figure 4 ). Comparing to the overall free binding energy of 212.5 kJ mol -1 between the wild type RBD and ACE2 receptor, all single and double RBD mutants showed increased free binding energy except N501Y decreased to -204.6 kJ mol -1 (Table 1A) . However, the changed levels between double mutants and their corresponding single mutants were at similar levels, implying double mutants didn't generate higher binding energy than the single mutants. We also compared the energy changes in the non-mutated residues in the RBD mutants. In the wild type RBD, a set of residues made high contributions to the overall energy between RBD and ACE2 receptor with the top 5 residues of F456, F486, Q493, T500, and Y505 (Table 1B) . However, the top 5 residues in each RBD double mutant were mostly changed from the wild type RBD and their corresponding single mutations: L452/T478K changed to Y505, F486, N501, Q493, and T500; L452R/E484Q to F486, Q493, Y505, Y489 and F456; and E484K/N501Y to Y501, Y505, Q493, T500 and F486. N501 in the wildtype RBD contributed only -7.58 kJ mol -1 (9 th ) in the 198 RBD residues (data not shown). However, N501 became a top residue in 2 of the 3 RBD double mutants and 4 of the 5 RBD single mutants, highlighting that N501Y in RBD double mutants and single RBD mutants played significant roles in enhancing the affinity between RBD mutants to ACE2 receptor by enhancing the binding energy (Table 1B, Figure 4 ). It has been proposed that higher transmissibility of SARS-CoV-2 new mutant strains can be attributed by the increased affinity of the mutated RBD to ACE2 receptor in the host cells, and the RBD double mutations can further increase the affinity and enhance the transmissibility, as represented by SARS-CoV-2 L452R/T478K (delta), L452R/E484Q (kappa), E484K/N501Y (beta and gamma) strains (Scudellari, 2021). Using combinational methods, our study provides structural-based evidence to demonstrate that RBD double mutant-changed RBD structure enhances the affinity of mutated RBD to ACE2 receptor, causing the increased transmissibility of the new SARS-CoV-2 mutant strains. It is interesting to note that many of the "hot spot" mutants predicted by previous simulation studies have rarely become highly transmissible strains (Gan et al., 2021; Ghorbani et al., 2020; Yi et al., 2020) . Instead, only limited numbers of mostly unpredicted single mutations were truly selected to cause outbreaks. More interesting fact is that the RBD double mutations in the newly emerging strains still inherited these mutations at the same positions with different combinations. This indicates that although MDS is powerful in detecting structural changes caused by mutation, solely based on physical relationship between RBD and ACE2 receptor in the randomly detected mutation pool is not a reliable approach to determine the "hotspot" mutation as the marks for new strain with increased transmissibility. No matter how strong the physical interaction between mutated RBD to ACE2 receptor, only these with best fitness within the mutation pool will be selected, which is determined by multiple selection factors not only by the relationship between RBD and ACE2 receptor. In addition, relying on the accumulated quantity of mutation sequences may also give misleading information for the increased transmissibility, as the number of sequences collected could be biased sampling, not necessarily reflect the actual abundance of the mutant strains in actual infected population. This is reflected by the RBD mutation sequence data that only 4 (N501Y, L452R, E484K, Y478K) were listed among the 7 top mutation sequences, and E484Q was listed at 26 th position (Supplementary table 1) . Therefore, focusing on the "real world" mutations either single, double and possible triple combinations at these fixed positions can provide more reliable evidence to monitor the newly selected strains and to use the physical evidence to explain the structural basis for these highly selected ones with high transmissibility (Khan et al., 2021). N501 is particularly interesting residue. It has been determined that N501 enhances RBD binding to ACE2 by maintaining RBD at open conformation (Teruel et al., 2021) , and this function can be enhanced by E484K (Nelson et al, 2021) . N501 is also a mutation hot-spot as reflected by the RBD single mutation N501Y in the alpha strain (Fiorentini et al., 2021) , and RBD double mutation E484K/N501Y in beta and gamma strains. We observed that each tested RBD single or double mutation increased the free binding energy between N501 and ACE2 receptor ( Figure 4B ), highlighting that N501 is a key player in enhancing wile type and mutated RBD binding to ACE2, not only for N501Y but also the mutated residues in other RBD locations likely through allosteric activity (Mugnai et al., 2020) . For example, E484K and E484Q substantially increased binding affinity of N501 and N501Y to ACE2 receptor (Luan et al., 2021; Nelson et al., 2021) . Similar situation was also present in RBD double mutants, and mutations outside RBD can also contribute to the increased affinity of RBD to ACE2 receptor, such as D614G (Korber et al., 2020).Therefore, not only the mutated RBD residues but also non-mutated RBD residues needs to be considered when addressing the increased transmissibility of new mutant strains. Antibody resistance is one of the crucial determinants of viral transmission. Many potentially neutralizing anti SARS-CoV-2 antibodies target RBD to block its binding to the ACE-2 receptor (Baum et al., 2020; Ju et al., 2020; Starr et al., 2021) . Consistent with experimental observations (Rogers et al., 2020) , our study showed that all three RBD double mutants can cause antibody escape. Breakthrough infection by delta strain has been observed in the vaccinated individuals (Bergwerk et al., 2021) . Our observation indicates that the altered antibody binding sites on RBD by RBD double mutation contributes to the phenomenon. It is interesting to note that the binding energy between single mutants (L452R, T478K, E484K, E484Q and N501Y) and double mutants (L452/E484Q, L452/T478K and E484K/N501Y) were at similar level, although both were higher than the wild type RBD. In conclusion, our study revealed that double mutated RBD in SARS-CoV-2 caused the changes of RBD structure, RBD binding energy to ACE2 receptor, and antibody binding sites on RBD. These changes contribute to the increased transmissibility of recent SARS-CoV-2 mutant strains with double mutated RBD. The structure of SARS-CoV-2 RBD -ACE2 receptor complex (PDB ID: 6M0J) was used as the wild type reference . The structure was optimized using CHARMM36m force field in GROMACS version 5.1.2. Three RBD double mutant L452R/T478K (delta), L452R/E484Q (Kappa) and E484K/N501Y (beta, gamma), and 5 RBD single mutants of L452R, T478K, E484Q, E484K, and N501Y included in the double mutants were used in the study. The structures with the mutated residues in each mutant were generated using UCSF Chimera following the default parameters (Pettersen et al., 2004) . GROMACS version 5.1.2, was used for molecular dynamics simulation (Berendsen et al., 1995) . CHARMM36m force field was chosen to model RBD -ACE2 receptor complex (Huang et al., 2017) . The complex was situated in the simulation box 2 nm away from the box edge. The system was solvated with tip3p water and neutralized with Na+ ions. Steep descent algorithm was applied to the system before 1 ns equilibration run at 298 K and 1 bar in the NPT ensemble using Berendsen thermostat and barostat. The system was set at 298 K and 1 bar in the NPT ensemble by using a Vrescale thermostat and Parrinello-Rahman barostat during the production run (Parrinello & Rahman, 1981) . Verlet velocity algorithm was employed with a time step of 2 fs. The Particle Mesh Ewald (PME) method was used to treat the long-range electrostatic interactions with the cut-off distance at 1.0 nm. Hydrogen bond was constrained at equilibrium lengths by the LINC algorithm and the trajectory frame of MD was saved every 30 ps (Hess, 2008) . RMSD, RMSF, Rg, SASA and MSD were used to analyze structural changes of the native RBD and its mutants. The 35-40ns simulation trajectories were utilized in each method. XMGRACE program was utilized to generate the corresponding plots (Turner, 2005) . Three independent RMSD MD simulations were performed for the wild type RBD-ACE2 for 100 ns, received RBD values at 0.186 nm, 0.173 nm, 0.179 nm, ACE2 values at 0.229 nm, 0.230 nm, 0.239 nm for run 1, run 2, run 3. The results confirmed that RBD -ACE2 structure remained stable throughout the simulation process. Mutant RBD structures extracted at 100 ns of MD simulations through UCSF CHIMERA were superimposed with the wild type RBD structures to identify the conformation changes using PYMOL. To determine whether RBD mutants escape the binding by neutralization antibody, structures between the mutant and wildtype RBD were structurally mapped with independent binding sites of antibody C121 on RBD. The RBD binding site for antibody C121, a neutralizing antibody known to (Onufriev et al., 2004) were assigned upon the proteins (Complex, RBD and ACE2) before steepest descent and conjugate gradient minimization. The free energy of binding, ∆ , is calculated based on the following equation: For each component, the total energy was estimated by using the following equation: The first three terms represent the standard MM energy terms: bonded (bond, angle, and dihedral), electrostatic and van der Waal interactions. and are the polar and the non-polar contribution of solvation free energies, calculated by the Generalised Born (GB) implicit solvent method and solvent accessible surface area (SASA). The last term absolute temperature, , is multiplied by entropy, . For the double mutant RBD and their containing single mutant RBD, simulation was performed at 100 ns and a single protein coordinate file was extracted every 10 ns. Overall free binding energy was averaged for each type of RBD. We are thankful for the Information and Communication Technology Office (ICTO), University of Macau for providing the High-Performance Computing Cluster (HPCC) resource and facilities for the study. SS, BT: method development, analysis, data acquisition and interpretation, draft manuscript; SMW: conception, design, data analysis and interpretation, draft and revise manuscript, and funding. None declared. Not required. Not required. All data relevant to the study are included in the article or uploaded as online supplemental information. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor Structural and Functional Basis of SARS-CoV-2 Entry by Using Human ACE2 Early transmissibility assessment of the N501Y mutant strains of SARS-CoV-2 in the United Kingdom Covid-19: Delta variant is now UK's most dominant strain and spreading through schools Molecular Dynamics Simulations of Biomolecules Classification of VUS and unclassified variants in BRCA1 BRCT repeats by molecular dynamics simulation Methods of protein structure comparison Efficient Determination of Protein-Protein Standard Binding Free Energies from First Principles Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus Antibody cocktail to SARS-CoV-2 spike protein prevents rapid mutational escape seen with individual antibodies Human neutralizing antibodies elicited by SARS-CoV-2 infection SARS-CoV-2 RBD antibodies that maximize breadth and resistance to escape Isolation of potent SARS-CoV-2 neutralizing antibodies and protection from disease in a small animal model Covid-19 Breakthrough Infections in Vaccinated Health Care Workers UCSF Chimera-A visualization system for exploratory research and analysis