key: cord-0143436-btxacvcz authors: Li, Jiacheng; Hou, Chengyu; Wang, Menghao; Liao, Chencheng; Guo, Shuai; Shi, Liping; Ma, Xiaoliang; Zhang, Hongchi; Jiang, Shenda; Zheng, Bing; Ye, Lin; Yang, Lin; He, Xiaodong title: Hydrophobic interaction determines docking affinity of SARS CoV 2 variants with antibodies date: 2021-02-28 journal: nan DOI: nan sha: 44b0e69c73fdbea080701380db7bedfe5a41ae12 doc_id: 143436 cord_uid: btxacvcz Preliminary epidemiologic, phylogenetic and clinical findings suggest that several novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants have increased transmissibility and decreased efficacy of several existing vaccines. Four mutations in the receptor-binding domain (RBD) of the spike protein that are reported to contribute to increased transmission. Understanding physical mechanism responsible for the affinity enhancement between the SARS-CoV-2 variants and ACE2 is the"urgent challenge"for developing blockers, vaccines and therapeutic antibodies against the coronavirus disease 2019 (COVID-19) pandemic. Based on a hydrophobic-interaction-based protein docking mechanism, this study reveals that the mutation N501Y obviously increased the hydrophobic attraction and decrease hydrophilic repulsion between the RBD and ACE2 that most likely caused the transmissibility increment of the variants. By analyzing the mutation-induced hydrophobic surface changes in the attraction and repulsion at the binding site of the complexes of the SARS-CoV-2 variants and antibodies, we found out that all the mutations of N501Y, E484K, K417N and L452R can selectively decrease or increase their binding affinity with some antibodies. The emergence of several variants of the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) undoubtedly brings new challenges to the development of vaccines and therapeutic antibodies. A key question is whether existing COVID-19 vaccines can protect against infection or disease from these new SARS-CoV-2 variants. Due to the emergence of a new SARS-CoV-2 variant 501Y.V1 (B.1.1.7) in the United Kingdom, there was an unexpected increase in reported COVID-19 cases (1). In South Africa, a high transmission in the context of high population immunity are most likely caused by the emergence of another SARS-CoV-2 variant 501Y.V2 (B.1.351) (2) . Both the variants have a mutation (N501Y) in the receptor binding domain of the spike protein, which is reported to help increase transmission, and the increased transmission range is estimated to be between 40% and 70% (2) . The 6501Y.V2 variant also has two mutations in the spike protein (E484K and K417N) that give the antibody a potential immune escape opportunity (3) . The preliminary clinical trial results of the ChAdOx1 nCoV-19 vaccine showed 74% effectiveness in the United Kingdom, but only 22% in South Africa (4) . While another COVID-19 vaccine NVX-CoV2373 showed 89% effectiveness in the United Kingdom, but in South Africa, only 49%(5). All SARS-CoV-2 variants enter human cells by protein-protein docking to human angiotensin converting enzyme 2 (ACE2) on the host cell membrane via CoV spike (S) glycoproteins. Molecular structures of the S protein of SARS-CoV-2 have been observed at high resolution by using cryo-electron microscopy (cryo-EM) (6) (7) (8) . The complex structures of ACE2 bound to the SARS-CoV-2 S have also been experimentally determined (9) (10) (11) (12) . Surprisingly, a change from asparagine (N) to tyrosine (Y) in amino-acid position 501 of the RBD of SARS-CoV-2 S can obviously increase transmission, with estimates ranging between 40% and 70% for increased transmission. The region of the protein responsible for binding another molecule is known as the docking site (also sometimes called binding site) and is often a depression on the molecular surface. The key to SARS-CoV-2 infection is that the S protein can specifically bind to the ACE2 in a strong affinity manner. Specific binding of SARS-CoV-2 S and ACE2 forms the joint structure between the coronavirus and the host cell that enable the coronavirus enter the host cell (13) . In natural intracellular environment and extracellular medium, protein-protein docking are usually the contacts of high specificity established between two or more specific protein molecules, and erroneous protein-protein docking rarely occurs (14) . The chief characteristic of proteins that allows their diverse set of functions is their ability to dock with other proteins specifically and tightly. Proteinprotein docking is therefore considered one of the miracles of nature. The classic problem of protein-protein docking is the question of how a protein find its partner in its natural environment (15) . In particular, whether hydrogen bonds formation regulate proteinprotein docking remains a long-standing problem with poorly defined mechanisms (16) (17) (18) (19) (20) (21) . It is worth noting that hydrogen bonds formation is not a long-range physical force. Considering that only several hydrogen bonds between SARS-CoV-2 RBD and ACE2 can be identified in the complex (9) (10) (11) (12) 22) , the docking between the coronavirus and the host cell are most likely not dominated by hydrogen bond pairing between them. A recent theoretical study revolutionize the understanding of docking affinity of SARS-CoV-2 S with ACE2 and the physical mechanism for protein docking. Traditional theories hold that, protein-protein docking is guided by a variety of physical forces as follows: (i) hydrophobic effect, (ii) electrostatic forces, (iii) van der Waals forces, (iv) hydrogen bonding, (v) ionic bonding, (vi) entropy. While the new theoretical study emphasizes that only hydrophobic effect can play a decisive role for protein docking (16, 23) . In extracellular medium, hydrogen-bond competing is always present with water. Because, water molecules should be able to saturate the hydrogen bond formations of the hydrophilic surface of proteins before the protein docking (24) (25) (26) , due to water molecules having very strong polarity (26) (27) (28) . The interaction of protein surface with the surrounding water is often referred to as protein hydration shell (also sometimes called hydration layer) and is fundamental to structural stability of protein, because non-aqueous solvents in general denature proteins (29) . The hydration layer around a protein has been found to have dynamics distinct from the bulk water to a distance of 1 nm and water molecules slow down greatly when they encounter a protein (24) . Thus, hydrophilic side chains of proteins are normally hydrogen bonded with surrounding water molecules in aqueous environments, thereby preventing the surface hydrophilic side-chains of proteins from randomly hydrogen-bonding together (24, 25) (26) . This is the reason why proteins usually do not aggregate and crystallize in unsaturated aqueous solutions (30) . Thus, this problem may lie in our lack of understanding of how surface hydrophilic side-chains of a protein at the docking site can get rid of their hydrogen-bonded water molecules, then hydrogen-bond with other hydrophilic groups or approach to hydrophobic groups of a partner protein at early steps of the docking, given the lack of awareness of the importance of the shielding effect of water. The hydrophobic interaction between hydrophobic surficial areas of two proteins at the docking site can reintroduce entropy to the system via the breaking of their water cages which frees the ordered water molecules (31) . Thus, the hydrophobic collapse between hydrophobic surface areas of a protein and hydrophobic surface areas of another protein most likely triggers enthalpy-entropy compensation for the docking of the two proteins, enabling the hydrophilic groups at the docking site to discard hydrogen-bonded water molecules, and then hydrogen-bond with each other between the proteins or approach to hydrophobic groups of the partner protein. This means that the hydrophobic interaction among proteins should be responsible for the protein docking. The hydrophobic interaction between the SARS-CoV-2 S and ACE2 protein is found to be significantly greater than that between SARS-CoV S and ACE2 (32) . This explains why the affinity of SARS-CoV-2 RBD and ACE2 far exceeds that of SARS-CoV RBD and ACE2. Binding ability of a protein is mediated by the tertiary structure of the protein, which defines the docking site, and by the chemical properties of the surrounding amino acids' side chains (33) . The hydrophobicity of the protein surface is the main factor that stabilizes the protein-protein binding, thus hydrophobic interaction among proteins play an important role in determining the protein-protein binding affinity (23, 34, 35) . Hydrophilic side-chains are not completely hydrophilic. The hydrophilicity of hydrophilic side-chains is normally expressed by C=O or N-H2 groups at their ends, and the other portions of hydrophilic side-chains are hydrophobic, because the molecular structures of these portions are basically alkyl and benzene ring structures. It means that there are a large number of water molecules surround the hydrophobic surface areas of the RBD rather than hydrogen bonded with the RBD. The characteristic of these water molecules surrounding the hydrophobic surface areas is that their hydrogen bonding network is more ordered than free liquid water molecules, that is, their entropy is lower. At the docking site, a hydrophobic surface area of a protein facing to another hydrophobic surface area of the other protein play a role of increasing the binding affinity between of the two proteins. Whereas, a hydrophobic surface area of a protein facing to another hydrophilic surface area of the other protein play a role of decreasing the binding affinity between of the two proteins due to the repulsion. Thus, mutation of one amino acid residue at the docking site may can obviously change the hydrophobic interactive surface areas of the SARS-CoV-2 RBD at the docking site and may significantly change the hydrophobic interaction between of the SARS-CoV-2 S and ACE2, thereby greatly change the affinity between them. For example, the original hydrophilic asparagine (N) in amino-acid position 501 of the RBD was facing to a hydrophobic surface area one the ACE2 at the docking site. The change from asparagine (N) to tyrosine (Y) in amino-acid position 501 increase hydrophobic attraction area and decrease hydrophilic repulsion area between the two proteins, see Fig.1 . Therefore, a change from asparagine (N) to tyrosine (Y) in aminoacid position 501 of the RBD can change the original repulsing relationship between the RBD and ACE2 to an attractive relation between them in amino-acid position 501. By analyzing the mutation-induced surface hydrophobic or hydrophilic area changes at the binding site of the complex of the SARS-CoV-2 variants and ACE2, we found out that only the mutations N501Y obviously increase the binding affinity between the RBD to the ACE2, and the mutation E484K and L452R slightly increase the binding affinity between the RBD to the ACE2, see Fig.1 and Table 1 . It is worth noting that the change from Leucine (L) to arginine (R) in amino-acid position 452 due to the mutation L452R introduce a new arginine-glutamic acid (R-E) electrostatic attraction between the RBD and the ACE2 that may cause increment of the binding affinity. Use same method, we can evaluate the impact of these mutations in the RBD on protection capability of the existing antibodies. By analyzing the mutation-induced surface hydrophobic or hydrophilic area changes at the binding site of the complex of the SARS-CoV-2 variants and 9 existing antibodies, we found out that the mutation N501Y can selectively decrease the binding affinity with the antibodies CV30, and CC12.1, see the In this study, many experimentally determined native structures of proteins are used to study the mechanism triggering docking of SARS-CoV-2 variants to ACE2 and the antibodies. All the three-dimensional (3D) structure data of protein molecules are Affinity of RBD and ACE2 and antibodies can be characterized by calculating the size of the hydrophobic contact area in the complex structures. We used molecular 3D structure display software PyMOL to draw the hydrophobic and hydrophilic surface areas at the docking sites. We calculated the hydrophobic attraction surface areas and hydrophobic repulsion surface areas in the positions of mutations involved in the hydrophobic interaction among the RBD, ACE2 and antibodies in this study. Table 1 Sum of increment of hydrophobic attraction area and decrement of hydrophilic repulsion area among the variants, the ACE2 and the antibodies due to mutations (Å 2 ) Transmission of SARS-CoV-2 Lineage B.1.1.7 in England: Insights from linking epidemiological and genetic data. medrxiv Emergence and rapid spread of a new severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) lineage with multiple spike mutations in South Africa SARS-CoV-2 501Y.V2 escapes neutralization by South African COVID-19 donor plasma. biorxiv South Africa suspends use of AstraZeneca's COVID-19 vaccine after it fails to clearly stop virus variant Novavax vaccine delivers 89% efficacy against COVID-19 in U.K.-but is less potent in South Africa Controlling the SARS-CoV-2 Spike Glycoprotein Conformation Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor Structural basis of receptor recognition by SARS-CoV-2 Structural and Functional Basis of SARS-CoV-2 Entry by Using Human ACE2 Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2 Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2) Insights into Protein-Ligand Interactions: Mechanisms, Models, and Methods So Much More to Know & Regulation of protein-ligand binding affinity by hydrogen bond pairing Entropy-Enthalpy Compensation: Role and Ramifications in Biomolecular Ligand Recognition and Design Extent of enthalpy-entropy compensation in proteinligand interactions Comparative Assessment of Scoring Functions on an Updated Benchmark: 2. Evaluation Methods and General Results Further development and validation of empirical scoring functions for structure-based binding affinity prediction Summing up ligand binding interactions Evolution of the novel coronavirus from the ongoing Wuhan outbreak and modeling of its spike protein for risk of human transmission Optimized hydrophobic interactions and hydrogen bonding at the target-ligand interface leads the pathways of drug-designing Mapping hydration dynamics around a protein surface Universal Initial Thermodynamic Metastable state of Unfolded Proteins Water follows polar and nonpolar protein surface domains CHARMM: the biomolecular simulation program Structure relaxation via long trajectories made stable Protein structure and dynamics in nonaqueous solvents: insights from molecular dynamics simulation studies Introduction to protein crystallization A Hydrophobic-Interaction-Based Mechanism Triggers Docking between the SARS-CoV-2 Spike and Angiotensin-Converting Enzyme 2 Biotechnology Fundamentals Principles of protein-protein recognition A noncompeting pair of human neutralizing antibodies block COVID-19 virus binding to its receptor ACE2 Lin Yang is indebted to Daniel Wagner from the Weizmann Institute of Science and Liyong Tong from the University of Sydney for their support and guidance. Lin Yang is grateful for his research experience in the Weizmann Institute of Science for inspiration.The authors acknowledge the financial support from the National Natural Science