key: cord-0897928-96v185b2 authors: Luan, Junwen; Lu, Yue; Jin, Xiaolu; Zhang, Leiliang title: Spike protein recognition of mammalian ACE2 predicts the host range and an optimized ACE2 for SARS-CoV-2 infection date: 2020-03-19 journal: Biochem Biophys Res Commun DOI: 10.1016/j.bbrc.2020.03.047 sha: d0dafaa2700d9fa0289b620ce87992f9cf393ae1 doc_id: 897928 cord_uid: 96v185b2 SARS-CoV-2 causes the recent global COVID-19 public health emergency. ACE2 is the receptor for both SARS-CoV-2 and SARS-CoV. To predict the potential host range of SARS-CoV-2, we analyzed the key residues of ACE2 for recognizing S protein. We found that most of the selected mammals including pets (dog and cat), pangolin and Circetidae mammals remained the most of key residues for association with S protein from SARS-CoV and SARS-CoV-2. The interaction interface between cat/dog/pangolin/Chinese hamster ACE2 and SARS-CoV/SARS-CoV-2 S protein was simulated through homology modeling. We identified that N82 in ACE2 showed a closer contact with SARS-CoV-2 S protein than M82 in human ACE2. Our finding will provide important insights into the host range of SARS-CoV-2 and a new strategy to design an optimized ACE2 for SARS-CoV-2 infection. Corona Virus Disease 2019 (COVID- 19) , which was reported from Wuhan city, Hubei province of China, has caused over 78,000 human infections and more than 2700 deaths (as of February 25, 2020) [1, 2] . Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2) was identified as the pathogen of COVID- 19 [1e3] . After SARS-CoV and MERS-CoV, SARS-CoV-2 has become the third coronavirus that causes severe respiratory disease and human death [4, 5] . Belonging to the subgenus sarbecvirus of Coronaviridae, both SARS-CoV-2 and SARS-CoV are human SARS-related coronavirus (SARSr-CoV). Its genome is a single-stranded RNA composed of about 30 kb nucleotides. SARS-CoV-2 encodes at least four major structural proteins, namely spike protein (S), membrane protein (M), envelope protein (E), and nucleocapsid protein (N) [6] . S protein, which is a type I glycoprotein, protrudes from the surface of the virus and can contact the host cell earlier. S protein has attracted great attention because of its function in receptor binding. Angiotensin-converting enzyme 2 (ACE2) binds to the receptorbinding motif (RBM) in the receptor-binding domain (RBD) of SARS-CoV and functions as a receptor for SARS-CoV [7, 8] . ACE2 is widely distributed in heart, liver, testis, kidney, intestine and other tissues. It has the physiological functions of regulating heart and kidney function and controlling blood pressure [9] . Recently, it has been found that human ACE2 promoted the entry of SARS-CoV-2 into the cells [3, 10] . RBD domain of SARS-CoV-2 interacts with human ACE2. Thus, ACE2 is defined as the receptor for SARS-CoV-2. The specificity of the interaction between virus and receptor determines the host tropism and host range. The origin of SARS-CoV-2 is presumed to be bat [3] . However, the intermediate host is not clear, and some studies suggest that pangolin is involved in the evolution of SARS-CoV-2 [11, 12] . It is not clear which mammals are involved in the evolution of SARS-CoV-2 and which animals may be infected by SARS-CoV-2. By sequence alignment of key amino acids binding to RBD in ACE2, the interaction between RBD of SARS-CoV-2/SARS-CoV and mammalian ACE2 was predicted. Based on the potential interaction between S protein and mammalian ACE2, it was speculated that SARS-CoV-2/SARS-CoV preserved the ability to infect many mammals including cat, dog, pangolin and Chinese hamster. From the structure stimulation, we identified N82 in ACE2 show closer contact with F486 of SARS-CoV-2 S protein compared with M82 of ACE2. The S protein sequence of SARS-CoV-2 is YP_009724390.1, and the S protein sequence of SARS-CoV is NP_828851.1. RBM of SARS-CoV is from 424 to 494. RBM of SARS-CoV-2 is from 437 to 508. A total of 42 mammalian ACE2 protein sequences were selected from the wild animal protection lists of Hubei Province and Jiangxi Province, primates, bats, dog, and cat. These sequences are as follows: hACE2: Homo sapiens (BAB40370. Cavia porcellus (XP_023417808.1), CgACE2: Cricetulus griseus (A0A061HZ66). Based on the known key sites in SARS-CoV S protein interacting with human ACE2, we analyzed whether these sites were conserved on ACE2 from wild mammals and domestic pets. Phylogenetic and molecular evolutionary analyses of ACE2 were conducted using MEGA version X [13] . Phylogenetic tree was generated with Jones-Taylor-Thornton (JTT) evolutionary model using Maximum Likelihood method. The interaction interfaces of SARSr-CoV S and ACE2 from cat/ dog/pangolin/Chinese hamster were simulated by Chimera software Ver 1.14 [14] . The simulation were based on the structures of hACE2 with SARS-CoV S RBD (PDB: 2AJF) [7] and hACE2 with SARS-CoV-2 S RBD (PDB: 6LZG). Human ACE2 is the receptor for both SARS-CoV and SARS-CoV-2. According to the literature [15] , the key amino acids (AAs) in the S protein of SARS-CoV interacting with human ACE2 are Y442, L472, N479, D480, T487, Y491 [7, 15] . We compared the RBM in S protein of SARS-CoV-2 with that of SARS-CoV (Fig. 1A) . These amino acids corresponding to SARS-CoV-2 are L455, F486, Q493, S494, N501 and Y505 (Fig. 1A) . Although five of six key AAs in SARS-CoV-2 are changed compared with SARS-CoV, the overall structure of interfaces of ACE2-RBM are similar (Fig. 1B) . The key AAs in hACE2 for interacting with RBM are K31, E35, D38, M82 and K353 [7, 15] . Among them, K31 and K353 in hACE2 are most critical residues for RBM recognition. Because the overall structure of interfaces of ACE2-RBM in SARS-CoV-2 and SARS-CoV are similar, we analyzed the key RBD recognizing AAs of ACE2 protein from selected mammals, as shown in Table 1 . We predicted that the mammals whose ACE2 could bind to the S protein of SARS- Next, we constructed a phylogenetic tree for mammalian ACE2 proteins. The ACE2 protein sequences of 42 mammalian animals were compared by ClustalW method of MEGA-X software. Then the JTT model of maximum likelihood method was used to construct ACE2 phylogenetic tree. As shown in Fig. 2 , species that cannot bind to S protein are marked in red, and species that can bind to S protein are marked in green. No correlation between genetic distance and the interaction of ACE2/S was found. Some Pets including dog (Canis lupus familiaris) and cat (Felis catus) potentially recognize S protein, indicating the importance to monitor the pets for SARS-CoV-2 infection. We found that four members of Circetidae including Mesocricetus auratus, Phodopus campbelli, Ictidomys tridecemlineatus, and Cricetulus griseus remained the key residues for association with S protein from SARS-CoV and SARS-CoV-2, though two members of Muridae (Rattus norvegicus, Mus musculus) could not bind to S protein. This founding suggested that Circetidae mammals could be developed as SARSr-CoV small animal models. We noticed that ACE2 from dog, cat, pangolin and Chinese hamster potentially associated with S protein (Table 1 ). Next, we predicted the structure of cat/dog/pangolin/Chinese hamster ACE2 with SARSr-CoV RBD. The structure of protein complex between RBD region of S protein of SARS-CoV and human ACE2 has been resolved (PDB: 2AJF) [7] . Recently, the structure of SARS-CoV-2 RBD with human ACE2 was also determined [16, 17] . We used Chimera software to display homologous model, and obtained the interaction complex structure of RBD region of SARSr-CoV (SARS-CoV-2 and SARS-CoV) and cat/dog/pangolin/Chinese hamster ACE2. Overall, the RBM structures of S protein of SARS-CoV-2 and SARS-CoV are similar (Fig. 3) . Interaction interface of SARSr-CoV RBD and cat/dog/pangolin/Chinese hamster ACE2 confirmed the potential interaction between SARS-CoV-2 and cat/dog/pangolin/ Chinese hamster ACE2, indicating that these ACE2 could support SARS-CoV-2 entry. The AA in 82 position of human ACE2 is M82, while the corresponding AA in cat, dog, pangolin and Chinese hamster ACE2 is T82, T81, N82 and N82, respectively. The distance between F486 of SARS-CoV-2 S protein and the corresponding AA in ACE2 is 3.753Å for human (Fig. 1A) , 2.695Å for dog (Fig. 3A) , 3.753Å for cat (Fig. 3B ), 1.621Å for pangolin (Fig. 3C) , and 2.024Å for Chinese hamster (Fig. 3D) , respectively. We concluded that N82 in ACE2 showed closer contact with F486 of SARS-CoV-2 S protein than M82 of ACE2. The host tropism of zoonotic coronavirus is hybrid, and it is important to determine the natural host and host range of coronavirus. In the past two decades, SARS-CoV, MERS-CoV and SARS-CoV-2 have caused serious outbreaks of human infectious diseases. All the three human coronaviruses originated from bats, but the intermediate hosts were different. SARS-CoV is believed to come from the Paguma larvata [18] , and the intermediate host of MERS-CoV is Camelus dromedaries [19] . The new coronavirus SARS-CoV-2 has recently caused a serious epidemic in China, but its intermediate hosts are not clear. S protein of SARS-CoV-2 interacts with human ACE2, which promotes the entry of SARS-CoV-2, indicating that human ACE2 is the receptor of SARS-CoV-2 [3] . ACE2 contains at least five key amino acids critical for binding S protein of SARSr-CoV [15] . Based on these five amino acids, we analyzed the corresponding amino Table 1 Prediction of the RBD binding capacity of mammalian ACE2. AA position matched AA binding capacity 31 35 38 82 353 hACE2 Phylogenetic tree of mammalian ACE2 proteins. ACE2 sequences from a total of 42 mammals were analyzed by MEGA-X and the phylogenetic tree was constructed with JTT evolutionary model using Maximum Likelihood method. The red represents the species whose ACE2 cannot bind to S protein, and the green is the species whose ACE2 associate with S protein. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.) acids of different mammals to determine which mammalian ACE2 could interact with S protein of human SARSr-CoV. By analyzing the protein sequence of mammalian ACE2, we found that the ACE2 of Camelus dromedarius, Procyon lotor, Rhinolophus ferrumequinum, Rattus norvegicus, Mus musculus, Ornithorhynchus anatinus, Loxodonta africana, Erinaceus europaeus, Nyctereutes procyonoides, Suricata suricatta, Dipodomys ordii, and Cavia porcellus lose the capability to associate with S protein (Table 1) . These mammals could be ruled out from the potential host list for SARS-CoV-2. We found that S protein may bind to ACE2 from some wild mammals, which suggests that we should investigate whether these animals may be intermediate hosts for SARS-CoV-2. It has been reported that the RBM region in S protein of pangolin coronavirus is similar to that of S protein of SARS-CoV-2 [11, 12] , which may be involved in the recombination of SARS-CoV-2. We identified that N82 of pangolin ACE2 showed closer contact with RBD than human ACE2 (Fig. 3C) , indicating that pangolin ACE2 might show better affinity to SARS-CoV-2. This finding further supports the hypothesis that pangolin is involved in SARS-CoV-2 evolution. In current study, only a limited list of wild mammals is covered. In the future, we should select more mammals for study. Although no SARS-CoV-2 has been found in domestic cats and dogs, cat/dog ACE2 may bind to S protein of SARS-CoV-2. In the future, we should pay attention to monitoring whether domestic cats and dogs could be infected by SARS-CoV-2. Animal model is an important tool in the study of infectious diseases. ACE2 of mice cannot interact with SARS-CoV-2, so it cannot be used as animal model of SARS-CoV-2 directly. Some studies have generated mice transfected with human ACE2 as the models to study SARS-CoV [20] , and these mice can also be used as animal models for SARS-CoV-2 infection. Interestingly, we identified that N82 in ACE2 is closer to RBD than M82 (Fig. 3C and D) , indicating a novel strategy to design an optimized ACE2 for SARS-CoV-2 infection. We speculated that small peptide based on N82 of ACE2 might show higher affinity to SARS-CoV-2 RBD. We proposed that if M82 in human ACE2 was mutated to N82, the modified human ACE2 will enhance SARS-CoV-2 infection. In the future, those ideas will be tested in cell culture and animal model. We noticed that the ACE2 proteins from Circetidae (Mesocricetus auratus, Phodopus campbelli, Ictidomys tridecemlineatus, and Cricetulus griseus) are capable to recognize RBD. Mesocricetus auratus (golden hamster) and Cricetulus griseus (Chinese hamster) are experimental animals and our finding indicates the possibility to develop small animal models for SARS-CoV-2 infection using Chinese hamster and golden hamster. The authors declare that there are no conflicts of interest. A novel coronavirus from patients with pneumonia in China Structure simulation of SARSr-CoV RBD with ACE2 from dog, cat, pangolin and Chinese hamster. (A) Structural simulation of the protein complex of dog ACE2 and SARSr-CoV RBD. Dog ACE2, SARS-CoV-2 RBD, SARS-CoV RBD are in gold, blue, and green, respectively. (B) Structural simulation of the protein complex of cat ACE2 and SARSr-CoV RBD. Cat ACE2, SARS-CoV-2 RBD, and SARS-CoV RBD are in pulm RBD are in sandy brown, blue and green, respectively. (D) Structural simulation of the protein complex of Chinese hamster ACE2 and SARSr-CoV RBD. Chinese hamster ACE2, SARS-CoV-2 RBD, and SARS-CoV RBD are in dim gray, blue and green, respectively A new coronavirus associated with human respiratory disease in China Epidemiology and cause of severe acute respiratory syndrome (SARS) in Guangdong, People's Republic of China Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia Genome composition and divergence of the novel coronavirus (2019-nCoV) originating in China Structure of SARS coronavirus spike receptor-binding domain complexed with receptor Angiotensinconverting enzyme 2 is a functional receptor for the SARS coronavirus Circulating ACE2 in cardiovascular and kidney diseases Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronavirus Evidence of recombination in coronaviruses implicating pangolin origins of nCoV-2019, bioRxiv Molecular evolutionary genetics analysis across computing platforms UCSF Chimera-a visualization system for exploratory research and analysis Receptor recognition by novel coronavirus from Wuhan: an analysis based on decade-long structural studies of SARS Structural basis for the recognition of the 2019-nCoV by human ACE2, bioRxiv Crystal structure of the 2019-nCoV spike receptor-binding domain bound with the ACE2 receptor, bioRxiv Molecular evolution analysis and geographic investigation of severe acute respiratory syndrome coronavirus-like virus in palm civets at an animal market and on farms Tropism and replication of Middle East respiratory syndrome coronavirus from dromedary camels in the human respiratory tract: an in-vitro and ex-vivo study Mice transgenic for human angiotensin-converting enzyme 2 provide a model for SARS coronavirus infection Transparency document related to this article can be found online at https://doi.org/10.1016/j.bbrc.2020.03.047.