key: cord-0844076-7avqykyr authors: Behloul, Nouredine; Baha, Sarra; Shi, Ruihua; Meng, Jihong title: Role of the GTNGTKR motif in the N-terminal receptor-binding domain of the SARS-CoV-2 spike protein date: 2020-06-09 journal: Virus Res DOI: 10.1016/j.virusres.2020.198058 sha: c94ef620e69c5b065486edee798854adb3c3d49b doc_id: 844076 cord_uid: 7avqykyr The 2019 novel coronavirus disease (COVID-19) that emerged in China has been declared as public health emergency of international concern by the World Health Organization and the causative pathogen was named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). In this report, we analyzed the structural characteristics of the N-terminal domain of the S1 subunit (S1-NTD) of the SARS-CoV-2 spike protein in comparison to the SARS-CoV in particular, and to other viruses presenting similar characteristic in general. Given the severity and the wide and rapid spread of the SARS-CoV-2 infection, it is very likely that the virus recognizes other receptors/co-receptors besides the ACE2. The NTD of the SARS-CoV-2 contains a receptor-binding motif different from that of SARS-CoV, with some insertions that could confer to the new coronavirus new receptor binding abilities. In particular, motifs similar to the insertion 72GTNGTKR78 have been found in structural proteins of other viruses; and these motifs were located in putative regions involved in recognizing protein and sugar receptors, suggesting therefore that similar binding abilities could be displayed by the SARS-CoV-2 S1-NTD. Moreover, concerning the origin of these NTD insertions, our findings point towards an evolutionary acquisition rather than the hypothesis of an engineered virus. Keywords: COVID-19; SARS-CoV-2; Coronavirus; receptor-binding domain; receptor binding-motif A novel coronavirus has emerged in human population in the city of Wuhan (China) causing severe respiratory illness that the World Health Organization named 2019 novel coronavirus disease and the pathogen named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) (Huang et al., 2020; Li et al., 2020) . Since its emergence in December 2019, the viral infection has already spread in many Chinese cities and several countries, and the World Health Organization has declared it a public health emergency of international concern by the end of January 2020. The situation report 114 of the WHO (May 13 th , 2020) indicated that more than 4 million confirmed cases had been reported globally with nearly 32% of the cases in the USA alone; the total deaths caused by the disease reached 287 399 cases, mainly in the Americas and Europe (37% and 55%, respectively) (https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200513-covid-19sitrep-114.pdf?sfvrsn=17ebbbe_4). Coronaviruses belong to the family Coronaviridae in the order Nidovirales and can be classified into four genera: Alpha-coronavirus, Beta-coronavirus, Gamma-coronavirus, and Delta-coronavirus (Cui et al., 2019; Perlman and Netland, 2009) . They are, enveloped, positive-stranded RNA viruses, containing the largest genome among all RNA viruses, ranging from 27 to 32 kb (Fehr and Perlman, 2015) . After the release of the SARS-CoV-2 genome sequence, it has been classified as Beta-coronavirus, closely related to the severe acute respiratory syndrome coronavirus (SARS-CoV) that emerged in 2002 (Ksiazek et al., 2003 Peiris et al., 2003) . The coronaviruses spike protein (S) forms large protrusions from the virus surface (spikes) giving the viral particles the appearance of having crowns (hence their name coronavirus) J o u r n a l P r e -p r o o f (Cui et al., 2019; Perlman and Netland, 2009; Zumla et al., 2016) . These spikes represent the first contact with the host and mediate the virus entry into host cells; besides, the S protein has been linked to host and tissue tropism (Du et al., 2009; Li, 2016) . Structurally, the coronavirus spikes are clove-shaped trimers of the S protein with the asymmetric unit containing a large ectodomain, a single-pass transmembrane anchor, and a short intracellular tail (Kirchdoerfer et al., 2016; Smith et al., 2016) . The ectodomain consists of a receptorbinding subunit S1 and a membrane-fusion subunit S2, with the S1 subunit containing distinct N-terminal and C-terminal domains (S1-NTD and S1-CTD) (Beniac et al., 2006; Walls et al., 2016) . One of the major complexities of coronaviruses is their receptor recognition pattern. To date, several receptors have been found to be recognized by different coronaviruses (Li, 2015; Li, 2016) . Among these receptors: zinc peptidases such as angiotensin-converting enzyme 2 (ACE2) (Hofmann et al., 2005; Li et al., 2003) and aminopeptidase N (APN) (Delmas et al., 1993; Li et al., 2007) ; dipeptidyl peptidase 4 (DPP4) (Raj et al., 2013; Yang et al., 2014) ; carcinoembryonic antigen-related cell adhesion molecule 1 (CEACAM1) (Dveksler et al., 1991; Williams et al., 1991) ; and sialic acid-containing receptors (Schwegmann-Wessels and Herrler, 2006) . In this report, we compared the composition and the structural features of the S protein receptor-binding domains between the SARS-CoV-2 and related viruses (especially the SARS-CoV) and tried to extrapolate the findings to shed some light on possible functioning of this S protein of the new coronavirus, especially the receptor-binding domains. A total of 1652 sequences of S proteins of SARS-CoV-2 isolates from 29 countries were retrieved. The majority of the sequences were for the USA (1451) followed by China (66) J o u r n a l P r e -p r o o f (Supplementary file 1). The sequence identity to the reference sequence (YP_009724390.1) varied from 99.7 to 100%, indicating thus a high degree of conservation. Nearly 40% of these sequences were identical to the reference sequence, while the other 60% presented mostly single mutations (one mutated position in each sequence). The mutated positions (77 in total) are reported in Supplementary file 1. However, of these mutated positions, three were shared by several sequences: position 791 where Thr was mutated to Ile in 5 of the Taiwanese sequences; position 829 where Ala was mutated to Thr in 9 sequences from Thailand; and more importantly the apparition of Gly instead of Asp at position 614 in 923 sequences. it is to note that all these three positions are located in the S2 subunit. The S protein of the SARS-CoV-2 shares 97.41% amino acid similarity with the recently identified bat-CoV RatG13 isolate, 80.32% with a bat SARS-like CoV and only 76.27% identity with the SARS-CoV GZ02 isolate. Moreover, compared to the S1 subunit, the S2 subunit of the S proteins was found more conserved in the four strains ( Figure 1 and Table 1 ). Further, by comparing segments of 100aa of the SARS-CoV-2 S protein to the other three coronaviruses (Figure 1a ), the results indicated that the region spanning aa1-400 of the S protein was more similar to the new bat isolate (>90%) than to the SARS-or the bat SARSlike strains. A more pronounced dissimilarity was noted at the regions spanning aa401-500 and aa601-700, which correspond to the C-terminal domain of the S1 subunit (S1-CTD). The S1-NTD of the SARS-CoV-2 is highly similar to that of the newly isolated bat coronavirus RatG13 (>98%) but shares roughly 53% and 67% with those of the SARS-CoV or bat SARS-like CoV, respectively (Table 1) . Structurally, we used the Dali server for aligning the S1-NTD of the SARS-CoV-2 with other coronaviruses from the different genera (Table 2) . As expected the S1-NTD of the SARS-J o u r n a l P r e -p r o o f CoV-2 was more similar to those of the Beta-coronaviruses, especially the SARS-CoV (the highest Z-Score, highest sequence identity and the lowest RMSD). However, all the NTDs were aligned with Z-scores ranging from 6 to 22.1 and RMSDs ranging from 1.1 to 4.2. This similarity is due to the Galectin-like topology of the NTDs' core structures as previously documented (Li, 2012) . The primary sequence alignment also revealed some insertions shared by the SARS-CoV-2 and bat coronavirus RatG13 but not the SARS-CoV, located at positions aa72-82, aa144-147, aa244-246 and aa255-257 of the SARS-CoV-2 S protein ( Figure 2 ). To further investigate the structural role of these inserts, we searched the Protein Data Bank using the largest insert 72GTNGTKR78 with 5 amino acids extensions on both N-and Cterminal sides, leading to 17aa long segment 67AIHVSGTNGTKRFDNPV83; then we analyzed the hits to see whether the aligned motifs were engaged in any identified structural function. When we searched the protein databank using the GTNGTKR motif, the first hit was the structure of the Mengo virus VP1 protein ( Figure 3a ). The aligned segment was located on the VP1 GH loop, which forms along with the VP3 C-terminal loop a depression on the capsid that has been associated with receptor recognition and binding (Kim et al., 1990; Krishnaswamy and Rossmann, 1990 ). Although the depression described in the Mengo virus capsid is absent in the S1-NTD of the SARS-CoV-2, the target motif 72GTNGTKRFDN81 forms a similarly exposed loop with two neighboring loops containing the 255SSG257 motif (the identified insert 4) and the N-terminal loop (18LTT20 motif) on both sides ( Figure 3b ). Whether this formation could play the same role as the Mengo virus VP1 and VP3 loops, which would, in turn, allow the SARS-CoV-2 to interact with the same receptor, need further investigation. Moreover, the Mengo virus has been found to bind the murine cellular receptor vascular cell adhesion molecule 1 (VCAM-1) to enter and infect cells (Huber, 1994) . This receptor molecule is restricted to endothelial cells and is subject to upregulation under cytokines stimulation (Hosokawa et al., 2006; Singh et al., 2005) . Given the high cytokine amounts stimulated by the SARS-CoV-2 (Huang et al., 2020) and pre-existing heart disease (hypertension and coronary heart disease) being one of the major co-morbidities of the fatality cases (Deng and Peng, 2020) , it is interesting to explore the possibility of the SARS-CoV-2 binding the VCAM-1 receptor via its S1-NTD. The mouse hepatitis coronavirus (MHV) also binds another cell adhesion molecule, the murine carcinoembryonic antigen-related cell adhesion molecule 1a (mCEACAM1a), using its S1-NTD (Peng et al., 2011) . Therefore, as a next step, we compared the MHV and SARS-CoV-2 S1-NTDs. We found that the receptor-binding motif of the MHV S1-NTD also presents a motif 168NTNGNK173 with some similarity to insert 1 (72GTNGTKR78) of the SARS-CoV-2 S1-NTD. However, when the S1-NTDs were compared in the quaternary structures of the S proteins, the above motifs seem to occupy opposite positions (Figure 3c and d). Besides, Peng et al. identified four receptor binding motifs in the MHV S1-NTD (RBM1-4) (Peng et al., 2011) . By comparing the MHV-receptor interaction interface and the exposed amino acids on the receptor-binding surface of the SARS-CoV-2 S1-NTD, we found that the N-terminal aa15-21 segments adopt different conformations (Figure 3d ), and this segment in the MHV (RBM1) contains three residues critical for receptor binding affinity (Peng et al., 2011) . Therefore, it seems unlikely that the SARS-CoV-2 would bind the same receptor. However, this observation should be taken with care since it is based on the predicted model of the SARS-CoV-2 S1-NTD. Taken all together, the presence of the GTNGTKR motif in the SARS-CoV-2 S1-NTD seems to be a potentially evolutionary feature that SARS-CoV-2 acquired to allow its S1-NTD to bind to protein receptors. We believe that the above observations are worth investigating. Another structure containing the analyzed motif was the tail spike protein 1 of the bacteriophage CBA120 (Podoviridae) (Chen et al., 2014) . The aligned motif GTNGTK was located within the receptor-binding domain, in the inverting region connecting the subdomain D3 and D4. Interestingly, unlike other tail spike proteins where the sugar-binding sites were located on the D3 subdomain Muller et al., 2008; Steinbacher et al., 1996; Xiang et al., 2009) , the D3-D4 inverting region of the CBA120 tail spike protein generates a hole that forms the sugar-binding site (Chen et al., 2014) . Although the target motif was not directly involved in the sugar's interactions, the binding site (hole) is formed in the opposite direction of the GTNGTK loop, and a quite similar orientation of the motif is observed in the SARS-CoV-2 S1-NTD ( Figure 4) . Besides, what could be the counterparts of the sugar-binding pocket of the CBA120 tail spike protein is one of the two pockets formed in the SARS-CoV-2 S1-NTD: the first situated on the top part of the domain and the other located above the β-sandwich core in the opposite direction of the GTNGTKR loop ( Figure 4c ). This latter pocket is also aligned with the sugar-binding site in the NTD of bovine coronavirus (BCoV) (Figure 5a and c) . Peng et al. (Peng et al., 2012) reported that the pocket above the β-sandwich core is the sugar-binding site in BCoV NTD and through mutagenesis studies, they identified 4 residues critical for the NTD-receptor interaction Y162, E182, W184, and H185 and the binding was stabilized by the loop 10-11 (146NDLNKL151) ( Figure 5c) . Interestingly, the corresponding pocket in the SARS-CoV-2 NTD also contains three amino acids (E154, F157, and Y160) with the same orientation than the four key residues identified in the BCoV NTD. Moreover, the positions of E154 and Y160 are J o u r n a l P r e -p r o o f strikingly similar to that of Y162 and E182 in BCoV NTD (Figure 5d ). Besides, a counterpart of the stabilizing loop 10-11 is also present in the SARS-CoV-2 NTD, although shorter, but seems to share the NxxN motif. These observations suggest that SARS-CoV-2 NTD might recognize a sugar receptor as well, and it is likely to be the same Neu5,9Ac2 that BCoV NTD binds to (Peng et al., 2012) . The TNGTRRF motif was also present in the infectious bronchitis coronavirus (IBV) spike protein. Despite the low primary sequence identity (11%), the pairwise structural alignment revealed that the SARS-CoV-2 S1-NTD shared a relatively high structural similarity with the S1-NTD of the IBV with a Dali Z-score of 8.6 and RMSD of 3.1 over 159 aligned residues ( Table 2 ). The aligned motif TNGTRRF was located within the S1-CTD of the IBV, in the subdomain connecting S1 and S2 (Shang et al., 2018) . Although no functional features have been described for the subdomains of the S1-NTDs and S1-CTDs in the coronaviruses spikes, the target motif was found protruding from the surface of the trimer (Figure 6a) , suggesting that such protrusion might interact with the surrounding environment. The aligned fragment was also located at the C-terminal of the Mimivirus cyclophilin but it was missing from the deposited structure (Thai et al., 2008) . Further, the authors did not link any structural function of the segment of interest. However, it is to note that this is the first virus-encoded cyclophilin but it lacks peptidyl-prolyl isomerase, an activity that several viruses such human immunodeficiency virus type 1 and SARS-CoV exploit the host cyclophilin for (Chen et al., 2005; Sorin and Kalpana, 2006) . Interestingly, the viral cyclophilin was located on the surface of mature Mimivirus virions, and given the absence of the catalytic activity, the authors suggested that the protein may play a structural role yet to be identified in the Mimivirus life cycle (Thai et al., 2008) . Since the Mimivirus can cause J o u r n a l P r e -p r o o f pneumonia in humans (La Scola et al., 2005; Saadi et al., 2013) , and the exact position of the cyclophilin is yet to be determined, a question could be asked whether the 221NGTKRF226 motif could play a role in the virus pathogenesis that could be shared by the SARS-CoV-2. Besides viruses, the search for a functional GTNGTKR motif in the deposited protein structures revealed a similar motif in the folylpolyglutamate synthetase (FPGS) of Lactobacillus casei, an enzyme that catalyzes the MgATP-dependent glutamylation of folate coenzymes (Figure 6b ). The aligned motif 42IHVTGTNG49 was located on the putative nucleotide-binding P loop (GTNGKGS) that resembles the consensus P-loop sequence found in many other adenylate and uridylate kinase (Smith and Rayment, 1996; Sun et al., 1998) . Moreover, a Ω loop near the P loop binding site was also suggested to play a role in the activity of the FPGS, especially the Serine residue. Interestingly, a similarly shaped loop is also found in the SARS-CoV-2 S1-NTD adjacent to the GTNGTKR loop, formed by the insert 4 (254SSSG257) also rich in serine residues (Figure 6c and d) . Given the severity and the widespread nature of the infection, it is safe to assume that the SARS-CoV-2 has a more efficient way to penetrate and infect cells. Besides, based on the comparison of the SARS-CoV-2 and SARS-CoV receptor-binding domains, it seems that SARS-CoV-2 evolved in a way that allowed it to maintain and enhance the binding of the ACE2 receptor via its S1-CTD, but also acquired a different S1-NTD that according to our analysis might bind other receptors (protein or sugar receptor). More precisely, the acquisition of the GTNGTKR motif, found at the active sites of structural and nonstructural proteins of other viruses and organisms, might allow the SARS-CoV-2 to recognize other receptors/co-receptors besides the ACE2. Moreover, our results suggest that the apparition of the GTNGTKR motif points more toward an evolutionary trait of the SARS-CoV-2 rather than the hypothesis of an engineered virus; Under functional constraints, proteins tend to evolve in a way that their tertiary structures could perform the needed functions regardless of the changes in their primary sequences (Goldstein, 2008; Siltberg-Liberles et al., 2011; Worth et al., 2009) and the two main factors driving the evolution of the S proteins are the need for better adaptation to the host receptors and the need to evade the immune system of the host to ensure better infectivity (Li, 2015; Li, 2016) . Therefore, it is more plausible to assume that the SARS-CoV-2 acquired the GTNGTKR motif during its evolutionary parkour under functional constraints. As for the exact mechanism of acquisition and the origin of this motif, we believe that further investigations are needed not only in the context of the SARS-CoV-2 infection but as a pertinent motif for viral proteins activity in general. A total of 1652 SARS-CoV-2 S protein complete sequences available at the NCBI Virus portal were retrieved. The sequences of SARS-CoV GC02 isolate (AY390556) and two bat isolates: a bat SARS-like coronavirus (MG772934) and the recently isolated RatG13 bat coronavirus (MN996532) were also retrieved, and their S glycoproteins were compared to that of the SARS-CoV-2 (RefSeq: YP_009724390.1). First, we performed a multiple alignment of the S proteins of the 1652 SARS-CoV-2 strains to see if any dissimilarities were present and analyzed the occurrence of mutations in comparison to the reference sequence. Next, we compared the similarity of the S glycoprotein of the SARS-CoV-2 (RefSeq: YP_009724390.1) to that of the selected 4 related coronaviruses strains mentioned above: 1) aligning the full-length proteins of the 4 stains J o u r n a l P r e -p r o o f altogether; 2) aligning the full-length SARS-CoV-2 S protein to that of each of the related strains separately; 3) aligning portions (100aa windows) of the SARS-CoV-2 S protein by to that of each of the related strains separately. All sequence alignments were performed using the Muscle algorithm implemented in the MEGA-X software or BLASTp suite of the U.S. National Library of Medicine. For the search of motifs similar to the GTNGTKR motif in the Protein Data Bank deposited structures, the BLASTp suite of the U.S. National Library of Medicine was used by adjusting parameters to search for a short input sequence. Three crystal structures of the SARS-CoV-2 Spike protein (containing the S1-NTD) were retrieved from the Protein Data Bank (PDB ID: 6vyb, 6vxx, and 6vsb). Since all of these structures lacks some fragments of interest (especially the GTNGTKR motif), the sequence of S glycoprotein of the SARS-CoV-2 (Reference ID: YP_009724390.1) was submitted to I-Tasser (https://zhanglab.ccmb.med.umich.edu/I-TASSER/) and Swiss-Model (https://swissmodel.expasy.org/) servers for the prediction of complete 3D structure models (Waterhouse et al., 2018; Yang and Zhang, 2015) . The quality of the predicted 3D structures was evaluated using the MolProbity server (http://molprobity.biochem.duke.edu) (Williams et al., 2018) and the best models were selected for the analysis. All structural alignments were performed using the Dali server (http://ekhidna2.biocenter.helsinki.fi/dali/) (Holm, 2020) regions identified in the similarity analysis were mapped on the SARS-CoV-2 S protein trimer predicted by the SWISS-model server: region aa1-400 is depicted in blue, region aa401-500 is depicted in red, region 501-600 is depicted in cyan, region aa601-700 is depicted in hot pink, and the C-terminal region starting from aa701 is depicted in yellow. 4H14). b) The SARS-CoV-2 S1-NTD in the same orientation as the BCoV S1-NTD in (a), with the 72GTNGTKR78 in the far side colored in red and the counterpart of the loop 10-11 with a conserved NxxN motif in the front (also colored in red). c and d) different orientations of the BCoV and SARS-CoV-2 S1-NTDs obtained by a 70° rotation of the x-axis of (a) and (b) respectively; the key residues for the interactions with sugar moieties in the BCoV S1-NTD and their possible counterparts in the SARS-CoV-2 S1-NTD are shown as red sticks in (c) and (d), respectively. J o u r n a l P r e -p r o o f Crystal structure of Escherichia coli phage HK620 tailspike: podoviral tailspike endoglycosidase modules are evolutionarily related Architecture of the SARS coronavirus prefusion spike Crystal structure of ORF210 from E. coli O157:H1 phage CBA120 (TSP1), a putative tailspike protein Function of HAb18G/CD147 in invasion of host cells by severe acute respiratory syndrome coronavirus Origin and evolution of pathogenic coronaviruses Further characterization of aminopeptidase-N as a receptor for coronaviruses Characteristics of and Public Health Responses to the Coronavirus Disease The spike protein of SARS-CoV--a target for vaccine and therapeutic development Cloning of the mouse hepatitis virus (MHV) receptor: expression in human and hamster cell lines confers susceptibility to MHV Coronaviruses: an overview of their replication and pathogenesis The structure of protein evolution and the evolution of protein structure Human coronavirus NL63 employs the severe acute respiratory syndrome coronavirus receptor for cellular entry Using Dali for Protein Structure Comparison Cytokines differentially regulate ICAM-1 and VCAM-1 expression on human gingival fibroblasts VCAM-1 is a receptor for encephalomyocarditis virus on murine vascular endothelial cells Conformational variability of a picornavirus capsid: pHdependent structural changes of Mengo virus related to its host receptor attachment site and disassembly Pre-fusion structure of a human coronavirus spike protein Structural refinement and analysis of Mengo virus A novel coronavirus associated with severe acute respiratory syndrome Mimivirus in pneumonia patients Porcine aminopeptidase N is a functional receptor for the PEDV coronavirus Evidence for a common evolutionary origin of coronavirus spike protein receptor-binding subunits Receptor recognition mechanisms of coronaviruses: a decade of structural studies Structure, Function, and Evolution of Coronavirus Spike Proteins Angiotensin-converting enzyme 2 is a functional receptor for the SARS coronavirus An intersubunit active site between supercoiled parallel beta helices in the trimeric tailspike endorhamnosidase of Shigella flexneri Phage Sf6 Coronavirus as a possible cause of severe acute respiratory syndrome Crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor Crystal structure of bovine coronavirus spike protein lectin domain Coronaviruses post-SARS: update on replication and pathogenesis Dipeptidyl peptidase 4 is a functional receptor for the emerging human coronavirus-EMC First isolation of Mimivirus in a patient with pneumonia Sialic acids as receptor determinants for coronaviruses Cryo-EM structure of infectious bronchitis coronavirus spike protein reveals structural and functional evolution of coronavirus spike proteins The evolution of protein structures and structural ensembles under functional constraint Cytokine stimulated vascular cell adhesion molecule-1 (VCAM-1) Active site comparisons highlight structural similarities between myosin and other P-loop proteins Crystal structure of phage P22 tailspike protein complexed with Salmonella sp. O-antigen receptors Structural homologies with ATP-and folate-binding enzymes in the crystal structure of folylpolyglutamate synthetase Structural, biochemical, and in vivo characterization of the first virally encoded cyclophilin from the Mimivirus Cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer SWISS-MODEL: homology modelling of protein structures and complexes MolProbity: More and better reference data for improved all-atom structure validation Receptor for mouse hepatitis virus is a member of the carcinoembryonic antigen family of glycoproteins Structural and functional constraints in the evolution of protein families Crystallographic insights into the autocatalytic assembly mechanism of a bacteriophage tail spike I-TASSER server: new development for protein structure and function predictions Receptor usage and cell entry of bat coronavirus HKU4 provide insight into bat-tohuman transmission of MERS coronavirus Coronaviruses -drug discovery and therapeutic options 3JCL); chains A, B and C are colored in cyan, blue and magenta, respectively; the protruding 511TNGTRRF517 motif in the IBV trimer is colored in yellow in chains A and B. b) The folylpolyglutamate synthetase (FPGS) of Lactobacillus casei (PDB ID: 1FGS), with the putative nucleotide-binding P loop, harboring the 42IHVTGTNG49 motif, is shown in red, the Ω loop is shown in yellow, and the ligand is shown as magenta spheres. c) A top view of the SARS-CoV-2 S1-NTD showing the adjacent 72GTNGTKR78 (insert 1) and 254SSSG257 (insert 4) loops in red and yellow respectively SARS-CoV-2: severe acute respiratory syndrome coronavirus 2 (PDB ID : 6vyb) NL63 respiratory coronavirus middle-east respiratory syndrome coronavirus (PDB ID: 5x5f) severe acute respiratory syndrome coronavirus (PDB ID: 5x58); MHV: mouse hepatitis coronavirus porcine delta coronavirus (PDB ID: 6b7n); IBV: infectious bronchitis coronavirus (PDB ID: 6cv0); hGALECTIN: human galectin-3 (PDB ID: 1a3k) The authors have no competing interests to declare. CoV 23 9 100 21 18 10 11 SARS-CoV 63 14 21 100 20 12 13 MHV 23 12 18 20 100 11 14 Pd-CoV 11 28 10 12 11 100 16 IBV 11 11 11 13 14 16 100 hGALECTIN 12 9 7 10 8 9 3