key: cord-0706195-2nw6ja5w authors: Dragelj, Jovan; Mroginski, Maria Andrea; Ebrahimi, Kourosh H. title: Hidden in Plain Sight: Natural Products of Commensal Microbiota as an Environmental Selection Pressure for the Rise of New Variants of SARS‐CoV‐2 date: 2021-07-26 journal: Chembiochem DOI: 10.1002/cbic.202100346 sha: adef5a9b5083acf1437b3368c7e474d60e1bc219 doc_id: 706195 cord_uid: 2nw6ja5w Since the emergence of SARS‐CoV‐2, little attention has been paid to the interplay between the interaction of virus and commensal microbiota. Here, we used molecular docking and dynamics simulations to study the interaction of some of the known metabolites and natural products (NPs) produced by commensal microbiota with the receptor binding domain (RBD) of the spike glycoprotein of SARS‐CoV‐2. The results predict that NPs of commensal microbiota such as bile acids and non‐ribosomal peptides (NRPs), of which some are siderophores, bind to the wild‐type RBD and interfere with its binding to the ACE2 receptor. N501Y mutation, which is present in many of the emerging variants of the virus, abolishes the predicted binding pocket of bile acids and NRPs. Based on these findings, available experimental data showing that bile acids reduce the binding affinity of wild‐type RBD to the ACE2 receptor, and the data suggesting that the respiratory tract microbiota affect viral infection we put forward the following proposal: mutations such as N501Y enable the RBD to bind to the ACE2 receptor more effectively in the presence of NPs produced by the respiratory tract bacteria thereby, increasing the infectivity rate of the virus. We hope our data stimulate future works to better understand the interactions of NPs produced by commensal microbiota with respiratory viruses like SARS‐CoV‐2. Since the emergence of SARS-CoV-2, little attention has been paid to the interplay between the interaction of virus and commensal microbiota. Here, we used molecular docking and dynamics simulations to study the interaction of some of the known metabolites and natural products (NPs) produced by commensal microbiota with the receptor binding domain (RBD) of the spike glycoprotein of SARS-CoV-2. The results predict that NPs of commensal microbiota such as bile acids and nonribosomal peptides (NRPs), of which some are siderophores, bind to the wild-type RBD and interfere with its binding to the ACE2 receptor. N501Y mutation, which is present in many of the emerging variants of the virus, abolishes the predicted binding pocket of bile acids and NRPs. Based on these findings, available experimental data showing that bile acids reduce the binding affinity of wild-type RBD to the ACE2 receptor, and the data suggesting that the respiratory tract microbiota affect viral infection we put forward the following proposal: mutations such as N501Y enable the RBD to bind to the ACE2 receptor more effectively in the presence of NPs produced by the respiratory tract bacteria thereby, increasing the infectivity rate of the virus. We hope our data stimulate future works to better understand the interactions of NPs produced by commensal microbiota with respiratory viruses like SARS-CoV-2. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has become a global emergency since early 2020. It is now clear that upon entry to the respiratory tract the virus uses its homotrimeric spike glycoprotein (S protein) to bind to its hostcell receptor angiotensin-converting enzyme 2 (ACE2) in the nasal epithelial cells, [1] the primary site of SARS-CoV-2 infection (Figure 1a ). The respiratory tract is covered by mucus, which consists of mucins and various bacteria (Figure 1a ). [2] This healthy mucus is the first barrier for respiratory viruses such as SARS-CoV-2. It is shown that bacteria residing in the respiratory tract affect influenza virus infection. [3] [4] [5] The microbial community of the mucus is proposed to be the gatekeeper to respiratory health. [6] It is shown that changes in the population and diversity of these microbiota are linked to respiratory viral infections [3] including SARS-CoV-2. [7] [8] [9] In-silico studies predict that the upper respiratory tract (URT) bacteria produce proteins with the ability to bind to the RBD reducing its capacity to cause severe infection, [7] and more recently it is suggested that the virus may have phage-like behaviour. [10] In this respect, the potential interactions of natural products (NPs) of respiratory tract bacteria with SARS-CoV-2 has remained unexplored. The primary site of the interaction of SARS-CoV-2 with the mucus and the host cell is its homotrimeric spike glycoprotein. (a) SARS-CoV-2 is a respiratory virus and upon entry to the respiratory system of a healthy individual it faces mucus, which consists of mucin and a healthy population of bacteria. (b) After pathing through the mucus, viral entry to epithelial cells requires the proteolytic activity of TMPRSS2 to induce conformational changes in the spike glycoprotein. Consequently, RBD is exposed and interacts with the ACE2 receptor. (c) Structure of RBD (PDB Code: 7C8D). Mutations observed in the three major emerging variants of SARS-CoV-2 are shown (cyan). The Asn501Tyr (N501Y) mutation is seen among many variants. The surface of RBD that faces and interacts with the ACE2 receptor is colored in blue. For entry to the host cells, the spike glycoprotein interacts with the transmembrane protease, serine 2 (TMPRSS2) on the surface of target cells (Figure 1b) . The proteolytic activity of TMPRSS2 induces a conformational change in the spike glycoprotein causing it to adopt an open state. Consequently, the receptor binding domain (RBD) of the spike glycoprotein is exposed and binds to the angiotensin-converting enzyme 2 (ACE2) allowing the virus to enter into the host cells (Figure 1b) [11] very similar to SARS-CoV-1. [12] In the case of SARS-CoV-2, neuropilin-1 (NRP1) is shown to be a host-cell factor helping viral entry and infection. [13] Emerging variants of the virus with higher mortality and infectivity rates are rapidly appearing around the globe. Some recent variants, which may resist antibody response, [14, 15] include B.1.1.7 (the UK variant), B.1.351 (the South African variant), and P.1 (the Brazilian variant). All these variants have gained amino acid changes in the spike glycoprotein including mutations in the RBD (Figure 1c ). The common mutation of the RBD among all these variants is the substitution of asparagine 501 by tyrosine (Asn501Tyr or N501Y) ( Figure 1c ). The emerging variants are shown to have a replicative advantage and higher transmissibility, [16, 17] whose molecular mechanisms have remained vague. Molecular dynamics simulations suggest that the N501Y mutation may increase [18, 19] or decrease [20] the binding affinity of RBD to the ACE2 receptor. On the other hand, studies using pseudotyped viruses revealed that N501Y mutation does not increase the infectivity rate of variants of SARS-CoV-2 as compared to the wild-type (WT) virus in cell-based assays. [21] More recent data using hamster models and cellbased assays show that the replication rates of the N501Y variants increase in the upper respiratory airway. [17] These contrasting findings suggested to us that additional factors in the respiratory tract might contribute to the difference between the lower infectivity rate of the wild-type virus and those of its emerging variants. Because the first barrier for viral entry to host cells is mucus (Figure 1a ), we hypothesized that metabolites or natural products (NPs) produced by bacteria residing in mucus can bind to the WT-RBD and interfere with its interaction with the ACE2 receptor and thus, viral entry to host cells. To test this hypothesis, we used molecular docking simulations taking advantage of the open source PyRx software (Supporting Methods), [22] which uses AutoDock Vina program. [23] We benchmarked our approach to estimate its accuracy. We predicted the binding pocket of a bisindolylmaleimide inhibitor into the catalytic domain of human protein kinase C (PKC) β II and compared the outcome with the reported structural data for the same inhibitor ( Figure 2 ). [24] The predicted binding pocket using PyRx software is identical to the observed binding pocket of the inhibitor in the X-ray crystal structure of PKCβ-II (Figure 2a) . In both the predicted and the observed binding pockets the binding mode of the ligand is dictated by hydrophobic interactions (Figure 2b) . The predicted Gibbs free energy (ΔG) of binding was À 8.3 (kCal/mol). Next, we aimed to estimate the error for the predicted ΔG. We performed molecular docking simulations using four predicted ligands of RBD and compared our ΔG values with those published using other molecular docking approaches (Supporting Methods). Using the predicted ΔG values obtain by our approach and those published previously, we estimated an error of � 0.55 (kCal/mol). Thus, we used a ΔG value of À 8.3 � 0.55 (kCal/mol) as a benchmark for identifying metabolites or NPs with a reasonable binding affinity to the RBD. The current knowledge of the full spectrum of metabolites and NPs produced by respiratory tract bacteria is limited. Thus, we looked at some of the most common metabolites and NPs produced by commensal microbiota including amino acids, bile acids, non-ribosomal peptides (NRPs), and vitamins (Supporting Table S1 ). [25] [26] [27] We predicted the binding pocket of each metabolite and NP (Supporting Figures S1-S4 ) with the smallest Gibbs free energy (ΔG), which we used to estimate the dissociation constants (K d ) (Supporting Methods). Among various molecules tested, tauro-α-muricholic acid, taurocholic acid, enterobactin, and tilivalline have a ΔG value close to the benchmarked value of À 8.3 � 0.55 (kCal/ml). Thus, these molecules have a reasonable binding affinity with calculated K d values of 0.45, 1.00, 0.74 and 0.82 mM, respectively (Supporting Table S1 ). These values are less than that of folic acid (vitamin B9) (K d = 2.1 mM), which is predicted to bind to the RBD [28] and might help in treating COVID-19 patients. [29, 30] As an example, the predicted binding pockets for tauro-α-muricholic acid ( Figure 3a ) and tilivalline (Figure 3b ) are shown. The binding mode of tauro-α-muricholic acid and tilivalline includes hydrogen bonds and hydrophobic interactions (Supporting Figure S5 ). Tauro-alpha-muricholic acid, taurocholic acid, enterobactin and tilivalline have some common features: they all have aromatic and aromatic hydroxyl groups. These functional groups are present in other ligands like terpenes and drugs, which are predicted to bind to the same binding pocket in the RBD. [31, 32] Additionally, these functional groups are present in other bile acids [33] and the drug Corilagin, [34] which are shown to interfere with the RBD binding to the ACE2 receptor in biochemical assays. The predicted binding pocket for NPs like bile acids and NRPs includes Asn501 (N501) (Figures 3a-b and Supporting Figures S1-S4) . Alignment of the predicted binding pocket with the structure of RBD-ACE2 complex revealed that bile acids like tauro-α-muricholic acid (Figure 3c ) and NRPs like tilivalline (Figure 3d ) directly interfere with the interactions between the amino acid residues of WT-RBD and those of the ACE2 receptor. These data strongly suggest that NPs produced by commensal microbiota interfere with the WT-RBD binding to the ACE2 receptor consistent with the reported biochemical data for bile acids. [33] Since the predicted binding pocket for most of the NPs includes the amino acid Asn501 (Figures 3a-b and Supporting Figures S1-S4) , which in many of the emerging variants of SARS-CoV-2 is replaced by a tyrosine (N501Y) (Figure 1c) , we hypothesized that this mutation could interfere with the binding of bile acids and NRPs to the RBD. Therefore, we tested the binding of metabolites and NPs (Supporting Table S1 ) to the RBD of a variant harboring N501Y mutation (N501Y-RBD). Molecular docking studies revealed that N501Y mutation abolishes the binding pocket of bile acids and NRPs observed in the WT-RBD (Supporting Table S1 and Figures 3ef ). In the case of tauro-α-muricholic acid and tilivalline, the predicted K d values are 2.22 and 2.45 mM, respectively. These values within the estimated error are significantly larger (about 5-fold) than those of tauro-α-muricholic acid and tilivalline binding to the WT-RBD. Additionally, in the N501Y-RBD the predicted binding pocket of tauro-α-muricholic acid (Figure 3g) , tilivalline (Figure 3h) , and other bile acids and NRPs (Supporting Figures S6-S9) shifted. In this new binding pocket, the interference of NPs with the binding of N501Y-RBD to the ACE2 receptor is minimized (Figures 3g-h) . The reason for the shift in binding site is potentially the bulky phenol group of Tyr501. The predicted binding pocket of bile acids and NRPs does not have a glycosylation site; however, there is a distal glycosylation site (N343) (Supporting Figure S10 ). [35] The N343 glycans are important for the flexibility of the RBD during its exposure but not its direct interaction with the ACE2 receptor. [36, 37] Consistently, using molecular docking studies we did not observe a difference between the binding pocket of NPs like taurocholic acid and tilivalline in the glycosylated RBD (N343) as compared with the not-glycosylated RBD (Supporting Figure S10 ). To further evaluate the results of molecular docking studies, we performed classical all atoms molecular dynamics (MD) simulations of glycosylated structures of WT-RBD and of the N501Y-RBD, both in complex with tilivalline. We first used MD simulations to evaluate the flexibility of tyrosine in the presence of tilivalline as a ligand. In our MD simulations, tyrosine did not show significant flexibility (Supporting Figure S11 ) supporting the proposal that its bulky phenol group interferes with the binding of bile acids and NRPs to the RBD. Next, we used MD simulations to study the conformational stability of tilivalline in the predicted binding site of glycosylated structures of WT-RBD and N501Y-RBD. The interaction energy between tillivaline and the RBD binding site averaged over the last 4 ns of the MDs simulations (Supporting Figure S12) , was predicted to be as low as À 10.26 kCal/mol for the N501Y-RBD and below À 17.08 kCal/mol for the WT-RBD. When bound to the N501Y-RBD, tillivaline mostly interacts with the amino acid residues of RBD (Y501 and Y505) in such a way that it is positioned at the edge of the interface with the ACE2 receptor as revealed by close inspection of the MD trajectories (Supporting Figure S13 ). This observation suggests that tillivaline will be easily displaced by the N501Y-RBD binding to the ACE2 receptor. Therefore, we conclude that MD simulations reproduce the trend predicted by a static docking approach: i.e. N501Y mutation interferes with the binding of bile acids and NRPs to the RBD. In summary, our in-silico studies revealed that NPs of commensal microbiota like bile acids and NRPs can bind to the WT-RBD of SARS-CoV-2 with a reasonable affinity and that N501Y mutation interferes with their binding to the RBD. Although we could not obtain experimental data in support of our in-silico studies, earlier biochemical studies have reported Table S1 ) and in this pocket the interference of (g) tauro-α-muricholic acid (orange) and (h) tilivalline with the N501Y-RBD binding to the ACE2 receptor is minimized as observed by aligning the predicted binding site with the solved structure of RBD-ACE2 receptor (PDB Code: 7NXC). that bile acids interfere with the interaction of WT-RBD with the ACE2 receptor. [33] Additionally, our knowledge of NPs produced by the respiratory tract microbiota is very limited and it is likely that they produce unknown NPs with higher binding affinity to the RBD. Hence, we postulate that NPs produced by the commensal microbiota, e.g. bile acids or NRPs, interact with the WT-RBD and these interactions reduce the binding affinity (or increase the K d ) of the spike glycoprotein of SARS-CoV-2 to the ACE2 receptor. In the emerging variants with mutations such as N501Y, bile acids or NRPs cannot effectively interact with the RBD and thus, do not interfere with viral entry to host cells. Alternatively, it is possible that N501Y or other mutations of the RBD like those present in the B.1.617.2 variant (delta variant, which was first detected in India) increase the interactions of RBD with the ACE2 receptor [38] and thus, improving its binding to the ACE2 receptor in the presence of microbial NPs (Figure 4a ). This proposal helps in understanding the molecular mechanism underlying the emergence and the higher infectivity rates of new variants of SARS-CoV-2. Accordingly, we put forward a hypothetical model for a role of the respiratory tract bacteria in the evolution of SARS-CoV-2 ( Figure 4b ). The selection pressure induced by the host continuously directs the evolution and adaptation of the virus. For the spike glycoprotein variants of SARS-CoV-2 at least two environmental section pressures exist: the antibodies produced by the immune system and the healthy bacteria in the mucus, which is the first barrier for viral entry to the host cells. Future studies are needed to test the validity of this hypothesis. Moreover, our findings suggest that a change in the population of the healthy bacteria of the respiratory tract, which may occur due to ageing and smoking, [39, 40] could contribute to the severity of the disease consistent with previous reports. [7] [8] [9] Our in-silico studies suggest that NPs produced by commensal microbiota may have therapeutic applications. We hope our findings stimulate future works to study and discover NPs produced by various bacteria living in the respiratory tract and to better understand their interactions with respiratory viruses like SARS-CoV-2. (a) NPs like bile acids and/or NRPs produced by healthy bacteria interfere with the WT-RBD binding to the ACE2 receptor. The N501Y mutation or other mutations interfere with the binding of NPs with the RBD and/or increase the interaction of RBD with the ACD2 receptor. Consequently, they enable the virus to scape NPs of healthy bacteria and thus, they increase the infectivity of the virus. (b) Upon viral entry to the human body, the spike glycoprotein of the virus interacts withNPs produced by bacteria in the mucus. These NPs act as an environmental selection pressure together with antibody response leading to the emergence of new variants of SARS-CoV-2 with higher infectivity and/or mortality rates. Accepted manuscript online The authors declare no conflict of interest.Keywords: bile acids · commensal microbiota · natural products · NRPs · SARS-CoV-2 In silico studies suggest that NPs like bile acids and non-ribosomal peptides (NRPs) produced by bacteria residing in the respiratory tract can interfere with the binding of the original strain of SARS-CoV-2 to the ACE2 receptor but not with the binding of the emerging variants having the N501Y mutation. Thus, it is hypothesized that NPs of respiratory tract bacteria are an environmental selection pressure for the rise of new variants of the virus with higher infectivity rate.