key: cord-1024771-oaa6ipcq authors: Oyedara, Omotayo O.; Agbedahunsi, Joseph M.; Adeyemi, Folasade M.; Juárez-Saldivar, Alfredo; Fadare, Olatomide A.; Adetunji, Charles O.; Rivera, Gildardo title: Computational screening of phytochemicals from three medicinal plants as inhibitors of transmembrane protease serine 2 implicated in SARS-CoV-2 infection date: 2021-09-29 journal: Phytomedicine Plus : International journal of phytotherapy and phytopharmacology DOI: 10.1016/j.phyplu.2021.100135 sha: 43e1561154b41a058134230c8b528e33e880a086 doc_id: 1024771 cord_uid: oaa6ipcq Background : SARS-CoV-2 infection or COVID-19 is a major global public health issue which requires urgent attention in term of drug development. Transmembrane Protease Serine 2 (TMPRSS2) is a good drug target against SARS-CoV-2 because of the role it plays during the viral entry into the cell. Virtual screening of phytochemicals as potential inhibitors of TMPRSS2 can lead to the discovery of drug candidates for the treatment of COVID-19. Purpose : The study was designed to screen 132 phytochemicals from three medicinal plants traditionally used as antivirals; Zingiber officinalis Roscoe (Zingiberaceae), Artemisia annua L. (Asteraceae), and Moringa oleifera Lam. (Moringaceae), as potential inhibitors of TMPRSS2 for the purpose of finding therapeutic options to treat COVID-19. Methods : Homology model of TMPRSS2 was built using the ProMod3 3.1.1 program of the SWISS-MODEL. Binding affinities and interaction between compounds and TMPRSS2 model was examined using molecular docking and molecular dynamics simulation. The drug-likeness and ADMET (absorption, distribution, metabolism, excretion, and toxicity) properties of potential inhibitors of TMPRSS2 were also assessed using admetSAR web tool. Results : Three compounds, namely, niazirin, quercetin, and moringyne from M. oleifera demonstrated better molecular interactions with binding affinities ranging from -7.1 to -8.0 kcal/mol compared to -7.0 kcal/mol obtained for camostat mesylate (a known TMPRSS2 inhibitor), which served as a control. All the three compounds exhibited good drug-like properties by not violating the Lipinski rule of 5. Niazirin and moringyne possessed good ADMET properties and were stable in their interactions with the TMPRSS2 based on the molecular dynamics simulation. However, the ADMET tool predicted the potential hepatotoxic and mutagenic effects of quercetin. Conclusion : This study demonstrated the potentials of niazirin, quercetin, and moringyne from M. oleifera, to inhibit the activities of human TMPRSS2, thus probably being good candidates for further development as new drugs for the treatment or management of COVID-19. SARS-CoV-2 is an envelope and a spiked positive-stranded ribonucleic acid (RNA) virus responsible for a severe acute respiratory syndrome (SARS) termed Coronavirus Disease 2019 , with some symptoms such as cough, sore throat, fever, pneumonia, alveolar damage, and inflammatory responses in the airways ultimately leading to severe respiratory failure. SARS-CoV-2 possesses a 30 kb genome encoding non-structural proteins (viral proteases, transcription, and replication proteins) and structural proteins (nucleocapsid, membrane, envelope, and spike). The spike is a glycosylated protein that plays a significant role during SARS-CoV-2 infection. It recognizes and binds to a receptor known as angiotensin-converting enzyme 2 (ACE-2) on the host cell surface prior to the virus entry into host cells via receptor mediated endocytosis . Furthermore, the successful internalization of SARS-CoV-2 into the human host cell is a function of the presence of a cellular protease Transmembrane Protease Serine 2 (TMPRSS2), which primes the glycosylated spike protein to allow the virus fuse with the host membrane (Hoffmann et al. 2020 ). The critical role TMPRSS2 plays during SARS-CoV-2 infection makes it a good target for drug development in the treatment of COVID-19 disease or amelioration of conditions of patients infected with the SARS-CoV-2 virus. Although several research efforts have been committed to the discovery of drugs to combat COVID-19, there has been no success in the development of approved antiviral drugs. The inhibitory activities of compounds and drugs against TMPRSS2 have been reported Plant-based natural products have been recognised as an excellent source of novel drugs. Different plant species have been converted into pharmacological drugs which have been applied for successful treatment of several human ailments such as malaria, hypertension, and cancer. Small molecules derived from natural products are not only readily abundant, but they have also been reported to be orally active according to the Lipinski rule of compound druggability (Benet et al. 2016) . The plants of interest in this study include Zingiber officinalis Roscoe (Zingiberaceae), Artemisia annua L. (Asteraceae), and Moringa oleifera Lam. (Moringaceae) that have a long history of traditional uses as antiviral. Z. officinalis is a spice with several pharmacological properties. Chang et al. (2013) reported that fresh ginger reduced human respiratory syncytial virus infection by more than 70 % in human respiratory tract cell lines HEp-2 and 549. A study on A. annua has also shown that it has potential as an antiviral agent in the control of flaviruses (Romero et al. 2006) , human cytomegalovirus, herpes simplex virus, and hepatitis virus (Efferth et al. 2008) . Recently, Nair et al. (2021) showed that A. annua extracts could inhibit SARS-CoV-2 replication in vitro. M. oleifera (also known as "drum stick or horse radish") is well distributed and cultivated in Asia, Africa, Latin America, the Caribbean, and the Pacific Island due to its numerous nutritional and medicinal importance. The antiviral potentials of M. oleifera against herpes simplex virus, hepatitis B virus, influenza virus, Epstein-Barr virus, and human immunodeficiency virus have been reported (Biswas et al. 2020) . A molecular docking study conducted by Chakotiya and Sharma (2020) showed that shoagol, zingerone, and zingiberene from Z. officinalis had higher binding affinities for TMPRSS2 compared to camostat mesylate, a standard TMPRSS2 inhibitor. Laksmiani et al. (2020) also studied the interactions between three phytochemicals from M. oleifera (apigenin, luteolin, and quercetin) and TMPRSS2 via molecular docking. However, in this current study, molecular docking was performed to analyze the interactions between several phytochemicals from the study plants and TMPRSS2. The stability of the resulting protein-ligand complexes was further assessed by molecular dynamics simulation. The development of a molecule or compound into a new clinically acceptable drug takes an average of 12 years and cost over $1 billion (Mohs & Greig, 2017) . However, the application of computational methods has shortened the period required to detect a compound that could be utilised as a potential clinical candidate (Lin et al. 2020) . Interactions between several compounds and protein targets can be studied in silico using computer programs to select lead compounds that could be developed into clinical drugs. In silico screening of compounds does not only aid the rapid discovery of compounds with therapeutic values but also reduces the cost of developing new drugs. Computational studies also offer the opportunity of determining the ADME (Absorption, Distribution, Metabolism, and Excretion) and "drug-likeness" of promising natural compounds, hence, helping in ruling out compounds that might not be effective during the different steps of drug development (Lombardo et al. 2017) . As researchers continue to hunt for arsenals to treat SARS-CoV-2 infection, the in silico evaluation of natural plant-derived compounds via molecular docking and molecular dynamics simulation can assist in the quick discovery of drugs to treat COVID-19. Therefore, in this study, compounds from three important medicinal plants (Z. officinalis, A. annua, and M. oleifera) were screened as potential inhibitors of TMPRSS2, an important protein implicated in the SARS-CoV-2 infection. The amino acid sequence of TMPRSS2 (Uniprot accession number O15393) comprising 492 amino acids was retrieved from Uniprot database (https://www.uniprot.org/), and then used to predict the protein 3D structure of TMPRSS2 by homology modelling. The amino acid sequence of TMPRSS2 was subjected to BLAST search in the SWISS-MODEL (https://swissmodel.expasy.org/), an automated protein structure homology-modelling server to obtain a template protein structure. The BLAST search returned the crystal structure of serine protease hepsin in complex with inhibitor (SMTL ID 5ce1.1) as the template protein structure for the homology-modelling. The template protein structure was used to build a homology model of TMPRSS2 using the ProMod3 3.1.1 program of the SWISS-MODEL online server. The model obtained was validated using PROCHECK (https://servicesn.mbi.ucla.edu/PROCHECK/), ERRAT (https://servicesn.mbi.ucla.edu/ERRAT/), and verify3D (https://servicesn.mbi.ucla.edu/Verify3D/). The computed atlas of surface topography of proteins (CASTp) server (http://sts.bioe.uic.edu/) was used to predict the binding pockets of the modelled TMPRSS2, and the distribution of actives sites residues (substrate binding: Asp435, Ser460, and Gly462; catalytic: His296, Asp345, and Ser441) previously reported (Idris et al. 2020) in the pockets. The SDF files of 2D chemical structures of the 132 phytochemicals were downloaded from the PubChem database (https://pubchem.ncbi.nlm.nih.gov/) and used for the molecular docking. Camostat mesylate, an approved serine inhibitor reported to have promising therapeutic effects against the Material Tables S1-S3. The molecular docking between phytochemicals and modelled 3D structure of transmembrane protease serine 2 (TMPRSS2) was performed using Open babel and Autodock vina wizard on PyRx virtual screening tool. Compound structures (ligand) were minimized and converted to pdbqt format. For this analysis the receptor was treated rigid. The vina search space grid centre selected was X: 13.1910 Y: -6.0318 Z: 15.7311, and was set around reported catalytic (His296, Asp345, and Ser441) and substrate binding (Asp435, Ser460, and Gly462) residues of TMPRSS2 (Idris et al. 2020). The binding affinity (kcal/mol) was obtained after docking. The visualization of the interaction between ligands (compounds) and the TMPRSS2 protein model was done using BIOVIA Discovery studio visualizer 2020 and PyMOL software. The binding affinity obtained for the camostat mesylate, a known inhibitor of TMPRSS2, was set as the cut-off to select compounds for further analysis. The drug-likeness and ADMET properties of compounds predicted to be potential and potent inhibitors of TMPRSS2 were evaluated using the admetSAR web tool (Cheng et al. 2012 ). The druglikeness was predicted based on the Lipinski rule of 5 (Benet et al. 2016) . Molecular dynamics simulation was performed using GROMACS version 2018.4 (Abraham et al. 2018 ). Ligand topologies were generated based on General Amber Force Field (GAFF) using ACPYPE-AnteChamber PYthon Parser interface (Sousa da Silva and Vranken, 2012), and protein topology was generated based on Amber03 using pdb2gmx. The unit cell was defined as a dodecahedron and filled with TIP3P (three-site transferrable intermolecular potential) water. To neutralize the system, six chloride ions (Cl -) were added. The energy minimization step was performed using the steepest descent minimization with a maximum force < 10.0 kJ/mol. Then, the system was equilibrated in two phases. Firstly, NVT ensembled with 300 K as the reference temperature, and secondly, NPT ensembled with 1 bar as the reference pressure. Both were 100 ps long with a position restraining force on the heavy atoms of the protein and ligand. Once the system was equilibrated, a 50 ns molecular dynamic simulation was run. Analysis of the trajectory was composed by root-mean-square deviation (RMSD) calculation of the heavy atoms of the ligand and an energy decomposition analysis, which was performed using gmx-MMPBSA (Modifed Molecular Mechanics-Poisson Boltzmann Surface Area) with 100 frames from the last 10 ns of each molecular dynamic simulation. The BLAST search of the amino acid sequence of TMPRSS2 obtained from the Uniprot database against the SWISS-MODEL library generated crystal structure of serine protease hepsin (SMTL ID 5ce1.1). The homology model for TMPRSS2 protein was then built using the crystal structure of serine protease hepsin as a template. The template has a sequence coverage of 71 % and resolution of 2.50 Å. It also shared sequence identity and similarity of 33.33 % and 38 %, respectively, with the query sequence (Table 1) . Based on the report of Xiang (2006) , the sequence similarity value obtained was suitable to generate a reliable model. The model obtained has a QMEAN (Qualitative Model Energy Analysis) value of -1.47 and global quality estimation score (GMQE) of 0.49 based on the result from ProMod3 3.1.1 in the SWISS-MODEL (Table 1) . For biomedical applications, protein models must be accurately predicted. The QMEAN value otherwise, known as "degree of nativeness" indicates, how comparable the model is to the experimental structures. A good model is expected to have a QMEAN value close to zero and not less than -4.0 (Benkert et al. 2011 ). In the study, the QMEAN score (-1.47) obtained for the predicted protein model confirmed its good quality. The GMQE result is also reliable because the value (0.49) obtained falls between 0 and 1, which is the benchmark, as reported by Biasini et al. (2014) . Based on the Ramachandran plot (Figure 1 ), the modelled TMPRSS2 has 99.6 % of its residues in the allowed region, 88.6 % in the most favoured region, and 11.0 % in the additional allowed region. This result also supported the good quality of the protein model. The modelled TMPRSS2 was further validated to be of good quality based on the overall quality factor of 95.56 % obtained from ERRAT2 plot analysis (Supplementary Figure S1 ). CASTp server predicts topographic characteristics such as cavities, channels, and surface pockets of proteins (Tian et al. 2018) . The result of CASTp can reveal the position of active sites such as the binding and catalytic sites of proteins. It can also reveal pockets in which ligand and amino acid residues interaction occur. The CASTp software predicted 75 pockets with the active site residues (His296, Asp345, Asp435, Ser460, and Gly462) with the exception of Ser441 distributed in 10 pockets (Supplementary Figure S2) . The area, volume, and residues around the active sites are presented in Table 2 . The Asp435 residue is very important in the TMPRSS2 for substrate binding, recognition, and orientation for the catalytic process, and its presence indicates the trypsin-like proteolytic activity of TMPRSS2 (Vivek-Ananth et al. 2020; Hempel et al. 2021) . Therefore, compounds that bind to the Asp435 were taken as the potential inhibitor of TMPRSS2. The Asp435 residue was present in 6 pockets designated 4A, 15B, 25D, 45F, 48G, and 50H. Meanwhile, the catalytic residue Asp345 was present in four pockets (20C, 32E, 51I, and 68J), and His296 in one pocket (32E). However, the ligand-protein interactions were observed to occur in the TMPRSS2 pocket designated 4A, with an area and volume size of 160.709SA and 80.898SA, respectively (Table 2) . The result of the molecular docking of 132 phytochemicals selected from three medicinal plants with the TMPRSS2 protein is presented in the Supplementary Tables S1, S2, and S3.The reference compound camostat mesylate had a binding affinity score of -7.0 kcal/mol. Hence, phytochemicals with binding affinity scores of ≤ -7.0 kcal/mol were selected for further analysis. Twenty (20) phytochemicals had binding affinity scores ≤ -7.0 kcal/mol out of 132 phytochemicals with binding affinity scores ranging from -7.0 to -8.7 kcal/mol. The bond interaction between the compounds and TMPRSS2 residues are presented in Table 3 . TMPRSS2 residues Asn398, Asp435, and Gly258 form a strong N−H···O hydrogen bond type interaction with the control camostat mesylate. (Figure 2a ). The formation of hydrogen bond between a ligand and target protein is important for a molecule's function . The presence of hydrogen bonds also encourages the formation of stronger and robust ligand-protein complexes (Majewski et al. 2019 ). However, the numerous van der Waals, covalent, π-alkyl, and electrostatic interactions observed between study compounds and TMPRSS2 can also help in stabilizing the ligand-protein complex. There was no interaction between TMPRSS2 and two compounds found in Z. officinalis (thujopsene and zingiberol) when the docking interaction output was imported into Discovery Studio for visualization. All the compounds with binding energy ≤ -7.0 kcal/mol except gamma elemene, beta-elemene, and aromandrene form hydrogen bonds with residues on TMPRSS2 (Table 4 and Supplementary Figure S3 -S14). However, only five compounds, namely quercetin, glucotropaelin, niazirin, and moringyne from M. oleifera, and scopolin from A. annua formed hydrogen bonds with the important substrate binding residue Asp435 (Figure 2bf ). Our findings established that these five compounds could serve as potential inhibitors of TMPRSS2. Scopolin formed a weak carbon hydrogen (C−H···O) bond of 3.46 Å in length (L) with the Asp435 residue (Figure 2b ). It also formed the same weak hydrogen bond interaction with residues Gly259 (L= 3.30 Å), Asp440 (L= l3.30 Å and 3.32 Å), and Thr387 (L= 3.65 Å). It interacted with residues Asp440 (L= 2.72 Å), Gly383 (L= 1.83 Å), and Asn398 (L= 1.98 Å) with a strong covalent hydrogen bond (O−H···O). Other interaction observed between scopolin and TMPRSS2 was a hydrophobic alkyl/π-alkyl bond involving residues Asn398 (L= 2.04 and 2.52 Å), and Gly385 (L= 2.41 Å). A weak hydrogen bond was observed between moringyne and two TMPRSS2 residues, including Ala434 (L= 3.48 Å) and Asp440 (L= 3.19 Å). It also binds to the Ala 466 residue via a π-alkyl bond of length 4.63 Å (Figure 2f ). Moringyne is a glycoside reported to relax bronchioles, and hence, it is used in the treatment of asthma (Mahajan et al. 2009 ). Due to this property, it could be very useful as an adjuvant for the treatment of COVID-19 infections, which produce severe breathing difficulties. According to Jeffrey (1997) , hydrogen bond with distances ranging between 2.2 -2.5 Å is classified as strong covalent hydrogen bond, while moderate and weak electrostatic hydrogen bond have distances ranging between 2.5 -3.2 Å and 3.2 -4.0 Å respectively. Therefore, based on interaction with the binding substrate Asp435 and hydrogen bonding, all the biologically active constituents derived from M. oleifera (quercetin, niazirin, and moringyne) could serve as potent inhibitor of TMPRSS2 among all the biological compounds tested. According to the Lipinski rule of five, a small compound that has the potential to become a drug must have certain properties. The compound must have molecular weight less than 500, the hydrogen bond donor (H-Bond donor), and hydrogen bond acceptor (H-Bond acceptor) must not be more than 5 and 10 respectively. The calculated Log P (CLog P), which is the octanol-water partition coefficient logP, must not be more than 5 (Benet et al. 2016) . All the compounds, including the control, did not violate any of the Lipinski rules of five (Table 5) , and as such, they possess drug-like features. Studies on the pharmacokinetics and toxicity of potential drug molecules are essential. The three phytochemicals (niazirin, quercetin, and moringyne) exhibited the highest potential to inhibit TMPRSS2 based on the molecular docking result, exhibiting different ADMET properties ( Table 6 ). The control compound camostat mesylate has good human intestinal absorption (HIA), blood brain barrier (BBB), and human oral bioavailability (HOB) properties. It is not carcinogenic, according to AMES mutagenesis prediction, and has acute oral toxicity of level III i.e. it has a lethal dose 50 (LD 50 ) that is less than 500 but greater than 5000 mg/kg (Ammar, 2017) . However, camostat mesylate is hepatotoxic according to the admetSAR server prediction. It was also found in this study that quercetin and moringyne can be absorbed in the intestine, and none of the compounds including niazirin can cross the blood brain barrier. Contrarily, accumulation of quercetin in rat brain tissues has been reported (Ishisaka et al. 2011 ). The admetSAR program predicted the compounds as non-carcinogenic and not orally bioavailable. However, several approaches such as absorption enhancer (Aungst, 2012) , chemical modification (Renukuntla et al. 2013) , and self-double-emulsifying drug delivery system (Wang et al. 2017 ) methods can be used to improve the oral bioavailability of the compounds. According to the AMES test, quercetin was predicted as a mutagen with a probability of 0.9000. It has level II acute oral toxicity. This implies that the LD 50 , which is the amount of compound or drug required to kill 50 % of the animals on which it is being tested, is greater than 50 but less than 500 mg/kg (Ammar, 2017) . Quercetin was also predicted as a hepatotoxic compound. Based on the AMES, acute oral toxicity, and hepatotoxicity test results of quercetin, there is a need to apply caution in its application as a potential TMPRSS2 inhibitor. This result corroborates the findings of Ammar (2017) . There is a dearth of information about the pharmacokinetics and toxicity of moringyne, and further research on this compound might lead to the development of an excellent lead compound for the treatment of SARS-CoV-2 infection. Figure 3a , the reference compound camostat mesylate in complex with TMPRSS2 showed a trajectory with a clear fluctuation from 5 to 30 ns, but after this time, the system stabilizes around 5 Å. In the case of quercetin, the RMSD never reached a stable state, and at 30 ns, the ligand moved away from the protein, therefore, causing a drastic reduction in the RMSD value. In contrast to niazirin and moringyne, lack of sugar moeity in the quercetin can affect its flexibility and stability because the sugar ring and the rotation of the glycosidic bond can generate several sites of interactions. The relative frequency of molecular interactions in the last 10 ns of simulation from camostat mesylate, niazirin, and moringyne trajectories suggests that Asp440, Ser436, and Gly385 have significant contributions to the ligand binding because the three compounds bind to these residues primarily by hydrogen bonding (Figure 3b ). From the RMSD result, camostat mesylate is a highly flexible compound, therefore interacts at a low relative frequency, except for its hydrogen bond interaction with Asp440, Ser436, and Gly385. On the other hand, Ile381, Ala400, and Val434 contributed to the stability of niazirin by hydrophobic interactions. The most stable compound, moringyne, forms two additional hydrogen bonds with Ala386 and Asn398, in accordance with the docking pose. Using computer-aided in silico analysis which include homology modelling, molecular docking, and molecular dynamics simulation, we identified three compounds from M. oleifera-niazirin, quercetin, and moringyne that can inhibit the activity of human TMPRSS2. The three compounds exhibited satisfactory ADMET properties, though the application of quercetin requires caution. Comparatively, the result of the molecular dynamics simulation suggested moringyne as the best phytochemical to be considered as a potential inhibitor of TMPRSS2. Further experimental studies on niazirin, quercetin, and moringyne are required to consider their application in the treatment and management of SARS-CoV-2 infection. The authors declare that there are no known conflicts of interest associated with this publication and there has been no significant financial support for this work that could have influenced its outcome. & the GROMACS development team Glucosinolates profile, volatile constituents, antimicrobial, and cytotoxic activities of Lobularia libyca In silico pharmacodynamics, toxicity profile and biological activities of the Saharan medicinal plant Limoniastrum feei Absorption enhancers: applications and advances BDDCS, the rule of 5 and drugability Toward the estimation of the absolute quality of individual protein structure models SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information Moringa oleifera Lam. and derived phytochemicals as promising antiviral agents: A review Phytoconstituents of zingiber officinale targeting host-viral protein interaction at entry point of sars-COV-2: A molecular docking study Fresh ginger (Zingiber officinale) has anti-viral activity against human respiratory syncytial virus in human respiratory tract cell lines admetSAR: a comprehensive source and free tool for assessment of chemical ADMET properties The antiviral activities of artemisinin and artesunate Accumulation of orally administered quercetin in brain tissue and its antioxidative effects in rats. Free Radic An introduction to hydrogen bonding Active compounds activity from the medicinal plants against SARS-CoV-2 using in silico assay A review on applications of computational methods in drug screening and design In Silico absorption, distribution, metabolism, excretion, and pharmacokinetics (ADME-PK): utility and best practices. An industry perspective from the international consortium for innovation through quality in pharmaceutical development Inhibitory effect of n-butanol fraction of Moringa oleifera Lam. seeds on ovalbumin-induced airway inflammation in a guinea pig model of asthma An investigation of structural stability in protein-ligand complexes reveals the balance between order and disorder Candidate drugs against SARS-CoV-2 and COVID-19 Drug discovery and development: Role of basic biological research Artemisia annua L. extracts inhibit the in vitro replication of SARS-CoV-2 and two of its variants Targeting SARS-CoV-2 spike protein of COVID-19 with naturally occurring phytochemicals: an in silico study for drug development Scopolin attenuates osteoporotic bone loss in ovariectomized Mice Scopolin prevents adipocyte differentiation in 3T3-L1 preadipocytes and weight gain in an ovariectomy-induced obese mouse Unravelling high-affinity binding compounds towards transmembrane protease serine 2 enzyme in treating SARS-CoV-2 infection using molecular modelling and docking studies Approaches for enhancing oral bioavailability of peptides and proteins Antiviral effect of artemisinin from Artemisia annua against a model member of the Flaviviridae family, the bovine viral diarrhoea virus (BVDV) ACPYPE -antechamber python parser interfacE CASTp 3.0: computed atlas of surface topography of proteins In silico identification of potential natural product inhibitors of human proteases key to SARS-CoV-2 infection Enhanced oral bioavailability of quercetin by a new non-aqueous self-double-emulsifying drug delivery system PRL-dock: Protein-ligand docking based on hydrogen bond matching and probabilistic relaxation labeling Advances in homology protein structure modeling In silico study on spice-derived antiviral phytochemicals against SARS-CoV-2 TMPRSS2 target