key: cord-0794084-du4fhd4o authors: Chandel, Vaishali; Sharma, Prem Prakash; Raj, Sibi; Choudhari, Ramesh; Rathi, Brijesh; Kumar, Dhruv title: Structure-based drug repurposing for targeting Nsp9 replicase and spike proteins of severe acute respiratory syndrome coronavirus 2 date: 2020-08-24 journal: Journal of biomolecular structure & dynamics DOI: 10.1080/07391102.2020.1811773 sha: fa0948725fdeaaee3e94a990d2b565f3d20dfc17 doc_id: 794084 cord_uid: du4fhd4o Drug re-purposing might be a fast and efficient way of drug development against the novel coronavirus disease 2019 caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). We applied a bioinformatics approach using molecular dynamics and docking to identify FDA-approved drugs that can be re-purposed to potentially inhibit the non-structural protein 9 (Nsp9) replicase and spike proteins in SARS-CoV-2. We performed virtual screening of FDA-approved compounds, including antiviral, anti-malarial, anti-parasitic, anti-fungal, anti-tuberculosis, and active phytochemicals against the Nsp9 replicase and spike proteins. Selected hit compounds were identified based on their highest binding energy and favorable absorption, distribution, metabolism and excretion (ADME) profile. Conivaptan, an arginine vasopressin antagonist drug exhibited the highest binding energy (-8.4 Kcal/mol) and maximum stability with the amino acid residues present at the active site of the Nsp9 replicase. Tegobuvir, a non-nucleoside inhibitor of the hepatitis C virus, also exhibited maximum stability along with the highest binding energy (-8.1 Kcal/mol) at the active site of the spike proteins. Molecular docking scores were further validated by molecular dynamics using Schrodinger, which supported the strong stability of ligands with the proteins at their active sites through water bridges, hydrophobic interactions, and H-bonding. Our findings suggest Conivaptan and Tegobuvir as potential therapeutic agents against SARS-CoV-2. Further in vitro and in vivo validation and evaluation are warranted to establish how these drug compounds target the Nsp9 replicase and spike proteins. The current coronavirus pandemic has caused 3,87,911 deaths across the globe and has infected around 6 million people as of 5 th June 2020. Initial studies on severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) stated that it was closely related to SARS-CoV. Coronaviruses (CoVs) are enveloped viruses from the family of Coronaviridae with a positive-sense single-stranded RNA genome (Fehr & Perlman, 2015) . The genome size of CoVs is relatively large, ranging from approximately 27 to 37 kilobases. The envelope of the virus contains a lipid bilayer with three structural proteins: membrane (M), envelope (E), and spike (S) (Boopathi et al., 2020) . The multiple copies of nucleocapsid protein present are associated with the positive sense single stranded RNA genome and are responsible for the formation of the nucleocapsid present inside the envelope (Bride et al., 2014) . Viral protection outside the host is provided by the lipid bilayer, nucleocapsid, and membrane proteins (Lenard, 2008) . The CoV infection is initiated by the attachment of the S glycoprotein to the complementary host receptor (Young & Alexandra, 2019) . The entry of viral particles and its attachment to host membrane is mediated through either direct fusion of the viral envelope or endocytosis using the host membrane (Huang et al., 2020) . The single positive-stranded RNA genome of COVs has the capacity to replicate their own genomic information as well as translate proteins in the cytoplasm of host (Nakagawa et al., 2016) . Polymerase is synthesized by the virus and used to subsequently synthesize the minus strand using the positive strand as a template (Elfiky, 2020) . This positive sense genomic RNAs generated through replication develops into the viral progeny. The genomic RNA is attached to the N glycoprotein. The M glycoprotein integrates into the endoplasmic reticulum (ER) membrane exactly as the S and hemagglutinin esterase (HE) proteins. The ER is the location for the translation of RNA and viral structural proteins. The M protein assists the protein-protein interactions that help in the assembly of viral particles, which is followed by its binding to the nucleocapsid. These viral particles are then released from the host cell via exocytosis (Mousavizadeh & Ghasemi, 2020 . The main protease domain (Mpro) is a conserved target in SARS-CoV-2, and thus provide the opportunity to design new inhibitors throughout the entire Coronaviridae subfamily (Mahanta et al., 2020) . Twothird region of the 5 0 end of the CoV genome consists of open reading frame I (ORFI), which encodes two large polypeptides involved in the replicase machinery: pp1a and pp1ab1. Two proteases encoded in the 5 0 region of ORF 1: 3 C-like protease (3CL or Nsp5) and papain-like protease (PLP), co-translationally cleave the two polypeptides into mature non-structural proteins (NSPs) (Lim et al., 2000; Kumar, Sharma, et al., 2020) . The 3CL protease, also referred to as Mpro due to its dominant role in the post-translational machinery of the replicase protein (Kanchan et al., 2003) . Both these proteins have a substrate-binding pocket where at P1, glutamine is the substrate and at the P2, either leucine or methionine are the source of substrates. This strong structural basis provides a loophole for the design of a wide spectrum of anti-CoV inhibitors. In general, there are no treatment options for corona based viral diseases that occur suddenly and spread at a higher frequency. The spike proteins (PDB ID-6LZG) form a crown shape on the surface of the novel virus and are of major research interest as little is known about how they attach, fuse and gain entry into the host cell (Walls et al., 2020) . There are mainly two subunits in the spike protein, namely S1 and S2. The S1portion has diverged sequences even with the same coronavirus species, whereas the S2 subunit is highly conserved. The S1 subunit has 2 domains N and C terminal domains. These domains mainly function as receptor-binding domains and bind to various proteins and sugar molecules. These spike proteins contain heptad repeats of hydrophobic domains that help in fusing into the host. The cell entry program is mediated by the spike proteins, mainly through binding to the ACE-2 receptor on the host surface and subsequently mediating the viral infection. The major role played by the spike proteins in the host entry and attachment illustrates the large possibilities of targeted and to find effective vaccines and anti-bodies to neutralize the viral infection (Li, 2016; Robson, 2020) . The Nsp9 (PDB ID-6W4B) replicase is a non-structural protein encoded by ORF1a, which has no eminent function, but is related to viral RNA synthesis (Dene et al., 2020) . This protein contains a single folded beta-barrel, which is unique, unlike the single domain proteins. This fold is related to the OB-fold having an extended C-terminal in the subdomains of both SARS-CoV-2 and 3 C-like protease that belongs to the serine protease superfamily. The crystal structure of Nsp9 replicase emphasizes it as a dimeric protein. Nsp9 replicase specifically binds to the RNA, further interacting with the nsp8 protein and activating it, which is essential for its function (Sutton et al., 2004) . As Nsp9 replicase plays a major role in viral replication, it can be a unique target for the discovery of novel drugs against this protein, enabling the inhibition of the viral progression. As many vaccines and drugs are undergoing clinical trials globally, drug repurposing has been one of the effective approaches taken by the scientists across the globe to bring out an effective medicine for the eradication of the novel coronavirus. Anti-viral drugs such as chloroquine and hydroxychloroquine, used to treat malaria and arthritis respectively, were approved in the USA to treat SARS-CoV-2 patients (Touret & Lamballerie, 2020) . Some of the other drugs such as Remdesivir, Actemra, and Galidesivir are currently undergoing clinical trials, but there has been no vaccine or therapeutic drug currently approved by the FDA for the prevention or treatment of SARS-CoV-2 (Hendaus, 2019; Chaudhuri et al., 2018; Das et al., 2020) . Therefore, structure-based drug repurposing through targeting the Nsp9 replicase and spike proteins of SARS-CoV-2 can provide successful therapeutic strategies against this virus. Our study used a computational approach towards structurebased drug repurposing of different anti-viral, anti-malarial, anti-parasitic, anti-fungal, anti-tuberculosis and active phytochemicals against major target proteins such as Nsp-9 replicase and the spike protein of SARS-CoV-2. The atomic coordinates of the protein crystal structures of Nsp9 replicase (PDB ID-6W4B) and the spike protein (PDB ID-6LZG) were downloaded from the RCSB-PDB (protein data bank) database. Prior to docking or analysis, the solvation parameters, charge assignment, fragmental volumes, and protein optimization were checked using Autodock Tool 4 (ADT) Kumar, Chandel, et al., 2020; O'Boyle et al., 2011) . The 3 D SDF structures of all the compounds were downloaded from the PubChem database . The 2 D ligand structures of the compounds were designed using Chemdraw. The optimization of the ligands was done using Avogadro and the data converted into the PDB file format using Open Babel software. Molecular screening of the compounds was performed using PyRx virtual screening tool-python prescription 0.8 software and Autodock wizard as the engine for molecular docking (Dallakyan & Olson, 2015; Khan et al., 2020; Pagadala et al., 2017; Seeliger & Groot, 2010) . The ligands were minimized to their stable form. During the period of docking, the protein was considered to be rigid and the ligands were considered to be flexible. Auto Grid engine in PyRx was used to generate the configuration file for the grid parameters. The application was also used to identify/predict the amino acids in the active site of the protein that interact with the ligands. A result of positional root-mean-square deviation (RMSD) less than 1.0A˚was considered ideal for finding the favorable binding. The ligand with the highest binding energy (most negative) was considered as the ligand with maximum binding affinity. Pymol version 2.3.4 and ADT were used for visual analysis of the docking site and the results were validated using Autodock-Vina (Seeliger & Groot, 2010) . ADME analysis ADME analysis of the selected ligands obtained from PubChem was done on the basis of canonical SMILES using The structure is shown in ribbon representation, coloured from the N-terminus to the C-terminus with colours changing from blue through green and yellow to red. (B) Spike protein (PDB ID-6LZG) of SARS-CoV-2 shows ribbon structure representation, coloured from the N-terminus to the C-terminus with colours changing from red through yellow and green to blue Ribbon structure. Swiss-ADME programme (Daina et al., 2017) . The ADME properties of the chosen compounds were calculated. The major ADME associated parameters such as Lipinski's rule of five, drug likeliness, pharmacokinetic properties, the solubility of the drug, were considered. The values of the observed properties are presented in Tables 1 and 2 . The complete study was performed on different modules of Schrodinger suite 2020-1 trial version. Both complexes were prepared prior to MD simulation in the protein preparation wizard and Prime module of Schrodinger suite to remove defects such as missing hydrogen atoms, incorrect bond order assignments, charge states, orientations of various groups and missing side chains suite (Schr€ odinger, 2020, 2016). Removal of steric clashes and strained bonds/angles were done by performing a restrained energy minimization, allowing movement in heavy atoms up to 0.3 Å. Extensive 100 ns MD simulation was carried out for both complexes through Desmond, D. E. Shaw Research, New York, NY, 2015 (Schr€ odinger, 2020) to access the binding stability of query molecule with respect to nelfinavir in the complex. Both complex systems were solvated in TIP3P water model and 0.15 M NaCl to mimic a physiological ionic concentration. The full system energy minimization step was done for 100 ps. The MD simulation was run for 100 ns at 300 K temperature, standard pressure (1.01325 bar), within an orthorhombic box with buffer dimensions 10 Â 10 Â 10 Å3 and NPT ensemble. The energy (kcal/mol) was recorded at intervals of 1.2 ps. The protein-ligand complex system was neutralized by balancing the net charge of the system by adding Na þ or Cl-counter ions. The Nose-Hoover chain and Martyna-Tobias-Klein dynamic algorithm was used maintain the temperature of all the systems at 300 K and pressure 1.01325 bar, respectively. Our study focused on drug repurposing against the structural proteins Nsp9 replicase (PDB ID-6W4B) and the spike protein (PDB ID-6LZG) (Figure 1 ) of SARS-CoV-2 in combination as potential therapeutic targets for the treatment of coronavirus. In this study, we applied a computational approach of structure-based drug repurposing to identify specific therapeutic agents against SARS-CoV-2. We created a database of 2000 FDA approved compounds, including antiviral, anti-malarial, anti-parasitic, anti-fungal, anti-tuberculosis and active phytochemicals from FDA and Indian Medicinal Plants, Phytochemistry and Therapeutic database ( Figure 2 ). These compounds were screened using a virtual screening tool PyRx, based on which 15 hits were selected depending on their best binding energy. Further, molecular docking was performed for hits against Nsp9 replicase and the spike protein (Tables 1 and 2) . Molecular docking is a computational approach that aims to identify non-covalent binding between (ligand/ inhibitor) and protein (receptor). Docking predicts the mode of interaction between a receptor and the ligand for an established binding site. Binding energy suggests the affinity and strength of a specific ligand to which a compound binds and interacts at the active site pocket of a target protein. To understand the effect of active antiviral, antimalarial, anti-parasitic, anti-fungal, anti-tuberculosis, anti-bacterial and active phytochemical compounds on SARS-CoV-2 molecular docking of 15 active compounds against each target selected after screening from PyRx, was performed. Further, based on their binding energy and best ADME properties, the top three compounds were selected (Tables 3 and 4) . The three selected compounds (Conivaptan, Telmisartan, and Phaitanthrin D) showed the best docking scores and were found to be best molecules against the target site of the Nsp9 replicase. Out of these, Conivaptan exhibited the best binding energy (-8.4 Kcal/mol) with Nsp9 replicase, interacting with the CYS74, LEU107, LEU113, ALA108, LEU5, ASN34, LEU98, ASN96, LEU98, PHE41, THR36, ALA9, LEU104, VAL8, ALA108, ASN99 and SER6 amino acid residues at the active site ( Figure 3A) . Moreover, Conivaptan showed strong interaction with Nsp9 replicase at the active site through Hbond with VAL42 amino acid ( Figure 5A ). Conivaptan was the first of this class of FDA-approved arginine vasopressin antagonists for the management of hypervolemic and euvolemic hyponatremia (Ghali et al., 2009) . However, the most common side effects of Conivaptan include allergic reactions, fluid or electrolyte problems, signs of high or low blood pressure, headache and throat pain (Ghali et al., 2009 ). Telmisartan exhibited (-8.1 Kcal/mol) binding affinity with Nsp9 replicase interacting with the ARG100, LEU98, PHE9, MET102, PHE41, ASN34, THR36, LEU113, LEU107, ALA108, VAL8, PRO7, LEU104, PHE76, LEU5, GLU4, SER6, CYS74 and PHE91 amino acid residues ( Figure 3B ). Telmisartan, an antagonist of angiotensin II receptor is highly selective for angiotensin II receptors type 1. It is a useful therapeutic choice in the management of patients suffering from hypertension. (Miura et al., 2011) . Phaitanthrin D showed (-7.9 Kcal/ mol) binding energy with Nsp9 replicase and interacted with the 6W4B. PHE76, CYS74, LEU89, LEU104, LEU107, GLY105, MET102, SER6, VAL8, PRO7, ALA108 and LEU113 amino acid residues ( Figure 3C ). Phaitanthrin D is natural alkaloid found that exhibits potent anti-tubercular activity against MDR-TB (Kamal et al., 2015) . The molecular docking analysis in our study showed the inhibition potential of the top three compounds against Nsp9 replicase ranked by binding energy and best ADME properties as being: Conivaptan > Telmisartan > Phaitanthrin D. Docking results of spike protein of SARS-CoV-2 with another three compounds (Tegobuvir, Bromocriptine and Baicalin) showed best binding energy and were found to be best molecules at the target site of the protein. Out of the 15 compounds, Tegobuvir exhibited the binding energy (-8.1 Kcal/mol) interacting with the PRO337, ALA344, ASN343, PHE342, PHE347, PHE338, GLY339, GLU340 and VAL341 amino acid residues of spike protein ( Figure 4A ). Tegobuvir showed strong interaction with spike protein at the active site through H-bond with the ARG355 and ARG466 amino acids ( Figure 5B ). Tegobuvir is a non-nucleoside inhibitor of hepatitis C virus (HCV) RNA replication with proven antiviral activity in the patients suffering from chronic genotype 1 HCV infection. Tegobuvir is an analog of imidazopyridine class inhibitors that selectively targets HCV (Vliegen et al., 2015) . However, the most common side effects of Tegobuvir includes cough, dizziness, fatigue and dry mouth (Vliegen et al., 2015) . Bromocriptine functions as a serotonin modulator and postsynaptic dopamine receptor clinically used to treat Parkinson's disease (Kato et al., 2016) . Bromocriptine has also shown antiviral activity against dengue virus replication (Kato et al., 2016) . Bromocriptine exhibited (-7.7 Kcal/ mol) binding affinity with the 6LZG, ASN450, LEU452, ILE468, TYR351, ALA352, SER349, LYS356, GLU340, ASN354, VAL341, THR345, ARG346, PHE347, ALA348 and SER349 amino acid residues of spike protein ( Figure 4B ). Baicalin exhibited (-7.6 Kcal/mol) binding affinity interacting with the 6LZG, ASN450, ARG346, ALA344, PHE342, GLU340, VAL341, SER399, ASN354, TRP353, ARG466, ILE468, ALA352, PHE400 and PHE347 amino acid residues of spike protein ( Figure 4C ). Baicalin is a flavonoid derived from Scutellaria baicalensis. Baicalin also exhibits a potent inhibitory effect against viruses such as anti-influenza virus and against chikungunya virus (Chu et al., 2015) . The molecular docking analysis ranked the three compounds based on their binding energy and ADME properties as follows: Tegobuvir > Bromocriptine > Baicalin. In addition to the three best compounds, Conivaptan (-7.4 Kcal/mol), Phaitanthrin D (-7.2 Kcal/mol) and Telmisartan (-7.2 Kcal/mol) also exhibited good docking scores against the spike protein suggesting that these compounds could potentially target both Nsp9 replicase as well as the spike protein. Molecular dynamics study of Conivaptan with Nsp9 replicase ( Figure 6 ) and Tegobuvir with spike protein (Figure 7) for 100 ns showed strong interactions and stability between proteins and their ligands at the active domain of proteins interacting through water bridges, hydrophobic interactions, and H-bonds. The molecular dynamic study strongly validated the molecular docking data of protein ligand interaction. The major criteria for the evaluation of the likeliness of the drug is "Lipinski's rule of five" suggesting that if a specific ligand with a certain pharmacological and biological activity has chemical and physical properties that would make it as a chosen option for orally active drug for humans (Brito, 2011) . Lipinski's rule describes the molecular properties that are crucial for pharmacokinetics of a drug in the human body for example; absorption, distribution, metabolism, and excretion (ADME) (Brito, 2011) . If three or more of Lipinski's rule of five are violated, the rule of drug likeliness is discarded and the drug is not considered further for use in treatment. ADME studies of the selected 15 compounds showed that out of 15, all virtual hits were successful at passing the test filters. In view of the current outbreak of the novel coronavirus and rising death toll scenario, novel drug discovery is a challenge constrained by time. Drug repurposing could greatly aid the development of therapeutic drugs for the effective management of COVID-19. Structure-based drug design approaches have developed into valuable drug discovery tools, owing to their synergy and versatility. Here, we described structurebased drug repurposing of a collection of FDA-approved antiviral, anti-malarial, anti-parasitic, anti-fungal, anti-tuberculosis and active phytochemical compounds against Nsp9 replicase and spike protein of SARS-CoV-2. Several molecules were identified as potent inhibitors of Nsp9 replicase (Conivaptan, Telmisartan and Phaitanthrin D) and spike protein (Tegobuvir, Bromocriptine, and Baicalin) of SARS-CoV-2. Interestingly, Conivaptan, Phaitanthrin D and Telmisartan showed good binding affinity with both Nsp9 replicase and the spike protein, suggesting the potential of utilizing these compounds for the inhibition of multiple molecular targets in SARS-CoV-2. We propose that these compounds might be applicable in the management of COVID-19 and should be investigated as potential leads in the drug development against SARS-CoV-2. Further in vitro and in vivo validation of these compounds is warranted. Novel 2019 coronavirus structure, mechanism of action, antiviral drug promises and rule out against its treatment The coronavirus Nucleocapsid is a multifunctional protein Pharmacokinetic study with computational tools in the medicinal chemistry course In-silico interactions of active phytochemicals with c-Myc EGFR and ERBB2 oncoproteins Innovation and trends in the development and approval of antiviral medicines: 1987-2017 and beyond Role of Baicalin in anti-influenza virus A as a potent inducer of IFN-gamma SwissADME: a A free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules Small-molecule library screening by docking with PyRx An investigation into the identification of potential inhibitors of SARS-CoV-2 main protease using molecular docking study Crystal structure of the SARS-CoV-2 non-structural protein 9 SARS-CoV-2 RNA dependent RNA polymerase (RdRp) targeting: aAn in silico perspective Coronaviruses: An overview of their replication and pathogenesis Conivaptan and its role in the treatment of hyponatremia. Drug Design Remdesivir in the treatment of coronavirus disease 2019 (COVID-19): a A simplified summary Clinical features of patients infected with 2019 novel coronavirus in Wuhan. Lancet Synthesis and biological evaluation of phaitanthrin congeners as antimycobacterial agents Coronavirus main proteinase (3CLpro) structure: Basis for design of anti-SARS drugs Novel antiviral activity of bromocriptine against dengue virus replication Trageting SARS-CoV-2: a A systemic drug repurposing approach to identify promising inhibitors against 3C-protease and 2'-O-ribose methyltransferase Discovery of new hydroxyethylamine analogs against 3CLpro protein target of SARS-CoV-2: Molecular docking, molecular dynamics simulation and structure-activity relationship studies In silico identification of potent FDA approved drugs against coronavirus COVID-19 main protease: A drug repurposing approach Viral membranes. Encyclopedia of vVirology Structure, function, and evolution of coronavirus spike proteins Identification of a novel cleavage activity of the first papain-like proteinase domain encoded by open reading frame 1a of the coronavirus avian infectious bronchitis virus and characterization of the cleavage products Potential anti-viral aFctivity of approved repurposed drug against main protease of SARS-CoV-2: aAn in silico based approach Angiotensin II type 1 receptor blockers: cClass effects vs. Molecular effects IMPAAT: A curated database of Indian Medicinal Plants AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility Genotype and phenotype of COVID-19: Their roles in pathogenesis Viral and cellular mRNA translation in Coronavirus-infected cells Open Babel: An open chemical toolbox Software for molecular docking: A review Understanding the Molecular Mechanism (s) of SARS-CoV2 Infection and Propagation in Human to Discover Potential Preventive and Therapeutic Approach COVID-19 Coronavirus spike protein analysis for synthetic vaccines, a peptidomimetic antagonist, and therapeutic drugs, and analysis of a proposed Achilles' heel conserved region to minimize probability of escape mutations and drug resistance Maestro-desmond interoperability tools Schr€ odinger Release 2020-1: Protein Preparation Wizard; Epik, Schr€ odinger, LLC Ligand docking and binding site analysis with PyMOL and Autodock/vina The nsp9 Replicase Protein of SARS-coronavirus Of chloroquine and COVID-19 In vitro combinations containing Tegobuvir are highly efficient in curing cells from HCV replicon and in delaying/preventing the development of drug resistance Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein Structure of MERS-CoV spike glycoprotein in complex with sialoside attachment receptors. nNature sStructural & mMolecular bBiology Authors acknowledge Department of Science and Technology-Science and Engineering Research Board (DST-SERB), Government of India for partial financial support. No potential conflict of interest was reported by the authors. Board (DST-SERB) funded research grant (ECR/2016/001489), Govt. of India.