key: cord-0819830-0unhpouo authors: Chandel, Vaishali; Tripathi, Garima; Nayar, Seema A.; Rathi, Brijesh; Kumar, Abhijeet; Kumar, Dhruv title: In silico identification and validation of triarylchromones as potential inhibitor against main protease of severe acute respiratory syndrome coronavirus 2 date: 2021-05-03 journal: Journal of biomolecular structure & dynamics DOI: 10.1080/07391102.2021.1918255 sha: 0c2eb80f9474e3f1d2500f7b59552da2baaa5604 doc_id: 819830 cord_uid: 0unhpouo The ongoing pandemic due to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) caused COVID-19 has emerged as a severe threat to the life of human kind. The identification and designing of appropriate and reliable drug molecule for the treatment of COVID-19 patients is the pressing need of the present time. Among different drug targets, the main protease of SARS-CoV-2 is being considered as most effective target. In addition to the drug repurposing, different compounds of natural as well as synthetic origins are being investigated for their efficacy against different drug targets of SARS-CoV-2 virus. In that context, the chromone based natural flavonols have also exhibited significant antiviral properties against different targets of SARS-CoV-2. The in silico studies presented here discloses the efficacy of triarylchromones (TAC) as potential inhibitor against main protease of SARS-CoV-2. The molecular docking and ADMET study performed using 14 arylchromones which could easily be accessed through simple synthetic protocols, revealed best binding affinities in case of TAC-3 (–11.2 kcal/mol), TAC-4 (–10.5 kcal/mol), TAC-6 (–11.2 kcal/mol), TAC-7 (–10.0 kcal/mol). Additional validation studies including molecular dynamics simulation and binding energy calculation using MMGBSA for protein ligand complex for 100 ns revealed the best binding interaction of TAC-3, TAC-4, TAC-6, TAC-7 against main protease of SARS-CoV-2. Moreover, the in vitro and preclinical validation of identified compounds will help us to understand the molecular mechanisms of regulation of TACs against SARS-CoV-2. Communicated by Ramaswamy H. Sarma The ongoing pandemic due to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has emerged as one the most contagious disease of last several decades (Rohit et al., 2020; Zheng, 2020; . Although various structural as well as biological aspects of this virus has already been explored but (Portelli et al., 2020; V'kovski et al., 2020) but several information regarding the specific drug targets, mechanism of action are yet to be established to discover an appropriate drug therapy. SARS-CoV-2 is an enveloped positive-sense RNA viruses, is one of the member viruses of Coronaviridae family and Coronavirinae subfamily (Astuti & Ysrafil, 2020; Yang et al., 2020) . As per the current data provided by the World Health Organization (WHO), more than 54 millions confirmed cases and approx. 1.3 million deaths have been reported so far across the world since its first report from Wuhan in Dec. 2019, China and respiratory related infection and complications have emerged as one of the major cause of the death (Vincet & Taccone, 2020 ; http:// covid19who.int/). Apart from the lower respiratory tract infections, other organs such as kidney, liver, heart have also found to be greatly affected due to the infections due to SARS-CoV-2 virus (Gavriatopoulou et al., 2020; . In order to trounce the existing health crisis, different strategies such as drug repurposing (Guy et al., 2020; Harrison, 2020; Serafin et al., 2020) , discovery of new vaccines (Krammer, 2020; Tregoning et al., 2020 ; https://www. nature.com/articles/d41586-020-03248-7; https://www.pfizer. com/news.press-release/press-release-detail/pfizer-and-biontechannounce-vaccine-candidate-against) and convalescent plasma therapy (Focosi et al., 2020) are being employed. In that direction, immense efforts are also being made towards the discovery of effective therapeutic agents which could successfully bind with the different drug targets such as the spike (S) protein, enzymes such as proteases, viral RNA etc. and inhibit different steps of the infection (Akaji & Konno, 2020; Coelho et al., 2020; Preeti et al., 2020; Rane et al., 2020; Zhao et al., 2021; . For examples the spike (S) protein of SARS-CoV-2 interacts with different receptors such as Table 1 . List of arylchromones as potential drug candidate against main protease of SARS-CoV-2 S. no. Diarylchromone (DAC) S. no. Triarylchromone (TAC) angiotensin-converting enzyme 2 (ACE-2), TM protease serine 2 (TMPRSS-2) etc. present on the host cell and facilitates the viral entry (Ali & Vijayan, 2020; . Similarly, proteases such as Mpro or 3CL-protease are also being explored as excellent drug targets due to their involvement in enhancing viral loads and the growth of virus Zhang et al., 2020) . Despite of the growing understanding about the potential drug targets and fast-track research towards the development of the therapeutic agent, the discovery of appropriate and widely applicable drug therapy is still awaited. Some of the vaccines such as mRNA-1273 and BNT162b2 have exhibited promising results so far (Jackson et al., 2020; Walsh et al., 2020) . Except m-RNA based some of these vaccine candidates, no other effective and robust drug therapy could be discovered. Since the surfacing of the first case in China, the efficacy of several natural products as well as FDA-approved drugs of diverse verities has rigorously been investigated. For example, the effectiveness of drugs such as chloroquines, lopinavir, remdisivir, dexamethasone, ivermectin etc. have been investigated so far against COVID-19 but none of these could be established as an effective drug against SARS-CoV-2 (Cao & Yang, 2020; Chaccour et al., 2020; Ferner & Aronson, 2020; Lammers et al., 2020; Wang et al., 2020) . Similarly, the efficacy of natural products and phytochemicals against COVID-19 and other viruses has also been explored (Aucoin & Cooley, 2020; DAC-3 -6.6 4. DAC-4 -8.4 5. DAC-5 -8.0 6. DAC-6 -7.7 7. DAC-7 -6.8 8. TAC-1 -8.9 9. TAC-2 -8.5 10. TAC-3 -11.2 11. TAC-4 -10.5 12. TAC-5 -8.7 13. TAC-6 -11.2 14. TAC-7 -10.0 Kumar et al., 2020; Straughn & Kakar, 2020; Verma et al., 2020) . In particular, the therapeutic values of chromone as core scaffolds are well established in the literature (Gaspar et al., 2014; Keri et al., 2014) . Interestingly chromone scaffold containing phytochemicals such as quercetin, fisetin, etc. which are part of our dietary supplements, have also exhibited their efficacy against different drug targets of SARS-CoV-2 (Derosa et al., 2020; Seri et al., 2020) . In view of the immense therapeutic importance of these chromone containing heterocycles, especially as anti-viral agent, we envisaged that such therapeutic activity could be enhanced by functionalizing the chromone scaffold with appropriate functional groups and moieties. Flavoxate, Pranlukast, Disodium cromoglycate (DSCG) etc. are examples of synthetically produced drugs with chromone scaffold which have displayed excellent efficacy in the treatment of urinary and asthmatic disorders respectively (Gaspar et al., 2014; Reis et al., 2017) . Therefore, the aim of the present in silico study was to investigate the efficacy of newly designed di-and triaryl chromones (Table 1 ) which could easily be accessed following literature known reaction protocols such as metal-catalyzed cross-coupling and C-H activation reactions (Choi et al., 2016; Kim et al., 2012; Rao & Kumar, 2014) , against the SARS-CoV-2 main protease (6LU7) as potential drug target. 6LU7 is the major protease is the major protein in SARS-CoV-2 that has been structured and repositioned in the PDB database recently and is widely accessible around the globe. As shown in Figure 1A , it represents the ribbon structure representation and (B) surface representation of COVID-19 main protease (6LU7) with interacting chain A and B. It is important to underline that compared to the natural product; the synthetic analogues allow the large scale synthesis in relatively short span of time. In addition to that, the pharmacological activities of these could easily be modulated through modification in the substituent. The SARS-CoV-2 virus main protease (PDB ID-6LU7) structure was obtained from the RCSB PDB (protein data bank) database (http://www.rcsb.org). Prior to docking or analysis, the solvation parameters, charge assignment, and fragmental volumes to the SARS-CoV-2 main protease was done using the Autodock Tool 4 (ADT) (Morris et al., 2009; Sumit et al., 2020; Vivek et al., 2020) . Further optimization of the protein molecule was done using Autodock Tool for the molecular docking . The selected compounds in 3D SDF format were screened in order to select the best hits. The 2D ligand structures were designed using Chemsketch program ( Figure 2 ; Table 2 ). Avogadro program was used to optimize the ligands. Open Babel was used to convert the ligands into the PDB format. The ligands were first optimized and converted to PDBQT format in order to further simplify the process using the PyRx virtual screening tool-python prescription 0.8. PyRx software was used for the purpose of molecular screening of all the library of compounds by autodock wizard as the engine for molecular docking (Morris et al., 2008) . The ligands were considered to be flexible during the docking period and the protein was considered to be rigid. Auto Grid engine in PyRx was used for the generation of configuration file for the grid parameters. The PyRx application was also used to predict/understand the amino acids residues present in the active site of the protein that interact with the ligands. The results less than 1.0A˚in positional root-mean-square deviation (RMSD) were clustered together and considered ideal for identifying the favourable binding. The most negative (highest binding energy) was considered as the best candidate with maximum binding energy Rashmi et al., 2020) . Pymol version 2.3.4 was used for the visual analysis of the docking site and the validation of results were done using Autodock-Vina (Rauf et al., 2015) . The ADME properties (absorption, distribution, metabolism, excretion, and toxicity) of the selected compound were calculated using online Swiss ADME program (Diana et al., 2007) . The major parameters for ADMET associated properties includes Lipinski's rule of five (H bond acceptor, H bond donor, molecular weight, water/octanol partition coefficient), pharmacokinetic properties, the solubility of the drug, and drug likeliness were considered. The values of the observe properties are presented in Table 3 . Selected complexes were prepared prior to MD simulation in the protein preparation wizard and Prime module of Schr€ odinger suite Schr€ ondinger, 2020; Schrondinger, 2016; Schrondinger, 2020-1) . Removal of steric clashes and strained bonds/angles were done by performing a restrained energy minimization, allowing movement in heavy atoms up to 0.3 Å. Extensive 100 ns MD simulation was carried out for both complexes through Desmond (D. E. Shaw Research, New York, NY, 2015) to access the binding stability of compounds with their respective protein complex. The standard fixed-charged force fields were used to optimize the complex. These complexes were solvated in TIP3P water model and 0.15 M NaCl to mimic a physiological ionic concentration. The full system energy minimization step was done for 100ps. The MD simulation was run at constant 300 K temperature, standard pressure (1.01325 bar), within an orthorhombic box with buffer dimensions 10 Â 10 Â 10 Å 3 and NPT ensemble. The energy (kcal/mol) was recorded at interval of 1.2 ps. The protein-ligand complex system was neutralized by balancing the net charge of the system by adding Na þ or Clcounter ions. The Nose-Hoover chain and Martyna-Tobias-Klein dynamic algorithm was used maintain the temperature of all the systems at 300 K and pressure 1.01325 bar, respectively. The arylchromones (1-14, Table 1 ) selected for our purposes could easily be prepared using literature known synthetic protocols mainly under the metal-catalyzed cross-coupling reaction protocols. The potential drug likeness and efficacy of these functionalized chromones were investigated using in silico studies including molecular docking, ADME analysis and molecular dynamics simulation studies which have separately been described in the following sections. Molecular docking study reveals the strength and binding energy of a specific ligand by which a compound interacts with and binds to the active site pocket of a target protein. A compound with lesser binding energy is considered as a possible drug candidate. In order to understand the possible effect of the compounds against main protease 6LU7, molecular docking study of 14 active compounds was performed (Table 2) . Docking results of main protease 6LU7 with selected 4 compounds (TAC-3, TAC-4, TAC-6, TAC-7) revealed favourable binding energy and were observed to be as the best molecules at the major target site of the protein. The chemical structure of the selected 4 active compounds is shown in Figure 2 . The molecular docking study and visualisation of main protease 6LU7 binding TAC-3, TAC-4, TAC-6, and TAC-7 are shown in Figures 3-6 , respectively. Out of the four compounds, TAC-6 exhibited the best docking score (binding energy) i.e. À11.2 Kcal/mol with SARS-CoV-2 Main Protease 6LU7. The 2D schematic representation of protein-ligand interaction is represented by LIGPLOT representing the major amino acid residues of compound 13 binding on the active site pocket of the protein 6LU7 ( Figure 5 ). TAC-3 exhibited (-11.1 kcal/mol) binding energy with main protease 6LU7. TYR118, HIS41, LEU27, HIS163, CYS117, PHE140 are the amino acid residues participating in the interaction at the binding pocket of COVID-19 ( Figure 3) . The 2D schematic representation of protein-ligand interaction is represented by LIGPLOT representing the major amino acid residues of TAC-3 binding on the active site pocket of the protein 6LU7. TAC-4 exhibited (-10.5 kcal/mol) binding energy with main protease 6LU7. ARG131, PHE134, THR292, ILE200, and PRO132 are the amino acid residues participating in the interaction at the binding pocket of COVID-19 (Figure 4) . The 2D schematic representation of protein-ligand interaction is represented by LIGPLOT representing the major amino acid residues of compound 10 binding on the active site pocket of the protein 6LU7. TAC-7 exhibited (-10.0 Kcal/mol) binding energy with main protease 6LU7. PHE291, GLN127, ARG4, and LYS5 are the amino acid residues participating in the interaction at the binding pocket of COVID-19 ( Figure 6 ). The 2D schematic representation of protein-ligand interaction is represented by LIGPLOT representing the major amino acid residues of TAC-4 binding on the active site pocket of the protein 6LU7The molecular docking study in our study showed the inhibitory potential of 4 compounds, ranked by affinity (DG); TAC-6 > TAC-3 > TAC-4 > TAC-7. Next, to get an insight about the drug-likeness properties of the lead compounds, all the 14 compounds were screened for ADME properties using swiss ADME programme and the results are shown in Table 3 . The major criteria to understand the drug likeness properties of a particular compound involves Lipinski's rule of five (ROF) and if a specific lead compound with a certain pharmacological and biological activity has chemical and physical properties would make it a likely orally active drug in humans. Lipinski's rule of five suggests the molecular properties which are critical in order to understand the drug's pharmacokinetics in the human body for example absorption, distribution, metabolism, and excretion (ADME). An ideal drug following Lipinski's rule of five criteria are (i) molecular mass of a compound less than 500 Daltons, (ii) no more than 5 hydrogen bond donors, (iii) no more than 10 hydrogen bond acceptors, (iv) an octanol-water partition coefficient log P not greater than 5. Three or more than 3 violations of the Lipinski's rule do not fit into the criteria of drug likeliness and ideally it is not generally considered for further drug discovery. However, it is very important to mention that Lipinski's rule of five is not applicable to certain class of natural products and drugs which are substrates of biological transporters such as antibodies and proteins and antibodies and are successfully FDA approved and widely distributed in the market. Although in our study, all the four triarylchromones (TAC 3, 4, 6 and 7) are not following Lipinski's rule of five. However, the selected compounds docked against main protease 6LU7 have been validated using molecular dynamics simulation studies and the results revealed that the selected lead compounds were identified as the most suitable target against SARS-CoV-2 main protease 6LU7. Therefore, this preliminary screening process of potential compounds would facilitate in providing the fast in silico analysis towards development of therapeutic agent against SARS-CoV-2 (COVID-19). All four complexes (complex TAC-3, TAC-4, TAC-6, and TAC-7) were simulated for 100 ns and the final structure exhibited good stereochemical geometry of the residues as analyzed by Ramachandran map (Figure 7A-D) . The proteins in all four complexes were stabled with only 1-3 residues in outlier region. Both complex TAC-3 and complex TAC-7 have only one residue (Asp33) in outlier region while complex TAC-4 and complex TAC-6 have 3 (Asp33, Asn84, Tyr154) and 2 (Asp33, Asn84), respectively. It was found that Asp33 is the most common residue lie in outlier region in all four complexes. These number of residues in favoured, additional allowed, generously allowed and outlier region along with their percentage given in Table 4 . The Ramachandran plot showed good steriochemical geometry of residues of protein for all four complexes complex. The binding free energy for these four docked complexes were also calculated by MMGBSA as given in Table 5 where compound in complex TAC-3 has lowest binding free energy of À78 kcal/mol among all compounds. This generalized Born model and solvent accessibility method of force field optimization to elicit free energies from structural information circumventing the computational complexity of free energy simulations. Molecular dynamics simulation study of all four complexes for 100 ns was done to analyze the stability of docked ligand within the binding pocket where ligand interacted with active domain of proteins through water bridges, hydrophobic interactions, salt bridge and H-bonds. The Ca RMSD is not deviated so much in all four complexes (Figure 8) . It suggests the structural stability of protein in all four complexes but ligand RMSD plot showed least deviation in ligand in complex TAC-3 as compare to other three complex TAC-4, 6, and 7 ( Figure 9A-D) . Ligand RMSD indicates how stable the ligand is with respect to the protein and its binding pocket. The plot ( Figure 9A-D) show the RMSD of a ligand when the protein-ligand complex is first aligned on the protein backbone of the reference and then the RMSD of the ligand is measured. The similarly, ligand root mean square fluctuation (RMSF) is also least fluctuating for ligand within complex TAC-3 among all complexes ( Figure 10A-D) . It indicates conformational stability of compound in complex TAC-3. The RMSF is useful for characterizing local changes along the protein chain. TAC-3 showed less fluctuations than others, indicates that the ligand is stable in TAC-3 complex along the protein chain during simulation. The molecular dynamic study strongly validated the molecular docking data of protein ligand interaction for complex TAC-3 as compare to other complexes. It is very clear after trajectory analysis that ligand in complex TAC-3 is very stable than others. Figure 11A shows protein RMSF of ligand-protein complex interacting with binding site residues of protein where Thr25, Thr26, His41, Met49, Leu50, Phe140, Leu141, Asn142, Gly143, Ser144, Cys145, His163, His164, Met165, Glu166, Leu167, Pro168, His172, and Ala191residues are interacting to compound where all binding site residues have RMSF <2Å except Leu50 shown by green lines matching the residue index, while the pink and blue bands indicate protein secondary structures helices and b-strands, respectively. The type of interaction between compound and protein can be seen in Figure 11B where 3 kinds of interaction between ligand and protein such as H-bond (green), water bridge (blue), and hydrophobic interaction (grey) can be seen. There was no ionic interaction in complex. The bottom panel of Figure 11C shows the total number of contacts, the protein makes with the ligand over the course of the trajectory. While the bottom panel shows which residues interact with the ligand in each trajectory frame. Some residues make more than one specific contact with the ligand, which is represented by a darker shade of orange, according to the scale to the right of the plot. Even binding free energy after MD simulation was further calculated and found that ligand in TAC-3 has lowest binding energy over others (Table 4) . Moreover, TAC-4 ( Figure 12A -C), TAC-6 Figure 13 . Complex TAC-6: (A) RMSF plot of residue number and C-alpha of spike protein at 100 ns simulation. It predicts the fluctuations of the C-alpha atoms; residues are shown in three letter code with their respective number in green colour belong to binding site residues interacting to compound shown in green line. (B) The histogram of protein-ligand contact over the course of the trajectory (C) A timeline representation of the interactions and contacts (salt bridge, hydrophobic, ionic, water bridges) with compound. The top panel shows the total number of specific contacts the protein-ligand complex in each trajectory frame. The bottom panel shows binding site residues interact with the ligand in each trajectory frame. ( Figure 13A -C) and TAC-7 ( Figure 14A -C) also showed strong stability at the active site of the protein (main protease of SARS-CoV-2). The identification of potent/effective molecules against infectious diseases including COVID-19 are always appreciable. As we already know by now, COVID-19 has hugely impacted the socio-economic status of the several countries across the world. In this situation, the identification of an effective and reliable drug molecule against COVID-19 targeting through main protease of SARS-CoV-2 will help us to develop potential therapeutic approach which could save millions of people globally. In this in silico study, we have identified triarylchromones (TAC) derivatives as potent inhibitors against main protease of SARS-CoV-2 which can be further tested against the SARS-CoV-2 at in vitro and in vivo model for the development of therapeutic approach for COVID-19. Design and evaluation of anti-SARS-coronavirus agents based on molecular interactions with the viral protease Dynamics of the ACE2-SARS-CoV-2/SARS-CoV spike protein interface reveal unique mechanisms Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2): An overview of viral structure and host response The effect of quercetin on the prevention or treatment of COVID-19 and other respiratory tract infections in humans: A rapid review A trial of lopinavir-ritonavir in adults hospitalized with severe Covid-19 Nebulized ivermectin for COVID-19 and other respiratory diseases, a proof of concept, dose-ranging study in rats In silico identification of potent FDA approved drugs against coronavirus covid-19 main protease: A drug repurposing approach In-silico interactions of active phytochemicals with c-Myc EGFR and ERBB2 oncoproteins Unraveling innate substrate control in site-selective palladium-catalyzed C-H heterocycle functionalization SwissADME: A free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules Drugs against coronavirus COVID-19 main protease: A drug repurposing approach Chloroquine and hydroxychloroquine in covid-19 Convalescent plasma therapy for COVID-19: State of the art Chromone: A valid scaffold in medicinal chemistry Organ-specific manifestations of COVID-19 infection Rapid repurposing of drugs for COVID-19 Coronavirus puts drug repurposing on the fast track Clinical features of patients infected with 2019 novel coronavirus in Structural and functional properties of SARS-CoV-2 spike protein: Potential antivirus drug development for COVID-19 An mRNA vaccine against SARS-CoV-2 -Preliminary report Palladium-catalyzed oxidative arylation of chromones via a double C-H activation: An expedient approach to flavones SARS-CoV-2 vaccines in development Phytochemicals as potential curative agents against viral infection: A review Dexamethasone nanomedicines for COVID-19 AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility Using Autodock for receptor ligand docking Exploring the structural distribution of genetic variation in SARS-CoV-2 with the COVID-3D online resource Targeting SARS-CoV-2 spike protein of COVID-19 with naturally occurring phytochemicals: An in silico study for drug development Targeting virus-host interaction by novel pyrimidine derivative: An in silico approach towards discovery of potential drug against COVID-19 Pd-catalyzed atom-economic couplings of triarylbismuth reagents with 2-bromo-and 2,6-dibromochromones and synthesis of medicinally important fisetin Development of potential proteasome inhibitors against Mycobacterium tuberculosis Ligand docking and binding site analysis with pymol and autodock/vina Chromone as a privileged scaffold in drug discovery: Recent advances Deciphering the SSR incidences across viral members of Coronaviridae family Schr€ odinger Release 2020-1: Protein Preparation Wizard; Epik, Schr€ odinger, LLC Drug repositioning is an alternative for the treatment of coronavirus COVID-19 Flavonoids with inhibitory activity against SARS-CoV-2 3CLpro Withaferin A: A potential therapeutic agent against COVID-19 infection Discovery of new hydroxyethylamine analogs against 3CLpro protein target of SARS-CoV-2: Molecular docking, molecular dynamics simulation, and structure-activity relationship studies Vaccines for COVID-19 Coronavirus biology and replication: Implications for SARS-CoV-2 Structure-based drug repurposing for targeting Nsp9 replicase and spike proteins of severe acute respiratory syndrome coronavirus 2 Anti-SARS-CoV natural products with the potential to inhibit SARS-CoV-2 (COVID-19) Understanding pathways to death in patients with COVID-19 Novel 3,4-diarylpyrazole as prospective anti-cancerous agents Safety and immunogenicity of two RNA-based COVID-19 vaccine candidates Remdesivir in adults with severe COVID-19: A randomised, double-blind, placebo-controlled, multicentre trial Analysis of therapeutic targets for SARS-CoV-2 and discovery of potential drugs by computational methods SARS-CoV-2: Characteristics and current advances in research Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved a-ketoamide inhibitors Targeting RNA G-quadruplex in SARS-CoV-2: A promising therapeutic target for COVID-19? SARS-CoV-2: An emerging coronavirus that causes a global threat A novel coronavirus from patients with pneumonia in China Identification of SARS-CoV-2 3CL protease inhibitors by a quantitative high-throughput screening We thank our lab members and collaborators for carefully reading the manuscript and contributing valuable inputs for improving the manuscript. The authors declare no conflict of interest