key: cord-0894835-dursgimt authors: Pitsillou, Eleni; Liang, Julia; Yu Meng Huang, Helen; Hung, Andrew; Karagiannis, Tom C. title: In silico investigation to identify potential small molecule inhibitors of the RNA-dependent RNA polymerase (RdRp) nidovirus RdRp-associated nucleotidyltransferase domain date: 2021-09-16 journal: Chem Phys Lett DOI: 10.1016/j.cplett.2021.138889 sha: bba23473dc903cdecd2cb7cf76457cc436a3d510 doc_id: 894835 cord_uid: dursgimt The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) RNA-dependent RNA polymerase (RdRp) is a promising target for antiviral drugs. In this study, a chemical library (n = 300) was screened against the nidovirus RdRp-associated nucleotidyltransferase (NiRAN) domain. Blind docking was performed using a selection of 30 compounds and nine ligands were chosen based on their docking scores, safety profile, and availability. Using cluster analysis on a 10 microsecond molecular dynamics simulation trajectory (from D.E. Shaw Research), the compounds were docked to the different conformations. On the basis of our modelling studies, oleuropein was identified as a potential lead compound. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) belongs to the betacoronavirus genus and is the infectious agent that causes coronavirus disease 2019 (COVID-19) [1] . To date, there are seven coronaviruses that can infect humans and they are divided into two groups [1] . The common human coronaviruses (hCoVs) generally cause a mild to moderate respiratory infection and include hCoV-229E, hCoV-NL63, hCoV-OC43, and hCoV-HKUI [1] . Additionally, three hCoVs have been reported to cause severe disease and they include severe acute respiratory syndrome coronavirus (SARS-CoV), Middle East respiratory syndrome coronavirus (MERS-CoV), and SARS-CoV-2 [1] . These three coronaviruses have the capacity to cause lower respiratory infections, which can result in acute lung injury (ALI), acute respiratory distress syndrome (ARDS), and multiorgan failure [2] . The long-term consequences of infection continue to be investigated. The high transmissibility of SARS-CoV-2 and its emerging variants, has resulted in strict public health measures being implemented and a significant amount of attention has been placed on developing vaccines and investigating potential antiviral drugs. Betacoronaviruses are enveloped viruses that consist of a positive-sense, single stranded RNA genome [1] . The SARS-CoV-2 genome consists of a replicase complex that is formed by the open reading frames, ORF1a and ORF1b [3] . These two ORFs encode the polyproteins pp1a and pp1ab, which are cleaved to produce the non-structural proteins (nsp1-16) [3] . The SARS-CoV-2 genome is also comprised of structural and accessory genes. The four major structural proteins are the spike protein (S), nucleocapsid protein (N), envelope protein (E), and membrane glycoprotein (M) [3] . The receptor binding domain of the spike protein attaches to the host cell receptor angiotensin-converting enzyme 2 (ACE2) and this interaction mediates SARS-CoV-2 infection [4] . Non-structural protein 12 (nsp12), which is also known as RNAdependent RNA polymerase (RdRp), is a crucial component of the replication-transcription complex and catalyses the synthesis of RNA from RNA templates [5] . The RdRp interacts with proteins such as nsp7, nsp8, nsp9 and nsp13 to facilitate virus replication and transcription [5] . The structure of the SARS-CoV-2 RdRp has been determined and is comprised of a right-hand RdRp domain, a nidovirus RdRp-associated nucleotidyltransferase domain (NiRAN), an interface domain and an N-terminal β-hairpin [6] . The RdRp domain consists of the fingers, palm and thumb subdomains [6] . The SARS-CoV-2 RdRp has also been identified as an ideal target for antiviral drugs. Remdesivir and favipiravir are examples of prodrugs that are being tested for their ability to inhibit the RdRp, as they act as nucleoside analogues and are incorporated into the growing RNA chain [7] . This results in the termination of RNA synthesis. Remdesivir was the first drug to be approved by the U.S. Food and Drug Administration (FDA), despite the contradicting findings from the Solidarity Trial conducted by the World Health Organization [7] . The NiRAN domain has also been of interest and it is conserved in the Nidovirales [8] . This domain was first discovered in the RdRp of the equine arteritis virus (EAV) and it was hypothesised to have RNA ligase activity, nucleotidyltransferase activity, and protein priming function [8] . The potential kinase or phosphotransferase activity of the NiRAN domain is also being investigated and more recently, its interaction with nsp9 has been explored [9] . In addition to developing drugs that inhibit the catalytic activity of the RdRp through covalently binding to the RNA template, the NiRAN domain could be a potential target site for therapeutic agents [10] . Drug repurposing will continue to play an integral role in combating infectious diseases and a number of studies have utilised computational methods to identify potential lead compounds from existing drugs [11] . In this study, molecular modelling tools were used to screen a library of 300 compounds against the NiRAN domain of the SARS-CoV-2 RdRp. This consisted of pharmacological compounds and natural compounds, with antiviral, antioxidant, and anti-inflammatory properties. As aforementioned, the NiRAN domain has nucleotidylating activity and adenosine diphosphate (ADP), uridine-5 ′ -triphosphate (UTP), and guanosine-5 ′ -triphosphate (GTP) were used as the control ligands. Based on the results, the library was narrowed down to 30 compounds. The potential lead ligands were subsequently identified through performing blind docking on the RdRp structures and molecular docking on several conformations of the RdRp from a 10 μs trajectory [12] . The cryo-electron microscopy (cryo-EM) structures of the SARS-CoV-2 RdRp were obtained from the RCSB Protein Data Bank (PDB ID: 6 M71 and 6XEZ) [6, 13, 14] . The RdRp chain was isolated from the replicationtranscription complex, the waters and ligands were removed, and the relevant ions were retained. This included the zinc (Zn 2+ ) ions in both structures. Adenosine diphosphate was the ligand present in the NiRAN domain of the 6XEZ cryo-EM structure and this was used as a control [13] . Likewise, UTP and GTP were used as control compounds. The chemical structures of ADP, UTP, GTP, and 300 ligands were obtained from the National Centre for Biotechnology Information (NCBI) Pub-Chem Database [15] . If unavailable, the chemical structures were obtained from the ChEMBL Database [16] . The library of 300 ligands consisted of 220 phenolic compounds and 13 fatty acids from Olive-Net TM [17] . A number of compounds (n = 63) with antimicrobial and anti-inflammatory properties were also utilised. The cryo-EM structures of the RdRp protein were imported into Maestro and were prepared using the Protein Preparation Wizard of the Schrödinger Suite (version 2020-4) [18] . Similarly, the compounds were imported into Maestro and were prepared using the LigPrep tool. The default settings were utilised and the optimised potentials for liquid simulations (OPLS3e) force field was selected [19] . A receptor grid that was 20 × 20 × 20 Å in size was generated around the conserved residues of the NiRAN domain and they were K73, E83, R116, L119, T120, T123, T206, D208, N209, Y217, D218, G220, D221, and S236 [20] . The Glide Receptor Grid Generation protocol was used for this step. The 300 ligands and controls were initially screened using the Glide Ligand Docking protocol. The Glide standard precision (SP) mode was selected for this process. The SP mode allows for compounds to be docked in a timely manner and is more accurate than the high throughput screening option. The promising ligand poses were refined using the OPLS3e force field and this was followed by post-docking minimisation. Based on the results, 30 compounds were examined further. The ligands were docked to the NiRAN domain using the quantum-mechanicspolarised ligand docking (QPLD) protocol of the Schrödinger Suite for improved docking accuracy [21] . The compounds were initially docked using Glide and energy calculations were then performed on the protein-ligand complexes generated using ab initio quantum mechanics (QM) methods. The ligands were re-docked using the charges that were predicted by the QSite software and the poses were ranked in the final stage. The extra precision (XP) mode was chosen for the initial docking and redocking steps, and the QM level was set to accurate for the Jaguar component [22] [23] [24] . The GlideScore (kcal/mol) was recorded and the protein-ligand interactions were visualised using the Ligand Interaction Diagram tool. The P2Rank software package is a template-free tool that predicts ligand-binding sites based on machine learning and the SARS-CoV-2 RdRp cryo-EM structures were analysed using this program [25] . Blind docking was also performed with the selection of 30 compounds. The goal of blind docking was to investigate whether the ligands would preferentially bind to the NiRAN domain or any other site in the protein, which may potentially include an allosteric binding site. The structures of the proteins and compounds were imported into PyRx and they were prepared as macromolecules and ligands, respectively [26] . The protein was set as rigid, while all torsions of the ligands were activated. The receptor grid was generated around the entire protein and the exhaustiveness was increased to 2048. AutoDock Vina was used to perform the blind docking calculations and the jobs were run on Galileo, which is a cloud computing service (Hypernet Labs), and the Spartan High Performance Computing (HPC) system [27] [28] [29] . A 10 µs molecular dynamics (MD) simulation trajectory of the SARS-CoV-2 nsp7-nsp8-nsp12 RNA polymerase complex (PDB ID: 6 M71) was obtained from the D.E. Shaw Research group and analysed using the Gromacs 2018.2 software package with plug-ins for Visual Molecular Dynamics 1.9.3 [12, [30] [31] [32] . Nsp12 was isolated from the protein complex and analysed using root mean square deviation (RMSD) and root mean square fluctuation (RMSF) analysis tools included in Gromacs 2018.2. The Gromacs clustering tool gmx cluster was utilised to calculate clusters of similar structures based on RMSD of the protein. The gromos clustering algorithm, as described by Daura et al. [33] , was applied. Cluster analysis was performed on the partially disordered Nterminal region of the protein (residues 30-120) for the entire trajectory, where the time interval between frames was 1.2 ns. Using an RMSD cut-off of 0.3 nm to define two structures as neighbours, 15 clusters were obtained. Representative protein structures for the top six clusters were extracted from the trajectory based on the median frame of each group for molecular docking of compounds to the NiRAN domain of RdRp. The Protein Structure Alignment tool in Maestro was used to align the NiRAN domain of the conformations that were representative of each cluster, using the cluster 1 structure as the reference. The NiRAN domain of the 6XEZ cryo-EM structure was also aligned to the conformation corresponding to cluster 1 for comparison. The RMSD values of the aligned amino acids were recorded. In a study that was performed on the RdRp of the EAV, the nucleotidylation activity of the NiRAN domain was observed when UTP and GTP were present as substrates [8] . In a recent paper by Slanina et al. it was demonstrated that the coronavirus NiRAN domains could transfer nucleoside monophosphates to nsp9 and that there was relatively low specificity for a particular NTP substrate [9] . Residues K73, R116, T123, D126, D218 and F219 of the SARS-CoV-2 RdRp have previously been predicted to be essential for the enzymatic activity of the NiRAN domain, and multiple sequence alignment of coronavirus RdRp sequences has revealed that there are a number of conserved residues ( Fig. 1 ) [20] . Using the Glide Ligand Docking protocol, 300 compounds were screened against the NiRAN domain of the SARS-CoV-2 RdRp (PDB ID: 6 M71) (Table S1 ). Lucidumoside C was the flavonoid compound that had the weakest binding affinity and the GlideScore was − 0.4 kcal/mol. Delphinidin was predicted to be the strongest binding ligand and the GlideScore was found to be − 7.2 kcal/mol. In addition to the library of 300 ligands, ADP, UTP and GTP were used as the control compounds. The GlideScores of these compounds were − 6.3, − 6.3, and − 6.0 kcal/ mol, respectively. Based on this initial screen, 30 compounds with a broad range of binding affinities were selected for further analysis. This allowed for comparison of a range of ligands to ensure that there was a reasonable agreement between the ranking according to binding affinities. The commercial availability, approval by the FDA, and known side effects of these compounds were also taken into consideration. They included protease inhibitors, antibiotics, kinase inhibitors, nucleoside analogues, dietary compounds, and compounds with antioxidant and antiinflammatory properties. For improved docking accuracy and further refinement, the 30 ligands and control compounds were subsequently docked to the NiRAN domain using the QPLD protocol. The GlideScores ranged from − 3.6 to − 10.8 kcal/mol and the chemical structures of these ligands are provided in Table 1 . In order to evaluate whether the controls and selected compounds would preferentially bind to the active site of the NiRAN domain, blind docking was performed on the cryo-EM structure of the SARS-CoV-2 RdRp. For the 6 M71 structure, 20 ligands had poses within the NiRAN domain (Table S2 ). In a study by Dwivedy et al. the NiRAN domain was found to assume a kinase-like fold and is thought that this region may have pseudokinase or phosphotransferase activity [20] . The motif search also predicted the presence of kinase-like motifs and to explore this further, they docked broad specificity kinase inhibitors to the active site of the NiRAN domain [20] . Sunitinib and sorafenib were predicted to interact with aspartate residues, while SU6656 formed a hydrogen bond with K73 [20] . Through using an ADP-Glo Kinase assay kit, Dwivedy et al. were also able to provide evidence that the SARS-CoV-2 RdRp had intrinsic kinase/phosphotransferase like activity and that the kinase inhibitors significantly reduced its kinase-like activity [20] . Sunitinib, ibrutinib, zanubrutinib, sorafenib and acalabrutinib were the kinase inhibitors examined in the current study and when examining the protein-ligand interactions, it was apparent that the ligands also formed hydrogen bonds with negatively charged aspartate residues in the NiRAN domain (Table S3 ). It is important to note that kinase inhibitors may provide clinical benefit by exerting dual antiviral and anti-inflammatory effects, and the potential side-effects should also be taken into consideration [34] . Furthermore, the antiretroviral protease inhibitors that are used to treat patients with human immunodeficiency virus/acquired immunodeficiency syndrome (HIV/AIDS) have been of interest [35] . Lopinavir, for example, is an inhibitor of the SARS-CoV main protease (M pro ) and in vitro studies have shown that this drug has inhibitory activity against SARS-CoV, SARS-CoV-2, and MERS-CoV [36] . Lopinavir is commonly used in combination with ritonavir, and these inhibitors have been tested in patients admitted to hospital with COVID-19 [37] . Broadspectrum antibiotics have also played a role in the drug repurposing process, as they may be used for the treatment of co-infections, and their mechanisms of action require further elucidation [38] . Several dietary compounds were also part of the 30 ligands to be selected, with rutin, hellicoside, oleuropein, and cyanidin-3-O-glucoside being phenolic compounds from the OliveNet TM database [17] . Curcumin, which is the major constituent of turmeric, as well as the catechins (epicatechin gallate and epigallocatechin gallate) are also classified as polyphenols [39] . Hypericin is classified as an anthraquinone derivative and is found in St. John's Wort [40] . Over one-third of new molecular entities that are approved by the FDA are natural products and their derivatives, and numerous studies have focused on screening large libraries of phytochemicals against coronavirus proteins to identify potential lead compounds [41] . Various natural compounds have been examined for their ability to target the spike protein, M pro , papain-like protease (PL pro ), and RdRp, and further research is required to validate their antiviral effects and pharmacokinetic properties. Curcumin, piperine, demethoxycurcumin, glycyrrhizic acid, rutin, nicotiflorin, epigallocatechin-3-gallate, and theaflavin are natural compounds that have been identified as potential antiviral drugs against the SARS-CoV-2 RdRp based on in silico analysis [42] [43] [44] . Due to the NiRAN domain being a flexible region of the RdRp, cluster analysis was performed on a 10 μs MD simulation trajectory of the RdRp protein complex that was made available by the D.E Shaw Research group. Ensemble docking uses MD simulations to generate conformations of the protein for docking calculations, aiming to reproduce the selection of ligands for specific protein conformations that form more thermodynamically favourable protein/ligand complexes [45] . Thus, the aim was to select a subset of conformations where a representative protein structure for each cluster could be used for further docking. The average RMSD of nsp12 protein backbone was 0.47 nm over the duration of the trajectory. There was a slight fluctuation in backbone RMSD at approximately 3 µs before stabilising after 4.2 µs (Fig. 2A) . RMSF analysis (Fig. 2B) indicated that this may be attributed to flexibility in the partially disordered residues 30 -120 encompassing the Nterminal region of nsp12. The most prominent peaks in this region are at residues D61 and D107 located on the outer loops of the protein with RMSF values of 0.90 nm. It is noted that due to the highly flexible nature of the N-terminal residues, the structure of this region was previously unable to be resolved in SARS-CoV nsp12 [46] . As this was the most flexible region of the protein in proximity to the proposed binding site, this region was selected for cluster analysis. Cluster analysis was performed to identify the most prevalent conformations in the trajectory for further screening of compounds. Cut-off values were varied between 0.1 and 0.5 nm in increments of 0.1 nm, with clustering analysis performed for each of these values. Based on the distribution of structures captured by each group, a cut-off distance of 0.3 nm for the N-terminal protein was selected. 8334 frames of the trajectory were divided into 15 clusters. Representative structures from the six most prevalent structures were used as starting structures for docking with lead compounds. The majority (59.2%) of frames were assigned to cluster 1, followed by 20.7% to cluster 2. The remaining clusters captured: 6.9% (cluster 3), 5.5% (cluster 4), 4.6% (cluster 5), 1.4% (cluster 6) of frames. Clusters 7 to 15 each captured less than 0.5% of frames, and were thus excluded from analysis. From the 10 µs trajectory of the SARS-CoV-2 nsp12, this N-terminal region is initially partially disordered, becoming folded into a stable ordered structure resembling the N-lobe fold of protein kinases in agreement with the same complex determined in the presence of a reducing agent [6, 12] . From the heatmap shown in Fig. 2C , the N-terminal residues at the beginning of the trajectory are in conformations consistent with clusters 4, 2, and 6 until approximately 3 µs. From this time point, the protein becomes stable in conformations corresponding to cluster 1, which becomes the most common structure for the remainder of the trajectory. Conformations assigned to cluster 5 emerge at approximately 5 µs, while conformations corresponding to cluster 3 occur 8 µs into the trajectory. It is inferred from this analysis that clusters 1, 5, and 3 may represent conformations of the stable ordered N-terminal region of nsp12. However, it is acknowledged that further analysis will be required to characterise this. For the purpose of the present manuscript, representative structures for each cluster were utilised for molecular docking. bridge: italics, π-π interaction: regular font, π-π cation: regular font and underline, hydrogen bonds and salt bridges: bold font and italics, salt bridge and π-π cation: regular font, underline, and italics). The GlideScores (kcal/mol) are provided. Polar residues are coloured blue, positively charged residues are coloured purple, negatively charged residues are coloured red, and hydrophobic residues are coloured green. The selected 30 compounds and control ligands were docked to the NiRAN domain of the representative structure for cluster 1 (Table S4) . Interestingly, the phenolic compounds rutin, oleuropein, and hellicoside from the OliveNet TM database were the top three ligands with the strongest binding affinities. The GlideScores of these ligands were − 10.9, − 10.0, and − 9.9 kcal/mol, respectively. Rutin, oleuropein, and hellicoside predominantly formed hydrogen bonds with the residues of the NiRAN domain, and hellicoside also formed a π-π cation with the amino acid R116. The control compounds ADP, UTP, and GTP had GlideScores between − 7.4 and − 9.0 kcal/mol (Fig. 3) . Blind docking was performed on the representative structure for cluster 1 using the control compounds and selection of 30 ligands (Table S5) . Several poses of ADP (n = 3), UTP (n = 6), and GTP (n = 7) were predicted to be positioned in the NiRAN domain (Fig. 4) . Conversely, the ligands sunitinib, tobramycin, hellicoside, rutin, SRT1720, and hypericin were predicted to bind away from this region and had no poses within the NiRAN site. In addition to the control compounds, nine ligands with a range of binding affinities were selected and were docked to the conformations of the NiRAN domain that were assigned to clusters 2 to 6 for comparison (Table S6) . They were indinavir, ritonavir, nelfinavir, sulfasalazine, lopinavir, hypericin, oleuropein, cefotaxime, and sunitinib. There were differences in the GlideScores for each cluster, as well as the intermolecular bonds that were formed between the ligands and the protein residues. The amino acids that participated in hydrogen bond interactions (maximum distance of 2.8 Å) with each ligand are described in Table 2 . When examining the intermolecular bonds that were present between the ligands and the protein structures for each cluster, it was evident that several residues formed part of the NiRAN domain and Nterminal β-hairpin structure. To compare this region in the representative conformations assigned to the clusters, protein structure alignment was performed using the cluster 1 structure as the reference. The RMSD value for cluster 2 was 2.315 Å, 1.970 Å for cluster 3, 2.313 Å for cluster 4, 1.903 Å for cluster 5, and 2.117 Å for cluster 6. The RMSD values of the amino acids in the NiRAN domain and nearby β-hairpin were also evaluated. Greater RMSD values were observed for the residues in the conformations corresponding to clusters 2, 4 and 6. The larger RMSD values were mainly associated with residues Table 2 Hydrogen bond interactions that were formed between the ligands and each conformation representative of the clusters identified from the 10 μs MD simulation trajectory. Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster 6 Indinavir K50 to Y69, and K103 to P112. The conformations corresponding to clusters 3 and 5 were found to be more similar to cluster 1. As aforementioned, cluster 1, 3 and 5 were prominent towards the end of the 10 μs MD simulation trajectory and may represent conformations of the stable ordered N-terminal region of the RdRp. Moreover, differences were observed in the RMSD values for several residues that formed intermolecular bonds with the ligands and this was more noticeable in the conformations corresponding to clusters 2, 4 and 6 ( Table S7 ). The conformational changes that occur in this region over the course of the trajectory and the flexibility of some of the residues may consequently be contributing to the differences seen in the binding affinities of the compounds and intermolecular bonds that are formed. Oleuropein was found to consistently bind strongly to the conformations corresponding to each cluster and was selected as a potential lead compound. The GlideScore for the cluster 1 structure was − 10.0 kcal/mol and oleuropein predominantly formed hydrogen bonds with the protein residues including N209, D208, T206, V204, D221, and N52. Due to there being missing residues in the NiRAN domain of the 6 M71 structure that was originally obtained from the RCSB PDB, oleuropein and the control compounds were also docked to the RdRp chain of the cryo-EM replication-transcription complex that was determined by Chen et al (PDB ID: 6XEZ) [13] . When comparing the NiRAN domain of the 6XEZ cryo-EM structure to the conformation that was representative of cluster 1, the RMSD was found to be 1.945 Å and the RMSD values of the amino acids can be found in the Supplementary Information (Table S8) . Oleuropein had a GlideScore of − 8.1 kcal/mol and hydrogen bonds were present with residues R116, N52, N209, Y217, D218, and K73. Most notably, the hydroxyl groups of oleuropein were predominantly involved in hydrogen bonding. The GlideScores of GTP, ADP, and UTP were − 8.1, − 7.1, and − 6.9 kcal/mol, respectively. The protein-ligand interactions of oleuropein and the control compounds can be seen in Fig. 5 . The molecular docking results revealed that ADP formed hydrogen bonds with N209 and K50, salt bridges with K73, K50 and R116, as well as a π-π cation with R116. The ADP that was present in the cryo-EM structure formed a π-π interaction with H75, and salt bridges with K73, R116, and K50 (Fig. 1) . Blind docking revealed that oleuropein had eight poses within the NiRAN domain, while ADP had seven poses in this region (Fig. 5, Table S9 ). Guanosine-5 ′ -triphosphate and UTP had 11 poses and 10 poses positioned in the NiRAN domain, respectively. The 6 M71 and 6XEZ cryo-EM structures that were obtained from the RCSB PDB, and conformation of 6 M71 that was representative of cluster 1 from the 10 μs MD simulation trajectory were also examined using the P2Rank server. In addition to the NiRAN domain being identified as a potential ligand binding site, the results revealed that there were several other pockets that may be potential allosteric sites and this included the nsp12-nsp8 interface region (Table S10) . Oleuropein is the most prominent phenolic compound in Olea europaea and belongs to the secoiridoid subclass [47] . Studies have shown that oleuropein exhibits antiviral activity in vitro against respiratory syncytial virus and para-influenza type 3 virus [48] . The pharmacokinetic profile of oleuropein in humans needs to be investigated further and its use as a potential prophylactic and therapeutic agent has been discussed in the literature [47] . Oleuropein has been screened against SARS-CoV-2 protein targets using in silico and in vitro methods, namely the spike protein and cysteine proteases [49, 50] . In general, polyphenols are being investigated for their antiviral activity against SARS-CoV-2 using a combination of molecular modelling and classical experimental methods. A number of studies have previously examined the role of the hydroxyl groups in the antioxidant activity of polyphenols and structure-activity relationships should also be performed to explore the function of hydroxyl groups in the antiviral activity of these compounds. Overall, molecular docking was used to screen 300 ligands against the NiRAN domain of the SARS-CoV-2 RdRp. A selection of 30 compounds were then further investigated by high stringency blind docking before a final selection of nine potential lead compounds. These were docked to different conformations of the NiRAN domain identified through cluster analysis of a 10 μs MD simulation trajectory. By careful consideration of all analyses, oleuropein was identified as a lead compound. Given that this compound is relatively well-known and investigated, its potential antiviral effects can be relatively easily investigated in vitro and in vivo. Author contributions statement TCK and AH conceptualized the aims and methodology and were involved in supervision. EP performed data analysis, data curation, and was involved in production of the first draft of the manuscript. JL was involved in data analysis and curation and was involved in production of the first draft of the manuscript. HYMH performed formal data analysis and validation. All authors contributed to editing and reviewing the manuscript. The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Epigenomic Medicine Program (TCK) is supported financially by McCord Research (Iowa, USA), which has a financial interest in dietary compounds described in this work. However, there is no conflict of interest with respect to the inhibition of the SARS-CoV-2 RNA-dependent RNA polymerase. The remaining co-authors also have no conflicts of interest. From SARS and MERS to COVID-19: a brief summary and comparison of severe acute respiratory infections caused by three highly pathogenic human coronaviruses COVID-19 pathophysiology: A review Genomic characterization of a novel SARS-CoV-2 Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor Cryo-EM structure of an extended SARS-CoV-2 replication and transcription complex reveals an intermediate state in cap synthesis Structure of the RNA-dependent RNA polymerase from COVID-19 virus Structural basis for inhibition of the RNA-dependent RNA polymerase from SARS-CoV-2 by remdesivir Discovery of an essential nucleotidylating activity associated with a newly delineated conserved domain in the RNA polymerase-containing protein of all nidoviruses Coronavirus replication-transcription complex: Vital and selective NMPylation of a conserved site in nsp9 by the NiRAN-RdRp subunit Therapeutic strategies against COVID-19 and structural characterization of SARS-CoV-2: A review Drug repurposing using computational methods to identify therapeutic options for COVID-19 Molecular dynamics simulations related to SARS-CoV-2 Structural basis for helicase-polymerase coupling in the SARS-CoV-2 replication-transcription complex The protein data bank PubChem 2019 update: improved access to chemical data ChEMBL: towards direct deposition of bioassay data OliveNet™: a comprehensive library of compounds from Olea europaea Protein and ligand preparation: parameters, protocols, and influence on virtual screening enrichments OPLS3: A force field providing broad coverage of drug-like small molecules and proteins Characterization of the NiRAN domain from RNA-dependent RNA polymerase provides insights into a potential therapeutic target against SARS-CoV-2. bioRxiv Importance of accurate charges in molecular docking: quantum mechanical/molecular mechanical (QM/MM) approach Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein− ligand complexes Jaguar: A high-performance quantum chemistry software program with strengths in life and materials sciences A mixed quantum mechanics/molecular mechanics (QM/MM) method for large-scale modeling of chemistry in protein environments P2Rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure Small-molecule library screening by docking with PyRx AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading Spartan performance and flexibility; An hpc-cloud chimera GROMACS: A message-passing parallel molecular dynamics implementation GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers VMD: visual molecular dynamics Peptide folding: When simulation meets experiment Repurposing of kinase inhibitors for treatment of COVID-19 Targeting the coronavirus SARS-CoV-2: computational insights into the mechanism of action of the protease inhibitors lopinavir, ritonavir and nelfinavir Remdesivir, lopinavir, emetine, and homoharringtonine inhibit SARS-CoV-2 replication in vitro Lopinavir-ritonavir in patients admitted to hospital with COVID-19 (RECOVERY): a randomised, controlled, open-label, platform trial COVID-19 pneumonia and the appropriate use of antibiotics The upshot of Polyphenolic compounds on immunity amid COVID-19 pandemic and other emerging communicable diseases: An appraisal Pharmacokinetics, safety, and antiviral effects of hypericin, a derivative of St. John's wort plant, in patients with chronic hepatitis C virus infection An analysis of FDA-approved drugs: natural products and their derivatives Alkaloids and flavonoids from African phytochemicals as potential inhibitors of SARS-Cov-2 RNA-dependent RNA polymerase: an in silico perspective In silico ADMET and molecular docking study on searching potential inhibitors from limonoids and triterpenoids for COVID-19 Plant-derived natural polyphenols as potential antiviral drugs against SARS-CoV-2 via RNA-dependent RNA polymerase (RdRp) inhibition: an insilico analysis Ensemble docking in drug discovery: how many protein configurations from molecular dynamics simulations are needed to reproduce known ligand binding? Structure of the SARS-CoV nsp12 polymerase bound to nsp7 and nsp8 co-factors Oleuropein, a bioactive compound from Olea europaea L., as a potential preventive and therapeutic agent in non-communicable diseases In vitro evaluation of secoiridoid glucosides from the fruits of Ligustrum lucidum as antiviral agents Targeting the SARS-CoV-2 spike glycoprotein prefusion conformation: virtual screening and molecular dynamics simulations applied to the identification of potential fusion inhibitors Identification of small molecule inhibitors of the deubiquitinating activity of the SARS-CoV-2 papain-like protease: in silico molecular docking studies and in vitro enzymatic activity assay 2020 We would like to acknowledge intellectual and financial support by McCord Research (Iowa, USA). JL is supported by an Australian Government Research Training Program Scholarship. We are indebted to Alfonso Perez Escudero and the team at Crowdfight COVID-19 for enabling access to supercomputing facilities, and to Matthew Gasperetti and the team at Hypernet Labs; Galileo, for enabling cloud computing for this project. We thank the National Computing Infrastructure (NCI), and the Pawsey Supercomputing Centre in Australia (funded by the Australian Government). Further, we thank the Spartan High Performance Computing service (University of Melbourne), and the Partnership for Advanced Computing in Europe (PRACE) for awarding the access to Piz Daint, hosted at the Swiss National Supercomputing Centre (CSCS), Switzerland. Supplementary data to this article can be found online at https://doi. org/10.1016/j.cplett.2021.138889.