key: cord-1028250-wb1zgv8m authors: Sang, Peng; Tian, Shu-Hui; Meng, Zhao-Hui; Yang, Li-Quan title: Anti-HIV drug repurposing against SARS-CoV-2 date: 2020-04-21 journal: RSC advances DOI: 10.1039/d0ra01899f sha: 808eff3e8746e9e75f7d567935f45308b221783f doc_id: 1028250 cord_uid: wb1zgv8m A novel severe acute respiratory syndrome human coronavirus (SARS HCoV) was identified from respiratory illness patients (named SARS-CoV-2 by ICTV) in December 2019 and has recently emerged as a serious threat to world public health. However, no approved drugs have been found to effectively inhibit the virus. Since it has been reported that HIV protease inhibitors can be used as anti-SARS drugs by targeting SARS-CoV-1 3CLpro, we chose six approved anti-HIV drugs and investigated their binding interactions with 3CLpro to evaluate their potential to become clinical drugs for the new coronavirus pneumonia (COVID-19) caused by SARS-CoV-2 infection. The molecular docking results indicate that the 3CLpro of SARS-CoV-2 has a higher binding affinity for all the studied inhibitors than does SARS-CoV-1. Two docking complexes (indinavir and darunavir) with high docking scores were further subjected to MM-PBSA binding free energy calculations to detail the molecular interactions between these two protease inhibitors and SARS HCoV 3CLpro. Our results show that, among the inhibitors tested, darunavir has the highest binding affinity with SARS-CoV-2 and SARS-CoV-1 3CLpro, indicating that it may have the potential to be used as an anti-COVID-19 clinical drug. The mechanism behind the increased binding affinity of HIV protease inhibitors toward SARS-CoV-2 3CLpro (as compared to SARS-CoV-1) was investigated by MD simulations. Our study provides insight into the possible role of structural flexibility during interactions between SARS HCoV 3CLpro and inhibitors and sheds light on structure-based design of anti-COVID-19 drugs targeting SARS-CoV-2 3CLpro. Beginning in December 2019, an outbreak of a new coronavirus pneumonia (named COVID-19 by the WHO) caused by a novel SARS HCoV (named SARS-CoV-2 by ICTV) has emerged as a serious threat to global health, with more than 2 000 000 worldwide cases resulting in more than 100 000 deaths as of 15 April 2020. SARS-CoV-2 has been identied as the seventh member of the coronavirus family. 1 Through whole genome sequence alignment analysis, SARS-CoV-2 was found to have higher sequence homology toward SARS-CoV-1, which caused the SARS outbreak in 2003. 2 The coronavirus genome encodes four structural proteins: spike glycoprotein (S), small envelope protein (E), matrix glycoprotein (M) and nucleocapsid protein (N). 3 In addition to these four structural genes, 3CLproa main protease required for coronavirus maturationis vital for the viral life cycle, making it an attractive target of anti-coronavirus drug development. [4] [5] [6] By sequence alignment, it has been found that SARS-CoV-2 3CLpro (3CLpro-2) and SARS-CoV-1 3CLpro (3CLpro-1) share remarkable 96% sequence identity (Fig. 1A) . The crystal structure of 3CLpro-2 (PDB ID: 6LU7) 7 is highly similar to that of its SARS sister 3CLpro-1 (PDB ID: 1UJ1); 8 the backbone root mean square deviation (RMSD) value between these two proteins is only 1.4Å (Fig. 1C ). Both 3CLpro-2 and 3CLpro-1 contain nine a-helices and 13 b-strands that make up three distinct domains: Domain I, Domain II and Domain III (Fig. 1B) . 9, 10 Similar to other CoV proteases, Domains I (residues 8-101) and II (residues 102-184) contain one antiparallel bbarrel, resembling the structure of trypsin-like serine proteases. Domain III (residues 201-306) consists of ve a-helices (a5-a9) connected by a long loop (residues 185-200) to Domain II. In contrast to the common Ser-His-Asp catalytic triad of serine proteases, 3CLpro-2 and 3CLpro-1 has a catalytic dyad composed of conserved residues H41 and C145. The main substrate-binding site of 3CLpro is formed by a cle between Domains I and II (Fig. 1B) . Although the 3CLpro-2 3D structure provides deep insight into the viral life cycle and facilitates the screening of anti-COVID-19 drugs, no approved drugs have been found to effectively inhibit the virus so far. Given both the emergency of this outbreak and previous reports that HIV protease inhibitors can be used as anti-SARS drugs by targeting 3CLpro-1, [11] [12] [13] we chose six public anti-HIV drugs and evaluated their potential to become clinical drugs for COVID-19 by means of molecular docking. Two of the six drug-3CLpro complexes (indinavir and darunavir) showed high docking scores and were further subjected to molecular dynamics (MD) simulation and molecular mechanics Poisson-Boltzmann surface area (MM-PBSA) binding free energy calculations. The molecular interactions between these two HIV protease inhibitors and the SARS HCoV 3CLpro were analyzed in detail, and the mechanism for the difference in binding ability between 3CLpro-2 and 3CLpro-1 and these inhibitors was also investigated. Crystal structures of 3CLpro-2 (PDB code: 6LU7) 7 and 3CLpro-1 (PDB code: 1UJ1) 8 were obtained from the Protein Data Bank (http://www.pdb.org), and any heteroatoms and water molecules were removed for molecular docking studies. Six HIV protease-inhibitor complex structures were downloaded from the Protein Data Bank (PDB codes: 1MUI [lopinavir], 2B60 Sequence alignment and 3D structure of 3CLpro-2 and 3CLpro-1. (A) Sequence alignment and secondary structures of 3CLpro-2 and 3CLpro-1. Secondary structures are illustrated above the corresponding amino acid sequence (red helix: a-helix, yellow arrow: b-sheet), and residue numbers are indicated above the primary sequence. The H41 and C145 residues that make up the catalytic dyad are highlighted in blue. (A) was generated using Aline. 14 (B) Ribbon representation of 3CLpro-2 (PDB code: 6LU7). 7 Structural elements are indicated by color; a-helices are red, b-sheets are yellow, and loops are green. The peptide-like inhibitor N3 is represented as stick model. The catalytic dyad residues H41 and C145 are shown as magenta stick models. (C) Superimposed 3D structures of 3CLpro-2 (PDB code: 6LU7, blue) 7 and 3CLpro-1 (PDB code: 1UJ1, green). 8 Fig. 2 ) was used for docking to 3CLpro-2 and 3CLpro-1. Molecular docking is a widely used approach in structure-based drug design. 16, 17 To evaluate the binding affinity of these HIV protease inhibitors with SARS HCoV 3CLpro, comparative molecular docking analysis was carried out using Autodock 4.2. 18 In docking simulations, the grid box was dened according the peptide-like inhibitor binding pocket of 3CLpro-2 (Fig. 3) . The size of the affinity map was set at 40Å Â 50Å Â 40 A, and spacing between the grid points was set to 0.375Å. Docking was performed with Lamarckian genetic algorithm and default parameters. The best docked conformations (3CLproinhibitor complexes) with the lowest docking energies were selected for further MD simulations and MM-PBSA binding free energy calculations. In order to explore the molecular interactions between HIV protease inhibitors and the SARS HCoV 3CLpro, as well as to investigate the mechanism behind the difference in binding ability between 3CLpro-2 and 3CLpro-1 and these inhibitors, two kinds of MD simulations were performed: SARS HCoV 3CLpro-inhibitor complex (3CLpro-2-inhibitor and 3CLpro-1inhibitor) and SARS HCoV 3CLpro free enzyme (free 3CLpro-2 and free 3CLpro-1). All simulations were carried out using the GROMACS-5.1.4 soware package 19 with the CHARMM36 allatom force eld (March, 2019). 20 The force elds of all HIV protease inhibitors were generated by the CGenFF server. 21, 22 Each simulation system was dissolved using the TIP3P water model 23 and centered in a dodecahedron box with a 1.0 nm minimum distance between the protein and the edge of the box. The steepest descent algorithm was used to minimize simulation energy. The systems were equilibrated by two continuous 500 ps position restraint simulations of 1000 kJ mol À1 nm À2 in the NVT and NPT ensembles. All equilibrated systems were then subjected to production MD runs. Other simulation parameters and conditions were the same as ref. 24 . The equilibrium of MD simulations were examined by computing the backbone root mean square standard deviation (RMSD) with respect to their starting structures as a function of simulation time. The results ( Fig. S1 and S2 †) show that the free 3CLpro-2 and 3CLpro-1 systems require only a few ps to reach stable RMSD values, and the 3CLpro-2-inhibitor and 3CLpro-1inhibitor systems require about 4000 ps and about 500 ps to reach an approximate equilibrium, respectively. The MM-PBSA method was used to compute the binding free energy of SARS HCoV 3CLpro-inhibitor complexes during simulation. A detailed description of the MM-PBSA method is presented in ref. 25 . In this study, the binding free energies of SARS HCoV 3CLpro to HIV protease inhibitors and N3 were calculated using the GROMACS tool g_mmpbsa. 26 When using MM-PBSA, the binding free energy of the protein and ligand was dened as For each subunit, the free energy, G, can be presented as where E MM represents the average molecular mechanical potential energy in vacuum, which includes electrostatic (E elec ) and van der Waals (E vdw ) interactions components and interprets them as G sol represents the solvation free energy, which includes both electrostatic (G polar ) and non-electrostatic (G nonpolar ) components and interprets them as Since the contribution of the entropic term (TS) is negligible when the computing models are very similar 11 and calculating the contribution of entropy to the binding free energy is challenging and time-consuming, the contribution of the entropic term is excluded in current version of g_mmpbsa. We chose six approved anti-HIV inhibitors to evaluate their potential to become clinical drugs for COVID-19. As shown in Table 1 , 3CLpro-2 has a lower binding energy for all the studied inhibitors than its sister 3CLpro-1, although both of their binding energies are higher than all of the HIV proteaseinhibitor complexes that served as positive controls. This indicates that the binding affinities of 3CLpro-2 towards inhibitors are higher than that of 3CLpro-1. Of the six HIV protease inhibitors, indinavir and darunavir were proven to have a higher binding affinity to 3CLpro-2, and their binding energy values are close to those of HIV protease inhibitors. When compared, the binding energy of the 3CLpro-2-darunavir complex (À10.24 kJ mol À1 ) is lower than that of its 3CLpro-2indinavir counterpart (À10.02 kJ mol À1 ), indicating that the binding affinity of darunavir towards 3CLpro-2 might be higher than that of indinavir. Since 3CLpro is essential for coronavirus replication, 9 the inhibitory effect of these two compounds on 3CLpro-2 indicates that they might have potential application as anti-COVID-19 clinical drugs. The binding modes of indinavir and darunavir in their docking complexes are shown in Fig. 3 . For 3CLpro-2 ( Fig. 4A and B) , the binding pockets are in a more closed state; indinavir and darunavir bind deeper into the pocket, both with 19 contact residues. In contrast, the 3CLpro-1 binding pocket is in a more open state (Fig. 4C and D) , and there are only 11 and 17 contact residues in 3CLpro-1-indinavir and 3CLpro-1-darunavir complexes, respectively. These additional contact residues likely strengthen the binding affinity between 3CLpro-2 and the tested inhibitors, potentially explaining why the binding energy values between them are lower than those of 3CLpro-1-inhibitor complexes. Of note, darunavir forms ve and three hydrogen bonds with 3CLpro-2 and 3CLpro-1, respectively. In contrast, no hydrogen bonds are formed between indinavir and either of the two 3CLpro species. Since hydrogen bonding plays an important role in the stability of the enzyme-inhibitor complex, 27 darunavir might be more suitable for treatment of COVID-19. In order to explore the binding mechanisms of SARS HCoV 3CLpro to indinavir and darunavir, four docked complex structures (i.e., 3CLpro-2-indinavir, 3CLpro-2-darunavir, 3CLpro-1-indinavir, 3CLpro-1-darunavir) and the crystal structure of 3CLpro-2 in complex with a peptide-like inhibitor N3 which was served as a control, were subjected to 20 ns molecular dynamics simulations. The simulation trajectories were used to calculate the binding free energy by the MM-PBSA method. As presented in Table 2 , the binding free energy of 3CLpro-2 is about 50% of that of 3CLpro-1 for both indinavir and darunavir, suggesting that these two inhibitors bind more tightly to the former. Close examination of Table 2 suggests that the primary forces driving the binding events for both 3CLpro-2 and 3CLpro-1 are vacuum potential energy (DE MM ) and nonpolar energies (DG nonpolar ). In contrast, polar energies (DG polar ) negatively contribute to the binding events. As listed in Table 2 , for 3CLpro-2, the binding free energy to N3 is higher than either of indinavir and darunavir. In accordance with the molecular docking results, the nal binding free energy value for darunavir and 3CLpro-2 is lower than that of indinavir (À72.11 kJ mol À1 for indinavir and À95.53 kJ mol À1 for darunavir), indicating the higher binding affinity of the former towards 3CLpro as compared to the latter. Detailed decomposition of the energy components reveals that the decreased binding free energy of darunavir is mainly due to the reduced negative contribution of DG polar (173.57 kJ mol À1 for indinavir and 121.27 kJ mol À1 for darunavir), although the positive contribution of DE MM is even lower than for indinavir (À226.53 kJ mol À1 for indinavir and À196.94 kJ mol À1 for darunavir). Taken together, these results suggest that darunavir could be used as template for structure-based design of SARS-CoV-2 3CLpro inhibitors, and may also have the potential to become an anti-COVID-19 clinical drug. The contribution of residues to the binding energy of 3CLpro-2-darunavir and 3CLpro-2-indinavir complex structures were calculated by the g_mmpbsa tool. As shown in Fig. 5A and B, for 3CLpro-2-darunavir, residues Met49, Met165, Pro168 and Gln189 make signicant positive contributions to binding. For 3CLpro-2-indinavir ( Fig. 5C and D) , besides residues Met49 and Met165, residues making signicant positive contributions also include Leu41 and Cys145. In contrast to 3CLpro-2-darunavir, there are two residues (Thr26 and Glu166) making signicant negative contributions to binding in 3CLpro-2indinavir, which might interpret the lower binding affinity of indinavir to 3CLpro-2 than darunavir. Taken together, darunavir which could be used as template for structure-based design of 3CLpro-2 inhibitors, might has the potential to become an anti-COVID-19 clinical drug. In order to investigate the mechanism behind the increased binding affinity of HIV protease inhibitors toward 3CLpro-2 compared to that of 3CLpro-1, 100 ns MD simulations were performed on these two free enzymes without any inhibitor. Based on the obtained MD trajectories, the root mean square uctuation (RMSF) for each residue was calculated in order to compare the structural exibility of 3CLpro-2 and 3CLpro-1. The resulting RMSFs are displayed in Fig. 6 as a function of residue number, and the 3D backbone representations of 3CLpro are colored according to their RMSF values. As shown in Fig. 6 , 3CLpro-2 has an higher overall exibility (or lower rigidity) than 3CLpro-1 in Domains I and II. Close examination of Fig. 6 reveals that 3CLpro-1 displays lower exibility in some regions of surface-exposed loops, especially those within the substrate binding pocket (Fig. 6B and C) . Interestingly, when examining the Domain III region, 3CLpro-2 is less exible (i.e., more rigid) than 3CLpro-1. It is well established that exibility plays a signicant role in protein function. 29 For example, higher exibility could enlarge the substrate binding pocket, thus increasing the kinetics of substrate entrance and product egress. 30 In addition, high exibility could also increase substrate binding affinity. 31 Therefore, the lower exibility in Domains I and II, especially within the substrate binding pocket, might explain why the binding affinity of HIV protease inhibitors toward 3CLpro-2 is higher than that of 3CLpro-1. A novel severe acute respiratory syndrome coronavirus (SARS-CoV-2) was identied from respiratory illness patients in December 2019, and has recently emerged as a serious threat to world public health. However, no approved drugs have been found to effectively inhibit the virus. Given the urgency of the current epidemic situation, it would be highly effective to repurpose old drugs for clinical treatment. It has been reported that HIV inhibitors can be used as anti-SARS clinical treatment drugs, as they target SARS-CoV-1 3CLpro. In this study, we chose six approved anti-HIV inhibitor drugs to evaluate and compare their binding affinities with SARS-CoV-2 and SARS-CoV-1 3CLpro by molecular docking and MM-PBSA binding free energy calculations. Our results show that, among all inhibitors, darunavir has the best binding affinity with SARS-CoV-2 and SARS-CoV-1 3CLpro, indicating that it might have the potential to become an anti-COVID-19 clinical drug. The mechanism behind the increased binding affinity of HIV protease inhibitors toward SARS-CoV-2 3CLpro over SARS-CoV-1 3CLpro was investigated by MD simulations. The calculated RMSF values for each residue during simulation indicate lower exibility in the Domain I and Domain II regions of SARS-CoV-1, especially within the substrate binding pocket; this might explain why the binding affinities of HIV protease inhibitors toward SARS-CoV-2 3CLpro are higher than those of SARS-CoV-1. Our study provides insight into the possible role of structural exibility in regulating interactions between SARS HCoV 3CLpro and inhibitors and sheds light on structure-based design of anti-COVID-19 drugs targeting SARS-CoV-2 3CLpro. The authors declare no conicts of interest for this work. The PyMOL Molecular Graphics System This study was funded by the National Natural Sciences Foundation of China (31860243, 31960198 and 31660015, 81860074) and Yunnan Applied Basic Research Projects (2017FH001-032 and 2017FB024). We greatly acknowledge the help of Roger DuBois in revising the manuscript.