key: cord-0820451-59yon7ji authors: Khorsandi, Zahra; Afshinpour, Maral; Molaei, Fatemeh; Askandar, Rafee Habib; Keshavarzipour, Fariba; Abbasi, Maryam; Sadeghi-Aliabadi, Hojjat title: Design and synthesis of novel phe-phe hydroxyethylene derivatives as potential coronavirus main protease inhibitors date: 2021-03-30 journal: Journal of biomolecular structure & dynamics DOI: 10.1080/07391102.2021.1905549 sha: cf36fc2ae6a749b9f2d0aa59e166d82830b903a0 doc_id: 820451 cord_uid: 59yon7ji In response to the current pandemic caused by the novel SARS-CoV-2, we design new compounds based on Lopinavir structure as an FDA-approved antiviral agent which is currently under more evaluation in clinical trials for COVID-19 patients. This is the first example of the preparation of Lopinavir isosteres from the main core of Lopinavir conducted to various heterocyclic fragments. It is proposed that main protease inhibitors play an important role in the cycle life of coronavirus. Thus, the protease inhibition effect of synthesized compounds was studied by molecular docking method. All of these 10 molecules, showing a good docking score compared. Molecular dynamics (MD) simulations also confirmed the stability of the best-designed compound in Mpro active site. Communicated by Ramaswamy H. Sarma In recent months, the pandemic of novel Coronavirus is spreading around the world. The number of confirmed cases at the time of writing this manuscript (4 January 2021) exceeded 83,910,386 and there were confirmed more than 1,839,660 deaths (WHO, 2020) . COVID-19 is a member of human beta coronaviruses which also include SARS and MERS (Elfiky et al., 2017) . The mortality rates for SARS and MERS HCoV are more than COVID-19 (10% and 36%, respectively in comparison with 2-3%) but the spreading rate of the new virus is amazing in a few months (Hemida & Alnaeem, 2019; WHO, 2016) Screening of existing antiviral drugs was known as a fast and useful strategy against SARS-CoV-2; thus, considering to pandemic of COVID-19 and time-consuming of the drug discovery process, finding a new compound against the virus drug via repurposing seems like a logical and essential strategy; however, drug discovery progression has to start somewhere. For further evaluations and founding new drugs for the treatments of COVID-19, focusing on the chemical structure of available drugs against other viruses including the similar SARS-CoV, middle east respiratory syndrome coronavirus (MERS-CoV), human immunodeficiency virus (HIV), and hepatitis C virus (HCV) have been suggested (Cunningham et al., 2020; Ko et al., 2020; Zhou et al., 2020) . For instance, the commonly used HIV treatment is based on main protease (Mpro/chymotrypsin-like protease (3CLpro)) inhibitors such as Lopinavir/Ritonavir which was applied as a preliminary candidate for the treatment of COVID-19 infected patients (Morales et al., 2020) . This protease displays a potential target for the inhibition of CoV replication. In recent months, numerous researches have been reported to determine the effective inhibitors for SARS-CoV-2 through in-silico docking models (Ghosh et al., 2020; Kadil et al., 2020) . Also, several antiviral medications such as Lopinavir, Zanamivir, Indinavir, Saquinavir, and Remdesivir display potential as main proteases and as a treatment for COVID-19 (Hall & Ji, 2020) . More importantly, the beneficial of some compounds such as Lopinavir was proven for the treatment of SARS-CoV-2 infections on clinical trials. Most of known main proteases have a similar shape which matches the capacity of the receptor active site. For example, the core unit of Lopinavir and Ritonavir is L-phenylalanine (phe-phe hydroxyethylene isostere) which in this study, their amino group is functionalized with the different organic unit (Scheme 1). Considering to progressively application of these chemical compounds as protease inhibitors, we design novel protease inhibitors based on Lopinavir structure and introduce a short and efficient synthesis method for their preparation. Although several methods have been developed for the synthesis of these dipeptide derivatives, herein, we have afforded to the presented practical strategy which behind disadvantages of commonly available approaches such as multi-complex steps, harsh reaction condition and low yields of products (Bhaskar et al., 2008; Damo et al., 2006) . In continuous, the main proteases-inhibiting potential of synthesized compounds was investigated using computational studies including molecular docking and molecular dynamics simulation. Fragment-based drug design is a general method to design new compounds which have introduced as an impressive substitute to high throughput screening of compounds in drug discovery (Kumar et al., 2012; Murray & Blundell, 2010) . In this approach, fragments as small organic moieties such as active heterocyclic rings are fused to the main pharmacophore. Some new anticancer, anti-alzheimer, and anti-malarial agents have been developed via such process (T anase et al., 2014) . Herein, to design new structures, some heterocyclic fragments were connected to the amino group of phe-phe hydroxyethylene core. Through our basic knowledge of protease inhibitors, the hydrogen bond potential and a high degree of hydrophobicity make more efficient in the blocking process (Sgrignani & Magistrat, 2012; Speck-Planche et al., 2012; Suvannang et al., 2011) . Therefore, we applied non-toxic heterocyclic fragments which are known as the most important fragments in medical chemistry; their structure was given in Scheme 2. The reported fragments are seen in different compounds, for example, guanine is one of the four main nucleobases found in the nucleic acids, DNA and RNA and characterized as potent immunosuppressive and chemotherapeutic agents (Scheme 2b) (Chern et al., 1993) . 8H-oxazolo[4,5-g] indole has been reported as potent cytotoxic activities towards cancerous cell lines in diffuse malignant peritoneal mesothelioma (Scheme 2c,e). The chromeno[3,4-b]pyrrol-4(3H)-one framework is often existing in marine alkaloids such as ningalins and lamellarins (Scheme 2d). They also exhibit potent biological activities as immunomodulatory, anti-HIV-1, multidrug-resistant (MDR) reversal, phosphodiesterase-5 inhibitory (PDE-5) activities, and antianalgesic behavior (Fan et al., 2008; Imbri et al., 2014) . The fused heterocyclic systems contain pyrazole (Scheme 2f) are among pharmacological importance compounds as inhibitors of HIV-1, pesticides, fungicides, antihypertensive and anticancer agents (Hassan et al., 1997; Min et al., 2006) . The 1,2,4-triazoles (Scheme 2g) has also attracted widespread attention due to their diverse applications as antibacterial, antidepressant, antiviral, antitumor, anti-inflammatory, pesticides, herbicides, dyes, lubricant, and analytical reagents (Dumas, 1999; Weng et al., 2012) . Also, purine derivatives (Scheme 2h) were introduced as antiviral agents and affect neuronal and muscle nicotinic acetylcholine receptors (H rebabeck y et al., 2012) . Pyranopyrazoles are other important class of heterocyclic compounds which have used as pharmaceutical constituents. Pyrano[2,3c]pyrazoles (Scheme 2i,j) have shown analgesic, anticancer, antitumor, and antiinflammatory activities (Khoobi et al., 2015) . In conclusion, these new designed structures as a potential of the COVID-19 main protease inhibition could be logical because they consist of a combination of known bioactive molecules and phe-phe hydroxyethylene scaffold which is the core of Lopinavir and Ritonavir as recommended drugs for the treatment of COVID-19 infected patients (Scheme 3). To confirm this opinion, the main interactions and binding energies of designed compounds were studied by molecular docking and molecular dynamics (MD) simulation. Herein, we would like to report a new green and economical synthesis of new proposed compounds. The synthesis Scheme 1. Chemical structures of Lopinavir and Ritonavir Scheme 2. Some heterocyclic fragments used in different pharmaceutical products process was shown in Scheme 4. Our general synthetic strategy is similar to that employed for Ritonavir according to the literature (Stoner et al., 2000) . This six-step procedure was applied to synthesized products in acceptable yields. Utilizing commonly available reagents and good reaction conditions, this process is sufficient for large-scale production and has been used to prepare these types of compounds in high yields (more than 85%). As illustrated in Scheme 4, the synthesis was started from commercially available L-phenyl alanine. L-phenyl alanine was protected with Boc and converted to its corresponding Boc-amino alcohol. The alcohol was oxidized to crossponding aldehyde and one carbon extension to olefin compound was occurred using standard Wittig reaction. These reactions were taken continuously without purification. The crossmetathesis reaction of olefin was done using Hoveyda-Grubbs' second-generation olefin metathesis catalyst. Although, some reports for the synthesis of this olefin are available; they suffer from serious limitations such as more steps, using toxic and expensive reagents, and low yields of products. Therefore, to large scale synthesis, presentation more useful approach is valuable. The next step is hydroboration-oxidation reaction in presented Zn which gave final alcohol in excellent selectivity; it can be explained by a chelation control model. The synthesis of this pharmacy important core from phenylalanine has been reported previously which proceeds through a seven-step sequence using a large amount of metal catalyst. After de-protection from amines, they were reacted with methyl 2-chloroacetate and various introduced amino heterocycles. The isolated yields of final products (which are shown in Scheme 4) were given in Table 1 . Details experimental were given in supplemental data. Herein, we report a short Scheme 3. Chemical structure of the new designed and synthesized compounds as COVID-19 main protease inhibitors. and efficient synthesis of novel molecular scaffolds containing Ritonavir and Lopinavir core and heterocycles as a synthon in the hope to get lead compounds as antiviral agents. The present study focused on the main protease in COVID-19 (PDB ID 6LU7) as the potential for COVID-19 inhibition (Jin et al., 2020) . To find interactions and binding energy between COVID19-Mpro protein and the predicted compounds, a molecular docking was done by AutoDock 4.2 program (Morris et al., 2009 ). Among the experimental X-ray structures of COVID19-Mpro protein, the crystallographic structure with a PDB entry code of 6LU7 was selected. The first, validation docking on COVID19-Mpro protein and the ligand in X-ray crystallography (N3) was done. The analysis of all docked poses showed that the N3 ligand was located in the binding pocket. The main residues in this pocket were His41, Phe140, Leu141, Asn142, Gly143, Cys145, His163, His164, Met165, Glu166, Leu167, Pro168, Asp187, Arg188, Gln189 and Thr190. The initial coordinates of the ligand were used as the reference and a root-mean-square deviation (RMSD) was obtained between docked ligand and reference at less than 2 Å. Two and three-dimensional analysis for the N3 ligand is shown in Figure 1 . According to standard drugs (Ritonavir and Lopinavir), novel inhibitors were designed. The molecular docking of the designed ligands with MPro protein was carried out and the poses of each ligand were ordered in terms of binding energy and clusters ( Table 2) . All compounds were perfectly placed in the active site. Among the ten mentioned compounds, structure A demonstrated the lowest binding energy, thus it was chosen for further studies. To confirm the stability of compound A in the active site of Mpro protein, the MD simulation was performed and also to compare its interaction modes with Lopinavir as a standard drug. Two and three-dimensional analysis for synthesized compounds were shown in Figures S1 and S2 . The compound A, Lopinavir and N3 were superimposed and were shown in Figure S3 . It can be seen that compound A similar to Lopinavir and N3 was completely perched into the active site. The top 3 proposed inhibitors were chosen in terms of energy and main interaction in active site ( Figure 2 ). Figure 2 shows hydrogen bonds between Lopinavir (reference standard) and Ser144, Cys145, Glu166 and Arg188 and pi-alkyl interactions with Met165, Leu167, and Pro168. Compound A forms H-bonds with Mpro protein amino acids Cys145, His164, Glu166 and Gln188 and pi-alkyl interactions with Cys145 and Met165. Compound B forms H-bonds with the MPro protein amino acids Asn142, Cys145, Glu166, Gln189, and two sigma and pi interactions with His41. Compound C forms H bonds with the MPro protein amino acids Cys145, Glu166 and Gln189, and pi interaction with His41, also all calculated binding energies using Autodock are shown in Table 2 . According to molecular docking results, compound A was chosen for MD simulation. A 100 ns MD simulation was carried out to corroborate the stability of the A compound in the Mpro protein active site. Also, interaction modes of compound A were compared with a Mpro protein inhibitor as a reference (Lopinavir). Root mean square deviation (RMSD), root-mean-square fluctuation (RMSF), and gyration radius were investigated as the time-dependent behaviors of MD trajectories. To estimate the conformational stability of Mpro protein during the simulation, RMSD of backbone complexes and RMSD of two compounds (compound A and Lopinavir) were studied. As shown in Figure 3 (A), in the first 20 ns, the RMSD profile of the Mpro-Lopinavir complex was seen as more stable than Mpro-A complex but in remaining simulation time, the profile of Mpro-A complex was more stable than the Mpro-Lopinavir complex. Generally, RMSD profile did not alter more than 0.28 and 0.38 nm in Mpro-A and Mpro-Lopinavir complexes, respectively. By analysis of the RMSD plots of the two ligands ( Figure 3 (B)), it can be identified that compound A and Lopinavir were superimposed in the second 50 ns simulations. The RMSD profile results show that both ligands had significant stability in the active site during MD simulation. The compactness of the protein (Rg) was displayed in Figure 4 (A). The Rg value of A and the Lopinavir were superimposed and the continuity of both complexes was saved during the simulation. The alteration of protein flexibility (RMSF) was investigated during the MD simulation. As shown in Figure 4(B) , the RMSF profiles of both complexes were superimposed in all amino acids. The main residues, Cys145 and His163, were seen more stable during MD simulation. Average values of RMSD, RMSF and Rg were calculated 0.333, 0.092 and 2.191, respectively for COVID19-Mpro-A complex. The binding free energy has been also computed for Mpro-Lopinavir and Mpro-A complexes using the g_mmpbsa. The obtained binding energy components are reported in Table 3 . As regards of the stability of compound A in active site of Mpro-protein, this compound could be propose as COVID-19 Mpro inhibitor. Chemistry General synthetic methods: 1 H NMR and 13 C NMR spectra were recorded by 400 MHz spectrometer instrument. EI mass spectral analyses were recorded on Shimadzu Japan QP2010 S model spectrometer. Thin-layer chromatography (TLC) on silica gel plate was used to checking compounds purity by using hexane and ethyl acetate. The purification process was performed using column chromatography on silica gel (60-120 mesh) by ethyl acetate and hexane mixture as eluent. Details procedures for synthesis and characterization data of products were given in supplementary data. To find the main interactions and the binding energy of designed compounds with the COVID-19 main protease as a receptor, molecular docking was done by AutoDock 4.2 program (Morris et al., 2009) . The pdb file of COVID-19 main proteases was taken from the Protein Data Bank (PDB ID: 6LU7). All water molecules, ligand, and ions were removed from pdb file. Then polar hydrogens were added and the partial atomic charge was calculated by Kollman method. Then the prepared file was saved in pdbqt format to use in the following steps. Three-dimensional structures of designed compounds were depicted in Marvin Sketch Ver. 5.7, ChemAxon (Cosconati et al., 2010) . The partial charges of atoms were determined according to the Gasteiger-Marsili procedure, and non-polar hydrogens of the compounds were merged (Morris et al., 1998) . A 50 Â 50 Â 50 Å (x,y, and z) grid box was centered on the protease binding pocket with a 0.375 nm spacing for each dimension. Docking was performed by Lamarckian genetic algorithm and empirical scoring function by using a flexible method. The program was run for a total number of 50 Genetic algorithm runs. Eventually, the docking procedure was carried out by AutoDock 4.2. All of the runs were ranked in term of the binding energy and were analyzed to obtain the best conformation and orientation of the ligand in the active site of the protein. The best-ranked nominee from docking results was considered for evaluating their thermodynamic behavior and stability of binding mode in the Mpro binding pocket using molecular dynamics (MD) simulation studies. All MD simulations were accomplished by GROMACS-2019.3 package (Abraham et al., 2015) . The Amber 99.sb force field was engaged in MD simulations (Cornell et al., 1995) . Drug topology parameters were ready by the AnteChamber Python Parser InterfacE (ACPYPE) (Da Silva & Vranken, 2012) . To characterize which residue adopts non-standard ionization states, was obtained pKa values using PROPKA 3.1 webserver (Søndergaard et al., 2011) . A TIP3P water model was select (Jorgensen, 1983 ) and the complex of ligand-protein was soaked in a dodecahedron water box. Some cations, Na þ ions, were substituted with solvent water molecules to neutralizing the system. The energy minimization was performed and MD simulation was commenced by two stages of the process: 1) 500 ps simulation in the NVT ensemble at a constant number of particles, volume, and temperature; 2) 1 ns simulation in the NPT ensemble at a constant number of particles, pressure, and temperature. Finally, MD simulation was run at 300 K temperature for 100 ns. The Particle Mesh Ewald (PME) method and the linear constraint (LINCS) algorithm were carried out to computing long-range electrostatic interactions and covalent bond constraints, respectively. Structure visualization was done using VMD 1.8.6 (Humphrey et al., 1996) and PyMOL Tcl. In summary, we designed and synthesized novel potential COVID-19 main protease inhibitors that contain the main core of Lopinavir connected to various heterocyclic fragments. Lopinavir is one of the rare successful clinical trial agents for the treatment of COVID-19 patients. First, molecular docking was carried out to identify the interactions of these compounds with the main protease protein. Our findings revealed that all designed structures were docked successfully but compound A exhibited the lowest binding energy within the protein pocket and could be considered as a potential of COVID-19 main protease inhibition. The molecular dynamic simulation was also confirmed our claim. Next, these structures were synthesized through facile and efficient six-step reactions instead of some available strategies which contain more than ten harsh reaction steps. Because of the similarity of these synthesized compounds to Lopinavir and Ritanivir scaffold, they could be introduced as potential of main protease inhibition. However, further research is needed to investigate the validation of these compounds using in vitro and in vivo methods to pave a way for these compounds in drug discovery. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX, 1-2, 19-25 Synthesis of a hydroxyethylene dipeptide isostere, a core unit of the HIV protease inhibitors ritonavir and lopinavir, and its C-5 epimer Nucleosides. 5. Synthesis of guanine and formycin B derivatives as potential inhibitors of purine nucleoside phosphorylase A second-generation force field for the simulation of proteins, nucleic acids, and organic molecules Virtual screening with AutoDock: Theory and practice Treatment of coronavirus disease 2019 in Shandong, China: A cost and affordability analysis Stereoselective hydroazidation of amino enones: synthesis of the ritonavir/lopinavir core ACPYPE -AnteChamber PYthon Parser interface Discovery studio modeling environment Process for the preparation of sulfentrazone (US Patent 5990315) Quantitative structure-activity relationship and molecular docking revealed a potency of anti-hepatitis C virus drugs against human corona viruses Lamellarins and related pyrrole-derived alkaloids from marine organisms Identification of polyphenols from Broussonetia papyrifera as SARS CoV-2 main protease inhibitors using in silico docking and molecular dynamics simulation approaches COVID-19: Travel health and the implications for sub-Saharan Africa Studies on synthesis and cyclization reactions 2-(5-Amino-3-arylpyrazol-1-yl)-3-methylquinoxalines Some One Health based control strategies for the Middle East respiratory syndrome coronavirus. One Health Synthesis of novel azanorbornylpurine derivatives VMD: Visual molecular dynamics Synthetic approaches to the lamellarins-a comprehensive review Structure of Mpro from SARS-CoV-2 and discovery of its inhibitors Comparison of simple potential functions for simulating liquid water In silico study of pharmacological treatments against SARS-CoV2 main protease New tetracyclic tacrine analogs containing pyrano[2,3-c]pyrazole: efficient synthesis, biological assessment and docking simulation study Arguments in favour of remdesivir for treating SARS-CoV-2 infections Fragment Based Drug Design: From Experimental to computational approaches Potential purine antagonists. VI. Synthesis of 1-Alkyl-and 1-Aryl-4-substituted Pyrazolo[3,4-d] pyrimidines Going global -Travel and the 2019 novel coronavirus Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility Docking and ligand binding affinity: Uses and pitfalls Influence of the membrane lipophilic environment on the structure and on the substrate access/egress routes of the human aromatase enzyme. A computational study Improved treatment of ligands and coupling effects in empirical calculation and rationalization of pKa values Role of ligand-based drug design methodologies toward the discovery of new anti-Alzheimer agents: Futures perspectives in fragment-based ligand design Synthesis of HIV protease inhibitor ABT-378 (lopinavir) Molecular docking of aromatase inhibitors New carbocyclic nucleoside analogues with a bicyclo[2.2.1]heptane fragment as sugar moiety; synthesis, X-ray crystallography and anticancer activity Crystal structure of 2-(pyridin-4-yl)-5-(undecylthio)-1,3,4-oxadiazole A pneumonia outbreak associated with a new coronavirus of probable bat origin This work was financially supported by Isfahan Pharmaceutical Sciences Research Center, School of Pharmacy, Isfahan University of Medical Sciences, Isfahan, Iran. No potential conflict of interest was reported by the author(s). http://orcid.org/0000-0002-3022-2507