key: cord-0792516-yeke2g9q authors: Aljarba, Nada H.; Hasnain, Md Saquib; Bin-Meferij, Mashael Mohammed; Alkahtani, Saad title: Design & discovery of small molecule COVID-19 inhibitor via dual approach based virtual screening and molecular simulation studies date: 2022-01-29 journal: J King Saud Univ Sci DOI: 10.1016/j.jksus.2022.101867 sha: e9921a74443e4a515dbcdae4ebe41f1eb4cda84a doc_id: 792516 cord_uid: yeke2g9q The emerged COVID-19 (SARS corona virus) pandemic leads to severe or fatal respiratory tract infections affecting millions of people worldwide since its outbreak. The situation needs the newer molecule to control the infections as the pandemic had very badly affected the health and socioeconomic conditions of human being. CoV-2 main protease is considered to be key enzyme by targeting which we can design or develop the drug candidate. The active fitting and binding of any molecule depends upon the shape and electrostatic properties of ligand complementary to the receptor site. In this study ZINC13 database, a drug like subset (13,195,609 molecules) was subjected to shape and electrostic based virtual screening (VROCS & EON software) and followed by molecular modelling studies using docking and molecular dynamics simulation. Further the drug ability of identified candidate was predicted by the SiteMap analysis. The best shape and electrostatic similarities were observed between ZINC19973962 and reference molecule. The Taminto(shape) and Tanimoto(electrostatic) was found to be 0.667 and 0.022 respectively. The molecule also displayed the identical binding pattern with docking score -7.964 and this interaction was further validated by the molecular dynamics simulations. The RMSD & RMSF values were found to be 1.5 Å and1.8 Å respectively suggesting the stability of complex and very low fluctuation in ligand-protein complex over the entire MD simulation run. SiteMap analysis showed the identical Dscore of reference and identified HIT that indicated the molecule ZINC19973962 would be the promising druggable candidate against COVID main protease enzyme and can be used as lead molecule for the development of anti-COVID molecule. Severe acute respiratory syndrome (SARS) coronavirus 2 causes severe highly prevalent and deadliest diseases including middle east respiratory syndrome (MERS) and SARS in human and animals. The later outbreak was first reported in Wuhan province of China and declared as pandemic by world health organization (WHO) in March 2020. (Surveillances 2020, Zumla and Niederman 2020) Since then millions of COVID cases and death associated with COVID infection have been reported worldwide. The data indicated that the transmission rate of SARS-CoV-2 is higher (2-2.5%) and fatality rate (5%) is lower than previously reported coronaviruses like SARS coronavirus (1.7-1.9% transmissibility, 9.5% fatality) and MERS coronavirus (<1% transmissibility, 34.4% fatality) (Mahase, 2020) . The major route of transmission is respiratory droplets generated during infected through person's cough, sneezes, speaks and contact with particles contaminated with virus droplets, whereas some reports suggested that transmission may occur through digestive tract (Tsang et al., 2020; Zhou, et al., 2020) . The incubation period of virus is 1-14 days which may increase up-to 19 days in asymptomatic carriers. The period from onset of symptoms to death may range from 6-41 days which depends upon immunity and age of patient. Patients with one or more comorbidities like hypertension, diabetes and cardiovascular disease have more fatality rates (Mazumder, et al., 2020; Zhao, et al., 2020) . The major symptoms of COVID-19 are difficulty breathing, cough, runny nose or symptoms associated with pneumonia. Severity of symptoms may be mild, moderate or severe depending on the clinical features. Mild infection is manifested by development of pneumonia or nonpneumonia whereas severe infection is manifested by breathlessness, acute respiratory failure or sometime multiple organ failure that led death (Lovato, et al., 2020; Yin, et al., 2020; Zare-Zardini, et al., 2020) . Clinical laboratory studies showed leucopenia and lymphopenia are the prime feature of COVID-19. In many patients abnormal myocardial zymograms was observed due elevated level of lactate dehydrogenase and creatinine kinase. Increased level of C-reactive protein was noted with high erythrocytes sedimentation rate and D-dimer were also noted (Aghagoli, et al., 2020; Cui, et al., 2020) . Since the outbreak various research groups and companies have been tested the library of existing antiviral drugs for the promising drug candidate or lead against the SARS-COV2 virus. Various reports have been published on generated in-silico data for development of effective candidate against this enzyme (Cao, et al., 2015; Tian, et al., 2021) , but none of them was found to be effective. Still there is tremendous demand for development of newer molecule that can cure the acute viral SARS-COV2 infection. Recently a few reports have been indicated the possible use of some compounds from natural or synthetic origin such as Kobophenol A, (Gangadevi, Badavath et al. 2021 ) Tideglusib, Carmofur, Shikonin, Ebselen, Disulfiram and PX-12 for their possible use in SARS-CoV-2 infection (Jin, et al., 2020) . The Kobophenol A was reported as an ACE2 Receptor inhibitor and binds with the Spike RBD Domain of SARS-CoV-2, whereas later all were COVID main protease inhibitors. In this study, we report the dual approach of ligand and protein based virtual screening by shape and electrostatic comparative study with previously reported inhibitor and molecular modellingbased discovery of novel HITs for the possible SARS-CoV-2 inhibitors. Comparative shape and electrostatic similarities indices based virtual screening of reference molecule and Zinc13 database (Irwin and Shoichet 2005) was done by using OpenEye toolkit. (OEChem) The modeling and SiteMap analysis was performed with Maestro and SiteMap module of Schrodinger LLC suite. (Schrodinger 2011) The molecular dynamics simulation was done by using Desmond V3. The protein structure CoV-2 main protease (PDB id 6LU7) (Jin, Du et al. 2020 ) was downloaded from the scientific collaboratory protein data bank (www.rcsb.org). (Berman, et al., 2000) . The downloaded raw protein was devoid of H bonds; some residues were missing that was updated by processing with protein preparation wizard to avoid any physicochemical constraint. All the procedures were successfully executed using above mentioned software running on ubuntu operating system 1TB machine (Intel-R, C™ i7-8700 CPU 3.20GHz, 3.19GHz, 16GB RAM 1TB). The basis of current study was based on the comparative virtual screening of reference molecule with Zinc13 database. The reason in differentiating the topological properties is that it takes account of molecules with shape similarity by superimposition of different molecules with known inhibitors. As we know the complementary topological are the primary determinants for active site of the protein (Sastry et al., 2011; Bitencourt-Ferreira and de Azevedo, 2018; Siddique et al., 2020) . Moreover, the stability of ligand-protein complex is also dependent on these interactions. Therefore, it was thought that we should consider these primary determinants at the preliminary screening of drug database. These forces are involved in electrostatic interactions and led to non-covalent bond formation between oppositely charged moieties. (Morrone et al., 2016; de Ávila and de Azevedo 2018; Siddique et al., 2018) . Thus, comparing shape complementary and electronic features of (Fig3) reference molecule with the Zinc database gave the important structural framework which realized the binding affinity and functional potency. The OpenEye ToolkitsROCS and EON algorithm were check for shape and electrostatic similarities respectively. During the shape and electrostatic screening, the molecules were screened depending on shape agreement with that of reference molecule. The obtained top HITS were used for comparative electrostatic correlations and their shape and electrostatic scores were counted for result analysis as shown in Figure3. Receptor grid is set of active site residues where molecule should get bind and modulate the activity of enzyme of protein. 'Receptor Grid Generation' wizard in Glide was used for grid generation with default parameters for partial cut-off (0.25) and scaling factor (1.0) without any force. The site was determined by literature reported and centroid of the selected residues was used for generation of grid. Ligprep utility in Glide module used for generation of various conformers, isomers of screened ligand. A highest number of 32 tautomers and stereoisomers were produced. OPLS 2005 force field was used for energy minimization and ligands were also desalted. The generated gird file with active site residues was used for docking of ligand using Glide software. The default 0.8 scaling factor (vdW) and 0.15 potential charge cut-off from were set. The analysis of results XP pose viewer utility was used by importing xpz file. MD simulation study was used to check the ligand-protein complex at target site in physiological conditions. It was done by using Desmond V3 module running workstation. The docked proteinligand complex was merged and used for generation for orthorhombic simulation box by the system builder utility of desmond module. It was built with Simple Point-Charge (SPC) explicit water model and the 10 Å distance was maintained between the solvent surfaces with protein surface. This solvated system was neutralized and 0.15 M concentration of physiological salt was maintained. This equilibrated system was used to MD simulations studies. The MD simulation was performed at 310.15 K temperatures over a 100 ns with constant temperature, constant pressure (1.0 bar). The CMS generated after the successful MD run was used for analysis of results using the simulation interaction analysis utility. The MD trajectory was written with 1000 frames during the entire simulation run but only initial protein backbone frames were aligned to understand the stability of ligand-protein complex. The RMSD, RMSF and interaction plots were used to understand the stability of complex. Single binding site region was evaluated by Sitemap tool of Schrödinger. All the molecules were separately tested for calculation of Dscore, hydrophilicity, hydrophobicity, H bond donor and acceptor properties. These computed properties are used to calculate the site score and Dscore. Depending on these scores, the druggability of target was also predicted. The binding site of Covid main protease (CYP1B1) was validated by comparing scores to all the molecules. (Halgren, 2007; Halgren, 2009 ). ADME drug likeliness properties are the crucial part of new drug development procedure as many molecules are withdrawn from the market due to their poor pharmacokinetic profiles. We used QikProp v3, Schrödinger, 2005 for prediction of ADME properties. To calculate the in-vitro and in-vivo drug likeliness properties, we considered various parameters such as H bond donor and acceptor, human oral absorption, molecular weight, Lipinski's rule of five and predicted aqueous solubility as these are the important determinants of how the body going to react with any external molecule. If all these came under the identical or normal values, then we consider the identified HIT to be taken into further stages of drug development processes. HTVS based on the shape and electrostatic complementary of reference molecule (Ebselen) with Zinc13 (13, 195, 609) was performed. The best scoring top 15000 molecules were further subjected to molecular modeling studies via XP mode in glide. The top 10 molecules were analyzed and top identifies molecule was used for molecular dynamics, SiteMap analysis and insilico ADME calculation for drug likeliness properties ( Figure. 2). The shape and electrostatic properties of any molecule are the primary topological determinants that govern the active fitting and binding at the catalytic site of protein. In the present study the top HIT was displayed the good structural similarities with reference molecule. Figure. 3B and 3C displayed the shape and electrostatic potential of the ZINC19973962 compared to reference molecule whereas figure.3A showed the complete overlapping of reference molecule with identified HIT. The Tanimoto shape and electrostaticscores of ZINC19973962 with reference molecule was found to be 0.667 and 0.22 respectively. The identified top HIT ZINC19973962 was chemically different from the reference molecule but it displayed the good shape and electrostatic complementary with reference molecule and these values suggested that ZINC19973962 could actively bind with the amino acid residues present at protein catalytic site. Based on the active binding and fitting of HITs at the active pocket it was considered that these molecules would be potential inhibitors of COVID main protease enzyme. But generated insilico data do not confirm the actual in vitro or in vivo biological activity. The identified best scoring top 1500 molecules from preliminary shape and electrostatic screening was led to further in silico studies. Molecular docking studies of these molecules were performed against the COVID main protease enzyme using 6LU7 pdb file. Before the run of docking studies, the software needs to be validated and it was simply done by extracting out the internal ligand and again redocking at its actual place without changing its states or generating any conformers. The obtained root means square deviation value 3.365 Å suggested the software is ready to use for study. The top 1500 HITs obtained from the preliminary screening were docked at the active site of COVID main protease. The key interaction between molecule and active site residues was determined by extra precision docking. From the docking studies compound with id ZINC19973962 displayed the good docking score of The reference and ZINC19973962 molecules were used for SiteMap analysis to check the drug ability of identified HIT. It was considered that identical scores and values could assure the The pharmacokinetic properties of any drug candidate are a very crucial aspect that should be taken in consideration in the drug discovery and development process. The reason behind this is many promising and strong druggable candidate was failed in the later stages of drug discovery processes due to poor pharmacokinetic profiling. This causes huge loss of time and cost of the drug development process that ultimately add the cost of the project. Hence, if we can check or predict the in-silico pharmacokinetic properties of HIT or lead molecule which is supposed to be taken into further stages of drug discovery process to avoid the later stage failure of drug discovery process. Here we also checked the in-silico pharmacokinetic profile of identified HIT i.e ZINC19973962. Different physicochemical properties that are going to play a major role in human body or behavior of body onto the drug candidate were determined by using qikprop utility of Schrodinger tool. The various physicochemical parameters of drug candidate like molecular weight, LogP, metabolism, logBB human oral absorption and polar surface area was predicted. For a molecule to be orally active, the violation of Lipniski's rule of five was checked. The results obtained were listed in table 3. The obtained values were suggested that the molecule was not violated the Lipniski's rule of five hence there is chance that molecule may be orally active. The obtained LogBB value indicated that the molecule will not cross the blood brain barrier and not be able to produce any CNS related toxicities. Thus, the obtained values suggested the fate of ZINC19973962 i.e absorption, distribution, metabolism and excretion. By this it was postulated that the identified HIT can absorb orally and reached to specific target in unaltered form to produce the desired effect. Cardiac involvement in COVID-19 patients: Risk factors, predictors, and complications: A review The protein data bank Development of a machine-learning model to predict Gibbs free energy of binding for protein-ligand complexes A screen of the NIH Clinical Collection small molecule library identifies potential anti-coronavirus drugs A 55-day-old female infant infected with 2019 novel coronavirus disease: presenting with pneumonia, liver injury, and heart damage Development of machine learning models to predict inhibition of 3-dehydroquinate dehydratase Kobophenol a inhibits binding of host ace2 receptor with spike rbd domain of sars-cov-2, a lead compound for blocking covid-19 The MM/PBSA and MM/GBSA methods to estimate ligandbinding affinities New Method for Fast and Accurate Binding-site Identification and Analysis Identifying and characterizing binding sites and assessing druggability ZINC− a free database of commercially available compounds for virtual screening Structure of M pro from SARS-CoV-2 and discovery of its inhibitors Upper airway symptoms in coronavirus disease 2019 (COVID-19) Coronavirus: covid-19 has killed more people than SARS and MERS combined, despite lower case fatality rate Geriatric care during public health emergencies: lessons learned from novel corona virus disease (COVID-19) pandemic SAnDReS a computational tool for statistical analysis of docking results and development of scoring functions There is no corresponding record for this reference Rapid shape-based ligand alignment and virtual screening method based on atom/feature-pair similarities and volume overlap scoring Schrodinger Software Suite Comparative Computational Studies on Selective CytochromeP450 1B1 Inhibitors Comparative Shape and Electrostatic Study of Highly Potent and Selective CYP1B1 Inhibitor: Assessment of Active Site of CYP1B1 by Binding Mode Analysis Using Site Map Tool Non-Carboxylic Acid Inhibitors Of Aldose Reductase Based On N-Substituted Thiazolidinedione Derivatives The epidemiological characteristics of an outbreak of 2019 novel coronavirus diseases (COVID-19)-China, 2020 An update review of emerging small-molecule therapeutic options for COVID-19 Temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by SARS-CoV-2: an observational cohort study Concomitant neurological symptoms observed in a patient diagnosed with coronavirus disease 2019 Coronavirus disease 2019 (COVID-19) in children: prevalence, diagnosis, clinical symptoms, and treatment Advances in the relationship between coronavirus infection and cardiovascular diseases Clinical course and risk factors for mortality of adult inpatients with COVID-19 in China: a retrospective cohort study The explosive epidemic outbreak of novel coronavirus The authors extend their appreciation to the Deputyship for Research & Innovation, Ministry of Education in Saudi Arabia for funding this research work through the project number (PNU-DRI-Targeted-20-031). Many promising drug candidates were failed in later stages of the drug development due to lesser credibility of in-silico data. Hence, the more validated in-silico data is needed to consider any molecule as a suitable lead that can be taken into further stages of drug development. In the present study combined ligand and protein based physicochemical parameters were considered for HTVS based design and discovery of small molecule inhibitors of covid-19 main protease enzyme. Then top scoring 15000 molecules were subjected to molecular docking and dynamics simulation studies. Considering the fact that these two-sided physicochemical aspects and validated in-silico results and chances of lesser failure of later stage drug development process.The results obtained based on these studies were analyzed and ZINC19973962 was identified as top scoring HIT after the successful completion of computational studies. Based on topological similarity indices with reference molecule, identical binding pattern, stability of ligand-protein and drug ability the identified molecule would be the promising HIT and can used as a lead molecule for further development of small molecular inhibitors against SARS COVID main protease enzyme. The authors declare no conflicts of interest associated with this manuscript. The data generated or analyzed in this article are online publicly available without request. Nada H. Aljarba and Md Saquib Hasnain were performed the electrostatic study; Saad Alkahtani was performed the molecular docking studies. Mashael Bin-Meferij was acquired and analyzed the data. Nada H. Aljarba and Mashael Bin-Meferij were performed dynamics simulations and SiteMap analysis. Md Saquib Hasnain and Saad Alkahtani were involved in the conception and design of the study, data interpretation, and critically revised the manuscript.