key: cord-0068013-f7ucnmcx authors: Srivastava, Ruby title: Theoretical Studies on the Molecular Properties, Toxicity, and Biological Efficacy of 21 New Chemical Entities date: 2021-09-15 journal: ACS Omega DOI: 10.1021/acsomega.1c03736 sha: 4a30e424214c6576ec84bb5f8874f28ecbceaeb4 doc_id: 68013 cord_uid: f7ucnmcx [Image: see text] New chemical entities (NCEs) such as small molecules and antibody–drug conjugates have strong binding affinity for biological targets, which provide deep insights into structure-specific interactions for the design of future drugs. As structures of drugs increase in complexity, the importance of computational predictions comes into sharp focus. Knowledge of various computational tools enables us to predict the molecular properties, toxicity, and biological efficacy of the drugs and help the medicinal chemists to discover new drugs more efficiently. Newly approved drugs have higher affinities for proteins and nucleic acids and are applied for the treatment of human diseases. We have carried out the computational studies of 21 such NCEs, specifically small molecules and antibody–drug conjugates, and studied the biological efficacy of these drugs. Their bioactivity score and molecular and pharmacokinetic properties were evaluated using online computer software programs, viz., Molinspiration and Osiris Property Explorer. The SwissTargetPrediction tool was used for the efficient prediction of protein targets for the NCEs. The results indicated higher stability for the drug complexes due to a larger HOMO–LUMO gap. A high electrophilicity index reflects good electrophilic behavior and high reactivity of the drugs. Lipinski’s ‘‘rule of five’’ indicated that most of the drug complexes are likely to be orally active. These drugs also showed non-mutagenic, non-tumorigenic, non-irritant, and non-effective reproductive behavior. We hope that these studies will provide an insight into molecular recognition and definitely help the medicinal chemists to design new drugs in future. The pharmaceutical industry has been facing significant challenges since the past decade regarding the increased research and development (R&D) costs, looming patent expirations, and continuously declining number of new drug approvals. 1−3 Due to the loss of the roaring success of drugs' market share to generic competition, it has become a serious concern that innovation in first-in-class drug discovery has stagnated and the new approved drugs would not be able to replace the losses incurred by expiring patents. Latest data show that the U.S. Food and Drug Administration (FDA) has approved a total of 289 new molecular entities (NMEs), 89 biological license applications (BLAs), and 27 biosimilars between 2010 and 2019, which were evaluated according to curative areas, procedures, administrative routes, high quality selection, approval times, approval numbers by year, and expedited review categories. 4 Only 19 NMEs were approved in 2007, which has been the lowest number since 1983. The median number of NME and BLA approvals per year was 40, with a low of 21 in 2010 and a high of 59 in 2018. These factors indicate that the pharmaceutical industry should reinvent itself to stop the expected decline in productivity and bring innovative new drugs to the market. The significant contribution of computational and theoretical studies of quantum chemistry has allowed medicinal chemists to obtain more precise molecular properties and bioactivity of drugs in a shorter time. 5−8 Due to the evolution of computing data storage and higher processor performance, molecular modeling has been very efficient to solve drug-related issues without compromising the accuracy of the predicted data. 9−15 Investigation of the mechanism of action of drugs on therapeutic targets can be carried out using structure−activity relationship (SAR) and quantitative structure−activity relationship (QSAR) models. The structures of drugs and their activities or properties are studied using molecular modeling methods and statistical methods. As the structures of small-molecule drugs increase in complexity, computational capability comes into sharp focus. These computational methods are used to design therapeutically important drugs in the pharmaceutical industry. Being divided into two categories as structure-based and ligand-based, the energy evaluation of these two methods depends on force fields or empirical scoring functions. 16 Therefore, an increased understanding of these ligand−receptor interactions within target molecules is needed to design new drugs with higher specificity. Fragment-based drug discovery (FBDD) offers the possibility to encode an enormously large number of chemical structures both for similarity-based (2D) and structure-based (3D) de novo design applications. 17 Various FBDD tools can be used for the search and classification of molecules using physicochemical constraints, 18 scaffold hopping, 19 design of molecular architectures based on ligands, 20 structure-based search within large fragment spaces, 21 and similarity-driven library design. 22 Fragment-based drug discovery (FBDD) has also achieved great success in the discovery of highly specific protein kinase inhibitors (PKIs). Protein kinases (PKs) are considered as important drug targets in many diseases due to their vital role in cellular signaling 23 and in the treatment of multiple diseases. Another novel drug discovery strategy is targeted protein degradation using bifunctional small molecules (PROTACs), which is used to remove specific proteins from within cells. It is a potential strategy which offers therapeutic interventions that have not been achieved with the existing approaches. 24 Recently, the Pesticide And Drug Fragments (PADFrag) database with a distinct chemo-bioinformatic cross-referencing resource was designed by Yang et al., 25 which is the first database to define and explore the biologically relevant fragment space by comparing the pesticide and drug fragments. PADFrag combines detailed bioactive fragment data and potential targets and is used to analyze the privileged structures within known bioactive molecules, ab initio molecule library design, core fragment ACS Omega http://pubs.acs.org/journal/acsodf Article discovery for fragment-based drug design and prediction of fragment interactions, and other drug-related research. 25 Density functional theory (DFT) has offered a high level of accuracy with lesser computational time. Being cost effective, DFT techniques have provided significant amount of information in a limited time period for biological entities. A drug effect is produced by the molecular recognition of the ligand binding to the active site of a target molecule. It also determines the structure and binding interaction strength of the drug molecule and its target. In a lot of studies, the geometries of drug molecules are predicted efficiently by DFT. DFT also successfully predicts the possible interactions between a drug and a target consisting of ionic and covalent interactions, 26, 27 dipole−dipole interactions, ion−dipole interactions, hydrogen bonding, 28,29 charge transfer, and hydrophobic interactions. 30 DFT can also be utilized to explicate the reaction mechanisms of drug and precisely calculate the transition state, which is an important tool for drug design. Conceptual DFT (CDFT), originally developed by Parr and collaborators, 31−37 with several global and local reactivity descriptors helps us to understand various physicochemical processes. As global reactivity descriptors are connected with several electronic structure principles, they play a very important role in obtaining the physico-chemical information of the complexes. Various studies 38−42 have been carried out to predict the chemical properties and the assessment of chemical behavior of drug complexes in order to design and develop new molecules with important biological activities. The aim of our study is to understand the relationship between the drug structure and its activities and properties responsible for the drug efficiency using CDFT. Out of 11 therapeutic areas such as anti-infective/antibiotic, cardiovascu-lar and hematologic, neurological [central nervous system (CNS)], dermatologic, gastrointestinal, inflammatory and immunologic, metabolic, musculoskeletal, oncologic, reproductive, and respiratory drugs, we have selected 21 drugs from 10 therapeutic areas. Those drugs are lascufloxacin, pretomanid, relebactam, triclabendazole, esaxerenone, voxelotor, cenobamate, lasmiditan, mirogabalin, remimazolam, solriamfetol, ubrogepant, benvitimod, trifarotene, upadacitinib, sotagliflozin, alpelisib, erdafitinib, zanubrutinib, relugolix, and elexacaftor. The chemical structures of these 21 drugs are given in Figure 1a ,b. The optimized structures of the 21 selected drugs lascufloxacin (1), pretomanid (2), relebactam (3), triclabendazole (4), esaxerenone (5), voxelotor (6), cenobamate (7), lasmiditan (8), mirogabalin (9), remimazolam (10), solriamfetol (11), ubrogepant (12), benvitimod (13), trifarotene (14) , upadacitinib (15), sotagliflozin (16) , alpelisib (17) , erdafitinib (18) , zanubrutinib (19) , relugolix (20) , and elexacaftor (21) are given in Figure 2 , and the SMILES notations and chemical formulas are given in Table 1 . Several chemical reactivity descriptors, such as the chemical potential, global hardness, softness, electrophilicity index, and electronegativity, and HOMO−LUMO (HL) gap, and molecular descriptors have been calculated to understand the physicochemical and pharmacological aspects and the eco-toxicological characteristics of drugs, see Table 2 . The global reactivity descriptors describe the reactivity of the molecule as a whole. The calculated vertical IP values for the drugs are 3.01−8.14 eV and EA values are 0.18−2.27 eV in the water phase. A lower IP of complexes 13, 15, and 19 means a higher probability of losing an electron. It is clear from the trends observed in IP values that all these drug molecules are capable of electronic charge transfer. A higher EA for complexes 2, 6, 10 and 18 means a higher probability of gaining an electron. Chemical hardness (η) is directly correlated to the stability of the molecule, whereas softness (S) is a measure of its reactivity. The calculated chemical hardness (η), for each of the chemical species, is found to be lower for all complexes except complex 17. It means a little intramolecular charge transfer for complex 17. Electronegativity (χ), which is the negative of the electronic chemical potential (μ), is a measure of the tendency of a molecule to attract electrons. Electronegativity also measures the attraction of an atom for electrons in a covalent bond. For two unlikely covalently bonded atoms, the shared electrons will be more strongly attracted to the atom of greater electronegativity. In our studied drug complexes, the electronegativity is higher, 2.18−4.43 eV. The chemical potential (μ), which is a measure of the tendency of an electron to escape, is found to follow the opposite trend for the studied complexes. The electrophilicity index is considered as a better descriptor to provide information on electron transfer (chemical potential) and stability (hardness). Also, it has both the ability of an electrophile to acquire an additional electronic charge and the resistance of the system to exchange an electronic charge with the environment. A higher electrophilicity index means a highly reactive nature of the drug molecules. It also measures the propensity of chemical species to accept electrons. A high value of electrophilicity index describes a good electrophilic behavior, while a small value of electrophilicity index describes a good nucleophilic behavior. The electrophilicity index is higher for most of the complexes, 1.70−5.27 eV. The reactivity of the molecule can be demonstrated by the distribution of the orbital frontiers which is predicted with the help of the frontier molecular orbital (FMO) theory. The frontier orbital gap helps to characterize the chemical reactivity and kinetic stability 43 of the molecules. The HOMO and the LUMO determine the way by which it interacts with other species. The HOMO energy indicates the tendency of a molecule to donate electrons to empty molecular orbitals with a low energy of convenient molecules and the LUMO energy indicates the ability to accept electrons. The higher HL gap of the studied drug complexes reflects the high chemical stability and the low reactivity of the complexes. According to Lipinski's ''rule of five'', a candidate molecule will likely be orally active, if (i) the molecular weight (MW) < 500, (ii) the calculated octanol/water partition coefficient (log P) < 5, (iii) there were fewer than 5 hydrogen bond donors (HBD) (OH and NH groups), and (iv) there are less than 10 hydrogen bond acceptors (HBA) (notably N and O). 44 Molecular properties such as log P, topological polar surface area (TPSA), HBD and HBA, and MW, and violations of Lipinski's rule of five were determined to evaluate the druglikeness and drug score of the studied drugs. The Ghose, Pritchett, and Crippen method 45 was used to calculate the log P values. Log P is related to the lipophilicity of compounds and is useful for prediction of the absorption of drugs across the intestinal epithelium. In our studied drug complexes, only complexes 12 and 20 have MW > 500. The violation of log P values is seen in complexes 1 and 14 (a negative value for log P indicates higher affinity for the aqueous phase), while other drug complexes show log P (0.39−4.5) < 5, which is the upper limit for drug penetration through biomembranes. Thus, all these compounds have good bioavailability. Furthermore, HBA (as total number of nitrogen and oxygen atoms) is less than 10 for all the complexes except complexes 3 and 21, and HBD (as total number of nitrogen−hydrogen and oxygen−hydrogen bonds) is less than 5 for all the drugs except complex 3. The TPSA is calculated from the surfaces which are occupied by the oxygen and nitrogen atoms and by the hydrogen atoms attached to them. Thus, TPSA is closely linked to the hydrogen bonding potential of a compound. 46 Molecules with TPSA values > 140 Å 2 or more would show poor intestinal absorption, 47 and TPSA < 90 Å 2 is needed to penetrate the blood−brain barrier (and thus act on receptors in the CNS). . It should be noted that the log P and TPSA values are not sufficient to predict the oral absorption of a drug. 48 The drug complexes need to respect the rule of five. Log P can be correlated with PSA when the potential drugs are evaluated according to Hughes et al. 49 who proposed the criteria as log P < 4 and PSA > 75 Å 2 . As is known, two or more violations of the rule of five suggest the likelihood of bioavailability problems, 49 see Table 3 . According to the Lipinski rule, most of the drug complexes showed good bioavailability. Properties such as hydrogen bonding, hydrophobicity, electronic distribution, flexibility, size of molecules, and the presence of various characteristics of pharmacophores have an effect on the behavior of the molecule in a biological medium, including the bioavailability, reactivity, toxicity, protein affinity, metabolic stability, transport properties, and many more. The activity of all standard compounds and drugs has been rigorously analyzed according to the following four criteria of successful drug activity: activity on the G protein-coupled receptor (GPCR) ligand, modulation of ion channels, inhibition of http://pubs.acs.org/journal/acsodf Article kinases, proteases, and other enzymes, and activity on nuclear receptor ligands. Molinspiration Cheminformatics software was used to calculate the drug activities of the 21 drug complexes and they are reported in Table 4 . A molecule having a bioactivity score more than 0.00 is most likely to exhibit considerable biological activities, while molecules with values −0.50 to 0.00 are expected to be moderately active, and if the score is less than −0.50 it is presumed to be inactive. The results clearly reveal that the physiological actions of drug complexes might involve multiple mechanisms and could be due to the interactions with GPCR ligands, nuclear receptor ligands, and inhibitors of proteases and other enzymes. These bioactivity scores are highly suggestive of a moderate interaction with all drug targets. To further predict the higher efficiency and probable prediction of protein targets for the studied NCEs, we used the SwissTargetPrediction online tool. We found that the complexes 2, 3, 4, 8, 10, 17, 19 , and 20 have higher probability for kinase inhibitors, complexes 5, 9, 11, 14, and 18 for GPCR, and complexes 6 and 21 for enzyme inhibitors as potential targets. Complexes 15 and 16 have higher probability for nuclear receptors and electrochemical transporters as potential targets. Complexes 1 and 21 have estimated equal probability for kinase and GPCR as targets, while complex 13 has the estimated equal probability for lyase and GPCR as targets, see Figure 3 . Nowadays, structure-based designs are very common, but because of inappropriate absorption, distribution, metabolism, elimination, and toxicity (ADMET) properties, many potential drugs do not reach the clinic. Therefore, the risks of toxicity (mutagenicity, tumorigenicity, irritation, reproductive effect) and the physicochemical properties (drug-likeness and drug score) of the studied 21 drug compounds were calculated using Osiris software, see Table 5 . The toxicity risk predictor locates fragments in a molecule, indicating a potential toxicity risk. These alerts indicate that the structure drawn can be harmful concerning the specified risk category. The color code for these categories are red, green, and yellow. 50 Green color suggests a low toxic potential, yellow means mild toxicity, and red color indicates a high probability for toxicity. The results clearly indicate that all compounds except complexes 3, 4, 14, and 18 would be safe and expected to show low or no toxicity regarding mutagenicity, tumorigenicity, irritation, and effect on the reproductive system. According to the data presented in Table 5 , following are the toxic effects of the above-mentioned drug complexes: 3 (highly toxic mutagenic and highly toxicreproductive), 4 (slightly toxic mutagenic, highly toxicirritant, and highly toxicreproductive), 14 (highly toxicmutagenic and highly toxictumorigenic), and 18 (mildmutagenic). At the same time, most of the drugs have shown good results in screening with best drug score values (DS = 0.62−0.83), compared to the other drugs having lower values in Table 5 . The results indicate higher chemical stability for the studied 21 drug complexes. Good electrophilic behavior has been observed for the studied drug complexes due to the higher electrophilicity index. Most of the studied drugs are non-mutagenic, nontumorigenic, non-irritant, and without effects on reproduction, and they have good drug score values. The results indicate that the physiological actions of drug complexes might involve multiple mechanisms and could be due to the interactions with kinase inhibitors, GPCR ligands, nuclear receptor ligands, and inhibitors of proteases and other enzymes. The higher HL gap of the studied drug complexes predicted a high chemical stability and a low reactivity of the complexes. In the past few years, QSAR-based computational models have been used to predict the physicochemical properties, druglikeliness, and toxicity of drugs. On the basis of these data, recently developed artificial intelligence (AI) approaches, such as deep learning and relevant modeling approaches, are implemented for the safety and efficacy evaluations of drug molecules. The continued improvement of machine learning (ML) methods which compete with standard approaches and expert and skill has changed the world of computational medicinal chemistry. In near future, the basic scientific research Table 5 . DLDrug-Likeliness, DSDrug Score, MUTMutagenicity, TUMOTumorigenicity, IRRIIrritant Effect, and REPReproductive Effect for the Studied 21 Drug Complexes will move to a place where knowledge and understanding will overtake the quantitative results. The calculations have been performed with two different basis sets, M06 51 /6-31G** and B3LYP 52 /6-31G** from G16 53 software suites. Vibrational frequency analysis has been carried out for all the optimized structures and there is no negative frequency for the lowest energy structures. Finally, the lowest energy minima with M06/6-31G** basis set values were selected for further calculations. The energy values from the two methods are reported in Table S1 . The calculations were performed using water as a solvent by the integral equation formalism-polarized continuum model (IEF-PCM) according to the SMD solvation model. 54 GaussView 55 was used to draw the optimized structures of drug molecules. The output (pdb) structures were used as input to generate the SMILES notation from the Online SMILES Translator and Structure File Generator tool. Chemical reactivity theory was been applied to calculate the global descriptors for these NCEs. Molinspiration 56 software (www.molinspiration.com) was used to calculate various molecular properties and to predict the bioactivity score for drug targets including enzymes and nuclear receptors, kinase inhibitors, GPCR ligands, and ion channel modulators. Molecular properties such as log P, TPSA, HBD and HBA, and MW weight were also calculated. Lipinski's rule 44 of five was applied to evaluate the drug-likeness, and the bioactivity score of the drugs were also compared. Using Molinspiration Cheminformatics, all pharmacokinetic parameters were evaluated to predict the bioactivity of compounds. The online SwissTarget-Prediction 57 tool was used for the efficient prediction of protein targets for the NCEs. Osiris Property Explorer 58 was used to determine pharmacokinetic parameters such as toxicity potential, solubility, and overall drug-likeness of these NCEs. The results of virtual screening were evaluated and color-coded either green or red for properties such as the mutagenic effect, irritant effect, tumorigenicity, and effect on the reproductive system. Red color indicates the high risk of undesired effects, while green color shows drug-conformity behavior, compatibility, and safety in vivo. The optimized structure (CIF) coordinates of all 21 drug complexes are given in the Supporting Information. The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsomega.1c03736. Formulas used, optimized energies (au) of the studied 21 drug complexes by M06/6-31G** and B3LYP/6-31G** basis sets, and optimized structure (CIF) coordinates of the studied 21 The manuscript is solely written by the corresponding author. The author declares no competing financial interest. R.S. acknowledges the financial assistance by the DST WOSA project (SR/WOS-A/CS-69/2018). R.S. is thankful to her mentor Dr. Shrish Tiwari, Bioinformatics, CSIR-Centre for Cellular and Molecular Biology, and Dr. G. Narahari Sastry, Director, CSIR-NEIST for the technical support. Why Has R&D Productivity Declined in the Pharmaceutical Industry? Diagnosing the Decline in Pharmaceutical R&D Efficiency How to Improve R&D Productivity: The Pharmaceutical Industry's Grand Challenge Computational Methods in Drug Discovery Towards reproducible computational drug discovery Computational Chemistry on a Budget: Supporting Drug Discovery with Limited Resources Progress on open chemoinformatic tools for expanding and exploring the chemical space Integrating molecular modelling methods to advance influenza A virus drug discovery Accuracy or novelty: what can we gain from target-specific machine-learning-based scoring functions in virtual screening? Briefings Bioinf Rationality over fashion and hype in drug design Structurebased drug repositioning over the human TMPRSS2 protease domain: search for chemical probes able to repress SARS-CoV-2 Spike protein cleavages Small data materials design with machine learning: When the average model knows best Transforming Computational Drug Discovery with Machine Learning and AI Fragment-based drug design facilitates selective kinase inhibitor discovery Exploring fragment spaces under multiple physicochemical constraints Recore: A fast and versatile method for scaffold hopping based on small molecule crystal structure conformations De novo design of molecular architectures by evolutionary assembly of drugderived building blocks Structure-based searching in large fragment spaces LoFT: Similarity-Driven Multiobjective Focused Library Design The protein kinase complement of the human genome Protac-Induced Protein Degradation in Drug Discovery: Breaking the Rules or Just Making New Ones? PADFrag: A Database Built for the Exploration of Bioactive Fragment Space for Drug Discovery Ionic to covalent bonding: a density functional theory study of linear and bent X2Y3 monomers Periodic trends in bond dissociation energies. A theoretical study Ab initio study of the binding strength of POSS-cation complexes Predicting the binding energies of H-bonded complexes: A comparative DFT study The appropriateness of density-functional theory for the calculation of molecular electronics properties Density Functional Theory of Atoms and Molecules Density Functional Theory of Electronic Structure Density Functional Theory. In Computational Medicinal Chemistry for Drug Discovery Density-functional theory of the electronic structure of molecules Chemical Reactivity and Dynamics within a Density-based Quantum Mechanical Framework An atom counting strategy towards analyzing the biological activity of sex hormones Computational Quantum Chemical Study, Drug-Likeness and In Silico Cytotoxicity Evaluation of Some Steroidal Anti-Inflammatory Drugs Electron-correlation based externally predictive QSARs for mutagenicity of nitrated-PAHs in Salmonella typhimurium TA100 Chemical reactivity theory (CRT) study of small drug-like biologically active molecules DFT calculations and POM analyses of cytotoxicity of some flavonoids from aerial parts of Cupressus sempervirens: Docking and identification of pharmacophore sites Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings Atomic physicochemical parameters for three-dimensional structure directed quantitative structure-activity relationships. III: Modeling hydrophobic interactions Evaluation of chromatographic descriptors for the prediction of gastro-intestinal absorption of drugs SwissADME: a free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules Proteolysis targeting chimeras (PROTACs) in "beyond rule-of-five" chemical space: Recent progress and future challenges Physicochemical drug properties associated with in vivo toxicological outcomes Actelion's property explorer, Actelion's Pharmaceuticals Ltd A new local density functional for maingroup thermochemistry, transition metal bonding, thermochemical kinetics, and noncovalent interactions Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density Universal Solvation Model Based on Solute Electron Density and on a Continuum Model of the Solvent Defined by the Bulk Dielectric Constant and Atomic Surface Tensions SwissTargetPrediction: Updated Data and New Features for Efficient Prediction of Protein Targets of Small Molecules Osiris property explorer