key: cord-0019149-2fh5nh1q
authors: Daoui, Ossama; Elkhattabi, Souad; Chtita, Samir; Elkhalabi, Rachida; Zgou, Hsaine; Benjelloun, Adil Touimi
title: QSAR, molecular docking and ADMET properties in silico studies of novel 4,5,6,7-tetrahydrobenzo[D]-thiazol-2-Yl derivatives derived from dimedone as potent anti-tumor agents through inhibition of C-Met receptor tyrosine kinase
date: 2021-07-03
journal: Heliyon
DOI: 10.1016/j.heliyon.2021.e07463
sha: 1c780a04c96ee1af1892cc6ae8179fe33cc2d03c
doc_id: 19149
cord_uid: 2fh5nh1q

A quantitative structure-activity relationship (QSAR) study is performed on 48 novel 4,5,6,7-tetrahydrobenzo[D]-thiazol-2 derivatives as anticancer agents capable of inhibiting c-Met receptor tyrosine kinase. The present study is conducted using multiple linear regression, multiple nonlinear regression and artificial neural networks. Three QSAR models are developed after partitioning the database into two sets (training and test) via the k-means method. The obtained values of the correlation coefficients by the three developed QSAR models are 0.90, 0.91 and 0.92, respectively. The resulting models are validated by using the external validation, leave-one-out cross-validation, Y-randomization test, and applicability domain methods. Moreover, we evaluated the drug-likeness properties of seven selected molecules based on their observed high activity to inhibit the c-Met receptor. The results of the evaluation showed that three of the seven compounds present drug-like characteristics. In order to identify the important active sites for the inhibition of the c-Met receptor responsible for the development of cancer cell lines, the crystallized form of the Crizotinib-c-Met complex (PDB code: 2WGJ) is used. These sites are used as references in the molecular docking test of the three selected molecules to identify the most suitable molecule for use as a new c-Met inhibitor. A comparative study is conducted based on the evaluation of the predicted properties of ADMET in silico between the candidate molecule and the Crizotinib inhibitor. The comparison results show that the selected molecule can be used as new anticancer drug candidates.

Recently, aromatic heterocyclic thiazole compounds have received increasing attention in the medicinal chemistry domain [1, 2] . Thiazoles and their derivatives have shown significant biological activities due to their anti-inflammatory properties [3] , anticonvulsifs [4] , insecticides [5] , antioxydants, anti-tumor, antihypertensives, pesticides [6] , and antidiabetic potential [7] . In particular, 1,3-thiazole structures have been successfully used as effective anti-cancer agents. For example, the 1,3-thiazole formulations S3U937 Figure 1 (A) [8] , and S8A375 Figure 1 (B) [9] , showed anticancer activities against different types of cancer [10] (Figure 1 ).

The c-Met receptor tyrosine kinase represents an interesting anti-cancer target [11] . Optimization of the series of 2-amino-5-aryl-3-benzyloxypyridine molecules used in the development of a new clinical candidate, namely Crizotinib (PF-02341066) ( Figure 2 ) [12] , which demonstrated strong inhibition of c-Met kinase tyrosine in Vitro and in Vivo tests, and shows excellent pharmaceutical properties and strong inhibition of tumor growth [13] . Crizotinib inhibits the c-Met kinase and perturbs the c-Met signaling pathway [14] . Overall, this agent inhibits the growth of cancer cells.

To develop a new molecule for use in a drug, a very long synthesis process must be followed.

For this reason, pharmaceutical industries are moving towards innovation and new research methods, including the prediction of molecule activities before their synthesis. The use of molecular modeling techniques such as QSAR and molecular docking have become very important methods [15] . In the context of the development of new molecules inhibiting c-Met, a molecular modeling study of 4,5,6,7-tetrahydrobenzo [D]-Thiazol-2-Y derivatives is performed in this work using the molecular series of these derivatives synthesized by Mohareb et al. [16] since these derivatives showed high biological activity and a high capacity to inhibit the c-Met protein more effectively than the Crizotinib inhibitor.

According to the studies conducted by James et al. [13] and Christensen et al. [14] , there is a significant relationship between c-Met kinase activity and tumor cell growth. It is therefore necessary to find effective inhibitors of c-Met to remove the tumor cells generated by the increased enzymatic activity of the c-Met protein. In this work, candidate inhibitors of the studied series are determined to be used in the inhibition of c-Met as novel anticancer agents. These are determined by QSAR and molecular docking analysis. In addition, the evaluation of ADMET properties in silico is performed to confirm the use of the best selected inhibitor that can be used as anticancer drug.

The rest of this work is organized as follows: the second section presents materials and methods. The third section includes the simulation results and discussions. As well as the final section concludes the performed work.

Quantitative structure-activity relationships (QSAR) are powerful methods used in drug discovery [15, 28] . We calculated multidimensional molecular descriptors (Constitutional, Topological, Physico-chemical, Geometrical, and Quantum) in order to identify the regions of space related to the inhibition of the enzymatic activity of the c-Met protein. To this aim, we construct QSAR models by using statistical methods [17] . In this study, we used a set of 48 molecules of 4,5,6,7-tetrahydrobenzo[d]-thiazol-2-yl derivatives to construct QSAR models. To develop these models, we used multiple linear regression (MLR) analysis, multiple nonlinear regression (MNLR) and artificial neural networks (ANN). The predictive capacity of the developed models is tested by several validation techniques that are: internal and external validations and Y-randomization methods. In addition, we also examined the applicability (AD) of the QSAR model obtained by MLR based on William's plot to detect outliers and external compounds [18] . In addition, we evaluated the drug-likeness properties of a set of molecules that achieved high biological activity against cancer cell growth [19] . The evaluation of drug-likeness properties is performed by respecting a set of basic rules that are: Lipinski [20] , Veber [21] and Egan rules [22] . After the selection of the candidate molecules, we perform the molecular docking of the selected compounds with the c-Met receptor. This is to identify the different types of interactions between the selected molecules and the active sites of c-Met, as well as to determine the positions and orientations of these molecules in the c-Met receptor pocket.

To perform the molecular modeling, we use the experimental IC 50 values of anti-cancer enzymatic activity data of a series of 48 compounds of novel 4,5,6,7-Tetrahydrobenzo[D]-Thiazol-2-Yl previously synthesized and presented by Mohareb et al. [16] . The observed activities IC 50 (nM) are converted to pIC 50 level (pIC 50 ¼ -logIC 50 ), which are presented in Table 1 .

In the present study, we have based on the computation of 15 different molecular descriptors of the studied series that belong to different classes (1D, 2D, 3D) to develop linear and non-linear mathematical models. Tables 2 and 3 show the computed descriptors by using Chem3D V16 [23] , ChemSketch12 [24] and Gaussian 09 software [25] . The topological, physico-chemical and geometrical descriptors are calculated after optimizing the energy of each compound using the MM2 method (force field method with gradient for root mean square (RMS) of 0.01 kcal/mol) [26] . The geometrical structure of the studied compounds is also optimized by the Becke's three-parameter hybrid method and the Lee-Yang-Parr B3LYP function using the 6-31G(d) basis to calculate the quantum chemical descriptors using the Gaussian 09W software [25, 26] . We calculate the molecular descriptors to be used in the development of 2D-QSAR models, as these descriptors represent the most important structural properties of the studied molecules. After calculating these descriptors for all the molecules of the studied series, we determine the quantitative relationship between these descriptors and the biological inhibitory activity of the c-Met enzyme. The quantitative relationship is constructed via statistical methods (MLR, MNLR and ANN) for representing QSAR models.

To construct QSAR models, we studied a series of 48 molecules obtained by in vitro synthesis [16] . All the studied molecules showed a strong enzymatic inhibitory activity of c-Met (pIC 50 > 8) . To develop the QSAR models in this study, we use the statistical methods presented below. [16] . 01 

PCA is a very efficient method for assembling the information encoded in components, and is widely used to understand the distribution of components and the links between them [29] . The PCA method is based on descriptive statistics, and the main objective of using this method is to extract as much information as possible from the database [30] . In this work, we use the PCA method to identify the different molecular descriptors that will contribute to the development of the QSAR models. The PCA method is applied using fifteen descriptors calculated for each molecule in the series of 4,5,6,7-Tetrahydrobenzo[D]-Thiazol-2-Yl derivatives.

After selecting the most important molecular descriptors that are uncorrelated through the PCA method, we divide the database into two sets (training and test). Thus, the training and test sets include 80% and 20% of the total data, respectively [31] . The training set is used to develop QSAR models, while the test set is used to evaluate the effectiveness of each developed model. The procedure of dividing the data set into training and test sets is performed by the k-means classification method [32] . In each cluster obtained after the K-means division method, one compound from each cluster is randomly selected as part of the test set, while the remaining compounds are selected as the training set. After this division, we obtain ten compounds for the test set and thirty-eight compounds for the training set.

To carry out the analysis of the relationship between the structure and activity of the studied compounds, the most widely used methods in the development of QSAR models are used, which are: MLR [33] , MNLR [30] , and ANN [34] . This is to build QSAR models capable of predicting the biological efficacy of molecules in inhibiting the enzymatic activity of the c-Met protein. Through the obtained QSAR models, the biological activity is related to the molecular descriptors. The descriptors of the obtained model represent very important parameters that can influence the inhibition of the enzymatic activity of the c-Met kinase protein. In our work, the MNLR and MLR models are developed with XLSTAT V. 2019 software [35] , and the ANN model is developed with MatlabV.2015a software [36] . In the statistical analysis of QSAR models, we rely on classical analytical approaches [37] , where the main parameters used in this approaches are: the determination coefficient (R 2 ) (Eq. 1), correlation coefficient, adjusted coefficient (R 2 adj ) (Eq. 2), the mean squared error (MSE) (Eq. 3), high F-value (F > 0.33) and the level of signification (p-value) is traditionally between 1% and 5% [38] . 

where Y obs is the value of the observed response, Y calc is the value of the predicted response, Y calc is the average value of observed/predicted responses, p is the number of explicative variables in the model, and n is the number of individuals.

After partitioning the dataset, we use the descriptors obtained by the PCA method to develop an accurate and statistically acceptable QSAR model using the MLR method. This is done in order to define the most important descriptors to be used as inputs in the development of other QSAR models via the MNLR and ANN techniques.

The MLR method is widely used in QSAR studies for molecular descriptor selection due to its simplicity and robustness [39] . MLR is also used to identify the descriptors used as input parameters in the development of QSAR models by the MNLR and ANN methods [40] . MLR is based on the hypothesis that the dependent variable is linked linearly to certain independent variables according to the following relationship given by Eq. (4).

Where Y is the dependent variable (biological activity to be predicted), X i are the independent variables (molecular descriptors), n is the number of molecular descriptors, a 0 is the constant in Eq. (4), a i represent the coefficients of the descriptors.

The MNLR method is a non-linear approach (exponential, logarithmic, polynomial...), which consists in determining the mathematical model that best describes the non-linear variation of a molecular property or biological activity (Y) as it relates to molecular descriptors (X i ) [41] . In this context, we use the polynomial model of the second order to build the QSAR model via the MNLR technique, based on the descriptors that are determined by the MLR model. The nonlinear relationship between molecular descriptors and biological activity is done according to Eq. (5).

Where: Y is the dependent variable (biological activity to be predicted), X i are the independent variables (the molecular descriptors), n is the number of molecular descriptors, a 0 is the constant in the equation of the model, a i and b i represent the coefficients of the descriptors in the equation of the model.

Artificial neural networks are used to augment the probability of characterization of the compound and to generate a predictive model linking all quantitative molecular descriptors obtained from the MLR model and the values of observed biological activities [42] . We develop the QSAR model based on the ANN method to confirm the accuracy of the selected molecular descriptors that are obtained by the MLR model. Moreover, the ANN model allows us to obtain the biological activity predictions for each molecule with high accuracy. The ANN model that we develop in this work is of the feed-forward type [34] . This method is based on the sigmoid transport function in the hidden layer and the linear transfer function in the output layer. The ANN architecture in this work is composed of three layers of neurons, called the input layer, hidden layer, and output layer, as shown in Figure 3 .

The input layer contains a number of neurons equal to or less than the number of descriptors obtained using the multiple linear regression model, and the output layer contains the predicted activity values. In order to determine the number of hidden neurons within the hidden layer, it is necessary to calculate the parameter ρ, this parameter being calculated according to the relationρ ¼ (number of weights)/(number of connections) [43] . According to the recommendations of some authors, the value of the parameter ρ should be between 1 and 3 to ensure that the ANN model is statistically acceptable, and that the forecasts obtained through this model are made in a way that ensures the contribution of all the elements of the database used [44, 45] .

After developing the QSAR models, it is necessary to perform certain statistical tests to confirm the validity of the proposed models. In this work, the developed QSAR models are validated by an internal and an external validation. Also, we perform a Y-randomization test to evaluate the efficiency of the original model obtained by the MLR method, and then we determine the applicability domain of the MLR model.

To validate the QSAR models developed by MLR, MNLR and ANN, we use the internal validation procedure named leave-one-out cross-validation (LOOCV) [46] . This validation is based on the calculation of the R 2 cv coefficient value by using Eq. (6). According to [46] the value of R 2 cv should be more than 0.5. This indicates that the developed model is robust in the internal prediction.

Where Y obs ðtrainÞ is the value of the observed response, Y calc ðtrainÞ is the value of the response predicted by Loo-cv, Y calc ðtrainÞ is the mean value of the observed/predicted responses.

In this test, we apply the QSAR models developed to predict the activities of the compounds of the test set. The test set contains compounds from the series of molecules studied in this work, but these compounds did not contribute to the development of the QSAR models. We assessed the external ability of the QSAR models to predict the activity of the test set molecules by calculating the coefficient R 2 test between the observed pIC 50 values and the predicted pIC 50 values after the inclusion of the test set. The importance of evaluating the value of R 2 test in the external validation of QSAR models has been described by Globarikh and Tropsha [46] . Accordingly, it has been described that when the value of R 2 test is greater than 0.5, the model is statistically acceptable in prediction and can be applied to new external data [47] .

The Y-randomization test is used to avoid the possibility of random correlation between descriptors and their corresponding biological activities in the model that was initially obtained by the MLR technique. Therefore, any random correlation between X values (molecular descriptors) and Y values (biological activity) will affect the efficiency and validity of the MLR model as well as the MNLR and ANN models. the Yrandomization test is distributing randomly the experimental properties/ activity values on the descriptors of the original model, and thanks to this distribution, new models are generated [48] . The QSAR model is considered acceptable and was not obtained by chance through the Y-randomization test, when the average random correlation coefficient (R 2 r ) of the randomly constructed models is less than the correlation coefficient (R 2 ) of the original non-random model [49] .

The applicability domain of the original QSAR model is obtained by MLR. This step is the last step in the validation of the developed QSAR models in the present work. The QSAR model cannot be considered as a universal model, because it is developed on a limited number of compounds that do not cover the total space chemical [50] . The applicability domain of the QSAR model is defined as a space that includes the chemical space of the molecules in the training set. This space contains the molecules with correctly predicted activities [50] . Thus, QSAR models cannot accurately predict the characteristics of all compounds involved. Therefore, determining the applicability domain of QSAR models is of great importance to identify molecules that are not correctly predicted in terms of activities. Hence, molecules that are outside the applicability domain of the QSAR model will not be considered for molecular modeling. In addition, the verification and validation process through the definition of the applicability domain is very important according to the Organization for Economic Cooperation and Development (OECD) [51] . In the absence of the required AD analyses, any QSAR model can predict the activity of any compound, even with a completely different structure than the molecules under study. There are several methods for defining the AD models [52] , but the most frequently used method is the determination of the leverage values effect (h i ¼

:::; n) for each compound, x i is the vector descriptive of the compound to be found, X is n Âðk À 1Þ matrix of the k descriptor values of the model for n compounds of the training set, and the exponent ð:Þ T refers to the transposition of the matrix/vector [51] . The Williams plot is used to determine the AD within a square zone [53] and the level of leverage h * (h * ¼ 3 Â ðkþ1Þ n ) [50] with n is the number of compounds in the training set and k is the number of selected descriptors in the model. When the leverage effect h for the compound is higher than the alert leverage h * of the same compound, indicates that this compound negatively affects the constructed model, so it is considered outside the applicability domain [18] .

In this work, we define the applicability domain of the Williams plot type by using MatlabV2015a software.

We note that many potential therapeutic agents fail to reach clinical trials because of their unfavorable parameters of ADMET (absorption, distribution, metabolism, elimination and toxicity) [50, 51] . The drug-likeness is the most recent method proposed to identify compounds that are recommended for use in drugs that must respect certain rules that are important: Lipinski's [20] , Veber's [21] and Igan's [22] rules. In this work, we predict the drug-likeness properties of compounds that have experimentally and predictively demonstrated excellent activity, so that these molecules are within the applicability of the previously determined QSAR model. The Lipinski, Veber and Igan rules are based on the evaluation of ADME properties of human drugs. These rules are very useful to discover drugs based on the 2D structure of small molecules, and on the bioavailability of these molecules by the oral administration [56] . Compounds whose physical and chemical properties do not meet at least two of the Lipinski, Veber and Igan rules are subject to a variety of problems in their pharmacokinetic properties related to ADMET. Less than 10% of drugs that reach the clinical trial phase do not meet any of these rules (Lipinski, Veber and Igan). We also evaluate two other factors, which are the number of Rotatable bonds (n-ROTB) and Topological polar surface area (TPSA) [57] . The prediction of these factors allows us to know if the molecule interacts with the receptor in a flexible mode or an inflexible one [58] . In this study, we evaluate the drug-likeness and pharmacokinetic in silico properties of the molecules to be selected as inhibitory agents of c-Met enzyme activity, using the online SwissADME [59] and pkCSM [60] servers, respectively.

Molecular docking has recently become an essential tool in drug discovery [61] , because of its ability to predict the conformation and mode of binding of the ligand to the receptor binding site. In this study, we dock the selected molecules by evaluating the drug-likeness properties with the c-Met receptor in order to identify the most appropriate candidate molecule for the inhibition of c-Met enzymatic activity. c-Met protein is also called tyrosine kinase Met protein or hepatocyte growth factor receptor (HGFR) [13] . Similar to most receptor tyrosine kinases (RTKs), hepatocyte growth factor (HGF) c-Met is also a regulator of many critical cellular processes, including embryonic development, cell growth, differentiation, vasodilation and tissue regeneration [62] . High enzymatic expression of HGF/c-Met also leads to the growth of different types of solid tumors in humans [63] . Inhibition of the high enzymatic activity of c-Met protein leads to inhibition of various cancer cell lines [13] . A large number of small molecules have been mentioned recently in the inhibition of c-Met enzymatic activity. However, the FDA has approved only one molecule as the lead anti-tumor drug in the c-Met inhibitor class, called Crizotinib (PF-02341066) [13] . The crystalline structure of c-Met (code PDB: 2WGJ) is obtained from the RCSB protein database [64] , The 2WGJ crystal complex is formed by the co-crystallized ligand of Crizotinib bound to the c-Met receptor.

Firstly, before performing the molecular docking, the c-Met protein is prepared by removing water molecules, the associated ligand (crizotinib) and all non-protein elements. Then, polar hydrogen atoms are added to the c-Met receptor structure. Next, the preparation of ligands that will be docked to c-Met consists of adding hydrogen atoms to these ligands and optimizing their structure. The site of binding is defined as the volume occupied by the co-crystallized ligand Crizotinib in the c-Met receptor pocket with a radius of 5 Å.

The protein and ligands as well as the ligand entry site into the c-Met protein pocket are prepared in the present work by using Discovery Studio 2016 software [65] . Also, AutoDock software (ADT) MGLTools 1.5.6 packages [66] are used in the re-docking of the Crizotinib ligand with the c-Met receptor and in the docking of molecules (5, 16, and 46) with c-Met receptor. The 3D grid is constructed using the AUTOGRID algorithm which determines the box grid where the total binding energy of the ligands with the receptor are measured [67] . The grid is defined for x ¼ 60, y ¼ 60, z ¼ 60 with a distance of 0.375 Å between the grid points. We then identified the coordinates x ¼ 21.70 Å, y ¼ 83.73 Å, z ¼ 4.28 Å as the docking site of the selected ligands in the c-Met protein pocket. Next, we dock the ligands with the c-Met protein using Lamarckian Genetic Algorithms (LGA) to obtain the best molecular docking match [67] .

This work explores molecular docking for two purposes. The first one consists in displaying the ligand visualization profiles in the c-Met receptor pocket, and comparing the realized binding energies between the ligands and the c-Met receptor. The second aim is the identification of the most important active sites of the c-Met protein, which are responsible for its enzymatic activity at the source of cancer, and to analyze the interactions that occur between the docked molecules and the identified active sites. Next, the candidate molecules for inhibition of c-Met enzyme activity are selected and their docking is studied. This is achieved based on the results of the drug-like evaluation of the compounds that show high biological activity predicted by the QSAR models. The binding modes of Crizotinib inhibitor with c-Met receptor and the active sites with which Crizotinib inhibitor interacts are identified according to the study reported by J. Jean Cui et al [13] . The active sites interacted with the Crizotinib are used as reference to predict the inhibition of c-Met enzymatic activity.

To validate the molecular docking procedure, we re-docked the crystallized ligand of Crizotinib to the c-Met receptor to determine the predicted binding energy between Crizotinib and the c-Met receptor, as well as to predict the reference active sites involved in the inhibition of c-Met activity. To ensure that the docking procedure is acceptable and valid, the range of the RMSD must be less than 2 Å according to Y. Westermaier et al. [68] .

After performing the PCA analyses, the following descriptors are selected: EVDW, LogP, αe, Pc, MW, MV, and ɳ as inputs for developing the QSAR model through the MLR technique. The seven aforementioned descriptors are selected among fifteen ones based on the correlation coefficient values. In fact, the descriptors of lowest correlation coefficients between them are selected as shown in the correlation matrix presented in Table 4 . Then, the values of these descriptors are attributed to all the 48 molecules studied in the form of a matrix of 7 columns and 48 rows. Then, the database obtained is divided into two sets (training and test). This division is done using the K-means technique. The outcomes of this division are presented in Table 5 . From these results, the following molecules (2, 6, 8, 14, 22, 23, 24, 26, 38 and 42) are selected for the test set, and the next molecules (1, 3, 4, 5, 7, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, 20, 21, 25, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 39, 40, 41, 43, 44, 45, 46, 47, and 48) are selected for the training set.

The resulting QSAR model via the MLR technique is given by Eq. (7) below. 7) is statistically acceptable. In addition, the achieved p-value that is less than 0.05 (Pr < 0.0001) indicates that the QSAR model equation is statistically significant with level greater than 95%. In addition, the value of the cross-validation correlation coefficient (R 2 cv ¼ 0:62), greater than 0.5, indicates the accuracy of the obtained QSAR model through MLR technique. The value of R 2 cv less than R 2 value indicates the fragility and weakness of the model when excluding any element of the training set. Figure 4 shows the relationship between the observed activity values and the predicted ones of (pIC 50 ). The latter are obtained by QSAR model based on the MLR technique for the molecules in both test and the training sets.

From Figure 4 , we notice that the distribution of observed and predicted pIC 50 values are significantly correlated, which is due to the low obtained MSE value. Thus, it is clear that the experimentally obtained values and the predicted ones from the QSAR model are correlated. Therefore, it is apparent that the seven descriptors in Eq. (7) show a strong linear correlation with the biological activity of pIC 50 that inhibits the enzymatic activity of the carcinogenic protein c-Met. In order to improve the relationship between the predicted activities obtained by the developed QSAR model via the MLR technique and the seven molecular descriptors, new QSAR models are developed using two different nonlinear techniques, namely the MNLR and ANN techniques. The following descriptors: EVDW, LogP, αe, Pc, MW, MV and ɳ are used as input parameters in these two techniques.

The nonlinear QSAR model obtained by the MNLR technique is presented in Eq. (8) À0:124 Â αe À 1:5 Â 10 À3 Â αe 2 þ 5:7 Â 10 À3 Â Pc À 2:729 Â 10 À7 Â Pc 2 À 1:66 Â 10 À2 Â MW þ 1:63 Â 10 À5 Â MW 2 þ6:5 Â 10 À3 Â MV À 3 Â 10 À5 Â MV 2 þ 0:345 Â η þ 0:064 Â η 2 

When developing a QSAR model by using the ANN technique, the following architecture 7-4-1 with ρ ¼ 1:027 is used. With a value of 1 < ρ < 3 it is apparent that the number 4 in the hidden layer is proportional to the number of descriptors 7 in the input layer in order to predict the pIC 50 values expressed as 1 in the output layer. The developed QSAR model via the ANN technique shows a high value of the determination coefficient (R 2 ¼ 0.84) and a low value of the mean square error (MSE ¼ 0.04), as well as the value of the cross-validation coefficient (R 2 cv ¼ 0:78) is lower than the R 2 value. These results confirm the efficiency of the QSAR model in the prediction of the anticancer biological activity of the studied molecules. Therefore, the selection of the seven descriptors EVDW, LogP, αe, Pc, MW, MV and ɳ in predicting pIC 50 values is a successful. From Figure 6 , the even distribution of candidate pIC 50 values in the training and the test sets ensures that the pIC 50 values obtained by the ANN model predictions are very close to the experimentally observed values.

We present in Table 6 the pIC 50 values predicted by the MLR, MNLR and ANN models that are developed in the present study. The three QSAR models developed are successfully internally validated. In order to test the accuracy of the predictive power of obtained QSAR models, we perform an external validation. In the following paragraph, we present the results of the performed test.

We perform the external validation test by evaluating the power of the QSAR models to predict the pIC 50 activity values of the molecules from the test set, by computing the coefficient of correlation R 2 test with R 2 test represents an important criterion in evaluating the performance of externally validated models in predicting the activities of molecules not involved in the development of the models. The achieved values of R 2 test are 0.67, 0.69 and 0.64 for the MLR, MNLR and ANN models, respectively. The R 2 test values of the three models are close to each other, and these values are also greater than 0.5. Hence, the external validation of the QSAR models ensures the strong power of these models to predict pIC 50 values.

In Table 7 , we present a summary of the results of internal and external validation on the QSAR models obtained in this study.

The comparison between the performance of MLR, MNLR and ANN models in terms of the coefficients (R, R 2 , MSE) indicates that all the developed models are statistically significant, and show high internal and external predictive ability.

Hence, MLR, MNLR, and ANN models are capable to correlate a strong quantitative relationship between molecular descriptors (LogP, Polarizability, Parachor, Molecular Weight, Molecular Volume and Chemical Hardness) and biological activity (pIC 50 ) that inhibits the enzymatic activity of c-Met protein. Thus, the QSAR models developed via MLR, MNLR and ANN techniques can be exploited to predict the activity values of other molecules that can be designed by making modifications to the structure of 4,5,6,7-Tetrahydrobenzo[D]-Thiazol-2-Yl derivatives in order to obtain new molecules with stronger biological activities than those observed. In this work, instead of designing new molecules and predicting their activities, we select the most apt molecules to inhibit the enzymatic activity of the c-Met protein among the series of 4,5,6,7-tetrahydrobenzo[D]-Thiazol-2-Yl derivatives that we study. For this selection, we rely on the high activity values obtained by the predictions of the QSAR models. Before selecting the molecules with the highest predicted biological activity, we perform two essential tests to validate the efficiency of the predicted pIC 50 values. The performed tests are: Y-randomization and applicability domain tests. Both tests are used to avoid the selection of one or more molecules whose activities are not correctly predicted in this study. 

In order to ensure the quality of the QSAR models that select the candidate molecules to inhibit c-Met activity, a Y-randomization test is performed on the original QSAR model that is obtained by the MLR technique. This test is performed to reduce the possibility of randomly obtaining a strong correlation between the seven descriptors in (Eq. 7) and the biological inhibitory activity of c-Met.

We perform the Y-randomization test by randomly distributing the Y values (pIC 50 ) fifty times without changing the seven X descriptors. We perform the Y randomization test by randomly distributing the Y values (pIC 50 ) fifty times without changing the seven X descriptors. The random distribution of Y values to X descriptors enabled the generation of 50 new QSAR models. Each new model has new values of (R r ; R 2 r and R 2 r;cv ). The results of the Y-randomization test for the first 50 iterations are presented in Table 8 . Through these results, we find that the values of R r , R 2 r , and R 2 r;cv obtained by the randomly constructed models are lower than the values of R; ; R 2 and R 2 cv obtained by the original model. These results confirm that the original model obtained in Eq. (7) is robust and that the correlation between the seven descriptors and biological activity is not due to chance. Thus, we can confirm that the pIC 50 values predicted by the QSAR models based on the seven descriptors presented in the original model (Eq. 7) are not due to chance.

In the present test we determine the applicability domain of the original QSAR model obtained by MLR technique by analyzing the relationship between residual values and the leverage effect. Figure 7 shows the applicability domain obtained by the Williams diagram. The leverage effect threshold value is h* ¼ 0.52 and the distribution of normalized residual values and leverage level values were calculated and determined in Figure 7 of the Williams plot. The pIC 50 values predicted by the QSAR model are correct and valid only for compounds within the applicability domain located to the left of the leverage threshold h* ¼ 0.52, while molecules outside the applicability domain are not predicted correctly.

In Figure 7 , we note that the molecules (2 and 8) in the test set have a standard deviation outside the AE x range (x ¼ 2.5), but even so, these molecules remain in the applicability domain adjacent to the other molecules. We note that only one molecule is outside the applicability domain and that is molecule 1 of the training set. It could be that the activity value of this compound was not correctly predicted due to the availability of incorrect experimental data for this molecule. For this reason, we will remove molecule 1 from the list of molecules we propose to use as novel inhibitors of c-Met enzymatic activity, and also exclude this molecule from molecular modeling for the rest of this study. Thus, we use only the molecules within the applicability domain situated to the left of the h* threshold as candidate group members for c-Met inhibition. From these molecules only those with the highest predicted pIC 50 values will be selected.

Based on the predicted pIC 50 values shown in Table 9 in the attached file, as well as the information obtained by the applicability domain, we selected seven compounds that presented high pIC 50 values in terms of c-Met enzymatic inhibitory activity. All of these seven molecules present higher predicted pIC 50 values than those experimentally observed in comparison with the drug Crizotinib. We report the predicted pIC 50 values for the selected molecules, as well as the experimental pIC 50 value for the Crizotinib drug in Table 9 .

The prediction results of pIC 50 values presented in Table 9 that are obtained by the three models MLR, MNLR and ANN show that these values are very close to each other. Therefore, it is difficult to favor one molecule over another and select it as the best inhibitor of c-Met enzymatic activity based only on the predicted pIC 50 values. Therefore, to select the most candidate molecules for inhibition of c-Met kinase enzymatic activity among the seven molecules, we perform an in-silico study in which we predict the drug-like ADME properties of these molecules. Based on the results of the evaluation of the ADME properties of each molecule, we will select only the molecules that have drug-like properties.

The first aim of this study is to predict the drug-like properties of the seven molecules (5, 9, 14, 16, 27, 40, 46) in order to describe the biological activity of these compounds and to investigate their beneficial or toxic effects on the organism if they are used in pharmaceutical applications. The second aim is to identify molecules with drug-like properties in order to study their docking with the c-Met receptor. In Table 10 , we present the evaluation results of the drug-like properties obtained by using the SwissADME online server.

The results presented in Table 10 indicate that the compounds (5, 16, 40 and 46) meet all the rules of Lipinski, Veber and Egan, which indicates the absence of oral bioavailability problems for these compounds. Where the compounds 5, 16 and 46 demonstrated a high absorption capacity. In contrast compound 40 showed a low absorption capacity. While it is noted that compounds 9, 14 and 27 showed a deviation on the high lipid affinity level (LogP >5) in Lipinski, Veber and Egan rules. This deviation results in higher metabolic turnover, lower solubility and poor oral absorption. Therefore, we will not consider the compounds (9, 14 and 27) as candidates for pharmacological use. Based on these results, we can select the compounds (5, 16 and 46) that do not cause any oral bioavailability problems and have drug-like properties. However, for the TPSA when it is less than 140 Å 2 and the number of rotatable bonds is less than 10, the compounds becomes more flexible and is more able to interact with the target receptor [69] . We note from Table 10 that the TPSA values for all seven compounds are less than 140 Å 2 and also have the n-ROTB value less than 10, so all seven compounds can interact flexibly with the c-Met receptor. We also predict the synthetic accessibility of the compounds presented in Table 10 . The evaluation of the synthetic accessibility of the molecule ranges from 1 (easy to synthesize) to 10 (very difficult to synthesize) [70] . We find that the S.A values for the three selected drug-like molecules (5, 16 and 46) are between 3.46 and 4.32, which means that the S.A values are far from 10 and close to 1, therefore the possibility to synthesize these molecules is very easy. Based on these results, we select the three compounds (5, 16 and 46) as the most potent candidates to inhibit the c-Met receptor and the best in terms of bioavailability in the human body, also in terms of the flexibility of these molecules when interacting with the c-Met receptor kinase. Moreover, the selected molecules are easy synthesis. We illustrate in Figure 8 the 3D structures of the three selected compounds based on the predicted drug-likeness properties.

In the next step, we test the docking of the three selected molecules with the c-Met receptor kinase to determine the most stable molecule in the c-Met receptor pocket. To do this, we compare the number and types of interactions that take place between the ligands (L5, L16, L46) and the active sites of the c-Met receptor, also we compare the binding energies of the ligands with the same receptor. Then, we select the most appropriate ligand among the three ligands to replace the in- hibitor Crizotinib to treat cancer by inhibiting the enzymatic activity of c-Met.

Before docking the ligands (L5, L16 and L46) with the c-Met receptor, a first step is to identify the active sites with which the co-crystallized ligand (Crizotinib) interacted within the c-Met receptor pocket. In this step, we visualize the structure of the 2WGJ crystal complex to identify the active sites. Figure 9 shows the most important active sites of the c-Met protein to which the Crizotinib inhibitor was bound (Met1160A, Pro1158A and Tyr1230A), which were reported by 2D PoseView interaction diagrams via the ProteinsPlus online server [71] . Figure 10 shows the active sites (Tyr1159, Met1160, Ala1108, Pro1158, Met1211, Val1092, Leu1157, Ala1226, Ala1221, Tyr1230, Asp1222, Leu1140 and Ile1084) in the c-Met protein with which the Crizotinib inhibitor co-crystallized was reacted. We find these interactions by analyzing 2D and 3D visualizations of the 2WGJ complex using Discovery Studio 2016 software. The 2D and 3D visualizations in Figure 10 of the types of interactions that have occurred between the active sites in the c-Met protein and the achieved co-crystallized ligand (Crizotinib) by analyzing the complex (2WGJ) via the Discovery Studio 2016 program are much clearer than the 2D visualization in Figure 9 that is obtained by using ProteinsPlus server. From these results, we consider the following active sites (Tyr1159, Met1160, Ala1108, Pro1158, Met1211, Val1092, Leu1157, Ala1226, Ala1221, Tyr1230, Asp1222, Leu1140, and Ile1084) shown in Figure 10 as the most important sites that conduct the inhibition of c-Met enzyme activity.

In the next step, we perform a re-docking of the Crizotinib with the c-Met receptor to confirm and select the most important reference sites active in the inhibition of c-Met activity, and to verify the ability of the AutoDock (ADT) software for implementing the protocol of molecular docking. Figure 11 shows the superimposed view of the conformation between the docked ligand and the native crystalline ligand in the c-Met receptor pocket, such that the RMSD value between the ligands is 1.428 Å.

After performing the molecular re-docking, we observe in Figure 11 that there is an almost perfect superposition between the original and redocked ligand in the c-Met receptor pocket. Furthermore, the value of RMSD (1.428 Å) less than 2 Å, which indicates the efficiency of the AutoDock software (ADT) in achieving excellent molecular docking, and therefore we perform the docking test of the selected ligands (5, 16 and 46) inside the c-Met receptor pocket depending on the program Auto-Dock (ADT) based on the LGA. Also, it appears from Figure 12 that the Crizotinib ligand interacts with the active sites (Ala1108, Met1160, Pro1158, Ala1226, ALa1221, Leu1159, Leu1140, Asp1222, Tyr1230, Val1092, Met1211 and Ile1084) It can be seen from the re-docking results that the interactions of Crizotinib occurred with the same active sites identified by visualizing the docking position in the crystal complex (2WGJ) in Figure 10 .

Through the identification of the most active sites (Ala1108, Met1160, Pro1158, Ala226, ALa1221, Leu1159, Leu1140, Asp1222, Tyr1230, Val1092, Met1211 and Ile1084) involved in the inhibition of the enzymatic activity of the c-Met receptor by Crizotinib inhibitor. We will take these sites as references that explain the enzymatic activity of c-Met, and thus any interaction between the ligands (5, 16 and 46) with any of the reference sites can cause an inhibitory response of c-Met. Inhibition of c-Met results in the blockage of cancer cell growth. Also, the values of binding energy that occur between the ligands (L5, L16 and L46) will be used to determine the most stable ligand in the c-Met receptor pocket.

We present the visualizations of the interaction profiles obtained by molecular docking of the ligands 5, 16, and 46 with the c-Met receptor in Figure 13 , as well as the binding energy values that are obtained between these ligands and the c-Met receptor. Based on the molecular docking results shown in Figure 13 , the ligands (5, 16, and 46) are docked into the c-Met receptor pocket through the interactions with the following residues: Tyr 1159, Met1160, Met1211, Tyr1230, Tyr1159, Ile1084, Val1092, Ala1108, Ala1221, Gly1163 and His1162. Table 11 summarizes the results of the interactions predictions between the ligands 5, 16, 46 and L ref with active sites in the c-Met receptor as well as the binding energies of these ligands with the same receptor.

From the complex X-A we can notice that the ligand 5, binds to four reference sites important in the inhibition of the enzymatic activity c-Met kinase, these sites are the following active residues: Met1160 (3.39 Ǻ), Ile1084 (2.24 Ǻ), Met1211 (3.97 Ǻ) and Tyr1230 (4.80 Ǻ) the interaction of ligand 5 with these residues is as follows: hydrogen bonding with Ile1084 and Met1160, by Pi-Sigma interaction with Met1211, and by Tshaped Pi-Pi and Pi-alkyl interactions with Tyr1230. Ligand 5 also binds to residue Tyr1159 (2.60 Ǻ), which we did not identify as an important site in the inhibition of c-Met enzymatic activity (see Figure 12 ). We further note that the value of the binding energy of ligand 5 with the c-Met receptor is equal to -7.08 kcal/mol.

From the complex X-B in Figure 13 we can observe that the ligand 16, binds to five important sites of reference which are important in the inhibition of c-Met kinase enzyme activity, these sites are: Met1160 (2.94), Ile1084 (4.12), Ile1084 (4.5 Ǻ), Ala1108 (3.63 Ǻ) and Val1092 (4.13 Ǻ), Val1092 (5.23 Ǻ) and Tyr1230 (5.05 Ǻ), Tyr1230 (5.35 Ǻ). The binding of ligand 16 to these residues is performed as follows: a hydrogen bonding interaction with Met1160, the interactions Pi-Pi Stacked and Pi-T-Shaped with Tyr1230, and the reactions Pi-alkyl with Ile1084, Ala1108 and Val1092. Ligand 16 also binds to residues Gly1163 (3.28 Ǻ) and Tyr1159 (4.01 Ǻ), we did not report these amino acids as important sites in the inhibition of c-Met enzyme activity, as previously shown in We also find that the binding energy of the ligand (L16) with the c-Met receptor (-8.27 kcal/ mol) is less than the binding energies of the ligands L5 (-7.08 kcal/mol) and L46 (-7.4 kcal/mol) with c-Met, which means that the ligand L16 is more stable and more localized in the pocket of the c-Met receptor than the ligands L5 and L46. Thus, the X-B complex shows better stability than the complexes X-A and X-C. Based on the comparison of the molecular docking results, the number of important sites with which the ligand L16 is bound, as well as its most stable binding energy permits to select the compound L16 as the best candidate for the inhibition of the enzymatic activity of the c-Met receptor among the three ligands (L5, 16 and L46). Therefore, based on these results, we can explain the difference in the inhibition mode of c-Met enzymatic activity by the molecules (5, 16 and 46) due to the placement of the root and the orientation of the structure to identified active sites as important references in c-Met inhibition. Moreover, we can conclude that the location and the characteristics of the roots in the structure of the compounds (5, 16 and 46) can influence the biological activity pIC 50 . This influence being reflected in the number of bonds that can be created between the compound and the active sites of the future target and also in the binding energy that is produced between the compound and the target receptor. We confirm this hypothesis by the 3D visualization of the complex (X-B). In Figure 14 , we notice that the phenyl ring level in ligand 16 is bound to the active site (5.35 Ǻ) Tyr1230 with a Pi-Pi Stacked interaction, and this reaction is identical to the interaction of the phenyl ring in the Crizontib ligand with the same active site Tyr1230 (3.92 Ǻ) in the L ref -A complex. In addition, the reaction is carried out by hydrogen bonding between the nitrogen atom of In terms of binding energy values, the drug is most potent when it has the lowest binding energy value (more negative value) [72] . This binding results in the formation of more interactions between the receptor and the ligand due to more free energy being liberated. As result, the desired target is more accessible compared to ligands with lower negative values. By comparing the values of the binding energies of the ligands (L5, L16 and L46) with the c-Met receptor obtained by the molecular docking protocol, we can interpret the lowest experimental IC 50 value for the molecule 16 (0.26 nM) compared to the experimental IC 50 values of the molecules 5 (0.32 nM) and 46 (0.32 nM). Therefore, ligand 16 has the lowest binding energy (-8.27 Kcal/mol) compared to ligands 5 (-7.08 Kcal/mol) and 46 (-7.40 kcal/mol). Thus, the low IC 50 value of molecule 16 reflects the high activity of this molecule in the therapeutic use. Also, we note that the binding energy value of Crizotinib ligand (-7.04 Kcal/mol) is higher than the binding energy values of the ligand (5, 16 and 46), which confirms the stronger biological activity of these molecules in inhibiting the enzymatic activity of c-Met protein compared to Crizotinib molecule. Based on the achieved results of the molecular docking predictions, it is clear that the structure of ligand 16 that is interacted with the largest number of active sites important in inhibiting the enzymatic activity of c-Met. In addition, ligand 16 is well inserted into the pocket of c-Met and produced better stability binding energy compared to L5 and L46 ligands. Hence, the structure of molecule 16 can be used to improve the inhibition of the enzymatic activity of c-Met protein. In addition, the proposal of other molecules based on the structure of molecule 16 can be achieved by identifying additional information about the properties of each region of the structure of compound 16. Also, we can modify the structure of molecule 16 and evaluating the effect of these modifications on the pIC 50 values in order to design new molecules based on the performed modifications. For this purpose, it is preferable to combine the study of 2D-QSAR with molecular docking and 3D-QSAR. In a future work, we will combine these studies to design new molecules that can be added to the 4,5,6,7-Tetrahydrobenzo[D]-Thiazol-2-Yl derivatives and we will investigate their anticancer drug kinetics.

Based on the present study results, we confirm the selection of compound 16 as a novel candidate agent for pharmacological use in the treatment of cancerous tumors resulting from the activity of the c-Met kinase enzyme. In order to confirm the validity of using the molecule 16 in the treatment of cancer by inhibiting the enzymatic activity of the c-Met protein, we perform in the following section an evaluation of the pharmacokinetic parameters in silico of this molecule. Also, a comparison will be performed between the Crizotinib and compound 16 in terms of ADMET properties.

Both compound 16 and Crizotinib are undergoing ADMET in the present in silico studies by using the pkCSM tool [73] . The letter is used to predict in silico ADMET (Absorption, Distribution, Metabolism, Excretion, Toxicity) properties of the selected compound 16, as well as the properties of the Crizotinib compound that is previously used as a drug. The results of the ADMET properties prediction are computed and then presented in Table 12 .

Based on the obtained results in Table 12 , we can conclude that:

-In terms of the percentage of absorption by the human intestines, a value less than 30% indicates that the absorption is low [74] . Compound 16 presented a absorption value higher than 94%, which guarantees a good absorption by the human intestine best than the Crizotinib compound (91.38%). -In terms of distribution indicators, the size of the distribution (VDss)

is considered high if its value is greater than 0.45 [75] . The standard value for blood-brain barrier (BBB) permeability is good if its value is more than 0.3 and poor if LogBB < -1 [76] . For the CNS index, compounds with LogPS > -2 are considered capable of penetrating the CNS, whereas compounds with LogPS < -3 are considered incapable of penetrating the CNS [77] . The distribution indices reported by molecule 16 indicated a better distribution capacity than Crizotinib. -In terms of metabolism, cytochrome P450 (CYP) is an important enzyme for detoxification. CYP enzymes are present in all tissues of the body [70] . This enzyme oxidizes foreign microorganisms to facilitate their excretion. Many drugs are inhibited by cytochrome CYP, and some can also be activated by it. Inhibitors of this enzyme may affect the metabolism of the drug, and the drug may have a reverse effect [78] . Therefore, it is indispensable to evaluate the ability of compounds to inhibit cytochromes (CYP). Up to now, 17 categories of CYPs have been identified in humans. Although only CYP1, CYP2, CYP3 and CYP4 are responsible in the metabolism of drugs, Thus, only the types (1A2, 2C9, 2C19, 2D6 and 3A4) are responsible for biotransformation for more than 90% of drugs pass the first step of metabolism [79] . The two isoforms 2D6 and 3A4 are mainly responsible for drug metabolism [80] . A study performed by A. Puccini et al. [81] indicates that the metabolism of Crizotinib in the liver is affected by an increase in CYP3A4 enzyme activity, which inhibited Crizotinib activity. For this reason, we are relying on the evaluation the effect of the compound 16 on the CYP3A4 enzyme (inhibitor or substrate) in order to predict the metabolic effect of CYP3A4 on the activity of the compound 16 that is proposed to use as drug. From the obtained results of the molecule 16 properties metabolism, we can see that this molecule can be a CYP3A4 substrate as well as an inhibitor of CYP3A4. This indicates that the metabolism of the compound 16 as drug is acceptable, and therefore molecule 16 can successfully reach the therapeutic target before being oxidized and excreted. -The drug clearance index in excretion properties is important for determining drug dosage ratios to achieve stable drug concentrations [75] . Where clearance is done in the liver and excretion in the kidneys. Therefore, lower value of the clearance index indicates that the higher the persistence of drugs in the body. We evaluate the excretion property in this study to determine the level of stability of molecule 16 as a drug in the body before its excretion. The predictive values of this index showed that the total clearance index of molecule 16 is 0.113 that is lower than the total clearance index of crizotinib (0.583), and therefore molecule 16 may persist in the body better than Crizotinib, it can be explained that the stability of molecule 16 for a longer period in the body compared to Crizotinib led to an increase in the activity of molecule 16 in the inhibition of the enzyme c-Met at a dose lower than the dose used by the Crizotinib molecule. -In terms of the toxicity indicator, it is necessary to check whether the predicted compounds are non-toxic. The letter indicator is important in the selection of drugs. The AMES test is widely used to evaluate the toxicity of compounds [82] . Therefore, in this work, we evaluate the toxicity of the molecules (16 and Crizotinib) based on AMES test predictions. According to the study presented in [75] , all compounds in the database are toxic, although we fortunately found in the in silico evaluation of the toxicity characteristic of molecule 16 that this molecule is not toxic.

Based on the obtained results in silico ADMET properties evaluation for both compounds 16 and Crizotinib, we find that compound 16 meets all the pharmacokinetic conditions that are evaluated in this study. Therefore, molecule 16 can be used in the future as a drug to treat cancer by inhibiting the enzymatic activity of the c-Met protein. Molecule 16 can also be used in the design of new compounds with stronger biological activities, with other properties and new uses.

This study contributed to the development of mathematical models that were able to determine the quantitative relationship between the biological anticancer activity and the molecular structure of a series of 4,5,6,7-tetrahydrobenzo[D]-thiazol-2-Yl derivatives. Also in this work, the molecular properties necessary for robust activity to inhibit the C-Met receptor tyrosine kinase and thus achieve an anti-cancer response were detected. The resulting QSAR models were analyzed and validated for statistical significance and predictive power through internal and external validations, as well as via a Y-randomization test and domain of applicability. The predictive ability of QSAR models that is obtained by three analysis methods (MLR, MNLR, ANN) has been shown that the proposed models are very strong for all these methods. The analysis of the developed QSAR model equations showed that the following seven important descriptors influence the biological inhibitory activity of the c-Met receptor: LogP, polarization, Parachor, molecular weight, molecular volume and chemical hardness. The descriptors identified in this work by QSAR models can be used effectively to predict the anticancer activity values of new compounds that can be designed based on the structure of 4,5,6,7-Tetrahydrobenzo[D]-Thiazol-2-Yl derivatives. This will allow to significantly reduce the drug development process and the cost of synthesis at the pharmaceutical chemistry laboratories. In this work, seven molecules with the highest c-Met inhibition activity were selected based on the predictions of the 2D-QSAR models that were developed. After performing an evaluation of the drug-likeness properties of the seven molecules, the results showed that the molecules (5, 16 and 46) had acceptable drug-likeness properties.

A molecular docking study that was performed for the molecules (5, 16 and 46) with the c-Met receptor, showed that the molecule 16 is the best candidate for the inhibition of c-Met carcinogenic activity. Where the molecule 16 established more interactions compared to the molecules (5 and 46) with the reference active sites in the c-Met receptor. Furthermore, molecular docking results showed that the binding energy of molecule 16 in the c-Met receptor pocket is higher than the binding energies of molecules 5 and 46 with the same receptor, indicating that molecule 16 is well stable with the c-Met receptor. In addition, the pharmacokinetic evaluation of ADMET properties in silico between molecule 16 and Crizotinib indicates that molecule 16 has better pharmacokinetic properties than Crizotinib. This could explain the higher biological activity observed for molecule 16 (pIC 50 ¼ 9.59) compared to the biological activity observed for Crizotinib (pIC 50 In future work, we will develop 3D-QSAR models based on the series of 4,5,6,7-Tetrahydrobenzo [D]-Thiazol-2-Yl derivatives, and we will adopt molecule 16 as a reference molecule in the design of novel inhibitors of c-Met enzymatic activity in order to propose new derivatives of 4,5,6,7-Tetrahydrobenzo [D]-Thiazol-2-Yl structure and also to predict its biological activities against the growth of non-small cell lung cancer cells as a model of cancer treatment.

Synthesis and evaluation of bis-thiazole derivatives as new anticancer agents

Cameroonian medicinal plants as potential candidates of SARS-CoV-2 inhibitors

Synthesis and biological evaluation of thiazolo-triazole derivatives

Synthesis and anticonvulsant activity of some 1,2,3,3a-tetrahydropyrrolo[2,1-b]benzothiazol-1-ones and pyrrolo[2,1-b]thiazole analogues

Synthesis and insecticidal activity of N-substituted (1,3-thiazole)alkyl sulfoximine derivatives

Design and synthesis of azolopyrimidoquinolines, pyrimidoquinazolines as antioxidant, anti-inflammatory and analgesic activities

Structural characterization and QSAR modeling of 1,2,4-triazole derivatives as α-glucosidase inhibitors

Synthesis and biological evaluation of a series of novel inhibitor of Nek2/Hec1 analogues

Discovery and optimization of 2-(4-substituted-pyrrolo [2, 3-b] pyridin-3-yl) methylene-4-hydroxybenzofuran-3 (2H)-ones as potent and selective ATP-competitive inhibitors of the mammalian target of rapamycin (mTOR)

Synthesis of some new pyrazole-based 1,3-thiazoles and 1,3,4-thiadiazoles as anticancer agents

C-MET as a new therapeutic target for the development of novel anticancer drugs

MET tyrosine kinase inhibitor crizotinib (PF-02341066) shows differential antitumor effects in non-small cell lung cancer according to MET alterations

Structure based drug design of crizotinib (PF-02341066), a potent and selective dual inhibitor of mesenchymal-epithelial transition factor (c-MET) kinase and anaplastic lymphoma kinase (ALK)

Crizotinib, a small-molecule dual inhibitor of the c-Met and ALK receptor tyrosine kinases

Medicinal chemistry and the molecular operating environment (MOE): application of QSAR and molecular docking to drug discovery

Synthesis and biological evaluation of novel 4, 5, 6, 7-tetrahydrobenzo [D]-Thiazol-2-Yl derivatives derived from dimedone with anti-tumor, c-met, tyrosine kinase and Pim-1 inhibitions

Best practices for QSAR model development, validation, and exploitation

On a simple approach for determining applicability domain of QSAR models

Predicting ADME properties in silico: methods and models

Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings

Molecular properties that influence the oral bioavailability of drug candidates

Prediction of drug absorption using multivariate statistics

ChemDraw and Chem3D

Structure Drawing Software for Academic and Personal Use, ACD/ChemSketch

Expanding the limits of computational chemistry

Conformational analysis. 130. MM2. A hydrocarbon force field utilizing V1 and V2 torsional terms

QSAR study of anti-Human African Trypanosomiasis activity for 2-phenylimidazopyridines derivatives using DFT and Lipinski's descriptors

Principal component analysis

Principal component analysis: a method for determining the essential dynamics of proteins

QSAR modelling of rat acute toxicity on the basis of PASS prediction

On selection of training and test sets for the development of predictive QSAR models

Linear QSAR regression models for the prediction of bioconcentration factors by physicochemical properties and structural theoretical molecular descriptors

The use of artificial neural networks in QSAR

XLSTAT version 2019.1, XLSTAT, Your data analysis solution

Best software & apps

Validation of QSAR models-strategies and importance

Basic validation procedures for regression models in QSAR and QSPR studies: theory and application

Understanding the Basics of QSAR for Applications in Pharmaceutical Sciences and Risk Assessment

Investigation of antileishmanial activities of acridines derivatives against promastigotes and amastigotes form of parasites using quantitative structure activity relationship analysis

QSPR studies of 9-aniliioacridine derivatives for their DNA drug binding properties based on density functional theory using statistical methods: model, validation and influencing factors

Gonz alez-Díaz, Multi-target spectral moment QSAR versus ANN for antiparasitic drugs against different parasite species

Kolmogorov's theorem and multilayer neural networks

Application of neural networks: quantitative structure-activity relationships of the derivatives of 2, 4-diamino-5-(substitutedbenzyl) pyrimidines as DHFR inhibitors

Applications of Neural Networks in Quantitative Structure-Activity Relationships of Dihydrofolate Reductase Inhibitors

Beware of q2!

QSAR study of unsymmetrical aromatic disulfides as potent avian SARS-CoV main protease inhibitors using quantum chemical descriptors and statistical methods

Y-randomization and its variants in QSPR/ QSAR

On various metrics used for validation of predictive QSAR models with applications in virtual screening and focused library design

Current status of methods for defining the applicability domain of (quantitative) structure-activity relationships: the report and recommendations of ecvam workshop 52

Principles of QSAR models validation: internal and external

Methods for reliability and uncertainty assessment and for applicability evaluations of classification-and regression-based QSARs

A statistically defined endpoint titer determination method for immunoassays

Structure-based virtual screening of influenza virus RNA polymerase inhibitors from natural compounds: molecular dynamics simulation and MM-GBSA calculation

QSAR study of N-substituted oseltamivir derivatives as potent avian influenza virus H5N1 inhibitors using quantum chemical descriptors and statistical methods

SwissADME: a free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules

pkCSM: predicting small-molecule pharmacokinetic and toxicity properties using graph-based signatures

Discovery of potent SARS-CoV-2 inhibitors from approved antiviral drugs via docking and virtual screening

Cytoreductive antitumor activity of PF-2341066, a novel inhibitor of anaplastic lymphoma kinase and c-Met, in experimental models of anaplastic large-cell lymphoma

Discovery of novel dual c-Met/HDAC inhibitors as a promising strategy for cancer therapy

X-ray Structure of PF-02341066 bound to the kinase domain of c-Met

Free download: BIOVIA discovery Studio visualizer

MGLTools 1.5.6 RC3 release announcement -MGLTools

Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function

Virtual screening: an in silico tool for interlacing the chemical universe with the proteome

Nature as a treasure trove of potential anti-SARS-CoV drug leads: a structural/mechanistic rationale

Combined 3D-QSAR and molecular docking analysis of thienopyrimidine derivatives as Staphylococcus aureus inhibitors

Universit€ at hamburg -proteins plus server

Repositioning therapeutics for COVID-19: virtual screening of the potent synthetic and natural compounds as SARS-CoV-2 3CLpro inhibitors

Combined docking methods and molecular dynamics to identify effective antiviral 2, 5-diaminobenzophenonederivatives against SARS-CoV-2

Characterization of the human upper gastrointestinal contents under conditions simulating bioavailability/bioequivalence studies

pkCSM: predicting small-molecule pharmacokinetic and toxicity properties using graph-based signatures

Silibinin as potential tool against SARS-Cov-2: in silico spike receptor-binding domain and main protease molecular docking analysis, and in vitro endothelial protective effects

silico ADME and toxicity prediction of ceftazidime and its impurities

Avila-Z arraga, Synthesis, molecular docking, and in silico ADME/Tox profiling studies of new 1-aryl-5-(3-azidopropyl) indol-4-ones: potential inhibitors of SARS CoV-2 main protease

Cytochrome P450 enzymes in drug metabolism: regulation of gene expression, enzyme activities, and impact of genetic variation

Nonclinical evaluation of IQG-607, an antituberculosis candidate with potential use in combination drug therapy

Safety and tolerability of c-MET inhibitors in cancer

Differential toxicity of Disperse Red 1 and Disperse Red 13 in the Ames test, HepG2 cytotoxicity assay, and Daphnia acute toxicity test

Author contribution statement Ossama Daoui: Conceived and designed the experiments; Performed the experiments; Contributed reagents, materials, analysis tools or data; Wrote the paper.Souad Elkhattabi, Samir Chtita: Conceived and designed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper.Rachida Elkhalabi, Hsaine Zgou, Adil Touimi Benjelloun: Conceived and designed the experiments.

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data will be made available on request.

The authors declare no conflict of interest.

No additional information is available for this paper.