Skip to main content

ORIGINAL RESEARCH article

Front. Pharmacol., 21 February 2023
Sec. Experimental Pharmacology and Drug Discovery
This article is part of the Research Topic Artificial Intelligence and Drug Discovery View all 13 articles

3D,2D-QSAR study and docking of novel quinazolines as potential target drugs for osteosarcoma

Zheng Lian&#x;Zheng Lian1Chenglin Sang&#x;Chenglin Sang2Nianhu LiNianhu Li3Honglin ZhaiHonglin Zhai2Wenzhe Bai
Wenzhe Bai3*
  • 1School of Clinical Medicine, Weifang Medical University, Weifang, China
  • 2Department of Orthopedics, The 960th Hospital of the Chinese People’s Liberation Army, Jinan, China
  • 3The First Clinical Medical School, Shandong University of Traditional Chinese Medicine, Jinan, China

Background: Quinazolines are an important class of benzopyrimidine heterocyclic compounds with a promising antitumor activity that can be used for the design and development of osteosarcoma target compounds.

Objective: To predict the compound activity of quinazoline compounds by constructing 2D- and 3D-QSAR models, and to design new compounds according to the main influencing factors of compound activity in the two models.

Methods: First, heuristic method and GEP (gene expression programming) algorithm were used to construct linear and non-linear 2D-QSAR models. Then a 3D-QSAR model was constructed using CoMSIA method in SYBYL software package. Finally, new compounds were designed according to molecular descriptors of 2D-QSAR model and contour maps of 3D-QSAR model. Several compounds with optimal activity were used for docking experiments with osteosarcoma related targets (FGFR4).

Results: The non-linear model constructed by GEP algorithm was more stable and predictive than the linear model constructed by heuristic method. A 3D-QSAR model with high Q2 (0.63) and R2 (0.987) values and low error values (0.05) was obtained in this study. The success of the model fully passed the external validation formula, proving that the model is very stable and has strong predictive power. 200 quinazoline derivatives were designed according to molecular descriptors and contour maps, and docking experiments were carried out for the most active compounds. Compound 19g.10 has the best compound activity with good target binding capability.

Conclusion: To sum up, the two novel QSAR models constructed were very reliable. The combination of descriptors in 2D-QSAR with COMSIA contour maps provides new design ideas for future compound design in osteosarcoma.

1 Introduction

Osteosarcoma is a malignant tumor characterized by proliferating tumor cells directly producing bone or osteoid tissue, also known as osteosarcoma, derived from mesenchymal tissue (Ottaviani and Jaffe, 2009). Osteosarcoma is the most common primary malignant bone tumor with a high degree of malignancy, rapid growth and early metastasis. Early diagnosis of the disease is difficult and the prognosis is poor with an incidence of about 4–5/106 (Ottaviani and Jaffe, 2009; Moore and Luu, 2014). The average median age of diagnosis was 15 years, and the most common occurrence was between 15 and 20 years old, with 60% occurring below 25 years of age. Conventional osteosarcomas (classic) originate from the bone marrow and account for approximately 80% of all types of osteosarcomas, classified as osteogenic (50%), chondrogenic (25%), and fibrogenic (25%). Other rare osteosarcoma subtypes include capillary dilatation, small cell, parabone, periosteal, highly malignant surface osteosarcoma, low malignant central osteosarcoma, multi-center osteosarcoma and secondary osteosarcoma (Paget’s disease), etc. (Geller and Gorlick, 2010). Traditional osteosarcomas tend to occur in the long bones of the extremities, most commonly around the knee joint (distal femur, proximal tibia) (Sanerkin, 1980; Botter et al., 2014), with approximately 91% occurring in the metaphysis and 9% in the diaphysis. Atypical osteosarcoma can still invade non-long bones (skull, pelvis, mandible, vertebrae), and its incidence increases progressively with age. Common initial symptoms of osteosarcoma are pain and swelling, localized painful masses and inflammatory reactions, which may be followed by varying degrees of joint motion limitation and pathologic fractures.

At present, the treatment mode for osteosarcoma is preoperative neoadjuvant chemotherapy + surgical resection + postoperative adjuvant chemotherapy (Bishop et al., 2016). The main first-line chemotherapy compounds for osteosarcoma are methotrexate (MTX), doxorubicin (ADM), cisplatin (DDP), ifosfamide (IFO), vincritin (VCR), epirubicin (EPI), cyclo-phosphamide (CTX) and etoposide (VP-16), etc., in which the MTX, ADM, DDP and IFO are the most commonly used (Ta et al., 2009). However, in the current clinical work on osteosarcoma chemotherapy, these compounds were found to have several serious side effects, such as kidney damage caused by methotrexate and cisplatin, cardiac inhibition by doxorubicin, and resistance to chemotherapy compounds. Therefore, there is an urgent need to develop and design new and more effective compounds for the treatment of osteosarcoma.

Fibroblast growth factor receptor 4 (FGFR4) is a tyrosine kinase receptor that selectively binds to fibroblast growth factor 19 (FGF19). FGF19 binds to FGFR4 and its co-receptor b-Klotho, leading to dimerization and autophosphorylation of FGFR4. Activated FGFR4 interacts with fibroblast growth factor receptor substrate 2 (FRS2), recruits growth factor receptor binding protein (GRB2) and affects downstream proteins mediating osteosarcoma cell proliferation (Vainikka et al., 1994; Ho et al., 2009; Wu et al., 2011). Quinazoline derivatives exert an inhibitory effect on osteosarcoma growth by inhibiting the phosphorylation and signaling pathways of FGFR4 (Querolle et al., 2015; Nandi and Bagchi, 2016; Voskoboynik et al., 2016).

QSAR (Quantitative structure-activity relationship) is a compound research method that uses mathematical models to describe the relationship between the structure of a molecule and certain biological activities of the molecule. This method has been widely used for compound activity prediction and the development of new compounds (Dearden, 2017). Under the guidance of this method, people have successfully designed quinolones, such as norfloxacin. At present, the research methods of QSAR are mainly divided into 2D-QSAR and 3D-QSAR (Roy et al., 2015). Since 2D-QSAR research cannot accurately describe the relationship between molecular 3D structure and physiological activity, 3D-QSAR is more accepted by the scientific community in later compound design research. However, the effect of molecular descriptors in the 2D structure-activity relationships on the production of new compounds is often ignored when using a 3D method to design new compounds, which leads to unsatisfactory activity results of newly designed compounds. The main objective of this experiment was our desire to find a method that combines 2D-QSAR and 3D-QSAR to design more reliable compounds targeting osteosarcoma fibroblast growth factor receptor 4 (FGFR4).

2 Experiment

2.1 Data set

The data for this experiment included a total of 37 quinazoline derivatives, and all compounds were obtained from the references (Pan et al., 2021). The structures and activity values of all compounds are shown in Table 1.

TABLE 1
www.frontiersin.org

TABLE 1. Structure and activity values of 37 compounds.

2.2 2D-QSAR research

2.2.1 2D-QSAR data processing

In machine learning, to evaluate the discrimination, stability, robustness and other model effects of supervised algorithms, and to avoid over-fitting of data, it is necessary to divide the data set into a training set and a test set in the ratio of 4:1 (Mitchell et al., 1990).

In this experiment, system time was used as a random seed to split the dataset into a training set containing 29 compounds and a training set containing 8 compounds in a ratio of 4:1. The training set was used to train supervised models, fit models, adjust parameters, select modal variables, and make other choices for the algorithms. The test set is used to evaluate the effectiveness of the training model without changing the parameters and effects of the model. Usually, the decision to retrain the model or to choose another algorithm is based on whether the validated model is overfitted or underfitted.

2.2.2 Calculation of the descriptions

In 2D-QSAR models, compounds are typically represented by molecular descriptors that can be statistically correlated with biological or even physicochemical properties. The molecular descriptors are calculated as follows: first, the molecular structures of 37 compounds were drawn in ChemDraw software. Then, to obtain stable molecular structures with the lowest energy, the molecular structures were optimized in HyperChem 7.5 software (Stewart, 1989) using successive MM + molecular mechanics force fields and the semi-empirical PM3 method (Ivanciuc, 1996), and then the optimized molecules were put into MOPAC6.0 software. Finally, the results were imported into CODESSA software (Katritzky et al., 2001) to calculate five classes of molecular descriptors: constitutional, geometrical, topological, electrostatic and quantum-chemical.

2.2.3 Linear modeling (Cao and Lin, 2003)

Heuristic Method (HM) is the method of descriptor screening in the CODESSA software, the greatest advantage of which is that it allows a complete search for a large number of molecular descriptors without checking all possible combinations of parameters. Meanwhile, the method allows to build the best linear regression equation. The steps of HM are as follows.

a. Selection of 1-parameter descriptor. The square of correlation coefficient (R2), F-test and t-test are used as selection criteria, and descriptors with low correlation with properties (activity) are removed. The descriptors without significant changes and those with high correlation are removed.

b. Selection of 2-Parameter Descriptor. R2 and F- tests are the criteria for analysis and selection.

c. Selection of n-parameter descriptors. After obtaining the two-parameter correlation coefficients with optimal statistical characteristics, descriptors not used in the previous selection process are added to establish a new correlation equation. The new correlation coefficient is verified by R2, F-test and standard deviation (S). Until the established correlation equation contains the maximum number of parameters. The process of adding descriptors starts with the correlation equation with the maximum fit-ness function value, which is defined as follows:

w=R2*F*n/N*S2

where R2 donates the square of the correlation coefficient, F is F-test value; n represents the number of samples, N is the number of descriptors and S is the standard deviation.

d. Output. Every time a descriptor is added, the optimal 10 correlation results are displayed, and every iteration starts with the results with the best correlation. The correlation between descriptors and the square of the cross-validation coefficient (R2CV) should be calculated during each cycle.

2.2.4 Establishment of non-linear model

GEP (Gene expression programming) is a new genetic algorithm that combines the advantages of genetic programming and genetic algorithm to solve complex problems with simple codes (Holland, 2005). According to the gene expression law of biological inheritance, GEP adopts equal-length linear symbols as the genetic code, and the individual phenotype as the expression tree. After a large number of operations, the algorithm can find the optimal solution. The process of GEP algorithm is described in detail as follows:

First, a certain number of chromosomes were randomly created as the initial population and then all chromosomes were translated into corresponding expression trees (ETs). Next, the fitness of each chromosome was measured according to a predefined fitness function to determine whether the fitness satisfies the termination criterion (a solution of the desired quality was found or a certain number of iterations had been completed). If the termination criteria were not met, the appropriate individuals were retained by an elitist roulette selection method. The selected individuals underwent genetic manipulation to form new individuals based on a certain probability, including variation, recombination and transposition. Finally, a new generation was created. Moreover, the chromosome of an individual consisted of one or more genes, represented by a fixed-size linear symbol string. The GEP gene consisted of two parts, which contain multiple gene element bits. The values of the gene element bits were taken from the set of terminal T and the set of function F (Teodorescu and Sherwood, 2008; Gharagheizi et al., 2012; Pham and Karaboga, 2012; Kaydani et al., 2014). Figure 1 summarizes the above operation process.

FIGURE 1
www.frontiersin.org

FIGURE 1. Flowchart of GEP algorithm.

By comparing the linear model and non-linear model in 2D-QSAR model, it is found that the non-linear model generated by GEP algorithm is more reliable and has stronger prediction power than the linear model generated by HM method. However, the 2D-QSAR model cannot accurately describe the relationship between molecular 3D structure and physiological activity, so it is necessary to continue the 3D-QSAR experiment.

2.3 3D-QSAR research

2.3.1 Data processing and structure optimization

In 3D-QSAR experiments, the IC50 values of all compounds need to be first converted to ′-log (IC50) +9′, which makes the data more stable and reduces the error caused by the original values in the experiment. Similar to 2D-QSAR experiment, the 3D-QSAR experiment also required randomly dividing the 37 quinazoline derivatives into training and test sets.

In the previous experiment, ChemDraw software was used to construct all 37 compounds, and in the 3D-QSAR experiment, these 37 compounds were put into SYBYL software for optimization and modeling. When processing data in SYBYL software, Tripos force field and Powell gradient algorithm were used to minimize COMSIA structure energy. Finally, the minimal structure was used as the initial conformation (Yu et al., 2015).

2.3.2 Conformational sampling and alignment

In 3D-QSAR experiments, the structure alignment of compounds will directly affect the establishment of subsequent 3D models, so it is very important to select appropriate structure alignment methods for compounds (Li et al., 2005; Patel et al., 2008; Ai et al., 2011). Ligand-based alignment was used in this experiment (Figure 2) since compound 19g was the compound with the best compound activity in the dataset, all compounds were aligned with compound 19g as a template.

FIGURE 2
www.frontiersin.org

FIGURE 2. Compound 19g serves as a template for the alignment of all compounds. (A) Common alignment structures in compound 19g (shown in bold), and (B) all compounds are aligned with 19g as a template.

2.3.3 COMSIA study

In 1994, Klebe et al. H1 proposed the method of Comparative Molecular Similarity Index Analysis (CoMSIA), an extension of CoMFA. Both methods share the same principles and are based on the following assumptions when the bond affinity of a molecule changes, its corresponding molecular properties also change, which are expressed in the form of molecular fields (Yu et al., 2015).

In CoMSIA, the use of a distance-dependent Gaussian functional form for the calculation of various molecular fields effectively avoids significant changes in potential energy and anomalous positions of atoms at lattice points near the molecular surface. In addition, there is no need to define cut-off values for the energy in CoMSIA. Compared to CoMFA, CoMSIA corresponds to a significantly improved contour map of the contribution of different molecular fields in space, which allows a more intuitive interpretation of the effect of different molecular fields on molecular activity (Yang et al., 2011a; Li et al., 2012; Hadni and Elhallaoui, 2020).

For CoMSIA analysis, a grid with side lengths of 2 was first generated in all the assembled regions of the superimposed molecules, and the boundaries of 4Å were used to determine the regions of all the superimposed molecules. The default probe provided by CoMSIA was used to calculate the steric field, electrostatic field, hydrophobic field and hydrogen bond field (including hydrogen bond acceptor and hydrogen bond donor) at each grid point. After obtaining the molecular field for each grid, Partial Least Square (PLS) method was used to establish the quantitative correlation model between molecular field parameters and affinity. Finally, the Leave One Out (LOO) method and Cross Validation (CV) method were used to test the statistical significance of the model and determine the number of principal components of the model. The number of principal components determined by the optimal interaction verification value was used to establish the 3D-QSAR model without interaction verification. The affinity of the compounds in the test set was predicted (Yan et al., 2020).

2.3.4 Validation of 3D-QSAR model

To demonstrate the stability of the QSAR model, the 3D-QSAR model needs to be evaluated using internal or external validation methods (Yan et al., 2020). In this experiment, external validation was selected to verify the 3D-QSAR model. The verification formula is as follows:

Rext2=1i=1ntestyiyi2i=1ntestyiytr2

where ntest refers to the number of compounds in the test set, ytr refers to the average value of compound activity in the training set, yi and yi, refer to the experimental value and predicted value of compound activity in the test set respectively. Generally, with Rext2 >0.5, the established model is considered to be robust and has good predictive ability in statistics (Yang et al., 2011b; Awasthi et al., 2018).

2.3.5 Molecular docking experiment

After introducing the compound molecules into the Sybyl software, the molecular mechanics of the ligand small molecules were optimized using the Cong-Grad gradient method. A Tripos force field was used in the optimization process with energy convergence to 0.01 kcal/(mol-Å) and a maximum number of 106 iterations. After the optimization of molecular mechanics, the conformation with the lowest energy was selected for further molecular docking studies. For the FGFR4 receptor (crystal structure of the protein from the RCSB Protein Data Bank, PDB ID:4xcu), crystal water molecules and hydrogenated atoms were removed and the original ligands in the protein were extracted to identify their binding sites for subsequent molecular docking.

Flexible docking between small molecule ligands and receptors was performed using Sybyl software. The ligand binding sites were used to generate target active pockets. Docking was subsequently performed using the Sybyl-Dock standard model with a threshold parameter of 0.5 and an expansion factor of 1. Molecular conformational changes were retained by 20. Total-Score function of the Sybyl-Dock module was used to score the interaction between the small molecule and the target, taking into account the effects of polarity, hydrophobicity and enthalpy. The higher the value, the better the interaction between the compound molecule and the protein crystal.

3 Results and discussion

3.1 HM

In this experiment, 593 descriptors for 37 compounds were calculated using CODESSA software. Also, to obtain a set of descriptors most relevant to the activity of osteosarcoma inhibitors, eight linear regression QSAR models were developed using the HM method with the number of molecular descriptors ranging from 1 to 8, where the effects of different numbers of descriptors on R2, R2cv and S2 is shown in Figure 3. The results show that the number of descriptors is proportional to the values of R2 and R2cv, and inversely proportional to S2. The model with three descriptors was selected as the best linear model for predicting the activity of osteosarcoma receptor inhibitors. Table 2 shows the details of these descriptors. In addition, to ensure that there is no multicollinearity between the molecular descriptors calculated in this experiment, the correlation coefficients between descriptors were calculated by CODESSA software, as shown in Table 3. The correlation coefficient between any two descriptors is less than 0.8, indicating that all descriptors are independent. Therefore, the constructed linear model has strong statistical reliability. The HM model is illustrated in Figure 4.

FIGURE 3
www.frontiersin.org

FIGURE 3. The effects of different numbers of descriptors on R2, R2cv and S2.

TABLE 2
www.frontiersin.org

TABLE 2. Details of the three selected descriptors.

TABLE 3
www.frontiersin.org

TABLE 3. Correlation coefficient between three descriptors.

FIGURE 4
www.frontiersin.org

FIGURE 4. Plots of measured and calculated log(IC50) by HM.

The linear model formula is as follows:

LogIC50=1.5952*103* AvgERC1.1630*102* HDSA2/TMSAQC+2.5642*104* MinNRO7.5167*101

From the coefficients of the three descriptors in the above formula, it can be seen that the absolute value of the coefficient of the descriptor “Min-NRO” is the largest, so the descriptor “Min-NRO” has the greatest influence on the linear model of this experiment.

3.2 GEP

The construction of non-linear model in this experiment is mainly realized by the GEP algorithm in APS software. All the functions used in the operation of GEP algorithm are shown in Table 4. Finally, a satisfactory non-linear model was obtained when GEP was used for 220 generations. The correlation coefficients of the training and testing sets of the non-linear model were 0.89 and 0.86, respectively, with average errors of 0.02 and 0.04.

TABLE 4
www.frontiersin.org

TABLE 4. All operation functions of GEP algorithm.

In addition, the non-linear model equation obtained by GEP algorithm (converted by C language) is as follows:

LogIC50=14y+2x+1+cosy/3xcosx+coscosx*x/y/z*cos2y+y/3xcos1

where x, y and z represent the descriptors Avg-ERC, HDSA-2/TMSA(QC) and Min-NRO, respectively.

In order to verify the predictive ability of the non-linear model, the formula was used to predict the IC50 value of all compounds. The specific results are shown in Figure 5, indicating that the predictive ability of the non-linear model is extremely reliable.

FIGURE 5
www.frontiersin.org

FIGURE 5. Experimental and predicted values for non-linear models. All IC50 values are converted to log(IC50).

3.3 COMSIA model results

In the 3D-QSAR experiments, eight groups of COMSIA models were obtained by permuting and combining the steric field (S), electrostatic field (E), hydrophobic (H), hydrogen bond donor field (D) and hydrogen bond acceptor field (A). Among them, the model composed of SEHDA has the best statistical results, and the details are shown in Table 5.

TABLE 5
www.frontiersin.org

TABLE 5. Statistical results of the optimal CoMSIA model.

3.4 External verification results

In order to prove the stability of the 3D-QSAR model constructed in this experiment, the external verification formula was used to verify the model with satisfactory results. The value of Rext2 obtained by external verification formula was 0.997, greater than 0.5, indicating that the CoMSIA model constructed in this experiment has a strong predictive ability. At the same time, in order to further demonstrate the predictive ability of the model, all compounds were put into the model to predict their compound activity, which can be found to be extremely reliable (Figure 6).

FIGURE 6
www.frontiersin.org

FIGURE 6. Experimental and predicted values of CoMSIA model. All compounds IC50 were converted to “−log(IC50)+9.”

3.5 COMSIA contour maps

One of the advantages of CoMSIA is the ability to isolate and observe the effects of various physical and chemical properties on biological activity through 3D correlation contour maps. This contour map helps to identify the important regions of the molecular field that affect the biological activity, and also to mark the molecular field features that contribute significantly to the active site of ligands and receptors (Chen et al., 2021; Wang et al., 2022).

In this experiment, the steric, electrostatic, hydrophobic, hydrogen bond donor and acceptor fields were constructed according to the compound 19g with the best compound activity, among which the electrostatic field had the highest contribution. In other words, the influence of electrostatic field on the compound activity should be taken into consideration in subsequent compound design experiments. Figure 7 shows the five contour maps.

FIGURE 7
www.frontiersin.org

FIGURE 7. Contour map of optimal compound 19g. (A) In the steric field, green represents favorable and yellow represents unfavorable. (B) In the electrostatic field, blue represents a positive electric field and red represents a negative electric field. (C) In the hydrophobic field, yellow represents favorable and white represents unfavorable. (D) Favorable (cyan) and unfavorable (purple) hydrogen bond donor fields. (E) Favorable (magenta) and unfavorable (red) hydrogen bond acceptor fields.

3.6 Design new compounds and predict activity

From the results of 2D-QSAR experiments, we know that the descriptor ‘Min nucleoph React index for an O Atom' has the greatest impact on the compound activity of compounds. Fukui atomic nucleophilic reaction index formula (Franke, 1984) has a very detailed explanation for this kind of descriptors, as expressed as follows:

NA=iACiHOMO2

where CiHOMO2 stands for highest occupied molecular orbital MO coefficients, NA stands for minimum nucleophilic reaction index. Through an in-depth study of this formula, it is easy to conclude that the regression coefficient of “Min nucleophilic reaction index of O atom” is positive, and the more negatively charged oxygen atoms in the molecule, the greater the nucleophilic reaction index. The larger the nucleophilic reaction index, the larger the IC50 value of the compound, and the lower the compound activity. According to this conclusion, the activity of a compound can be improved by increasing the valence of the oxygen atom or decreasing the number of oxygen atoms in the compound.

Electrostatic field contributed the most in the five CoMSIA contour maps, so electrostatic field is the main factor to be considered in compound design. Of course, there are other force fields that need to be considered. The biggest innovation of this study is the design of new compounds by combining the descriptor “minimum nuclear reaction index of O atoms” and CoMSIA contour maps. The specific combination method is to reduce the number of molecular oxygen atoms or increase the valence of oxygen atoms in the favorable region of contour map according to Fukui nucleophilic reaction formula. Finally, 200 new quinazolines were designed. Table 6 shows the ten compounds with the highest compound activity. Compound 19g.10 has the highest value of antitumor activity among these compounds and can be considered as a potential chemotherapeutic agent for the treatment of osteosarcoma. However, in order to prove the target binding capability of the newly designed compounds with osteosarcoma targets, it is necessary to continue with small molecule docking experiments.

TABLE 6
www.frontiersin.org

TABLE 6. The compounds of newly designed and their predicted values.

3.7 Molecular docking results

In docking experiments, compound 19g and the new compound 19g.10 were used as ligands for docking with the osteosarcoma-related target FGFR4, and Figure 8 represents the docking results for both compounds (yellow dashed lines represent hydrogen bonds).

FIGURE 8
www.frontiersin.org

FIGURE 8. Docking assay of compounds 19g, and 19g.10 with osteosarcoma related target (FGFR4, PDB ID: 4XCU).

From the docking conformation of compound 19g, the oxygen atom (located in the structure 2,4-Dichloro-1,5-dimethoxy-3-methylbenzene) formed a hydrogen bond with the protein residue ASP630, and the nitrogen atom of the quinazoline core, as well as the hinge region, formed a hydrogen bond with Ala553. This is the same binding mode as that of BLU9931, a selective inhibitor of FGFR4 first identified by Hagel M, Miduturu C et al. (Hagel et al., 2015).

From the docking conformation of compound 19.10, it appears that compound 19g.10 can form hydrogen bonds not only with residues ASP630 and ALA553 but also the N-methylpropanamide structure of the compound can form hydrogen bonds with residue ASN557. And the 1-fluoride-4-methylpiperazine structure can also form hydrogen bonds with the protein residue LYS471. The above results suggest that compound 19g.10 designed in this experiment does have potential as a chemotherapeutic candidate for osteosarcoma.

4 Conclusion

In this study, linear and non-linear 2D-QSAR models were obtained experimentally, and it can be seen from the experimental results that the descriptor Min-NRO has the greatest influence on compound activity. However, it is obvious that only 2D experiments are not enough, so it is necessary to continue with 3D conformational relationship experiments. Through the CoMSIA method, a 3D-QSAR model with high q2 (0.63) and r2 (0.987) values and low error values (0.05) was obtained. In addition, among the five contour maps of CoMSIA model, the contour map that contributes most to the compound activity of the compound is the electrostatic field.

Finally, 200 new quinazoline derivatives were designed according to the molecular descriptor Min-NRO and contour map. In order to further verify the correlation between the compounds and the targets of osteosarcoma, small molecule docking experiments were also performed in this study, indicating that compound 19g.10 also had good target binding capability and can be regarded as a potential target compound for osteosarcoma. In conclusion, this study designed a reliable QSAR model for quinazoline compounds, which provided new ideas for compound design and could provide important guidance for the development of future chemotherapeutic drugs for osteosarcoma.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

Conceptualization, ZL and WB; methodology, ZL; software, HZ; validation, CS; formalanalysis, ZL; investigation, ZL; datacuration, WB; writing—original draft preparation, ZL; writing—review and editing, WB. All authors have read and agreed to the published version of the manuscript.

Acknowledgments

The authors are grateful to the providers of APS software, CODESSA software and SYBYL software.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Ai, Y., Wang, S. T., Tang, C., Sun, P. H., and Song, F. J. (2011). 3D-QSAR and docking studies on pyridopyrazinones as BRAF inhibitors. Med. Chem. Res. 20 (8), 1298–1317. doi:10.1007/s00044-010-9468-1

CrossRef Full Text | Google Scholar

Awasthi, M., Singh, S., Pandey, V. P., and Dwivedi, U. N. (2018). CoMFA and CoMSIA-based designing of resveratrol derivatives as amyloid-beta aggregation inhibitors against Alzheimer's disease. Med. Chem. Res. 27 (4), 1167–1185. doi:10.1007/s00044-018-2138-4

CrossRef Full Text | Google Scholar

Bishop, M. W., Janeway, K. A., and Gorlick, R. (2016). Future directions in the treatment of osteosarcoma. Curr. Opin. Pediatr. 28 (1), 26–33. doi:10.1097/MOP.0000000000000298

PubMed Abstract | CrossRef Full Text | Google Scholar

Botter, S. M., Neri, D., and Fuchs, B. (2014). Recent advances in osteosarcoma. Curr. Opin. Pharmacol. 16, 15–23. doi:10.1016/j.coph.2014.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Cao, C., and Lin, Y. (2003). Correlation between the glass transition temperatures and repeating unit structure for high molecular weight polymers. J. Chem. Inf. Comput. Sci. 43 (2), 643–650. doi:10.1021/ci0202990

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Q., Wang, F., and Zhou, B. (2021). Investigations of retinoic acid receptor-related orphan receptor-gamma t (RORγt) agonists: A combination of 3D-QSAR, molecular docking and molecular dynamics. J. Biomol. Struct. Dyn. 39 (10), 3501–3514. doi:10.1080/07391102.2020.1765873

PubMed Abstract | CrossRef Full Text | Google Scholar

Dearden, J. C. (2017). The history and development of quantitative structure-activity relationships (QSARs): addendum[J]. Int. J. Quantitative Structure-Property Relat. (IJQSPR) 2 (2), 36–46. doi:10.4018/ijqspr.2017070104

CrossRef Full Text | Google Scholar

Franke, R., Huebel, S., and Streich, W. J. (1985). Substructural QSAR approaches and topological pharmacophores[J]. Environ. Health Perspect. 61, 239–255. United States: Pharmacochemistry library.

PubMed Abstract | CrossRef Full Text | Google Scholar

Geller, D. S., and Gorlick, R. (2010). Osteosarcoma: A review of diagnosis, management, and treatment strategies. Clin. Adv. Hematol. Oncol. 8 (10), 705–718.

PubMed Abstract | Google Scholar

Gharagheizi, F., Ilani-Kashkouli, P., Farahani, N., and Mohammadi, A. H. (2012). Gene expression programming strategy for estimation of flash point temperature of non-electrolyte organic compounds. Fluid Phase Equilibria 329, 71–77. doi:10.1016/j.fluid.2012.05.015

CrossRef Full Text | Google Scholar

Hadni, H., and Elhallaoui, M. (2020). 2D and 3D-QSAR, molecular docking and ADMET properties in silico studies of azaaurones as antimalarial agents. New J. Chem. 44 (16), 6553–6565. doi:10.1039/c9nj05767f

CrossRef Full Text | Google Scholar

Hagel, M., Miduturu, C., Sheets, M., Rubin, N., Weng, W., Stransky, N., et al. (2015). First selective small molecule inhibitor of FGFR4 for the treatment of hepatocellular carcinomas with an activated FGFR4 signaling pathway. Cancer Discov. 5 (4), 424–437. doi:10.1158/2159-8290.CD-14-1029

PubMed Abstract | CrossRef Full Text | Google Scholar

Ho, H. K., Pok, S., Streit, S., Ruhe, J. E., Hart, S., Lim, K. S., et al. (2009). Fibroblast growth factor receptor 4 regulates proliferation, anti-apoptosis and alpha-fetoprotein secretion during hepatocellular carcinoma progression and represents a potential target for therapeutic intervention. J. hepatology 50 (1), 118–127. doi:10.1016/j.jhep.2008.08.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Holland, J. H. (2005). Genetic algorithms: Computer programs that “evolve” in ways that resemble natural selection can solve complex problems even their creators do not fully understand[J]. Sci. Am. 267, 1992.

Google Scholar

Ivanciuc, O. (1996). HyperChem release 4.5 for windows[J]. J. Chem. Inf. Comput. Sci. 36 (3), 612–614. United States: Hypercube.

CrossRef Full Text | Google Scholar

Katritzky, A. R., Perumal, S., Petrukhin, R., and KlEinpEtEr, E. (2001). Codessa-based theoretical QSPR model for hydantoin HPLC-RT lipophilicities. J. Chem. Inf. Comput. Sci. 41 (3), 569–574. doi:10.1021/ci000099t

PubMed Abstract | CrossRef Full Text | Google Scholar

Kaydani, H., Mohebbi, A., and Eftekhari, M. (2014). Permeability estimation in heterogeneous oil reservoirs by multi-gene genetic programming algorithm. J. Petroleum Sci. Eng. 123, 201–206. doi:10.1016/j.petrol.2014.07.035

CrossRef Full Text | Google Scholar

Li, X., Ye, L., Wang, X., Liu, H., Qian, X., Zhu, Y., et al. (2012). Molecular docking, molecular dynamics simulation, and structure-based 3D-QSAR studies on estrogenic activity of hydroxylated polychlorinated biphenyls. Sci. Total Environ. 441, 230–238. doi:10.1016/j.scitotenv.2012.08.072

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., Wang, Y. H., Yang, L., Zhang, S. W., Liu, C. H., and Yang, S. L. (2005). Comparison of steroid substrates and inhibitors of P-glycoprotein by 3D-QSAR analysis. J. Mol. Struct. 733 (1-3), 111–118. doi:10.1016/j.molstruc.2004.08.012

CrossRef Full Text | Google Scholar

Mitchell, T., Buchanan, B., DeJong, G., Dietterich, T., Rosenbloom, P., and Waibel, A. (1990). Machine learning. Annu. Rev. Comput. Sci. 4 (1), 417–433. doi:10.1146/annurev.cs.04.060190.002221

CrossRef Full Text | Google Scholar

Moore, D. D., and Luu, H. H. (2014). Orthopaedic oncology: primary and metastatic tumors of the skeletal system. Osteosarcoma[J]. United States: Orthopaedic oncology, 65–92.

CrossRef Full Text | Google Scholar

Nandi, S., and Bagchi, M. C. (2016). EGFr, FGFr and PDGFr: Emerging targets for anticancer compound design[J]. J. Cancer Res. Updat. 5 (3), 99–108.

Google Scholar

Ottaviani, G., and Jaffe, N. (2009). The epidemiology of osteosarcoma. Pediatr. Adolesc. osteosarcoma 152, 3–13. doi:10.1007/978-1-4419-0284-9_1

CrossRef Full Text | Google Scholar

Pan, C., Nie, W., Wang, J., Du, J., Pan, Z., Gao, J., et al. (2021). Design, synthesis and biological evaluation of quinazoline derivatives as potent and selective FGFR4 inhibitors. Eur. J. Med. Chem. 225, 113794. doi:10.1016/j.ejmech.2021.113794

PubMed Abstract | CrossRef Full Text | Google Scholar

Patel, P. D., Patel, M. R., Kaushik-Basu, N., and Talele, T. T. (2008). 3D QSAR and molecular docking studies of benzimidazole derivatives as hepatitis C virus NS5B polymerase inhibitors. J. Chem. Inf. Model. 48 (1), 42–55. doi:10.1021/ci700266z

PubMed Abstract | CrossRef Full Text | Google Scholar

Pham, D., and Karaboga, D. (2012). Intelligent optimisation techniques: Genetic algorithms, tabu search, simulated annealing and neural networks[M]. Springer Science & Business Media.

Google Scholar

Querolle, O., Angibaud, P., Colombel, H., Caron, V., Pilatte, I., Poncelet, V., et al. (2015). Abstract 3640: Discovery of quinazolinones as fibroblast growth factor receptor (FGFR1-4) kinase inhibitors. Cancer Res. 75, 3640. doi:10.1158/1538-7445.am2015-3640

CrossRef Full Text | Google Scholar

Roy, K., Kar, S., and Das, R. N. (2015). Understanding the basics of QSAR for applications in pharmaceutical sciences and risk assessment[M]. Academic Press.

Google Scholar

Sanerkin, N. G. (1980). Definitions of osteosarcoma, chondrosarcoma, and fibrosarcoma of bone. J. Cancer 46 (1), 178–185. doi:10.1002/1097-0142(19800701)46:1<178:aid-cncr2820460130>3.0.co;2-c

PubMed Abstract | CrossRef Full Text | Google Scholar

Stewart, J. P. P. (1990). MOPAC: a semiempirical molecular orbital program[J]. J. Comput. Aided Mol. Des. 4 (1), 1–103. United States: University of Indiana[J].

PubMed Abstract | CrossRef Full Text | Google Scholar

Ta, H. T., Dass, C. R., Choong, P. F. M., and Dunstan, D. E. (2009). Osteosarcoma treatment: State of the art. Cancer Metastasis Rev. 28 (1), 247–263. doi:10.1007/s10555-009-9186-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Teodorescu, L., and Sherwood, D. (2008). High energy Physics event selection with gene expression programming. Comput. Phys. Commun. 178 (6), 409–419. doi:10.1016/j.cpc.2007.10.003

CrossRef Full Text | Google Scholar

Vainikka, S., Joukov, V., Wennström, S., BergMan, M., Pelicci, P. G., and Alitalo, K. (1994). Signal transduction by fibroblast growth factor receptor-4 (FGFR-4). Comparison with FGFR-1. J. Biol. Chem. 269 (28), 18320–18326. doi:10.1016/s0021-9258(17)32309-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Voskoboynik, O. Y., Starosyla, S. A., Protopopo, M. V., and Volynets, H. P. (2016). Synthesis, anticancer and FGFR1 inhibitory activity of isoindolo [2, 1-a][1, 2, 4] triazino [2, 3-c] quinazoline derivatives[J]. Med. Clin. Chem. (1), 5–18.

Google Scholar

Wang, F., Yang, W., and Zhou, B. (2022). Studies on the antibacterial activities and molecular mechanism of GyrB inhibitors by 3D-QSAR, molecular docking and molecular dynamics simulation. Arabian J. Chem. 15 (6), 103872. doi:10.1016/j.arabjc.2022.103872

CrossRef Full Text | Google Scholar

Wu, A. L., Coulter, S., Liddle, C., Wong, A., Eastham-Anderson, J., French, D. M., et al. (2011). FGF19 regulates cell proliferation, glucose and bile acid metabolism via FGFR4-dependent and independent pathways. PloS one 6 (3), e17868. doi:10.1371/journal.pone.0017868

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, W., Lin, G., Zhang, R., Liang, Z., and Wu, W. (2020). Studies on the bioactivities and molecular mechanism of antioxidant peptides by 3D-QSAR, in vitro evaluation and molecular dynamic simulations. Food & Funct. 11 (4), 3043–3052. doi:10.1039/c9fo03018b

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, W., Shen, S., Mu, L., and Yu, H. (2011). Structure-activity relationship study on the binding of PBDEs with thyroxine transport proteins. Environ. Toxicol. Chem. 30 (11), 2431–2439. doi:10.1002/etc.645

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Y., Qin, J., Liu, H., and Yao, X. (2011). Molecular dynamics simulation, free energy calculation and structure-based 3D-QSAR studies of B-RAF kinase inhibitors. J. Chem. Inf. Model 28, 680–692. doi:10.1021/ci100427j

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, Z., Li, X., Ge, C., Si, H., Cui, L., Gao, H., et al. (2015). 3D-QSAR modeling and molecular docking study on Mer kinase inhibitors of pyridine-substituted pyrimidines. Mol. Divers. 19 (1), 135–147. doi:10.1007/s11030-014-9556-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: quinazolines, structure-activity relationship, osteosarcoma, Targeted drugs, small molecule docking

Citation: Lian Z, Sang C, Li N, Zhai H and Bai W (2023) 3D,2D-QSAR study and docking of novel quinazolines as potential target drugs for osteosarcoma. Front. Pharmacol. 14:1124895. doi: 10.3389/fphar.2023.1124895

Received: 15 December 2022; Accepted: 31 January 2023;
Published: 21 February 2023.

Edited by:

Hongzong Si, Qingdao University, China

Reviewed by:

An Xiao Li, Institute of Modern Physics (CAS), China
Hongli Liu, Xuzhou Medical University, China

Copyright © 2023 Lian, Sang, Li, Zhai and Bai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wenzhe Bai, 60170120@sdutcm.edu.cn

These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.