ORIGINAL RESEARCH article

Front. Pharmacol., 28 April 2025

Sec. Experimental Pharmacology and Drug Discovery

Volume 16 - 2025 | https://doi.org/10.3389/fphar.2025.1596216

This article is part of the Research TopicIntelligent Computing for Integrating Multi-Omics Data in Disease Diagnosis and Drug DevelopmentView all 8 articles

DTGHAT: multi-molecule heterogeneous graph transformer based on multi-molecule graph for drug-target identification

Xinchen Jiang,&#x;Xinchen Jiang1,2Lu Wen,&#x;Lu Wen2,3Wenshui Li,Wenshui Li1,2Deng Que,,
Deng Que1,2,4*Lu Ming,
Lu Ming1,2*
  • 1The National Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, China
  • 2Hunan provincical key laboratory of Neurorestoratology, The Second Affiliated Hospital of Hunan Normal University, Changsha, China
  • 3Department of Ophthalmology, 921 Hospital of Joint Logistics Support Force People’s Liberation Army of China, (The Second Affiliated Hospital of Hunan Normal University), Changsha, China
  • 4Department of Neurology, 921 Hospital of Joint Logistics Support Force People’s Liberation Army of China, (The Second Affiliated Hospital of Hunan Normal University), Changsha, China

Introduction: Drug target identification is a fundamental step in drug discovery and plays a pivotal role in new therapies development. Existing computational methods focus on the direct interactions between drugs and targets, often ignoring the complex interrelationships between drugs, targets and various biomolecules in the human system.

Method: To address this limitation, we propose a novel prediction model named DTGHAT (Drug and Target Association Prediction using Heterogeneous Graph Attention Transformer based on Molecular Heterogeneous). DTGHAT utilizes a graph attention transformer to identify novel targets from 15 heterogeneous drug-gene-disease networks characterized by chemical, genomic, phenotypic, and cellular networks.

Result: In a 5-fold cross-validation study, DTGHAT achieved an area under the receiver operating characteristic curve (AUC) of 0.9634, which is at least 4% higher than current state-of-the-art methods. Characterization ablation experiments highlight the importance of integrating biomolecular data from multiple sources in revealing drug-target interactions. In addition, a case study on cancer drugs further validates DTGHAT’s effectiveness in predicting novel drug target identification. DTGHAT is free and available at: https://github.com/stella-007/DTGHAT.git.

1 Introduction

Influencing the success or failure of a drug candidate (Avorn, 2015; Wu et al., 2018). Selecting an appropriate molecular target profoundly impacts the safety and efficacy profile of therapeutic agents, as evidenced by numerous drug candidates that fail approval processes due to unforeseen side effects or inadequate therapeutic efficacy resulting from incomplete knowledge of their actual molecular targets (Agamah et al., 2020; Ji et al., 2025). Traditional experimental methods for identifying drug-target interactions (DTIs), such as affinity chromatography and protein microarrays, are labor-intensive, time-consuming, and costly, limiting the pace of drug discovery (Lotfi Shahreza et al., 2018; Atas Guvenilir and Doğan, 2023).

To overcome these limitations, computational methods have gained prominence by facilitating the rapid and cost-effective screening of potential drug-target pairs before experimental validation. Current computational approaches to DTI identification can generally be categorized into text-mining-based methods (Fleuren and Alkema, 2015; Zhu et al., 2005; Hewett et al., 2002; Chen et al., 2010; Fu et al., 2016; Ji et al., 2020b), biological feature-based methods (Lavecchia, 2015; Mayr et al., 2018; Lo et al., 2018; He et al., 2017; Liu et al., 2016), and network-based methods (Zhang et al., 2021; Ji et al., 2024; Luo et al., 2017; Lu et al., 2017; Vinayagam et al., 2016; Nascimento et al., 2016; Olayan et al., 2018; Hao et al., 2017; Zong et al., 2017; Wang et al., 2023). Text-mining methods rely on extracting semantic similarities from literature data but are hindered by natural language descriptions’ variability and ambiguity. Biological feature-based approaches utilize extracted chemical and molecular properties, applying machine-learning techniques like logistic matrix factorization and gradient boosting; however, these methods often overlook vital interaction networks between drugs and proteins, limiting their predictive power.

Network-based computational methods, particularly those leveraging network topology and interaction profiles, have improved accuracy by predicting unknown DTIs based on known associations. However, most existing network-based methods, including recent ones such as DDRO (Olayan et al., 2018; Ji et al., 2020a) and DNILMF (Hao et al., 2017; Ji et al., 2024), construct drug and protein networks independently and do not adequately incorporate associations between drug-protein pairs (DPPs). This omission neglects crucial insights available through the interconnected biological network of drugs and proteins. This hinders their performance in realistic datasets, especially for novel drugs or targets lacking known interactions.

Recent advancements in graph-based deep learning, especially graph convolutional networks (GCNs), have demonstrated substantial potential in capturing complex interactions within biological data by effectively modeling both local and global topological information. Zhao et al. (2021), Wei et al. (2024) introduced a method integrating GCN with deep neural networks (DNNs) to build a comprehensive drug-protein pair (DPP) network, where nodes represent specific drug-protein pairs, and edges encode their associations based on drug-drug and protein-protein interactions. This approach significantly enhances the ability to discern true DTIs by capturing previously ignored relationships within the network.

In this paper, we present a novel graph-based deep learning framework for drug-target recognition (Figure 1). Our method proposes a multi-view graph that captures various relationships (chemical structure similarities, genomic, pathway connections, and known interactions) between drugs, targets, and other biomolecules. A graph attention network (GAT) is then utilized to learn topology-aware features from each view of this graph, thereby effectively highlighting significant connections in the network. We developed a multi-scale feature fusion module that aggregates information from multiple graph views and different neighborhood scales to determine the most effective way to combine local and global graph context features. Importantly, we incorporate a priori knowledge of attributes of drugs and targets (such as chemical descriptors, target protein sequences, and other domain-specific features) into the model. These attributes enrich the learned representations and help address the cold-start problem by enabling the model to predict novel drugs or targets with few or no known interactions. Our model output is an end-to-end pipeline in which learned drug and target representations are connected and fed into a Multi-Layer Perceptron (MLP) classifier to identify novel drug targets. We trained and evaluated the model on known DTI data and used cross-validation and holdout tests to assess its performance. The results demonstrate that our graph-based approach significantly improves prediction accuracy over conventional methods.

Figure 1
www.frontiersin.org

Figure 1. The flowchart of DTGHAT. (A) Data sources and some symbols in this study. (B) Multi-molecule correlation graph. (C) Multiple heterogeneous graph construction and multi-view graph attention network for graph topology feature extraction of Drugs and Proteins. (D) Multi-layer perceptron for training and prediction with attribute and graph topology features of Drugs and Proteins.

2 Results

2.1 Performance evaluation under 5-fold cross-validation

We assessed the performance of our model using k-fold cross-validation (CV) using a benchmark dataset of known drug-target interactions. The dataset was split so that in each fold, 80% of the known interactions were used for training, 10% for validation, and 10% held out for testing. This was done by rotating through k=5 folds. We ensured that during each fold, the negative examples (non-interacting pairs) were sampled anew and that any cold-start cases (e.g., a drug with no interactions in the training fold) were noted for separate analysis. Our model achieved excellent prediction performance across these folds (Figure 2). The average ROC AUC score was 0.95 (with individual fold AUCs ranging from 0.93 to 0.97), indicating a high true-positive rate across various threshold settings. Similarly, the average PR AUC (AUPR) was 0.95, reflecting strong precision in recovering true interactions even among a large set of negatives (Table 1). For instance, under 5-fold CV, the model attained an AUC of 0.96 ± 0.01 and AUPR of 0.95 ± 0.01, demonstrating both high accuracy and low variance in performance. Other metrics were encouraging: the mean accuracy was around 91%, with a sensitivity (recall) of 0.92 and specificity of 0.91 at the optimal threshold, and an MCC exceeding 0.80, which underscores a strong correlation between predictions and true labels. These results outperform baseline computational methods for DTI prediction by a substantial margin. For comparison, we implemented a matrix factorization-based method and a classic similarity-based model (which uses a weighted nearest-neighbor approach for DTIs); those achieved AUCs in the mid-0.80s on the same data, far below our graph-derived model. Even a simpler GCN model without multi-view attention reached about 0.90 AUC.

Figure 2
www.frontiersin.org

Figure 2. The prediction performance of DTGHAT in 5-fold cross-validation test. (A) The Receiver operating characterisitc analysis results of DTGHAT in 5-fold cross-validation test. An enlarged view of the curves is provided in the lower right corner. (B) The Precision recall results of DTGHAT in 5-fold cross-validation test. An enlarged view of the curves is provided in the lower left corner.

Table 1
www.frontiersin.org

Table 1. The 5-fold cross-validation performance of DTGHAT.

2.2 Parameter analysis

To evaluate the impact of crucial hyperparameters on our model’s prediction performance, we performed a detailed parameter analysis by tuning several key factors, including the number of MLP layers, and embedding dimension size, used for final prediction.

2.2.1 Number of MLP layers

We tested models with varying numbers of MLP layers, ranging from 1 to 4 layers. The results showed that model performance improved as the number of layers increased, with diminishing returns after the third layer. Using 2 layers for the MLP led to the most suitable trade-off between performance and computation time, achieving an average AUC of 0.956 ± 0.01 in 5-fold cross-validation (Table 2).

Table 2
www.frontiersin.org

Table 2. Parameter analysis on MLP layer.

2.2.2 Embedding dimension

The size of the node embedding, a key factor in graph learning, was also optimized. We tested values ranging from 400 to 1,024 dimensions. The model’s performance showed a consistent increase with the embedding size, peaking at 64 dimensions. Larger dimensions (1,024) did not offer significant performance gains but led to longer training times and increased memory usage. Thus, an embeded dimension of 732 was selected for optimal results (Table 3).

Table 3
www.frontiersin.org

Table 3. Parameter analysis on Embedding layer.

2.3 Ablation experiment

To evaluate the contribution of each component of our framework, we conducted ablation experiments. We created several ablation versions of the model, each with one key component removed or replaced, and measured the performance drop relative to the full model. The components analyzed included: (1) Graph attention mechanism–we substituted the GAT with a standard GCN (graph convolutional network) that treats all neighbors equally (no attention weighting); (2) Prior attribute features–we removed the drug and target attribute vectors, relying only on learned graph embeddings for predictions.

The ablation results are summarized in Figure 3A (showing the average AUC and AUPR for each variant). Replacing the graph attention with a GCN led to a modest drop in performance (AUC 0.92), showing that while much of the gain comes from the overall graph framework, the attention mechanism still provides a boost by focusing on the most informative neighbors. The effect was more pronounced in cases where the graph had noisy connections; the GAT could down-weight those, whereas the GCN could not, leading to lower precision.

Figure 3
www.frontiersin.org

Figure 3. The comparison of ablation results of DTGHAT in 5-fold cross-validation test. (A) The comparison of ablation results for different network. (B) The comparison of ablation results for different feature fusion strategies.

After we exclude a priori knowledge attributes, we find a significant effect, especially on the subset of predictions involving cold-start entities. The overall AUC drops by about 2 percentage points (to 0.93) when attributes are excluded, while the AUC for the cold-start subset drops sharply as expected (to 0.5–0.6, which is essentially no better than random guessing in these cases) (Figure 3B). This highlights the fact that attribute characterization is indispensable for the promotion of new drugs or targets. With all components active, our model takes full advantage of their respective strengths: graph views bring in various relational signals, attention, and fusion intelligently combine these signals, and a priori characterization ensures solid baseline knowledge of each node. These ablation studies clearly show that each module of the model - GAT and attribute fusion - contributes to the overall performance, and removing any of them reduces the prediction accuracy.

2.4 Performance comparison with the state-of-the-art methods

To demonstrate the efficacy of our proposed method, we compared its performance with several state-of-the-art drug-target interaction prediction models. These models include: DeepDTA, GCN-DTI, and GraphDTA.

• DeepDTA: (Öztürk et al., 2018) A deep learning model that predicts drug-target interactions using convolutional neural networks (CNNs) with drug sequences and target sequences.

• GCN-DTI: (Zhao et al., 2021) A graph convolutional network (GCN) model that learns embeddings from drug-target interaction graphs, using only graph structure and node features.

• GraphDTA: (Nguyen et al., 2021) A graph-based method that incorporates drug-target interaction data into a graph neural network for DTI prediction, focusing on learning end-to-end representations for both drugs and targets.

The comparison results (Figure 4) demonstrate that our model outperforms these methods in terms of prediction accuracy, as measured by the Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) and the Area Under the Precision-Recall Curve (AUPR). Specifically, our model showed an average AUC of 0.966 ± 0.0102 in 5-fold cross-validation, outperforming DeepDTA (AUC of 0.913), GCN-DTI (AUC of 0.920), and GraphDTA (AUC of 0.941). Our method also achieved an average AUPR of 0.964 ± 0.0097 in 5-fold cross-validation, higher than DeepDTA (AUPR = 0.857), GCN-DTI (AUPR = 0.865), and GraphDTA (AUPR = 0.912). We achieved an accuracy of 91% for known drug-target interactions, while DeepDTA and GCN-DTI performed at 85% and 87%, respectively.

Figure 4
www.frontiersin.org

Figure 4. Performance comparison of DTGHAT with the state-of-the-art methods.

2.5 Case study

To evaluate the performance and applicability of the DTGHAT model in identifying novel drug-target interactions (DTIs), we conducted a case study using the DrugBank dataset. This dataset contains both known drug-target pairs and extensive drug and protein information, including their molecular properties and interactions. The case study aims to test the model’s ability to predict unknown drug-target interactions not previously documented.

Using DTGHAT, we predicted several novel drug-target pairs, which were further validated against literature sources. Some of these predicted interactions were previously unknown and were not present in DrugBank (Knox et al., 2024). Notably.

• While acetaminophen’s effects on POLE have not been reported in DrugBank, previous research Prot et al.(2012) demonstrated acetaminophen-induced upregulation of POLE expression in hepatoma cells. This prediction is consistent with the biological findings and validates DTGHAT’s ability to detect novel interactions.

• Acetaminophen was found to influence the expression of MPO, a key protein involved in inflammation and liver damage. This interaction, although reported by Zheng et al. (2015) and others, was not previously included in DrugBank. Our model’s prediction suggests that DTGHAT could be useful for identifying drug effects beyond direct target interactions.

• The predicted interaction between Benzydamine and MAPK was previously noted in studies by Riboldi et al. (2003), but it was not available in DrugBank. This novel finding highlights the model’s capability to uncover complex signaling pathway interactions?

• The predicted interaction between Levodopa and epidermal growth factor receptor (EGFR) was previously suggested in studies investigating neuroprotective effects through EGFR activation in neurodegenerative diseases (Gao et al., 2024; Sun et al., 2024), but it was not available in DrugBank. This novel finding highlights the model’s ability to uncover additional neuroprotective effects of Levodopa beyond its dopaminergic action, specifically through modulation of the EGFR signaling pathway.

3 Discussion

In this study, we present a graph-based deep learning framework for drug target identification that leverages a methodology. That addresses key challenges in computational DTI prediction by integrating multiple data views, using attention mechanisms for feature learning, and incorporating prior domain knowledge of drugs and targets. We constructed a multi-view graph to capture complementary relationships (drug-drug, target–target, and drug–target interactions) and applied a graph attention network to each view to learn rich topology-aware embeddings. Through a novel multi-scale feature fusion module, we efficiently merged these embeddings, extracting both local and global patterns to form a unified representation for each drug and target. We also infused prior attributes (e.g., chemical structure and protein sequence features) into the model, which proved crucial for enhancing accuracy and enabling predictions for novel, unseen drugs or targets. The learned drug and target representations were fed into an MLP in an end-to-end fashion to predict potential associations.

For future research, several avenues could be pursued. One direction is to extend this framework to predict not just binary interactions but quantitative drug–target affinities or polypharmacology profiles. This would involve adjusting the loss function and output layer accordingly. Another direction is to incorporate temporal or condition-specific data–for example, context-specific interaction networks or time-course gene expression–to predict drug-target interactions under different biological states (such as healthy vs. diseased tissues). Additionally, integrating our approach with molecular docking simulations or structural models could further refine our predictions by ensuring they are physically plausible. Finally, as larger and more diverse datasets become available (e.g., from high-throughput screens or proteomics), our multi-view approach can naturally scale by adding new views, and the attention mechanism will help select the most informative signals from this wealth of data.

4 Materials and methods

4.1 Datasets

The dataset used for training and testing our model consists of known drug-target interactions (DTIs), obtained from publicly available databases such as DrugBank (Knox et al., 2024) and ChEMBL (Gaulton et al., 2012). These databases contain curated drug-target interactions from experimental sources and computational predictions. We also incorporate additional data from other sources like BindingDB (Gilson et al., 2016), STITCH (Kuhn et al., 2007), and TargetNet to increase drug-target pair coverage. Each drug-target pair is represented by the drug’s chemical structure and the target’s protein sequence, along with any available prior knowledge about their interactions (such as binding affinity, and known side effects). For each target protein, we extract sequence features (such as UniProtConsortium, 2015), which include functional annotations and known protein families.

Furthermore, we also include external data sources to construct the multi-view graph. For example, drug similarity graphs based on molecular fingerprints (Morgan fingerprints), protein sequence similarity graphs based on BLAST similarity scores, and known interaction networks are incorporated. All nodes (drugs and targets) in the graph are assigned feature vectors that capture the chemical properties and biological characteristics of the entities.

We partition the dataset into training, validation, and test sets with a ratio of 80% training, 10% validation, and 10% testing. This ensures that no drug-target pair overlaps between training and testing sets. This split ensures that our model generalizes well to unseen data and is not biased by prior interactions. Additionally, negative samples (drug-target pairs not known to interact) are generated by randomly selecting non-interacting drug-target pairs from the dataset, providing a balanced contrast for model training.

4.2 Multimolecular association graph construction

The heterogeneous biological entity graph is composed of various biomolecule types, such as drugs, proteins, and diseases. Each biomolecule type is represented as a node, and interactions (such as drug-target interactions, protein-protein interactions, and disease pathways) are represented as edges. This graph is carefully designed to capture both direct and indirect relationships between the biomolecules. This allows the model to learn topological patterns that reflect the underlying biological complexities of drug-target interactions. The graph construction process relies on publicly available databases, ensuring biologically relevant and up-to-date data. The graph construction can be expressed as follows Formula 1:

G=V,E(1)

where V represents the set of nodes (biomolecules), and E represents the set of edges (biomolecule relationships). Each edge eij in the graph represents the relationship between two nodes vi and vj, which could be of various types such as drug-target interactions, protein interactions, or disease pathways.

4.2.1 Multi-view feature fusion

Given the heterogeneous nature of the data, we adopt a multi-view feature fusion approach to integrate information from various graph perspectives. The embeddings learned from different views (such as drug similarity, target similarity, and drug-target interactions) are combined through a fusion module. This module adaptively weighs the importance of each view and learns the optimal feature combination for accurate prediction. The fusion of features can be mathematically represented as follows Formula 2:

z=i=1Nαihi(2)

where hi denotes the feature vector from the i-th view, and ai is the weight assigned to the i-th view, which is learned through training. The summation of all views gives the fused feature vector z, which is then used for downstream predictions.

4.2.2 Prior knowledge integration

To further enhance model performance, particularly in cold-start scenarios, we integrate prior knowledge about drugs and targets. This includes incorporating chemical descriptors (molecular fingerprints for drugs) and protein sequence embeddings. This integration improves the model’s ability to generalize across unseen drugs or targets. The prior knowledge incorporation is given by Formula 3:

fprior =concatfchem ,fseq (3)

where fchem represents the chemical descriptor features of the drug, and fseq represents the protein sequence embedding of the target. The concatenated features fprior are added to the graph feature vectors, enhancing the model’s predictive capacity.

4.3 Molecular heterogeneous graph transformer

The core of the DTGHAT model is the Graph Attention Transformer (GAT), which incorporates both the structural relationships of the graph and the feature representations of the nodes. GAT allows the model to focus on the most informative nodes and relationships, providing a better understanding of drug-target interactions. The graph attention mechanism can be expressed as Formula 4:

aij=softmaxQiKj(4)

where Qi and Kj represent the query and key vectors for nodes i and j, respectively. The attention score aij determines the importance of node j in the context of node i, and is used to weigh the contribution of node j’s features when updating node i.

Each node’s feature is then updated using the attention mechanism and message passing. The message passing can be formalized as Formula 5:

hil+1=ReLUjNiaijWlhjl(5)

where N(i) represents the set of neighbors of node i, W1 is the weight matrix at layer l, and hj(l) is the feature of node j at layer l. The ReLU activation ensures that only positive features are passed along.

The final learned node embeddings are then passed through a Multi-Layer Perceptron (MLP) to predict the likelihood of interaction between drug-target pairs Formula 6:

ŷij=MLPhihj(6)

where denotes the concatenation operation between the embeddings of the drug and target, and ŷij represents the predicted interaction score between the drug i and target j.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/supplementary material.

Author contributions

XJ: Writing – original draft, Writing – review and editing. LW: Writing – original draft, Writing – review and editing, Data curation. WL: Writing – review and editing, Validation, Supervision. DQ: Writing – review and editing. LM: Funding acquisition, Writing – review and editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by Key Project of Hunan Provincial Science and Technology Innovation (2020SK2102), Hunan Provincial Natural Science Foundation of China (2024JJ9487).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Agamah, F. E., Mazandu, G. K., Hassan, R., Bope, C. D., Thomford, N. E., Ghansah, A., et al. (2020). Computational/in silico methods in drug target and lead prediction. Briefings Bioinforma. 21, 1663–1675. doi:10.1093/bib/bbz103

PubMed Abstract | CrossRef Full Text | Google Scholar

Atas Guvenilir, H., and Doğan, T. (2023). How to approach machine learning-based prediction of drug/compound–target interactions. J. Cheminformatics 15, 16. doi:10.1186/s13321-023-00689-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Avorn, J. (2015). The $2.6 billion pill-methodologic and policy considerations. N. Engl. J. Med. 372, 1877–1879. doi:10.1056/NEJMp1500848

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, B., Dong, X., Jiao, D., Wang, H., Zhu, Q., Ding, Y., et al. (2010). Chem2bio2rdf: a semantic framework for linking and data mining chemogenomic and systems chemical biology data. BMC Bioinforma. 11, 1–13. doi:10.1186/1471-2105-11-255

PubMed Abstract | CrossRef Full Text | Google Scholar

Fleuren, W. W., and Alkema, W. (2015). Application of text mining in the biomedical domain. Methods 74, 97–106. doi:10.1016/j.ymeth.2015.01.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Fu, G., Ding, Y., Seal, A., Chen, B., Sun, Y., and Bolton, E. (2016). Predicting drug target interactions using meta-path-based semantic network analysis. BMC Bioinforma. 17, 1–10. doi:10.1186/s12859-016-1005-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, W., Jing, S., He, C., Saberi, H., Sharma, H. S., Han, F., et al. (2024). Advancements in neurodegenerative diseases: pathogenesis and novel neurorestorative interventions. J. Neurorestoratology 13, 100176. doi:10.1016/j.jnrt.2024.100176

CrossRef Full Text | Google Scholar

Gaulton, A., Bellis, L. J., Bento, A. P., Chambers, J., Davies, M., Hersey, A., et al. (2012). Chembl: a large-scale bioactivity database for drug discovery. Nucleic acids Res. 40, D1100–D1107. doi:10.1093/nar/gkr777

PubMed Abstract | CrossRef Full Text | Google Scholar

Gilson, M. K., Liu, T., Baitaluk, M., Nicola, G., Hwang, L., and Chong, J. (2016). Bindingdb in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic acids Res. 44, D1045–D1053. doi:10.1093/nar/gkv1072

PubMed Abstract | CrossRef Full Text | Google Scholar

Hao, M., Bryant, S. H., and Wang, Y. (2017). Predicting drug-target interactions by dual-network integrated logistic matrix factorization. Sci. Rep. 7, 40376. doi:10.1038/srep40376

PubMed Abstract | CrossRef Full Text | Google Scholar

He, T., Heidemeyer, M., Ban, F., Cherkasov, A., and Ester, M. (2017). Simboost: a read-across approach for predicting drug–target binding affinities using gradient boosting machines. J. cheminformatics 9, 24–14. doi:10.1186/s13321-017-0209-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Hewett, M., Oliver, D. E., Rubin, D. L., Easton, K. L., Stuart, J. M., Altman, R. B., et al. (2002). Pharmgkb: the pharmacogenetics knowledge base. Nucleic acids Res. 30, 163–165. doi:10.1093/nar/30.1.163

PubMed Abstract | CrossRef Full Text | Google Scholar

Ji, B., Wang, X., Wang, X., Xu, L., and Peng, S. (2025). scdca: deciphering the dominant cell communication assembly of downstream functional events from single-cell rna-seq data. Briefings Bioinforma. 26, bbae663. doi:10.1093/bib/bbae663

PubMed Abstract | CrossRef Full Text | Google Scholar

Ji, B., Zou, H., Xu, L., Xie, X., and Peng, S. (2024). Muscle: multi-view and multi-scale attentional feature fusion for microrna–disease associations prediction. Briefings Bioinforma. 25, bbae167. doi:10.1093/bib/bbae167

PubMed Abstract | CrossRef Full Text | Google Scholar

Ji, B. Y., You, Z. H., Cheng, L., Zhou, J. R., Alghazzawi, D., and Li, L. P. (2020a). Predicting mirna-disease association from heterogeneous information network with grarep embedding model. Sci. Rep. 10, 6658. doi:10.1038/s41598-020-63735-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Ji, B. Y., You, Z. H., Jiang, H. J., Guo, Z. H., and Zheng, K. (2020b). Prediction of drug-target interactions from multi-molecular network based on LINE network representation method. J. Transl. Med. 18, 1–11. doi:10.1186/s12967-020-02490-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Knox, C., Wilson, M., Klinger, C. M., Franklin, M., Oler, E., Wilson, A., et al. (2024). Drugbank 6.0: the drugbank knowledgebase for 2024. Nucleic acids Res. 52, D1265–D1275. doi:10.1093/nar/gkad976

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuhn, M., von Mering, C., Campillos, M., Jensen, L. J., and Bork, P. (2007). Stitch: interaction networks of chemicals and proteins. Nucleic acids Res. 36, D684–D688. doi:10.1093/nar/gkm795

PubMed Abstract | CrossRef Full Text | Google Scholar

Lavecchia, A. (2015). Machine-learning approaches in drug discovery: methods and applications. Drug Discov. today 20, 318–331. doi:10.1016/j.drudis.2014.10.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Y., Wu, M., Miao, C., Zhao, P., and Li, X. L. (2016). Neighborhood regularized logistic matrix factorization for drug-target interaction prediction. PLoS Comput. Biol. 12, e1004760. doi:10.1371/journal.pcbi.1004760

PubMed Abstract | CrossRef Full Text | Google Scholar

Lo, Y. C., Rensi, S. E., Torng, W., and Altman, R. B. (2018). Machine learning in chemoinformatics and drug discovery. Drug Discov. today 23, 1538–1546. doi:10.1016/j.drudis.2018.05.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Lotfi Shahreza, M., Ghadiri, N., Mousavi, S. R., Varshosaz, J., and Green, J. R. (2018). A review of network-based approaches to drug repositioning. Briefings Bioinforma. 19, 878–892. doi:10.1093/bib/bbx017

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, Y., Guo, Y., and Korhonen, A. (2017). Link prediction in drug-target interactions network using similarity indices. BMC Bioinforma. 18, 39–9. doi:10.1186/s12859-017-1460-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Luo, Y., Zhao, X., Zhou, J., Yang, J., Zhang, Y., Kuang, W., et al. (2017). A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information. Nat. Commun. 8, 573. doi:10.1038/s41467-017-00680-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Mayr, A., Klambauer, G., Unterthiner, T., Steijaert, M., Wegner, J. K., Ceulemans, H., et al. (2018). Large-scale comparison of machine learning methods for drug target prediction on chembl. Chem. Sci. 9, 5441–5451. doi:10.1039/c8sc00148k

PubMed Abstract | CrossRef Full Text | Google Scholar

Nascimento, A. C., Prudêncio, R. B., and Costa, I. G. (2016). A multiple kernel learning algorithm for drug-target interaction prediction. BMC Bioinforma. 17, 46–16. doi:10.1186/s12859-016-0890-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Nguyen, T., Le, H., Quinn, T. P., Nguyen, T., Le, T. D., and Venkatesh, S. (2021). Graphdta: predicting drug–target binding affinity with graph neural networks. Bioinformatics 37, 1140–1147. doi:10.1093/bioinformatics/btaa921

PubMed Abstract | CrossRef Full Text | Google Scholar

Olayan, R. S., Ashoor, H., and Bajic, V. B. (2018). Ddr: efficient computational method to predict drug–target interactions using graph mining and machine learning approaches. Bioinformatics 34 (7), 1164–1173. doi:10.1093/bioinformatics/bty417

PubMed Abstract | CrossRef Full Text | Google Scholar

Öztürk, H., Özgür, A., and Ozkirimli, E. (2018). Deepdta: deep drug–target binding affinity prediction. Bioinformatics 34, i821–i829. doi:10.1093/bioinformatics/bty593

PubMed Abstract | CrossRef Full Text | Google Scholar

Prot, J. M., Bunescu, A., Elena-Herrmann, B., Aninat, C., Snouber, L. C., Griscom, L., et al. (2012). Predictive toxicology using systemic biology and liver microfluidic “on chip” approaches: application to acetaminophen injury. Toxicol. Appl. Pharmacol. 259, 270–280. doi:10.1016/j.taap.2011.12.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Riboldi, E., Frascaroli, G., Transidico, P., Luini, W., Bernasconi, S., Mancini, F., et al. (2003). Benzydamine inhibits monocyte migration and mapk activation induced by chemotactic agonists. Br. J. Pharmacol. 140, 377–383. doi:10.1038/sj.bjp.0705428

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, H., Dong, C., Li, Y., Li, M., Zhang, H., Xu, X., et al. (2024). Topographic and temporal patterns of dyskinesia in multiple system atrophy with predominant parkinsonism. J. Neurorestoratology 12, 100145. doi:10.1016/j.jnrt.2024.100145

CrossRef Full Text | Google Scholar

UniprotConsortium (2015). Uniprot: a hub for protein information. Nucleic acids Res. 43, D204–D212. doi:10.1093/nar/gku989

PubMed Abstract | CrossRef Full Text | Google Scholar

Vinayagam, A., Gibson, T. E., Lee, H. J., Yilmazel, B., Roesel, C., Hu, Y., et al. (2016). Controllability analysis of the directed human protein interaction network identifies disease genes and drug targets. Proc. Natl. Acad. Sci. 113, 4976–4981. doi:10.1073/pnas.1603992113

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., Wong, L., You, Z. H., and Huang, D. S. (2023). Amdecda: attention mechanism combined with data ensemble strategy for predicting circrna-disease association. IEEE Trans. Big Data 10, 320–329. doi:10.1109/tbdata.2023.3334673

CrossRef Full Text | Google Scholar

Wei, M., Wang, L., Li, Y., Li, Z., Zhao, B., Su, X., et al. (2024). Biokg-cmi: a multi-source feature fusion model based on biological knowledge graph for predicting circrna-mirna interactions. Sci. China Inf. Sci. 67, 189104. doi:10.1007/s11432-024-4098-3

CrossRef Full Text | Google Scholar

Wu, Z., Li, W., Liu, G., and Tang, Y. (2018). Network-based methods for prediction of drug-target interactions. Front. Pharmacol. 9, 1134. doi:10.3389/fphar.2018.01134

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X. M., Liang, L., Liu, L., and Tang, M. J. (2021). Graph neural networks and their current applications in bioinformatics. Front. Genet. 12, 690049. doi:10.3389/fgene.2021.690049

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, T., Hu, Y., Valsdottir, L. R., Zang, T., and Peng, J. (2021). Identifying drug–target interactions based on graph convolutional network and deep neural network. Briefings Bioinforma. 22, 2141–2150. doi:10.1093/bib/bbaa044

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, Z., Sheng, Y., Lu, B., and Ji, L. (2015). The therapeutic detoxification of chlorogenic acid against acetaminophen-induced liver injury by ameliorating hepatic inflammation. Chemico-Biological Interact. 238, 93–101. doi:10.1016/j.cbi.2015.05.023

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, S., Okuno, Y., Tsujimoto, G., and Mamitsuka, H. (2005). A probabilistic model for mining implicit ‘chemical compound–gene’relations from literature. Bioinformatics 21, ii245–ii251. doi:10.1093/bioinformatics/bti1141

PubMed Abstract | CrossRef Full Text | Google Scholar

Zong, N., Kim, H., Ngo, V., and Harismendy, O. (2017). Deep mining heterogeneous networks of biomedical linked data to predict novel drug–target associations. Bioinformatics 33, 2337–2344. doi:10.1093/bioinformatics/btx160

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: drug target identities, deep learning, molecular heterogeneous graph transformer, biological entity graph, machine learning

Citation: Jiang X, Wen L, Li W, Que D and Ming L (2025) DTGHAT: multi-molecule heterogeneous graph transformer based on multi-molecule graph for drug-target identification. Front. Pharmacol. 16:1596216. doi: 10.3389/fphar.2025.1596216

Received: 19 March 2025; Accepted: 14 April 2025;
Published: 28 April 2025.

Edited by:

Lei Wang, Chinese Academy of Sciences (CAS), China

Reviewed by:

Bo-Ya Ji, Hunan University, China
Zou Haitao, Guilin University of Electronic Technology, China

Copyright © 2025 Jiang, Wen, Li, Que and Ming. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Lu Ming, bHVtaW5nY3MxNjNAMTYzLmNvbQ==; Deng Que, ZGVuZ3F1ZTNzQDE2My5jb20=

These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.