- 1Faculty of Applied Sciences, Macao Polytechnic University, Macau, China
- 2Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, China
- 3Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan, China
- 4Editorial Office, Geriatric Hospital of Nanjing Medical University, Nanjing, Jiangsu, China
- 5Yangtze Delta Region lnstitute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China
Introduction: Drug combination therapy represents a promising strategy for addressing complex diseases, offering the potential for improved efficacy while mitigating safety concerns. However, conventional wet-lab experimentation for identifying optimal drug combinations is resource-intensive due to the vast combinatorial search space. To address this challenge, computational methods leveraging machine learning and deep learning have emerged to effectively navigate this space.
Methods: In this study, we introduce a Calibrated Deep Feature Aggregation (CDFA) framework for screening synergistic drug combinations. Concretely, CDFA utilizes a novel cell line representation based on the protein information and gene expression capturing complementary biological determinants of drug response. Besides, a novel feature aggregation network is proposed based on the Transformer to model the intricate interactions between drug pairs and cell lines through multi-head attention mechanisms, enabling discovery of non-linear synergy patterns. Furthermore, a method is introduced to quantify and calibrate the uncertainties associated with CDFA’s predictions, enhancing the reliability of the identified synergistic drug combinations.
Results: Experiments results have demonstrated that CDFA outperforms existing state-of-the-art deep learning models.
Discussion: The superior performance of CDFA stems from its biologically informed cell line representation, its ability to capture complex non-linear drug-cell interactions via attention mechanisms, and its enhanced reliability through uncertainty calibration. This framework provides a robust computational tool for efficient and reliable drug combination screening.
1 Introduction
Drug combination therapy has emerged as a mainstay in the clinical treatment of various cancers (Meng et al., 2023), including lung cancer (Nair et al., 2023; Cui et al., 2024), ovarian cancer (Kong et al., 2023), and pancreatic cancer (Jaaks et al., 2022). Compared with monotherapy, combination therapies often demonstrate enhanced efficacy, reduced drug resistance, and decreased toxicity. However, it is crucial to recognize that not all drug combinations yield synergistic effects; in fact, some combinations may even exhibit antagonistic effects (Wang T. et al., 2023). For instance, the concomitant administration of antibiotics inhibiting DNA synthesis and those targeting protein synthesis can stimulate bacterial growth (Bollenbach et al., 2009). Therefore, the precise identification of synergistic drug pairs for specific cell types is essential to harness the full potential of combination therapy (Wang T. et al., 2022).
Traditional laboratory experiments to screen for synergistic drug combinations from the vast pharmacological space are often time-consuming and resource-intensive. Moreover, drug combination trials can sometimes result in side effects or harmful reactions in patients. With the growing availability of high-throughput screening data (Jiang et al., 2024; Liu et al., 2021), computational methods have emerged as efficient preclinical strategies for identifying synergistic drug combinations (Cao et al., 2024).
With the accumulation of data and the advancement of related technologies in recent decades, classical machine learning (ML)-based approaches and deep learning (DL) techniques have been employed to model drug combination trials, showing promising results by leveraging a variety of drug and cell line features. As drug combination effect prediction can be formulated as a regression or a multi-class classification task, the early ML-based methods often used the classical machine learning, such as logistic regression (LR) (H et al., 2014), support vector machine (SVM), random forests (RF) (Breiman, 2001), and extreme gradient boosting (XGboost). As early as 2014, Huang H et al. used a logistic regression model to systematically predict the drug combinations based on clinical side-effect (H et al., 2014). Pavel Sidorov et al. predicted Synergism of Cancer Drug Combinations by using NCI-ALMANAC Data based on RF and XGboost models (Sidorov et al., 2019). These methods laid the groundwork for more advanced approaches. Recently, deep learning (DL) models have shown excellent performance in bio-sequence analysis, gene regulation, and other areas, for extracting various data features and fusing heterogeneous data (Wang T. et al., 2024; Zhu et al., 2025). As the data about drugs continues to expand, most ML-based work has shifted towards deep learning (DL) models, driven by significant advancements in neural network architectures. One notable early DL model is DeepSynergy (Preuer et al., 2018), which integrates genomic data and drug information to identify drug combinations by a fully connected neural networks. Building on this foundation, newer DL models have emerged, leveraging advanced architectures like Transformers (Wang T. et al., 2024), Graph Neural Networks (GNNs) (Zhang et al., 2024), and Auto-Encoders (Zhu et al., 2025). For instance, CCSynergy (Hosseini and Zhou, 2023), GTextSy (Yan and Zheng, 2024), MMGCSyn (Zhang et al., 2025) and MatchMaker (Kuru et al., 2022) are integrated DNN with drug and cell line features. Based on Transformers models, DeepTraSynergy (Rafiei et al., 2023) and TranSynergy (Liu and Xie, 2021) were developed to learn drug representations and incorporate auxiliary knowledge through a novel neural network design. MRHGNN (Chen et al., 2025) and DeepDDS (Wang JX. et al., 2022) employ various GNNs to extract drug features by modeling drugs as graphs, capturing their structural properties. Moreover, recent research has introduced hypergraph neural networks to model complex relationships between cell lines and drug pairs (Wang W. et al., 2024; Liu et al., 2022).
In addition to neural network design, the fusion mechanism plays a crucial role in drug combination synergy prediction models. Recent studies have focused on effectively combining drug and cell line information to improve predictive accuracy. In parallel, advances in biological sequence classification have demonstrated the benefits of integrating multiple types of information. For instance, the SBSM-Pro model (Wang YZ. et al., 2024) introduces a novel multiple kernel learning strategy to combine sequence similarity measures, significantly enhancing classification performance. Similarly, DFFNDDS (Xu et al., 2023) employs two distinct neural networks to fuse drug features and cell line information from both bit-wise and vector-wise perspectives. DualSyn (Chen et al., 2024) introduces two modules to capture high-order and global information, enhancing the model’s ability to understand complex interactions. SynergyX (Guo et al., 2024) utilizes mutual-attention and self-attention mechanisms to model drug-cell and drug-drug interactions, providing a more nuanced understanding of these relationships. CircRDRP (Wang Y. et al., 2024) uses a graph neural network model to predict the association of circRNA with drug resistance by combining disease context characteristics and deep learning techniques. MMSyn (Pang et al., 2024) and AttenSyn (Wang TS. et al., 2023) leverage attention mechanisms to integrate multiple drug and cell line features, allowing the model to focus on the most relevant aspects of the data. CLCDA (Wang YT. et al., 2023) is a collaborative deep learning-based model for predicting potential associations between circRNA and disease. Despite these significant contributions, many of these approaches still rely on late fusion mechanisms, where drug and cell line features are combined at a later stage in the model. This can limit the model’s ability to fully capture the intricate interactions between drugs and cell lines. To address the limitations of late fusion mechanisms, this study proposes the Calibrated Deep Feature Aggregation (CDFA) framework–a Transformer-based architecture that enables early-stage integration of proteomic features and gene expression profiles to capture intricate drug-drug-cell interactions. The design incorporates dedicated uncertainty calibration to ensure probabilistic reliability. Experimental validation demonstrates CDFA’s fusion efficacy: comprehensive testing across two benchmark datasets (spanning diverse cell lines and tissue types) confirms both the structural effectiveness and superior generalization of our approach.
2 Materials and methods
2.1 Synergy datasets
We assessed our method using two publicly available datasets: O'Neil (O'Neil et al., 2016) and NCI-ALMANAC (Holbeck et al., 2017). The O'Neil dataset comprised 23,062 drug combination samples involving 38 drugs and 39 human cancer cell lines. The NCI-ALMANAC dataset was relatively larger, containing 304,549 data points across 104 drugs and 60 cell lines. The synergy value for each sample is represented by the Loewe and combination scores for O'Neil and NCI-ALMANAC, respectively. The characteristics of the cell lines were represented by 651 gene expression values obtained from the COSMIC database (Forbes et al., 2015). Following established preprocessing steps (Liu et al., 2022), the final datasets included 18,950 and 74,139 drug-drug-cell line combinations for O'Neil and NCI-ALMANAC, respectively. Figure 1 depicts the distribution of synergy scores for both datasets. Notably, the left side of the distribution, centered around 30, constitutes more than half of the dataset. These values correspond to the negative pairs that exhibit either additive or antagonistic effects, indicating that a significant portion of the drug combinations do not show a synergistic benefit over the individual effects of the drugs. This observation underscores the complexity of identifying truly synergistic drug pairs and highlights the importance of systematic screening and computational approaches to optimize drug combination therapies.

Figure 1. The distribution of synergy scores in the O’Neil and NCI-ALMANAC datasets. The vertical axis represents sample frequency counts, while the horizontal axis displays synergy scores. (a) O’Neil dataset. (b) NCI-ALMANAC dataset.
To train and evaluate the model, we began by randomly selecting 90% of drug pairs and cell lines from each dataset to conduct three different experimental settings: random setting, cold cell line setting, and cold drug pair setting. The remaining 10% of the samples were set aside as an independent test set to evaluate generalization performance. For the random splitting setting, we divided the samples into five equal subsets. One subset served as the test set, while the remaining four were further split into training and validation sets in a 9:1 ratio. In the cold cell line setting, all the unique cell lines were divided into five equal groups randomly. The related samples which contain the cell line from one of these groups were used for testing, while the remaining samples were split into a 9:1 ratio as the training set and validation set. This ensured that the test set included only cell lines not present in the training set. For the cold drug pair setting, drug pairs were similarly partitioned into five equal groups. Four groups were used for training, with the test set containing only those drug pairs not seen during training. This ensured that the model was tested on an entirely new pair of drugs.
2.2 Problem formulation
In this study, we formulate the synergy prediction problem as a regression task. Let
2.3 Drug and cell line representations
A variety of molecular representations have been employed for drug combination prediction tasks. Fingerprints, such as ECFP and MHFP, are commonly used to encode compound structures. In this study, we adopted the MinHashed Atom-Pair fingerprint extended to four bonds (MAP4) as our molecular representation. MAP4 offers a versatile approach to representing diverse chemical structures.
Gene expression profiles have been commonly employed to represent cell lines in drug combination prediction tasks. In this study, we utilized gene expression data extracted from COSMIC, represented as 651-dimensional vectors (
2.4 Feature encoder
The weighted gene expression representation of a cell line is fed into a cell line feature encoder to learn abstract cell line representations. This encoder comprises three convolutional layers interleaved with pooling layers. The initial convolutional layer transforms the input into feature maps, which are subsequently downsampled using max-pooling. This process is repeated three times.
The MAP4 vector representing a drug is input into a drug feature encoder to extract high-level abstract features. The encoder consists of two fully connected (FC) layers followed by Gaussian Error Linear Units (GELU) (Hendrycks and Gimpel, 2016) and batch normalization. The resulting features serve as essential inputs for subsequent fusion operations. The formulation of the drug feature encoder can be summarized as follows (Equation 1):
where
We refer to these generated drug pair features as
2.5 Deep feature aggregation module
Given the drug pair features
Global feature fusion: This process aims to integrate drug and early cell line features, followed by reinforcing the fused global features back into the local cell features. We employ a transformer encoder for global feature fusion. The core idea of the transformer encoder is the attention mechanism. An attention function maps queries (
where
The multi-head attention mechanism consists of multiple attention heads, with each head conducting a linear transformation on the input vectors before performing the attention operation. Each attention head has its own set of trainable parameters, allowing it to potentially model an independent relationship between the input vectors. This is achieved by utilizing different parameters in the linear transformation step.
Then, for the
wherein
Then, the output of the multi-head attention mechanism is the linear transformation of the concatenation of the output vectors acquired from the attention heads (Equation 4):
where
Besides the attention mechanism, the transformer encoder also contains the residual and feed-forward neural network. Formally, the global feature fusion can be defined as follows (Equation 5):
where
Global to local cell line feature fusion: Inspired by recent findings that drugs can influence the synergistic or antagonistic effects of drug combinations through modulating key gene expression (Wu et al., 2023), we incorporate a global-to-local cell line feature fusion network to simulate drug-induced gene regulation effects. The local cell line feature is enhanced through multi-head attention where global features (
2.6 Synergy prediction module
The final synergy value of a drug combination is predicted using the output of the global feature fusion network (
Given a training dataset, that contains
2.7 Uncertainty quantification
We use an ensemble method to further enhance generalization and quantify the uncertainty of the CDFA. Specifically, we trained
2.8 Uncertainty recalibration
Calibration errors (Mervin et al., 2021) in probability estimates compromise reliability by creating discrepancies between predicted and true probabilities. Specifically, they refer to the discrepancy between the model’s predicted confidence and the actual observed frequency of correctness at that confidence level. For example, if a model assigns 80% confidence to a set of predictions, but only 70% of them are correct, this indicates a calibration error in that confidence range. Such miscalibration reduces the effectiveness of uncertainty estimates as indicators of trustworthiness in predictions.
To address this issue, a common strategy is to learn a recalibration function that adjusts the predicted uncertainties to better align with the true underlying probabilities. The recalibration function is often a non-linear uncertainty scaling function, learned using a hold-out validation dataset to create a calibration map, and is often assessed using metrics like Expected Calibration Error (ECE). In our method, we adopt a simple yet effective single-parameter scaling approach that adjusts only the uncertainty component
3 Results
3.1 Overview of the CDFA framework
CDFA is an ensemble deep learning framework for predicting the potential synergy effects of drug combinations based on the drugs’ molecular information and the cells’ gene expression. The overall architecture of CDFA is shown in Figure 2. It consists of three main components: the feature encoders for the drug pair and cell line, the feature aggregation module, and the synergy prediction module. First, MAP4 is used to represent diverse chemical structures of the paired drugs. Gene expression profiles are employed to represent cell lines in drug combination prediction tasks. Then, feature encoders are used to extract these three types of features separately. A novel feature aggregation network is involved based on the Transformer which tries to capture the intricate interactions between drug pairs and cell lines. Finally, the aggregated features are connected to another synergy prediction module. The subsequent sections of this section provide detailed evidence of the superiority of this computational framework.

Figure 2. The overview network of CDFA. Data flows sequentially from input to output through three core components: (1) feature encoders for drug pairs and cell lines, (2) feature aggregation module, and (3) synergy prediction module.
3.2 Comparison with existing models
To evaluate CDFA’s performance, we compared it with nine existing drug combination synergy prediction models: HypergraphSynergy, DeepSynergy, DTF, CombFM, Celebi’s method, PermuteDDS, MatchMaker, GTextSyn and MMGCSyn. We employed three common regression evaluation metrics to assess the performance of these methods: root mean squared error (RMSE), coefficient of determination (
As Table 1 shows, we compared CDFA’s performance with several models using the O'Neil dataset across three different experimental setups. In the random split scenario, where data is divided without specific constraints, the CDFA model outshone others with the lowest RMSE at 13.522, alongside the highest
As shown in Table 2, the consistent superiority of CDFA has also been demonstrated on the NCI-ALMANAC dataset. In the random split setup, CDFA exhibited the best performance with the lowest RMSE of 41.893, highest

Table 2. Performance comparison on the NCI-ALMANAC dataset. Bold values indicate the best performance.
The 10% of the samples of the O'Neil and NCI-ALMANAC datasets were set aside as an independent test set to evaluate these models’ generalization performance. In the independent test data section of the O'Neil and NCI-ALMANAC datasets, the superior performance of CDFA has been once again proven. As Table 3 shows, it illustrates the performance of various methods when applied to the independent test datasets. For the O'Neil dataset, CDFA demonstrates superior accuracy with the lowest RMSE of 15.111 and the highest

Table 3. Performance comparison on the independent test datasets. Bold values indicate the best performance.
Overall, CDFA consistently demonstrated strong performance, particularly excelling in the random split and cold drug pair settings. However, the poor performance of all methods in the cold cell line setting suggests that future research should focus on improving models' ability to generalize to new cell lines.
3.3 Tissue-specific analysis
Both previous studies and our own experiments have consistently demonstrated that model performance deteriorates significantly under the cold cell-line scenario, where test cell lines are entirely disjoint from those seen during training. This setting introduces substantial biological variability, making it difficult to disentangle whether performance degradation arises from tissue-specific effects or from the challenge of generalizing to unseen cell-line profiles.
To avoid this confounding factor, we also conducted a tissue-specific analysis on the O’Neil and NCI-ALMANAC datasets. The O’Neil dataset is built on testing 38 drugs on 39 cell lines representing multiple cancer types from six tissue origins. The NCI-ALMANAC dataset covers 104 drugs in 60 cell lines from nine tissue origins. As illustrated in Figures 3, 4, our analysis employs raincloud plots to visualize the distribution of MSE for the two independent test datasets. These plots combine box plots with kernel density estimates ('clouds') to visualize both the shape and central tendency of the error distributions, with outliers indicated by diamond markers.

Figure 3. Raincloud plots of MSE for O’Neil independent test dataset. The horizontal axis quantifies mean squared error (MSE) between true synergy scores and model predictions.

Figure 4. Raincloud plots of MSE for NCI-ALMANAC independent test dataset. The horizontal axis quantifies mean squared error (MSE) between true synergy scores and model predictions.
Our analysis reveals that although the MSE values of the median, second quartile, and third quartile are low, almost all tissues included by the two datasets have MSE values exceeding 500 and 2000, respectively. This suggests that while there is a small number of higher error values across most tissues, the central tendency of the error distribution may be relatively low. This pattern indicates that the model can achieve efficient prediction across different tissues. Our analysis confirms that the presence of high-error predictions—though limited in quantity—reveals significant variability in model performance. Such findings highlight the need for further investigation into the factors contributing to these higher errors and suggest that improvements in model accuracy and consistency are necessary for more reliable predictions across different tissue types.
3.4 Uncertainty results
Figure 5 displays the calibration curves of CDFA under various settings for both the O'Neil and NCI-ALMANAC datasets. The figures are organized from left to right, representing random splits, cold cell line settings, and cold drug pair settings, respectively. The first row showcases the O'Neil dataset, whereas the second row pertains to the NCI-ALMANAC dataset. The space between the calibration curves and the diagonal line represents the miscalibration area, which quantifies the extent of uncertainty calibration. As illustrated in Figure 5, CDFA’s recalibration algorithm successfully shifts the calibration curves closer to the diagonal line, thereby reducing the miscalibration area and improving the reliability of the predictions.

Figure 5. Calibration curves of CDFA. On the x-axis, it plots the expected confidence level, while the y-axis shows the observed proportion of correct predictions. A perfectly calibrated model lies on the diagonal line, where the observed proportion of correct outcomes exactly matches the stated confidence at every interval.
Figure 6 illustrates the relationship between prediction error and uncertainty, with uncertainty measured as the standard deviation (std). In this figure, red points indicate errors that do not fall within two standard deviations, while black and blue points represent errors that fall within one and two standard deviations, respectively. It is evident that the majority of the observed errors lie within two standard deviations, reflecting a reasonable alignment between the model’s predicted uncertainty and its actual prediction error.

Figure 6. Relationship between model error and uncertainty of all test datasets. The vertical axis represents the prediction errors, while the horizontal axis displays uncertainty measured as the standard deviation (std).
4 Conclusion
This study introduces an ensemble deep learning framework for predicting the potential synergy effects of drug combinations, showcasing superior performance relative to existing methods. A key innovation is the dual-level feature fusion mechanism, which integrates deep semantic features from various network modules, enhancing the model’s ability to capture complex interactions. The model leverages convolutional processing of the gene expression matrix to identify key gene signals relevant to drug response. Combined with a Transformer-based attention mechanism, this architecture enables context-aware re-weighting of gene importance under specific drug–cell interactions. This design emulates biological processes where only a subset of genes contribute significantly to the synergistic effect of drug combinations. Furthermore, the model’s prediction errors demonstrate robust generalization across tissues, as reflected in the consistent error distributions observed across different tissue types. Isolated high-error samples may correspond to biologically unique or complex cell lines, offering potential avenues for future investigation. Uncertainty estimation is integrated into the model, providing a critical safeguard against biased or overconfident predictions. This feature is especially valuable in guiding both the refinement of known synergies and the exploration of novel drug combinations. Additionally, the uncertainty estimation is integrated into the model, providing a critical safeguard against biased or overconfident predictions. This feature is especially valuable in guiding both the refinement of known synergies and the exploration of novel drug combinations. The uncertainty quantification and recalibration processes ensure that the model’s predictions are not only accurate but also reliable, offering a balanced approach to decision-making. While the experimental results demonstrate excellent performance on two datasets, further investigation is needed to assess the model’s robustness and generalization capabilities, particularly in scenarios involving new cell lines. Enhancing the interpretability of the model is another important area for future research, as it can provide deeper insights into the mechanisms underlying drug synergy and facilitate broader acceptance.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/supplementary material. All codes of CDFA can be accessed from https://github.com/TracyHIT/CDFA.
Author contributions
XK: Formal Analysis, Data curation, Writing – original draft. XyL: Writing – review and editing. QZ: Formal Analysis, Data curation, Writing – review and editing, Funding acquisition. TL: Writing – original draft. XmL: Formal Analysis, Data curation, Writing – review and editing, Funding acquisition.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by the National Science and Technology Major Project (Grant No. 2022ZD0117700), and the National Natural Science Foundation of China (Grant No.62371347 and 62271174). The submission code of Macao Polytechnic University is fca.c4de.e9f0.1.
Acknowledgments
We sincerely appreciate the dedicated efforts of Ximei Luo and XK in primary data collection and analysis, with valuable assistance from QZ. Special thanks go to TL and XK for leading the manuscript writing, as well as Xiaoyan Liu for her meticulous proofreading and constructive suggestions. We also extend our gratitude to all colleagues and collaborators who provided insightful discussions and technical support throughout this research.
Finally, we thank the reviewers for their insightful comments, which helped improve the quality of this manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Bollenbach, T., Quan, S., Chait, R., and Kishony, R. (2009). Nonoptimal microbial response to antibiotics underlies suppressive drug interactions. Cell 139 (4), 707–718. doi:10.1016/j.cell.2009.10.025
Brent, R. P. (1971). An algorithm with guaranteed convergence for finding a zero of a function. Comput. J. 14 (4), 422–425. doi:10.1093/comjnl/14.4.422
Cao, C., Wang, C., Dai, Q., Zou, Q., and Wang, T. (2024). CRBPSA: circrna-rbp interaction sites identification using sequence structural attention model. BMC Biol. 22, 260. doi:10.1186/s12915-024-02055-0
Chen, M. J., Zhang, M., Yan, G. Y., Wang, G. H., and Qu, C. Q. (2025). MRHGNN: enhanced multimodal relational hypergraph neural network for synergistic drug combination forecasting. IEEE Trans. Neural Netw. Learn Syst., 1–13. doi:10.1109/TNNLS.2025.3553385
Chen, Z. H., Li, Z. M., Shen, X. Z., Liu, Y. S., Lin, X., Zeng, D. J., et al. (2024). DualSyn: a dual-level feature interaction method to predict synergistic drug combinations. Expert Syst. Appl. 257, 125065. doi:10.1016/j.eswa.2024.125065
Cui, X., Lin, Q., Chen, M., Wang, Y., Wang, Y., Wang, Y., et al. (2024). Long-read sequencing unveils novel somatic variants and methylation patterns in the genetic information system of early lung cancer. Comput. Biol. Med. 171, 108174. doi:10.1016/j.compbiomed.2024.108174
Forbes, S. A., Beare, D., Gunasekaran, P., Leung, K., Bindal, N., Boutselakis, H., et al. (2015). COSMIC: exploring the world's knowledge of somatic mutations in human cancer. Nucleic Acids Res. 43 (D1), D805–D811. doi:10.1093/nar/gku1075
Guo, Y., Hu, H. T., Chen, W. B., Yin, H., Wu, J., Hsieh, C. Y., et al. (2024). SynergyX: a multi-modality mutual attention network for interpretable drug synergy prediction. Briefings Bioinforma. 25 (2), 15. doi:10.1093/bib/bbae015
Hendrycks, D., and Gimpel, K. (2016). “Gaussian error linear units (gelus),” in arXiv preprint arXiv:160608415.
Holbeck, S. L., Camalier, R., Crowell, J. A., Govindharajulu, J. P., Hollingshead, M., Anderson, L. W., et al. (2017). The national cancer institute ALMANAC: a comprehensive screening resource for the detection of anticancer drug pairs with enhanced therapeutic activity. Cancer Res. 77 (13), 3564–3576. doi:10.1158/0008-5472.CAN-17-0489
Hosseini, S. R., and Zhou, X. B. (2023). CCSynergy: an integrative deep -learning framework enabling context -aware prediction of anti -cancer drug synergy. Brief. Bioinform 24 (1), bbac588. doi:10.1093/bib/bbac588
Huang, H., Zhang, P., Qu, X. A., Sanseau, P., and Yang, L. (2014). Systematic prediction of drug combinations based on clinical side-effects. Sci. Rep. 4 (1), 7160. doi:10.1038/srep07160
Jaaks, P., Coker, E. A., Vis, D. J., Edwards, O., Carpenter, E. F., Leto, S. M., et al. (2022). Effective drug combinations in breast, Colon and pancreatic cancer cells. Nature 603 (7899), 166–173. doi:10.1038/s41586-022-04437-2
Jiang, T., Guo, H. Z., Liu, Y. D., Li, G. Y., Cui, Z., Cui, X. R., et al. (2024). A comprehensive genetic variant reference for the Chinese population. Sci. Bull. 69 (24), 3820–3825. doi:10.1016/j.scib.2024.06.017
Kong, S., Moharil, P., Handly-Santana, A., Boehnke, N., Panayiotou, R., Gomerdinger, V., et al. (2023). Synergistic combination therapy delivered via layer-by-layer nanoparticles induces solid tumor regression of ovarian cancer. Bioeng. Transl. Med. 8 (2), e10429. doi:10.1002/btm2.10429
Kuru, H. I., Tastan, O., and Cicek, A. E. (2022). MatchMaker: a deep learning framework for drug synergy prediction. IEEE-ACM Trans. Comput. Biol. Bioinform 19 (4), 2334–2344. doi:10.1109/TCBB.2021.3086702
Liu, Q., and Xie, L. (2021). TranSynergy: mechanism-Driven interpretable deep neural network for the synergistic prediction and pathway deconvolution of drug combinations. PLoS Comput. Biol. 17 (2), e1008653. doi:10.1371/journal.pcbi.1008653
Liu, X., Song, C. Z., Liu, S. C., Li, M. L., Zhou, X. H., and Zhang, W. (2022). Multi-way relation-enhanced hypergraph representation learning for anti-cancer drug synergy prediction. Bioinformatics 38 (20), 4782–4789. doi:10.1093/bioinformatics/btac579
Liu, Y. D., Jiang, T., Gao, Y., Liu, B., Zang, T. Y., and Wang, Y. D. (2021). Psi-caller: a lightweight short read-based variant caller with high speed and accuracy. Front. Cell Dev. Biol. 9, 11. doi:10.3389/fcell.2021.731424
Meng, P., Wang, G. H., Guo, H. Z., and Jiang, T. (2023). Identifying cancer driver genes using a two-stage random walk with restart on a gene interaction network. Comput. Biol. Med. 158, 106810. doi:10.1016/j.compbiomed.2023.106810
Mervin, L. H., Johansson, S., Semenova, E., Giblin, K. A., and Engkvist, O. (2021). Uncertainty quantification in drug design. Drug Discov. Today 26 (2), 474–489. doi:10.1016/j.drudis.2020.11.027
Nair, N. U., Greninger, P., Zhang, X. H., Friedman, A. A., Amzallag, A., Cortez, E., et al. (2023). A landscape of response to drug combinations in non-small cell lung cancer. Nat. Commun. 14 (1), 3830. doi:10.1038/s41467-023-39528-9
O'Neil, J., Benita, Y., Feldman, I., Chenard, M., Roberts, B., Liu, Y. P., et al. (2016). An unbiased oncology compound screen to identify novel combination strategies. Mol. Cancer Ther. 15 (6), 1155–1162. doi:10.1158/1535-7163.MCT-15-0843
Pang, Y., Chen, Y. H., Lin, M. J., Zhang, Y. H., Zhang, J. Q., and Wang, L. (2024). MMSyn: a new multimodal deep learning framework for enhanced prediction of synergistic drug combinations. J. Chem. Inf. Model 64 (9), 3689–3705. doi:10.1021/acs.jcim.4c00165
Preuer, K., Lewis, R. P. I., Hochreiter, S., Bender, A., Bulusu, K. C., and Klambauer, G. (2018). DeepSynergy: predicting anti-cancer drug synergy with deep learning. Bioinformatics 34 (9), 1538–1546. doi:10.1093/bioinformatics/btx806
Rafiei, F., Zeraati, H., Abbasi, K., Ghasemi, J. B., Parsaeian, M., and Masoudi-Nejad, A. (2023). DeepTraSynergy: drug combinations using multimodal deep learning with transformers. Bioinformatics 39 (8), btad438. doi:10.1093/bioinformatics/btad438
Sidorov, P., Naulaerts, S., Arley-Bonnet, J., Pasquier, E., and Ballester, P. J. (2019). Predicting synergism of cancer drug combinations using NCI-ALMANAC data. Front. Chem. 7, 13. doi:10.3389/fchem.2019.00509
Wang, J. X., Liu, X. J., Shen, S. Y., Deng, L., and Liu, H. (2022b). DeepDDS: deep graph neural network with attention mechanism to predict synergistic drug combinations. Briefings Bioinforma. 23 (1), bbab390. doi:10.1093/bib/bbab390
Wang, T., Renteria, M. E., and Peng, J. (2022a). Editorial: data mining and statistical methods for knowledge discovery in diseases based on multimodal omics. Front. Genet. 13, 895796. doi:10.3389/fgene.2022.895796
Wang, T., Shu, H., Hu, J. L., Wang, Y. T., Chen, J., Peng, J. J., et al. (2024a). Accurately deciphering spatial domains for spatially resolved transcriptomics with stCluster. Brief. Bioinform 25 (4), bbae329. doi:10.1093/bib/bbae329
Wang, T., Yang, J., Xiao, Y., Wang, J., Wang, Y., Zeng, X., et al. (2023a). DFinder: a novel end-to-end graph embedding-based method to identify drug–food interactions. Bioinformatics 39 (1), btac837. doi:10.1093/bioinformatics/btac837
Wang, T. S., Wang, R. H., and Wei, L. Y. (2023b). AttenSyn: an attention-based deep graph neural network for anticancer synergistic drug combination prediction. J. Chem. Inf. Model 64 (7), 2854–2862. doi:10.1021/acs.jcim.3c00709
Wang, W., Yuan, G., Wan, S., Zheng, Z., Liu, D., Zhang, H., et al. (2024b). A granularity-level information fusion strategy on hypergraph transformer for predicting synergistic effects of anticancer drugs. Briefings Bioinforma. 25 (1), bbad522. doi:10.1093/bib/bbad522
Wang, Y., Shen, W., Shen, Y., Feng, S., Wang, T., Shang, X., et al. (2024d). Integrative graph-based framework for predicting circRNA drug resistance using disease contextualization and deep learning. IEEE J. Biomed. health Inf. 1–12. doi:10.1109/JBHI.2024.3457271
Wang, Y. T., Liu, X. M., Shen, Y. W., Song, X. R., Wang, T., Shang, X. Q., et al. (2023c). Collaborative deep learning improves disease-related circRNA prediction based on multi-source functional information. Briefings Bioinforma. 24 (2), bbad069. doi:10.1093/bib/bbad069
Wang, Y. Z., Zhai, Y. X., Ding, Y. J., and Zou, Q. (2024c). SBSM-Pro: support bio-sequence machine for proteins. Sci. China-Information Sci. 67 (11), 212106. doi:10.1007/s11432-024-4171-9
Wu, L., Gao, J., Zhang, Y., Sui, B., Wen, Y., Wu, Q., et al. (2023). A hybrid deep forest-based method for predicting synergistic drug combinations. Cell Rep. methods. 3 (2), 100411. doi:10.1016/j.crmeth.2023.100411
Xu, M. D., Zhao, X. W., Wang, J. Y., Feng, W., Wen, N. F., Wang, C. Y., et al. (2023). DFFNDDS: prediction of synergistic drug combinations with dual feature fusion networks. J. Cheminformatics 15 (1), 33. doi:10.1186/s13321-023-00690-3
Yan, S. Y., and Zheng, D. (2024). A deep neural network for predicting synergistic drug combinations on cancer. Interdiscip. Sci. 16 (1), 218–230. doi:10.1007/s12539-023-00596-6
Zhang, T., Zhang, X., Wu, Z., Ren, J., Zhao, Z., Zhang, H., et al. (2024). VGAE-CCI: variational graph autoencoder-based construction of 3D spatial cell-cell communication network. Brief. Bioinform 26 (1), bbae619. doi:10.1093/bib/bbae619
Zhang, Y. Q., Yuan, H., Liu, Y. H., Xiong, S. W., Zhou, Z. G., Xu, Y. G., et al. (2025). MMGCSyn: explainable synergistic drug combination prediction based on multimodal fusion. Futur Gener. Comp. Syst. 168, 107784. doi:10.1016/j.future.2025.107784
Keywords: drug combination, deep learning, feature fusion, transformer, synergistic drug
Citation: Kang X, Liu X, Zou Q, Li T and Luo X (2025) CDFA: Calibrated deep feature aggregation for screening synergistic drug combinations. Front. Pharmacol. 16:1608832. doi: 10.3389/fphar.2025.1608832
Received: 09 April 2025; Accepted: 30 June 2025;
Published: 23 July 2025.
Edited by:
Xinyu Wang, Philadelphia College of Osteopathic Medicine (PCOM), United StatesReviewed by:
Sayed-Rzgar Hosseini, Indiana State University, United StatesYanglan Gan, Donghua University, China
Copyright © 2025 Kang, Liu, Zou, Li and Luo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Tiantian Li, bGl0aWFudGlhbkBqc3BnaC5jb20=; Ximei Luo, bHVveGltZWlAdWVzdGMuZWR1LmNu