CDFA: Calibrated deep feature aggregation for screening synergistic drug combinations

Kang, Xiaorui; Liu, Xiaoyan; Zou, Quan; Li, Tiantian; Luo, Ximei

doi:10.3389/fphar.2025.1608832

ORIGINAL RESEARCH article

Front. Pharmacol., 23 July 2025

Sec. Pharmacology of Anti-Cancer Drugs

Volume 16 - 2025 | https://doi.org/10.3389/fphar.2025.1608832

CDFA: Calibrated deep feature aggregation for screening synergistic drug combinations

XK
Xiaorui Kang ¹
XL
Xiaoyan Liu ²
QZ
Quan Zou ^1,3
TL
Tiantian Li ⁴^*
XL
Ximei Luo ^3,5^*

1. Faculty of Applied Sciences, Macao Polytechnic University, Macau, China
2. Faculty of Computing, Harbin Institute of Technology, Harbin, Heilongjiang, China
3. Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan, China
4. Editorial Office, Geriatric Hospital of Nanjing Medical University, Nanjing, Jiangsu, China
5. Yangtze Delta Region lnstitute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China

Article metrics

View details

Citations

1,3k

Views

394

Downloads

Abstract

Introduction:

Drug combination therapy represents a promising strategy for addressing complex diseases, offering the potential for improved efficacy while mitigating safety concerns. However, conventional wet-lab experimentation for identifying optimal drug combinations is resource-intensive due to the vast combinatorial search space. To address this challenge, computational methods leveraging machine learning and deep learning have emerged to effectively navigate this space.

Methods:

In this study, we introduce a Calibrated Deep Feature Aggregation (CDFA) framework for screening synergistic drug combinations. Concretely, CDFA utilizes a novel cell line representation based on the protein information and gene expression capturing complementary biological determinants of drug response. Besides, a novel feature aggregation network is proposed based on the Transformer to model the intricate interactions between drug pairs and cell lines through multi-head attention mechanisms, enabling discovery of non-linear synergy patterns. Furthermore, a method is introduced to quantify and calibrate the uncertainties associated with CDFA’s predictions, enhancing the reliability of the identified synergistic drug combinations.

Results:

Experiments results have demonstrated that CDFA outperforms existing state-of-the-art deep learning models.

Discussion:

The superior performance of CDFA stems from its biologically informed cell line representation, its ability to capture complex non-linear drug-cell interactions via attention mechanisms, and its enhanced reliability through uncertainty calibration. This framework provides a robust computational tool for efficient and reliable drug combination screening.

1 Introduction

Drug combination therapy has emerged as a mainstay in the clinical treatment of various cancers (Meng et al., 2023), including lung cancer (Nair et al., 2023; Cui et al., 2024), ovarian cancer (Kong et al., 2023), and pancreatic cancer (Jaaks et al., 2022). Compared with monotherapy, combination therapies often demonstrate enhanced efficacy, reduced drug resistance, and decreased toxicity. However, it is crucial to recognize that not all drug combinations yield synergistic effects; in fact, some combinations may even exhibit antagonistic effects (Wang T. et al., 2023). For instance, the concomitant administration of antibiotics inhibiting DNA synthesis and those targeting protein synthesis can stimulate bacterial growth (Bollenbach et al., 2009). Therefore, the precise identification of synergistic drug pairs for specific cell types is essential to harness the full potential of combination therapy (Wang T. et al., 2022).

Traditional laboratory experiments to screen for synergistic drug combinations from the vast pharmacological space are often time-consuming and resource-intensive. Moreover, drug combination trials can sometimes result in side effects or harmful reactions in patients. With the growing availability of high-throughput screening data (Jiang et al., 2024; Liu et al., 2021), computational methods have emerged as efficient preclinical strategies for identifying synergistic drug combinations (Cao et al., 2024).

With the accumulation of data and the advancement of related technologies in recent decades, classical machine learning (ML)-based approaches and deep learning (DL) techniques have been employed to model drug combination trials, showing promising results by leveraging a variety of drug and cell line features. As drug combination effect prediction can be formulated as a regression or a multi-class classification task, the early ML-based methods often used the classical machine learning, such as logistic regression (LR) (H et al., 2014), support vector machine (SVM), random forests (RF) (Breiman, 2001), and extreme gradient boosting (XGboost). As early as 2014, Huang H et al. used a logistic regression model to systematically predict the drug combinations based on clinical side-effect (H et al., 2014). Pavel Sidorov et al. predicted Synergism of Cancer Drug Combinations by using NCI-ALMANAC Data based on RF and XGboost models (Sidorov et al., 2019). These methods laid the groundwork for more advanced approaches. Recently, deep learning (DL) models have shown excellent performance in bio-sequence analysis, gene regulation, and other areas, for extracting various data features and fusing heterogeneous data (Wang T. et al., 2024; Zhu et al., 2025). As the data about drugs continues to expand, most ML-based work has shifted towards deep learning (DL) models, driven by significant advancements in neural network architectures. One notable early DL model is DeepSynergy (Preuer et al., 2018), which integrates genomic data and drug information to identify drug combinations by a fully connected neural networks. Building on this foundation, newer DL models have emerged, leveraging advanced architectures like Transformers (Wang T. et al., 2024), Graph Neural Networks (GNNs) (Zhang et al., 2024), and Auto-Encoders (Zhu et al., 2025). For instance, CCSynergy (Hosseini and Zhou, 2023), GTextSy (Yan and Zheng, 2024), MMGCSyn (Zhang et al., 2025) and MatchMaker (Kuru et al., 2022) are integrated DNN with drug and cell line features. Based on Transformers models, DeepTraSynergy (Rafiei et al., 2023) and TranSynergy (Liu and Xie, 2021) were developed to learn drug representations and incorporate auxiliary knowledge through a novel neural network design. MRHGNN (Chen et al., 2025) and DeepDDS (Wang JX. et al., 2022) employ various GNNs to extract drug features by modeling drugs as graphs, capturing their structural properties. Moreover, recent research has introduced hypergraph neural networks to model complex relationships between cell lines and drug pairs (Wang W. et al., 2024; Liu et al., 2022).

In addition to neural network design, the fusion mechanism plays a crucial role in drug combination synergy prediction models. Recent studies have focused on effectively combining drug and cell line information to improve predictive accuracy. In parallel, advances in biological sequence classification have demonstrated the benefits of integrating multiple types of information. For instance, the SBSM-Pro model (Wang YZ. et al., 2024) introduces a novel multiple kernel learning strategy to combine sequence similarity measures, significantly enhancing classification performance. Similarly, DFFNDDS (Xu et al., 2023) employs two distinct neural networks to fuse drug features and cell line information from both bit-wise and vector-wise perspectives. DualSyn (Chen et al., 2024) introduces two modules to capture high-order and global information, enhancing the model’s ability to understand complex interactions. SynergyX (Guo et al., 2024) utilizes mutual-attention and self-attention mechanisms to model drug-cell and drug-drug interactions, providing a more nuanced understanding of these relationships. CircRDRP (Wang Y. et al., 2024) uses a graph neural network model to predict the association of circRNA with drug resistance by combining disease context characteristics and deep learning techniques. MMSyn (Pang et al., 2024) and AttenSyn (Wang TS. et al., 2023) leverage attention mechanisms to integrate multiple drug and cell line features, allowing the model to focus on the most relevant aspects of the data. CLCDA (Wang YT. et al., 2023) is a collaborative deep learning-based model for predicting potential associations between circRNA and disease. Despite these significant contributions, many of these approaches still rely on late fusion mechanisms, where drug and cell line features are combined at a later stage in the model. This can limit the model’s ability to fully capture the intricate interactions between drugs and cell lines. To address the limitations of late fusion mechanisms, this study proposes the Calibrated Deep Feature Aggregation (CDFA) framework–a Transformer-based architecture that enables early-stage integration of proteomic features and gene expression profiles to capture intricate drug-drug-cell interactions. The design incorporates dedicated uncertainty calibration to ensure probabilistic reliability. Experimental validation demonstrates CDFA’s fusion efficacy: comprehensive testing across two benchmark datasets (spanning diverse cell lines and tissue types) confirms both the structural effectiveness and superior generalization of our approach.

2 Materials and methods

2.1 Synergy datasets

We assessed our method using two publicly available datasets: O'Neil (O'Neil et al., 2016) and NCI-ALMANAC (Holbeck et al., 2017). The O'Neil dataset comprised 23,062 drug combination samples involving 38 drugs and 39 human cancer cell lines. The NCI-ALMANAC dataset was relatively larger, containing 304,549 data points across 104 drugs and 60 cell lines. The synergy value for each sample is represented by the Loewe and combination scores for O'Neil and NCI-ALMANAC, respectively. The characteristics of the cell lines were represented by 651 gene expression values obtained from the COSMIC database (Forbes et al., 2015). Following established preprocessing steps (Liu et al., 2022), the final datasets included 18,950 and 74,139 drug-drug-cell line combinations for O'Neil and NCI-ALMANAC, respectively. Figure 1 depicts the distribution of synergy scores for both datasets. Notably, the left side of the distribution, centered around 30, constitutes more than half of the dataset. These values correspond to the negative pairs that exhibit either additive or antagonistic effects, indicating that a significant portion of the drug combinations do not show a synergistic benefit over the individual effects of the drugs. This observation underscores the complexity of identifying truly synergistic drug pairs and highlights the importance of systematic screening and computational approaches to optimize drug combination therapies.

FIGURE 1

To train and evaluate the model, we began by randomly selecting 90% of drug pairs and cell lines from each dataset to conduct three different experimental settings: random setting, cold cell line setting, and cold drug pair setting. The remaining 10% of the samples were set aside as an independent test set to evaluate generalization performance. For the random splitting setting, we divided the samples into five equal subsets. One subset served as the test set, while the remaining four were further split into training and validation sets in a 9:1 ratio. In the cold cell line setting, all the unique cell lines were divided into five equal groups randomly. The related samples which contain the cell line from one of these groups were used for testing, while the remaining samples were split into a 9:1 ratio as the training set and validation set. This ensured that the test set included only cell lines not present in the training set. For the cold drug pair setting, drug pairs were similarly partitioned into five equal groups. Four groups were used for training, with the test set containing only those drug pairs not seen during training. This ensured that the model was tested on an entirely new pair of drugs.

2.2 Problem formulation

In this study, we formulate the synergy prediction problem as a regression task. Let denote the set of the training samples where denote the drug pair and is the cell lines, and denotes the number of training samples. Also, the corresponding synergy effect is represented by the label . The paper aims at learning a drug combination function , given a drug pair and a cell line, can generate the target value .

2.3 Drug and cell line representations

A variety of molecular representations have been employed for drug combination prediction tasks. Fingerprints, such as ECFP and MHFP, are commonly used to encode compound structures. In this study, we adopted the MinHashed Atom-Pair fingerprint extended to four bonds (MAP4) as our molecular representation. MAP4 offers a versatile approach to representing diverse chemical structures.

Gene expression profiles have been commonly employed to represent cell lines in drug combination prediction tasks. In this study, we utilized gene expression data extracted from COSMIC, represented as 651-dimensional vectors (), where each element corresponds to the expression level of a specific gene. In the most of the deep learning-based models treat the gene expression as a vector which does not satisfy the biomedical meaning which each gene expression should be treated separately. In the bio-mechanism of drug synergy, only a part of genes contributes to the synergy effect. So, we treat the 651-dimensional vectors () as a matrix . In the following work, we use the CNN to extract the important genes to simulate the bio-mechanism.

2.4 Feature encoder

The weighted gene expression representation of a cell line is fed into a cell line feature encoder to learn abstract cell line representations. This encoder comprises three convolutional layers interleaved with pooling layers. The initial convolutional layer transforms the input into feature maps, which are subsequently downsampled using max-pooling. This process is repeated three times.

The MAP4 vector representing a drug is input into a drug feature encoder to extract high-level abstract features. The encoder consists of two fully connected (FC) layers followed by Gaussian Error Linear Units (GELU) (Hendrycks and Gimpel, 2016) and batch normalization. The resulting features serve as essential inputs for subsequent fusion operations. The formulation of the drug feature encoder can be summarized as follows (Equation 1):where is one of the input features and denotes the corresponding generated feature. BN represents 1day batch normalization. represents an FC layer with neurons. During the feature extraction stage, we project drug features and cell line feature into the same dimension to obtain higher-quality information for use in the subsequent modules.

We refer to these generated drug pair features as and cell line feature as .

2.5 Deep feature aggregation module

Given the drug pair features and extracted feature of weight gene expression of cell line , we first use a global max pooling operation to obtain the global cell line feature as . We treat the drug pair features and global cell line feature as whole global features and the as the local cell line feature. The deep feature aggregation module can be decomposed into two parts: 1) global feature fusion, and 2) global to local feature fusion. Details are discussed as follows:

Global feature fusion: This process aims to integrate drug and early cell line features, followed by reinforcing the fused global features back into the local cell features. We employ a transformer encoder for global feature fusion. The core idea of the transformer encoder is the attention mechanism. An attention function maps queries (), keys (), and values () to an output as follows (Equation 2):where is the dimensionality of the query vector.

The multi-head attention mechanism consists of multiple attention heads, with each head conducting a linear transformation on the input vectors before performing the attention operation. Each attention head has its own set of trainable parameters, allowing it to potentially model an independent relationship between the input vectors. This is achieved by utilizing different parameters in the linear transformation step.

Then, for the head, three weight matrices are used to project Q, K, and V, respectively, to a lower dimension ; then, an attention function is performed (Equation 3).wherein .

Then, the output of the multi-head attention mechanism is the linear transformation of the concatenation of the output vectors acquired from the attention heads (Equation 4):where is the number of heads and is a trainable weight matrix.

Besides the attention mechanism, the transformer encoder also contains the residual and feed-forward neural network. Formally, the global feature fusion can be defined as follows (Equation 5):where and represent layer normalization and feed-forward neural network, respectively.

Global to local cell line feature fusion: Inspired by recent findings that drugs can influence the synergistic or antagonistic effects of drug combinations through modulating key gene expression (Wu et al., 2023), we incorporate a global-to-local cell line feature fusion network to simulate drug-induced gene regulation effects. The local cell line feature is enhanced through multi-head attention where global features ()are incorporated using a Transformer decoder. This enables adaptive re-weighting of gene expressions based on cross-tissue biological patterns, with layer normalization and residual connections stabilizing feature refinement. This process can be mathematically expressed as follows (Equation 6):

2.6 Synergy prediction module

The final synergy value of a drug combination is predicted using the output of the global feature fusion network () and the global-to-local cell line feature fusion network (). Specifically, is flattened into a 1D vector, and global max pooling is applied to to obtain another 1D vector. These vectors are then fed into separate multi-layer fully connected layers to refine their abstract features. Finally, the refined features are concatenated and passed through a final FC layer to predict the synergy value .

Given a training dataset, that contains samples with ground-truth synergy scores and the corresponding values predicted by our method, we can train the deep learning model in an end-to-end fashion using the mean squared error (MSE) loss as the loss function.

2.7 Uncertainty quantification

We use an ensemble method to further enhance generalization and quantify the uncertainty of the CDFA. Specifically, we trained distinct model replicas. Each replica shares the same neural network architecture and settings but uses a different initial random seed for parameter initialization. This ensures that while the models are structurally identical, they develop unique parameter values during training, leading to diverse predictions and a more robust uncertainty estimation. For every input drug combination, each model generates a predicted synergy value, denoted as . The final synergy prediction, , is determined by averaging these individual predictions. Meanwhile, the uncertainty associated with this prediction, , is quantified by calculating the standard deviation of the individual predictions from the ensemble.

2.8 Uncertainty recalibration

Calibration errors (Mervin et al., 2021) in probability estimates compromise reliability by creating discrepancies between predicted and true probabilities. Specifically, they refer to the discrepancy between the model’s predicted confidence and the actual observed frequency of correctness at that confidence level. For example, if a model assigns 80% confidence to a set of predictions, but only 70% of them are correct, this indicates a calibration error in that confidence range. Such miscalibration reduces the effectiveness of uncertainty estimates as indicators of trustworthiness in predictions.

To address this issue, a common strategy is to learn a recalibration function that adjusts the predicted uncertainties to better align with the true underlying probabilities. The recalibration function is often a non-linear uncertainty scaling function, learned using a hold-out validation dataset to create a calibration map, and is often assessed using metrics like Expected Calibration Error (ECE). In our method, we adopt a simple yet effective single-parameter scaling approach that adjusts only the uncertainty component . We achieve this by multiplying with a scaling factor , while keeping the predicted synergy value unchanged. This choice is motivated by the fact that , as the model’s point estimate, already captures the optimal synergy prediction and should not be altered during post-hoc calibration. Instead, we rescale by a positive scalar factor , resulting in the recalibrated output . The scaling factor is optimized using Brent’s method (Brent, 1971) to ensure that the recalibrated uncertainties accurately reflect the true probability of correctness. The objective is to minimize the miscalibration, quantified by ECE, on a separate validation set. This optimization ensures that the adjusted uncertainties more accurately reflect the true likelihood of correct predictions across confidence levels. The result is an uncertainty estimate that is better aligned with the model’s empirical behavior and more trustworthy for downstream decision-making.

3 Results

3.1 Overview of the CDFA framework

CDFA is an ensemble deep learning framework for predicting the potential synergy effects of drug combinations based on the drugs’ molecular information and the cells’ gene expression. The overall architecture of CDFA is shown in Figure 2. It consists of three main components: the feature encoders for the drug pair and cell line, the feature aggregation module, and the synergy prediction module. First, MAP4 is used to represent diverse chemical structures of the paired drugs. Gene expression profiles are employed to represent cell lines in drug combination prediction tasks. Then, feature encoders are used to extract these three types of features separately. A novel feature aggregation network is involved based on the Transformer which tries to capture the intricate interactions between drug pairs and cell lines. Finally, the aggregated features are connected to another synergy prediction module. The subsequent sections of this section provide detailed evidence of the superiority of this computational framework.

FIGURE 2

3.2 Comparison with existing models

To evaluate CDFA’s performance, we compared it with nine existing drug combination synergy prediction models: HypergraphSynergy, DeepSynergy, DTF, CombFM, Celebi’s method, PermuteDDS, MatchMaker, GTextSyn and MMGCSyn. We employed three common regression evaluation metrics to assess the performance of these methods: root mean squared error (RMSE), coefficient of determination (), and Pearson’s Correlation Coefficient (PCC).

As Table 1 shows, we compared CDFA’s performance with several models using the O'Neil dataset across three different experimental setups. In the random split scenario, where data is divided without specific constraints, the CDFA model outshone others with the lowest RMSE at 13.522, alongside the highest at 0.651 and PCC at 0.808. When tested on unseen cell lines (cold cell line setting), HypergraphSynergy led with the highest of 0.252 and the lowest RMSE of 19.537. However, CDFA maintained a competitive edge despite not leading in every metric. For the cold drug pair setting, where models predict outcomes for drug combinations not encountered during training, CDFA performed exceptionally well, achieving the lowest RMSE (15.976), highest (0.511), and a PCC of 0.717, demonstrating its strength in handling unseen drug pairs.

TABLE 1

	Randon split			Cold cell line setting			Cold drug pair setting
	RMSE	R2	PCC	RMSE	R2	PCC	RMSE	R2	PCC
CDFA	13.522	0.651	0.808	19.597	0.25	0.53	15.976	0.511	0.717
PermuteDDS	13.721	0.641	0.801	19.668	0.243	0.522	16.152	0.501	0.709
HypergraphSynergy	14.727	0.586	0.775	19.537	0.252	0.533	17.346	0.42	0.656
DeepSynergy	14.87	0.584	0.765	23.89	0.195	0.426	17.28	0.433	0.663
ComboFM	16.86	0.451	0.702	20.82	0.142	0.396	18.62	0.376	0.635
DTF	14.73	0.594	0.775	21.11	0.132	0.535	17.37	0.429	0.671
Celebi’s method	16.34	0.5	0.708	20.6	0.179	0.473	19.1	0.309	0.572
MatchMaker	17.4948	0.4162	0.6466	28.5376	−0.7616	0.3628	17.7172	0.399	0.6332
GTextSyn	16.231	0.497	0.709	20.866	0.144	0.457	18.186	0.367	0.625
MMGCSyn	17.138	0.439	0.69	25.754	−0.342	0.316	18.837	0.317	0.605

Performance comparison on the O’Neil dataset. Bold values indicate the best performance.

As shown in Table 2, the consistent superiority of CDFA has also been demonstrated on the NCI-ALMANAC dataset. In the random split setup, CDFA exhibited the best performance with the lowest RMSE of 41.893, highest of 0.552, and highest PCC of 0.746. Under the cold cell line condition, HypergraphSynergy performed best with an RMSE of 53.398, of 0.273, and PCC of 0.538. In the cold drug pair scenario, CDFA once again stood out, achieving the lowest RMSE (50.522), sub-optimal (0.346), and highest PCC (0.593), underscoring its effectiveness in predicting responses for novel drug combinations.

TABLE 2

	Randon split			Cold cell line setting			Cold drug pair setting
	RMSE	R2	PCC	RMSE	R2	PCC	RMSE	R2	PCC
CDFA	41.893	0.552	0.746	53.819	0.259	0.536	50.522	0.346	0.593
PermuteDDS	43.053	0.527	0.726	54.128	0.242	0.519	51.58	0.318	0.569
HypergraphSynergy	43.89	0.508	0.719	53.398	0.273	0.538	52.609	0.291	0.543
DeepSynergy	44.44	0.491	0.701	54.56	0.23	0.322	53.5	0.262	0.526
ComboFM	48.27	0.399	0.651	54.67	0.245	0.531	53.89	0.267	0.526
DTF	47.03	0.43	0.678	54.73	0.223	0.517	53.47	0.263	0.531
Celebi’s method	47.31	0.423	0.653	53.49	0.259	0.516	55.83	0.196	0.456
MatchMaker	51.7316	0.3168	0.5642	64.6824	0.3644	−0.0652	55.7034	0.2028	0.4588
GTextSyn	47.425	0.426	0.657	56.369	0.187	0.479	55.511	0.208	0.483
MMGCSyn	47.793	0.417	0.659	60.353	0.067	0.454	54.523	0.519	0.236

Performance comparison on the NCI-ALMANAC dataset. Bold values indicate the best performance.

The 10% of the samples of the O'Neil and NCI-ALMANAC datasets were set aside as an independent test set to evaluate these models’ generalization performance. In the independent test data section of the O'Neil and NCI-ALMANAC datasets, the superior performance of CDFA has been once again proven. As Table 3 shows, it illustrates the performance of various methods when applied to the independent test datasets. For the O'Neil dataset, CDFA demonstrates superior accuracy with the lowest RMSE of 15.111 and the highest of 0.660. PermuteDDS trails closely behind with similarly strong results, showing almost no difference from CDFA. On the NCI-ALMANAC dataset, CDFA retains its leadership by achieving the best RMSE at 42.307 and the highest value at 0.508, confirming its robustness in both precision and explanatory capability. Although PermuteDDS performs well, it still lags slightly behind CDFA across all metrics. The remaining methods exhibit higher RMSE figures and lower values, suggesting they are less precise and less effective compared to our method.

TABLE 3

	Randon split			Cold cell line setting
	RMSE	R2	PCC	RMSE	R2	PCC
CDFA	15.111	0.660	0.818	42.307	0.508	0.713
PermuteDDS	15.144	0.659	0.821	43.338	0.484	0.696
HypergraphSynergy	16.710	0.585	0.788	43.730	0.474	0.693
DeepSynergy	16.840	0.578	0.765	45.325	0.435	0.670
ComboFM	16.080	0.541	0.754	46.370	0.457	0.685
DTF	16.150	0.548	0.752	49.860	0.372	0.700
Celebi’s method	16.500	0.529	0.728	45.860	0.469	0.688
MatchMaker	20.725	0.361	0.6466	51.259	0.2778	0.5282
GTextSyn	18.931	0.466	0.686	48.026	0.366	0.612
MMGCSyn	19.834	0.412	0.647	48.312	0.358	0.619

Performance comparison on the independent test datasets. Bold values indicate the best performance.

Overall, CDFA consistently demonstrated strong performance, particularly excelling in the random split and cold drug pair settings. However, the poor performance of all methods in the cold cell line setting suggests that future research should focus on improving models' ability to generalize to new cell lines.

3.3 Tissue-specific analysis

Both previous studies and our own experiments have consistently demonstrated that model performance deteriorates significantly under the cold cell-line scenario, where test cell lines are entirely disjoint from those seen during training. This setting introduces substantial biological variability, making it difficult to disentangle whether performance degradation arises from tissue-specific effects or from the challenge of generalizing to unseen cell-line profiles.

To avoid this confounding factor, we also conducted a tissue-specific analysis on the O’Neil and NCI-ALMANAC datasets. The O’Neil dataset is built on testing 38 drugs on 39 cell lines representing multiple cancer types from six tissue origins. The NCI-ALMANAC dataset covers 104 drugs in 60 cell lines from nine tissue origins. As illustrated in Figures 3, 4, our analysis employs raincloud plots to visualize the distribution of MSE for the two independent test datasets. These plots combine box plots with kernel density estimates ('clouds') to visualize both the shape and central tendency of the error distributions, with outliers indicated by diamond markers.

FIGURE 3

FIGURE 4

Our analysis reveals that although the MSE values of the median, second quartile, and third quartile are low, almost all tissues included by the two datasets have MSE values exceeding 500 and 2000, respectively. This suggests that while there is a small number of higher error values across most tissues, the central tendency of the error distribution may be relatively low. This pattern indicates that the model can achieve efficient prediction across different tissues. Our analysis confirms that the presence of high-error predictions—though limited in quantity—reveals significant variability in model performance. Such findings highlight the need for further investigation into the factors contributing to these higher errors and suggest that improvements in model accuracy and consistency are necessary for more reliable predictions across different tissue types.

3.4 Uncertainty results

Figure 5 displays the calibration curves of CDFA under various settings for both the O'Neil and NCI-ALMANAC datasets. The figures are organized from left to right, representing random splits, cold cell line settings, and cold drug pair settings, respectively. The first row showcases the O'Neil dataset, whereas the second row pertains to the NCI-ALMANAC dataset. The space between the calibration curves and the diagonal line represents the miscalibration area, which quantifies the extent of uncertainty calibration. As illustrated in Figure 5, CDFA’s recalibration algorithm successfully shifts the calibration curves closer to the diagonal line, thereby reducing the miscalibration area and improving the reliability of the predictions.

FIGURE 5

Figure 6 illustrates the relationship between prediction error and uncertainty, with uncertainty measured as the standard deviation (std). In this figure, red points indicate errors that do not fall within two standard deviations, while black and blue points represent errors that fall within one and two standard deviations, respectively. It is evident that the majority of the observed errors lie within two standard deviations, reflecting a reasonable alignment between the model’s predicted uncertainty and its actual prediction error.

FIGURE 6

4 Conclusion

This study introduces an ensemble deep learning framework for predicting the potential synergy effects of drug combinations, showcasing superior performance relative to existing methods. A key innovation is the dual-level feature fusion mechanism, which integrates deep semantic features from various network modules, enhancing the model’s ability to capture complex interactions. The model leverages convolutional processing of the gene expression matrix to identify key gene signals relevant to drug response. Combined with a Transformer-based attention mechanism, this architecture enables context-aware re-weighting of gene importance under specific drug–cell interactions. This design emulates biological processes where only a subset of genes contribute significantly to the synergistic effect of drug combinations. Furthermore, the model’s prediction errors demonstrate robust generalization across tissues, as reflected in the consistent error distributions observed across different tissue types. Isolated high-error samples may correspond to biologically unique or complex cell lines, offering potential avenues for future investigation. Uncertainty estimation is integrated into the model, providing a critical safeguard against biased or overconfident predictions. This feature is especially valuable in guiding both the refinement of known synergies and the exploration of novel drug combinations. Additionally, the uncertainty estimation is integrated into the model, providing a critical safeguard against biased or overconfident predictions. This feature is especially valuable in guiding both the refinement of known synergies and the exploration of novel drug combinations. The uncertainty quantification and recalibration processes ensure that the model’s predictions are not only accurate but also reliable, offering a balanced approach to decision-making. While the experimental results demonstrate excellent performance on two datasets, further investigation is needed to assess the model’s robustness and generalization capabilities, particularly in scenarios involving new cell lines. Enhancing the interpretability of the model is another important area for future research, as it can provide deeper insights into the mechanisms underlying drug synergy and facilitate broader acceptance.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/supplementary material. All codes of CDFA can be accessed from https://github.com/TracyHIT/CDFA.

Author contributions

XK: Formal Analysis, Data curation, Writing – original draft. XyL: Writing – review and editing. QZ: Formal Analysis, Data curation, Writing – review and editing, Funding acquisition. TL: Writing – original draft. XmL: Formal Analysis, Data curation, Writing – review and editing, Funding acquisition.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by the National Science and Technology Major Project (Grant No. 2022ZD0117700), and the National Natural Science Foundation of China (Grant No.62371347 and 62271174). The submission code of Macao Polytechnic University is fca.c4de.e9f0.1.

Acknowledgments

We sincerely appreciate the dedicated efforts of Ximei Luo and XK in primary data collection and analysis, with valuable assistance from QZ. Special thanks go to TL and XK for leading the manuscript writing, as well as Xiaoyan Liu for her meticulous proofreading and constructive suggestions. We also extend our gratitude to all colleagues and collaborators who provided insightful discussions and technical support throughout this research.

Finally, we thank the reviewers for their insightful comments, which helped improve the quality of this manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1
BollenbachT.QuanS.ChaitR.KishonyR. (2009). Nonoptimal microbial response to antibiotics underlies suppressive drug interactions. Cell139 (4), 707–718. 10.1016/j.cell.2009.10.025
- CrossRef
- Google Scholar
2
BreimanL. (2001). Random forests. Mach. Learn.45, 5–32. 10.1023/a:1010933404324
- CrossRef
- Google Scholar
3
BrentR. P. (1971). An algorithm with guaranteed convergence for finding a zero of a function. Comput. J.14 (4), 422–425. 10.1093/comjnl/14.4.422
- CrossRef
- Google Scholar
4
CaoC.WangC.DaiQ.ZouQ.WangT. (2024). CRBPSA: circrna-rbp interaction sites identification using sequence structural attention model. BMC Biol.22, 260. 10.1186/s12915-024-02055-0
- CrossRef
- Google Scholar
5
ChenM. J.ZhangM.YanG. Y.WangG. H.QuC. Q. (2025). MRHGNN: enhanced multimodal relational hypergraph neural network for synergistic drug combination forecasting. IEEE Trans. Neural Netw. Learn Syst., 1–13. 10.1109/TNNLS.2025.3553385
- CrossRef
- Google Scholar
6
ChenZ. H.LiZ. M.ShenX. Z.LiuY. S.LinX.ZengD. J.et al (2024). DualSyn: a dual-level feature interaction method to predict synergistic drug combinations. Expert Syst. Appl.257, 125065. 10.1016/j.eswa.2024.125065
- CrossRef
- Google Scholar
7
CuiX.LinQ.ChenM.WangY.WangY.WangY.et al (2024). Long-read sequencing unveils novel somatic variants and methylation patterns in the genetic information system of early lung cancer. Comput. Biol. Med.171, 108174. 10.1016/j.compbiomed.2024.108174
- CrossRef
- Google Scholar
8
ForbesS. A.BeareD.GunasekaranP.LeungK.BindalN.BoutselakisH.et al (2015). COSMIC: exploring the world's knowledge of somatic mutations in human cancer. Nucleic Acids Res.43 (D1), D805–D811. 10.1093/nar/gku1075
- CrossRef
- Google Scholar
9
GuoY.HuH. T.ChenW. B.YinH.WuJ.HsiehC. Y.et al (2024). SynergyX: a multi-modality mutual attention network for interpretable drug synergy prediction. Briefings Bioinforma.25 (2), 15. 10.1093/bib/bbae015
- CrossRef
- Google Scholar
10
HendrycksD.GimpelK. (2016). “Gaussian error linear units (gelus),” in arXiv preprint arXiv:160608415.
- Google Scholar
11
HolbeckS. L.CamalierR.CrowellJ. A.GovindharajuluJ. P.HollingsheadM.AndersonL. W.et al (2017). The national cancer institute ALMANAC: a comprehensive screening resource for the detection of anticancer drug pairs with enhanced therapeutic activity. Cancer Res.77 (13), 3564–3576. 10.1158/0008-5472.CAN-17-0489
- CrossRef
- Google Scholar
12
HosseiniS. R.ZhouX. B. (2023). CCSynergy: an integrative deep -learning framework enabling context -aware prediction of anti -cancer drug synergy. Brief. Bioinform24 (1), bbac588. 10.1093/bib/bbac588
- CrossRef
- Google Scholar
13
HuangH.ZhangP.QuX. A.SanseauP.YangL. (2014). Systematic prediction of drug combinations based on clinical side-effects. Sci. Rep.4 (1), 7160. 10.1038/srep07160
- CrossRef
- Google Scholar
14
JaaksP.CokerE. A.VisD. J.EdwardsO.CarpenterE. F.LetoS. M.et al (2022). Effective drug combinations in breast, Colon and pancreatic cancer cells. Nature603 (7899), 166–173. 10.1038/s41586-022-04437-2
- CrossRef
- Google Scholar
15
JiangT.GuoH. Z.LiuY. D.LiG. Y.CuiZ.CuiX. R.et al (2024). A comprehensive genetic variant reference for the Chinese population. Sci. Bull.69 (24), 3820–3825. 10.1016/j.scib.2024.06.017
- CrossRef
- Google Scholar
16
KongS.MoharilP.Handly-SantanaA.BoehnkeN.PanayiotouR.GomerdingerV.et al (2023). Synergistic combination therapy delivered via layer-by-layer nanoparticles induces solid tumor regression of ovarian cancer. Bioeng. Transl. Med.8 (2), e10429. 10.1002/btm2.10429
- CrossRef
- Google Scholar
17
KuruH. I.TastanO.CicekA. E. (2022). MatchMaker: a deep learning framework for drug synergy prediction. IEEE-ACM Trans. Comput. Biol. Bioinform19 (4), 2334–2344. 10.1109/TCBB.2021.3086702
- CrossRef
- Google Scholar
18
LiuQ.XieL. (2021). TranSynergy: mechanism-Driven interpretable deep neural network for the synergistic prediction and pathway deconvolution of drug combinations. PLoS Comput. Biol.17 (2), e1008653. 10.1371/journal.pcbi.1008653
- CrossRef
- Google Scholar
19
LiuX.SongC. Z.LiuS. C.LiM. L.ZhouX. H.ZhangW. (2022). Multi-way relation-enhanced hypergraph representation learning for anti-cancer drug synergy prediction. Bioinformatics38 (20), 4782–4789. 10.1093/bioinformatics/btac579
- CrossRef
- Google Scholar
20
LiuY. D.JiangT.GaoY.LiuB.ZangT. Y.WangY. D. (2021). Psi-caller: a lightweight short read-based variant caller with high speed and accuracy. Front. Cell Dev. Biol.9, 11. 10.3389/fcell.2021.731424
- CrossRef
- Google Scholar
21
MengP.WangG. H.GuoH. Z.JiangT. (2023). Identifying cancer driver genes using a two-stage random walk with restart on a gene interaction network. Comput. Biol. Med.158, 106810. 10.1016/j.compbiomed.2023.106810
- CrossRef
- Google Scholar
22
MervinL. H.JohanssonS.SemenovaE.GiblinK. A.EngkvistO. (2021). Uncertainty quantification in drug design. Drug Discov. Today26 (2), 474–489. 10.1016/j.drudis.2020.11.027
- CrossRef
- Google Scholar
23
NairN. U.GreningerP.ZhangX. H.FriedmanA. A.AmzallagA.CortezE.et al (2023). A landscape of response to drug combinations in non-small cell lung cancer. Nat. Commun.14 (1), 3830. 10.1038/s41467-023-39528-9
- CrossRef
- Google Scholar
24
O'NeilJ.BenitaY.FeldmanI.ChenardM.RobertsB.LiuY. P.et al (2016). An unbiased oncology compound screen to identify novel combination strategies. Mol. Cancer Ther.15 (6), 1155–1162. 10.1158/1535-7163.MCT-15-0843
- CrossRef
- Google Scholar
25
PangY.ChenY. H.LinM. J.ZhangY. H.ZhangJ. Q.WangL. (2024). MMSyn: a new multimodal deep learning framework for enhanced prediction of synergistic drug combinations. J. Chem. Inf. Model64 (9), 3689–3705. 10.1021/acs.jcim.4c00165
- CrossRef
- Google Scholar
26
PreuerK.LewisR. P. I.HochreiterS.BenderA.BulusuK. C.KlambauerG. (2018). DeepSynergy: predicting anti-cancer drug synergy with deep learning. Bioinformatics34 (9), 1538–1546. 10.1093/bioinformatics/btx806
- CrossRef
- Google Scholar
27
RafieiF.ZeraatiH.AbbasiK.GhasemiJ. B.ParsaeianM.Masoudi-NejadA. (2023). DeepTraSynergy: drug combinations using multimodal deep learning with transformers. Bioinformatics39 (8), btad438. 10.1093/bioinformatics/btad438
- CrossRef
- Google Scholar
28
SidorovP.NaulaertsS.Arley-BonnetJ.PasquierE.BallesterP. J. (2019). Predicting synergism of cancer drug combinations using NCI-ALMANAC data. Front. Chem.7, 13. 10.3389/fchem.2019.00509
- CrossRef
- Google Scholar
29
WangJ. X.LiuX. J.ShenS. Y.DengL.LiuH. (2022b). DeepDDS: deep graph neural network with attention mechanism to predict synergistic drug combinations. Briefings Bioinforma.23 (1), bbab390. 10.1093/bib/bbab390
- CrossRef
- Google Scholar
30
WangT.RenteriaM. E.PengJ. (2022a). Editorial: data mining and statistical methods for knowledge discovery in diseases based on multimodal omics. Front. Genet.13, 895796. 10.3389/fgene.2022.895796
- CrossRef
- Google Scholar
31
WangT.ShuH.HuJ. L.WangY. T.ChenJ.PengJ. J.et al (2024a). Accurately deciphering spatial domains for spatially resolved transcriptomics with stCluster. Brief. Bioinform25 (4), bbae329. 10.1093/bib/bbae329
- CrossRef
- Google Scholar
32
WangT.YangJ.XiaoY.WangJ.WangY.ZengX.et al (2023a). DFinder: a novel end-to-end graph embedding-based method to identify drug–food interactions. Bioinformatics39 (1), btac837. 10.1093/bioinformatics/btac837
- CrossRef
- Google Scholar
33
WangT. S.WangR. H.WeiL. Y. (2023b). AttenSyn: an attention-based deep graph neural network for anticancer synergistic drug combination prediction. J. Chem. Inf. Model64 (7), 2854–2862. 10.1021/acs.jcim.3c00709
- CrossRef
- Google Scholar
34
WangW.YuanG.WanS.ZhengZ.LiuD.ZhangH.et al (2024b). A granularity-level information fusion strategy on hypergraph transformer for predicting synergistic effects of anticancer drugs. Briefings Bioinforma.25 (1), bbad522. 10.1093/bib/bbad522
- CrossRef
- Google Scholar
35
WangY.ShenW.ShenY.FengS.WangT.ShangX.et al (2024d). Integrative graph-based framework for predicting circRNA drug resistance using disease contextualization and deep learning. IEEE J. Biomed. health Inf.1–12. 10.1109/JBHI.2024.3457271
- CrossRef
- Google Scholar
36
WangY. T.LiuX. M.ShenY. W.SongX. R.WangT.ShangX. Q.et al (2023c). Collaborative deep learning improves disease-related circRNA prediction based on multi-source functional information. Briefings Bioinforma.24 (2), bbad069. 10.1093/bib/bbad069
- CrossRef
- Google Scholar
37
WangY. Z.ZhaiY. X.DingY. J.ZouQ. (2024c). SBSM-Pro: support bio-sequence machine for proteins. Sci. China-Information Sci.67 (11), 212106. 10.1007/s11432-024-4171-9
- CrossRef
- Google Scholar
38
WuL.GaoJ.ZhangY.SuiB.WenY.WuQ.et al (2023). A hybrid deep forest-based method for predicting synergistic drug combinations. Cell Rep. methods.3 (2), 100411. 10.1016/j.crmeth.2023.100411
- CrossRef
- Google Scholar
39
XuM. D.ZhaoX. W.WangJ. Y.FengW.WenN. F.WangC. Y.et al (2023). DFFNDDS: prediction of synergistic drug combinations with dual feature fusion networks. J. Cheminformatics15 (1), 33. 10.1186/s13321-023-00690-3
- CrossRef
- Google Scholar
40
YanS. Y.ZhengD. (2024). A deep neural network for predicting synergistic drug combinations on cancer. Interdiscip. Sci.16 (1), 218–230. 10.1007/s12539-023-00596-6
- CrossRef
- Google Scholar
41
ZhangT.ZhangX.WuZ.RenJ.ZhaoZ.ZhangH.et al (2024). VGAE-CCI: variational graph autoencoder-based construction of 3D spatial cell-cell communication network. Brief. Bioinform26 (1), bbae619. 10.1093/bib/bbae619
- CrossRef
- Google Scholar
42
ZhangY. Q.YuanH.LiuY. H.XiongS. W.ZhouZ. G.XuY. G.et al (2025). MMGCSyn: explainable synergistic drug combination prediction based on multimodal fusion. Futur Gener. Comp. Syst.168, 107784. 10.1016/j.future.2025.107784
- CrossRef
- Google Scholar
43
ZhuP. F.ShuH.WangY. T.WangX. F.ZhaoY.HuJ. L.et al (2025). MAEST: accurately spatial domain detection in spatial transcriptomics with graph masked autoencoder. Brief. Bioinform26 (2), bbaf086. 10.1093/bib/bbaf086
- CrossRef
- Google Scholar

Summary

Keywords

drug combination, deep learning, feature fusion, transformer, synergistic drug

Citation

Kang X, Liu X, Zou Q, Li T and Luo X (2025) CDFA: Calibrated deep feature aggregation for screening synergistic drug combinations. Front. Pharmacol. 16:1608832. doi: 10.3389/fphar.2025.1608832

Received

09 April 2025

Accepted

30 June 2025

Published

23 July 2025

Volume

16 - 2025

Edited by

Xinyu Wang, Philadelphia College of Osteopathic Medicine (PCOM), United States

Reviewed by

Sayed-Rzgar Hosseini, Indiana State University, United States

Yanglan Gan, Donghua University, China

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tiantian Li, litiantian@jspgh.com; Ximei Luo, luoximei@uestc.edu.cn

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Pharmacology of Anti-Cancer Drugs

ORIGINAL RESEARCH article

CDFA: Calibrated deep feature aggregation for screening synergistic drug combinations

Abstract

1 Introduction