Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol., 13 January 2026

Sec. Cancer Immunity and Immunotherapy

Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1658122

This article is part of the Research TopicInnovations in Cancer Imaging and Radiomics through Explainable Artificial IntelligenceView all 7 articles

Deep learning-based multimodal approach for non-invasive prediction and prognostic analysis of immune and angiogenic biomarkers in extrahepatic cholangiocarcinoma

Jiong Liu,&#x;Jiong Liu1,2†Zhitao Cheng&#x;Zhitao Cheng3†Ruidan Yang,Ruidan Yang1,2Yinfei Fan,Yinfei Fan1,2Yue ShuYue Shu4Yong Tang,*Yong Tang3,5*Jian Shu,*Jian Shu1,2*
  • 1Department of Radiology, The Affiliated Hospital of Southwest Medical University, Luzhou, Sichuan, China
  • 2Precision Imaging and Intelligent Analysis Key Laboratory of Luzhou, Luzhou, Sichuan, China
  • 3School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
  • 4Department of Oncology, The Affiliated Hospital of Southwest Medical University, Luzhou, China
  • 5Institute of Intelligent Chinese Medicine, Chongqing University of Chinese Medicine, Chongqing, China

Objective: This study aims to develop a deep learning (DL)-based multimodal framework that integrates magnetic resonance imaging (MRI), clinical, and laboratory data to predict programmed death ligand 1 (PD-L1) and vascular endothelial growth factor (VEGF) expression in Extrahepatic cholangiocarcinoma (eCCA) patients and assess the prognostic value.

Methods: A retrospective cohort study involving 96 patients with eCCA was conducted across two institutions. A total of 16050 raw MRI images (11505 T1WI, 2371 T2WI, 2372 DWI) and 1570 tumor-containing images (990 T1WI, 289 T2WI, 291 DWI) were analyzed. Radiomic feature extraction was performed manually segmented tumor regions from MRI scans. The multimodal DL framework integrated DL features extracted from images and radiomic features as well as clinical-laboratory features through a repeated attention mechanism. Prognostic stratification was performed using Cox regression analysis to predict overall survival (OS) and evaluate the clinical utility of the model.

Results: The DL framework demonstrated moderate predictive performance for PD-L1 expression (AUC = 0.71) and good predictive capability for VEGF expression (AUC = 0.85) in the test cohort. The combination of DL-based imaging features and radiomic data outperformed single-modality approaches. Prognostic analysis revealed significant associations of model-predicted PD-L1 and VEGF expression with OS in eCCA patients. The Cox model-based nomogram demonstrated significant survival stratification (p = 0.006), with performance comparable to traditional immunohistochemistry-based methods.

Conclusion: Our findings highlighted the potential of integrating DL and radiomics for non-invasive, preoperative biomarker profiling, offering a promising tool for personalized treatment strategies and improved clinical decision-making in eCCA.

1 Introduction

Extrahepatic cholangiocarcinoma (eCCA) is a rare malignant epithelial tumor originating from the perihilar and distal bile ducts. It is characterized by high invasiveness, heterogeneity, and poor prognosis (1). Accounting for 70%–90% of all CCA cases, eCCA has a higher global incidence compared to intrahepatic cholangiocarcinoma (iCCA), at approximately 1.02 per 100,000 population (2, 3). Due to subtle and non-specific early symptoms, approximately 80% of eCCA patients are diagnosed at advanced stages, thus missing the optimal window for curative surgery (4). Even for those undergoing R0 resection, the recurrence rate remains as high as 75%, and the five-year overall survival (OS) rate is only about 5% for patients with advanced disease (5, 6). Given the high mortality rates of eCCA, there is an urgent clinical need to develop novel therapeutic approaches and identify effective biomarkers capable of predicting patient prognosis and treatment response.

In recent years, immunotherapy and targeted therapy have emerged as promising options for advanced eCCA. Programmed death ligand 1 (PD-L1), a critical molecule in tumor immune evasion, inhibits T-cell activation by binding to programmed death protein 1 (PD-1), allowing tumor cells to evade immune surveillance (7). Similarly, angiogenesis, mediated by vascular endothelial growth factor (VEGF), plays a crucial role in tumor growth and metastasis. Elevated VEGF expression has been found strongly associated with poor prognosis in eCCA, making it a potential biomarker for anti-angiogenic therapies (810). Furthermore, the interaction between angiogenesis and tumor immune microenvironment (TIME) suggests that a dual blockade of PD-L1/PD-1 immune checkpoints and VEGF-mediated angiogenesis may synergistically improve treatment outcomes (1113).

Despite these advancements, the preoperative identification of PD-L1 and VEGF expression in eCCA remains challenging. Immunohistochemical staining, currently the gold standard for biomarker detection, relies on tumor specimens obtained through invasive procedures, which limits its practicality for repeated or dynamic assessments. Magnetic resonance imaging (MRI), a widely used modality for preoperative diagnosis and staging of eCCA (14), traditionally focused on morphological features, limiting its utility in capturing molecular or immune characteristics. However, advances in radiomics and deep learning (DL) have demonstrated great potential for the non-invasive prediction of molecular biomarkers in various tumor types (15, 16). By extracting and analyzing high-dimensional data from imaging modalities, DL models can uncover subtle patterns related to the tumor microenvironment, facilitating comprehensive biomarker accessment without invasive procedures.

This study aims to develop a DL-based multimodal framework that combines MRI radiomic features with clinical and laboratory data to predict PD-L1 and VEGF expression as well as survival prognosis in eCCA patients. By providing a non-invasive, preoperative biomarker assessment tool, this study aims to improve personalized treatment strategies and predict patient survival outcomes, ultimately enhancing clinical decision-making and patient prognosis.

2 Materials and methods

2.1 Patient enrollment

A retrospective analysis was conducted in eCCA patients diagnosed at Institution A between January 2011 and December 2021, and at Institution B between January 2021 and December 2023. The inclusion criteria were: (1) postoperatively confirmed eCCA through pathological tissue biopsy; (2) availability of complete preoperative multi-sequence MRI images; and (3) comprehensive clinical data. The exclusion criteria were: (1) poor image quality with significant artifacts; (2) lesions too small (short axis < 5 mm) to accurately delineate tumor boundaries; and (3) patients who had undergone any prior treatments (such as surgery, chemotherapy, or interventional therapy) before MRI scanning. A total of 96 eCCA patients meeting the inclusion and exclusion criteria from both centers were included (Figure 1). Due to the relatively small sample size from each center, patient data from both centers were combined into a unified dataset to increase the reliability of statistical inferences. This study was approved by the Ethics Committee of the Institution A (KY2023041), and the requirement for informed consent was waived.

Figure 1
Flowchart depicting patient selection for a study. Institution A initially had 141 eCCA patients; exclusions resulted in 87 patients. Institution B had 21 patients; exclusions left 9 patients. Total of 96 patients enrolled. Patients were categorized into PD-L1 positive (47), PD-L1 negative (49), VEGF positive (46), and VEGF negative (50).

Figure 1. Flowchart of the patient selection process.

In this study, 15 common clinical features and laboratory indicators were analyzed, including patient age, gender, tumor location, and key biomarkers such as carbohydrate antigen 19-9 (CA19-9), CA50, carcinoembryonic antigen (CEA), alpha-fetoprotein (AFP), and ferritin (FER). Additionally, liver function markers including alkaline phosphatase (ALP), γ-glutamyl transpeptidase (γ-GTP), aspartate aminotransferase (AST), alanine aminotransferase (ALT), total bilirubin (TBIL), and direct bilirubin (DBIL) were included, along with systemic inflammatory markers such as the neutrophil-to-lymphocyte ratio (NLR).

2.2 Evaluation of PD-L1 and VEGF expression

The expression of PD-L1 and VEGF was detected using the immunohistochemistry (IHC) Envision method (Figure 2). The primary antibodies used were the PD-L1 SP142 antibody (Beijing Zhongshan Golden Bridge Biotechnology Co., Ltd.) and the VEGF antibody (Luzhou Keyang Biotechnology Co., Ltd.). The evaluation of both markers was interpreted by pathologists with over 10 years of experience in pathological diagnosis.

Figure 2
Microscopic immunohistochemical images of tissue sections at ×200 magnification are shown in four panels labeled a through d. Panel a shows PD-L1–negative staining with minimal chromogenic signal. Panel b shows PD-L1–positive staining with increased brown coloration. Panel c shows VEGF-negative staining with low staining intensity. Panel d shows VEGF-positive staining with more prominent brown staining. The panels illustrate variations in staining intensity and distribution across markers.

Figure 2. Immunohistochemical Staining (×200 Magnification). (a) PD-L1 Negative; (b) PD-L1 Positive; (c) VEGF Negative; (d) VEGF Positive.

PD-L1 expression was quantified using the Combined Positive Score (CPS), which was calculated by dividing the total number of PD-L1-positive tumor cells and surrounding immune cells by 100 tumor cells. Given the low expression rate of PD-L1 in eCCA patients and in reference to related studies on other malignancies, a CPS ≥ 1 was classified as weakly positive, while a CPS ≥ 50 was categorized as strongly positive. Both groups were collectively defined as PD-L1 positive, whereas patients with a CPS < 1 were classified as PD-L1 negative.

VEGF expression was determined by cytoplasmic staining ranging from light yellow to brown, and the percentage of positive cells relative to the total number of cells was calculated. No positive cells (0%) indicated no VEGF expression. A positive cell percentage below 25% was considered weakly positive, between 25% and 50% was moderately positive, and more than 50% was strongly positive. Finally, patients with no or weak VEGF expression were classified as VEGF negative, while others were categorized as VEGF positive.

2.3 MRI protocol

Institution A conducted standard upper abdominal imaging using a 3.0T whole-body superconducting MRI system (Achieva, Philips, Netherlands) equipped with a 16-channel phased-array torso coil. Similarly, Institution B performed comparable scans utilizing a 3.0T whole-body superconducting MRI system (Skyra, Siemens, Germany) with an 18-channel phased-array torso coil. The imaging protocols in both institutions were designed to comprehensively cover the biliary system. Core sequences included axial T1-weighted imaging (T1WI), axial T2-weighted imaging with spectral attenuated inversion recovery (T2WI-SPAIR), and axial diffusion-weighted imaging (DWI), among others. Detailed scanning parameters are summarized in Table 1.

Table 1
www.frontiersin.org

Table 1. Scanning parameters for upper abdominal MRI in both institutions.

2.4 Lesion segmentation

All MRI images were retrieved and stored in the Picture Archiving and Communication Systems (PACS), and subsequently imported into the open-source software 3D-Slicer (version 4.11, https://www.slicer.org/). Two radiologists (R1 and R2), both with more than five years of experience, confirmed the lesions and established the criteria for delineating the volume of interest (VOI) for the whole tumor: (1) the tumor was outlined slice-by-slice along its borders in the three imaging sequences; (2) internal tumor components such as vessels, necrotic cysts, and hemorrhage were included; (3) surrounding vessels and adjacent dilated bile ducts were excluded (Figure 3).

Figure 3
Flowchart illustrating a medical imaging process for eCCA patients across two institutions. The process includes patient enrollment, image processing with multisequence MRI images, lesion ROI segmentation, and radiomics feature extraction. Data set splitting includes details about PD-L1 and VEGF images. A multimodal fusion framework integrates tumor mask attention using ResNet 18 for feature extraction, with a classifier for the result. ResNet 18 details include convolution layers and residual blocks.

Figure 3. Study schematic.

2.5 Radiomic feature extraction and consistency assessment

Before feature extraction, all images were resampled into 3mm×3mm×3mm voxel size to standardize the imaging data. Additionally, the “bin width” parameter was set to 25, and the gray-level co-occurrence matrix (GLCM) feature was forced to symmetry. Using the Pyradiomics package, radiomic features were extracted from the three MRI sequences for each patient, including 19 first-order statistical features, 16 three-dimensional shape features, 75 texture features, and high-order features derived from wavelet and LoG filters. All features adhered to the Image Biomarker Standardization Initiative (IBSI) guidelines. In total, 851 T1WI features, 1316 T2WI features, and 1316 DWI features were extracted for each patient (Figure 3).

To assess the stability of the extracted radiomic features, images from 20 randomly selected eCCA patients were used for consistency testing across the three MRI sequences. Two radiologists (R1 and R2) outlined the tumors independently, and radiologist R1 repeated the delineation after one week. The intraclass correlation coefficient (ICC) was calculated using Python (version 3.7, https://www.python.org/), with a two-way random-effects model and absolute agreement to evaluate both inter-observer and intra-observer reliability. Radiomic features with ICC values ≥ 0.75 for both inter-observer and intra-observer assessments were retained. This stability assessment was performed independently for each of the three MRI sequences (T1WI, T2WI, DWI). After ICC analysis, 502 T1WI features, 1113 T2WI features, and 1088 DWI features were retained, resulting in a total of 2703 radiomic features.

2.6 Multimodal DL framework construction

Our DL architecture integrated three distinct data modalities: multi-sequence MRI images, radiomic features, and clinical parameters. For imaging analysis, we employed the ResNet18 backbone to extract high-dimensional features from both the original images and their corresponding segmented tumor regions across all sequences. A total of 16050 raw MRI images (11505 T1WI, 2371 T2WI, 2372 DWI) and 1570 tumor-containing images (990 T1WI, 289 T2WI, 291 DWI) were analyzed. Patient-specific feature aggregation was achieved through the global average pooling of tumor region features per sequence, generating a 1000-dimensional latent representation. These imaging embeddings were subsequently combined with radiomic features and clinic-laboratory variables through a hierarchical feature fusion network, forming a combined, high-dimensional vector of 5718 features. This combined vector was then fed into the hierarchical feature fusion network. The fusion network is composed of multiple dense (fully-connected) layers forming a bottleneck structure (5718 features → 2048 → 512 → 512 → 64), and employs strong regularization (e.g., L2 weight decay). Furthermore, to address the dimensional disparity between modalities while preserving critical clinical information, we implemented a repeated attention mechanism that dynamically reweights clinical features at multiple network stages. This architecture compels the model to learn a compact, low-dimensional representation by automatically learning to assign low weights to redundant or uninformative features during training. This in-model, non-linear dimensionality reduction is an intrinsic property of the DL architecture and is designed to mitigate the risk of overfitting from the high-dimensional inputs. The final classification layer employs sigmoid activation for binary outcome prediction. For each biomarker prediction task, we implemented a stratified random sampling strategy: 10 PD-L1 positive and 10 PD-L1 negative patients were reserved as an independent test cohort for PD-L1 modeling, with the remaining cases undergoing four-fold cross-validation on the training cohort. To further improve the robustness and generalizability of the DL model, oversampling-based data augmentation techniques were applied to the multi-sequence MRI images. An identical approach was applied to VEGF prediction. The overall framework was illustrated in Figure 3.

To evaluate the contributions of automatically extracted DL features and manually designed radiomic features, we conducted two ablation experiments. The first experiment combined clinical features with raw imaging data, while the second integrated clinical and radiomic features without imaging data. All ablation models maintained identical architectural hyperparameters and training protocols to isolate modality-specific effects. Performance metrics were compared using DeLong’s test for receiver operating characteristic (ROC) curves.

2.7 Prognostic stratification

The entire cohort was used for prognostic analysis to maximize the statistical capability and ensure the clinical generalizability. All patients were followed up by telephone and medical record system queries. Survival outcomes were rigorously defined in accordance with RECIST 1.1 guidelines for oncology endpoints. OS was calculated from the date of pathological diagnosis to death from any causes, with surviving patients censored at last confirmed follow-up. To assess the combined predictive role of PD-L1 and VEGF, we utilized prediction scores from our multimodal framework in a Cox proportional hazards regression analysis. These scores, along with survival status and OS time, formed the basis of a Cox model predicting 1-, 2-, and 3-year OS, presented through nomograms for personalized survival predictions.

Using X-tile software (version 3.6.1, Yale University), optimal cutoff values were determined from nomogram scores to classify patients into high-risk and low-risk groups. Kaplan-Meier (KM) curves were compared with survival outcomes between these groups, with log-rank tests confirming statistical significance. To validate our stratification approach biologically, we developed a parallel Cox model using IHC-confirmed PD-L1 and VEGF status. Patients were similarly stratified, and KM curves with log-rank tests were used to compare survival outcomes. This allowed comparison between DL-based and traditional IHC-based risk stratification.

2.8 Statistical analysis

Statistical analyses of all patient clinical information, laboratory data, and pathological results were performed using SPSS software (version 27.0, IBM). Continuous variables were described as mean ± standard deviation (x̅ ± sd) if normally distributed; otherwise, they were reported as median (interquartile range, IQR). Independent t-tests were applied for normally distributed variables, while the Mann-Whitney U test was used for non-normally distributed variables. Categorical variables were described as numbers and percentages, and the Pearson's Chi-squared test was used for analysis. All statistical tests were two-sided, with a p-value < 0.05 considered statistically significant. ROC curves were used to evaluate the predictive performance of the model, including the area under the curve (AUC) and accuracy. Additionally, calibration curves and decision curve analysis were plotted to assess the model goodness-of-fit and clinical net benefit.

3 Results

3.1 Patient characteristics

The study cohort consisted of 96 eCCA patients, including 47 PD-L1 positive and 49 PD-L1 negative cases. The median age was 59 years (range 50–65) for PD-L1 positive patients and 58 years (range 53–62) for PD-L1 negative patients. There was no significant gender difference between the groups (p = 0.863). Tumor location was significantly different between the PD-L1 groups, with more patients in the PD-L1 positive group having distal CCA (76.6% vs. 57.1%, p = 0.043).

For VEGF expression, 46 patients were classified as VEGF positive, and 50 were classified as VEGF negative. The median age of VEGF positive patients was 56 years (range 50–62), while VEGF negative patients had a median age of 62 years (range 54–67), with a significant difference (p = 0.026). Tumor location also varied between VEGF groups, with a higher proportion of VEGF positive patients presenting with distal CCA (65.2% vs. 50%, p = 0.014). No significant differences were observed in all tumor markers and liver function indicators. Patients with PD-L1 positivity and VEGF negativity tended to have longer median survival times, though survival status and OS times did not significantly differ between the PD-L1 and VEGF groups. Baseline characteristics were summarized in Table 2.

Table 2
www.frontiersin.org

Table 2. Baseline characteristics comparison.

3.2 Multimodal DL framework performance

The multimodal DL model showed varying performance in predicting PD-L1 (Figure 4) and VEGF (Figure 5) expression. The ROC curve analysis revealed an AUC of 0.71 (95% CI, 0.670–0.749) for PD-L1 in the test cohort, indicating moderate performance. In contrast, the prediction for VEGF expression demonstrated higher accuracy, with an AUC of 0.85 (95% CI, 0.810–0.869) in the test cohort, reflecting strong predictive capability. In ablation experiments, the model that combined both imaging data and radiomic features outperformed models utilizing only one modality. Furthermore, DeLong’s test demonstrated that the differences between the models were statistically significant (Table 3). The calibration curves for the PD-L1 and VEGF tasks demonstrated good agreement with the actual outcomes, and decision curve analysis indicated that the use of the multimodal DL model nomogram provided substantial clinical net benefit across a wide range of decision thresholds.

Figure 4
Figure 4 presents model outputs for PD-L1 status prediction. Panel a shows a violin plot illustrating the distribution of predicted scores for two classes. Panels b through e display performance evaluation plots in the test cohort, including a receiver operating characteristic curve, a confusion matrix, a calibration curve comparing predicted and observed probabilities, and a decision curve analysis showing net benefit across different threshold probabilities.

Figure 4. Prediction of PD-L1 status. (a) Violin plot showing the distribution of predicted scores for PD-L1 positive (1) and negative (0) cases. (b–e) Performance evaluation of PD-L1 prediction in the test cohort: (b) ROC curve, (c) Confusion matrix, (d) Calibration curve, (e) Decision curve analysis.

Figure 5
Figure 5 shows model results for VEGF status prediction. Panel a displays a violin plot of predicted score distributions for two classes. Panels b through e include performance evaluation visualizations from the test cohort, consisting of a receiver operating characteristic curve, a confusion matrix, a calibration curve comparing predicted and observed probabilities, and a decision curve analysis illustrating net benefit across a range of threshold probabilities.

Figure 5. Prediction of VEGF status. (a) Violin plot illustrating the distribution of predicted scores for VEGF positive (1) and negative (0) cases. (b–e) Performance evaluation of VEGF prediction in the test cohort: (b) ROC curve, (c) Confusion matrix, (d) Calibration curve, (e) Decision curve. analysis. Calibration curve, (e) Decision curve analysis.

Table 3
www.frontiersin.org

Table 3. Performance comparison among models.

3.3 Prognostic stratification and survival prediction

Multivariable Cox regression analysis identified both PD-L1 (HR = 0, 95% CI: 0–0.19; p = 0.043) and VEGF (HR = 8.01, 95% CI: 1.51–42.36; p = 0.014) prediction scores as independent prognostic factors for OS. Using the constructed nomogram (Figure 6a) and the optimal risk stratification cutoff of 63.9, patients were categorized into high-risk (n = 63) and low-risk (n = 33) groups. KM analysis of these risk-stratified groups revealed significantly worse survival in high-risk patients (median OS: 23 months; log-rank p = 0.006) (Figure 6b). The IHC-based risk stratification also demonstrated comparable survival discrimination, with high-risk patients having a median OS of 27 months (log-rank p = 0.012) (Figure 6c).

Figure 6
Panel a displays risk assessment scales for points, PDL1 score, VEGF score, total points, linear predictor, and survival probabilities at 12, 24, and 36 months. Panel b shows a survival probability curve with high and low-risk groups, highlighting a significant difference (p = 0.0064). Panel c presents another survival probability curve for high and low-risk groups with a significant level (p = 0.012). Each curve includes a number at risk table over time.

Figure 6. Prognostic analysis results. (a) Cox nomogram for individualized survival probability prediction. The “Points” axis assigns a score for a patient’s predicted PD-L1 and VEGF levels. The sum of these two scores (“Total Points”) is projected downward to obtain the “Linear Predictor” and further to the survival probability axes for 12, 24, and 36 months (Pro, probability). (b) Kaplan-Meier curve analysis of overall survival stratified by the risk groups derived from the multimodal deep learning framework. (c) Kaplan-Meier curve analysis of overall survival based on risk stratification using traditional immunohistochemistry methods.

4 Discussion

This study presented a multimodal DL framework that non-invasively predicts PD-L1 and VEGF expression and survival outcomes in eCCA patients by integrating DL features, radiomic features, and clinical data. Our approach achieved robust preoperative molecular profiling and reliable risk stratification. These findings highlighted the potential for integrating imaging-based analyses into personalized eCCA treatment strategies.

Emerging immunotherapies and targeted treatments have provided new therapeutic perspectives for advanced CCA. However, the benefits have primarily been observed in iCCA patients, with only a small subset of eCCA patients showing similar responses (5, 17). The main reason for this disparity lies in the significant molecular, genetic, and histopathological differences between eCCA and iCCA (12, 18). Additionally, eCCA suffers from lower rates of molecular analysis and the difficulty in identifying suitable biomarkers for rare targets (17). Our results demonstrated the feasibility of preoperative prediction for PD-L1 and VEGF status using non-invasive multimodal data, with superior performance for VEGF (AUC = 0.85) compared to PD-L1 (AUC = 0.71). The differential predictive accuracy for PD-L1 and VEGF highlights intrinsic biological and methodological challenges. VEGF's strong performance likely arises from its uniform tumor expression, which can be detected through macroscopic imaging patterns of angiogenic signaling (19). This finding aligns with several other studies on VEGF expression prediction, which consistently report AUC values exceeding 0.8 (2023). In contrast, PD-L1’s heterogeneous expression, driven by dynamic immune-tumor interactions (24), may reduce the sensitivity of image-based models (25). Our hierarchical fusion model outperformed single-modality approaches, demonstrating the added value of combining DL-derived imaging features with handcrafted radiomics. The attention mechanism further optimized feature integration, dynamically prioritizing clinical variables like tumor location, which correlated significantly with PD-L1/VEGF positivity in our cohort. This supports recent genomic studies suggesting molecular gradients in CCA, potentially influenced by microenvironmental factors such as oxygen levels and immune infiltration (26).

Prognostic analysis identified the model-predicted PD-L1 and VEGF as significant factors for OS in eCCA patients, reflecting their important roles in tumor biology. The association between PD-L1 expression and survival may indicate a tumor microenvironment more responsive to immune checkpoint inhibition (1, 27), while VEGF-driven angiogenesis likely contributes to metastatic dissemination (9). Our DL-based Cox nomogram demonstrated risk stratification comparable to traditional IHC, effectively recapitulating biologically grounded prognostic hierarchies. The model's predictions can provide a probabilistic estimate of biomarker status when a biopsy is contraindicated, high-risk, or inconclusive. This information, while not replacing IHC, can be incorporated into multidisciplinary team discussions to inform therapeutic decisions. For example, it can assist in risk-stratifying patients or guiding the selection of first-line systemic therapies (e.g., prioritizing anti-angiogenic agents in cases with a high predicted probability of VEGF positivity). These findings suggest the nomogram’s potential to inform therapeutic decision-making, such as prioritizing high-risk patients for VEGF inhibitors like bevacizumab, while directing PD-L1-positive cases toward early PD-1/PD-L1 blockade. This approach aligns with a phase II trial evaluating the combination of anti-angiogenic antibodies and immune checkpoint inhibitors in patients with advanced CCA (28).

Despite the promising results, several limitations should be considered. First, the retrospective design and modest sample size, resulting from the low incidence and high detection cost, highlight the need for external validation in larger, prospective cohorts. Second, although the model demonstrated strong performance in VEGF prediction, further refinement is necessary for PD-L1 prediction, which remains an area of ongoing investigation. Third, the biological interpretability of DL and radiomic features remain unclear. Future studies should aim to correlate key features with spatial transcriptomics or multiplex immunohistochemistry data to enhance understanding.

5 Conclusions

Our study established a proof-of-concept for non-invasive, DL-driven biomarker profiling in eCCA. By transforming routine MRI into a multidimensional descriptor of tumor immune and angiogenic states, our framework could enable real-time therapeutic decision-making, particularly in resource-limited settings. As precision oncology shifts toward immune-microenvironment modulation, such tools may become indispensable for unlocking the full potential of targeted therapies in biliary tract cancers.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Ethics Committee of the Affiliated Hospital of Southwest Medical University (Ref: KY2023041). The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants' legal guardians/next of kin because This study was retrospective and carried minimal risk.

Author contributions

JL: Formal analysis, Writing – original draft, Data curation, Conceptualization. ZC: Software, Formal analysis, Writing – original draft, Methodology. RY: Writing – review & editing, Investigation, Visualization. YF: Investigation, Writing – review & editing. YS: Validation, Writing – review & editing. YT: Supervision, Project administration, Writing – review & editing. JS: Funding acquisition, Resources, Writing – review & editing, Supervision.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the National Natural Science Foundation of China (Grant No. 82272077).

Acknowledgments

We would like to thank the Chongqing People's Hospital for providing external data.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Montal R, Sia D, Montironi C, Leow WQ, Esteban-Fabró R, Pinyol R, et al. Molecular classification and therapeutic targets in extrahepatic cholangiocarcinoma. J Hepatol. (2020) 73:315–27. doi: 10.1016/j.jhep.2020.03.008

PubMed Abstract | Crossref Full Text | Google Scholar

2. Valle JW, Kelley RK, Nervi B, Oh D-Y, and Zhu AX. Biliary tract cancer. Lancet. (2021) 397:428–44. doi: 10.1016/S0140-6736(21)00153-7

PubMed Abstract | Crossref Full Text | Google Scholar

3. Banales JM, Marin JJG, Lamarca A, Rodrigues PM, Khan SA, Roberts LR, et al. Cholangiocarcinoma 2020: the next horizon in mechanisms and management. Nat Rev Gastroenterol Hepatol. (2020) 17:557–88. doi: 10.1038/s41575-020-0310-z

PubMed Abstract | Crossref Full Text | Google Scholar

4. Izquierdo-Sanchez L, Lamarca A, La Casta A, Buettner S, Utpatel K, Klümpen HJ, et al. Cholangiocarcinoma landscape in Europe: Diagnostic, prognostic and therapeutic insights from the ENSCCA Registry. J Hepatol. (2022) 76:1109–21. doi: 10.1016/j.jhep.2021.12.010

PubMed Abstract | Crossref Full Text | Google Scholar

5. Wheless M, Agarwal R, Goff L, Lockney N, Padmanabhan C, and Heumann T. Current standards, multidisciplinary approaches, and future directions in the management of extrahepatic cholangiocarcinoma. Curr Treat Options Oncol. (2024) 25:127–60. doi: 10.1007/s11864-023-01153-5

PubMed Abstract | Crossref Full Text | Google Scholar

6. Li Y, Yu J, Zhang Y, Peng C, Song Y, and Liu S. Advances in targeted therapy of cholangiocarcinoma. Ann Med. (2024) 56:2310196. doi: 10.1080/07853890.2024.2310196

PubMed Abstract | Crossref Full Text | Google Scholar

7. Yi M, Niu M, Xu L, Luo S, and Wu K. Regulation of PD-L1 expression in the tumor microenvironment. J Hematol Oncol. (2021) 14:10. doi: 10.1186/s13045-020-01027-5

PubMed Abstract | Crossref Full Text | Google Scholar

8. Ghalehbandi S, Yuzugulen J, Pranjol MZI, and Pourgholami MH. The role of VEGF in cancer-induced angiogenesis and research progress of drugs targeting VEGF. Eur J Pharmacol. (2023) 949:175586. doi: 10.1016/j.ejphar.2023.175586

PubMed Abstract | Crossref Full Text | Google Scholar

9. Mariotti V, Fiorotto R, Cadamuro M, Fabris L, and Strazzabosco M. New insights on the role of vascular endothelial growth factor in biliary pathophysiology. JHEP reports: Innovation hepatology. (2021) 3:100251. doi: 10.1016/j.jhepr.2021.100251

PubMed Abstract | Crossref Full Text | Google Scholar

10. Yoshikawa D, Ojima H, Iwasaki M, Hiraoka N, Kosuge T, Kasai S, et al. Clinicopathological and prognostic significance of EGFR, VEGF, and HER2 expression in cholangiocarcinoma. Br J cancer. (2008) 98:418–25. doi: 10.1038/sj.bjc.6604129

PubMed Abstract | Crossref Full Text | Google Scholar

11. Ramjiawan RR, Griffioen AW, and Duda DG. Anti-angiogenesis for cancer revisited: Is there a role for combinations with immunotherapy? Angiogenesis. (2017) 20:185–204. doi: 10.1007/s10456-017-9552-y

PubMed Abstract | Crossref Full Text | Google Scholar

12. Ilyas SI, Affo S, Goyal L, Lamarca A, Sapisochin G, Yang JD, et al. Cholangiocarcinoma - novel biological insights and therapeutic strategies. Nat Rev Clin Oncol. (2023) 20:470–86. doi: 10.1038/s41571-023-00770-1

PubMed Abstract | Crossref Full Text | Google Scholar

13. Murciano-Goroff YR, Warner AB, and Wolchok JD. The future of cancer immunotherapy: microenvironment-targeting combinations. Cell Res. (2020) 30:507–19. doi: 10.1038/s41422-020-0337-2

PubMed Abstract | Crossref Full Text | Google Scholar

14. Brindley PJ, Bachini M, Ilyas SI, Khan SA, Loukas A, Sirica AE, et al. Cholangiocarcinoma. Nat Rev Dis primers. (2021) 7:65. doi: 10.1038/s41572-021-00300-2

PubMed Abstract | Crossref Full Text | Google Scholar

15. Zhang X, Zhang Y, Zhang G, Qiu X, Tan W, Yin X, et al. Prospective clinical research of radiomics and deep learning in oncology: A translational review. Crit Rev oncology/hematology. (2022) 179:103823. doi: 10.1016/j.critrevonc.2022.103823

PubMed Abstract | Crossref Full Text | Google Scholar

16. Avanzo M, Wei L, Stancanello J, Vallières M, Rao A, Morin O, et al. Machine and deep learning methods for radiomics. Med physics. (2020) 47:e185–202. doi: 10.1002/mp.13678

PubMed Abstract | Crossref Full Text | Google Scholar

17. Spencer K, Pappas L, Baiev I, Maurer J, Bocobo AG, Zhang K, et al. Molecular profiling and treatment pattern differences between intrahepatic and extrahepatic cholangiocarcinoma. J Natl Cancer Inst. (2023) 115:870–80. doi: 10.1093/jnci/djad046

PubMed Abstract | Crossref Full Text | Google Scholar

18. Bragazzi MC, Ridola L, Safarikia S, Matteo SD, Costantini D, Nevi L, et al. New insights into cholangiocarcinoma: multiple stems and related cell lineages of origin. Ann Gastroenterol. (2018) 31:42–55. doi: 10.20524/aog.2017.0209

PubMed Abstract | Crossref Full Text | Google Scholar

19. Patel SA, Nilsson MB, Le X, Cascone T, Jain RK, and Heymach JV. Molecular mechanisms and future implications of VEGF/VEGFR in cancer therapy. Clin Cancer Res. (2023) 29:30–9. doi: 10.1158/1078-0432.CCR-22-1366

PubMed Abstract | Crossref Full Text | Google Scholar

20. Fan T, Li S, Li K, Xu J, Zhao S, Li J, et al. A potential prognostic marker for recognizing VEGF-positive hepatocellular carcinoma based on magnetic resonance radiomics signature. Front Oncol. (2022) 12:857715. doi: 10.3389/fonc.2022.857715

PubMed Abstract | Crossref Full Text | Google Scholar

21. Wang J, Hu Y, Zhou X, Bao S, Chen Y, Ge M, et al. A radiomics model based on DCE-MRI and DWI may improve the prediction of estimating IDH1 mutation and angiogenesis in gliomas. Eur J radiology. (2022) 147:110141. doi: 10.1016/j.ejrad.2021.110141

PubMed Abstract | Crossref Full Text | Google Scholar

22. Wang Q, Zhang Y, Zhang E, Xing X, Chen Y, Nie K, et al. A multiparametric method based on clinical and CT-based radiomics to predict the expression of p53 and VEGF in patients with spinal giant cell tumor of bone. Front Oncol. (2022) 12:894696. doi: 10.3389/fonc.2022.894696

PubMed Abstract | Crossref Full Text | Google Scholar

23. Yang C, Zhang ZM, Zhao ZP, Wang ZQ, Zheng J, Xiao HJ, et al. Radiomic analysis based on magnetic resonance imaging for the prediction of VEGF expression in hepatocellular carcinoma patients. Abdominal Radiol (New York). (2024) 49:3824–33. doi: 10.1007/s00261-024-04427-0

PubMed Abstract | Crossref Full Text | Google Scholar

24. Zhang N, Yang X, Piao M, Xun Z, Wang Y, Ning C, et al. Biomarkers and prognostic factors of PD-1/PD-L1 inhibitor-based therapy in patients with advanced hepatocellular carcinoma. biomark Res. (2024) 12:26. doi: 10.1186/s40364-023-00535-z

PubMed Abstract | Crossref Full Text | Google Scholar

25. Nimmagadda S. Quantifying PD-L1 expression to monitor immune checkpoint therapy: opportunities and challenges. Cancers. (2020) 12:3173. doi: 10.3390/cancers12113173

PubMed Abstract | Crossref Full Text | Google Scholar

26. Deng M, Ran P, Chen L, Wang Y, Yu Z, Cai K, et al. Proteogenomic characterization of cholangiocarcinoma. Hepatol (Baltimore Md). (2023) 77:411–29. doi: 10.1002/hep.32624

PubMed Abstract | Crossref Full Text | Google Scholar

27. Xia T, Li K, Niu N, Shao Y, Ding D, Thomas DL, et al. Immune cell atlas of cholangiocarcinomas reveals distinct tumor microenvironments and associated prognoses. J Hematol Oncol. (2022) 15:37. doi: 10.1186/s13045-022-01253-z

PubMed Abstract | Crossref Full Text | Google Scholar

28. Hack SP, Verret W, Mulla S, Liu B, Wang Y, Macarulla T, et al. IMbrave 151: a randomized phase II trial of atezolizumab combined with bevacizumab and chemotherapy in patients with advanced biliary tract cancer. Ther Adv Med Oncol. (2021) 13:17588359211036544. doi: 10.1177/17588359211036544

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: deep learning, extrahepatic cholangiocarcinoma, magnetic resonance imaging, programmed death ligand 1, vascular endothelial growth factor

Citation: Liu J, Cheng Z, Yang R, Fan Y, Shu Y, Tang Y and Shu J (2026) Deep learning-based multimodal approach for non-invasive prediction and prognostic analysis of immune and angiogenic biomarkers in extrahepatic cholangiocarcinoma. Front. Immunol. 16:1658122. doi: 10.3389/fimmu.2025.1658122

Received: 02 July 2025; Accepted: 23 December 2025; Revised: 23 December 2025;
Published: 13 January 2026.

Edited by:

Mathew Kurian Vithayathil, Imperial College, United Kingdom

Reviewed by:

Qi Li, The First Affiliated Hospital of Xi'an Jiaotong University, China
Basen Li, Huazhong University of Science and Technology, China

Copyright © 2026 Liu, Cheng, Yang, Fan, Shu, Tang and Shu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yong Tang, dGFuZ3lvbmdAdWVzdGMuZWR1LmNu; Jian Shu, c2h1amlhbm5jQHN3bXUuZWR1LmNu

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.