Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol.

Sec. Cancer Immunity and Immunotherapy

Volume 16 - 2025 | doi: 10.3389/fimmu.2025.1681072

This article is part of the Research TopicThe Insights of Multi-Omics into the Microenvironment After Tumor Metastasis: A Paradigm Shift in Molecular Targeting Modeling and Immunotherapy for Advanced Cancer PatientsView all 18 articles

Integrating Deep Learning Features from Mammography with SHAP Values for a Machine Learning Model Predicting Over 5-Year Recurrence of Breast Ductal Carcinoma In Situ Post-Lumpectomy

Provisionally accepted
Yupeng  ShaYupeng Sha1Yuan  QuanYuan Quan1Yi  DuYi Du1Shuqi  YangShuqi Yang2Ming  NiuMing Niu1Xiaoshuan  LiangXiaoshuan Liang1Shanshan  SunShanshan Sun1Tong  LiTong Li1Shu  GongShu Gong1Jiguang  HanJiguang Han1*
  • 1Third Affiliated Hospital of Harbin Medical University, Harbin, China
  • 2Fujian Medical University Affiliated First Quanzhou Hospital, Quanzhou, China

The final, formatted version of the article will be published soon.

Background In women with ductal carcinoma in situ (DCIS) undergoing breast-conserving surgery, recurrence rates within 5 years are 18-25%. Mammograms offer rich tumor data for patient stratification, but current prediction methods focus on clinicopathological factors, overlooking imaging insights. Methods We retrospectively analyzed 140 DCIS patients from Harbin Medical University Cancer Hospital (2011-2020, followed up to 2025). Preoperative digital mammograms and clinicopathological data were collected, with mammographic features extracted using pyradiomics and supervised by a senior radiologist. Feature selection employed 10-fold cross-validated LASSO regression. The dataset was split into training (n=100) and validation (n=40) sets (10:4 ratio). Sixteen machine learning algorithms combining mammographic deep learning features and clinicopathological variables were developed and compared for predicting DCIS recurrence. Model performance was assessed using ROC, sensitivity, specificity, PPV, NPV, and SHAP values for interpretation. Results Of the 140 patients, 34 (24.3%) experienced recurrence. The Gradient Boosting Machine (GBM) algorithm had the best predictive performance, with an AUC of 0.918 (95% CI 0.873-0.963) in the test set. SHAP values indicated that the mammographic signature (MS) was the most significant predictor, followed by Ki-67 index and histological grade. Patients not receiving radiotherapy had higher recurrence rates (41.2%) than those who did (17.9%, p<0.05). Decision curve analysis validated the model's clinical utility across various risk thresholds. Conclusion Our study developed an interpretable GBM model incorporating mammographic and clinical data to predict DCIS recurrence (AUC=0.918). Key predictors were mammographic signature, Ki-67, and tumor grade, offering clinicians a practical tool for personalized postoperative management.

Keywords: ductal carcinoma in situ, Breast-conserving surgery, Mammography, deep learning, Recurrence

Received: 06 Aug 2025; Accepted: 29 Aug 2025.

Copyright: © 2025 Sha, Quan, Du, Yang, Niu, Liang, Sun, Li, Gong and Han. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Jiguang Han, Third Affiliated Hospital of Harbin Medical University, Harbin, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.