ORIGINAL RESEARCH article
Front. Immunol.
Sec. Cancer Immunity and Immunotherapy
Volume 16 - 2025 | doi: 10.3389/fimmu.2025.1681072
This article is part of the Research TopicThe Insights of Multi-Omics into the Microenvironment After Tumor Metastasis: A Paradigm Shift in Molecular Targeting Modeling and Immunotherapy for Advanced Cancer PatientsView all 18 articles
Integrating Deep Learning Features from Mammography with SHAP Values for a Machine Learning Model Predicting Over 5-Year Recurrence of Breast Ductal Carcinoma In Situ Post-Lumpectomy
Provisionally accepted- 1Third Affiliated Hospital of Harbin Medical University, Harbin, China
- 2Fujian Medical University Affiliated First Quanzhou Hospital, Quanzhou, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background In women with ductal carcinoma in situ (DCIS) undergoing breast-conserving surgery, recurrence rates within 5 years are 18-25%. Mammograms offer rich tumor data for patient stratification, but current prediction methods focus on clinicopathological factors, overlooking imaging insights. Methods We retrospectively analyzed 140 DCIS patients from Harbin Medical University Cancer Hospital (2011-2020, followed up to 2025). Preoperative digital mammograms and clinicopathological data were collected, with mammographic features extracted using pyradiomics and supervised by a senior radiologist. Feature selection employed 10-fold cross-validated LASSO regression. The dataset was split into training (n=100) and validation (n=40) sets (10:4 ratio). Sixteen machine learning algorithms combining mammographic deep learning features and clinicopathological variables were developed and compared for predicting DCIS recurrence. Model performance was assessed using ROC, sensitivity, specificity, PPV, NPV, and SHAP values for interpretation. Results Of the 140 patients, 34 (24.3%) experienced recurrence. The Gradient Boosting Machine (GBM) algorithm had the best predictive performance, with an AUC of 0.918 (95% CI 0.873-0.963) in the test set. SHAP values indicated that the mammographic signature (MS) was the most significant predictor, followed by Ki-67 index and histological grade. Patients not receiving radiotherapy had higher recurrence rates (41.2%) than those who did (17.9%, p<0.05). Decision curve analysis validated the model's clinical utility across various risk thresholds. Conclusion Our study developed an interpretable GBM model incorporating mammographic and clinical data to predict DCIS recurrence (AUC=0.918). Key predictors were mammographic signature, Ki-67, and tumor grade, offering clinicians a practical tool for personalized postoperative management.
Keywords: ductal carcinoma in situ, Breast-conserving surgery, Mammography, deep learning, Recurrence
Received: 06 Aug 2025; Accepted: 29 Aug 2025.
Copyright: © 2025 Sha, Quan, Du, Yang, Niu, Liang, Sun, Li, Gong and Han. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Jiguang Han, Third Affiliated Hospital of Harbin Medical University, Harbin, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.