ORIGINAL RESEARCH article
Front. Oncol.
Sec. Genitourinary Oncology
Volume 15 - 2025 | doi: 10.3389/fonc.2025.1661695
This article is part of the Research TopicLeveraging Artificial Intelligence for Biomarker Discovery in Prostate CancerView all 3 articles
Interpretable Machine Learning Models Based on Multi-Dimensional Fusion Data for Predicting Positive Surgical Margins in Robot-Assisted Radical Prostatectomy: A Retrospective Study
Provisionally accepted- 1First Affiliated Hospital of Chongqing Medical University, Chongqing, China
- 2The Second People's Hospital of Neijiang, Neijiang, China
- 3Guang'an People's Hospital, Guang'an, China
- 4First Peoples Hospital of Neijiang, Neijiang, China
- 5Chongqing Medical University, Chongqing, China
- 6University-Town Hospital of Chongqing Medical University, Chongqing, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Objective This study aimed to develop and validate interpretable machine learning (ML) models based on multi-dimensional fusion data for predicting positive surgical margins (PSM) in robot-assisted radical prostatectomy (RARP). Methods Patients who underwent RARP at our institution between January 2016 and July 2025 were enrolled. Demographic, clinical, biopsy pathology data, and MRI-derived anatomical features (measured using ITK-SNAP on axial, sagittal, and coronal planes) were collected. Feature selection was performed using intraobserver and interobserver correlation coefficients (ICCs), low-variance filtering, univariable logistic regression, Spearman’s correlation analysis, the least absolute shrinkage and selection operator (LASSO) algorithm, and the Boruta algorithm. Six ML models were constructed, with performance evaluated using area under the curve (AUC), calibration curves, and decision curve analyses (DCA) to identify the optimal model. Five-fold and ten-fold cross-validation were used to assess the optimal model’s generalizability, and its interpretability was evaluated via Shapley Additive exPlanations (SHAP) analysis. Results A total of 347 patients were included, comprising a training set (n=193, January 2016–December 2024), validation set (n=84, January 2016–December 2024), and test set (n=70, January 2025–July 2025). From 164 initial features, 7 key features were retained through a four-step screening. The Random Forest (RF) model outperformed other models, achieving AUCs of 0.99 (95% CI: 0.97–1.00) in the training set, 0.88 (95% CI: 0.80–0.95) in the validation set, and 0.97 (95% CI: 0.94–1.00) in the test set. Calibration curve and decision curve analyses confirmed its strong clinical utility. Five-fold cross-validation for the RF model showed fold-specific AUCs of 0.82–0.92. Ten-fold cross-validation showed fold-specific AUCs of 0.80–0.99. SHAP analysis revealed five novel spatial anatomical features (such as Sagittal plane-posterior spatial anatomical structure index, Coronal plane-Left anatomical structure interval) were negatively associated with PSM risk, while the number of positive biopsy cores and clinical tumor stage were positively associations. Conclusions Multi-dimensional fusion data combined with ML models improves PSM prediction accuracy in RARP. The RF model, with excellent performance and interpretability, shows promise for preoperative PSM risk stratification, facilitates optimized clinical decision-making, and supports personalized treatment discussions during preoperative planning, but requires prospective and external validation before clinical implementation.
Keywords: prostate cancer, Robot-assisted radical prostatectomy (RARP), multi-dimensional fusion data, Multiparametric magnetic resonance imaging (mpMRI), machine learning, interpretation
Received: 08 Jul 2025; Accepted: 17 Sep 2025.
Copyright: © 2025 Liu, Zhou, Dong, Liu, Luo, Luo, Su, Santigie, Wang, Liu, Zhang, Qiu, Jiang, Han, Zhang, He and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Jindong Zhang, 204969@hospital.cqmu.edu.cn
Jiang He, 78644774@qq.com
Delin Wang, wangdelin_cqmu@163.com
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.