Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Surg.

Sec. Surgical Oncology

Volume 12 - 2025 | doi: 10.3389/fsurg.2025.1588208

Clinical Decision System for Renal Cell Carcinoma Integrating Interpretable Machine Learning Algorithms

Provisionally accepted
  • 1Seventh Affiliated Hospital, Sun Yat-sen University, Shenzhen, China
  • 2Xuzhou Central Hospital, Xuzhou, Jiangsu Province, China
  • 3Sun Yat-sen University, Shenzhen Campus, Shenzhen, Guangdong, China
  • 4The Affiliated Hospital of Southwest Medical University, Luzhou, Sichuan, China

The final, formatted version of the article will be published soon.

Background: Kidney cancer is a highly heterogeneous oncologic disease with historically poor prognosis. Precise assessment of the risk of distal metastasis can facilitate risk stratification and improve prognosis for kidney cancer patients. Methods: Data from the Surveillance, Epidemiology, and End Results (SEER) database, we identified 40527 kidney cancer patients diagnosed between 2010 and 2017 were obtained. LASSO, univariate and multivariate logistic regression analyses were employed to screen independent risk factors for distal metastasis. Six machine learning (ML) algorithms including logistic regression (LR), Naïve Bayes Classifier (NBC), Decision Tree (DT), Random Forest (RF), Gradient boosting machine (GBM) and Extreme gradient boosting (XGB), were further applied to build the predictive models. After testing with ten-fold cross-validation and receiver operating characteristic (ROC) analysis, the model with the highest area under curve (AUC) was selected as the best performing model to establish the risk predictive nomogram and web calculator. Results: In distal metastasis risk prediction, the XGB model had the best performance in both training (AUC=0.91) and testing (AUC=0.851) datasets among the six ML algorithms. Variables including marital status, sequence number, primary site, grade, pathological type, T-stage, N-stage, the calculated risk of XGB, surgical and radiation treatment were incorporated to establish a nomogram to predict the 1-, 3-, and 5-years survival probability. The calibration plots, decision curve analysis (DCA), ROC curves and Kaplan-Meier (KM) curves all verified the predictive utility of the nomogram. Conclusions: We established a favorable prediction for the occurrence of distal metastasis with the ML model. The nomogram based on XGB algorithm can contribute to identify high-risk patients and provide optimal clinical strategies.

Keywords: Kidney cancer, Distal metastasis, nomogram, machine learning, predictive model

Received: 05 Mar 2025; Accepted: 24 Sep 2025.

Copyright: © 2025 Teng, Zhang, Zhang, Pang and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Jun Pang, pangjun2@mail.sysu.edu.cn
Chao Yang, 675871623@qq.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.