ORIGINAL RESEARCH article
Front. Physiol.
Sec. Computational Physiology and Medicine
Volume 16 - 2025 | doi: 10.3389/fphys.2025.1665383
This article is part of the Research TopicMedical Knowledge-Assisted Machine Learning Technologies in Individualized Medicine Volume IIView all 24 articles
Interpretable machine learning models to predict survival in esophageal cancer: a study based on the SEER database and external validation in China
Provisionally accepted- First People's Hospital of Kashi, Kashgar Prefecture, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Objective:We developed interpretable machine learning(ML) models to predict the overall survival(OS) of esophageal cancer patients. This approach aims to make our modeling results more interpretable and transparent. Methods:We collected the clinicopathological information of esophageal cancer patients from the SEER database and divided them into training and validation sets at a ratio of 7:3. Meanwhile, we obtained an external validation cohort from the First People's Hospital of Kashi in Xinjiang, China. Using LASSO and multivariate Cox regression analyses, we identified relevant risk factors and combined them to develop CoxPH and six ML models: Random Survival Forest(RSF), Gradient Boosting with Component Linear(GLMboost), decision tree(dt), boosted tree(bt), DeepSurv, and neural multi-task logistic regression(NMTLR). We evaluated the predictive performance of these ML models using the C-index, integral cumulative/dynamic AUC, integral Brier score, Kolmogorov-Smirnov (KS) test and Cramer-von Mises (CvM) test. For interpretability assessment, we employed three complementary methods: (1)time-dependent variable importance to quantify feature contribution across follow-up periods; (2)partial correlation survival plots to visualize individual variable effects; and (3)aggregated survival SHapley additive interpretation(SurvSHAP) plots with mean absolute deviation metrics to validate feature impact stability at both individual and population levels. Results:The final ML model consisted of 11 factors: grade, stage, T stage, N stage, M stage, radiotherapy, chemotherapy, bone metastasis, liver metastasis, lung metastasis, and age. Our predictive models demonstrate significant discriminative power; in particular, the NMTLR model performs best. For the training, validation, and external validation sets, the area under the curve (AUC) for one-, three-, and five-year OS was higher than 0.81, and the integrated Brier score was consistently lower than 0.175. interpretability analyses confirmed consistent predictive logic: M stage, N stage, age, grade, bone metastases, liver metastases, lung metastases and radiotherapy were identified as the most influential predictors via quantifiable SurvSHAP values and time-dependent importance weights, with their effects visually validated through partial correlation survival curves. Conclusions:The NMTLR prognostic model is the most effective at predicting the OS of esophageal cancer patients. It helps physicians correctly assess patient survival and provides valuable information for diagnosis and prognosis evaluation.
Keywords: esophageal cancer, Interpretable machine learning, overall survival, SurvSHAP, Deep learning models
Received: 14 Jul 2025; Accepted: 15 Sep 2025.
Copyright: © 2025 Tuersun, Abudoubari, Abudouwake, Tuerdi, Maimaitiyiming, Nijiati, Qiu and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Jianquan Wang, wjqks@163.com
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.