Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol.

Sec. Cancer Endocrinology

Volume 16 - 2025 | doi: 10.3389/fendo.2025.1656191

This article is part of the Research TopicAI-Powered Insights: Predicting Treatment Response and Prognosis in Breast CancerView all 11 articles

Machine learning algorithms for individualised prediction of prognosis in breast cancer liver metastases and the prognostic impact of primary tumor surgery: a multicenter study

Provisionally accepted
Qunchen  ZhangQunchen Zhang1*Chunmei  ChenChunmei Chen1Jun-dong  WuJun-dong Wu2Bo  XuBo Xu3,4Weiwen  LiWeiwen Li1Chengming  ZhongChengming Zhong5Zhibing  YanZhibing Yan1Qipeng  ZhongQipeng Zhong1Ronggang  LiRonggang Li1Tao  ShaoTao Shao1Yan  DongYan Dong1Yutong  FangYutong Fang2Yong  LiYong Li1
  • 1Jiangmen Central Hospital, Jiangmen, China
  • 2Shantou University Medical College Cancer Hospital, Shantou, China
  • 3The First Affiliated Hospital of Jinan University, Guangzhou, China
  • 4Guangzhou First People's Hospital, Guangzhou, China
  • 5Jiangmen Xinhui Maternal and Child Health Hospital, Jiangmen, China

The final, formatted version of the article will be published soon.

Background The prognosis of patients with breast cancer liver metastasis (BCLM) is generally poor, and there are no specific treatment guidelines. Accurate prognostic tools are needed to estimate survival and support individualized management. Methods The study cohort consisted of 4,817 patients diagnosed with BCLM from the SEER database spanning 2010 to 2020. Candidate predictors were screened using univariate and multivariate Cox regression. Five machine-learning algorithms—Random Forest (RF), Logistic Regression, XGBoost, Decision Tree, and Gradient Boosting—were trained to predict 6-month, 1-, 3-, and 5-year overall survival (OS) and breast cancer–specific survival (BCSS). Labels at each horizon were handled with inverse probability-of-censoring weighting, and performance was assessed with IPCW-AUC, accuracy, F1 score, calibration, and decision curve analysis. External validation included 124 BCLM patients from two Chinese hospitals. To evaluate the effect of primary tumor surgery (PTS), we modeled PTS as a time-dependent exposure and performed time-dependent Cox analyses with time-varying effects, Simon–Makuch curves, piecewise Cox modeling, 2-month landmark analysis, and E-value calculations. Results RF was the top performer for both OS and BCSS across horizons (training AUCs = 0.840–0.899; internal test AUCs = 0.763–0.787), with good calibration and net benefit. External validation showed consistent discrimination (AUCs 0.779–0.815). SHAP analyses highlighted chemotherapy, age, subtype, and surgery as dominant contributors. In time-dependent analyses, PTS was associated with reduced risks of death (OS: HR 0.80, 95% CI 0.72–0.88) and breast cancer–specific mortality (BCSS: HR 0.77, 95% CI 0.69–0.86); findings were directionally consistent in piecewise and landmark analyses, and E-values (≥1.81) supported robustness to moderate unmeasured confounding. Conclusion We developed and externally validated robust RF-based models for predicting OS and BCSS in BCLM. Our results indicate that PTS is associated with longer survival and lower breast cancer-specific mortality in carefully selected patients, supporting consideration within individualized, multidisciplinary decision-making.

Keywords: Breast cancer liver metastases, machine learning, prognosis, Surgery, random forest

Received: 29 Jun 2025; Accepted: 23 Sep 2025.

Copyright: © 2025 Zhang, Chen, Wu, Xu, Li, Zhong, Yan, Zhong, Li, Shao, Dong, Fang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Qunchen Zhang, qczhang2014@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.