Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Public Health

Sec. Digital Public Health

This article is part of the Research TopicAdvancing Healthcare AI: Evaluating Accuracy and Future DirectionsView all 19 articles

Use of Machine Learning Models to Predict Mortality in Dialysis Patients

Provisionally accepted
Junmin  HuangJunmin Huang1LU  ChenLU Chen1Hongying  LuoHongying Luo1Junhao  SongJunhao Song2Ziqian  BiZiqian Bi2Keyu  ChenKeyu Chen2Chia  Xin LiangChia Xin Liang2Ming  LiuMing Liu2Tianyang  WangTianyang Wang2Benji  PengBenji Peng3Zihan  WeiZihan Wei1Zhiqing  HuangZhiqing Huang1Li  ZhihangLi Zhihang1Xincheng  LiuXincheng Liu1Hongjiu  ZhouHongjiu Zhou1Weihuang  ZhangWeihuang Zhang1Wen  WenWen Wen1Mian-Na  LuoMian-Na Luo1Shujun  WangShujun Wang1Huafeng  LiuHuafeng Liu1Chunjie  TianChunjie Tian1Jibin  GuanJibin Guan4Joe  YeongJoe Yeong5Yongzhi  XuYongzhi Xu1*Peng  WangPeng Wang1*Junfeng  HaoJunfeng Hao1*
  • 1Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
  • 2AI Agent Lab, Vokram Group, London, United Kingdom, London, United Kingdom
  • 3AppCubic, Atlanta, Georgia, USA, Atlanta, United States
  • 4University of Minnesota Masonic Cancer Center, Minneapolis, United States
  • 5Singapore General Hospital Division of Pathology, Singapore, Singapore

The final, formatted version of the article will be published soon.

Background Mortality among maintenance hemodialysis patients remains high, and traditional statistical models often fail to capture complex clinical relation-ships. This study aimed to systematically develop, compare, and validate 19 machine learning algorithms for predicting all-cause mortality in maintenance hemodialysis patients. Methods This retrospective study included data from 538 maintenance hemodialysis patients (2018.1–2023.12), with 70% used for training and 30% for testing. Each model underwent hyperparameter optimization based on three performance metrics (accuracy, F1-score, and ROC Area Under the Curve [AUC]) to evaluate the impact of different clinical priorities. Results Gradient boosting models demonstrated consistent superiority, with performance outcomes highly sensitive to the selected optimization target. XGBoost optimized for accuracy achieved an F1 score of 0.683 and a ROC AUC of 0.899. AdaBoost optimized for F1 score attained the highest ROC AUC of 0.903 and an F1 score of 0.682. AdaBoost also demonstrated robust performance across optimization strategies, suggesting its suitability for clinical implementation where balanced risk prediction is essential. Conclusion A systematic ML framework can yield tailored, high-performing models for mortality risk stratification in maintenance hemodialysis patients, with significant potential to enhance identification and management of high-risk individuals in clinical practice.

Keywords: Maintenance hemodialysis, Mortality prediction, machine learning, gradient boosting, risk stratification

Received: 10 Aug 2025; Accepted: 18 Nov 2025.

Copyright: © 2025 Huang, Chen, Luo, Song, Bi, Chen, Liang, Liu, Wang, Peng, Wei, Huang, Zhihang, Liu, Zhou, Zhang, Wen, Luo, Wang, Liu, Tian, Guan, Yeong, Xu, Wang and Hao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Yongzhi Xu, lxyzhi@126.com
Peng Wang, wangpeng@gdmu.edu.cn
Junfeng Hao, ygzhjf85@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.