Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med.

Sec. Nephrology

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1615950

SHAP combined with machine learning to predict mortality risk in maintenance hemodialysis patients: a retrospective study

Provisionally accepted
Peng  ShuPeng Shu*Xia  WangXia WangJie  ChenJie ChenFang  XuFang Xu*
  • Central Hospital of Wuhan, Huazhong University of Science and Technology, Wuhan, China

The final, formatted version of the article will be published soon.

Background: Patients undergoing maintenance hemodialysis face a high mortality rate, yet effective tools for predicting mortality risk in this population are lacking. This study aims to develop an interpretable machine learning model to predict mortality risk among maintenance hemodialysis patients. Methods: A retrospective analysis was conducted on clinical data from 512 maintenance hemodialysis patients treated at The Central Hospital of Wuhan between January 2021 and October 2024. The dataset included 50 feature variables. The data were split into a training set (70%) and a test set (30%). Five machine learning models-Random Forest, Extreme Gradient Boosting, Support Vector Machine, Logistic Regression, and K-Nearest Neighbor-were trained and evaluated for predicting patient mortality risk, using metrics such as the F1 score, precision, accuracy, AUC-ROC, and recall. SHAP values were used to assess the contribution of each feature in the best-performing model.Results: The K-Nearest Neighbor model achieved the highest AUC-ROC of 0.9792 (95% CI: 0.9600-0.9929). SHAP analysis identified key factors influencing predictions, including dialysis duration, creatinine levels, white blood cell ratio, blood phosphorus concentration, and unconjugated iron. Conclusion: The K-Nearest Neighbor model demonstrated high efficacy in predicting mortality risk among hemodialysis patients. SHAP analysis highlighted critical risk factors. While these findings show promise for future clinical research, they should be interpreted with caution due to the study's retrospective design and the need for external validation.

Keywords: hemodialysis, Predictive Modeling, machine learning, mortality risk, Shap

Received: 22 Apr 2025; Accepted: 16 Jun 2025.

Copyright: © 2025 Shu, Wang, Chen and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Peng Shu, Central Hospital of Wuhan, Huazhong University of Science and Technology, Wuhan, China
Fang Xu, Central Hospital of Wuhan, Huazhong University of Science and Technology, Wuhan, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.