Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Physiol.

Sec. Renal Physiology and Pathophysiology

This article is part of the Research TopicCardiovascular–Kidney–Metabolic Syndrome: Interorgan Crosstalk, Pathophysiology, and TherapeuticsView all 11 articles

Developing an explainable machine learning model using body composition to predict cardiovascular mortality in initial dialysis patients: a multicenter study

Provisionally accepted
  • 1Qilu Hospital of Shandong University, Jinan, China
  • 2Institute of Nephrology, Zhongda Hospital, Southeast University, Nanjing, China
  • 3The Affiliated Taizhou People's Hospital of Nanjing Medical University, Taizhou, China
  • 4Changzhou First People's Hospital, Changzhou, China
  • 5Affiliated Hospital of Yangzhou University, Yangzhou, China
  • 6Jiangsu Province Geriatric Hospital, Nanjing, China

The final, formatted version of the article will be published soon.

Cardiovascular disease (CVD) is the leading cause of death in patients receiving dialysis, and accurate risk prediction at dialysis initiation remains limited. We developed and validated a machine learning model integrating CT-derived body composition features to predict CVD related mortality in initial dialysis patients. Patients initiating dialysis between 2014 and 2020 from three tertiary hospitals were used for model training and internal validation, with patients from a fourth center for external validation. Clinical characteristics and laboratory variables were collected, and body composition parameters were assessed using opportunistic CT scans. Feature selection was performed using univariable logistic regression and LASSO regression. Eight machine learning algorithms were trained, and model performance was assessed using discrimination, calibration, and decision curve analysis. Model interpretability was evaluated using Shapley Additive Explanations (SHAP), and a web-based risk calculator was developed. Among 1051 incident dialysis patients, 645 were assigned to the training and internal validation cohorts and 406 to the external validation cohort. Eight key predictors were identified, including age, diabetes, CVD, history of cardiac intervention, dialysis modality, skeletal muscle density, hemoglobin, and serum creatinine. CatBoost demonstrated the best performance, with an area under the receiver operating characteristic curve of 0.843 in internal validation and 0.799 in external validation, along with good calibration and clinical net benefit. SHAP analysis identified CVD, skeletal muscle density, and hemoglobin as major contributors. An explainable machine learning model incorporating CT-derived body composition features accurately predicts CVD-related mortality in initial dialysis patients. This model may facilitate early risk stratification and targeted prevention strategies at dialysis initiation.

Keywords: Cardiovascular disease mortality, Dialysis, machine learning, risk prediction, Skeletal muscle density

Received: 16 Dec 2025; Accepted: 27 Jan 2026.

Copyright: © 2026 Wang, Wei, Yu, Zheng, Cao, Li, Wang, Hou, Xu, Yang and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Jian Xu
Xiang-dong Yang
Bin Wang

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.