Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med.

Sec. Nephrology

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1598065

Ensemble Machine Learning for Predicting Renal Function Decline in Chronic Kidney Disease: Development and External Validation

Provisionally accepted
Hong  ChenHong Chen*Yuping  HuangYuping HuangLizhen  ChenLizhen Chen
  • the 95th Hospital of Putian in China RongTong Medical Health Corporation, Putian, China

The final, formatted version of the article will be published soon.

Chronic kidney disease (CKD) is recognized as one of the most significant global health challenges to day. Its effective management relies on timely intervention which is needed when there are indication s of renal function deterioration. Current predictive approaches tend to be inaccurate and superficial. We engineered and verified a machine learning model to predict the risk of renal function deterioratio n using data from 1,200 patients with CKD from different clinics. The study population was carefully chosen through strict and standardized inclusion and exclusion criteria, extensive collection of clinic al information, and precise laboratory evaluation. We developed an ensemble model based on the alg orithms of Random Forest, XGBoost, and Light GBM, employing advanced feature selection and hyp erparameter tuning. The model achieved an AUC of 0.89 (95% CI: 0.87-0.91) which is outstanding, o utperforming traditional Cox models (AUC: 0.82, 95% CI: 0.79-0.85) and average machine learning t echniques (AUC: 0.85, 95% CI: 0.83-0.87). In a SHAP analysis, key predictive factors were eGFR, a ge, and urinary protein-creatinine ratio. The model showed impressive calibration (slope: 0.96, 95% CI: 0.94-0.98) and strong performance across diverse patient subgroups, especially in high-risk popul ations where prediction was strongest. The model was found to be generalizable by internal validation using five-fold cross-validation and external validation in three medical centers. Analysis of resource utilization shows a 60.6% reduction in computational needs relative to traditional methods with unm atched prediction accuracy. Our model serves as an effective solution toward early risk stratification i n CKD management for timely and tailored intervention while optimizing resources. Such a novel ap proach provides a substantial improvement to predictive modelling of kidney function decline and eq uips clinicians with an effective tool for data-driven decision-making. Our code is publicly available at: https://gitee.com/forest-AI/CDK-Model.

Keywords: machine learning, chronic kidney disease progression, risk prediction modeling, Clinical decision support, precision nephrology

Received: 22 Mar 2025; Accepted: 25 Sep 2025.

Copyright: © 2025 Chen, Huang and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Hong Chen, xtog09@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.