ORIGINAL RESEARCH article
Front. Med.
Sec. Nephrology
Volume 12 - 2025 | doi: 10.3389/fmed.2025.1598065
Ensemble Machine Learning for Predicting Renal Function Decline in Chronic Kidney Disease: Development and External Validation
Provisionally accepted- the 95th Hospital of Putian in China RongTong Medical Health Corporation, Putian, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Chronic kidney disease (CKD) is recognized as one of the most significant global health challenges to day. Its effective management relies on timely intervention which is needed when there are indication s of renal function deterioration. Current predictive approaches tend to be inaccurate and superficial. We engineered and verified a machine learning model to predict the risk of renal function deterioratio n using data from 1,200 patients with CKD from different clinics. The study population was carefully chosen through strict and standardized inclusion and exclusion criteria, extensive collection of clinic al information, and precise laboratory evaluation. We developed an ensemble model based on the alg orithms of Random Forest, XGBoost, and Light GBM, employing advanced feature selection and hyp erparameter tuning. The model achieved an AUC of 0.89 (95% CI: 0.87-0.91) which is outstanding, o utperforming traditional Cox models (AUC: 0.82, 95% CI: 0.79-0.85) and average machine learning t echniques (AUC: 0.85, 95% CI: 0.83-0.87). In a SHAP analysis, key predictive factors were eGFR, a ge, and urinary protein-creatinine ratio. The model showed impressive calibration (slope: 0.96, 95% CI: 0.94-0.98) and strong performance across diverse patient subgroups, especially in high-risk popul ations where prediction was strongest. The model was found to be generalizable by internal validation using five-fold cross-validation and external validation in three medical centers. Analysis of resource utilization shows a 60.6% reduction in computational needs relative to traditional methods with unm atched prediction accuracy. Our model serves as an effective solution toward early risk stratification i n CKD management for timely and tailored intervention while optimizing resources. Such a novel ap proach provides a substantial improvement to predictive modelling of kidney function decline and eq uips clinicians with an effective tool for data-driven decision-making. Our code is publicly available at: https://gitee.com/forest-AI/CDK-Model.
Keywords: machine learning, chronic kidney disease progression, risk prediction modeling, Clinical decision support, precision nephrology
Received: 22 Mar 2025; Accepted: 25 Sep 2025.
Copyright: © 2025 Chen, Huang and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Hong Chen, xtog09@163.com
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.