- 1Department of Clinical Laboratory, The Second Affiliated Hospital of Wannan Medical College, Wuhu, China
- 2Department of Laboratory Medicine, The Second People’s Hospital of Lianyungang, Lianyungang, China
- 3Department of Laboratory Medicine, The Second People’s Hospital of Lianyungang Affiliated with Kangda College of Nanjing Medical University, Lianyungang, China
Background: Diabetic kidney disease (DKD) is the leading cause of end-stage renal disease globally, yet early diagnosis remains challenging due to conventional biomarker limitations, including UACR variability and reduced eGFR sensitivity. While machine learning shows promise in diabetes prediction, its application to early DKD identification using routine parameters remains underexplored. This study aimed to develop and validate machine learning models incorporating routine blood and biochemical parameters for early DKD prediction.
Methods: This retrospective study analyzed 3,114 diabetic patients from the Second Affiliated Hospital of Wannan Medical College (EDN1) and 1,496 patients from NHANES 2005-2018 (EDN2) for external validation. Early DKD was defined as UACR 30–300 mg/g with eGFR ≥60 ml/min/1.73m². Seven machine learning algorithms were compared. Feature importance was assessed using SHAP framework, and Mendelian randomization explored causal relationships.
Results: Among 3,114 patients, 1,333 (42.8%) had early DKD. Logistic regression achieved optimal performance (AUC = 0.689, sensitivity=40.5%, specificity=81.3%). Top predictors included triglyceride-glucose index (TyG), gender, creatinine, globulin, and age. External validation confirmed significant associations for HbA1c, globulin, TyG, and neutrophil-to-albumin ratio.
Conclusions: The machine learning model successfully identified early DKD using routine parameters, with TyG index, HbA1c, and globulin as key predictors, demonstrating potential as a cost-effective screening tool.
1 Introduction
Diabetic kidney disease (DKD), the most common and severe microvascular complication of diabetes, has become the leading cause of end-stage renal disease (ESRD) worldwide (1). Its prevalence continues to rise alongside the global diabetes epidemic, imposing a substantial burden on patients’ quality of life and public health systems (2). Pathological changes in early-stage DKD exhibit a degree of reversibility; however, once the disease progresses to clinical proteinuria, renal function often deteriorates irreversibly (3). Therefore, early identification and intervention are crucial for delaying disease progression and improving prognosis (4).
Currently, the diagnosis of early-stage diabetic kidney disease in clinical practice relies primarily on the urine albumin-to-creatinine ratio (UACR) and estimated glomerular filtration rate (eGFR) (5). Nevertheless, UACR is subject to marked fluctuations throughout the day and can be affected by various confounders such as infections, physical activity, and changes in blood pressure. At the same time, eGFR shows limited sensitivity in identifying early or mild kidney dysfunction, which increases the likelihood of missed diagnoses in the initial stages of DKD. In addition, these assessments are not widely adopted in primary healthcare or routine physical check-ups, causing many high-risk patients to lose the chance for early detection (6). Recent studies indicate that DKD development is closely associated with systemic metabolic disorders and chronic inflammation (7). Routine blood and biochemical markers—including complete blood count, liver and kidney function, and lipid profiles—contain critical information about metabolic and inflammatory states (8). Composite indices derived from these markers, such as the triglyceride-glucose index (TyG), systemic immune-inflammation index (SII), and neutrophil-to-albumin ratio (NPAR), have been demonstrated to associate closely with insulin resistance and cardiovascular risk (9). However, their systematic application in comprehensive prediction of early DKD remains to be further explored (10).
With the rapid advancement of medical big data and artificial intelligence technologies, machine learning (ML) has emerged as a powerful tool for uncovering hidden patterns within high-dimensional, nonlinear clinical data (11). In diabetes research, ML models have been successfully applied to disease risk stratification and early diagnosis, even identifying individuals with normal fasting blood glucose levels who harbor diabetes, demonstrating predictive performance superior to traditional methods (12). However, existing research has focused primarily on predicting diabetes itself or its late-stage complications (13). Studies developing ML prediction models for early-stage diabetic kidney disease based on widely accessible routine examination indicators remain relatively scarce (14). Furthermore, most models lack independent external validation, limiting their clinical implementation and application.
To address this research gap, this study integrates 55 conventional blood and biochemical indicators (including baseline markers and inflammatory-metabolic composite indicators). Utilizing multiple machine learning algorithms such as logistic regression (LR), random forest (RF), and XGBoost, we constructed a model for identifying early-stage DKD (defined as UACR 30–300 mg/g and eGFR ≥ 60 ml/min/1.73m²). By systematically comparing algorithm performance, we selected the optimal predictive model. Feature importance analysis and model interpretability were conducted using the SHAP (Shapley Additive Explanations) framework. External validation using the National Health and Nutrition Examination Survey (NHANES) database assessed generalization capability. Furthermore, Mendelian randomization (MR) analysis explored causal relationships between potential risk factors and DKD, providing genetic evidence for risk mechanisms. This study aims to develop an efficient, cost-effective, and clinically scalable tool for early DKD diagnosis, offering novel strategies for early screening and personalized risk management in diabetic nephropathy. The complete analysis process is shown in Figure 1.
2 Methods
2.1 Data collection and processing
Retrospective study data were sourced from The Second Affiliated Hospital of Wannan Medical College, while external independent validation data were obtained from the NHANES database covering 2005–2018. The two datasets were designated as EDN1 and EDN2, respectively. The initial step involved data cleaning to exclude samples with missing values and outliers. Diabetic patients with HbA1c ≥ 6.5% were identified based on WHO criteria for diabetes. Early diabetic nephropathy was defined as eGFR ≥ 60 ml/min/1.73m² and 30 ≤UACR <300 mg/g, while the simple diabetes group was defined as eGFR ≥ 60 ml/min/1.73m² and UACR < 30 mg/g. After preprocessing, 3,114 and 1,496 samples were retained for EDN1 and EDN2, respectively. The dataset was first split into training (70%, n=2,180) and validation (30%, n=934) sets using stratified random sampling to maintain class proportions. All hyperparameter tuning was performed exclusively within the training set using 5-fold stratified cross-validation. The final model was selected based on mean AUC across the 5 folds, then retrained on the entire training set with optimal hyperparameters before evaluation on the held-out validation set. This nested approach prevents information leakage and provides unbiased performance estimates. Subsequently, EDN1 was split into training and validation sets using a 7:3 ratio via stratified random sampling, while EDN2 served as an independent test set. While an a priori sample size calculation was not performed due to the retrospective study design, post-hoc analysis confirmed adequate statistical power. Following the ‘events per variable’ principle (minimum 10 EPV), our 1,333 early DKD cases with 10 predictors yielded 133 EPV, substantially exceeding recommendations (15). Additionally, recent machine learning guidelines suggest samples >1,000 are adequate for binary classification tasks (16), and our combined cohort (n=4,610) exceeds this threshold.
All datasets contained 55 clinical characteristics, including: Gender, Age, Glycated Hemoglobin (HbA1c), Alanine Aminotransferase (ALT), Aspartate Aminotransferase (AST), Albumin (ALB), Globulin (GLO), Alkaline Phosphatase (ALP), Total Bilirubin (TBIL), Direct Bilirubin (DBIL), Creatinine (CREA), Urea (UREA), Uric Acid (UA), Triglycerides (TG), Total Cholesterol (CHOL), Low-Density Lipoprotein Cholesterol (LDL-C), High-Density Lipoprotein Cholesterol (HDL-C), Gamma-Glutamyl Transpeptidase (GGT), Fasting Blood Glucose (GLU), White Blood Cell Count (WBC), Red Blood Cell Count (RBC), Hemoglobin (HGB), Platelet Count (PLT), Hematocrit (HCT), Mean Corpuscular Volume (MCV), Mean Corpuscular Hemoglobin (MCH), Mean Corpuscular Hemoglobin Concentration (MCHC), Neutrophil Percentage (NEUT%), Lymphocyte Percentage (LYMPH%), Monocyte Percentage (MONO%), Basophil Percentage (BASO%), Eosinophil Percentage (EO%), Neutrophil Absolute Count (NEUT#), Lymphocyte Absolute Count (LYMPH#), Monocyte Absolute Count (MONO#), Eosinophil Absolute Count (EO#), Basophil Absolute Count (BASO#), Red Cell Distribution Width (RDW), Triglyceride-Glucose Index (TyG), Systemic Immune Inflammation Index (SII), Systemic Inflammatory Response Index (SIRI), Neutrophil-Lymphocyte Ratio (NLR), Platelet-Lymphocyte Ratio (PLR), Neutrophil-High-Density Lipoprotein Ratio (NHR), Monocyte-to-High-Density Lipoprotein Ratio (MHR), Platelet-to-High-Density Lipoprotein Ratio (PHR), Lymphocyte-to-High-Density Lipoprotein Ratio (LHR), Neutrophil-to-Monocyte Ratio (NMR), Adaptive Immune Inflammation Index (AISI), Monocyte-to-Lymphocyte Ratio (MLR), Neutrophil-Albumin Ratio (NPAR), Neutrophil-High-Density Lipoprotein-Hemoglobin Ratio (NHHR), Uric Acid-High-Density Lipoprotein Ratio (UHR), Atherosclerosis Index (AIP), and Stress-Induced Hyperglycemia Ratio (SHR). The formulae for composite indicators are provided in (Supplementary Table 1). Outliers were identified using the interquartile range (IQR) method, where values exceeding Q3 + 3×IQR or below Q1 - 3×IQR were flagged. This conservative threshold (3×IQR rather than 1.5×IQR) was selected to retain clinically plausible extreme values while excluding likely data entry errors. Additionally, physiologically implausible values were manually reviewed (e.g., negative laboratory values, HbA1c >20%). Given the presence of categorical imbalance across all datasets, normalization was performed on all datasets.
2.2 Machine learning methods
Seven machine learning algorithms were implemented and compared:
Logistic Regression (LR) belongs to generalized linear models and is widely used for binary classification problems. This model estimates the probability of an event occurring through linear combinations and sigmoid function transformations, with output range (0,1) representing the likelihood of a sample belonging to a specific category.
Decision Tree (DT) is a nonparametric supervised learning method based on tree structure that recursively partitions data through a series of rules to perform classification tasks. Its model structure is intuitive and interpretable but sensitive to noise and prone to overfitting.
Random Forest (RF) is an ensemble learning algorithm that constructs multiple decision trees and integrates their predictions to enhance generalization ability and robustness. This algorithm effectively handles high-dimensional features and exhibits strong tolerance for outliers and noise.
Support Vector Machine (SVM) is a classification model based on the maximum margin principle, utilizing kernel functions to address non-linear problems. Its optimization objective is to find the optimal hyperplane that separates samples of different classes.
XGBoost (eXtreme Gradient Boosting) is an efficient machine learning algorithm based on the gradient boosting framework. It iteratively constructs a series of weak classifiers (decision trees) while optimizing the loss function to progressively reduce bias and variance.
LightGBM (Light Gradient Boosting Machine) is another efficient gradient boosting algorithm based on decision tree ensembles. It employs histogram-based optimization strategies and leaf growth policies to significantly accelerate training speed and reduce memory usage.
Deep Neural Networks (DNNs) are neural network models with multiple hidden layers that can automatically extract high-level feature representations through multi-layer nonlinear transformations.
To eliminate the impact of inconsistent feature scales on model performance, all continuous features underwent standardization. Hyperparameter tuning employed five-fold cross-validation combined with grid search and manual fine-tuning strategies. All hyperparameters were selected using the area under the curve (AUC) as the performance metric within the training set via 5-fold cross-validation.
2.3 Model performance evaluation
To comprehensively evaluate the performance of classification models, this study employs multiple widely recognized metrics, including sensitivity, specificity, balanced accuracy, and area under the receiver operating characteristic curve (AUC). Given the significant class imbalance between the validation and test sets, these metrics provide a more holistic reflection of the model’s overall discriminative capability across majority and minority classes. The calculation formulas for each metric are as follows. True positives (TP) represent the number of correctly predicted positive samples, false positives (FP) denote the number of negative samples incorrectly predicted as positive, true negatives (TN) indicate the number of correctly predicted negative samples, and false negatives (FN) signify the number of positive samples misclassified as negative. All metrics range from 0 to 1, with higher values indicating superior model performance.
2.4 Model interpretation
To address the inherent “black box” nature of machine learning models, this study adopted the Shapley Additive Explanations (SHAP) framework based on cooperative game theory, providing a unified and interpretable method for quantifying feature importance. SHAP quantitatively assesses each feature’s marginal contribution to model output by assigning a SHAP value to each feature within each sample, thereby revealing the model’s decision-making mechanism at both individual and global levels.
2.5 Mendelian randomization analysis
To systematically evaluate the potential causal association between HbA1c levels and diabetic nephropathy, this study employed Mendelian randomization (MR) analysis using the TwoSampleMR software package (v0.5.6). Genetic instrumental variables were sourced from the GWAS database. Single nucleotide polymorphism (SNP) selection criteria included genome-wide significance (p <5×10-8), F-statistic >10 (to exclude weak instrument bias), and low linkage disequilibrium (r² <0.01). Causal effect estimates were derived using multiple methods to enhance robustness and reliability.
2.6 Statistical analysis
All statistical analyses were performed using R (version 4.4.3) and Python (version 3.10). The nonparametric Wilcoxon signed-rank test was employed to test for differences in characteristics between patient groups, with P <0.05 set as the threshold for statistical significance. Based on the finalized LR model, a nomogram model for individualized prediction was constructed using the rms R package.
3 Results
3.1 Baseline data results
We collected diabetes patient data from The Second Affiliated Hospital of Wannan Medical College between 2019 and 2025, enrolling a total of 3,114 eligible patients. These patients were categorized into those with uncomplicated diabetes and those with early diabetic nephropathy based on UACR criteria. (Table 1) describes the characteristics of individuals with and without early diabetic nephropathy, indicating that demographic and blood test indicators hold considerable potential for distinguishing early diabetic nephropathy patients within the diabetic population.
Table 1. Characteristics of patients with early diabetic nephropathy and those with diabetes mellitus alone.
3.2 Regression analysis for screening diagnostic markers
To investigate the association between early diabetic nephropathy and multiple clinical indicators, this study employed univariate and multivariate binary logistic regression analysis (Table 2). Univariate analysis revealed that multiple variables—including gender (male vs. female, OR = 0.69), age (OR = 1.02), HbA1c, GLO, TBIL, CREA, LDL, WBC, TyG index, and NPAR—were significantly associated with early-stage DKD (all P<0.05). Multivariate analysis further identified gender (male, OR = 0.63), age (OR = 1.02), HbA1c (OR = 1.06), GLO (OR = 1.05), TBIL (OR = 0.98), CREA (OR = 1.01), LDL (OR = 0.84), WBC (OR = 1.05), TyG index (OR = 1.39), and NPAR (OR = 1.05) as independent risk factors for early DKD (all P<0.05). Results indicate that male gender, increasing age, poor glycemic control, hyperglobulinemia, elevated creatinine, high white blood cell count, and elevated TyG index are significantly associated with increased risk of early-stage DKD (Supplementary Figure 1).
Table 2. Univariate and multivariate logistic regression analysis of patients with early diabetic nephropathy and patients with diabetes mellitus alone.
3.3 Comparative performance of machine learning methods
Supplementary Table 2 summarizes the performance of seven machine learning models on the validation set. The logistic regression model achieved the highest AUC of 0.689, with accuracy of 63.9%, precision of 61.8%, and F1 score of 0.489. While most models exhibited moderate sensitivity (Figure 2), LR demonstrated the highest sensitivity (40.5%) among all compared methods, whereas LightGBM achieved the highest specificity (92.1%). All models demonstrated comparable performance in accuracy and precision, yet sensitivity remained relatively low—likely due to the imbalanced nature of the early diabetic nephropathy dataset. Given the clinical priority of identifying early-stage kidney disease cases, we selected the logistic regression model due to its superior discriminative capability (AUC) and relatively higher sensitivity.
Figure 2. Performance comparison of multiple machine learning models in predicting early diabetic nephropathy (A) Receiver operating characteristic (ROC) curves for seven machine learning models on the training set. The area under the curve (AUC) value for each model is annotated next to its respective curve. The dashed grey line represents a classifier with no discriminative power (AUC = 0.5). (B) ROC curves for the same models evaluated on the independent test set. (C) Calibration curves for the models on the training set. The diagonal dashed line represents perfect calibration, where predicted probabilities perfectly match observed event frequencies. The degree of deviation from this line for each model’s curve reflects its calibration error. (D) Calibration curves for the models on the test set. (E) Decision curve analysis (DCA) for the models on the training set. The y-axis represents the net benefit calculated across a range of threshold probabilities (x-axis). The grey solid line assumes no patients have the event (‘Treat None’), and the solid black line assumes all patients have the event (‘Treat All’). A model curve higher than these lines indicates clinical utility across that threshold range. (F) Decision curve analysis for the models on the test set.
3.4 Feature importance ranking
To determine the characteristics contributing most significantly to predicting early diabetic nephropathy, we ranked their importance based on weights in the logistic regression model (Figures 3A, B). The top five variables were TyG, gender, CREA, GLO, and age. Among these, TyG exhibited the highest positive contribution, indicating its crucial role in identifying early nephropathy. Gender, CREA, and GLO demonstrated substantial predictive weights, suggesting that renal function indicators and globulin levels hold key positions in the predictive model. To develop a disease risk prediction tool, we employed logistic regression to construct a nomogram model that quantifies the predictive contribution of each clinical feature (Figure 3C).
Figure 3. SHAP feature importance analysis and nomogram construction for disease risk prediction model (A) SHAP feature importance bar chart, showing the average absolute SHAP value of each feature. Larger values indicate a more significant impact on model predictions. (B) SHAP bee swarm plot, showing the distribution of feature values and their corresponding SHAP values for each sample. Color represents high and low feature values (red for high, blue for low), and the degree of horizontal spread reflects the stability of the feature’s influence. (C) Logistic regression nomogram, used to calculate total points based on individual feature values (such as gender, age, HBA1c), and map them to risk probabilities, supporting rapid assessment of disease risk in clinical practice.
3.5 NHANES database validation
Data from the NHANES database (2005-2018) were selected for external validation. After exclusion, 1,496 diabetic patients remained (Supplementary Table 3), with 371 (24.8%) classified into the early diabetic nephropathy group and 1,125 (75.2%) into the simple diabetes group. The mean GLO level was higher in the early diabetic nephropathy group than in the simple diabetes group (31.69 [SD, 4.93] vs 30.64 [SD, 4.57]; P<0.001). Similarly, mean HbA1c (8.54 [SD, 2.08] vs 7.93 [SD, 1.62]; P<0.001), TyG (9.48 [SD, 0.84] vs 9.27 [SD, 0.74]; P<0.001), and NPAR (14.56 [SD, 2.64] vs 14.02 [SD, 2.53]; P<0.001) were significantly higher in the early DKD group.
3.6 NHANES database external validation analysis
To investigate independent associations between multiple metabolic indicators and disease risk, univariate and multivariate logistic regression models were employed (Supplementary Table 4). In multivariate analysis, HbA1c (OR = 1.14, 95% CI: 1.06–1.22, P<0.001), GLO (OR = 1.05, 95% CI: 1.02–1.07, P<0.001), TyG (OR = 1.27, 95% CI: 1.07–1.51, P = 0.007), and NPAR (OR = 1.08, 95% CI: 1.03–1.14, P<0.001) maintained significant independent associations. Restricted cubic spline analysis revealed nonlinear relationships (Figures 4A–D): HbA1c (P-overall <0.001; P-non-linear=0.099), GLO (P-overall <0.001; P-non-linear=0.205), and TyG (P-overall <0.001; P-non-linear=0.057) exhibited significant overall association trends, demonstrating clear dose-response relationships. Mendelian randomization analysis confirmed HbA1c as a risk factor for diabetic nephropathy (Figures 4E, F).
Figure 4. Analysis of the association between key indicators and the risk of diabetic nephropathy (A) The RCS curve for HbA1c and diabetic nephropathy risk demonstrates the relationship between OR values (95% CI) and HbA1c levels. P-overall <0.001 indicates overall significance, while P-non-linear = 0.099 indicates non-linearity is not significant. (B) RCS curve of GLO versus diabetic nephropathy risk. OR values (95% CI) show a positive correlation. P-overall <0.001 indicates overall significance, while P-non-linear = 0.205 indicates non-linearity is not significant. (C) RCS curve of TyG versus diabetic nephropathy risk. OR (95% CI) indicates positive correlation. P-overall <0.001 indicates overall significance. P-non-linear = 0.057 indicates near-significant nonlinearity. (D) RCS curve for NPAR and diabetic nephropathy risk. OR (95% CI) shows weak positive correlation, P-overall = 0.058 indicates marginal overall significance, P-non-linear = 0.449 indicates non-linearity is not significant. (E) Mendelian randomization scatterplot illustrating the effects of genetic variation (SNP) on HbA1c (exposure) and diabetic nephropathy (outcome). Dotted lines represent fitted curves from different MR methods (e.g., MR Egger), suggesting causal associations. (F) Mendelian randomization forest plot listing effect sizes and 95% CIs for each SNP on HbA1c and diabetic nephropathy. The bottom summary line displays the overall causal estimate (e.g., inverse variance weighted), supporting a causal effect of HbA1c.
4 Discussion
This study successfully constructed and validated a machine learning predictive model for early diabetic kidney disease based on routine blood and biochemical indicators. By comparing seven machine learning algorithms, logistic regression was selected as the optimal model (AUC = 0.689) and underwent external validation in the NHANES database. Glycated hemoglobin (HbA1c), globulin (GLO), and the triglyceride-glucose index (TyG) were identified as the most predictive factors. Mendelian randomization analysis further provided genetic evidence for a causal relationship between HbA1c and DKD.
Compared to previous applications of machine learning in diabetes diagnosis, this study extends the predictive target to identification of early-stage DKD (17, 18). Although ensemble algorithms like XGBoost and LightGBM demonstrate superior performance in certain medical prediction tasks, they did not surpass LR performance in this study (19, 20). This phenomenon may be attributed to the scarcity of early-stage DKD samples and significant class imbalance, which makes complex ensemble algorithms more prone to overfitting on the majority class (21). The LR model, leveraging its inherent regularization mechanism and linear assumption, demonstrates superior generalization capability under limited sample conditions (22).
The model’s relatively low sensitivity (40.5%) reflects the inherent diagnostic challenges of early-stage DKD. Pathological alterations in early-stage DKD are relatively subtle, with limited variations in conventional blood markers. Combined with the heterogeneity of disease progression, this makes it difficult for prediction models based on traditional biochemical indicators to achieve ideal sensitivity. However, considering this model is primarily positioned as a primary screening tool, its high specificity (81.3%) helps reduce false-positive results, thereby avoiding unnecessary consumption of medical resources and patient anxiety.
The TyG index ranked first in feature importance, underscoring the central role of insulin resistance in DKD pathogenesis. Insulin resistance accelerates glomerulosclerosis and interstitial fibrosis through multiple mechanisms, including activation of the renin-angiotensin-aldosterone system, promotion of advanced glycation end-product formation, and induction of oxidative stress and inflammatory responses (23, 24). As a convenient assessment tool for insulin resistance, the TyG index’s prominent position in DKD prediction provides a clear target for early clinical intervention.
The inclusion of CREA and GLO as traditional markers of renal function and inflammation in the model has solid theoretical foundation (25). Notably, even in early-stage DKD patients with eGFR ≥60 ml/min/1.73 m², subtle changes in CREA retain predictive value, suggesting that serum creatinine’s sensitivity in assessing early renal impairment may be underestimated (26). Elevated GLO reflects chronic systemic inflammation, a key driver of DKD progression (27). The inclusion of the novel indicator NPAR further supports the role of the “inflammation-malnutrition-atherosclerosis” syndrome in DKD pathogenesis (28). To assess multicollinearity, variance inflation factors (VIF) were calculated for all predictors. While moderate collinearity was observed between TyG and its component variables (TG, glucose), no features exceeded the VIF>10 threshold for exclusion. Comparative analysis of base laboratory values alone, composite indices alone, and combined features demonstrated that the combined approach achieved optimal predictive performance, though a VIF-pruned model offered comparable results with improved coefficient stability (Supplementary Table 7).
External validation using the NHANES database significantly enhances study reliability. Validation results demonstrate that key predictors (HbA1c, GLO, TyG, NPAR) retain substantial predictive value in an independent U.S. population, confirming the cross-population consistency of these biomarkers. Restricted cubic spline analysis revealed dose-response relationships between HbA1c, GLO, and TyG with DKD risk, providing a refined basis for clinical risk stratification. However, the weakened predictive effects of certain traditional indicators in external validation suggest the influence of population heterogeneity. Differences in genetic backgrounds, lifestyles, and healthcare systems between Chinese and American populations may influence biomarker expression and predictive value (29, 30).
Mendelian randomization analysis provides robust genetic evidence for the causal relationship between HbA1c and DKD, overcoming limitations of observational studies such as confounding factors and reverse causality (31). This finding validates the fundamental role of strict glycemic control in primary prevention of DKD and provides theoretical support for the critical role of HbA1c in our predictive model (32).
The nomogram developed in this study provides clinicians with an intuitive and user-friendly risk assessment tool (33). Predictive models based on routine examination indicators offer advantages of high cost-effectiveness and strong accessibility, making them particularly suitable for implementation in primary care settings and resource-limited regions (34). Clinicians can rapidly identify individuals at high risk for DKD based on patients’ routine laboratory results, thereby implementing targeted monitoring and intervention strategies.
This study has several limitations: First, the single-center retrospective design limits research representativeness; a multicenter prospective study would better validate model generalizability. Second, the relatively scarce early-stage DKD samples may cause category imbalance affecting model performance; future studies may consider optimizing training strategies using methods such as Synthetic Minority Over-sampling Technique (SMOTE). Third, cross-sectional studies cannot assess the impact of dynamic changes on predictive accuracy; longitudinal cohort studies would provide more comprehensive risk assessment. Fourth, the absence of kidney biopsy confirmation means that some patients classified as early DKD may have had CKD from other etiologies. This diagnostic ambiguity, inherent to clinical practice, may contribute to the model’s moderate sensitivity. However, this reflects real-world screening scenarios where biopsy is rarely performed for early-stage kidney disease. Finally, the model did not incorporate kidney-specific biomarkers or genetic risk scores, which may limit further improvements in discriminatory performance.
5 Conclusions
This study successfully developed a machine learning model based on routine blood and biochemical indicators for prediction and risk stratification of early diabetic kidney disease. Key predictors such as HbA1c, GLO, and TyG were identified, providing valuable methodological contributions for non-invasive, low-cost screening of early DKD. This approach holds promise for playing a significant role in clinical practice, particularly in primary care settings and resource-limited environments where comprehensive kidney function assessment may not be readily available.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.
Ethics statement
The study was approved by the Ethics Committee of Wannan Medical College (WNMC-AWE-2025199). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.
Author contributions
WY: Validation, Methodology, Supervision, Writing – original draft. D-dP: Conceptualization, Writing – review & editing, Writing – original draft. KY: Project administration, Investigation, Writing – original draft, Resources. J-jG: Conceptualization, Supervision, Methodology, Funding acquisition, Writing – review & editing, Software, Project administration, Formal analysis, Writing – original draft, Resources, Data curation, Validation, Investigation, Visualization. R-xC: Supervision, Writing – review & editing, Methodology, Conceptualization, Investigation, Writing – original draft, Software, Funding acquisition, Visualization, Formal analysis, Project administration, Validation, Data curation, Resources.
Funding
The author(s) declared that financial support was received for this work and/or its publication. The Natural Science Foundation of Anhui Education Department (2025AHGXZK31079), Wuhu Municipal Science and Technology Program Project (2024kj057), Key Project Research Fund of Wannan Medical College (WK2024ZZD35), Wannan Medical College Middle-aged and Young Research Fund 2024 (WK2024ZQNZ84), Lianyungang Cancer prevention and Treatment Science and Technology Development Program Project (QN2023010),General Project of Scientific Research Development Fund of Nanjing Medical University Kangda College (KD2023KYJJ060).
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2026.1720574/full#supplementary-material
References
1. Wang H, Liu D, Zheng B, Yang Y, Qiao Y, Li S, et al. Emerging role of ferroptosis in diabetic kidney disease: molecular mechanisms and therapeutic opportunities. Int J Biol Sci. (2023) 19:2678–94. doi: 10.7150/ijbs.81892
2. Meng L, Lu Y, Wang X, Cheng C, Xue F, Xie L, et al. NPRC deletion attenuates cardiac fibrosis in diabetic mice by activating PKA/PKG and inhibiting TGF-beta1/Smad pathways. Sci Adv. (2023) 9:eadd4222. doi: 10.1126/sciadv.add4222
3. Kang Y, Jin Q, Zhou M, Li Z, Zheng H, Li D, et al. Predictive value of bone metabolism markers in the progression of diabetic kidney disease: a cross-sectional study. Front Endocrinol. (2024) 15:1489676. doi: 10.3389/fendo.2024.1489676
4. He J, Wang X, Zhu P, Wang X, Zhang Y, Zhao J, et al. Identification and validation of an explainable early-stage chronic kidney disease prediction model: a multicenter retrospective study. EClinicalMedicine. (2025) 84:103286. doi: 10.1016/j.eclinm.2025.103286
5. Ni J, Wu P, Lu X, and Xu C. Examining the cross-sectional relationship of platelet/high-density lipoprotein cholesterol ratio with depressive symptoms in adults in the United States. BMC Psychiatry. (2024) 24:427. doi: 10.1186/s12888-024-05878-x
6. Yan P, Li J, Zhang Y, Dan X, Wu X, Zhang X, et al. : association of circulating carbohydrate antigen 19–9 level with type 2 diabetic kidney disease in chinese adults: A cross-sectional study. Diabetes Metab Syndr Obes. (2024) 17:467–77. doi: 10.2147/DMSO.S434972
7. Kaliannan K, Robertson RC, Murphy K, Stanton C, Kang C, Wang B, et al. Estrogen-mediated gut microbiome alterations influence sexual dimorphism in metabolic syndrome in mice. Microbiome. (2018) 6:205. doi: 10.1186/s40168-018-0587-0
8. Giustarini D, Colella S, Dalle-Donne I, and Rossi R. Homocysteine: canary in the coal mine or hidden threat? A biochemical study on the role of plasma thiols. FASEB J. (2025) 39:e70772. doi: 10.1096/fj.202500677RR
9. Lu J, Shen Y, Liu X, Mao Y, Jing L, Yang Z, et al. Increased prevalence of cardio-cerebrovascular risk factors during the COVID-19 pandemic lockdown: a large, single center, cross-sectional study. BMC Med. (2025) 23:414. doi: 10.1186/s12916-025-04193-w
10. Ye S, Zhang M, Zheng X, Li S, Fan Y, Wang Y, et al. : YAP1 preserves tubular mitochondrial quality control to mitigate diabetic kidney disease. Redox Biol. (2024) 78:103435. doi: 10.1016/j.redox.2024.103435
11. Bettencourt C, Skene N, Bandres-Ciga S, Anderson E, Winchester LM, Foote IF, et al. : Artificial intelligence for dementia genetics and omics. Alzheimers Dement. (2023) 19:5905–21. doi: 10.1002/alz.13427
12. Peng T, Miao R, Xiong H, Lin Y, Fan D, Ren J, et al. Prediction of insulin resistance in nondiabetic population using lightGBM and cohort validation of its clinical value: cross-sectional and retrospective cohort study. JMIR Med Inform. (2025) 13:e72238. doi: 10.2196/72238
13. Zhu ZX, Peng YX, Ye JH, and Wei Y. The economic burden of colorectal cancer in type 2 diabetes mellitus (T2DM) in shanghai, China. Cancer Med. (2025) 14:e70651. doi: 10.1002/cam4.70651
14. Lv K, Cui C, Fan R, Zha X, Wang P, Zhang J, et al. : Detection of diabetic patients in people with normal fasting glucose using machine learning. BMC Med. (2023) 21:342. doi: 10.1186/s12916-023-03045-9
15. Peduzzi P, Concato J, Kemper E, Holford TR, and Feinstein AR. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. (1996) 49:1373–9. doi: 10.1016/S0895-4356(96)00236-3
16. Riley RD, Ensor J, Snell KIE, Harrell FE Jr., Martin GP, Reitsma JB, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. (2020) 368:m441. doi: 10.1136/bmj.m441
17. Jiang X, Liu X, Qu X, Zhu P, Wo F, Xu X, et al. Integration of metabolomics and peptidomics reveals distinct molecular landscape of human diabetic kidney disease. Theranostics. (2023) 13:3188–203. doi: 10.7150/thno.80435
18. Chan L, Nadkarni GN, Fleming F, McCullough JR, Connolly P, Mosoyan G, et al. Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease. Diabetologia. (2021) 64:1504–15. doi: 10.1007/s00125-021-05444-0
19. Jin T and Halili A. Predicting the risk of depression in older adults with disability using machine learning: an analysis based on CHARLS data. Front Artif Intell. (2025) 8:1624171. doi: 10.3389/frai.2025.1624171
20. Su T, Zhang P, Zhang B, Liu Z, Xie Z, Li X, et al. Risk prediction of stroke-associated pneumonia in acute ischemic stroke with atrial fibrillation using machine learning models. Front Artif Intell. (2025) 8:1595101. doi: 10.3389/frai.2025.1595101
21. O’Brien R and Ishwaran H. A random forests quantile classifier for class imbalanced data. Pattern Recognit. (2019) 90:232–49. doi: 10.1016/j.patcog.2019.01.036
22. Chiu WY, Zhang C, Gao R, Hu Z, and Chen G. Accurate prediction of open-circuit voltages of lithium-ion batteries via delta learning. J Chem Theory Comput. (2025) 21:5230–5. doi: 10.1021/acs.jctc.5c00168
23. Fan Y, Ye D, Zhou K, Cai L, and Yu L. The novel triglyceride–glucose-weighted adjusted waist index as a supplementary diagnostic tool for heart failure: evidence of improved reclassification beyond traditional TyG-related indices from a cross-sectional study. Cardiovasc Diabetol. (2025) 24:329. doi: 10.1186/s12933-025-02896-1
24. Stanton AM, Vaduganathan M, Chang LS, Turchin A, Januzzi JL Jr., and Aroda VR. Asymptomatic diabetic cardiomyopathy: an underrecognized entity in type 2 diabetes. Curr Diabetes Rep. (2021) 21:41. doi: 10.1007/s11892-021-01407-2
25. Schiffl H and Lang SM. Current approach to successful liberation from renal replacement therapy in critically ill patients with severe acute kidney injury: the quest for biomarkers continues. Mol Diagn Ther. (2021) 25:1–8. doi: 10.1007/s40291-020-00498-z
26. Li J, Zhou Q, Liu Z, and Zou H. Association of insulin resistance with chronic kidney disease in individuals without diabetes in a community population in South China. BMC Nephrol. (2024) 25:437. doi: 10.1186/s12882-024-03866-7
27. Wang W, Ke B, Wang C, Xiong X, Feng X, and Yan H. Targeting ion channel networks in diabetic kidney disease: from molecular crosstalk to precision therapeutics and clinical innovation. Front Med. (2025) 12:1607701. doi: 10.3389/fmed.2025.1607701
28. Andrade-Sierra J, Pazarin-Villasenor L, Yanowsky-Escatell FG, Diaz-de la Cruz EN, Garcia-Sanchez A, Cardona-Munoz EG, et al. The influence of the severity of early chronic kidney disease on oxidative stress in patients with and without type 2 diabetes mellitus. Int J Mol Sci. (2022) 23(19):11196. doi: 10.3390/ijms231911196
29. Qiu J, Li J, Xu S, Yang J, Zeng H, Zhang Y, et al. et al: Triglyceride glucose-weight-adjusted waist index as a cardiovascular mortality predictor: incremental value beyond the establishment of TyG-related indices. Cardiovasc Diabetol. (2025) 24:306. doi: 10.1186/s12933-025-02873-8
30. Xiong Z, Lai Z, Li S, Zhang H, Wang Y, Tan L, et al. Potential kidney risks associated with clinical doses of omeprazole: in vivo and in vitro studies. Clin Exp Pharmacol Physiol. (2025) 52:e70050. doi: 10.1111/1440-1681.70050
31. Al Ageeli E. Exploring the genetic roles of diet and other modifiable risk factors in the risk of angina: A causal investigation using mendelian randomization in UK biobank and finnGen cohorts. Life (Basel). (2024) 14. doi: 10.3390/life14070905
32. Rein M, Ben-Yacov O, Godneva A, Shilo S, Zmora N, Kolobkov D, et al. et al: Effects of personalized diets by prediction of glycemic responses on glycemic control and metabolic health in newly diagnosed T2DM: a randomized dietary intervention pilot trial. BMC Med. (2022) 20:56. doi: 10.1186/s12916-022-02254-y
33. Hou Q, Zhao Y, Lin Z, Chen T, Di X, Wang X, et al. et al: correlation between systemic immune inflammation index(Sll) and outcome after occlusion in patients with post-infarction ventricular septal rupture. J Inflammation Res. (2025) 18:6641–52. doi: 10.2147/JIR.S518540
Keywords: diabetic kidney disease, early diagnosis, machine learning, risk prediction, routine blood parameters
Citation: Yong W, Peng D-d, Ye K, Gao J-j and Cao R-x (2026) Machine learning model based on routine blood and biochemical parameters for early diagnosis of diabetic kidney disease. Front. Endocrinol. 17:1720574. doi: 10.3389/fendo.2026.1720574
Received: 08 October 2025; Accepted: 05 January 2026; Revised: 28 December 2025;
Published: 28 January 2026.
Edited by:
Cem Haymana, University of Health Sciences, TürkiyeReviewed by:
Ashwin Dhakal, The University of Missouri, United StatesSheng Ding, The Central Hospital of Wuhan, China
Copyright © 2026 Yong, Peng, Ye, Gao and Cao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jun-jie Gao, d25tY2p5a2pqQDE2My5jb20=; Ruo-xue Cao, Y2FvcnVveHVlOTQ5MEAxNjMuY29t
†These authors have contributed equally to this work
Wei Yong1†