Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol., 15 December 2025

Sec. Clinical Diabetes

Volume 16 - 2025 | https://doi.org/10.3389/fendo.2025.1692917

This article is part of the Research TopicHarnessing Machine Learning for Enhanced Biomedical Diagnosis and Early Disease Detection: Bridging Data Science and HealthcareView all 9 articles

Development and external validation of machine-learning based models to predict diabetic foot ulcer in diabetes population

Yanji Zhang&#x;Yanji Zhang1†Yingying Tian&#x;Yingying Tian2†Yang JianYang Jian1Zairong WeiZairong Wei1Silang CaiSilang Cai3Gaofengi Zhang*Gaofengi Zhang2*Chengliang Deng*Chengliang Deng1*
  • 1Department of Burns and Plastic Surgery, Affiliated Hospital of Zunyi Medical University, Zunyi, Guizhou, China
  • 2Department of Radiology, The Second Affiliated Hospital of Zunyi Medical University, Zunyi, Guizhou, China
  • 3Department of Medical Education, Kweichow Moutai Hospital, Zunyi, Guizhou, China

Background: Diabetic foot ulcer (DFU) is a common and serious complication in patients with diabetes, which affects the quality of life greatly as well as brings high risk for mortality. Identification of high-risk individuals, as early as possible is important for efficient intervention and prevention. This study systematically evaluates and summarizes the diagnostic accuracy of machine learning approaches for predicting DFU risk in diabetic patients.

Methods: This study adhered to the TRIPOD+AI (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis, Extended for Artificial Intelligence) guidelines. Using data from the National Health and Nutrition Examination Survey (NHANES) 1999–2004 to determine diagnosis of DFU related clinical characteristics, laboratory indicators and lifestyle-related variables. The diagnostic performance of its models trained using Logistic Regression (LR), K-Nearest Neighbors (KNN), Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Support Vector Machine (SVM) classifiers were compared. An independent testing dataset collected from the Second Affiliated Hospital of Zunyi Medical University was used to conduct external validation.

Results: This study included 1, 857 participants from NHANES and 807 individuals recruited at the testing dataset. Key predictors identified in NHANES were numbness in extremities, direct HDL cholesterol, lymphocyte, white blood cell, segmented neutrophils, and BMI. Among them, the RF was identified as having the highest area under receiver operating characteristic curve (AUC) for NHANES at 0.81. The RF model also had the highest discriminative performance in external validation (as measured by an AUC of 0.79). Other models also provided good results in external validation: XGBoost had an AUC of 0.76, SVM reached 0.72, KNN reached 0.70, and LR received a score of 0.69.

Conclusion: The ability of machine learning models to predict DFU risk was good in a combined population cohort when measured using common metrics but varied across distinct regions. These results support future clinical evaluation of these models and underscore the need to select algorithms a priori based on the target patient population.

1 Introduction

Diabetes mellitus is a chronic metabolic disorder, which has remained as one of the largest global public health problems steadily increasing prevalence. An estimated 463 million adults live with diabetes globally and, without effective prevention strategies in place the number is likely to rise to 693 million by the year of 2045 (1). The data show China at the top of global rankings, with some 116.4 million sufferers overall. About 40% population of diabetics is unaware of their disease; moreover, companion disorders are chronic in the same way source (2). Complications such as cardiovascular disease, nephropathy, retinopathy, and diabetic foot ulcer (DFU) largely contribute to the risk of disability, death, and healthcare burden (3).

DFU is one of the serious chronic diabetic complications, usually involuing skin breakdown, infection, and even deep tissue destruction in the foot, mainly caused by neuropathy and microvascular disease (4). Worldwide, the risk of lower limb amputation in patients with DFU throughout their lifetime is as high as 20%, and 5-year mortakity rate is between 50% and 70% (5). In China, DFU prevalence is still on the increase, and most cases are only diagnosed at advanced stages, resulting in increased recurrence and amputation rates (6). Delayed recognition and intervention greatly elevate the risks of infection, gangrene, amputation, and even death, seriously affecting patients’ quality of life and causing a heavy healthcare burden (7, 8). Thus, the early detection of individuals at high risk and timely, personalized intervention are of great importance for ulcer prevention and outcome inprovement in patients.

The development of foot ulcer in patients with diabetes is a complex phenomenon involving both intrinsic and extrinsic factors. Existing literature has shown that the prevalence of DFU is closely associated with a constellation of clinical characteristice, biochemical markers, and lifestyle factors (9). Evidence has demonstrated that patients with diabetic retinopathy have a 1.74 time higher risk of developing DFU, whereas peripheral vascular disease has been identified as an independent risk factor for DFU in particular among Asian populations (10). A large-scale observational study in Europe also provided further evidence that improvement in glycemic and lipid control in patients with type 1 diabetic (T1D) and the elimination of smoking and alcohol consumption in patients with type 2 diabetes (T2D) were effective strategies to reduce the risk of DFU (11). In addition, numerous studies have reported that increased concentrations of HbA1c and C-reactive protein (CRP) were positively correlated with the severity of DFU-as reflected by increased Wagner grades-and were significantly associated with an elevated risk of DFU (12; 13).

Considering the multifactorial nature of DFU and the growing availability of clinical and biochemical datasets, machine learning (ML) methods have emerged as promising tools for integrating complex risk factors and strengthening early risk diagnosis (14). Compared with traditional statistical methods, ML algorithms have better ability in handling high-dimensional data, detecting nonlinear relationships, and optimizing diagnostic efficiency (15). Previous research has initially suggested the prospects of ML models in predicting the risk of DFU. However, limitations like reliance on single data types (e.g., clinical characteristics or images), inability to incorporate multidimensional variables, and absence of external validation have hindered the generalizability of such models to diverse populations (16; 17).

The purpose of this research is to determine DFU risk factors based on the 1999–2004 NHANES data and to construct ML models to predict DFU risk. Both model generalizability and clinical usefulness were tested with an external dataset. Through the incorporation of multidimensional variables and the comparing of algorithm performance, it lays the groundwork for the early detection and indivdualized risk determination of DFU.

2 Materials and methods

2.1 Study design and population

This study was conducted and reported in accordance with the TRIPOD+AI (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis, Extended for Artificial Intelligence) guidelines to ensure transparency and clarity in the development and validation of the prediction model (18). This research used information from two sources: the National Health and Nutrition Examination Survey (NHANES) and a separate clinical dataset from the Second Affiliated Hospital of Zunyi Medical University in China.

NHANES is a nationally representative survey conducted biennially by the National Center for Health Statistics (NCHS) to assess the health and nutritional status of the U.S. population through household interviews and mobile examination centers. The program was approved by the NCHS Ethics Review Board, and all participants provided written informed consent. For this study, NHANES data from 1999 to 2004 were used, yielding an initial sample of 31, 126 participants. Participants without diabetes, those under 40 years of age, and individuals without skin ulceration or whose ulcers/sores had healed within 4 weeks of onset were excluded from the study. (Figure 1). More information about NHANES is available at: https://wwwn.cdc.gov/nchs/nhanes.

Figure 1
Flowchart depicting participant selection. From NHANES (1999-2004), 31,126 participants; 29,090 excluded for being non-diabetic; 2,036 included; 179 excluded based on age, ulcers, healing criteria, leaving 1,857. From Zunyi (2023-2025), 11,537 participants; 8,584 non-diabetic, 1,567 excluded due to amputation history or data issues; 1,386 included; 579 further excluded, leaving 807 for validation. Finals: DFU 162, non-DFU 1,659; DFU 140, non-DFU 667.

Figure 1. Flow chart of subject inclusion and exclusion.

Clinical data of diabetic patients who were admitted to the Second Affiliated Hospital of Zunyi Medical University between June 2023 and May 2025 were retrospectively collected for the creation of the external validation cohort. To ensure consistency, participants were selected based on the same inclusion criteria set for the NHANES dataset. Additionally, patients who had foot ulcers due to non-diabetic causes (like trauma, autoimmune diseases, or malignancies), a reported histort of lower limb amputation, or incomplete clinical or laboratory data were excluded from the study. The study was approved by the relevant institutional ethics committee (No. KYLL-2025-128), and written informed consent was obtained from all participants.

2.2 Definition of diabetes and diabetic foot ulcers

Participants were classified as having diabetes mellitus if they met at least one of the following criteria: 1) glycohemoglobin (HbA1c) of ≥ 6.5%; 2) random blood glucose of ≥ 11.1 mmol/L; 3) fasting blood glucose of ≥ 7.0 mmol/L; 4) current use of anti-glycemic drugs; or 5) a prior diagnosis of diabetes made by a healthcare provider. DFU was harmonized across cohorts using the “unhealed ≥4 weeks” construct. In NHANES, DFU was self-reported; in the external validation cohort, DFU was clinician-verified according to the Chinese Guidelines for the Diagnosis and Treatment of Diabetic Foot, using the same ≥ 4-week observation window (1921).

2.3 Data collection

Data were collected from both cohort in four different domains: demographics including age, gender, and body mass index (BMI); medical history such as hypertension, hypercholesterolemia, coronary heart disease (CHD), stroke, retinopathy, numbness in hands or feet, leg pain while walking, and kidney weakness; lifestyle behaviors including smoking and alcohol consumption; and laboratory parameters such as HbA1c, fasting blood glucose, random blood glucose, total cholesterol, direct high-density lipoprotein cholesterol (HDL-C), urine creatinine, urine albumin, serum creatinine, albumin, total calcium, phosphorus, potassium, CRP, white blood cell count (WBC), segmented neutrophils, lymphocytes, and hemoglobin. Detailed definitions and descriptions of all study variables were provided in Supplementary Table S1.

2.4 Statistical analysis

The analyses were run separately using R version 4.2.2 and Python version 3.12. Predictors were processed with one-hot encoding for categorical variables and z-score standardization for continuous variables; ‘don’t know/refused’ were treated as missing after unit harmonization and LOD code handling. The continuous variables were reported as means ± standard deviations (SD), and categorical variables were reported as proportions. T-tests for continuous variables and χ2 tests for categorical variables were used to etermine baseline differences between DFU participants and non-DFU participants. Variables significant in univariate analyses were entered into a multivariable logistic regression to identify independent risk factors, from which coefficients, odds ratios (ORs), 95% confidence intervals (CIs), and p-values were estimated. The statistical significance level was set at P < 0.05.

To develop and evaluate predictive models, five machine learning algorithms were used, such as logistic regression (LR), random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbors (KNN), and support vector machine (SVM). The NHANES dataset was subjected to random splitting into a training set of 70% and an internal validation set of 30%. Missing values were imputed by mode for categorical variables and by median for continuous variables (Supplementary Table S2). Hyperparameter optimization was carried out on the training dataset using stratified 5-fold cross-validation: grid search for LR, RF, SVM, KNN, and XGBoost, and randomized search followed by a reduced-grid search for XGBoost. To counter class imbalance, class weights (and scale_pos_weight for XGBoost) were used, achieving better performance without risking severe overfitting (Supplementary Table S3). The performance of the models was evaluated on the internal validation set and external validation set using several metrics that included the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, precision, and F1 score. The optimal threshold was determined by maximizing Youden’s J statistic to achieve the best balance between sensitivity and specificity. In addition, decision curve analysis and calibration curve methodology were used to evaluate the clinical usefulness and the agreement between the predicted and actual outcomes of the models, respectively.

3 Results

3.1 Participant characteristics

Table 1 shows the baseline characteristics of all participants included. Overall, 2, 664 patients were enrolled, of whom 1, 857 were from the NHANES cohort (162 with DFU and 1, 695 without DFU) and 807 were from the external validation cohort (140 with DFU and 667 without DFU). Within the NHANES cohort, no significant between-group differences were noted for age (65.37 ± 11.95 vs. 65.20 ± 11.94 years) or gender distribution (male: 56.17% vs. 50.86%, P = 0.23) between the DFU and non-DFU groups. The DFU group, however, had significantly higher prevalence rates of retinopathy, numbness in the extremities, pain in the legs when walking, impaired renal function, and abnormal levels of a number of inflammation- and kidney-associated biomarkers (e.g., urine albumin, segmented neutrophils, creatinine). In the external validation cohort, DFU patients tended to be slightly older than non-DFU patients (62.03 ± 11.35 vs. 60.14 ± 8.97 years), and the percentage of males was similarly slightly higher (58.57% vs. 49.78%), although neither of these differences was statistically significant. Relative to the non-DFU group, DFU patients had worse metabolic and vascular profiles, with higher prevalence rates of retinopathy, neuropathy, stroke, and hypertension, as well as significant abnormalities in laboratory measures such as glycohemoglobin, CRP, creatinine, and hemoglobin.

Table 1
www.frontiersin.org

Table 1. Characteristics of study participants.

3.2 Logistic regression analysis

In order to detect independent predictors of DFU, we conducted univariate and multivariate logistic regression analysis (Table 2). Six variables were independently related: numbness in the extremities (Coef = 1.34, OR = 3, 82, 95% CI: 2.68–5.49, P < 0.01), direct HDL cholesterol (Coef =1.54, OR = 4.65, 95% CI: 1.66-13.02, P < 0.01), lymphocyte (Coef = -2.76, OR = 0.06, 95% CI: 0.01-0.14, P = 0.01), WBC (Coef = 2.03, OR = 7.63, 95% CI: 1.54-37.77, P = 0.01), segmented neutrophils (Coef = -0.91, OR = 0.40, 95% CI: 0.30-0.54, P = 0.02), and BMI (Coef = 0.07, OR = 1.07, 95% CI: 1.01-1.14, P = 0.04).

Table 2
www.frontiersin.org

Table 2. Multivariate logistic regression analysis of risk factors based on NHANES cohort.

3.3 Development and validation of ML models

We trained 5 machine learning models on the NHANES dataset and internally validated their predictive performance (Table 3, Figure 2a). The results indicated that, RF had the highest AUC (0.81, 95% CI: 0.76-0.87), followed by XGBoost (0.79, 95% CI: 0.73-0.85), LR (0.76, 95% CI: 0.70-0.82), KNN (0.74, 95% CI: 0.67-0.81), and SVM (0.74, 95% CI: 0.68-0.8). Supplementary Figure S1 shows the top ten features for the RF model on the NHANES dataset, including BMI, lymphocyte count, numbness in extremities, and diabetic retinopathy, among others. The complete search spaces and the best hyperparameters for all models are summarized in Supplementary Table S3. The model correctly identified 27 DFU cases (true positives) and 377 non-DFU cases (true negatives). These findings suggest that the RF model exhibits strong predictive capacity in detecting DFU risk (Supplementary Figure S2a). Decision curve analysis resulted in positive net benefit for the RF model for 5–20% thresholds, signifying its use utility (Figure 3a). The calibration curve for RF model had acceptable agreement along with significant calibration slope and intercept, denoting Brier score 0.06, calibration slope 1.44, and calibration intercept -0.82. (Figure 4a).

Table 3
www.frontiersin.org

Table 3. Performance comparison of DFU prediction models using NHANES and external validation cohorts.

Figure 2
ROC curves comparing five machine learning models: Random Forest, Logistic Regression, KNN, SVM, and XGBoost. The left graph shows results for NHANES data with AUC values ranging from 0.739 to 0.813. The right graph shows external validation, with AUC values from 0.689 to 0.792. Random Forest consistently performs best in both sets.

Figure 2. Prediction performance by AUCs for the five machine-learning models: (a) NHANES cohort; (b) external validation cohort.

Figure 3
Two line graphs display decision curve analysis for different models: Panel A (NHANES) and Panel B (External). Each graph plots net benefit against threshold probability for Random Forest, Logistic, KNN, SVM, and XGBoost models, indicated by different colored lines. Dotted lines represent 'Treat-all' and 'Treat-none' strategies. Both panels show variations in model performance, with Random Forest and XGBoost generally providing higher net benefits across threshold probabilities.

Figure 3. Decision curve analysis for clinical models in two cohorts. (a) NHANES cohort; (b) External validation cohorts; Net benefit is plotted against threshold probability; dashed lines denote “treat-all” and “treat-none”.

Figure 4
Two calibration plots compare predicted probabilities against observed frequencies for different models. The left plot shows NHANES data, and the right plot shows external data. Models include Random Forest, Logistic, KNN, SVM, and XGBoost. A dashed line indicates perfect calibration. A legend with Brier score, slope, and intercept values is provided for each model.

Figure 4. Calibration curves for clinical models in two cohorts. (a) NHANES cohort; (b) External validation cohorts. Observed outcome frequency versus predicted probability; the 45° line indicates perfect calibration.

3.4 External validation of ML models

In the external validation set, RF yielded the highest AUC (0.79, 95% CI: 0.76-0.83), followed by XGBoost (0.76, 95% CI: 0.73-0.80), SVM (0.72, 95% CI: 0.68–0.76), KNN (0.70, 95% CI: 0.65-0.74), and LR (0.69, 95% CI: 0.65-0.73). Although KNN and SVM showed high sensitivity (0.844 and 0.845, respectively), RF exhibited the best overall discriminative performance in the external validation cohort according to AUC(Table 3, Figure 2b). Confusion matrix analysis indicated that the RF model accurately identified 104 DFU cases (true positives) and 484 non-DFU cases (true negatives). These findings indicate that the RF model exhibits strong specificity and overall classification performance in identifying individuals at risk for DFU (Supplementary Figure S2b). The RF model likewise maintained positive net benefit over 5–20% thresholds, supporting clinical utility (Figure 3b). The calibration curve of the RF model showed agreement with some degree of deviation from ideal calibration: Brier score 0.13, calibration slope 5.01, calibration intercept −4.90 (Figure 4b).

3.5 SHAP-based model interpretability analysis

The SHAP analysis was carried out for assessing the contribution and significance of each feature to the RF model. The top 10 features contributing to the RF model were, in descending order, urine albumin, BMI, numbness in extremities, potassium, serum albumin, creatinine, retinopathy, HbA1c, segmented neutrophils, and fasting blood glucose (Figure 5a). The top features tend to include positively valued SHAP for urine albumin, BMI, numbness, and glycemic parameters (HbA1c and fasting glucose) indicative of higher risk for DFU, and negatively valued SHAP for serum albumin, showing a protective association. The features for potassium, creatinine, and segmented neutrophils tend to exhibit mixed direction association (Figure 5b).

Figure 5
Bar chart and violin plot comparing factors affecting a model's output using SHAP values. Panel (a) displays average SHAP values indicating feature importance, with urine albumin and BMI as top features. Panel (b) shows feature value distributions, highlighting the impact direction and value range for each factor, colored from low to high values.

Figure 5. SHAP diagram for predicting DFU using the random forest model. (a) SHAP bar plot of feature importance, Mean absolute SHAP values rank features by global importance; (b) SHAP summary plot of feature importance, Each point shows a patient-level SHAP value (red = higher feature value, blue = lower).

4 Discussion

Our results showed that DFU has partially independent associations with several important factors, namely the numbness of limbs, direct HDL-cholesterol, lymphocyte, white blood cell, segmented neutrophils, and BMI. The risk of DFU increases with numbness in limbs, higher levels of direct HDL-cholesterol, white blood cell, and BMI, and is strongly positively related to the white blood cell count, while it is inversely related to the lymphocyte and segmented neutrophils. The RF model has the highest performance for both internal and external validation, and the decision curve analysis clearly identified net benefit for both 5% - 20% thresholds. The calibration curve of the RF model showed acceptable overall agreement but systematic calibration bias, especially in the external validation, where the high calibration slope (5.01) suggests model overconfidence in low-risk predictions. SHAP values for analysis identified that the important factors for predicting DFU include variables like urine albumin, BMI, numbness in limbs, potassium, and albumin.

To date, several studies have attempted to build risk-prediction models for DFU. Researchers constructed a conventional logistic regression model based on five predictors, such as BMI and foot signs, which showed good performance with an AUC of 0.787 in internal validation (22). Chen et al. employed LASSO and logistic regression to screen out significant risk factors and constructed a nomogram prediction model for DFU in older patients with type 2 diabetes, with an AUC of 0.851 in internal validation (23). Another study, based on a large patient cohort with type 2 diabetes, developed and validated a diabetic foot risk score prediction model by extracting risk ratios through systematic review and meta-analysis, and adopting a weighter scoring approach to nine clinical factors, showing good discriminative ability (24). In addition to conventional model-based research, recent studies have increasingly resorted to machine learning techniques. Two studies based on data from type 2 diabetes patients in Malaysia adopted several ML methods to construct prediction models for DFU, and both showed that ML models consistently outperformed conventional methods (25; 26). Our results confirm previous work, offering additional validation across regions (NHANES, a cohort study in China, demonstrating greater generalizability for RF). Our study presents operating points for deployment (thresholds from confusion matrices) for better interpretation using SHAP, pointing towards factors for improvement (Supplementary Table S4).

Our multivariate logistic regression found six independent predictors of DFU: numbness in extremities, direct HDL cholesterol, lymphocyte, white blood cell, segmented neutrophils, and BMI. Numbness in the extremities, being a characteristic sign of diabetic peripheral neuropathy, is positively correlated with DFU most strongly, indicating multisystem nerve damage that compromises protective sensation and skin integrity, thus predisposing to a high risk of foot ulceration and poor healing (27). Direct HDL cholesterol can also undergo glycation and oxidation, forming “dysfunctional HDL, “ disrupting the function of reverse cholesterol efflux and endothelial and anti-inflammatory properties, leading to compromised microvascular perfusion, thereby contributing to the development and progression of DFU (28). Anemia can indicate lymphopenia, thereby disrupting the body’s ability to properly modulate the immune system, leading to inflammation, impaired cell proliferation and differentiation, and then impaired remodeling, contributing to the delayed healing process, thereby increasing the risk for DFU (29). An elevated white blood cell level can indicate chronic inflammation or occult infection, thereby increasing the risk for microvascular endothelial and cell damage, leading to inflammation, oxidative damage, and thereby increasing the risk for DFU (30). An abnormal number of segmented neutrophils can indicate a predisposition to altered cellular functions, specifically impaired chemotaxis and impaired or diminished phagocytosis, excessive production of NET, thereby predisposing the body to inflammation, leading to delayed closure of the wound, thereby contributing to the risk for DFU (31). Moreover, higher BMI triggers insulin resistance and inflammation, causing microvascular/neuropathic damage, higher plantar pressure, and impaired healing—thereby increasing DFU risk. (24).

It is interesting to note that the ML model showed similar predictive performance in both internal validation in U.S. NHANES data and external validation in a Chinese clinical cohort, with strong cross-population generalizability. This may be due to the fact that the model was based on several key predictive factors-e.r., urine albumin, peripheral neuropathy, abnormal blood glucose, and BMI — that have high generalizability and clinical interpretability across ethnicities, geographical locations, and healthcare system (3234). These predictors are not only closely related to the fundamental pathophysiological processes of diabetic foot ulceration- e.g., nerve dysfunction, microvascular disease, immunosuppression, and impaired tissue repair—but are relatively easy to access and standardize, thereby avoiding the confounding effects of data quality and measurement variability (35). However, in light of model’s systematic miscalibration, we recommend localized model recalibration strategies aligned with local epidemiology, along with deployment strategies for it as an EHR-based clinical decision support system utilizing FHIR/CDS hooks for automated data entry. This will display ‘risk tier + key drivers + recommended actions’ output for clinicians, along with a ‘patient calculator, ‘ version control system, bias detection, and periodic recalibration system accordingly. Additionally, threshold-based risk stratification could guide clinical decisions: high-risk patients (threshold >20%) should receive frequent foot exams and specialist referrals, while moderate-risk patients (threshold 10%-20%) should be monitored with lifestyle interventions.

Nonetheless, this study has several limitations that should be noted. First, the NHANES data used to develop the models is cross-sectional in nature, limiting the extent to which causal or temporal relationships can be inferred. Second, the external validation may have introduced selection bias due to its study design and the significant recruitment time gap with the NHANES cohort, potentially affecting the comparability and generalizability of the results. Third, because the DFU outcome in NHANES was self-reported without adjudication, non-differential misclassification is possible and would be expected to primarily affect calibration. Fourth, sparse data in the low-probability range may introduce systematic miscalibration, reducing the reliability of low-risk decision thresholds. Finally, we acknowledge potential measurement bias in biochemical markers across cohorts, and the absence of imaging and plantar-pressure data further limits the completeness of DFU risk modeling.

5 Conclusion

In this study, machine-learning models performed strongly for DFU risk prediction, with the RF model showing the best overall performance across cohorts. Future prospective validation should adopt a pragmatic, real-time EHR pilot with continuous monitoring (discrimination, calibration, net benefit, equity) and periodic recalibration.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by the Biomedical Research Ethics Review Committee of the Affiliated Hospital of Zunyi Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

YZ: Writing – original draft, Formal Analysis, Validation, Data curation. YT: Data curation, Project administration, Writing – original draft. ZW: Data curation, Writing – review & editing, Investigation. YJ: Writing – review & editing, Data curation. SC: Funding acquisition, Visualization, Writing – review & editing, Software. CD: Supervision, Funding acquisition, Conceptualization, Writing – review & editing. GZ: Conceptualization, Methodology, Writing – review & editing, Funding acquisition, Resources.

Funding

The author(s) declared that financial support was received for this work and/or its publication. 1.Key Project of Guizhou Provincial Science and Technology Plan of China (No. ZK2021-011); 2. Collaborative Innovation Center of Chinese Ministry of Education (No. 2020-39); 3. Scientific Research and Talent Training Funds of Kweichow Moutai Hospital (MTYK, No.2022-13).

Acknowledgments

We would like to thank participants from the National Health and Nutrition Examination Survey (NHANES) for their contribution to the study.

Conflict of interest

The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2025.1692917/full#supplementary-material

References

1. Wu B, Niu Z, and Hu F. Study on risk factors of peripheral neuropathy in type 2 diabetes mellitus and establishment of prediction model. Diabetes Metab J. (2021) 45:526–38. doi: 10.4093/dmj.2020.0100

PubMed Abstract | Crossref Full Text | Google Scholar

2. Kolarić V, Svirčević V, Bijuk R, and Zupančič V. CHRONIC COMPLICATIONS OF DIABETES AND QUALITY OF LIFE. Acta clinica Croatica. (2022) 61:520–7. doi: 10.20471/acc.2022.61.03.18

PubMed Abstract | Crossref Full Text | Google Scholar

3. Yu MG, Gordin D, Fu J, Park K, Li Q, and King GL. Protective factors and the pathogenesis of complications in diabetes. Endocrine Rev. (2024) 45:227–52. doi: 10.1210/endrev/bnad030

PubMed Abstract | Crossref Full Text | Google Scholar

4. Huang F, Lu X, Yang Y, Yang Y, Li Y, Kuai L, et al. Microenvironment-based diabetic foot ulcer nanomedicine. Advanced Sci (Weinheim Baden-Wurttemberg Germany). (2023) 10:e2203308. doi: 10.1002/advs.202203308

PubMed Abstract | Crossref Full Text | Google Scholar

5. McDermott K, Fang M, Boulton AJM, Selvin E, and Hicks CW. Etiology, epidemiology, and disparities in the burden of diabetic foot ulcers. Diabetes Care. (2023) 46:209–21. doi: 10.2337/dci22-0043

PubMed Abstract | Crossref Full Text | Google Scholar

6. Gong H, Ren Y, Li Z, Zha P, Bista R, Li Y, et al. Clinical characteristics and risk factors of lower extremity amputation in the diabetic inpatients with foot ulcers. Front Endocrinol. (2023) 14:1144806. doi: 10.3389/fendo.2023.1144806

PubMed Abstract | Crossref Full Text | Google Scholar

7. Shi H, Yuan X, Yang X, Huang R, Fan W, and Liu G. A novel diabetic foot ulcer diagnostic model: identification and analysis of genes related to glutamine metabolism and immune infiltration. BMC Genomics. (2024) 25:125. doi: 10.1186/s12864-024-10038-2

PubMed Abstract | Crossref Full Text | Google Scholar

8. Wang X, Yuan CX, Xu B, and Yu Z. Diabetic foot ulcers: Classification, risk factors and management. World J Diabetes. (2022) 13:1049–65. doi: 10.4239/wjd.v13.i12.1049

PubMed Abstract | Crossref Full Text | Google Scholar

9. Sharma H, Sharma S, Krishnan A, Yuan D, Vangaveti VN, Malabu UH, et al. The efficacy of inflammatory markers in diagnosing infected diabetic foot ulcers and diabetic foot osteomyelitis: Systematic review and meta-analysis. PloS One. (2022) 17:e0267412. doi: 10.1371/journal.pone.0267412

PubMed Abstract | Crossref Full Text | Google Scholar

10. Fan Z, Liu Y, Xie H, Yang Q, Zhang G, Zhang P, et al. Analysis of risk factors for foot ulcers in diabetes patients with neurovascular complications. BMC Public Health. (2025) 25:792. doi: 10.1186/s12889-025-21639-1

PubMed Abstract | Crossref Full Text | Google Scholar

11. Eckert AJ, Zimny S, Altmeier M, Dugic A, Gillessen A, Bozkurt L, et al. Factors associated with diabetic foot ulcers and lower limb amputations in type 1 and type 2 diabetes supported by real-world data from the German/Austrian DPV registry. J Diabetes. (2024) 16:e13531. doi: 10.1111/1753-0407.13531

PubMed Abstract | Crossref Full Text | Google Scholar

12. Ardelean A, Balta DF, Neamtu C, Neamtu AA, Rosu M, Pilat L, et al. Pentraxin-3 and other inflammatory markers for an infected diabetic foot ulcer diagnosis: A prospective study. Diagnostics (Basel Switzerland). (2023) 13:2366. doi: 10.3390/diagnostics13142366

PubMed Abstract | Crossref Full Text | Google Scholar

13. Su G, Yuan X, Yuan G, Sun Y, Zhang D, Liu W, et al. Exploration of predictive risk factors for diabetic foot in patients with diabetes in Beijing: analysis of 5-year follow-up data of patients with diabetes mellitus in a single center in Beijing. Front Endocrinol. (2024) 15:1441997. doi: 10.3389/fendo.2024.1441997

PubMed Abstract | Crossref Full Text | Google Scholar

14. Liu J, Liu C, Liu Z, Zhou Y, Li X, and Yang Y. Spatial analysis of air pollutant exposure and its association with metabolic diseases using machine learning. BMC Public Health. (2025) 25:831. doi: 10.1186/s12889-025-22077-9

PubMed Abstract | Crossref Full Text | Google Scholar

15. Guan H, Wang Y, Niu P, Zhang Y, Zhang Y, Miao R, et al. The role of machine learning in advancing diabetic foot: a review. Front Endocrinol. (2024) 15:1325434. doi: 10.3389/fendo.2024.1325434

PubMed Abstract | Crossref Full Text | Google Scholar

16. Khandakar A, Chowdhury MEH, Ibne Reaz MB, Md Ali SH, Hasan MA, Kiranyaz S, et al. A machine learning model for early detection of diabetic foot using thermogram images. Comput Biol Med. (2021) 137:104838. doi: 10.1016/j.compbiomed.2021.104838

PubMed Abstract | Crossref Full Text | Google Scholar

17. Liu L, Bi B, Gui M, Zhang L, Ju F, Wang X, et al. . Development and internal validation of an interpretable risk prediction model for diabetic peripheral neuropathy in type 2 diabetes: a single-centre retrospective cohort study in China. BMJ Open. (2025) 15:e092463. doi: 10.1136/bmjopen-2024-092463

PubMed Abstract | Crossref Full Text | Google Scholar

18. Collins GS, Moons KGM, Dhiman P, Riley RD, Beam AL, Van Calster B, et al. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ (Clinical Res ed.). (2024) 385:e078378. doi: 10.1136/bmj-2023-078378

PubMed Abstract | Crossref Full Text | Google Scholar

19. Gu YQ, Ran ZW, Guo LR, Guo JM, Ju S, Cao YM, et al. Chinese guide for diagnosis and treatment of diabetes foot. Chin J Clin Physicians. (2024) 52:1287–96. doi: 10.3969/j.issn.2095-8552.2024.11.007

Crossref Full Text | Google Scholar

20. Li Z, Jian Y, and Wei Z. Association between monocyte to lymphocyte ratio and diabetic foot ulcer in the population of the US with diabetes based on the 1999–2004 National Health and Nutrition Examination Survey data: a retrospective cross-sectional study. Front Endocrinol. (2024) 15:1361393. doi: 10.3389/fendo.2024.1361393

PubMed Abstract | Crossref Full Text | Google Scholar

21. Xing JH, Yu Y, Teng L, Hui XY, and Guo WG. Association of blood heavy metals with diabetic foot ulcers in U.S. Adults with diabetes: insights from the 1999–2004 NHANES data. Diabetes therapy: research Treat Educ Diabetes related Disord. (2025) 16:1255–66. doi: 10.1007/s13300-025-01730-x

PubMed Abstract | Crossref Full Text | Google Scholar

22. Lv J, Li R, Yuan L, Huang FM, Wang Y, He T, et al. Development and validation of a risk prediction model for foot ulcers in diabetic patients. J Diabetes Res. (2023) 2023:1199885. doi: 10.1155/2023/1199885

PubMed Abstract | Crossref Full Text | Google Scholar

23. Chen D, Wang M, Shang X, Liu X, Liu X, Ge T, et al. Development and validation of an incidence risk prediction model for early foot ulcer in diabetes based on a high evidence systematic review and meta-analysis. Diabetes Res Clin Pract. (2021) 180:109040. doi: 10.1016/j.diabres.2021.109040

PubMed Abstract | Crossref Full Text | Google Scholar

24. Shao Z, Wang Z, Bi S, and Zhang J. Establishment and validation of a nomogram for progression to diabetic foot ulcers in elderly diabetic patients. Front Endocrinol. (2023) 14:1107830. doi: 10.3389/fendo.2023.1107830

PubMed Abstract | Crossref Full Text | Google Scholar

25. Wang X, Shengmei Z, BingQian W, Wen L, Shuyan G, Hanbei C, et al. Enhancing diabetic foot ulcer prediction with machine learning: A focus on Localized examinations. Heliyon. (2024) 10:e37635. doi: 10.1016/j.heliyon.2024.e37635

PubMed Abstract | Crossref Full Text | Google Scholar

26. Wang Y, Jiang C, Xing YQ, Zou L, Song M, Qu X, et al. Establishing a clinical prediction model for diabetic foot ulcers in type 2 diabetic patients with lower extremity arteriosclerotic occlusion using machine learning. Sci Rep. (2025) 15:11737. doi: 10.1038/s41598-025-96502-9

PubMed Abstract | Crossref Full Text | Google Scholar

27. Hicks CW and Selvin E. Epidemiology of peripheral neuropathy and lower extremity disease in diabetes. Curr Diabetes Rep. (2019) 19:86. doi: 10.1007/s11892-019-1212-8

PubMed Abstract | Crossref Full Text | Google Scholar

28. Bonilha I, Zimetti F, Zanotti I, Papotti B, and Sposito AC. Dysfunctional high-density lipoproteins in type 2 diabetes mellitus: molecular mechanisms and therapeutic implications. J Clin Med. (2021) 10:2233. doi: 10.3390/jcm10112233

PubMed Abstract | Crossref Full Text | Google Scholar

29. Serban D, Papanas N, Dascalu AM, Kempler P, Raz I, Rizvi AA, et al. Significance of neutrophil to lymphocyte ratio (NLR) and platelet lymphocyte ratio (PLR) in diabetic foot ulcer and potential new therapeutic targets. Int J lower extremity wounds. (2024) 23:205–16. doi: 10.1177/15347346211057742

PubMed Abstract | Crossref Full Text | Google Scholar

30. Chen H, Mei S, Zhou Y, and Dai J. Systematic review and meta-analysis of the diagnostic accuracy of inflammatory markers for infected diabetic foot ulcer. J Tissue viability. (2024) 33:598–607. doi: 10.1016/j.jtv.2024.09.007

PubMed Abstract | Crossref Full Text | Google Scholar

31. Yang S, Gu Z, Lu C, Zhang T, Guo X, Xue G, et al. Neutrophil extracellular traps are markers of wound healing impairment in patients with diabetic foot ulcers treated in a multidisciplinary setting. Adv Wound Care. (2020) 9:16–27. doi: 10.1089/wound.2019.0943

PubMed Abstract | Crossref Full Text | Google Scholar

32. Qi H, Zhang T, Hou L, Li Q, Huang R, and Ma L. Research progress on risk prediction models for the diabetic foot. Acta diabetologica. (2025). doi: 10.1007/s00592-025-02505-3

PubMed Abstract | Crossref Full Text | Google Scholar

33. Wang S, Xia C, Zheng Q, Wang A, and Tan Q. Machine learning models for predicting the risk of hard-to-heal diabetic foot ulcers in a chinese population. Diabetes Metab syndrome obesity: Targets Ther. (2022) 15:3347–59. doi: 10.2147/DMSO.S383960

PubMed Abstract | Crossref Full Text | Google Scholar

34. Wang M, Chen D, Fu H, Xu H, Lin S, Ge T, et al. Development and validation of a risk prediction model for the recurrence of foot ulcer in type 2 diabetes in China: A longitudinal cohort study based on a systematic review and meta-analysis. Diabetes/metabolism Res Rev. (2023) 39:e3616. doi: 10.1002/dmrr.3616

PubMed Abstract | Crossref Full Text | Google Scholar

35. Feng X, Zhao R, Yang T, Wang N, and Wang G. Development and validation of a nomogram for predicting diabetic foot ulcer risk in patients with type 2 diabetes mellitus. Front Endocrinol. (2025) 16:1555163. doi: 10.3389/fendo.2025.1555163

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: machine learning, diabetic foot ulcer, NHANES, external validation, diagnosis

Citation: Zhang Y, Tian Y, Jian Y, Wei Z, Cai S, Zhang G and Deng C (2025) Development and external validation of machine-learning based models to predict diabetic foot ulcer in diabetes population. Front. Endocrinol. 16:1692917. doi: 10.3389/fendo.2025.1692917

Received: 28 August 2025; Accepted: 26 November 2025; Revised: 24 November 2025;
Published: 15 December 2025.

Edited by:

Arvind Mukundan, National Chung Cheng University, Taiwan

Reviewed by:

Sunil Upadhyay, Institute of Technology and Science Mohan Nagar Ghaziabad, India
Claudia C. Colmenares-Mejia, Fundación Cardiovascular de Colombia, Colombia

Copyright © 2025 Zhang, Tian, Jian, Wei, Cai, Zhang and Deng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chengliang Deng, Y2hlbGlhZGVuZ0BzaW5hLmNvbQ==; Gaofengi Zhang, emhhbmdnYW9mZW5nNzY3N0AxNjMuY29t

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.