- 1Department of Nursing, Suzhou BenQ Medical Center, Suzhou, China
- 2Department of Preventive Medicine, Medical College, Tarim University, Alar, China
- 3College of Medical Engineering and Technology, Xinjiang Medical University, Urumqi, China
- 4Institute of Medical Engineering Interdisciplinary Research, Xinjiang Medical University, Urumqi, China
Background: Diabetic peripheral neuropathy (DPN) is a prevalent and highly disabling complication of diabetes mellitus, associated with markedly increased rates of disability and mortality. Timely intervention and effective management have been consistently shown to substantially reduce the risk of DPN onset and progression.
Methods: This retrospective cohort study analyzed 1, 004 hospitalized patients with type 2 diabetes mellitus (T2DM) admitted to the endocrinology department of a hospital in Jiangsu Province, China. A risk prediction model for DPN was developed using the Random Forest (RF) algorithm, while logistic regression analysis was employed to identify the major risk factors. The overarching aim was to provide a reliable risk assessment tool for clinical application.
Findings: Five principal factors were identified as significantly associated with DPN risk: age (OR = 1.257, 95% CI [1.188–1.367], p < 0.001), serum 25(OH)D3 levels (OR = 0.791, 95% CI [0.759–0.854], p < 0.001), duration of diabetes (OR = 1.431, 95% CI [1.285–1.617], p < 0.001), glycated hemoglobin (HbA1c) (OR = 1.236, 95% CI [1.197–1.391], p < 0.001), and glycated serum protein (GSP) (OR = 1.091, 95% CI [1.047–1.201], p = 0.017). A DPN risk prediction model incorporating these variables achieved an area under the receiver operating characteristic curve (AUC) of 0.829 (95% CI: 0.802–0.857), demonstrating excellent discriminatory performance.
Interpretation: The Random Forest–based DPN risk prediction model successfully identified five critical risk factors, offering a solid theoretical foundation for personalized strategies in DPN prevention and management among patients with diabetes. This model exhibits high predictive utility in clinical practice.
Introduction
Diabetes has become one of the most serious and common chronic diseases in today’s society. It not only leads to life-threatening complications with a high rate of disability but also incurs significant treatment and management costs, significantly shortening patients’ life expectancy (1). According to statistics from the International Diabetes Federation (IDF), the global prevalence of diabetes has reached epidemic levels. In 2021, approximately 537 million adults (aged 20-79) were living with diabetes, accounting for about 10% of the global population. It is estimated that by 2030, the number of affected individuals will increase to 643 million, and by 2045, this number will further rise to 783 million. During this period, despite an expected 20% growth in the global population, the number of people with diabetes will increase by 46% (2, 3), creating a significant disease burden on society.
Diabetic Peripheral Neuropathy (DPN) is a common and severely detrimental complication of diabetes, with an incidence rate as high as 50% (4). In its early stages, DPN often presents with no or mild non-specific symptoms, which are usually only detectable through specific tests (5, 6). If not identified and managed early, severe complications such as foot ulcers and gangrene may develop, ultimately leading to an increased risk of amputation (7). Therefore, early diagnosis and intensive intervention are crucial for improving subclinical DPN and reducing the risk of its progression to clinical DPN. However, early diagnosis remains a major challenge in clinical practice.
Currently, most studies focused on the assessment and prediction of DPN primarily concentrate on symptoms and signs that have already manifested in patients, such as limb numbness, pain, and foot ulcers. However, recent research indicates that DPN may also occur in patients with prediabetes. In these individuals, up to 50% exhibit no obvious symptoms, which leads to a failure to identify the condition in a timely manner (8, 9). In recent years, machine learning methods have demonstrated significant value in the early identification and risk prediction of DPN (10, 11). These approaches can extract underlying patterns from multidimensional clinical data, thereby enhancing the accuracy and efficiency of predictions. Among various machine learning algorithms, the Random Forest (RF) model stands out as an effective method for handling nonlinear relationships and missing data. It assigns importance scores to each feature variable, enabling the identification of variables that exert a critical influence on classification outcomes (12, 13). Moreover, the RF method does not require the consideration of multicollinearity or complex variable selection processes.
This study aims to apply the RF method in conjunction with logistic regression to identify the primary risk factors for DPN in diabetic patients and to develop a practical DPN risk prediction model. The goal is to provide clinical practitioners with a powerful decision-support tool.
Methods
Study design and population
This study is a retrospective observational study that collects data from the Electronic Medical Records (EMR) and digital medical record systems. A total of 1, 004 patients with Type 2 Diabetes Mellitus were included in the study, all of whom were hospitalized in the endocrinology department of a hospital in Jiangsu Province from July 2021 to December 2023. The study protocol was approved by the Ethics Committee of Suzhou BenQ Hospital (SZMJYY2022102001) and informed consent was waived. The research was carried out in accordance with relevant guidelines and regulations.
Inclusion criteria were as follows: (a). patients between 20–79 years old, (b). the diagnostic criteria for T2DM was based on the Guideline for the prevention and treatment of type 2 diabetes mellitus in China (2020 edition) (14), or (c). all research participants were able to communicate independently. Exclusion criteria were as follows: (a). incomplete clinical data, and (b). informed consent was unsigned.
Data completeness was high; variables with missing values (less than 5% for any single variable) were imputed using the median for continuous variables and the mode for categorical variables to ensure robustness of the model.
Data collection
All baseline clinical characteristics, including age, leukocyte, neutrophil, eosinophil, lymphocyte, hemoglobin, platelet, total cholesterol, high-density lipoprotein, low-density lipoprotein, direct bilirubin, unconjugated bilirubin, total bilirubin, aspartate aminotransferase, alanine aminotransferase, body mass index, systolic blood pressure, diastolic blood pressure, serum creatinine, hemoglobin A1c, glycosylated serum protein, apolipoprotein A1, apolipoprotein B, fasting blood glucose, triglyceride, blood urea nitrogen, cystatin C, 1-hour postprandial blood glucose, 2-hour postprandial blood glucose, 25-hydroxyvitamin D3, blood glucose, homocysteine, alcohol, gender, nation, smoking, diabetic peripheral neuropathy [The diagnostic criteria for diabetic peripheral neuropathy was based on the Guideline for the prevention and treatment of type 2 diabetes mellitus in China (2020 edition)].
Diagnostic criteria included: (a) abnormal temperature perception (tested with a Tip-therm), (b) abnormal vibration perception (tested with a 128-Hz tuning fork), (c) decreased or absent sensation on nylon filament examination (10-g monofilament), (d) absent ankle reflex, and (e) abnormal nerve conduction velocity (NCV). NCS was performed by trained technicians using standard electromyography equipment. Abnormal NCV was defined as values below the lower limit of the laboratory’s reference range (adjusted for age and height). A diagnosis of DPN required the presence of at least two abnormal findings from criteria (a)-(d) OR one abnormal clinical sign plus abnormal NCV (criterion e). All assessments were performed during hospitalization, and data were extracted from structured electronic medical records.
Statistical analysis
For continuous variables that follow a normal distribution with homogeneity of variance, the results are presented as mean ± standard deviation (Mean ± SD), and comparisons are made using independent sample t-tests. If the data do not follow a normal distribution, the results are described using the median and interquartile range (P25, P75), and comparisons are made using the rank-sum test. For categorical variables, the results are presented as frequency percentages (%) and comparisons are made using chi-square tests or Fisher’s exact test. The Random Forest (RF) method combined with logistic regression analysis is used to screen for significant predictive variables. All statistical tests are two-tailed, with a significance level set at 0.05. All statistical analyses are performed using R software (version 4.2.1; accessed on October 31, 2022, at https://www.r-project.org).
Random Forest: Random Forest (RF) is an ensemble learning method based on decision tree classifiers, widely used in the field of bioinformatics. The basic principle is as follows: if the original training set contains N samples with M features, RF selects N samples from the original training set using Bootstrap resampling and randomly selects M features to train a fully grown decision tree. This process is repeated multiple times to generate a set of decision trees. Ultimately, RF aggregates the outputs of these decision trees into an ensemble model and generates the final prediction through a voting mechanism. Therefore, the number of decision trees and the randomly selected features are crucial for building an accurate RF model. RF model was implemented using the `randomForest` package in R. Key hyperparameters were optimized via out-of-bag (OOB) error estimation and 10-fold cross-validation. The final model used 500 trees (`ntree=500`), and the number of variables randomly sampled as candidates at each split (`mtry`) was set to 5 (approximately the square root of the total number of predictors). The Gini impurity index was used for node splitting. Model performance was evaluated using the OOB error estimate and the area under the ROC curve (AUC) derived from the cross-validation procedure.
Results
Patient characteristics
A total of 1, 004 Type 2 Diabetes Mellitus (T2DM) patients were included in this study. Table 1 presents the detailed baseline demographic and clinical characteristics of the two groups. Among them, 515 patients were diagnosed with Diabetic Peripheral Neuropathy (DPN), with an incidence rate of 51.29%. Of the DPN patients, 305 were male (59.22%) and 210 were female (40.78%) (Table 1).
Characteristics’ selection
During the construction of the RF model, we optimized key parameters (setting the number of random features to 5 and the number of decision trees to 500) and trained the model through repeated resampling of all samples, thereby generating the corresponding RF model. Subsequently, the importance scores of the model variables were ranked. The higher the score, the greater the impact of the corresponding variable on model classification. Through this process, we identified five significant variables (Figure 1).
These potential predictive factors included age, 25(OH)D3, duration of diabetes, HbA1c, and GSP. We then performed multivariate logistic regression analysis on these five candidate variables. Ultimately, age (OR = 1.257, 95% CI [1.188, 1.367], p < 0.001); 25(OH)D3 (OR = 0.791, 95% CI [0.759, 0.854], p < 0.001); duration of diabetes (OR = 1.431, 95% CI [1.285, 1.617], p < 0.001); HbA1c (OR = 1.236, 95% CI [1.197, 1.391]); and GSP (OR = 1.091, 95% CI [1.047, 1.201], p = 0.017) were all statistically significant and were thus selected for the development of the predictive model (Table 2). There was no multiple collinearity among the risk factors included in the model, and the maximum variance expansion factor (VIF) was 1.766 while the lowest was 1.099. Therefore, we construct the clinical risk prediction model with these 5 statistically significant variables.
In this study, the five selected variables demonstrated significant predictive accuracy, with an AUC value of 0.829 (95% CI: 0.802-0.857), sensitivity of 0.734, and specificity of 0.814 (as shown in Figure 2).
Discussion
In this study, the prevalence of Diabetic Peripheral Neuropathy (DPN) among all Type 2 Diabetes Mellitus (T2DM) patients was approximately 51.3% (515/1004). Analysis using the Random Forest (RF) model revealed that disease duration, 25(OH)D3 levels, age, HbA1c, and Glycated Serum Protein (GSP) were significant risk factors for the occurrence of DPN. The results of logistic regression analysis also indicated that these five factors were significant risk factors for DPN. Based on the identification of DPN risk factors by the RF model, its prediction accuracy of 82.9% suggests that this model holds high clinical value.
In most epidemiological studies on Diabetic Peripheral Neuropathy (DPN), the duration of diabetes and patient age, due to their immutable nature, have become key factors in the occurrence of DPN (15–17). As the duration of diabetes extends, patients’ age tends to increase as well, which further raises the likelihood of developing DPN. Notably, a specific cross-sectional study highlighted a significant positive correlation between the duration of diabetes and the prevalence of DPN, indicating that the onset of DPN is closely related to the duration of diabetes, typically occurring between 8 and 16 years of disease progression (16). Furthermore, a retrospective study also found that the duration of diabetes and patient age were the two most critical risk factors for DPN, particularly the duration of diabetes (17). Longer diabetes duration is often closely associated with chronic hyperglycemia, which activates various biochemical pathways, inducing oxidative stress, leading to diabetic neuronal damage and ischemia (18). One possible explanation is that patients may have already developed diabetes and DPN but, for various reasons, were not diagnosed in time. As the disease progresses, prolonged poor blood glucose control accelerates the onset of DPN (19). This finding helps explain why the duration of diabetes and age were selected as key predictive variables in the final model of this study.
In recent years, the relationship between vitamin D (primarily present in the body as 25(OH)D3) and microvascular complications of type 2 diabetes has garnered widespread attention from scholars both domestically and internationally. Among these complications, diabetic peripheral neuropathy (DPN) has emerged as a major area of research. Existing studies suggest a growing clarity in the correlation between vitamin D and DPN. One six-month prospective randomized controlled trial (20) included 150 patients with type 2 diabetes, with 50 patients receiving oral hypoglycemic treatment alone, 50 receiving empagliflozin treatment, and 50 receiving a combination of empagliflozin and vitamin D. The results demonstrated that the addition of vitamin D to empagliflozin treatment significantly improved the symptoms of diabetic neuropathy. Another prospective placebo-controlled trial (21) conducted by D. Shehab included a treatment group (n = 57) and a placebo group (n = 55), showing that short-term oral vitamin D3 supplementation effectively improved vitamin D levels and neuropathy symptoms in patients with type 2 diabetes. Additionally, several studies (22–24) consistently indicate a close relationship between DPN and vitamin D deficiency. In this study, serum 25(OH)D levels were significantly reduced in patients with type 2 diabetes and concurrent DPN, aligning with previous findings that vitamin D deficiency is associated with the onset of DPN.
Vitamin D, as a crucial neurotrophic factor, plays a vital role in the health of the nervous system. One of its actions is to upregulate the expression of the vitamin D receptor (VDR), thereby promoting the production of nerve growth factor (NGF). NGF is an essential protein for the development, maintenance, and repair of neurons in the peripheral nervous system (25, 26). Further research has shown that vitamin D deficiency may exacerbate diabetic peripheral neuropathy-related pain perception by increasing the number of axons containing calcitonin gene-related peptide (CGRP), which plays a critical role in pain conduction and regulation. In diabetic patients, this mechanism is closely linked to the manifestation of neuropathic pain. Thus, vitamin D deficiency may aggravate pain induced by nerve damage through the aforementioned pathways (26). Vitamin D, as a neurotrophic factor, promotes nerve health by upregulating neurotrophic factors such as nerve growth factor (NGF). Furthermore, vitamin D influences insulin sensitivity and secretion, and its deficiency is associated with insulin resistance, which may exacerbate metabolic dysfunction in diabetes. While vitamin D supplementation has shown promise in improving neuropathic symptoms in some studies, the causal relationship and optimal dosing require further investigation (27, 28). It is also noteworthy that there is a significant association between vitamin D deficiency and insulin resistance. Research (29) indicates that patients with vitamin D deficiency have a markedly increased risk of developing insulin resistance, which not only makes diabetes control more difficult but may also accelerate the progression of metabolic syndrome. It is crucial to note that the majority of evidence linking vitamin D to DPN remains observational and associative. Interventional trials have yielded mixed results, and publication bias may favor positive findings. Therefore, while our study identifies low 25(OH)D3 as a significant risk marker, it does not establish causality. The potential therapeutic role of vitamin D supplementation in preventing or treating DPN requires validation through large-scale, well-designed randomized controlled trials.
Currently, the pathogenesis of diabetic peripheral neuropathy (DPN) remains incompletely understood. The main hypotheses involve inflammation, oxidative stress, and mitochondrial dysfunction (30, 31). In type 2 diabetes mellitus (T2DM), key triggers for these metabolic events are hyperglycemia, and effective blood glucose control is fundamental to the successful management of T2DM and its complications (32). Existing research (33) highlights that one of the main features of T2DM patients is chronic hyperglycemia, and plasma glucose regulation is closely associated with the onset of DPN. Studies show a significant correlation between glucose variability and the occurrence of DPN in T2DM patients (34). Furthermore, elevated HbA1c levels (which reflect poor blood glucose control) have been confirmed to be significantly associated with an increased risk of DPN in diabetic patients (35). In fact, fasting blood glucose is an immediate diagnostic marker that can be influenced by various factors, while HbA1c, representing blood glucose control over the past 2–3 months, is considered a more reliable biomarker for blood glucose control (36). It is widely believed that the higher the HbA1c level, the greater the risk of DPN. The results of this study also show that an increase in HbA1c significantly raises the risk of DPN. However, HbA1c levels can be influenced by several factors, including anemia, hemoglobin lifespan, age, pregnancy, and racial differences. Additionally, because HbA1c reflects a “delayed effect” of blood glucose changes, it cannot capture recent fluctuations in blood glucose, whereas diabetic microvessels are more sensitive to short-term changes in blood glucose (37). In this context, glycated serum proteins (GSP) serve as a supplementary indicator and hold significant value. Due to the relatively short half-life of albumin (17–19 days), GSP can more accurately reflect a patient’s blood glucose control over the past 2–3 weeks. The results of this study indicate that GSP is an independent risk factor for DPN. Although we observed that for each 1-unit increase in GSP, the odds ratio (OR) for DPN risk was 1.091, its predictive ability for DPN incidence on its own may be limited. Previous studies have not identified GSP as an independent risk factor for DPN, likely because in those studies, HbA1c had a higher predictive value. Therefore, the potential mechanisms and predictive value of GSP in DPN occurrence still require further investigation.
The random forest (RF) model, composed of numerous decision trees, can automatically identify the most critical input variables and handle both continuous and categorical variables, making it widely used in clinical applications (38).
The existing research on prediction model of diabetes peripheral neuropathy (DPN) has included as many as nine predictive factors, including diabetes retinopathy and diabetes nephropathy (7). As we all know, these microvascular complications of diabetes often occur simultaneously with DPN, which has a strong correlation and significant impact. Existing research suggests that these microvascular complications share some common risk factors, such as HbA1c, age, and disease duration. If these comorbidities are included in the prediction model, it may lead to bias in the model’s predictions. Therefore, excluding the influence of these factors can more accurately reveal the pathogenesis of DPN and improve the reliability of its prediction model. In this study, we analyzed various risk factors and indicators associated with early DPN. By using the RF model to automatically identify relevant variables, we avoided the limitations of solely observing individual indicators, as individual markers may have biases and one-sidedness. Finally, combining multivariate logistic regression, we identified five risk factors, achieving an 82.9% predictive accuracy, which offers greater generalizability and practicality compared to previous studies (7, 17).
However, this study also has certain limitations. First, the retrospective, single-center design introduces risks of selection and information bias. Hospitalized patients may not be representative of the general T2DM population, potentially overestimating DPN prevalence and risk factor associations. Second, To some extent, the potential bias introduced by other microvascular complications, such as diabetic retinopathy and diabetic nephropathy, has not been ruled out. Therefore, in future studies, we will consider the impact of these comorbidities to more accurately reflect the progression of the disease in the real world. Thirdly, our study cohort consisted of hospitalized patients, who may represent a more severe spectrum of T2DM with potentially higher complication rates. The cross-sectional design limits causal inference, and pre-existing diagnoses or treatments could have influenced both risk factor levels and DPN status. To address these, our future research plan includes a prospective, multicenter cohort study to externally validate the model’s performance and generalizability across diverse healthcare settings.
To summarize, this study successfully identified five key risk factors for diabetic peripheral neuropathy (DPN) using the Random Forest (RF) model, providing a valuable basis for the early diagnosis and personalized management of DPN. The findings hold significant clinical implications.
Data availability statement
The data analyzed in this study is subject to the following licenses/restrictions: The data underlying this article will be shared on reasonable request to the corresponding authors. Requests to access these datasets should be directed to c2VuZ2FuOTlAMTYzLmNvbQ==.
Ethics statement
The studies involving humans were approved by The Ethics Committee of Suzhou BenQ Hospital (SZMJYY2022102001). The studies were conducted in accordance with the local legislation and institutional requirements.
Author contributions
YnL: Writing – original draft, Writing – review & editing. YsL: Methodology, Writing – original draft. GS: Funding acquisition, Methodology, Resources, Writing – review & editing. BH: Funding acquisition, Software, Supervision, Writing – review & editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This study was also supported by the Natural Science Foundation of Xinjiang Uygur Autonomous Region (Grant No.2022D01A311), President’s Fund of Tarim University (Grant No.TDZKSS202418) and South Xinjiang Key Industry Innovation and Development Support Plan of Xinjiang Production and Construction Corps (Grant No.2022DB005).
Conflict of interest
The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Heald AH, Stedman M, Davies M, Livingston M, Alshames R, Lunt M, et al. Estimating life years lost to diabetes: outcomes from analysis of National Diabetes Audit and Office of National Statistics data. Cardiovasc Endocrinol Metab. (2020) 9:183–5. doi: 10.1097/XCE.0000000000000210
2. Sun H, Saeedi P, Karuranga S, Pinkepank M, Ogurtsova K, Duncan BB, et al. IDF Diabetes Atlas: Global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res Clin Pract. (2022) 183:109119. doi: 10.1016/j.diabres.2021.109119
3. Magliano DJ and Boyko EJ. IDF Diabetes Atlas 10th edition scientific committee. IDF DIABETES ATLAS [Internet]. 10th ed. Brussels: International Diabetes Federation (2021). Available online at: https://diabetesatlas.org/media/uploads/sites/3/2025/02/IDF_Atlas_10th_Edition_2021.pdf (Accessed December 6, 2021).
4. Elafros MA, Andersen H, Bennett DL, Savelieff MG, Viswanathan V, Callaghan BC, et al. Towards prevention of diabetic peripheral neuropathy: Clinical presentation, pathogenesis, and new treatments. Lancet Neurol. (2022) 21:922–36. doi: 10.1016/S1474-4422(22)00188-0
5. Association AD. 10. Microvascular complications and foot care: standards of medical care in diabetes-2018[J. Diabetes Care. (2018) 41:S105. doi: 10.2337/dc18-S010
6. Stoian A, Muntean C, Babă DF, Manea A, Dénes L, Simon-Szabó Z, et al. Update on biomarkers of chronic inflammatory processes underlying diabetic neuropathy. Int J Mol Sci. (2024) 25:10395. doi: 10.3390/ijms251910395
7. Wang L, Li J, Lin Y, Yuan H, Fang Z, Fei A, et al. Establishment and external validation of an early warning model of diabetic peripheral neuropathy based on random forest and logistic regression. BMC Endocr Disord. (2024) 24:196. doi: 10.1186/s12902-024-01728-9
8. Callaghan BC, Gao L, Li Y, Zhou X, Reynolds E, Banerjee M, et al. Diabetes and obesity are the main metabolic drivers of peripheral neuropathy. Ann Clin Trans Neurol. (2018) 5:397–405. doi: 10.1002/acn3.531
9. Callaghan BC, Xia R, Reynolds E, Banerjee M, Rothberg AE, Burant CF, et al. Association between metabolic syndrome components and polyneuropathy in an obese population. JAMA Neurol. (2016) 73:1468–76. doi: 10.1001/jamaneurol.2016.3745
10. Ma Y, Wang Z, Yao Z, Lu B, and He Y. Machine learning in the prediction of diabetic peripheral neuropathy: a systematic review. BMC Med Inform Decis Mak. (2025) 25:344. doi: 10.1186/s12911-025-03201-6
11. Sheikh MM, Balachandra M, VG N, and Maiya AG. Predicting diabetic peripheral neuropathy through advanced plantar pressure analysis: a machine learning approach. Sci Rep. (2025) 15:20962. doi: 10.1038/s41598-025-07774-0
12. Wang L, Li J, Lin Y, Yuan H, Fang Z, Fei A, et al. Establishment and external validation of an early warning model of diabetic peripheral neuropathy based on random forest and logistic regression[J. BMC Endoc Disord. (2024) 24:1–14. doi: 10.1186/s12902-024-01728-9
13. Yang L, Wu H, Jin X, Zheng P, Hu S, Xu X, et al. Study of cardiovascular disease prediction model based on random forest in eastern China[J. Sci Rep. (2020) 10:5245. doi: 10.1038/s41598-020-62133-5
14. Chinese Diabetes Society. . Guideline for the prevention and treatment of type 2 diabetes mellitus in China (2020 edition). Chin J Diabetes Mellitus. (2021) 13:315–409. doi: 10.3760/cma.j.cn115791-20210221-00095
15. Yang Y, Zhao B, Wang Y, Lan H, Liu X, Hu Y, et al. Diabetic neuropathy: cutting-edge research and future directions. Signal Transduct Targ Ther. (2025) 10:132. doi: 10.1038/s41392-025-02175-1
16. Hindi E, Almusally BA, Bawareth R, Alhamadah W, Alfaraj R, Almwled A, et al. Diabetic polyneuropathy in type 1 and type 2 diabetes mellitus: A cross-sectional study. Cureus. (2022) 14:e30004. doi: 10.7759/cureus.30004
17. Li Y, Li Y, Deng N, Shi H, Caika S, Sen G, et al. Training and external validation of a predict nomogram for type 2 diabetic peripheral neuropathy. Diagnostics. (2023) 13:1265. doi: 10.3390/diagnostics13071265
18. Edwards JL, Vincent AM, Cheng HT, and Feldman EL. Diabetic neuropathy: mechanisms to management. Pharmacol Ther. (2008) 120:1–34. doi: 10.1016/j.pharmthera.2008.05.005
19. Wei Z, Wang X, Lu L, Li S, Long W, Zhang L, et al. Construction of an early risk prediction model for type 2 diabetic peripheral neuropathy based on random forest. CIN: Comput Inform Nurs. (2024) 42:665–74. doi: 10.1097/CIN.0000000000001157
20. Mehta S, Nain P, Agrawal BK, Singh RP, Kaur J, Maity S, et al. Effectiveness of empagliflozin with vitamin D supplementation in peripheral neuropathy in type 2 diabetic patients. Cureus. (2021) 13:e20208. doi: 10.7759/cureus.20208
21. Shehab D, Al-Jarallah K, Abdella N, Mojiminiyi OA, and Al Mohamedy H. Prospective evaluation of the effect of short-term oral vitamin D supplementation on peripheral neuropathy in type 2 diabetes mellitus. Med Princ Pract. (2015) 24:250. doi: 10.1159/000375304
22. Sun X, Yang X, Zhu X, Ma Y, Li X, Zhang Y, et al. Association of vitamin D deficiency and subclinical diabetic peripheral neuropathy in type 2 diabetes patients. Front Endocrinol. (2024) 15:1354511. doi: 10.3389/fendo.2024.1354511
23. Rahaman KM, Remli R, Tan HJ, Mustafa N, Abdul Manaf MR, and Ng CF. The association of vitamin D level with diabetic peripheral neuropathy: A comparative crosssectional study. Neurol Asia. (2023) 28:663–671. doi: 10.54029/2023eww
24. Li K, Chen Y, Xie J, Cai W, Pang C, Cui C, et al. How vitamins act as novel agents for ameliorating diabetic peripheral neuropathy: A comprehensive overview. Ageing Res Rev. (2023) 91:102064. doi: 10.1016/j.arr.2023.102064
25. Carlson AN and Kenny AM. Is vitamin D insufficiency associated with peripheral neuropathy? Endocrinol. (2007) 17:319–25. doi: 10.1097/TEN.0b013e3181594284
26. Ines M and Musiol D. 1, 25-dihydroxyvitamin D3 induction of nerve growth factor in L929 mouse fibroblasts: effect of vitamin D receptor regulation and potency of vitamin D3 analogs1. Endocrinology. (1997) 138:12–8. doi: 10.1210/endo.138.1.4858
27. Palomer X, González-Clemente JM, BlancoVaca F, and Mauricio D. Role of vitamin D in the pathogenesis of type 2 diabetes mellitus. Diabetes Obes Metab. (2008) 10:185–97. doi: 10.1111/j.1463-1326.2007.00710.x
28. Boucher BJ, Mannan N, Noonan K, Hales CN, and Evans SJ. Glucose intolerance and impairment of insulin secretion in relation to vitamin D deficiency in east London Asians. Diabetologia. (1995) 38:1239–45. doi: 10.1007/BF00422375
29. Boucher BJ, John WG, and Noonan K. Hypovitaminosis D is associated with insulin resistance and beta cell dysfunction. Am J Clin Nutr. (2004) 79:820–5. doi: 10.1093/ajcn/79.5.820
30. Feldman EL, Nave KA, Jensen TS, and Bennett DLH. New horizons in diabetic neuropathy: mechanisms, bioenergetics, and pain. Neuron. (2017) 93:1296–313. doi: 10.1016/j.neuron.2017.02.005
31. Ocak Ö, Silan F, and Şahin EM. Melatonin receptor gene polymorphisms as a risk factor in patients with diabetic peripheral neuropathy. Diabetes/metabol Res Rev. (2022) 38:e3573. doi: 10.1002/dmrr.3573
32. Tang Y, Zhang P, Li L, and Li J. Diabetic peripheral neuropathy and glycemia risk index in type 2 diabetes: A cross-sectional study. Diabetes Metab Syndr Obes. (2024) 17:4191–8. doi: 10.2147/DMSO.S482824
33. Zhang H, Chen Y, Zhu W, Niu T, Song B, Wang H, et al. The mediating role of HbA1c in the association between elevated low-density lipoprotein cholesterol levels and diabetic peripheral neuropathy in patients with type 2 diabetes mellitus. Lipids Health Dis. (2023) 22:102. doi: 10.1186/s12944-023-01865-5
34. Xu F, Zhao LH, Su JB, Chen T, Wang XQ, Chen JF, et al. The relationship between glycemic variability and diabetic peripheral neuropathy in type 2 diabetes with well-controlled HbA1c. Diabetol Metab Syndr. (2014) 6:139. doi: 10.1186/1758-5996-6-139
35. Liu X, Xu Y, An M, and Zeng Q. The risk factors for diabetic peripheral neuropathy: a meta-analysis. PloS One. (2019) 14:e0212574. doi: 10.1371/journal.pone.0212574
36. Sherwani SI, Khan HA, Ekhzaimy A, Masood A, and Sakharkar MK. Significance of HbA1c test in diagnosis and prognosis of diabetic patients. biomark Insights. (2016) 11:95–104. doi: 10.4137/bmi.s38440
37. Dai D, Shen Y, Lu J, Wang Y, Zhu W, Bao Y, et al. Association between visit-to-visit variability of glycated albumin and diabetic retinopathy among patients with type 2 diabetes—A prospective cohort study. J Diabetes Its Complicat. (2021) 35:107971. doi: 10.1016/j.jdiacomp.2021.107971
Keywords: 25(OH)D3, diabetic peripheral neuropathy, GPS, random forest, type 2diabetes mellitus (T2DM)
Citation: Li Y, Li Y, Sen G and Hu B (2026) Deep exploration and precise identification of key risk factors for diabetic peripheral neuropathy using the random forest algorithm. Front. Endocrinol. 16:1740545. doi: 10.3389/fendo.2025.1740545
Received: 06 November 2025; Accepted: 11 December 2025; Revised: 10 December 2025;
Published: 07 January 2026.
Edited by:
Zoltan Kender, Heidelberg University, GermanyReviewed by:
Dimitrios Tsilingiris, Heidelberg University Hospital, GermanyAleksandra Araszkiewicz, Poznan University of Medical Sciences, Poland
Copyright © 2026 Li, Li, Sen and Hu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yongsheng Li, bWF0aDAxMTVAMTI2LmNvbQ==; Gan Sen, c2VuZ2FuOTlAMTYzLmNvbQ==; Bin Hu, MTEwNTc1NzE0OEBxcS5jb20=
†These authors have contributed equally to this work
Yongnan Li1†