A study on predicting impaired fasting glucose risk in Chinese adults based on individual characteristics

Lin, Yijun; Wu, Wenxu; Liang, Xiaoyan; Zhou, Liping; Li, Gan; Kang, Cuiling; Li, Wuzhen; Huang, Chunyi; Tian, Feng

doi:10.3389/fmed.2025.1584626

ORIGINAL RESEARCH article

Front. Med., 09 June 2025

Sec. Family Medicine and Primary Care

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1584626

This article is part of the Research TopicNew Trends in Type 2 Diabetes Diagnosis and Management in Primary Care, volume IIView all 12 articles

A study on predicting impaired fasting glucose risk in Chinese adults based on individual characteristics

Yijun Lin¹^†

Wenxu Wu^2,3^†

Xiaoyan Liang¹

Liping Zhou¹

Gan Li¹

Cuiling Kang¹

Wuzhen Li⁴

Chunyi Huang^1*‡

Feng Tian^1*‡

¹Department of Health Management Centre, The Eighth Affiliated Hospital of Southern Medical University (The First People's Hospital of Shunde), Foshan, Guangdong, China
²Department of Health Examination Center, The First People's Hospital of Nanning, Nanning, China
³Department of Health Examination Center, The Fifth Affiliated Hospital of Guangxi Medical University, Nanning, China
⁴School of Politics and Public Administration, South China Normal University, Guangzhou, Guangdong, China

Introduction: This study aimed to develop a nomogram for early detection of impaired fasting glucose (IFG), predicting the 5-year risk in Chinese adults due to its link to various diseases.

Materials and methods: This retrospective cohort study included 28,875 participants without IFG at baseline, randomly divided them to a training set and a validation set. We developed four predictive models—LASSO, full, stepwise, and MFP—ultimately selecting the LASSO model for nomogram development due to its simplicity and predictive performance. Four prediction model performance was assessed through ROC analysis, calibration curves, and decision curve analysis, with external validation using Shunde Hospital (n = 18,618) and NHANES (n = 2,038) dataset.

Results: We developed a nomogram to predict the risk of IFG by incorporating parameters including age, body mass index (BMI), systolic blood pressure (SBP), fasting plasma glucose (FPG), and triglycerides (TG), which demonstrated performance with AUCs of 0.8167 and 0.8155 in the training and validation set, respectively. External validation achieved AUC 0.9665 (Shunde Hospital dataset) and 0.9171 (NHANES).

Conclusions: Our nomogram provides a personalized, validated approach for assessing 5-year IFG risk in Chinese adults, offering a practical screening tool for primary healthcare and resource-constrained environments.

Introduction

Diabetes has become one of the most serious public health challenges in the world, and its incidence and prevalence continue to rise globally (1). In 2024, ~589 million adults aged 20–79 worldwide are suffering from diabetes globally, representing 11.1% of this age group; this figure is expected to rise to 853 million by 2050, accounting for 13.0% (2). According to data from the 11th edition of the International Diabetes Alliance (IDF) Diabetes Atlas, although the prevalence of diabetes in China is very high, there are indeed a large number of undiagnosed cases, and the undiagnosed rate of diabetes is as high as 49.7% (2). The American Diabetes Association (ADA) acknowledged impaired fasting glucose (IFG) as an indicator of prediabetes as early as 1997 (3). The incidence of IFG globally was 5.8% in 2021, accounting for 298 million individuals. It is anticipated that this figure will increase to 6.5%, representing 414 million individuals, by the year 2045 (4). Often without noticeable symptoms, IFG serves as a sign of a hidden and potentially dangerous early stage of abnormal glucose metabolism (5). Consequently, IFG may be easily disregarded, yet it is intricately associated with type 2 diabetes mellitus (4) and cardiovascular diseases (CVD) (6), as well as cardiometabolic multimorbidity (7), heart failure (8), chronic kidney disease (9), and cerebral hemorrhage (10).

In recent years, many risk prediction models for diabetes have been developed (11–13). The Finnish Diabetes Risk Score (14) (FINDRISC) and the ADA risk assessment tools (15) are two widely used diabetes risk prediction models. While FINDRISC is mainly designed for European populations without verified applicability to the Chinese population, the ADA risk tool emphasizes diabetes risk but does not offer a specific approach for predicting IFG. Few studies have specifically focused on modeling the risk of IFG, and most existing research primarily explores its risk factors through cross-sectional analysis (16). Therefore, it is crucial to develop reliable risk assessment models that enable individuals to assess their risk of impaired fasting glucose, especially given the lack of such models for the adult population across multiple centers in China. Our study is currently underway to create and validate personalized nomograms for predicting IFG in a diverse cohort of Chinese adults across 32 sites and 11 cities. The goal of this research is to provide clinical professionals with a reliable tool for accurately identifying individuals at risk and conducting timely screenings. The model is designed to be cost-effective and easily accessible for widespread use.

Materials and methods

Study design and participants

Based on the data from the China Rich Healthcare Group's database, this retrospective cohort study was conducted with a 5-year follow-up period focusing on IFG as the dependent variable (17). IFG was categorized into two groups: non-IFG and IFG. The data utilized in this study was obtained from the publicly accessible, non-profit database DATADRYAD (http://www.DatadRyad.org) established by the Rich Healthcare Group. The data, sourced from Chen et al. (17), is publicly available and originates from the study “Association of body mass index and age with incident diabetes in Chinese adults: a population-based cohort study.” It can be accessed in the Dryad Digital Repository at http://dx.doi.org/10.1136/bmjopen-2018-021768. A total of 685,277 participants aged 20 and above who underwent at least two standard health examinations between 2010 and 2016 were included in the initial study. In the baseline study, demographic and clinical variables including age, gender, smoking status, alcohol consumption, family history of diabetes, body mass index (BMI), systolic and diastolic blood pressures (SBP and DBP), fasting plasma glucose (FPG), total cholesterol (TC), triglycerides (TG), low-density and high-density lipoprotein cholesterol (LDL-C and HDL-C), serum urea nitrogen (BUN), serum creatinine (Scr), alanine aminotransferase (ALT), years of follow-up, and censor of IFG at follow-up were collected. The initial study employed the following criteria to exclude participants: (1) absence of weight, height, and gender data; (2) BMI falling outside the range of 15–55 kg/m²; (3) visit intervals shorter than 2 years; (4) lack of FPG levels; and (5) presence of diabetes or indeterminate diabetes status. The final sample size comprised 211,833 participants. Furthermore, individuals without baseline data essential for assessing the risk of developing IFG were also excluded. The participant selection process is depicted in Figure 1. Our analysis included a total of 28,875 subjects. The data collected is anonymous, and the Rich Healthcare Group Review Board waived the need for informed consent due to the observational design of the study.

Figure 1

Figure 1. Flowchart of study participants.

Variable measurement

At the health check center, every participant completed an individual questionnaire inquiring about demographics, lifestyle, medical history, and family history of chronic diseases. An experienced staff member conducted the initial examination, which involved anthropometric measurements and laboratory biochemical testing. Participants were weighed and measured with a precision of 0.1 kg without wearing shoes. Blood pressure was measured using a standard mercury sphygmomanometer, and BMI was calculated by dividing weight by height squared. During the study, fasting venous blood samples were obtained from participants after a minimum fasting period of 10 h. Plasma glucose levels were measured using glucose oxidase, and a Beckman 5800 autoanalyzer was employed to evaluate FPG, TC, TG, LDL-C, HDL-C, BUN, Scr, and ALT. Standardized conditions and consistent procedures were maintained throughout the data collection process, with laboratory methods rigorously standardized through extensive quality checks at both internal and external levels.

Definition of outcome

According to the Chinese Type II Diabetes Prevention and Control Guidelines (2017 Edition), FBG between 6.1 and 6.9 mmol/L was consider as IFG (18). Patients were censored either at the time of the diagnosis or at the last visit, whichever comes first.

Statistical analysis

The training and validation sets were randomly allocated to all eligible participants. Through collinearity screening, variables displaying significant interference were excluded. The variance inflation factor (VIF) was calculated for each variable, and those with a VIF >5 were removed to address severe multicollinearity (19), which can significantly decrease the model's statistical stability and predictive accuracy (Supplementary Table S2).

Normally distributed continuous variables were reported as means and standard deviations, skewed variables as medians, and categorical variables as frequencies or percentages. The study utilized student t-tests to assess disparities between the training and validation sets for normally distributed continuous variables, Wilcoxon rank-sum tests for non-normally distributed continuous variables, and chi-square tests for categorical variables. Standardized differences of < 0.10 for a given covariate indicate a relatively small imbalance (20). Logit regression models were employed to ascertain the significance of each variable in identifying independent risk factors associated with IFG.

This study compared four distinct risk prediction models with the aim of developing a straightforward and dependable model for risk prediction. Initially, a comprehensive model incorporating all available risk factors (full model) was constructed, followed by a bidirectional stepwise selection process guided by the Akaike information criterion to streamline the model (stepwise model). Subsequently, the multivariable fractional polynomials algorithm was employed to identify crucial variables through a backward elimination process, culminating in a highly practical model (MFP model). Finally, the model underwent initial variable screening utilizing the Least Absolute Shrinkage and Selection Operator, a method known for its ability to streamline high-dimensional data and identify key predictors (21).

LASSO was chosen for its reduced number of variables and robust predictive capabilities during our comprehensive analysis (22), offering clinical interpretability and practical value compared to more complex models by significantly simplifying model complexity. ROC curves were generated, and the AUC along with 95% CI were calculated for both the training and validation datasets. The LASSO method was utilized to construct a nomogram that converts the regression coefficients obtained from a multivariate logistic regression analysis into a scoring system ranging (23). The variable with the largest absolute β coefficient was allocated 100 points. Subsequently, the cumulative points for all independent variables were computed and transformed into predicted probabilities of developing IFG. Each patient's nomogram score denoted their position within the predictive model, which was then juxtaposed with the observed 5-year incidence of IFG risk deciles in the training dataset.

To evaluate the agreement between the predicted and the actual 5-year risk of incident IFG, the Hosmer–Lemeshow test was used across deciles of predicted risk. The calibration of the model was further examined using a calibration plot. To address potential overfitting and assess model stability, 500 bootstrap resamples were applied to derive bias-corrected AUC estimates and enhance the credibility of the nomogram. To further evaluate the clinical utility of the LASSO regression model, we performed decision curve analysis (DCA) in the training sets. Standardized net benefit was plotted against high-risk thresholds to assess and visualize the added value of the LASSO nomogram in predicting 5-year IFG risk.

External validation was performed using two independent cohorts: (1) 18,618 individuals from routine health examinations at the Health Management Department of Shunde Hospital, Southern Medical University (January 2021–September 2022), and (2) 2,038 participants from NHANES 2017–2018. For both datasets, individuals with IFG at baseline were excluded. Missing values were imputed using the “mice” package in R.

All statistical analyses were performed using R software, version 4.3 (The R Foundation for Statistical Computing, http://www.R-project.org/). All tests were two-tailed, and a P-value of < 0.05 was considered statistically significant.

Results

Baseline characteristics of study participants

A total of 28,875 qualified participants were included in this study, with men comprising 64.27% and women 35.73%. The average age was 42.54 ± 12.37 years. During a median follow-up period of 2.96 years (range: 2.001–4.999 years), 948 participants were diagnosed with IFG. The average BMI was 23.44 ± 3.26 kg/m²; the mean SBP and DBP were 119.26 ± 15.57 and 74.64 ± 10.41 mmHg, respectively. The mean FPG level was 4.91 ± 0.54 mmol/L, with HDL-C and LDL-C averaging 1.34 ± 0.30 and 2.74 ± 0.68 mmol/L. Baseline levels of BUN and Scr were 4.68 ± 1.15 mmol/L and 72.14 ± 15.19 μmol/L, respectively. The mean follow-up duration was 2.99 ± 0.85 years. We summarize the demographic, clinical, and anthropometric characteristics of participants, showing no significant differences between the training set (n = 14,413) and validation set (n = 14,462) at baseline (all P > 0.05; Table 1). We also categorized participants based on IFG status (Supplementary Table S1).

Table 1

Table 1. Baseline characteristics of the training and validation sets.

Identification of independent risk factors for IFG

Univariate and multivariate logistic regression analyses were conducted to identify risk factors for incident IFG. In univariate analysis, all examined variables were significant predictors of IFG (all P < 0.05). After adjustment in the multivariate model, age (OR = 1.03), BMI (OR = 1.08), SBP (OR = 1.00), DBP (OR = 1.01), FPG (OR = 5.85), TG (OR = 1.21), HDL-C (OR = 0.67), ALT (OR = 1.00), Scr (OR = 0.98), and a family history of diabetes (OR = 1.78) remained significantly associated with the risk of IFG (all P < 0.05). In contrast, gender, LDL-C, BUN, smoking status, and alcohol consumption were not significantly associated with IFG (all P > 0.05; Table 2).

Table 2

Table 2. Risk predictors for incident diabetes in the univariate and multivariate analysis.

Development and validation of IFG risk prediction models

We constructed four risk prediction models—the full model, stepwise model, MFP model, and LASSO model (Supplementary Figure S1). Given its simplicity and robust predictive performance, the LASSO model was selected for nomogram development, identifying five key predictors: age, BMI, SBP, FPG, and TG (Figure 2). The 5-year IFG risk can be estimated using the following formula: −17.71490 + 0.03257 × age (years) + 0.09353 × BMI (kg/m²) + 0.01376 × SBP (mmHg) + 1.6520 × FPG (mmol/L) + 0.18660 × TG (mmol/L). The predictive performance details are presented in Table 3, with the LASSO model demonstrating comparable accuracy to other models.

Figure 2

Figure 2. Nomogram to predict the risk of IFG for Chinese adults. The patient's score for each risk predictor is plotted on the appropriate scale. The patient's score for each risk predictor is plotted on the appropriate scale and vertical lines are drawn from that value to the top Points scale to obtain the corresponding scores. All scores are summed to obtain the total points score. The total points score is plotted on the bottom Total Points scale. The corresponding value shows the predicted probability of incident IFG.

Table 3

Table 3. Prediction performance of LASSO, MFP, full and stepwise model for the risk of diabetes.

Prediction performance and clinical utility of the LASSO model

The AUC values of the LASSO model for the training and validation sets were 0.8167 and 0.8155, respectively (Table 3). Additionally, bootstrap validation demonstrated consistent AUC values for the prediction nomogram. The predictive performance of the full, stepwise, and MFP models is also presented in Table 3 and Supplementary Figure S2. Calibration of the LASSO model was assessed using both a calibration plot (Figure 3) and the Hosmer–Lemeshow test. The calibration plot showed that the predicted probabilities closely matched the actual incidence of IFG in the training set, indicating good overall calibration. In addition, the Hosmer–Lemeshow test assessed the agreement between predicted and observed risks across deciles of predicted risk, demonstrating no statistically significant difference (P > 0.05), which further supports the reliable calibration of the nomogram (Supplementary Figure S3).

Figure 3

Figure 3. Calibration plot for LASSO regression model in predicting IFG.

We conducted decision curve analysis (DCA) to evaluate the clinical utility of the LASSO model (Figure 4). The DCA results demonstrated that across a range of high-risk thresholds (0.0–1.0), the standardized net benefit curve of the LASSO model consistently exceeded the baseline strategies of “no treatment line” (black line) and “all treatment line” (light gray line). This indicates that clinicians would achieve clinical benefit when using the LASSO model for decision-making within the risk prediction threshold range. The area between the model curve and the two baseline lines represents the clinical utility of the model, with greater distance between the model curve and baseline lines indicating higher clinical value of the nomogram. Furthermore, Supplementary Figure S4 provided a comparative analysis of decision curves for four models (full model, stepwise model, MFP model, and LASSO model) in both training and validation cohorts. The results indicated that despite using the fewest predictors, the LASSO model demonstrated clinical utility comparable to more complex models. This further confirms the rationale for selecting the LASSO model to develop the final risk prediction nomogram, which achieves maximum clinical utility and simplicity while maintaining high predictive performance.

Figure 4

Figure 4. The decision curve analysis of the LASSO model for 5-year IFG risk in the training cohort. The black line represents the net benefit when none of the participants are considered to develop IFG, while the light gray line represents the net benefit when all participants are considered to develop IFG. The area between the “no treatment line” (black line) and “all treatment line” (light gray line) in the model curve indicates the clinical utility of the model. The farther the model curve is from the black and light gray lines, the better the clinical use of the nomogram.

External validation

External validation was performed using two independent datasets. The first cohort included 18,618 participants from the Health Management Department of Shunde Hospital, Southern Medical University. The second cohort comprised 2,038 participants from the NHANES 2017–2018 survey. The AUC values for external validation were 0.9665 for the Shunde Hospital cohort and 0.9171 for the NHANES cohort (Supplementary Tables S3, S4). At the optimal threshold, the Shunde Hospital cohort showed a specificity of 88.83% and a sensitivity of 93.66%, while the NHANES cohort demonstrated a specificity of 81.48% and a sensitivity of 86.51%, indicating excellent predictive performance of the nomogram in both external populations.

Sensitivity analysis

For the sensitivity analysis, individuals with IFG at baseline were first excluded from the overall population in the original study, resulting in 202,402 participants included for analysis. Multiple imputations were then performed for the remaining population with missing data on relevant variables. The LASSO model was applied to this imputed cohort, achieving an AUC of 0.8308. At the optimal threshold, the specificity and sensitivity were 73.76 and 78.14%, respectively (Supplementary Table S4).

Discussion

This retrospective cohort study developed and validated a personalized nomogram for predicting 5-year IFG risk in Chinese adults using five readily available clinical parameters (age, BMI, SBP, FPG, and TG). The LASSO regression model demonstrated predictive performance through internal validation and was tested in the external validation dataset. Decision curve analysis evaluated the model's clinical applicability at different risk thresholds, providing clinicians with a potential risk assessment tool.

Numerous studies have indicated that IFG plays a significant role as a transitional phase between normal health and the onset of DM. IFG is an independent risk factor for the development of DM (24). Effective prevention and management of DM have been shown to hinge upon the reduction of IFG rates (25). Early identification and intervention for DM have also proven to be effective strategies in the management of the disease. Compared to the clinical and laboratory parameter nomogram model developed by Wang et al. (16) based on over 4,000 individuals, our model differs in terms of the number of predictive factors and model complexity. The model includes six predictive factors: age, systolic blood pressure, BMI, albumin, urea, and triglycerides. Its AUC values are 0.783 for the training set and 0.7891 for the validation set. Our model demonstrates certain potential for clinical application in predictive performance. Compared to another IFG prediction model developed using the extreme gradient boosting (XGBoost) algorithm (26), our study explores a different approach to model interpretability. This model achieved an AUC value of 0.7391 in the validation set and included key predictive factors such as SBP, waist circumference, fatty liver, and serum creatinine. While machine learning algorithms like XGBoost offer powerful predictive capabilities, they often present challenges in clinical interpretation due to their complex underlying mechanisms. Our nomogram addresses this challenge by providing a visually intuitive representation that facilitates understanding for both healthcare professionals and patients. Our study employed both internal and external validation to thoroughly assess its generalizability, drawing parallels with Byeon's model (27), which achieved an AUC of 0.751 in predicting IFG risk among non-diabetic individuals in South Korea. To further evaluate the clinical applicability of our model, we conducted DCA across a wide range of risk thresholds. While many existing models have not extensively explored this analytical approach, our research aimed to provide a more nuanced understanding of the model's practical utility.

The IFG predictive nomogram model developed in this research demonstrates good clinical practicality, particularly in primary healthcare and resource-limited settings. The model relies solely on five routine clinical parameters, requiring no additional complex tests, a characteristic that enables its flexible application across different medical resource environments. In primary healthcare and community health service centers, this model can serve as a supplementary tool to routine physical examinations, enabling large-scale screening and the timely identification of individuals at high risk for IFG. It is especially practical in resource-limited rural areas, where risk assessment can be completed using only basic diagnostic equipment. In urban general hospitals, the model can be integrated into the routine assessment processes of health management centers and embedded within electronic health record systems to achieve automated risk evaluation, providing a basis for physicians to develop personalized health management plans. In the field of public health, this model can be used for population-level monitoring of IFG risk, offering scientific evidence for policy-making. With its characteristics of simplicity, cost-effectiveness, and ease of operation, this model is expected to become an effective tool for early identification and intervention of prediabetes in primary healthcare institutions.

Within the model, five specific risk factors were identified as significantly correlated with IFG, a finding consistent with prior research demonstrating these factors as key determinants of IFG (28, 29). A s individuals age, the risk of developing diabetes increases due to age-related changes in pancreatic β cells, including reduced glucose sensitivity and insulin secretion. This age-related glucose intolerance is often associated with insulin resistance and β-cell dysfunction (30). Additionally, aging human pancreatic islets may experience a decrease in mitochondrial DNA copy number, further impairing insulin secretion (31). Obesity is known to substantially elevate the likelihood of developing a range of metabolic disorders, particularly affecting the function of pancreatic β-cells by promoting excessive fat deposition in the liver and pancreas. Research has demonstrated a correlation between obesity and heightened levels of pancreatic fat accumulation (32). Additionally, regarding blood pressure, in the study by Sasaki et al. (33), it is established that the rate of hypertension significantly increases from normal fasting glucose to isolated IFG, indicative of a direct association between IFG and elevated hypertension risk. Our nomogram indicates that individuals with elevated FPG levels exhibit a greater propensity for IFG, with FPG emerging as a distinct risk factor in this context. This relationship is likely attributable to FPG's substantial impact on insulin responsiveness and sensitivity (34). Triglycerides play a crucial role in lipid storage and metabolic regulation within adipose tissue (35). The presence of excessive adipose tissue can significantly worsen insulin resistance by generating proinflammatory cytokines and lipid metabolites that induce insulin resistance (36). This highlights the complex interplay between adiposity, inflammation, and metabolic dysfunction, ultimately contributing to the development of IFG. As such, the inclusion of the five risk predictors in our models is justified.

Strengths and limitations of this study

The strengths of our study include a large sample size derived from a diverse participant pool across multiple centers, the selection of the LASSO model for its simplicity and predictive performance in developing four predictive models for clinical feasibility, the ability for clinicians to efficiently assess an individual's risk of impaired fasting glucose using a formula for risk prediction, the verification of findings through internal and external validations, and the mitigation of selection and information biases through a retrospective cohort design.

Several limitations should be acknowledged in our study. First, this research is primarily a secondary retrospective analysis, with inherent constraints in the original dataset. The data lacks comprehensive information on variables such as waist-to-hip ratio, detailed medical history, and comprehensive lifestyle factors—all of which could potentially influence IFG development. Second, although methodologically sound, our utilization of multiple imputation techniques to address missing data may potentially introduce bias into the analysis, thereby affecting the precision of our estimates. Additionally, this study used data from a cohort study conducted during 2010–2016, and we acknowledge that these relatively older data may impose certain limitations, potentially not fully reflecting current trends in the non-communicable disease burden among the Chinese population. Given the limitations of current research, future studies should prioritize comprehensive prospective research, improved baseline measurements, and the integration of lifestyle and genetic factor assessments to enhance the accuracy and generalizability of models for predicting the risk of IFG.

Conclusion

A personalized prediction nomogram was developed and validated to assess the 5-year risk of developing IFG among Chinese adults based on age, BMI, SBP, FPG, and TG levels. The nomogram exhibited excellent predictive accuracy during both training and validation phases, indicating its potential for generalizability. This tool aids clinicians in identifying individuals at high risk for IFG through a straightforward and dependable approach.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: The 'DATADRYAD' database (http://www.Datadryad.org) provides access to data.

Ethics statement

The studies involving humans were approved by the rich healthcare group review board. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants' legal guardians/next of kin because for this retrospective study, no approval or informed consent was required by the institutional Ethics Committee.

Author contributions

YL: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft. WW: Conceptualization, Data curation, Methodology, Writing – original draft, Formal analysis, Project administration, Resources. XL: Data curation, Formal analysis, Project administration, Visualization, Writing – original draft. LZ: Methodology, Project administration, Resources, Visualization, Writing – original draft. GL: Investigation, Software, Writing – original draft. CK: Writing – original draft, Resources. WL: Visualization, Writing – original draft. CH: Validation, Writing – review & editing, Conceptualization, Methodology. FT: Supervision, Validation, Visualization, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by Clinical Research Startup Program of Shunde Hospital, Southern Medical University (CRSP2022008) and Scientific Research Start Plan of Shunde Hospital, Southern Medical University (SRSP2021011).

Acknowledgments

The authors thank the field investigators for their contribution and the participants for their cooperation.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1584626/full#supplementary-material

References

1. GBD 2021 Diabetes Collaborators. Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the global burden of disease study 2021. Lancet. (2023) 402: 203–34. doi: 10.1016/S0140-6736(23)01301-6

PubMed Abstract | Crossref Full Text | Google Scholar

2. Sun H, Saeedi P, Karuranga S, Pinkepank M, Ogurtsova K, Duncan BB, et al. IDF diabetes atlas: global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res Clin Pract. (2022) 183: 109119. doi: 10.1016/j.diabres.2021.109119

PubMed Abstract | Crossref Full Text | Google Scholar

3. Report of the expert committee on the diagnosis and classification of diabetes mellitus. Diabetes Care. (1997) 20: 1183–97. doi: 10.2337/diacare.20.7.1183

PubMed Abstract | Crossref Full Text | Google Scholar

4. Rooney MR, Fang M, Ogurtsova K, Ozkan B, Echouffo-Tcheugui JB, Boyko EJ, et al. Global prevalence of prediabetes. Diabetes Care. (2023) 46:1388–94. doi: 10.2337/dc22-2376

PubMed Abstract | Crossref Full Text | Google Scholar

5. Diagnosis and classification of diabetes: standards of care in diabetes-2024. Diabetes Care. (2024) 47: S20–42. doi: 10.2337/dc24-S002

PubMed Abstract | Crossref Full Text | Google Scholar

6. Cai X, Zhang Y, Li M, Wu JH, Mai L, Li J, et al. Association between prediabetes and risk of all cause mortality and cardiovascular disease: updated meta-analysis. BMJ. (2020) 370:m2297. doi: 10.1136/bmj.m2297

PubMed Abstract | Crossref Full Text | Google Scholar

7. Guo Z, Wu S, Zheng M, Xia P, Li Q, He Q, et al. Association of impaired fasting glucose with cardiometabolic multimorbidity: the Kailuan study. J Diabetes Investig. (2025) 16:129–36. doi: 10.1111/jdi.14316

PubMed Abstract | Crossref Full Text | Google Scholar

8. Lind V, Hammar N, Lundman P, Friberg L, Talbäck M, Walldius G, et al. Impaired fasting glucose: a risk factor for atrial fibrillation and heart failure. Cardiovasc Diabetol. (2021) 20:227. doi: 10.1186/s12933-021-01422-3

PubMed Abstract | Crossref Full Text | Google Scholar

9. Echouffo-Tcheugui JB, Perreault L, Ji L, Dagogo-Jack S. Diagnosis and management of prediabetes: a review. JAMA. (2023) 329:1206–16. doi: 10.1001/jama.2023.4063

PubMed Abstract | Crossref Full Text | Google Scholar

10. Jin C, Li G, Rexrode KM, Gurol ME, Yuan X, Hui Y, et al. Prospective study of fasting blood glucose and intracerebral hemorrhagic risk. Stroke. (2018) 49:27–33. doi: 10.1161/STROKEAHA.117.019189

PubMed Abstract | Crossref Full Text | Google Scholar

11. Xiong XL, Zhang RX Bi Y, Zhou WH Yu Y, Zhu DL. Machine learning models in type 2 diabetes risk prediction: results from a cross-sectional retrospective study in Chinese adults. Curr Med Sci. (2019) 39:582–8. doi: 10.1007/s11596-019-2077-4

PubMed Abstract | Crossref Full Text | Google Scholar

12. Štiglic G, Kocbek P, Cilar L, Fijačko N, StoŽer A, Zaletel J, et al. Development of a screening tool using electronic health records for undiagnosed type 2 diabetes mellitus and impaired fasting glucose detection in the Slovenian population. Diabet Med. (2018) 35:640–9. doi: 10.1111/dme.13605

PubMed Abstract | Crossref Full Text | Google Scholar

13. Elizalde-Barrera CI, Rubio-Guerra AF, Lozano-Nuevo JJ, Olvera-Gomez JL. Triglycerides and waist to height ratio are more accurate than visceral adiposity and body adiposity index to predict impaired fasting glucose. Diabetes Res Clin Pract. (2019) 153:49–54. doi: 10.1016/j.diabres.2019.05.019

PubMed Abstract | Crossref Full Text | Google Scholar

14. Lindström J, Tuomilehto J. The diabetes risk score: a practical tool to predict type 2 diabetes risk. Diabetes Care. (2003) 26:725–31. doi: 10.2337/diacare.26.3.725

PubMed Abstract | Crossref Full Text | Google Scholar

15. ADA. Screening for type 2 diabetes. Diabetes Care. (2004) 27(Suppl 1): S11–4. doi: 10.2337/diacare.27.2007.S11

Crossref Full Text | Google Scholar

16. Wang C, Zhang X, Li C, Li N, Jia X, Zhao H. Construction and validation of a model for predicting impaired fasting glucose based on more than 4000 general population. Int J Gen Med. (2023) 16:1415–28. doi: 10.2147/IJGM.S409426

PubMed Abstract | Crossref Full Text | Google Scholar

17. Chen Y, Zhang XP, Yuan J, Cai B, Wang XL, Wu XL, et al. Association of body mass index and age with incident diabetes in Chinese adults: a population-based cohort study. BMJ Open. (2018) 8:e021768. doi: 10.1136/bmjopen-2018-021768

PubMed Abstract | Crossref Full Text | Google Scholar

18. Clinical guidelines for prevention and treatment of type 2 diabetes mellitus in the elderly in china (2022 edition). Zhonghua Nei Ke Za Zhi. (2022) 61: 12–50. doi: 10.3760/cma.j.cn112138-20211027-00751

PubMed Abstract | Crossref Full Text | Google Scholar

19. Kim JH. Multicollinearity and misleading statistical results. Korean J Anesthesiol. (2019) 72:558–69. doi: 10.4097/kja.19087

PubMed Abstract | Crossref Full Text | Google Scholar

20. Normand ST, Landrum MB, Guadagnoli E, Ayanian JZ, Ryan TJ, Cleary PD, et al. Validating recommendations for coronary angiography following acute myocardial infarction in the elderly: a matched analysis using propensity scores. J Clin Epidemiol. (2001) 54:387–98. doi: 10.1016/S0895-4356(00)00321-8

Crossref Full Text | Google Scholar

21. Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. (2010) 33:1–22. doi: 10.18637/jss.v033.i01

PubMed Abstract | Crossref Full Text | Google Scholar

22. Kidd AC, McGettrick M, Tsim S, Halligan DL, Bylesjo M, Blyth KG. Survival prediction in mesothelioma using a scalable lasso regression model: instructions for use and initial performance using clinical predictors. BMJ Open Respir Res. (2018) 5:e000240. doi: 10.1136/bmjresp-2017-000240

PubMed Abstract | Crossref Full Text | Google Scholar

23. Chen L, Liu C, Ye Z, Huang S, Liang T, Li H, et al. Predicting surgical site infection risk after spinal tuberculosis surgery: development and validation of a nomogram. Surg Infect. (2022) 23:564–75. doi: 10.1089/sur.2022.042

PubMed Abstract | Crossref Full Text | Google Scholar

24. Geva M, Shlomai G, Berkovich A, Maor E, Leibowitz A, Tenenbaum A, et al. The association between fasting plasma glucose and glycated hemoglobin in the prediabetes range and future development of hypertension. Cardiovasc Diabetol. (2019) 18:53. doi: 10.1186/s12933-019-0859-4

PubMed Abstract | Crossref Full Text | Google Scholar

25. Wang Y, Wang L, Su Y, Zhong L, Peng B. Prediction model for the onset risk of impaired fasting glucose: a 10-year longitudinal retrospective cohort health check-up study. BMC Endocr Disord. (2021) 21:211. doi: 10.1186/s12902-021-00878-4

PubMed Abstract | Crossref Full Text | Google Scholar

26. Cui Q, Pu J, Li W, Zheng Y, Lin J, Liu L, et al. Study on risk factors of impaired fasting glucose and development of a prediction model based on extreme gradient boosting algorithm. Front Endocrinol. (2024) 15:1368225. doi: 10.3389/fendo.2024.1368225

PubMed Abstract | Crossref Full Text | Google Scholar

27. Byeon H. Exploring the risk factors of impaired fasting glucose in middle-aged population living in South Korean communities by using categorical boosting machine. Front Endocrinol. (2022) 13:1013162. doi: 10.3389/fendo.2022.1013162

PubMed Abstract | Crossref Full Text | Google Scholar

28. Zhao Q, Zhen Q, Li Y, Lv R, Zhang K, Qiao Y, et al. Prevalence and risk factors of impaired fasting glucose among adults in northeast China: a cross-sectional study. Endocr Pract. (2018) 24:677–83. doi: 10.4158/EP-2018-0046

PubMed Abstract | Crossref Full Text | Google Scholar

29. Zhang FL, Xing YQ, Guo ZN, Wu YH, Liu HY, Yang Y. Prevalence and risk factors for diabetes and impaired fasting glucose in northeast China: results from the 2016 China national stroke screening survey. Diabetes Res Clin Pract. (2018) 144:302–13. doi: 10.1016/j.diabres.2018.09.005

PubMed Abstract | Crossref Full Text | Google Scholar

30. Chen S, Li H, Huang C, Li Y, Cai J, Luo T, et al. Study on the relationship between KCNQ1 gene-environment interaction and abnormal glucose metabolism in the elderly in a county of Hechi City, Guangxi. Br J Nutr. (2024) 132:979–87. doi: 10.1017/S0007114524001284

PubMed Abstract | Crossref Full Text | Google Scholar

31. Cree LM, Patel SK, Pyle A, Lynn S, Turnbull DM, Chinnery PF, et al. Age-related decline in mitochondrial DNA copy number in isolated human pancreatic islets. Diabetologia. (2008) 51:1440–3. doi: 10.1007/s00125-008-1054-4

PubMed Abstract | Crossref Full Text | Google Scholar

32. Wang L, Li Y, Li R, Luan J, Cao K, Liu T, et al. Diverse associations between pancreatic intra-, inter-lobular fat and the development of type 2 diabetes in overweight or obese patients. Front Nutr. (2024) 11:1421032. doi: 10.3389/fnut.2024.1421032

PubMed Abstract | Crossref Full Text | Google Scholar

33. Sasaki N, Ozono R, Higashi Y, Maeda R, Kihara Y. Association of insulin resistance, plasma glucose level, and serum insulin level with hypertension in a population with different stages of impaired glucose metabolism. J Am Heart Assoc. (2020) 9:e015546. doi: 10.1161/JAHA.119.015546

PubMed Abstract | Crossref Full Text | Google Scholar

34. Phimphilai M, Pothacharoen P, Chattipakorn N, Kongtawelert P. Receptors of advanced glycation end product (rage) suppression associated with a preserved osteogenic differentiation in patients with prediabetes. Front Endocrinol. (2022) 13:799872. doi: 10.3389/fendo.2022.799872

PubMed Abstract | Crossref Full Text | Google Scholar

35. Jannat APN, Zabihi-Mahmoudabadi H, Ebrahimi R, Yekaninejad MS, Hashemnia S, Meshkani R, et al. Principal component analysis of adipose tissue gene expression of lipogenic and adipogenic factors in obesity. BMC Endocr Disord. (2023) 23:94. doi: 10.1186/s12902-023-01347-w

PubMed Abstract | Crossref Full Text | Google Scholar

36. Ren L, Xuan L, Li A, Yang Y, Zhang W, Zhang J, et al. Gamma-aminobutyric acid supplementation improves olanzapine-induced insulin resistance by inhibiting macrophage infiltration in mice subcutaneous adipose tissue. Diabetes Obes Metab. (2024) 26:2695–705. doi: 10.1111/dom.15585

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: impaired fasting glucose, nomogram, risk score, prediction performance, LASSO

Citation: Lin Y, Wu W, Liang X, Zhou L, Li G, Kang C, Li W, Huang C and Tian F (2025) A study on predicting impaired fasting glucose risk in Chinese adults based on individual characteristics. Front. Med. 12:1584626. doi: 10.3389/fmed.2025.1584626

Received: 27 February 2025; Accepted: 20 May 2025;
Published: 09 June 2025.

Edited by:

I-Shiang Tzeng, National Taipei University, Taiwan

Reviewed by:

Godfrey Mutashambara Rwegerera, University of Botswana, Botswana
Yu-Han Liu, Taiwan Paramedicine Service, Taiwan

Copyright © 2025 Lin, Wu, Liang, Zhou, Li, Kang, Li, Huang and Tian. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chunyi Huang, bTE4MTI4NzYxODI4QDE2My5jb20=; Feng Tian, cXF0aW5hQHNtdS5lZHUuY24=

^†These authors share first authorship

^‡ORCID: Chunyi Huang orcid.org/0009-0005-7435-915X
Feng Tian orcid.org/0000-0002-3715-2281

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.