Machine learning-based prediction of hypoglycemia severity in hospitalized diabetic patients

Jia, Hongjian; Zhang, Jietao

doi:10.3389/fendo.2025.1634358

ORIGINAL RESEARCH article

Front. Endocrinol., 17 September 2025

Sec. Clinical Diabetes

Volume 16 - 2025 | https://doi.org/10.3389/fendo.2025.1634358

This article is part of the Research TopicAI in Healthcare: Transforming Clinical Risk Prediction, Medical Large Language Models, and BeyondView all 12 articles

Machine learning-based prediction of hypoglycemia severity in hospitalized diabetic patients

Hongjian Jia

Jietao Zhang^*

Department of General Practice, The Affiliated Hospital of Qingdao University, Qingdao, China

Objective: To identify risk factors for hypoglycemia in hospitalized patients with type 2 diabetes mellitus (T2DM) and develop predictive models for hypoglycemia severity based on machine learning algorithms.

Methods: Adult non-pregnant hospitalized patients diagnosed with T2DM were retrospectively enrolled from the electronic medical record system of the Affiliated Hospital of Qingdao University. Patients were categorized into hypoglycemia groups (mild, moderate-to-severe) or a non-hypoglycemia group based on inpatient venous plasma glucose levels. After data preprocessing, univariate and multivariate analyses were conducted to identify significant predictors. Three predictive models (XGBoost, Random Forest [RF], and Logistic Regression) were subsequently constructed and validated to evaluate their predictive performances.

Results: From an initial cohort of 8,947 patients, 1,798 patients were included after data screening. Among the evaluated models, the RF model demonstrated the highest predictive accuracy (93.3%) and Kappa coefficient (0.873), followed by XGBoost (accuracy: 92.6%, Kappa: 0.860). Logistic regression exhibited comparatively lower performance (accuracy: 83.8%, Kappa: 0.685). The macro-average area under the ROC curve (AUC) values for RF, XGBoost, and logistic regression were 0.960, 0.955, and 0.788, respectively, highlighting the superior discriminative capability of the RF model. While both XGBoost and RF models identified glycemic control metrics and glucose variability as core predictors for hypoglycemia, the RF model additionally emphasized medication usage, whereas XGBoost prioritized basal metabolic parameters.

Conclusions: The RF model outperformed XGBoost and conventional logistic regression in predicting hypoglycemia severity among hospitalized T2DM patients. The results emphasize the importance of closely monitoring glucose levels and glucose variability during diabetes management to prevent hypoglycemia. The developed model provides a foundation for implementing preventive strategies to reduce hypoglycemia occurrence in hospitalized patients with T2DM.

Introduction

Diabetes mellitus (DM) is a major chronic disease worldwide, with prevalence rates steadily rising over recent decades (1). Diabetes and its associated complications pose significant threats to patient health and quality of life, greatly impacting patients’ daily activities and potentially leading to mortality in severe cases. Among these complications, hypoglycemia is particularly critical to prevent and manage due to its frequent occurrence and substantial health risks for diabetic patients (2). Hypoglycemic events in diabetic patients are multifactorial, influenced by medications, diet, lifestyle, and comorbidities (3–5). Recently, stringent glycemic control strategies have been associated with an increased risk of hypoglycemia. However, hypoglycemia can undermine the long-term benefits gained from good glycemic management, underscoring the importance of carefully balancing the benefits and risks of intensive glucose management.

Hospitals play a central role in glycemic control, adjustments of antidiabetic medications, and individualized care for diabetic patients, placing heightened responsibilities on healthcare professionals. Patients with diabetes commonly exhibit multiple comorbidities that contribute independently to hypoglycemia risk. Studies have identified older age, renal impairment, liver dysfunction, poor nutritional status, inappropriate medication use, and debilitating diseases as critical risk factors for hypoglycemia. Clinical practitioners thus must analyze these risk factors systematically to accurately assess the probability of hypoglycemia and implement effective preventive measures.

Predicting hypoglycemia accurately, however, remains challenging due to the complex interplay of various clinical and biological factors. Although multiple studies have confirmed insulin therapy (6), impaired renal function, and suboptimal glycemic control as significant predictors of hypoglycemia, reliably forecasting such events remains difficult. Traditional logistic regression models have been widely employed to identify risk factors but are limited by their assumption of linear relationships between predictors and outcomes.

Given the limitations of traditional statistical methods in capturing complex clinical relationships, this study was motivated to explore advanced machine learning algorithms for predicting hypoglycemia severity in hospitalized T2DM patients. Thus, the current study aimed to develop and systematically compare three predictive models—multinomial logistic regression, XGBoost, and RF—to identify independent risk factors and predict hypoglycemia severity. We further conducted a feature importance analysis to highlight clinically significant predictors of hypoglycemia risk. This comparative approach is expected to improve early clinical intervention and personalized management. The paper is structured as follows: the methods section describes the study population, data collection, and model development; the results section presents patient characteristics and model performances; the discussion elaborates clinical implications and comparative insights from the modeling results; finally, limitations and future research directions are provided.

Literature review

Hypoglycemia remains a significant challenge in managing hospitalized patients with type 2 diabetes mellitus (T2DM), often resulting from complex interactions among clinical factors such as medication usage, renal function, and glycemic variability. Traditionally, logistic regression models have been widely employed to identify predictors of hypoglycemia risk due to their interpretability and simplicity. However, these models assume linear relationships and may not effectively capture intricate interactions among clinical variables, limiting their predictive capabilities in clinical practice.

With advancements in machine learning techniques, models such as Extreme Gradient Boosting (XGBoost) and Random Forest (RF) have shown remarkable performance in predicting complex clinical outcomes due to their ability to capture nonlinear interactions and handle high-dimensional data. Several previous studies have explored machine learning approaches to predict hypoglycemia in diabetic patients. For instance, Melih Agraz et al. used multi-view collaborative training of machine learning models on imbalanced datasets to improve hypoglycemia prediction (7). Harald Witte et al. used XGBoost alone to train models related to glycemic decompensation (8). Christopher Duckworth et al. also used XGBoost alone to predict glycemic decompensation (hyperglycemia and hypoglycemia) (9). Mai Shi et al. developed a multidimensional ML model based on electronic health records (EHR) to predict hypoglycemia in the elderly population (10). However, most of these studies have focused solely on hypoglycemia or hyperglycemia, lacking analysis of the severity of hypoglycemia. Additionally, few studies have used real hospital patient data to comprehensively compare multiple models. This study addresses this shortcoming by evaluating logistic regression, random forest, and XGBoost models to perform multi-class classification of hypoglycemia severity using clinical variables.

Methods

Study design and population

This retrospective study was conducted using data from the electronic medical record system of the Affiliated Hospital of Qingdao University. Initially, 8,947 adult, non-pregnant hospitalized patients with a confirmed diagnosis of T2DM were identified. After applying inclusion and exclusion criteria, a total of 1,798 patients were included in the final analysis.

Data collection

Clinical and demographic data were extracted from electronic medical records, including demographic characteristics, clinical features, laboratory findings, and antidiabetic medication use. Variables collected were age, gender, body mass index (BMI) classification, Charlson Comorbidity Index (CCI), glycated hemoglobin (HbA1c), mean blood glucose levels, serum creatinine, C-peptide, lipid profile, and use of antidiabetic medications (e.g., insulin, metformin, DPP - 4 inhibitors).

Patient grouping

Patients were categorized into three groups based on venous plasma glucose levels measured during hospitalization:

● Normal glycemia: >3.9 mmol/L

● Mild hypoglycemia: 3.0 – 3.9 mmol/L

● Moderate-to-severe hypoglycemia: <3.0 mmol/L

Statistical analysis

Univariate analyses were conducted using the chi-square test for categorical variables and the Kruskal-Wallis test for continuous variables. Variables with a P-value <0.05 were entered into a multivariate multinomial logistic regression analysis. Model performance was evaluated using overall accuracy, Kappa statistic, the area under the receiver operating characteristic (ROC) curve (AUC), and confusion matrices. All statistical analyses were performed using R software and SPSS.

Machine learning model development

Three predictive models were developed and evaluated: multinomial logistic regression, XGBoost, and Random Forest. Hyperparameters for the XGBoost and RF models were optimized using cross-validation. Multiclass ROC curves were constructed using a One-vs-Rest approach to visualize and compare the models’ predictive capabilities.

No independent testing set was separated in this study. Model development and evaluation were entirely conducted using 5-fold cross-validation on the full dataset. During training, hyperparameters of XGBoost and Random Forest were optimized within each fold using grid search. For XGBoost, the number of boosting rounds (nrounds), tree depth (max_depth), and learning rate (eta) were tuned. For Random Forest, the number of trees (ntree) and the number of variables randomly selected at each split (mtry) were adjusted. The best parameter set in each case was selected based on the highest average accuracy and Kappa coefficient across the cross-validation folds. Final performance metrics, including accuracy, Kappa coefficient, and area under the ROC curve (AUC), represent the average results across the cross-validation folds. This strategy ensured internal validation while minimizing overfitting and allowing fair comparison across models.

Model performance evaluation

Model performance evaluation was based on three key metrics: overall accuracy, Cohen’s Kappa coefficient, and area under the receiver operating characteristic curve (AUC). These metrics were calculated for each fold and averaged to obtain the final performance results. Additionally, multiclass ROC curves were generated using a One-vs-Rest approach to evaluate model discrimination across the three hypoglycemia severity classes.

Results

Patient characteristics

A total of 1,798 hospitalized diabetic patients were included in the final analysis. Based on inpatient blood glucose measurements, patients were divided into three groups: normoglycemic, mild hypoglycemia, and moderate-to-severe hypoglycemia. Significant differences were observed among the three groups in age, Charlson Comorbidity Index (CCI), the number of glucose-lowering medication classes, triglycerides (TG), serum creatinine, glycated hemoglobin (HbA1c), C-peptide, mean glucose levels, cholesterol, gender distribution, BMI classification, and use of various glucose-lowering medications (including SGLT2 inhibitors, α-glucosidase inhibitors, metformin, thiazolidinediones, and insulin) (all P < 0.05). Baseline characteristics of the study population are detailed in Table 1.

Table 1

Table 1. Baseline characteristics of patients by hypoglycemia severity group.

To visualize the distributional characteristics of numeric clinical variables across hypoglycemia severity groups, boxplots were generated (Figure 1). These plots provide an intuitive depiction of intergroup variability and outliers, complementing the statistical summary presented in Table 1. Categorical variables were not included in this visualization due to the nature of boxplots being suited for continuous variables only.

Figure 1

Box plots displaying values for Age, C-Peptide, Charlson, Cholesterol, Creatinine, Glucose Average (GlucoseAvg), HbA1c, and Triglycerides (TG) across Normal, Mild, and Moderate-to-severe blood sugar groups. Each plot shows variations in data distribution, with a legend indicating group colors: Normal in green, Mild in orange, and Moderate-to-severe in blue.

Figure 1. Boxplots of eight clinical variables across hypoglycemia severity groups (Normal, Mild, and Moderate-to-severe).

Univariate and Multivariate Logistic Regression Analyses

In the univariate logistic regression analysis (Table 2), gender, Charlson comorbidity index, number of glucose-lowering medication classes, BMI classification, triglycerides (TG), serum creatinine, HbA1c, C-peptide, mean glucose levels, cholesterol, and use of specific hypoglycemic drugs (metformin, α-glucosidase inhibitors, insulin) were significantly associated with the risk of hypoglycemia (P < 0.05). Specifically, an increased Charlson index, lower BMI, reduced C-peptide levels, decreased TG, elevated HbA1c, elevated creatinine levels, and insulin use were linked to a higher risk of hypoglycemia.

Table 2

Table 2. Univariate multinomial logistic regression analysis for predictors of hypoglycemia severity.

In the multivariate multinomial logistic regression analysis (Table 3), independent factors associated with mild hypoglycemia included older age (OR = 0.966, 95% CI: 0.951 – 0.981, P < 0.001), higher Charlson comorbidity index (OR = 2.684, 95% CI: 2.121 – 3.398, P < 0.001), increased creatinine (OR = 1.005, 95% CI: 1.003 – 1.007, P < 0.001), elevated HbA1c (OR = 10.570, 95% CI: 7.554 – 14.789, P < 0.001), lower mean glucose levels (OR = 0.545, 95% CI: 0.479 – 0.619, P < 0.001), and insulin use (OR = 0.205, 95% CI: 0.127 – 0.332, P < 0.001).

Table 3

Table 3. Multivariate multinomial logistic regression results for mild and moderate-to-severe hypoglycemia.

For moderate-to-severe hypoglycemia, independent predictors included older age (OR = 0.973, 95% CI: 0.954 – 0.993, P = 0.007), elevated Charlson comorbidity index (OR = 2.744, 95% CI: 2.141 – 3.517, P < 0.001), increased serum creatinine (OR = 1.005, 95% CI: 1.003 – 1.007, P < 0.001), elevated HbA1c (OR = 10.047, 95% CI: 7.127 – 14.163, P < 0.001), lower mean glucose levels (OR = 0.633, 95% CI: 0.554 – 0.723, P < 0.001), and insulin use (OR = 0.170, 95% CI: 0.086 – 0.336, P < 0.001). Additionally, overweight status was a protective factor against moderate-to-severe hypoglycemia (OR = 0.511, 95% CI: 0.302 – 0.865, P = 0.012), while underweight status significantly increased the risk (OR = 3.654, 95% CI: 1.045 – 12.781, P = 0.043).

Overall, the multivariate logistic regression model demonstrated good fit (χ² = 1585.430, df = 40, P < 0.001; Cox-Snell R² = 0.586; Nagelkerke R² = 0.703; McFadden R² = 0.492).

Machine learning model evaluation

The predictive performances of multinomial logistic regression, XGBoost, and random forest (RF) models were evaluated using receiver operating characteristic (ROC) curves constructed through a One-vs-Rest approach. ROC curves for each model are illustrated individually in Figures 2–4, and a direct comparative analysis among the three models is presented in Figure 5.

Figure 2

Receiver Operating Characteristic (ROC) curve comparing three conditions: normal (blue line, AUC 0.938), mild hypoglycemia (orange line, AUC 0.889), and moderate-to-severe hypoglycemia (red line, AUC 0.754). The curve plots sensitivity against specificity, with a diagonal reference line.

Figure 2. One-vs-rest ROC curves for multinomial logistic regression model.

Figure 3

ROC curve comparing three conditions: normal (blue), mild hypoglycemia (orange), and moderate-to-severe (red). The curves are plotted with specificity on the x-axis and sensitivity on the y-axis. Area under the curve (AUC) scores are 0.991 for normal, 0.980 for mild hypoglycemia, and 0.943 for moderate-to-severe.

Figure 3. One-vs-rest ROC curves for the XGBoost model.

Figure 4

ROC curve graph showing sensitivity versus specificity for three conditions: Normal (blue line, AUC: 0.991), Mild hypoglycemia (orange line, AUC: 0.983), and Moderate-to-severe (red line, AUC: 0.952). The graph includes a diagonal reference line.

Figure 4. One-vs-rest ROC curves for random forest model.

Figure 5

ROC curves compare model performance, plotting sensitivity versus specificity for logistic regression (Normal, Mild, Moderate-to-severe), XGBoost (Normal, Mild, Moderate-to-severe), and Random Forest (Normal, Mild, Moderate). The diagonal line represents random chance.

Figure 5. Comparison of ROC curves across logistic regression, XGBoost, and random forest models.

The multinomial logistic regression model yielded area under the curve (AUC) values of 0.938 for predicting normoglycemia, 0.889 for mild hypoglycemia, and 0.754 for moderate-to-severe hypoglycemia. The XGBoost model demonstrated notably higher AUCs of 0.991, 0.980, and 0.943, respectively. Similarly, the RF model achieved AUCs of 0.991, 0.983, and 0.952 for the three respective severity categories.

Comparative ROC analysis (Figure 5) clearly indicated superior discriminative performance of both XGBoost and RF models across all hypoglycemia severity categories compared to logistic regression.

Model performance comparison

To comprehensively evaluate predictive performances, we compared the overall accuracy, Kappa coefficient, and macro-average AUC of the three models (Table 4). The RF model achieved the highest accuracy (93.3%) and Kappa coefficient (0.873), followed closely by XGBoost (accuracy: 92.6%, Kappa: 0.860). Logistic regression demonstrated relatively lower performance, with an accuracy of 83.8% and Kappa of 0.685. Consistently, the macro-average AUC values for RF, XGBoost, and logistic regression were 0.960, 0.955, and 0.788, respectively, further emphasizing the superior predictive capability of the tree-based machine learning algorithms.

Table 4

Table 4. Performance comparison of three models: Accuracy, AUC, and Kappa.

To further evaluate the consistency and robustness of model performance, we applied 5-fold cross-validation and visualized the distribution of prediction accuracy and Kappa coefficient for each model (Figure 6). The Random Forest model consistently demonstrated the highest predictive performance (mean accuracy ≈ 93.2%, Kappa ≈ 0.872), followed closely by XGBoost (accuracy ≈ 92.6%, Kappa ≈ 0.860). Logistic regression showed comparatively lower performance (accuracy ≈ 84.0%, Kappa ≈ 0.682). These values were consistent with the results in Table 4, confirming the superiority of tree-based models in this context.

Figure 6

Box plots comparing model performance for Logistic, RF, and XGBoost. The left plot shows accuracy scores; Logistic has the lowest, XGBoost the highest. The right plot shows Kappa scores, with similar trends. Models are color-coded: red for Logistic, green for RF, blue for XGBoost.

Figure 6. Boxplots comparing model performance (Accuracy and Kappa coefficient) across three classifiers using 5-fold cross-validation.

Note: The boxplot metrics are derived from 5-fold cross-validation, while Table 4 reports final model performance on evaluation sets.

Feature importance analysis

Feature importance analysis (Figure 7) identified mean blood glucose level and HbA1c as the two most critical predictors in both XGBoost and RF models. Subsequent important predictors in the XGBoost model included creatinine, C-peptide, and triglycerides (TG). In the RF model, following mean glucose and HbA1c, important predictors included C-peptide, insulin usage, and the Charlson comorbidity index.

Figure 7

Two bar charts display the top 10 feature importances for Random Forest and XGBoost models. The Random Forest chart shows GlucoseAvg and HbA1c as the most important features. The XGBoost chart also highlights GlucoseAvg and HbA1c at the top. Both charts rank features like C_Peptide, Insulin, and Creatinine among others. The Random Forest importance is measured by MeanDecreaseGini, while XGBoost uses Gain.

Figure 7. Top 10 feature importance.

Discussion

In the present study, we developed and systematically compared three predictive models—multinomial logistic regression, XGBoost, and random forest (RF)—to identify key predictors of hypoglycemia severity among hospitalized patients with diabetes. Our results demonstrated that both machine learning models (XGBoost and RF) exhibited superior discriminative performance compared to traditional logistic regression, especially for detecting moderate-to-severe hypoglycemia, this has some modeling studies with the same results (11). We utilized the One-vs-Rest strategy, a standard and effective approach for generating ROC curves in multiclass classification problems, allowing robust evaluation and comparison of each model’s predictive performance across hypoglycemia categories.

As detailed in Table 4, the random forest model demonstrated the best overall predictive performance, with the highest accuracy (93.3%), Kappa coefficient (0.873), and macro-average AUC (0.960). These values reflect excellent classification agreement and discrimination for hypoglycemia severity prediction. The XGBoost model showed slightly lower but still robust performance (accuracy 92.6%, Kappa 0.860, AUC 0.955), confirming its strength in handling complex nonlinear data. In contrast, the logistic regression model yielded lower accuracy (83.8%) and Kappa (0.685), along with a substantially lower AUC (0.788), possibly due to its linear assumptions and inability to capture complex feature interactions. These findings highlight the superior performance of ensemble learning models in identifying subtle patterns and interactions in clinical data, and support their potential value in hypoglycemia risk stratification.

The dataset used to construct our predictive models comprised common clinical and laboratory variables routinely available for hospitalized patients with type 2 diabetes mellitus (T2DM). The logistic regression model identified seven statistically significant predictors (P < 0.05), while both XGBoost and RF models highlighted the top 10 most influential variables. Notably, six predictors—age, Charlson Comorbidity Index (CCI), serum creatinine, HbA1c, mean glucose levels, and insulin use—emerged consistently as significant factors across all three models. A number of previous modeling studies and factor analysis studies have similarly given the same conclusions for one or more of these factors (12–14). While previous studies have individually confirmed the significance of one or several of these factors, our research uniquely integrated multiple models for comprehensive comparison, offering a broader perspective than single-model studies previously reported.

A key advantage of our study lies in its reliance on routinely collected clinical data. This facilitates rapid assessment of hypoglycemia risk without additional specialized testing, thus granting medical staff valuable time to implement preventive strategies and appropriate treatments.

Regarding predictive variables, HbA1c has consistently been identified as a crucial factor in hypoglycemia research. Prior studies have reported that both excessively high HbA1c levels (>9%) and overly stringent glycemic control (HbA1c <7%) are associated with increased hypoglycemia risk (15). Additionally, insulin use within these HbA1c ranges further exacerbates hypoglycemia risk. Both our univariate and multivariate analyses supported these findings, reinforcing that elevated HbA1c levels and insulin therapy are critical risk factors for hypoglycemia in diabetic patients.

In recent years, growing attention has focused on the association between the Charlson Comorbidity Index (CCI) and hypoglycemia. Most prior studies investigating comorbidities typically examined only one or a few components of the CCI, such as cardiovascular disease, renal impairment, or malignancy (16). These comorbid conditions have been associated with glucose instability and heightened risk of adverse glycemic events. Patients with multiple chronic diseases often experience altered drug metabolism, polypharmacy, and malnutrition, potentially interfering with glucose regulation and insulin sensitivity. Incorporating the CCI into all three predictive models in our study enhances its utility as a comprehensive clinical indicator. Unlike individual diagnoses, the CCI provides an aggregated measure of overall disease burden, capturing complex interactions between comorbidities and thereby improving the generalizability and interpretability of the predictive models. Integrating the CCI into predictive assessments may assist clinicians in accurately stratifying risk, particularly among elderly hospitalized patients or those with multiple chronic diseases, who inherently have higher susceptibility to hypoglycemic episodes (17). Early identification of high-CCI patients could facilitate personalized monitoring, medication adjustments, and tailored nutritional plans, thereby mitigating hypoglycemia risk.

Lastly, we conducted feature importance analyses for the top ten variables in the XGBoost and RF models. Both models consistently identified glycemic control and glucose variability as key predictors. Additionally, XGBoost emphasized basal metabolic parameters (e.g., creatinine and C-peptide), whereas the RF model placed greater emphasis on medication use. These findings highlight nuanced differences between machine learning approaches, underscoring their ability to provide targeted clinical insights into managing and preventing hypoglycemic events in hospitalized diabetic patients.

Conclusions

In this study, we developed and compared three prediction models - multinomial logistic regression, XGBoost, and random forest - to identify factors associated with hypoglycemia severity in hospitalized diabetic patients. Our results showed that both machine learning models outperformed the traditional logistic regression model in terms of predictive performance, with higher accuracy and AUC values in all hypoglycemia categories. This time, the three models were designed with all commonly used clinical indicators, without the need for special tests, to prevent hypoglycemia as early as possible in type 2 diabetic patients and to avoid the adverse consequences caused by hypoglycemia. In a comparison of the three models and the focus of the models, the six variables of age, Charlson comorbidity index, creatinine, glycosylated hemoglobin, mean blood glucose level, and insulin use were consistently identified as core predictors in almost all models (7, 18, 19), This further enhances their clinical relevance. Future studies should focus on the prospective validation and practical application of these models to assess their clinical utility and impact on patient safety.

Strengths and limitations

This study has several strengths. First, it comprehensively compared the performance of traditional and ensemble machine learning models in predicting hypoglycemia severity, highlighting the superior capability of tree-based algorithms. Second, it utilized a multiclass classification approach to stratify hypoglycemia into different severity levels, which has greater clinical relevance than binary classification. Third, model performance was rigorously evaluated using stratified 5-fold cross-validation, and visual comparisons were provided through ROC curves and boxplots, enhancing interpretability. Finally, the study was based on a large cohort of hospitalized patients with diabetes, offering real-world insights.

However, some limitations should be acknowledged. The data were derived from a single center, which may limit the generalizability of the findings. External validation using independent datasets from other institutions is needed. Additionally, the reliance on internal cross-validation without a separate test set may lead to optimistic performance estimates. Some potentially important predictors, such as postprandial glucose or nutritional interventions, were not available in the dataset. Moreover, the models have not yet been tested in real-time clinical settings, and their clinical utility remains to be validated.

Data availability statement

The datasets presented in this article are not readily available because The datasets generated and/or analyzed during the current study are not publicly available due to The Affiliated Hospital of Qingdao University but are available from the corresponding author on reasonable request. Requests to access the datasets should be directed to Hongjian Jia, amhqMjAwMDE4QDE2My5jb20=.

Ethics statement

The studies involving humans were approved by The Affiliated Hospital of Qingdao University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

HJ: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. JZ: Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

This manuscript was previously made available as a preprint on Research Square (DOI: https://doi.org/10.21203/rs.3.rs-6619286/v1) (20).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Ong KL, Stafford LK, Mclaughlin SA, et al. Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021. Lancet (London, England). (2023) 402(10397):203–34. doi: 10.1016/S0140-6736(23)01301-6

PubMed Abstract | Crossref Full Text | Google Scholar

2. Yunir E, Nugraha ARA, Rosana M, Kurniawan J, Iswati E, Sarumpaet A, et al. Risk factors of severe hypoglycemia among patients with type 2 diabetes mellitus in outpatient clinic of tertiary hospital in Indonesia. Sci Rep. (2023) 13(1):16259. doi: 10.1038/s41598-023-43459-2

PubMed Abstract | Crossref Full Text | Google Scholar

3. Bonaventura A, Montecucco F, and Dallegri F. Update on strategies limiting iatrogenic hypoglycemia. Endocr. Conn. (2015) 4(3):R37–45. doi: 10.1530/EC-15-0044

PubMed Abstract | Crossref Full Text | Google Scholar

4. Akirov A, Amitai O, Masri-Iraqi H, Diker-Cohen T, Shochat T, Eizenberg Y, et al. Predictors of hypoglycemia in hospitalized patients with diabetes mellitus. Internal Emergency Med. (2018) 13(3):343–50. doi: 10.1007/s11739-018-1787-0

PubMed Abstract | Crossref Full Text | Google Scholar

5. Borzì V, Frasson S, Gussoni G, Di Lillo M, Gerloni R, Augello G, et al. Risk factors for hypoglycemia in patients with type 2 diabetes, hospitalized in internal medicine wards: Findings from the FADOI-DIAMOND study. Diabetes Res Clin Pract. Available online at: https://www.diabetesresearchclinicalpractice.com/article/S0168-8227(16)00069-3/abstract.

PubMed Abstract | Google Scholar

6. Khunti K, Alsifri S, Aronson R, Cigrovski Berković M, Enters-Weijnen C, Forsén T, et al. Rates and predictors of hypoglycaemia in 27–585 people from 24 countries with insulin-treated type 1 and type 2 diabetes: the global HAT study. Diabetes Obes Metab. (2016) 18(9):907–15. doi: 10.1111/dom.12689

PubMed Abstract | Crossref Full Text | Google Scholar

7. Agraz M, Deng Y, Karniadakis GE, and Mantzoros CS. Enhancing severe hypoglycemia prediction in type 2 diabetes mellitus through multi-view co-training machine learning model for imbalanced dataset. Sci Rep. (2024) 14(1):22741. doi: 10.1038/s41598-024-69844-z

PubMed Abstract | Crossref Full Text | Google Scholar

8. Witte H, Nakas C, Bally L, and Leichtle AB. Machine learning prediction of hypoglycemia and hyperglycemia from electronic health records: algorithm development and validation. JMIR Form. Res. (2022) 6:e36176. doi: 10.2196/36176

PubMed Abstract | Crossref Full Text | Google Scholar

9. Duckworth C, Guy MJ, Kumaran A, O’Kane AA, Ayobi A, Chapman A, et al. Explainable machine learning for real-time hypoglycemia and hyperglycemia prediction and personalized control recommendations. J Diabetes Sci Technol. (2024) 18(1):113–23. doi: 10.1177/19322968221103561

PubMed Abstract | Crossref Full Text | Google Scholar

10. Shi M, Yang A, Lau ESH, Luk AOY, Ma RCW, Kong APS, et al. A novel electronic health record-based, machine-learning model to predict severe hypoglycemia leading to hospitalizations in older adults with diabetes: A territory-wide cohort and modeling study. PloS Med. (2024) 21:e1004369. doi: 10.1371/journal.pmed.1004369

PubMed Abstract | Crossref Full Text | Google Scholar

11. Niu W, Liu C, Wang S, Zhang R, Zhang X, Jia H, et al. Interdisciplinary nursing research. Available online at: https://journals.lww.com/inr/fulltext/2025/03000/machine_learning_based_prediction_model_for.4.aspx.

Google Scholar

12. Chen NC, Chen CL, and Shen FC. The risk factors of severe hypoglycemia in older patients with dementia and type 2 diabetes mellitus. J Pers. Med. (2022) 12:67. doi: 10.3390/jpm12010067

PubMed Abstract | Crossref Full Text | Google Scholar

13. González-Vidal T, Rivas-Otero D, Gutiérrez-Hurtado A, Alonso Felgueroso C, Martínez Tamés G, Lambert C, et al. Hypoglycemia in patients with type 2 diabetes mellitus during hospitalization: associated factors and prognostic value. Diabetol Metab Syndr. (2023) 15(1):249. doi: 10.1186/s13098-023-01212-9

PubMed Abstract | Crossref Full Text | Google Scholar

14. Akirov A, Amitai O, Masri-Iraqi H, Diker-Cohen T, Shochat T, Eizenberg Y, et al. Predictors of hypoglycemia in hospitalized patients with diabetes mellitus. Intern Emerg Med. (2018) 13(3):343–50. doi: 10.1007/s11739-018-1787-0

PubMed Abstract | Crossref Full Text | Google Scholar

15. Lipska KJ, Warton EM, Huang ES, Moffet HH, Inzucchi SE, Krumholz HM, et al. HbA1c and risk of severe hypoglycemia in type 2 diabetes. Diabetes Care. (2013) 36(15): 3535–42. doi: 10.2337/dc13-0610

PubMed Abstract | Crossref Full Text | Google Scholar

16. Pratiwi C, Mokoagow MI, Made Kshanti IA, and Soewondo P. The risk factors of inpatient hypoglycemia: A systematic review. Heliyon. (2020) 6(5):e03913. doi: 10.1016/j.heliyon.2020.e03913

PubMed Abstract | Crossref Full Text | Google Scholar

17. Al-Azayzih A, Kanaan RJ, Altawalbeh SM, Alzoubi KH, Kharaba Z, and Jarab A. Prevalence and predictors of hypoglycemia in older outpatients with type 2 diabetes mellitus. PloS One. (2024) 19(8):e0309618. doi: 10.1371/journal.pone.0309618

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zhang RT, Liu Y, Sun C, Wu QY, Guo H, Wang GM, et al. Predicting hypoglycemia in elderly inpatients with type 2 diabetes: the ADOCHBIU model. Front Endocrinol. (2024) 15:1366184. doi: 10.3389/fendo.2024.1366184

PubMed Abstract | Crossref Full Text | Google Scholar

19. Ruan Y, Bellot A, Moysova Z, Tan GD, Lumb A, Davies J, et al. Predicting the risk of inpatient hypoglycemia with machine learning using electronic health records. Diabetes Care. (2020) 43(7):1504–11. doi: 10.2337/dc19-1743

PubMed Abstract | Crossref Full Text | Google Scholar

20. Jia H and Zhang J. Machine learning-based prediction of hypoglycemia severity in hospitalized diabetic patients. Res Square. (2025). doi: 10.21203/rs.3.rs-6619286/v1

Crossref Full Text | Google Scholar

Keywords: type 2 diabetes mellitus, hypoglycemia, risk prediction, machine learning, clinical analysis

Citation: Jia H and Zhang J (2025) Machine learning-based prediction of hypoglycemia severity in hospitalized diabetic patients. Front. Endocrinol. 16:1634358. doi: 10.3389/fendo.2025.1634358

Received: 24 May 2025; Accepted: 01 September 2025;
Published: 17 September 2025.

Edited by:

Hong Sun, Jiaxing University, China

Reviewed by:

Jayakumar Kaliappan, Vellore Institute of Technology (VIT), India
Alaa F. Sheta, Southern Connecticut State University, United States

Copyright © 2025 Jia and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jietao Zhang, cWRzemp0MTYzQDE2My5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.