Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med., 31 October 2025

Sec. Family Medicine and Primary Care

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1638927

Construction and validation of a clinical prediction model for diabetic ketoacidosis


Chen Xu&#x;Chen XuXingwen Jiang&#x;Xingwen JiangQuanan He*Quanan He*Peng Xu*Peng Xu*
  • Department of Accident and Emergency, Yangtze River Shipping General Hospital, Wuhan, Hubei, China

Background: Diabetes ketoacidosis (DKA) is a common and serious acute complication of diabetes mellitus. Globally, its incidence is on the rise, posing a serious threat to the life, health and quality of life of diabetic patients. In current clinical practice, although a variety of indicators are used to determine DKA, these indicators often have a lag and cannot effectively predict the occurrence of DKA at an early stage, resulting in some patients missing the best time for treatment.

Objective: To investigate the risk factors for the development of DKA and establish a prediction model based on the information of type II diabetes mellitus.

Methods: A total of 288 patients were collected in this study out of which 74 patients developed DKA. The patients enrolled in this study were randomly divided into a training set and a validation set according to a ratio of 7:3, with 201 patients in the training set and 87 patients in the validation set. The patients’ past medical history, dietary habits and relevant information during hospitalization were collected separately to study the correlation factors affecting the emergence of DKA in patients and to establish a prediction model.

Results: Possibly relevant factors were included in a one-way logistic regression, and after analyzing the results: history of infection, dietary status, duration of diabetes mellitus longer than 3 years, history of smoking, history of alcohol consumption, abnormalities in liver function, abnormalities in HbA1c, and hypokalaemia were potential risk factors for the development of DKA, P < 0.2; The data obtained were further included in a multifactorial review: history of infection, dietary status (intemperate diet), duration of diabetes mellitus more than 3 years, HbA1c abnormality, and hypokalaemia were predictive factors for DKA (P < 0.05).

Conclusion: This model provides a predictive tool for clinicians to identify high-risk patients with DKA at an early stage, which can help to take targeted preventive and intervention measures before the onset of the disease. However, the model was developed and internally validated using hospital-based data and that external validation is required before wider clinical application.

1 Introduction

1.1 Background and epidemiology of diabetes mellitus

Diabetes mellitus, a global chronic disease, is showing a high prevalence and high growth worldwide. According to the International Diabetes Federation, the number of people with diabetes worldwide reached 537 million in 2021 and is expected to increase to 783 million by 2045 (13).

1.2 Clinical significance of diabetic ketoacidosis

As one of the most serious acute complications of diabetes mellitus, diabetic ketoacidosis (DKA) is not to be underestimated (35). Domestically, studies have shown that the incidence of DKA among diabetic inpatients is about 10%–15%, while the incidence reported abroad varies from region to region and is about 4%–16% (68). DKA has a rapid onset and rapid progression, and once treatment is delayed, patients are very likely to fall into a coma or even face the risk of death, which undoubtedly significantly increases the chances of death of diabetic patients and the cost of medical care (912).

1.3 Problem statement of this study

Currently, scholars at home and abroad have conducted a lot of research on the pathogenesis, diagnosis and treatment of DKA. However, due to the complexity of the pathogenesis of DKA, which involves the intertwining of many factors, early diagnosis and prevention have always faced many difficulties, and most of the existing diagnostic methods rely on the detection of patients after the appearance of obvious symptoms, which makes it difficult to make an effective prediction of the disease in the early stage of its occurrence (11, 1315). Although some foreign studies are committed to constructing prediction models, there is still room for improvement in the accuracy and universality of the models; domestic research in this field started relatively late, and further in-depth exploration is needed in the construction and validation of the models.

1.4 Research objectives

Therefore, it is urgent to construct an accurate and effective clinical prediction model for DKA. The aim of this study was to construct an accurate clinical prediction model for DKA by exploring in depth the risk factors associated with DKA and to validate the model rigorously. The construction of an accurate and effective clinical prediction model for DKA is of great significance for clinical prevention and treatment. On the one hand, it can help clinicians identify patients with high risk of DKA at an early stage, so that timely interventions can be taken to reduce the risk of DKA in diabetic patients; on the other hand, by intervening in advance, the prognosis of patients can be effectively improved, and the pain and medical burden of patients can be reduced, which can provide a powerful support for the health management of diabetic patients.

2 Materials and methods

2.1 Research object

This study was conducted on 288 patients with type II diabetes mellitus, 170 males and 108 females, who were admitted to our department from January 2022 to December 2023 for treatment. Ethics approval and consent to participate: We obtained a waiver of informed consent from the Ethics Committee. The Ethics Committee of Yangtze River Shipping General Hospital approved the study (Ethics approval number: WY-2023-032-12). This study adhered to the Declaration of Helsinki.

2.2 Research methods

This study was a retrospective case-control study. A total of 288 patients were collected, of which 74 patients developed DKA. Past medical history, dietary habits and information about the patients during hospitalization were collected separately to study the correlating factors affecting the development of DKA in the patients.

2.3 Collecting indicators

Gender, age, history of infection, dietary status, hypertension, hyperlipidaemia, duration of diabetes mellitus, history of smoking, history of alcohol consumption, abnormalities in liver function, abnormalities in renal function, abnormalities in HbA1c, hyponatraemia, and incidence of hypokalaemia were collected from the patients in this study.

2.4 Sampling strategy and sample size calculation

2.4.1 Sampling strategy

In this study, we implemented a meticulously structured retrospective sampling approach to pinpoint patients with type 2 diabetes mellitus who were hospitalized between January 2022 and December 2023, utilizing the hospital’s electronic medical record system (Donghua HIS). We leveraged ICD-10 codes (particularly E11.6, which specifically indicates type 2 diabetes with ketoacidosis) in conjunction with clinical diagnostic criteria to effectively differentiate a case group, consisting of patients diagnosed with diabetic ketoacidosis (DKA), from a control group, made up of type 2 diabetes patients without a history of DKA. These two groups were then carefully matched in a 1:3 ratio.

2.4.2 Sample size calculation

For the sample size calculation, we utilized the variable multiplication approach, which involves multiplying the number of predictor variables (in this case, 14) by a factor ranging from 10 to 20. This method yielded a minimum required sample size spanning from 140 to 280 cases. Ultimately, we included a total of 288 cases, thereby comfortably meeting the event count prerequisite essential for conducting a robust multiple regression analysis.

2.5 Inclusion and exclusion criteria

2.5.1 Inclusion criteria

(i) Meets the diagnostic criteria for Clinical guidelines for prevention and treatment of type 2 diabetes mellitus in the elderly in China (2022 edition) (16)

(ii) Duration of the disease ≥ 1 year;

(iii) Age ≥ 18 years;

(iv) Patients in the study group presented with polyuria, irritable thirst and excessive drinking with nausea, vomiting, deep and fast breathing, and rotten-apple flavor in the mouth, as well as the appearance of a positive ketone body test in the blood with a pH < 7.35;

(v) Complete and faultless clinical data.

2.5.2 Exclusion criteria

(i) Presence of history of surgery in the abdomen, gastrointestinal tumors and regular use of laxatives;

(ii) Those who have suffered from acute or chronic gastroenteritis or other malignant conditions;

(iii) Those who suffer from serious diseases of other systems, such as circulatory and urinary systems, or mental abnormalities;

(iv) Those currently taking anticholinergics, antispasmodics, and antidepressants;

(v) Those who are receiving other treatments or participating in other clinical studies.

2.6 Interpretation of some of the validation indicators mentioned in this study

(i) Infection history: clinical symptoms of local tissue or organ infection such as infection and fever were confirmed after the patient was admitted to the hospital.

(ii) Dietary status: including diabetic diet and unrestrained diet, mainly used to analyze patients’ dietary status and living status before hospitalization.

Diabetes diet: A standardized dietary regimen adhering to scientific principles for blood glucose control, emphasizing low-glycemic-index foods, quantified carbohydrate intake, and balanced combinations of high-fiber vegetables and lean protein. It strictly restricts added sugars and unhealthy fats while maintaining regular meal timing. The goal is to stabilize blood glucose variability, prevent acute complications (e.g., diabetic ketoacidosis), and fulfill individual nutritional requirements. Adjustments should be made dynamically based on the patient’s blood glucose levels, weight, and physical activity.

Unrestricted diet: This refers to eating habits that violate diabetes management goals. Patients may frequently consume high-sugar sweets, high-fat fried foods, and refined carbohydrates, accompanied by irregular meals, binge eating, or drinking on an empty stomach. This can cause blood sugar levels to fluctuate dramatically or remain uncontrolled for long periods of time.

(iii) Abnormal liver function: refers to the existence of cirrhosis and hepatitis in the patient’s past.

(iv) Abnormal renal function: patients with nephritis, nephrotic syndrome and other conditions in the past.

2.7 Statistical methods

SPSS 25.0 was used for data processing and statistical analysis in this study. Quantitative data conforming to normal distribution were expressed as mean ± standard deviation, and the differences between groups were analyzed by independent sample t-test. Comparisons between groups that did not follow a normal distribution were performed using non-parametric tests. Data for qualitative data were expressed as number of cases and percentage, and chi-square tests were used to determine whether there were differences between groups. Firstly, the patients were divided into the training set and validation set in a 7:3 ratio by randomization, and then analyzed based on the onset of DKA and various clinically relevant indicators in the training set. Potential risk factors for DKA were subsequently identified based on one-way logistic regression analysis of the collected data. For the univariate analysis, exposure factors with P ≤ 0.2 were selected and included in the multivariate analysis (17, 18). An independent risk factor for DKA was derived with DKA, and P < 0.05 was considered a statistically significant difference. Subsequent internal validation of the model further confirmed the reliability of the predictive model derived in this study.

3 Results

3.1 Table of values assigned to the relevant indicators in this study

See Table 1 for details.

TABLE 1
www.frontiersin.org

Table 1. Table of values for relevant indicators.

3.2 Table of baseline characteristics of patients in both groups

This study randomly divided the patients included in this experimental study into a training set and a validation set in a ratio of 7:3. There were 201 patients in the training set and 87 patients in the validation set. The general information of the two groups of patients was included in the statistical study, P > 0.05. None of the differences in baseline characteristics between the two groups were statistically significant. See Table 2 for details.

TABLE 2
www.frontiersin.org

Table 2. Comparison of baseline features between training and validation sets.

3.3 One-way logistic regression analysis

The training set of 201 patients was included in the statistical analysis, of which 55 patients developed DKA. Possibly relevant factors were included in one-way logistic regression: history of infection, dietary status, duration of diabetes mellitus longer than 3 years, history of smoking (P < 0.096, OR = 1.70, 95% CI = 0.91–3.18), history of alcohol consumption (P < 0.001, OR = 3.30, 95% CI = 1.72–6.36), liver function abnormalities (P = 0.078, OR = 1.89, 95% CI = 0.93–3.82), HbA1c abnormalities, and hypokalaemia were potential risk factors for the development of DKA in patients with type II diabetes mellitus, P < 0.2. See Table 3 for details.

TABLE 3
www.frontiersin.org

Table 3. One-factor logistic regression analysis based on training set.

3.4 Multifactor logistic regression analysis

Incorporate risk factors identified through univariate analysis into multivariate analysis: history of infection (P < 0.001, OR = 5.67, 95% CI = 2.34–13.78), dietary status (intemperate diet, P < 0.01, OR = 14.66, 95% CI = 5.89–36.51), duration of diabetes mellitus for more than 3 years (P = 0.006, OR = 3.45, 95% CI = 1.43–8.28), HbA1c abnormality (P = 0.047, OR = 2.08, 95 CI = 0.90–4.80), and hypokalaemia (P = 0.032, OR = 3.20, 95 CI = 1.10∼9.28) were the factors showing predictive associations with DKA. See Table 4 for details.

TABLE 4
www.frontiersin.org

Table 4. Multifactor logistic regression analysis based on training set.

3.5 Plotting of nomograms

A nomogram of the risk of developing DKA in patients with type II diabetes mellitus was constructed based on five predictive factors tested by multifactorial logistic regression analysis, see Figure 1 for details. Assign Nomo score to each independent risk factor. Sum up the total score based on the patient’s clinical characteristics, then locate on the total points axis. The value on the vertically downward corresponding Risk axis is the probability of DKA. The score of each independent predictor corresponds to the upper limit of the score of each independent predictor. The total score for each subject was the sum of each independent predictor score. The probability of developing diabetic ketoacidosis was determined by the total score on the risk axis for developing DKA in patients with type II diabetes. The model was subsequently validated internally: Internal validation of the nomograms was carried out using the Bootstrap method in R software with 1,000 repetitive samples. The calibration curve is close to the ideal curve, indicating that the nomogram predicts the incidence of DKA in type II diabetic patients with a high degree of agreement with the actual incidence, reflecting a certain predictive performance, see Figure 2 for details. The nomogram has a ROC curve for the training set with an AUC of 0.888 (95% CI = 0.846–0.929) and a ROC curve for the validation set with an AUC of 0.853 (95% CI = 0.780–0.925), see Figure 3 for details. It shows that the nomogram is a good discriminator of type II diabetic patients at high risk of developing DKA. The decision curve (DCA) for this nomogram shows that the model provides more net benefits than the all intervene or none intervene strategies when the threshold probability of an individual is greater than 0.05 in this column. This finding suggests that the nomogram model has a certain clinical application in predicting the occurrence of DKA in patients with type II diabetes mellitus, see Figure 4 for details.

FIGURE 1
Nomogram displaying risk assessment based on factors such as history of infection, dietary status, duration of diabetes, HbA1c abnormality, and hypokalemia. Points correlate with risk on a scale of 0 to 350, with risk spanning 0.1 to 0.9.

Figure 1. Nomogram predicts risk of DKA in type II diabetes patients.

FIGURE 2
Calibration plot showing actual probability against predicted probability for a model predicting diabetic ketoacidosis (DKA). The plot contains three lines: dotted for apparent, solid for bias-corrected, and dashed for ideal predictions. The apparent and bias-corrected lines closely follow the ideal line, indicating good model calibration. Ticks on top denote data distribution. The plot notes B equals 1000 repetitions for bootstrapping and shows a mean absolute error of 0.018 with a sample size of 201.

Figure 2. Internal validation of nomograms: calibration curves.

FIGURE 3
ROC curve comparing two models, training (blue line) and validation (orange line). Sensitivity is on the vertical axis, and 1-Specificity on the horizontal axis. The training model has an AUC of 0.891, and the validation model has an AUC of 0.834. A dashed diagonal line represents random performance.

Figure 3. Validation of nomograms: ROC curves.

FIGURE 4
Line graph showing net benefit versus high risk threshold. The blue line represents the model, which decreases as the risk threshold increases from 0 to 1. The gray lines represent ‘All’ and ‘None,’ showing constant levels. The net benefit initially peaks and then declines, crossing the horizontal axis near 0.4 and fluctuating near zero beyond 0.6.

Figure 4. Decision curves in nomogram models.

4 Discussion

As a serious acute complication of diabetes mellitus, diabetic ketoacidosis (DKA) seriously affects patients’ health and quality of life, and has been a key area of concern in medical research at home and abroad. The construction of accurate and effective clinical prediction models for DKA is of great significance for early intervention and reduction of adverse outcomes. Some studies surface that patients’ age, type of diabetes, physiological and biochemical indicators such as blood glucose, blood ketone bodies, acid-base balance, electrolytes, clinical factors such as infections and insulin use, as well as lifestyle factors such as diet and stress, are closely related to DKA, and that genetic factors also have some influence (19).

4.1 An analysis of the results of clinical prediction models for DKA and the current situation at home and abroad

Two hundred and eighty-eight patients were included in this study, of whom 74 developed DKA. After univariate and multivariate analyses, a history of infection, dietary status without dietary moderation, duration of diabetes mellitus for more than 3 years, HbA1c abnormality, and hypokalaemia were determined to be the factors showing predictive associations with DKA (P < 0.05). Infection (P < 0.001, OR = 5.67, 95% CI = 2.34–13.78) is a common and important cause of DKA. An overseas study of adult DKA patients showed that about 40%–60% of DKA episodes were associated with infection (20, 21). Infection triggers a stress response in the body, prompting an increase in the secretion of catecholamines, cortisol and other glucagon, leading to a rapid rise in blood sugar; At the same time, metabolic disorders in the infected state accelerate lipolysis and increase free fatty acids, which in turn promotes ketogenesis and ultimately induces DKA (22, 23). In addition, other scholars have shown that respiratory and urinary tract infections are the most common types of infections in patients with DKA, and that poor infection control is strongly associated with recurrence of DKA (23, 24). Infection plays an important role in the pathogenesis of DKA, consistent with the results of the present study. Also, in terms of diet, there is growing interest in the association between binge eating and DKA. Overseas studies have found that irregular diets and diets high in sugar and fat can lead to large fluctuations in blood glucose, and long-term blood glucose instability can increase the risk of DKA. When patients consume too much high-sugar food, blood glucose rises rapidly, exceeding the regulatory capacity of pancreatic islet cells, insulin secretion is relatively or absolutely insufficient, lipolysis is intensified, and ketone bodies are produced in excess, thus triggering DKA (25, 26). There are also relevant reports in China that emphasize the importance of a reasonable diet for glycaemic control and prevention of DKA in diabetic patients, which further validates the rationality of dietary status (P < 0.01, OR = 14.66, 95% CI = 5.89–36.51) as an independent risk factor in this study (27). As the duration of diabetes increases, the patient’s insulin function declines. At the same time, studies have shown that patients with a disease duration of more than 3 years (P = 0.006, OR = 3.45, 95% CI = 1.43–8.28) have a significantly higher risk of developing DKA than those with a shorter disease duration (2830). Prolonged hyperglycaemia has a toxic effect on pancreatic islet cells, causing a gradual decrease in their ability to secrete insulin, making glycaemic control more difficult, and making DKA more likely to occur in the event of stress factors, such as infections and poor diet. In addition, HbA1c (P = 0.047, OR = 2.08, 95 CI = 0.90–4.80) reflects the average blood glucose level over the past 2–3 months and is an important indicator for assessing long-term glycaemic control. A number of studies at home and abroad have confirmed that elevated HbA1c levels are positively associated with the risk of DKA (31, 32). An abnormal HbA1c means that the patient’s long-term glycaemic control is poor and metabolic disorders persist, increasing the likelihood of DKA. Finally, this study also confirms that hypokalaemia (P = 0.032, OR = 3.20, 95 CI = 1.10∼9.28) cannot be ignored in the pathogenesis of DKA. Insulin deficiency and hyperglycaemia result in the transfer of extracellular fluid potassium ions to the cells, while polyuria causes potassium loss, ultimately resulting in hypokalaemia. Hypokalaemia, in turn, further affects the effect of insulin action, exacerbates metabolic disorders and induces DKA (3335).

4.2 Advantages of a clinical prediction model for DKA

The prediction model constructed in this study has unique advantages compared with the latest research literature at home and abroad. The AUC of the ROC curve for this nomogram was 0.888 (95% CI = 0.846–0.929). Some foreign studies have constructed models that incorporate more variables, but the models are complex and have limited clinical utility (36, 37). In contrast, the sample sizes of some domestic studies are small, and the stability and generalisability of the models are insufficient (38, 39). The predictive factors included in this study’s model were rigorously screened to ensure both model simplicity and good predictive efficacy. The probable reason for this is that the sample size of this study was moderate and covered diabetic patients with different characteristics, making the screening of risk factors more representative. In addition, the data were subjected to rigorous statistical analysis during the study to further optimize the model. In terms of intrinsic mechanisms, these predictive factors cover a number of key aspects such as infection, metabolism, diet, etc., which can comprehensively reflect the pathogenesis of DKA, thus improving the accuracy of model prediction.

4.3 Limitations

Although this study has achieved some results in the construction and validation of clinical prediction models for diabetic ketoacidosis (DKA), it inevitably has some limitations. Firstly, this is a single-center study with the sample originating from only one hospital. There are differences in genetic background, living habits, and medical conditions among populations in different regions, which may result in samples that are not fully representative of all diabetic populations, limiting the generalisability of the model when applied across different geographic regions. For example, some regions may have unique environmental factors or high prevalence of diseases that influence the pathogenesis of DKA, and a single-center sample that does not encompass these differences may reduce the predictive accuracy of the model in other regions. Secondly, this study only included common clinical factors such as history of infection, dietary status, and duration of diabetes, but did not consider the influence of emerging factors such as genes and gut flora on the development of DKA. Genetic polymorphisms may affect individual susceptibility to DKA, and different combinations of genes may result in different risks and courses of DKA when the organism is exposed to the same triggers. Gut flora are involved in the regulation of human metabolism, and their imbalance may interfere with glycaemic homeostasis and fat metabolism, which in turn may affect the development of DKA. By ignoring these factors, the model may omit key variables and fail to fully and accurately predict the risk of DKA onset. In addition, the study follow-up period was relatively short. The development of DKA may be dynamically affected by multiple factors such as long-term lifestyle and disease progression, which are difficult to capture in short-term follow-up and are not sufficient for the validation of long-term predictive efficacy of the model. In the long term, the patient’s physical condition, medication use, etc. may change, and the impact of these factors on the predictive accuracy of the model cannot be adequately assessed in a short period of time. Additionally, given that this study is retrospective in nature, despite the comprehensive variables collected and the fact that all variables were retrieved by researchers through the Donghua His medical record system, the study is inevitably subject to confounding bias and selection bias. Furthermore, retrospective studies are susceptible to reverse causality. This study primarily employed univariate and multivariate analyses to identify predictive factors for diabetic ketoacidosis (DKA), thereby constructing corresponding predictive models. And these methods represent the most fundamental approaches. Constrained by sample size limitations, we did not undertake more advanced statistical analyses such as propensity score weighting or competing risks models. The events-per-variable ratio may be low, raising the risk of overfitting. Although the AUC values for both the training and validation sets are relatively high, a certain discrepancy persists between them, making it impossible to completely rule out the possibility of overfitting. Additionally, calibration and decision curve results are mentioned qualitatively but not quantified. Furthermore, the predictive models lacked external validation sets to assess their generalisability, and non-linear methods such as Bayesian networks were not utilized to explore interactions between variables. Therefore, future studies should conduct prospective cohort studies with large sample sizes across multiple centers and perform Mendelian randomization studies using genetic variation as an instrumental variable to further exclude reverse causality.

4.4 Future direction and prospects

In the future, we plan to conduct a multi-center, large-sample study on the clinical prediction model of DKA, including diabetic patients from different regions and races, to comprehensively consider the effects of geographic and genetic differences on the onset of DKA, so as to improve the general applicability of the model. At the same time, the intrinsic links between genes, intestinal flora and other emerging factors and DKA will be explored in depth, and advanced gene detection technology and intestinal flora analysis will be used to clarify their specific roles in the pathogenesis of DKA, which will be incorporated into the prediction model to enhance the accuracy and comprehensiveness of the model. In addition, the follow-up time is greatly extended to closely track the long-term health status of patients, dynamically monitor the changes of various influencing factors, verify the long-term predictive effect of the model, and provide a more reliable predictive tool for the clinic, which will help in the early prevention and precise treatment of DKA.

5 Conclusion

Through the in-depth profiling in this study, we have successfully identified history of infection, dietary status without dietary moderation, duration of diabetes mellitus up to 3 years or more, HbA1c abnormality, and hypokalaemia as predictive factors for the development of DKA. Clinical prediction models constructed on the basis of these findings enable clinicians to identify patients at high risk of DKA early. In practice, doctors can use this model to develop targeted prevention and intervention strategies in advance, before the disease has even started. For example, focusing on strengthening the prevention and control of infection, paying close attention to the patient’s dietary status and giving reasonable dietary guidance, optimizing the blood glucose management plan according to the patient’s specific condition, and so on. Through these effective measures, it is expected to effectively reduce the incidence of DKA and the morbidity and mortality rate, and thus significantly improve the overall prognosis of diabetic patients, providing a more solid guarantee for the healthy life of patients.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Ethics Committee of Yangtze River Shipping General Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because this study retrospectively collected clinical data from patients and built models, rather than prospectively collecting clinical data from patients. Therefore, the ethics committee granted an exemption from informed consent.

Author contributions

CX: Writing – original draft. XJ: Writing – original draft. QH: Writing – review & editing. PX: Writing – review & editing.

Funding

The authors declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Zheng Y, Ley SH, Hu FB. Global aetiology and epidemiology of type 2 diabetes mellitus and its complications. Nat Rev Endocrinol. (2018) 14:88–98. doi: 10.1038/nrendo.2017.151

PubMed Abstract | Crossref Full Text | Google Scholar

2. Wong ND, Sattar N. Cardiovascular risk in diabetes mellitus: epidemiology, assessment and prevention. Nat Rev Cardiol. (2023) 20:685–95. doi: 10.1038/s41569-023-00877-z

PubMed Abstract | Crossref Full Text | Google Scholar

3. Ali MK, Pearson-Stuttard J, Selvin E, Gregg EW. Interpreting global trends in type 2 diabetes complications and mortality. Diabetologia. (2022) 65:3–13. doi: 10.1007/s00125-021-05585-2

PubMed Abstract | Crossref Full Text | Google Scholar

4. Dal Canto E, Ceriello A, Rydén L, Ferrini M, Hansen TB, Schnell O, et al. Diabetes as a cardiovascular risk factor: an overview of global trends of macro and micro vascular complications. Eur J Prev Cardiol. (2019) 26:25–32. doi: 10.1177/2047487319878371

PubMed Abstract | Crossref Full Text | Google Scholar

5. Ooi YG, Sarvanandan T, Hee NK, Lim QH, Paramasivam SS, Ratnasingam J, et al. Risk prediction and management of chronic kidney disease in people living with type 2 diabetes mellitus. Diabetes Metab J. (2024) 48:196–207. doi: 10.4093/dmj.2023.0244

PubMed Abstract | Crossref Full Text | Google Scholar

6. Chen J, Zeng H, Ouyang X, Zhu M, Huang Q, Yu W, et al. The incidence, risk factors, and long-term outcomes of acute kidney injury in hospitalized diabetic ketoacidosis patients. BMC Nephrol. (2020) 21:48. doi: 10.1186/s12882-020-1709-z

PubMed Abstract | Crossref Full Text | Google Scholar

7. Lu S, Wei D, Yin C, Xiong J, Zhu L, Yan S, et al. Correlation between PCT, 25(OH)D, PTX-3, AMS levels and the severity of diabetic ketoacidosis complicated by pancreatitis. BMC Endocr Disord. (2021) 21:136. doi: 10.1186/s12902-021-00792-9

PubMed Abstract | Crossref Full Text | Google Scholar

8. Hua Y, Ding N, Jing H, Xie Y, Wu H, Wu Y, et al. Association between SOFA score and risk of acute kidney injury in patients with diabetic ketoacidosis: an analysis of the MIMIC-IV database. Front Endocrinol. (2024) 15:1462330. doi: 10.3389/fendo.2024.1462330

PubMed Abstract | Crossref Full Text | Google Scholar

9. Fayfman M, Pasquel FJ, Umpierrez GE. Management of hyperglycemic crises: diabetic ketoacidosis and hyperglycemic hyperosmolar state. Med Clin North Am. (2017) 101:587–606. doi: 10.1016/j.mcna.2016.12.011

PubMed Abstract | Crossref Full Text | Google Scholar

10. Umpierrez G, Korytkowski M. Diabetic emergencies – ketoacidosis, hyperglycaemic hyperosmolar state and hypoglycaemia. Nat Rev Endocrinol. (2016) 12:222–32. doi: 10.1038/nrendo.2016.15

PubMed Abstract | Crossref Full Text | Google Scholar

11. Dhatariya KK, Vellanki P. Treatment of diabetic ketoacidosis (DKA)/hyperglycemic hyperosmolar state (HHS): novel advances in the management of hyperglycemic crises (UK Versus USA). Curr Diab Rep. (2017) 17:33. doi: 10.1007/s11892-017-0857-4

PubMed Abstract | Crossref Full Text | Google Scholar

12. Glaser N, Fritsch M, Priyambada L, Rewers A, Cherubini V, Estrada S, et al. ISPAD clinical practice consensus guidelines 2022: diabetic ketoacidosis and hyperglycemic hyperosmolar state. Pediatr Diabetes. (2022) 23:835–56. doi: 10.1111/pedi.13406

PubMed Abstract | Crossref Full Text | Google Scholar

13. Li Y, Liu B, Anand V, Dunne JL, Lundgren M, Ng K, et al. Predicting type 1 diabetes onset using novel survival analysis with biomarker ontology. AMIA Annu Symp Proc. (2020) 2020:727–36.

Google Scholar

14. Wolfsdorf JI, Glaser N, Agus M, Fritsch M, Hanas R, Rewers A, et al. ISPAD clinical practice consensus guidelines 2018: diabetic ketoacidosis and the hyperglycemic hyperosmolar state. Pediatr Diabetes. (2018) 19:155–77. doi: 10.1111/pedi.12701

PubMed Abstract | Crossref Full Text | Google Scholar

15. Stoner GD. Hyperosmolar hyperglycemic state. Am Fam Phys. (2017) 96:729–36.

Google Scholar

16. Chinese Elderly Type 2 Diabetes Prevention and Treatment of Clinical Guidelines Writing Group, Geriatric Endocrinology and Metabolism Branch of Chinese Geriatric Society, Geriatric Endocrinology and Metabolism Branch of Chinese Geriatric Health Care Society, Geriatric Professional Committee of Beijing Medical Award Foundation, National Clinical Medical Research Center for Geriatric Diseases (PLA General Hospital). [Clinical guidelines for prevention and treatment of type 2 diabetes mellitus in the elderly in China (2022 edition)]. Zhonghua Nei Ke Za Zhi. (2022) 61:12–50. doi: 10.3760/cma.j.cn112138-20211027-00751

PubMed Abstract | Crossref Full Text | Google Scholar

17. Wang C, Zou Y, Pan C, Shao L, Ding Z, Zhang Y, et al. Prognostic significance of chemokines CCL11 and CCL5 modulated by low-density lipoprotein cholesterol in colon cancer patients with normal body mass index. Ann Transl Med. (2021) 9:202. doi: 10.21037/atm-20-1604

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zhang Y, Chen L, Chen H. A meta-analysis of the correlation between non-steroidal anti-inflammatory drugs and recurrent colorectal adenomatous polyps. Am J Transl Res. (2021) 13:2432–8.

Google Scholar

19. Liu Y, Mo W, Wang H, Shao Z, Zeng Y, Bi J. Feature selection and risk prediction for diabetic patients with ketoacidosis based on MIMIC-IV. Front Endocrinol. (2024) 15:1344277. doi: 10.3389/fendo.2024.1344277

PubMed Abstract | Crossref Full Text | Google Scholar

20. Guo L, Wang J, Li L, Yuan L, Chen S, Wang H, et al. A multicentre, prospective, non-interventional study evaluating the safety of dapagliflozin in patients with type 2 diabetes in routine clinical practice in China (DONATE). BMC Med. (2023) 21:212. doi: 10.1186/s12916-023-02906-7

PubMed Abstract | Crossref Full Text | Google Scholar

21. Gorchane A, Ach T, Sahli J, Hadj Salah NB, Ben Lasfar N, Ben Abdelkrim A, et al. Unmasking the escalation: a comparative analysis of diabetic ketoacidosis severity before and during the COVID-19 pandemic in a Tunisian pioneer study. Expert Rev Endocrinol Metab. (2024) 19:393–8. doi: 10.1080/17446651.2024.2353723

PubMed Abstract | Crossref Full Text | Google Scholar

22. von Herrath M, Bain SC, Bode B, Clausen JO, Coppieters K, Gaysina L, et al. Anti-interleukin-21 antibody and liraglutide for the preservation of β-cell function in adults with recent-onset type 1 diabetes: a randomised, double-blind, placebo-controlled, phase 2 trial. Lancet Diabetes Endocrinol. (2021) 9:212–24. doi: 10.1016/s2213-8587(21)00019-x

PubMed Abstract | Crossref Full Text | Google Scholar

23. Dejgaard TF, Frandsen CS, Hansen TS, Almdal T, Urhammer S, Pedersen-Bjergaard U, et al. Efficacy and safety of liraglutide for overweight adult patients with type 1 diabetes and insufficient glycaemic control (Lira-1): a randomised, double-blind, placebo-controlled trial. Lancet Diabetes Endocrinol. (2016) 4:221–32. doi: 10.1016/s2213-8587(15)00436-2

PubMed Abstract | Crossref Full Text | Google Scholar

24. den Hollander NH, Roep BO. From disease and patient heterogeneity to precision medicine in type 1 diabetes. Front Med. (2022) 9:932086. doi: 10.3389/fmed.2022.932086

PubMed Abstract | Crossref Full Text | Google Scholar

25. Maki KC, Phillips-Eakley AK, Smith KN. The effects of breakfast consumption and composition on metabolic wellness with a focus on carbohydrate metabolism. Adv Nutr. (2016) 7:613s–21s. doi: 10.3945/an.115.010314

PubMed Abstract | Crossref Full Text | Google Scholar

26. She Y, Wang K, Makarowski A, Mangat R, Tsai S, Willing BP, et al. Effect of high-fat and low-fat dairy products on cardiometabolic risk factors and immune function in a low birthweight swine model of diet-induced insulin resistance. Front Nutr. (2022) 9:923120. doi: 10.3389/fnut.2022.923120

PubMed Abstract | Crossref Full Text | Google Scholar

27. Magkos F, Hjorth MF, Astrup A. Diet and exercise in the prevention and treatment of type 2 diabetes mellitus. Nat Rev Endocrinol. (2020) 16:545–55. doi: 10.1038/s41574-020-0381-5

PubMed Abstract | Crossref Full Text | Google Scholar

28. Roussel R, Riveline JP, Vicaut E, de Pouvourville G, Detournay B, Emery C, et al. important drop in rate of acute diabetes complications in people with type 1 or type 2 diabetes after initiation of flash glucose monitoring in france: the RELIEF study. Diabetes Care. (2021) 44:1368–76. doi: 10.2337/dc20-1690

PubMed Abstract | Crossref Full Text | Google Scholar

29. Michaelis M, Shochat T, Shimon I, Akirov A. Features and long-term outcomes of patients hospitalized for diabetic ketoacidosis. Diabetes Metab Res Rev. (2021) 37:e3408. doi: 10.1002/dmrr.3408

PubMed Abstract | Crossref Full Text | Google Scholar

30. Zhang Z, Zhao L, Lu Y, Xiao Y, Zhou X. Insulin resistance assessed by estimated glucose disposal rate and risk of incident cardiovascular diseases among individuals without diabetes: findings from a nationwide, population based, prospective cohort study. Cardiovasc Diabetol. (2024) 23:194. doi: 10.1186/s12933-024-02256-5

PubMed Abstract | Crossref Full Text | Google Scholar

31. Rossing P. HbA1c and beyond. Nephrol Dial Transplant. (2023) 38:34–40. doi: 10.1093/ndt/gfab243

PubMed Abstract | Crossref Full Text | Google Scholar

32. Vora H, Kaur P. Prediabetes and diabetes in India: an HbA1c based epidemiology study. Diabetes Res Clin Pract. (2024) 217:111889. doi: 10.1016/j.diabres.2024.111889

PubMed Abstract | Crossref Full Text | Google Scholar

33. Muneer M, Akbar I. Acute metabolic emergencies in diabetes: DKA, HHS and EDKA. Adv Exp Med Biol. (2021) 1307:85–114. doi: 10.1007/5584_2020_545

PubMed Abstract | Crossref Full Text | Google Scholar

34. Dhatariya KK. The management of diabetic ketoacidosis in adults-an updated guideline from the Joint british diabetes society for inpatient care. Diabet Med. (2022) 39:e14788. doi: 10.1111/dme.14788

PubMed Abstract | Crossref Full Text | Google Scholar

35. Mustafa OG, Haq M, Dashora U, Castro E, Dhatariya KK. Management of hyperosmolar hyperglycaemic state (HHS) in adults: an updated guideline from the joint british diabetes societies (JBDS) for inpatient care group. Diabet Med. (2023) 40:e15005. doi: 10.1111/dme.15005

PubMed Abstract | Crossref Full Text | Google Scholar

36. Xie W, Li Y, Meng X, Zhao M. Machine learning prediction models and nomogram to predict the risk of in-hospital death for severe DKA: a clinical study based on MIMIC-IV, eICU databases, and a college hospital ICU. Int J Med Inform. (2023) 174:105049. doi: 10.1016/j.ijmedinf.2023.105049

PubMed Abstract | Crossref Full Text | Google Scholar

37. Vandervelden C, Lockee B, Barnes M, Tallon EM, Williams DD, Kahkoska A, et al. Predicting and ranking diabetic ketoacidosis risk among youth with type 1 diabetes with a clinic-to-clinic transferrable machine learning model. Diabetes Technol Ther. (2025) 27:271–82. doi: 10.1089/dia.2024.0484

PubMed Abstract | Crossref Full Text | Google Scholar

38. Fan T, Wang J, Li L, Kang J, Wang W, Zhang C. Predicting the risk factors of diabetic ketoacidosis-associated acute kidney injury: a machine learning approach using XGBoost. Front Public Health. (2023) 11:1087297. doi: 10.3389/fpubh.2023.1087297

PubMed Abstract | Crossref Full Text | Google Scholar

39. Xiang Z, Hu J, Bu S, Ding J, Chen X, Li Z. Machine learning based prediction models for the prognosis of COVID-19 patients with DKA. Sci Rep. (2025) 15:2633. doi: 10.1038/s41598-025-85357-9

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: type II diabetes, diabetes ketoacidosis, multifactor logistic regression analysis, nomogram, dietary status, history of infection, HbA1c abnormality, hypokalaemia

Citation: Xu C, Jiang X, He Q and Xu P (2025) Construction and validation of a clinical prediction model for diabetic ketoacidosis. Front. Med. 12:1638927. doi: 10.3389/fmed.2025.1638927

Received: 31 May 2025; Accepted: 13 October 2025;
Published: 31 October 2025.

Edited by:

Rahnuma Ahmad, Medical College for Women and Hospital, Bangladesh

Reviewed by:

Mainul Haque, National Defence University of Malaysia, Malaysia
Chaoran Liang, Shandong University, China
Md. Ahsanul Haq, International Centre for Diarrhoeal Disease Research (ICDDR), Bangladesh

Copyright © 2025 Xu, Jiang, He and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Quanan He, MTEyMDM2NjgxNEBxcS5jb20=; Peng Xu, NjI1MjcxOTQwQHFxLmNvbQ==

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.