- 1Department of Pharmacy, The First Hospital of Lanzhou University, Lanzhou, China
- 2School of Pharmacy, Lanzhou University, Lanzhou, China
- 3The Second School of Clinical Medicine, Lanzhou University, Lanzhou, China
- 4Department of Pharmacy, The Third Affiliated Hospital of Xi'an Medical College, Xi'an, China
- 5Suzhou Medical College, Soochow University, Suzhou, China
Background: Metabolic dysfunction-associated steatotic liver disease (MASLD) is a major comorbidity in type 2 diabetes mellitus (T2DM), yet early prediction models tailored to this population are limited. This study aimed to develop and validate a novel diagnostic predictive model for MASLD in adults with T2DM.
Methods: A total of 4,726 T2DM patients were retrospectively analyzed. Candidate predictors were screened by least absolute shrinkage and selection operator (LASSO) regression, and a multivariable logistic regression model was built. Significant variables were integrated into a diagnostic predictive nomogram (DPN), with online and Excel-based calculators developed. Model performance was comprehensively evaluated and compared with four established models for fatty liver disease across training, internal, and external (NHANES) validation datasets. Subgroup analyses assessed generalizability.
Results: Eight independent predictors (sex, age, body mass index, alanine aminotransferase, albumin, diabetes duration, triglycerides, and high-density lipoprotein cholesterol) were included in the final model. The DPN achieved robust discrimination in training set (AUC: 0.775, 95% CI: 0.759–0.791), validation set (0.767, 95% CI: 0.742–0.791), and test set (0.794, 95% CI: 0.749–0.839) compared to existing models. NRI and IDI confirmed improved predictive capacity (P < 0.05). Calibration curves were excellent in the training (P = 0.936, Brier score = 0.184), validation (P = 0.956, Brier score = 0.189), and test sets (P = 0.687, Brier score = 0.156). DCA and CIC further demonstrated higher clinical net benefit. Subgroup analyses confirmed stability and broad applicability.
Conclusions: The DPN is a clinically practical and resource-efficient screening tool that enables early risk stratification for MASLD in patients with T2DM. Its implementation could streamline screening pathways and facilitate timely intervention in routine clinical practice.
1 Introduction
Diabetes mellitus is a heterogeneous metabolic disorder characterized by impairments in both glucose and lipid metabolism. It has emerged as the fastest-growing global health burden, with the number of affected individuals continuing to rise. According to the latest report from the International Diabetes Federation (IDF), the global number of adults living with diabetes is projected to reach 643 million by 2030 and 784 million by 2045 (1). Currently, over 90% of diabetes cases are attributed to type 2 diabetes mellitus (T2DM), a condition primarily driven by varying degrees of pancreatic β-cell dysfunction and insulin resistance (IR) (2). Despite ongoing advances in the therapeutic landscape, T2DM remains associated with a substantial burden of complications and comorbidities. Epidemiological data indicate that the prevalence of end-stage renal disease (ESRD), diabetic retinopathy (DR), cardiovascular disease (CVD), and chronic liver disease among individuals with diabetes ranges from 12%−55%, 2.6%, 40%, and 12.3%−57%, respectively (3).
Metabolic dysfunction-associated steatotic liver disease (MASLD), the term recently adopted to replace non-alcoholic fatty liver disease (NAFLD), is now encompassed under the umbrella of steatotic liver disease (SLD). This nomenclature shift, driven by a 2023 international consensus, aims to reduce stigma and improve disease awareness (4). The pathophysiological mechanisms linking T2DM and MASLD are multifactorial and intertwined, primarily driven by IR (5). In the diabetic state, hyperinsulinaemia coupled with IR enhances lipolysis, leading to elevated circulating free fatty acids (FFAs). These FFAs are taken up by the liver, promoting hepatic triglyceride accumulation. Simultaneously, dysregulation of key transcription factors like sterol regulatory element-binding protein 1c (SREBP-1c) exacerbates de novo lipogenesis. This lipid overload induces endoplasmic reticulum stress (ERS) and mitochondrial dysfunction, resulting in increased reactive oxygen species (ROS) production, impaired β-oxidation, and heightened inflammation (6).
Furthermore, the coexistence of MASLD has been shown to exert a deleterious effect on the long-term prognosis and survival of individuals with T2DM. For example, individuals without diabetes who present with MASLD at baseline exhibit a significantly increased risk of developing T2DM over time, even after adjusting for conventional diabetes-related risk factors (7). A population-based study conducted in Italy further demonstrated that patients with diabetes have a two- to three-fold higher risk of mortality from chronic liver diseases, particularly MASLD, compared to non-diabetic individuals (8). Conversely, findings from a 10-year cohort study revealed that histological improvement in MASLD was independently associated with a nearly 70% reduction in the risk of incident T2DM (9). Collectively, these findings underscore the clinical importance of early detection and longitudinal monitoring of MASLD in the management of patients with T2DM.
A variety of imaging modalities are currently available for the screening and diagnosis of MASLD; however, each method presents specific limitations that hinder their widespread application. Ultrasound and computed tomography (CT) are commonly used non-invasive tools for initial MASLD assessment (10). Nevertheless, ultrasound results are susceptible to operator-dependent variability and can be influenced by factors such as abdominal adiposity and disease staging (11). CT, although less operator-dependent, exposes patients to ionizing radiation, has limited sensitivity for detecting early or mild hepatic steatosis, and is associated with relatively high costs, restricting its routine use (12). Magnetic resonance imaging (MRI) and its advanced techniques, including proton density fat fraction (PDFF), chemical shift (including encoded MRI and magnetic resonance spectroscopy), offer high accuracy and reproducibility in quantifying hepatic fat content. However, these methods are time-intensive, expensive, and contraindicated in certain clinical settings (e.g., acute intracranial hemorrhage), thus rendering them unsuitable for large-scale screening (13). The controlled attenuation parameter (CAP), an emerging tool derived from transient elastography, is simple, rapid, and non-invasive. Despite its convenience, CAP accuracy may be compromised by various confounders such as diabetes, obesity, limited intercostal space, hepatic inflammation, and fibrosis (14). Liver biopsy remains the reference standard for definitive diagnosis and histological staging of MASLD. However, its invasive nature introduces procedural risks, including bleeding, infection, and bile leakage, and sampling limitations can lead to false-negative findings (15). Additionally, all of the aforementioned diagnostic modalities must be conducted in clinical settings, further limiting their accessibility for early-stage or asymptomatic individuals. Given these limitations, there is an urgent need to develop a novel screening tool that is non-invasive, reproducible, cost-effective, and easily accessible.
In recent years, clinical prediction models have garnered increasing attention in the medical community due to their high diagnostic accuracy, broad applicability across diverse populations, and operational simplicity. Several models originally developed for NAFLD are now highly relevant for identifying MASLD, as they primarily utilize metabolic parameters. For instance, the Framingham Steatosis Index (FSI) has emerged as a practical tool for identifying hepatic steatosis, achieving areas under the receiver operating characteristic curve (AUCs) of 0.845 and 0.760 in internal and external validation cohorts, respectively (16). Similarly, both the Zhejiang University Index (ZJU) (17) and the Hepatic Steatosis Index (HSI) (18) were developed using data from general adult populations and demonstrated excellent discriminatory power (AUC > 0.80). In addition, the triglyceride-glucose (TyG) index, a low-cost and reliable surrogate marker of insulin resistance (IR), has gained substantial traction in MASLD screening. Numerous large-scale studies leveraging national and regional databases have confirmed the TyG index's predictive utility for MASLD in individuals from the general population (19, 20). However, it is important to note that the majority of existing models were established and validated in general populations and do not account for the unique metabolic characteristics of individuals with T2DM. These models often assume a uniform MASLD risk across all subgroups, which may result in suboptimal risk estimation in diabetic populations. Given the distinct pathophysiological interplay between IR, β-cell dysfunction, and hepatic steatosis in T2DM, there remains a critical need to develop tailored prediction tools that address the specific risk factors and clinical profiles of this high-risk group.
In light of the practical demands of T2DM management and the aforementioned limitations of existing screening models, the present study aimed to develop a novel, individualized, and clinically applicable diagnostic tool for MASLD tailored specifically to the T2DM population. This tool, the Diagnostic Predictive Nomogram (DPN), was constructed using readily available demographic, anthropometric, and laboratory parameters. To ensure its robustness and clinical utility, the model's predictive performance was rigorously evaluated and externally validated, with the goal of facilitating broader applicability across diverse clinical settings.
2 Materials and methods
2.1 Study population
All data used in this study were obtained from hospitalized patients. Data were retrospectively extracted from the Hospital Information System (HIS) of the First Hospital of Lanzhou University, covering the period from January 2016 to June 2022. Eligible participants met the following inclusion criteria: (1) confirmed diagnosis of T2DM prior to hospital admission, based on the diagnostic criteria of the World Health Organization (WHO) (21); (2) age≥18 years; and (3) availability of complete clinical baseline and abdominal ultrasound imaging data. Exclusion criteria were as follows: (1) diagnosis of T1DM, gestational diabetes mellitus (GDM), or other specific types of diabetes; (2) presence of acute diabetic complications, such as diabetic ketoacidosis or hyperosmolar hyperglycaemic coma; (3) current diagnosis of drug-induced hepatitis, viral hepatitis, or autoimmune hepatitis; (4) missing data on alcohol consumption or documented history of alcohol abuse/excessive drinking; (5) presence of insulinoma, pituitary adenoma, Cushing's syndrome, or secondary pancreatic injury; (6) current use of medications known to affect glucose metabolism, including thyroid hormones, glucocorticoids, or growth hormones; (7) presence of severe infections, stress-related conditions, or cancer cachexia; and (8) prior history of fatty liver disease. This study was approved by the Ethics Committee of the First Hospital of Lanzhou University (Approval No. LDYYLL2021-294). Written informed consent was obtained from all participants. The study was conducted in accordance with the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) guidelines (22).
2.2 Patient screening process
A total of 7,246 participants were initially identified. After applying the predefined inclusion and exclusion criteria, the following individuals were excluded: those aged < 18 years (n = 153), those with non-T2DM diagnoses (n = 132), individuals with missing key biochemical data (n = 1,442), those without abdominal ultrasound records (n = 232), individuals with a history of excessive alcohol consumption (n = 258), those diagnosed with chronic liver diseases (n = 221), and patients with malignant tumors (n = 82). After exclusions, a final cohort of 4,726 patients with confirmed T2DM was included in the analysis. A detailed flowchart of the participant selection process is presented in Figure 1. All 4,726 hospitalized T2DM patients included in this study were admitted due to diabetes-related complications. The primary indications for hospitalization included diabetic distal symmetrical polyneuropathy (DSPN, 89.9%), diabetic peripheral vascular disease (DPVD, 68.5%), and diabetic nephropathy (DN, 61.2%). Among the common comorbidities, 1,756 patients had MASLD (originally diagnosed as fatty liver/NAFLD). Concurrently, hypertension (51.8%), hyperlipidaemia (25.3%), atherosclerosis (15.2%), and cardiovascular diseases (23.1%) were the major coexisting conditions, collectively constituting the clinical indications associated with hospitalization.
2.3 Sample estimation
It is generally recommended that a minimum of 10 outcome events be available for each candidate variable considered in the development of a predictive model (23). In this study, 29 variables were planned for inclusion, necessitating at least 290 outcome events. Given the reported prevalence of MASLD among patients with T2DM ranges from 50% to 75%, the estimated minimum required sample size would range from 387 to 580 individuals. The final cohort of 4,726 patients included in this study meets this requirement, thereby ensuring adequate statistical power for model development.
2.4 Data collection
This study collected comprehensive clinical data from hospitalized patients with T2DM, including demographic characteristics, anthropometric parameters, and biochemical indicators. Demographic data encompassed sex, age, duration of diabetes, smoking and alcohol consumption history, as well as medical history related to hypertension and diabetes-associated complications. These variables were obtained through standardized questionnaires administered by trained healthcare professionals. Anthropometric measurements, including height, weight, systolic blood pressure (SBP), and diastolic blood pressure (DBP), were recorded by certified staff using calibrated instruments and standardized protocols. Height and weight were measured using an automatic anthropometer (accuracy: 0.1 cm and 0.1 kg, respectively), and body mass index (BMI) was calculated as weight divided by height squared (kg/m2). Blood pressure was measured in the seated position after a 5–10 min rest period; three consecutive readings were taken using an electronic sphygmomanometer, and the average was used for analysis. Biochemical parameters, including comprising fasting blood glucose (FBG), uric acid (UA), alanine aminotransferase (ALT), aspartate aminotransferase (AST), alkaline phosphatase (ALP), γ-glutamyl transpeptidase (GGT), total bilirubin (TBIL), direct bilirubin (DBIL), indirect bilirubin (IBIL), total protein (TP), albumin (ALB), globulin (GLB), total cholesterol (TC), triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C), were measured using venous blood samples collected after an overnight fast of at least 8 h. All samples were analyzed in a central laboratory using a fully automated biochemical analyzer (LABOSPECT, Japan), with all procedures performed under standardized conditions to ensure consistency and accuracy.
2.5 Research outcomes
The primary outcome of this study was the first-time diagnosis of MASLD during the index hospitalization, with no prior history of hepatic steatosis or related liver conditions. MASLD was diagnosed by two experienced ultrasound physicians using a high-resolution ultrasound imaging system, based on standardized diagnostic criteria. Characteristic imaging features included: (1) an intact liver capsule with a significantly enlarged, rounded liver contour; (2) dense echogenic foci diffusely distributed within the hepatic parenchyma, with attenuation of echo intensity from superficial to deep regions, and a foggy appearance of the diaphragm and liver base due to impaired sound transmission; (3) reduced visibility of intrahepatic tubular structures, diminished vascular echoes, blurred vascular branches, and a decreased liver-to-kidney parenchymal echogenicity ratio; and (4) in cases of diffuse hepatic steatosis, a homogeneous distribution of steatosis throughout the liver with uniformly increased echogenicity and no evidence of focal heterogeneity. The diagnosis of MASLD was assigned to patients who exhibited these imaging features and had pre-existing T2DM—a cardinal criterion for metabolic dysfunction—after the exclusion of other identifiable causes of fatty liver, including alcohol abuse, viral hepatitis, drug-induced liver disease, or total parenteral nutrition (24).
2.6 Complications and concomitant conditions
Hypertension was defined as a SBP ≥140 mmHg and/or DBP ≥90 mmHg on three separate occasions in the absence of antihypertensive therapy. Individuals with a prior history of hypertension who maintained blood pressure below 140/90 mmHg while on antihypertensive medications were still classified as hypertensive (25). DN was diagnosed in patients with type 2 diabetes mellitus after excluding other causes of chronic kidney disease, based on an estimated glomerular filtration rate (eGFR) < 60 mL/min/1.73 m2 or a urinary albumin-to-creatinine ratio >30 mg/g persisting for more than 3 months (26). DSPN was defined as a neuropathy occurring after the onset of diabetes and diagnosed when patients exhibited at least two of the following clinical features: impaired temperature or vibration sensation, slowed nerve conduction velocity, reduced or absent distal sensation in the feet, or diminished ankle reflexes (27). DPVD was defined as peripheral arterial atherosclerosis confirmed by ultrasound imaging in the context of a known diagnosis of diabetes mellitus (28).
2.7 Existing prediction models for fatty liver disease
In order to evaluate the performance of the model we developed, this study also comprehensively compared the performance of DPN with four previously reported prediction models for fatty liver disease. These four models are:
(1) ZJU = BMI (kg/m2) + FPG (mmol/L) + TG (mmol/L) + 3 × [ALT(U/L)/AST(U/L)] (+ 2, if for women);
(2) HSI = 8 × [ALT(U/L)/AST(U/L)] + BMI (kg/m2)(+ 2, if T2DM; + 2, if for women);
(3) TyG = Ln [TG (mg/dL) × FPG (mg/dL)/2];
(4) FSI = −7.981 + 0.011 × Age (years) – 0.146 × Sex (female = 1, male = 0) + 0.173 × BMI (kg/m2) + 0.007 × TG (mg/dL)+ 0.593 × Hypertension (yes = 1, no = 0) + 0.789 × T2DM (yes = 1, no = 0) + 11 × [ALT(U/L)/AST(U/L)] > 1.33 (yes = 1, no = 0).
2.8 Model establishment
The study cohort comprising 4,726 patients was randomly divided into a training set (n = 3,310) and a validation set (n = 1,416) in a 7:3 ratio using the “Caret” package in R. The DPN was constructed based on the training set. To preliminarily explore associations between candidate variables and MASLD, univariate logistic regression analyses were performed. To further reduce dimensionality, eliminate irrelevant or multicollinear variables, and prevent model overfitting, least absolute shrinkage and selection operator (LASSO) regression was applied using the “glmnet” package with 10-fold cross-validation. Variables with non-zero coefficients were retained as potential predictors. The regularization parameter (lambda.lse), corresponding to the minimum mean squared error and optimal model performance, was selected for subsequent analysis. Following LASSO selection, variance inflation factors (VIFs) were calculated to assess multicollinearity among the remaining variables, and restricted cubic spline (RCS) analyses were conducted to explore potential non-linear relationships between continuous predictors and MASLD risk. Subsequently, multivariate logistic regression with stepwise variable selection based on the Akaike Information Criterion (AIC) was performed using the “MASS” package to develop the final prediction model. To enhance clinical utility and accessibility, an interactive web-based calculator was developed using the “Shiny” package in R. Additionally, a probability reference table based on the DPN was provided in the Supplementary material for practical application in healthcare settings.
2.9 Evaluation of model performance
The performance of the DPN was evaluated from three perspectives: discrimination, calibration, and clinical utility. First, the discriminative ability of the model was assessed by plotting receiver operating characteristic (ROC) curves using the “pROC” package in R. The AUC was calculated to quantify the ability of DPN, as well as existing models including the ZJU, HSI, TyG, and FSI, to distinguish MASLD in patients with T2DM. The Delong test was applied to compare AUCs between models, with statistical significance defined as P < 0.05. To further evaluate incremental predictive value, the net reclassification index (NRI) and integrated discrimination improvement (IDI) were computed. Additional diagnostic performance metrics derived from ROC analysis, including sensitivity (SEN), specificity (SPE), positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio (PLR), negative likelihood ratio (NLR), diagnostic odds ratio (DOR), Youden's index, and optimal cutoff value, were also calculated. Second, model calibration was assessed using calibration plots generated with the “calibrate” package to compare predicted probabilities with observed outcomes. A Brier score < 0.25 and a non-significant Hosmer–Lemeshow test (P > 0.05) were considered indicative of good calibration. Finally, the clinical utility of the DPN was evaluated using decision curve analysis (DCA) and clinical impact curves (CIC) to compare the net clinical benefit across various threshold probabilities against those of the ZJU, HSI, TyG, and FSI models. The DCA and CIC plots were generated using the “dca.R” function in R.
2.10 Model validation
To evaluate the generalizability and robustness of the diagnostic predictive nomogram (DPN), both internal and external validation were performed. Internal validation was conducted using the previously defined validation subset of the study cohort (n = 1,416). For external validation, an independent dataset was extracted from the National Health and Nutrition Examination Survey (NHANES) database, covering the period from 2017 to March 2020, comprising 471 eligible individuals. Details of the validation process are presented in Supplementary Figure S1. NHANES is a nationally representative, cross-sectional survey conducted by the National Center for Health Statistics (NCHS) to assess the health and nutritional status of the U.S. population. The survey protocol was approved by the NCHS Research Ethics Review Board, and all participants provided written informed consent. Additional information regarding NHANES is available on the official website: https://www.cdc.gov/nchs/nhanes/index.html.
2.11 Statistical analysis
All statistical analyses and graphing were performed using SPSS 27.0 (IBM Corp., Armonk, NY, USA) and R software 4.3.1 (R Foundation for Statistical Computing, Vienna, Austria). Among the 29 predictive variables included, the proportion of missing values was less than 30% for all variables. Therefore, we imputed missing values using the k-nearest neighbor algorithm (KNN). For the description of baseline data, when continuous variables were normally distributed, they were expressed as mean ± standard deviation, and comparisons between groups were performed using t-tests. When the measurement data were not normally distributed, the median and interquartile range [M (P25-P75)] were used, and the Mann-Whitney U test was used for comparisons between the two groups. The count data were expressed as frequencies and percentages, and the chi-square test was used for comparisons between the two groups. A P-value < 0.05 was considered statistically significant.
3 Results
3.1 Basic characteristics of the research subjects
This study initially included 7,246 patients, with 4,726 meeting the inclusion criteria. The inclusion process is shown in Figure 1. Among the 4,726 patients with T2DM, the average age was 59.43 years, with 61.3% being male (n = 2,901). The enrolled population was divided into the MASLD group (n = 1,756) and the non-MASLD group (n = 2,970) based on whether MASLD was first diagnosed during hospitalization. Table 1 describes the basic characteristics and metabolic parameters of the two groups of patients. Compared with the non-MASLD group, the MASLD group had higher body weight and BMI, younger age, shorter disease duration, and lower proportions of males, DSPN, and DPVD (P < 0.05). Moreover, the MASLD group had higher levels of DBP, FBG, TC, TG, LDL-C, UA, ALT, AST, ALP, TP, ALB, TBIL, DBIL, and IBIL, and lower levels of HDL-C and GLB (P < 0.05). Notably, the four MASLD-related predictive parameters (ZJU, HSI, TyG, and FSI) were higher in the MASLD group than in the non-MASLD group (P < 0.001). For the development and validation of the predictive model, the patients were randomly divided into the training set (n = 3,310) and the validation set (n = 1,416) in a 7:3 ratio. The average age of patients in the training set was 59.65 years, with 2,039 patients (61.6%) being male, and the prevalence of MASLD was 37.07% (n = 1,227). Patients in the validation set were similar to those in the training set in most clinical characteristics, with no significant differences between the two groups (P3 > 0.05, see Supplementary Table S1).
3.2 Screening of important variables in the training set
A correlation heatmap was used to preliminarily assess the association between the aforementioned 27 variables (excluding height and weight, which are closely related to BMI) related to demographic factors, laboratory factors, anthropometric measurements, lifestyle, comorbidities, and MASLD (Figure 2A). We further used univariate logistic regression to quantify the association between the aforementioned variables and MASLD in patients with T2DM, with 22 variables showing a significant association with MASLD (Supplementary Table S2, P < 0.05). To ultimately develop a model with the fewest variables, no overfitting, and optimal performance, we used the LASSO algorithm to perform dimensionality reduction and variable selection on the aforementioned 27 variables, selecting an appropriate penalty coefficient λ via 10-fold cross-validation. As the penalty coefficient λ increased, the coefficients of some of the included variables were gradually compressed to 0 (Figure 2B). At λ = 0.0232 (1 standard error of λ), the number of selected variables was minimal, and the mean squared error was small (Figure 2C). At this point, 8 variables with non-zero coefficients were obtained, namely gender, age, BMI, ALT, ALB, disease duration, TG, and HDL (Supplementary Table S2). Their VIF values ranged from a maximum of 1.272 to a minimum of 1.029, with all VIF values < 5, indicating no multicollinearity among them.
Figure 2. Correlation heat map and variable selection in the training set. (A) The correlation heat map is used to determine the relationship between variables. (B) Select the optimal parameter λ in LASSO regression, with the x-axis representing logλ and the y-axis representing the coefficient. (C) Relationship with the number of variables, with the lower end of the x-axis representing logλ, the upper end of the x-axis representing the number of variables, and the y-axis representing the mean square error.
3.3 Establishment of DPN in the training set
After LASSO regression, the aforementioned eight predictive factors were selected as the optimal variables. These eight variables were further incorporated into a stepwise AIC-based multivariate logistic regression analysis, which revealed that gender, age, BMI, ALT, ALB, disease duration, TG, and HDL were all statistically significant (P < 0.05, Figure 3A and Supplementary Table S3). RCS analysis further confirmed that the continuous variables among the aforementioned predictive factors (age, BMI, ALT, ALB, disease duration, TG, and HDL) exhibited either a nonlinear relationship (Poverall < 0.05, Pnon − linear < 0.05) or a linear relationship (Poverall < 0.05, Pnon − linear > 0.05) with MASLD risk, as shown in Supplementary Figure S2. These variables were used to construct the DPN and presented in the form of nomogram that quantifies MASLD risk (Figure 3B). Moreover, for the convenience of clinical medical staff, we further developed an interactive application based on the R package Shiny (a MASLD probability scale based on this program is also included in the Supplementary material). The uniform resource locator (URL) for this program is https://wangyazhi.shinyapps.io/DynNomapp/. The model summary is shown in Supplementary Figure S3.
Figure 3. Forest plot and diagnostic prediction nomogram. (A) Forest plot visualization of the results of stepwise AIC-selected multiple logistic regression. (B) DPN shows the predicted probability of MASLD, ranging from 0 to 200. A vertical line is drawn for each predictor, and the corresponding score is recorded. The scores for each predictor are added together to obtain the total score. The total score is the predicted probability of MASLD.
3.4 Performance evaluation of DPN
ZJU, HSI, TyG, and FSI are several previously reported predictive models that are widely used in the assessment of fatty liver disease, including MASLD. To comprehensively evaluate the performance, applicability, and stability of DPN, this study calculated and compared multiple performance metrics, including ROC curves, SEN, and SPE across the training set, validation set (internal validation set), and test set (external validation set from the NHANES database). We also conducted calibration and clinical utility assessments, as well as subgroup analyses.
3.5 Comparison of performance parameters between DPN and other existing models
Compared with ZJU, HSI, TyG, and FSI, DPN had the highest AUC values for predicting MASLD risk in the training set (0.775, 95% CI: 0.759–0.791), validation set (0.767, 95% CI: 0.742–0.791) and test set (0.794, 95% CI: 0.749–0.839), and these AUC values were statistically significantly different from those of the aforementioned four models (P < 0.05), as shown in Tables 2–4 and Figure 4. This indicated that DPN has the best discriminatory ability for MASLD and non-MASLD. Moreover, DPN demonstrated excellent SEN and SPE for predicting MASLD risk in the training set (SEN 0.785, 95% CI: 0.761–0.807; SPE 0.640, 95% CI: 0.619–0.661), the validation set (SEN 0.760, 95% CI: 0.721–0.795; SPE 0.666, 95% CI: 0.634–0.697), and test set (SEN 0.799, 95% CI: 0.752–0.840; SPE 0.672, 95% CI: 0.585–0.748). Compared to other models, Youden's index and DOR were the highest. Tables 2–4 described the specific details of other performance parameters (including PPV, NPV, PLR, NLR, and cutoff values). Moreover, alternative rule-in/rule-out cut-offs are provided in the Supplementary Table S4.
Table 2. Comparison of the performance of DPN and other existing models in predicting MASLD in the training set.
Table 3. Comparison of the performance of DPN and other existing models in predicting MASLD in the validation set.
Table 4. Comparison of the performance of DPN and other existing models in predicting MASLD in the NHANES dataset.
Figure 4. ROC curves of DPN and other models on the training set (A), validation set (B), and NHANES dataset (C).
3.6 The incremental value of DPN in identifying MASLD in patients with T2DM
Since the AUC increment is not intuitive or sensitive, this study calculated the NRI and IDI to further assess the predictive capability of DPN compared to other models. In the training set, validation set, and test set, the NRI and IDI of DPN were significantly higher than those of ZJU, HSI, TyG, and FSI (P < 0.05), as shown in Tables 2–4. This indicated that DPN has a superior ability to identify the risk of MASLD in the T2DM population compared to existing models.
3.7 Assessment of the calibration and clinical utility of DPN
When the calibration curve closely follows the diagonal line, it indicates that the model's predicted probability is more consistent with the actual probability of disease occurrence, resulting in higher prediction accuracy (P > 0.05, Brier score < 0.25). The calibration curves indicated that DPN demonstrated good calibration in the training set (P = 0.936, Brier score = 0.184), validation set (P = 0.956, Brier score = 0.189), and test set (P = 0.687, Brier score = 0.156), as shown in Figure 5. The DCA (Figure 6) and CIC curves (Figure 7) demonstrated the clinical utility of DPN. DCA results indicated that when DPN was within the majority of threshold ranges in the training set (5%−78%), validation set (5%−75%), and test set (11%−98%), its net benefit consistently exceeded that of other predictive models. CIC results suggested that when the threshold probability was ≥60%, the predicted high-risk MASLD population in the training set, validation set, and test set of the T2DM population highly aligned with the actual MASLD incidence population. These findings confirmed the high clinical utility of DPN.
Figure 5. Calibration assessment of DPN in the training set (A), validation set (B), and NHANES set (C). The x-axis represents the predicted probability of MASLD by DPN, and the y-axis represents the actual probability.
Figure 6. Clinical utility assessment of DPN and other existing models through decision curve analysis in the training set (A), validation set (B), and NHANES set (C). The x-axis represents the threshold probability, and the y-axis represents the net benefit (calculated by subtracting the relative harm from the benefit).
Figure 7. Clinical utility assessment of DPN in the training set (A), validation set (B), and NHANES set (C) using clinical impact curves. The x-axis represents the high-risk threshold, and the y-axis represents the number of high-risk individuals.
3.8 Subgroup analysis
Subgroup analyses confirmed that the DPN exhibited robust discriminatory performance, applicability, and stability across various clinically relevant subgroups, including sex (male and female), age ( ≤ 60 years and >60 years), duration of diabetes ( ≤ 20 years and >20 years), and hypertension status (with and without hypertension) (Supplementary Tables S5–S12; Supplementary Figures S4–S7). In all subgroups, the DPN consistently demonstrated higher AUC values compared with existing models, with most comparisons reaching statistical significance (P < 0.05), as shown in Supplementary Figure S4. Furthermore, calibration plots, DCA, and CIC across subgroups reaffirmed that the DPN maintained good calibration and clinical utility irrespective of subgroup stratification (Supplementary Figures S5–S7).
4 Discussion
MASLD (formerly known as NAFLD) has become one of the most prevalent chronic liver diseases globally, with its historical prevalence estimated at approximately 25% in the general adult population (29). Among individuals with T2DM, the prevalence increases by two- to three-fold, making T2DM one of the most potent risk factors for the progression of MASLD to metabolic dysfunction-associated steatohepatitis (MASH), advanced fibrosis, or cirrhosis (30). Longitudinal data further underscore the clinical implications of this interplay. In a 9-year follow-up study, patients with T2DM and MASLD exhibited a 2.2-fold increase in all-cause mortality compared to those without MASLD (31), highlighting the urgent need for proactive hepatic screening and intervention in this high-risk population.
Currently, imaging modalities such as contrast-enhanced ultrasound, CT, MRS, and liver biopsy are not feasible for large-scale MASLD screening due to their high cost, technical complexity, and limited accessibility. In contrast, demographic, anthropometric, and routine biochemical data are widely available in clinical practice, particularly in primary and secondary healthcare settings. Therefore, there is a clear need for a simple, cost-effective, and non-invasive screening tool based on these readily obtainable indicators to enable early identification of MASLD in patients with T2DM. In response to this need, this study developed a novel DPN tailored for the T2DM population. To promote individualized clinical application, the DPN was further deployed as an interactive web-based calculator and an Excel algorithm tool. To comprehensively evaluate the model's performance, we conducted internal and external validation across multiple datasets. The DPN consistently outperformed existing models—including ZJU, HSI, TyG, and FSI—in the training, validation, and test sets, with all AUC values exceeding 0.75 and all comparisons reaching statistical significance (P < 0.05). Moreover, DPN demonstrated significant incremental predictive value, as reflected by positive NRI and IDI (all P < 0.05). Calibration plots, DCA, and CIC uniformly demonstrated good model calibration, high clinical net benefit, and broad applicability. Subgroup analyses across stratified populations with T2DM further confirmed the robustness, reliability, and generalizability of the DPN.
Previous studies, both domestic and international, have primarily focused on developing predictive models for fatty liver disease (formerly NAFLD) in general adult populations. Representative examples included the ZJU Index in China (17), the HSI in South Korea (32), and the FSI in the United States (16). In addition, the TyG index has been increasingly recognized as a useful surrogate marker for assessing metabolic risk and disease progression in MASLD (33). Some recent studies have further incorporated machine learning algorithms to enhance predictive accuracy and computational efficiency (34). However, the applicability and predictive performance of these models in individuals with T2DM remain uncertain. Moreover, there is a paucity of screening tools specifically tailored for the T2DM population. Xue et al. (35) and Zhang et al. (36) both developed MASLD risk prediction models using logistic regression in T2DM cohorts. While these models demonstrated good discriminative ability in internal validation (AUCs of 0.756 and 0.848 in training sets; 0.755 and 0.809 in validation sets), several limitations were noted. These include the inclusion of a large number of predictors, lack of external validation in independent populations, absence of head-to-head model comparisons, and limited practical utility due to complex variable requirements. In contrast, the present study addresses these limitations by developing and validating a simplified eight-variable DPN, which demonstrated superior predictive performance compared to existing models across three independent datasets. Additionally, the model was implemented as an interactive web-based calculator and an Excel algorithm tool, enhancing its accessibility and clinical utility. Collectively, these findings highlight the practicality, scalability, and robustness of the DPN as a screening tool for MASLD in populations with T2DM.
Among 27 candidate variables, eight were ultimately included in the model (including gender, age, disease duration, BMI, TG, HDL-C, ALT, and ALB) based on LASSO regression and stepwise AIC selection. Gender has long been recognized as a determinant of susceptibility to metabolic diseases, including T2DM, CVD, hypertension, and MASLD (37). Women with T2DM exhibit a significantly higher risk of CVD than men, with carotid stenosis and ischaemic stroke more prevalent in female patients with metabolic syndrome (38). MASLD, as the hepatic manifestation of metabolic syndrome, also appears to disproportionately affect women with T2DM. A recent study showed that women with prediabetes or T2DM had a higher MASLD risk than men (39), a finding supported by Korean data indicating increased MASLD prevalence in women with elevated fasting glucose or antidiabetic treatment (40). This may be attributed to greater metabolic dysfunction in women, reflected in higher visceral adiposity, dyslipidemia, homeostatic model assessment of insulin resistance (HOMA-IR), and inflammatory markers (e.g., hsCRP), all of which promote hepatic IR and inflammation—key drivers of MASLD pathogenesis (41). Consistent with these findings, this model identified female as an independent risk factor for MASLD in patients with T2DM.
In this study, T2DM patients with MASLD were younger and had a shorter disease duration. The findings of Zhang et al. (36) and Xue et al. (35) were consistent with ours. This suggested that T2DM patients with MASLD were younger compared to those without MASLD. Data indicate that the prevalence of MASLD is often higher in middle-aged populations than in older populations, which may be due to factors such as high work stress, poor lifestyle habits (e.g., irregular diet and lack of exercise), and the widespread presence of metabolic syndrome.
BMI is a widely used anthropometric measure for evaluating obesity and predicting the risk of obesity-related complications, including diabetes, dyslipidaemia, MASLD, and CVD (42). Cohort studies have demonstrated that both central and general obesity are strongly associated with MASLD in T2DM patients, with BMI independently linked to MASLD development (43). Interventions such as dietary modification, pharmacotherapy, and bariatric surgery aim to reduce hepatic steatosis by improving BMI in patients with T2DM or MASLD (44–46). In this study, BMI emerged as a key predictor of MASLD, with significantly higher values observed in affected individuals. These findings underscore the clinical value of BMI in MASLD risk stratification and highlight the importance of weight management in preventing hepatic complications in T2DM.
TG and HDL-C are routinely assessed lipid markers in T2DM management and were included as key predictors in the DPN. Elevated TG and reduced HDL-C levels have been consistently associated with increased risk of T2DM and prediabetes (47), and contribute to the development and progression of MASLD (48). Verwer et al. (49) reported that hepatic steatosis in T2DM was closely linked to postprandial hypertriglyceridaemia, which led to TG enrichment of HDL particles. Impaired clearance of TG-rich HDL compromises its antioxidant function and promotes endothelial dysfunction. Recent studies have proposed lipid ratios as effective markers for identifying individuals at high risk of T2DM and MASLD (50). Li et al. (51) found that the TG/HDL-C ratio had the highest diagnostic value for MASLD in newly diagnosed T2DM patients, with an AUC of 0.732, sensitivity of 73.8%, and specificity of 60.1%. These findings suggest that these lipid parameters may serve as promising candidates for future model refinement.
ALT, a liver-specific enzyme, is commonly elevated in patients with T2DM and MASLD, reflecting hepatocellular injury (52). In Cen's predictive model, ALT ranked as the second most important predictor of MASLD after TG (53), while Mor et al. (54) also identified ALT as a valuable early marker for MASLD in newly diagnosed T2DM patients. Furthermore, elevated ALT has been linked to increased metabolic activity and IR in this population (55). Consistent with prior studies, our findings supported the inclusion of ALT as a key variable in MASLD risk prediction. ALB, the most abundant plasma protein synthesized by the liver, serves critical roles in antioxidation, anti-inflammation, and molecular transport (56). Although evidence on the relationship between ALB and MASLD in T2DM is limited, prior studies have reported conflicting findings. For example, some associate lower albumin levels with metabolic dysfunction and T2DM risk (57), while others link higher albumin to IR (58). In this study, ALB levels were higher in T2DM patients with MASLD, potentially reflecting the influence of hepatoprotective or renoprotective therapies. This observation warrants further investigation.
The MASLD risk prediction tool developed in this study offers a practical and non-invasive approach for early screening in patients with T2DM. First, due to the marked differences in metabolic profiles and MASLD prevalence between T2DM and healthy populations (60%−70% vs. 25%), a T2DM-specific model is warranted. Moreover, the model incorporates routinely collected demographic, anthropometric, and laboratory parameters, ensuring accessibility and ease of application in clinical settings. Third, across three independent cohorts (training, internal validation, and external testing), the model consistently outperformed existing tools in terms of discrimination, calibration, and clinical utility, demonstrating robust stability and generalizability. Lastly, while liver biopsy remains the diagnostic gold standard, our web-based, non-invasive model offers a feasible alternative for large-scale screening and risk stratification in T2DM management.
Nevertheless, this study has several limitations. (1) Although the study included a large sample size of 4,726 patients, the data were derived from a single source. Future multicenter prospective studies with larger cohorts are needed to further validate the model's performance across diverse populations. (2) Some predictors in the model (e.g., age and diabetes duration) were collected via questionnaires, which may introduce recall bias affecting prediction accuracy. (3) Beyond the eight identified predictors, additional factors including dietary patterns, sleep duration, ethnicity, medication use, and comorbidities may influence hepatic metabolism and should be incorporated in future research. (4) The final model contains relatively numerous variables. Developing more parsimonious models with fewer predictors while maintaining high performance warrants further investigation. (5) This study has a retrospective design, and the ultrasound assessments were conducted jointly by physicians; therefore, inter-observer agreement data could not be provided. Future studies should incorporate formal inter-observer agreement analysis for further validation. (6) It is important to acknowledge that the performance of our prediction model is inherently limited by the diagnostic accuracy of abdominal ultrasound for hepatic steatosis. (7) The Fatty Liver Index (FLI) could not be included in the model comparison due to insufficient WC data in our datasets. (8) This model was developed and validated in hospitalized patients with T2DM, who have fundamentally distinct clinical characteristics from outpatients. Therefore, its applicability is currently limited to inpatient risk stratification, and future validation in outpatient cohorts is warranted to assess generalizability.
5 Conclusions
In summary, this study developed and validated a clinically applicable prediction model for assessing MASLD risk in patients with T2DM, presented as an interactive web-based tool. The model demonstrated strong discriminative power, calibration, and clinical utility across multiple cohorts. Owing to its simplicity, accessibility, and generalizability, this tool may facilitate early identification of high-risk individuals, thereby improving risk stratification and guiding timely interventions in the management of T2DM.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by the Ethics Committee of the First Hospital of Lanzhou University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
MZ: Resources, Writing – review & editing, Writing – original draft, Formal analysis, Visualization, Methodology, Project administration, Investigation, Supervision. YW: Software, Formal analysis, Writing – original draft, Resources, Methodology, Writing – review & editing, Investigation, Validation, Data curation. ML: Software, Formal analysis, Writing – original draft, Methodology, Writing – review & editing, Investigation, Validation, Data curation. QS: Software, Formal analysis, Writing – original draft, Resources, Methodology, Writing – review & editing, Validation, Data curation. XW: Data curation, Project administration, Writing – original draft, Conceptualization, Methodology, Visualization, Investigation, Resources, Writing – review & editing. YZ: Data curation, Validation, Supervision, Investigation, Writing – review & editing, Software, Formal analysis, Resources, Writing – original draft.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This research was supported by the National Natural Science Foundation of China (grant numbers 82460732, 82060676, 81960680, and U21A20424), the Natural Science Foundation of Gansu (no. 21ZD4FA014), and the First Hospital of Lanzhou University intramural fund (ldyyyn2019-30 and ldyyyn2022-89). The Engineering Research Centre of Prevention and Control for Clinical Medication Risk, Gansu Province, supported the study. This research was also supported by the Department of Laboratory Medicine of the First Hospital of Lanzhou University, the Gansu Provincial Engineering and Technology Research Centre for Clinical Medication Risk Prevention and Control and Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2025.1692394/full#supplementary-material
References
1. Sun H, Saeedi P, Karuranga S, Pinkepank M, Ogurtsova K, Duncan BB, et al. IDF Diabetes Atlas: global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res Clin Pract. (2022) 183:109119. doi: 10.1016/j.diabres.2021.109119
2. Pearson ER. Type 2 diabetes: a multifaceted disease. Diabetologia. (2019) 62:1107–12. doi: 10.1007/s00125-019-4909-y
3. Bedi O, Aggarwal S, Trehanpati N, Ramakrishna G, Krishan P. Molecular and pathological events involved in the pathogenesis of diabetes-associated nonalcoholic fatty liver disease. J Clin Exp Hepatol. (2019) 9:607–18. doi: 10.1016/j.jceh.2018.10.004
4. Targher G, Byrne CD, Tilg H. MASLD: a systemic metabolic disorder with cardiovascular and malignant complications. Gut. (2024) 73:691–702. doi: 10.1136/gutjnl-2023-330595
5. Qi X, Li J, Caussy C, Teng GJ, Loomba R. Epidemiology, screening, and co-management of type 2 diabetes mellitus and metabolic dysfunction-associated steatotic liver disease. Hepatology. (2024). doi: 10.1097/HEP.0000000000000913. [Epub ahead of print].
6. Scoditti E, Sabatini S, Carli F, Gastaldelli A. Hepatic glucose metabolism in the steatotic liver. Nat Rev Gastroenterol Hepatol. (2024) 21:319–34. doi: 10.1038/s41575-023-00888-8
7. Chen SC, Tsai SP, Jhao JY, Jiang WK, Tsao CK, Chang LY. Liver fat, hepatic enzymes, alkaline phosphatase and the risk of incident type 2 diabetes: a prospective study of 132,377 adults. Sci Rep. (2017) 7:4649. doi: 10.1038/s41598-017-04631-7
8. Zoppini G, Fedeli U, Gennaro N, Saugo M, Targher G, Bonora E. Mortality from chronic liver diseases in diabetes. Am J Gastroenterol. (2014) 109:1020–5. doi: 10.1038/ajg.2014.132
9. Yamazaki H, Tsuboya T, Tsuji K, Dohke M, Maguchi H. Independent association between improvement of nonalcoholic fatty liver disease and reduced incidence of type 2 diabetes. Diabetes Care. (2015) 38:1673–9. doi: 10.2337/dc15-0140
10. Newsome PN, Loomba R. Therapeutic horizons in metabolic dysfunction-associated steatohepatitis. J Clin Invest. (2025) 135:e186425. doi: 10.1172/JCI186425
11. Selvaraj EA, Mózes FE, Jayaswal ANA, Zafarmand MH, Vali Y, Lee JA, et al. Diagnostic accuracy of elastography and magnetic resonance imaging in patients with NAFLD: a systematic review and meta-analysis. J Hepatol. (2021) 75:770–85. doi: 10.1016/j.jhep.2021.04.044
12. Sarkar S, Matsukuma KE, Spencer B, Chen S, Olson KA, Badawi RD, et al. Dynamic positron emission tomography/computed tomography imaging correlate of nonalcoholic steatohepatitis. Clin Gastroenterol Hepatol. (2021) 19:2441–43. doi: 10.1016/j.cgh.2020.10.029
13. Oh JH, Ahn SB, Cho S, Nah EH, Yoon EL, Jun DW. Diagnostic performance of non-invasive tests in patients with MetALD in a health check-up cohort. J Hepatol. (2024) 81:772–80. doi: 10.1016/j.jhep.2024.05.042
14. Petroff D, Blank V, Newsome PN. Shalimar, Voican CS, Thiele M, et al. Assessment of hepatic steatosis by controlled attenuation parameter using the M and XL probes: an individual patient data meta-analysis. Lancet Gastroenterol Hepatol. (2021) 6:185–98. doi: 10.1016/S2468-1253(20)30357-5
15. Chowdhury AB, Mehta KJ. Liver biopsy for assessment of chronic liver diseases: a synopsis. Clin Exp Med. (2023) 23:273–85. doi: 10.1007/s10238-022-00799-z
16. Long MT, Pedley A, Colantonio LD, Massaro JM, Hoffmann U, Muntner P, et al. Development and validation of the Framingham steatosis index to identify persons with hepatic steatosis. Clin Gastroenterol Hepatol. (2016) 14:1172–80.e2. doi: 10.1016/j.cgh.2016.03.034
17. Zheng K, Yin Y, Guo H, Ma L, Liu R, Zhao T, et al. Association between the ZJU index and risk of new-onset non-alcoholic fatty liver disease in non-obese participants: a Chinese longitudinal prospective cohort study. Front Endocrinol. (2024) 15:1340644. doi: 10.3389/fendo.2024.1340644
18. Chang JW, Lee HW, Kim BK, Park JY, Kim DY, Ahn SH, et al. Hepatic steatosis index in the detection of fatty liver in patients with chronic hepatitis B receiving antiviral therapy. Gut Liver. (2021) 15:117–27. doi: 10.5009/gnl19301
19. Zhao J, Fan H, Wang T, Yu B, Mao S, Wang X, et al. TyG index is positively associated with risk of CHD and coronary atherosclerosis severity among NAFLD patients. Cardiovasc Diabetol. (2022) 21:123. doi: 10.1186/s12933-022-01548-y
20. Yetim A, Sahin M, Kandemir I, Bulakçi B, Aksakal MT, Karapinar E, et al. Evaluation of the ability of insulin resistance and lipid-related indices to predict the presence of NAFLD in obese adolescents. Lipids Health Dis. (2024) 23:208. doi: 10.1186/s12944-024-02144-7
21. Alberti KG, Zimmet PZ. Definition, diagnosis and classification of diabetes mellitus and its complications. Part 1: diagnosis and classification of diabetes mellitus provisional report of a WHO consultation. Diabet Med. (1998) 15:539–53. doi: 10.1002/(SICI)1096-9136(199807)15:7<539::AID-DIA668>3.0.CO;2-S
22. Collins GS, Moons KGM, Dhiman P, Riley RD, Beam AL, Van Calster B, et al. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. Bmj. (2024) 385:e078378. doi: 10.1136/bmj-2023-078378
23. Slieker RC, van der Heijden A, Siddiqui MK, Langendoen-Gort M, Nijpels G, Herings R, et al. Performance of prediction models for nephropathy in people with type 2 diabetes: systematic review and external validation study. Bmj. (2021) 374:n2134. doi: 10.1136/bmj.n2134
24. Chan WK, Chuah KH, Rajaram RB, Lim LL, Ratnasingam J, Vethakkan SR. Metabolic dysfunction-associated steatotic liver disease (MASLD): a state-of-the-art review. J Obes Metab Syndr. (2023) 32:197–213. doi: 10.7570/jomes23052
25. Colantonio LD, Booth JN, 3rd, Bress AP, Whelton PK, Shimbo D, et al. 2017 ACC/AHA blood pressure treatment guideline recommendations and cardiovascular risk. J Am Coll Cardiol. (2018) 72:1187–97. doi: 10.1016/j.jacc.2018.05.074
26. Madero M, Levin A, Ahmed SB, Carrero JJ, Foster B, Francis A, et al. Evaluation and management of chronic kidney disease: synopsis of the kidney disease: improving global outcomes 2024 clinical practice guideline. Ann Intern Med. (2025) 178:705–13. doi: 10.7326/ANNALS-24-01926
27. Pop-Busui R, Boulton AJ, Feldman EL, Bril V, Freeman R, Malik RA, et al. Diabetic neuropathy: a position statement by the American diabetes association. Diabetes Care. (2017) 40:136–54. doi: 10.2337/dc16-2042
28. Yang SL, Zhu LY, Han R, Sun LL Li JX, Dou JT. Pathophysiology of peripheral arterial disease in diabetes mellitus. J Diabetes. (2017) 9:133–40. doi: 10.1111/1753-0407.12474
29. Pimpin L, Cortez-Pinto H, Negro F, Corbould E, Lazarus JV, Webber L, et al. Burden of liver disease in Europe: epidemiology and analysis of risk factors to identify prevention policies. J Hepatol. (2018) 69:718–35. doi: 10.1016/j.jhep.2018.05.011
30. Targher G, Corey KE, Byrne CD, Roden M. The complex link between NAFLD and type 2 diabetes mellitus - mechanisms and treatments. Nat Rev Gastroenterol Hepatol. (2021) 18:599–612. doi: 10.1038/s41575-021-00448-y
31. Adams LA, Harmsen S, St Sauver JL, Charatcharoenwitthaya P, Enders FB, Therneau T, et al. Nonalcoholic fatty liver disease increases risk of death among patients with diabetes: a community-based cohort study. Am J Gastroenterol. (2010) 105:1567–73. doi: 10.1038/ajg.2010.18
32. Lee JH, Kim D, Kim HJ, Lee CH, Yang JI, Kim W, et al. Hepatic steatosis index: a simple screening tool reflecting nonalcoholic fatty liver disease. Dig Liver Dis. (2010) 42:503–8. doi: 10.1016/j.dld.2009.08.002
33. Song K, Lee HW, Choi HS, Park G, Lee HS, Kim SJ, et al. Comparison of the modified TyG indices and other parameters to predict non-alcoholic fatty liver disease in youth. Biology. (2022) 11:685. doi: 10.3390/biology11050685
34. Huang G, Jin Q, Mao Y. Predicting the 5-year risk of nonalcoholic fatty liver disease using machine learning models: prospective cohort study. J Med Internet Res. (2023) 25:e46891. doi: 10.2196/46891
35. Xue M, Yang X, Zou Y, Liu T, Su Y, Li C, et al. A non-invasive prediction model for non-alcoholic fatty liver disease in adults with type 2 diabetes based on the population of Northern Urumqi, China. Diabetes Metab Syndr Obes. (2021) 14:443–54. doi: 10.2147/DMSO.S271882
36. Zhang Y, Shi R, Yu L, Ji L, Li M, Hu F. Establishment of a risk prediction model for non-alcoholic fatty liver disease in type 2 diabetes. Diabetes Ther. (2020) 11:2057–73. doi: 10.1007/s13300-020-00893-z
37. Chew NWS, Ng CH, Tan DJH, Kong G, Lin C, Chin YH, et al. The global burden of metabolic disease: Data from 2000 to 2019. Cell Metab. (2023) 35: 414–28.e3. doi: 10.1016/j.cmet.2023.02.003
38. Kalyani RR, Lazo M, Ouyang P, Turkbey E, Chevalier K, Brancati F, et al. Sex differences in diabetes and risk of incident coronary artery disease in healthy young and middle-aged adults. Diabetes Care. (2014) 37:830–8. doi: 10.2337/dc13-1755
39. Succurro E, Marini MA, Fiorentino TV, Perticone M, Sciacqua A, Andreozzi F, et al. Sex-specific differences in prevalence of nonalcoholic fatty liver disease in subjects with prediabetes and type 2 diabetes. Diabetes Res Clin Pract. (2022) 190:110027. doi: 10.1016/j.diabres.2022.110027
40. Park SH, Jeon WK, Kim SH, Kim HJ, Park DI, Cho YK, et al. Prevalence and risk factors of non-alcoholic fatty liver disease among Korean adults. J Gastroenterol Hepatol. (2006) 21:138–43. doi: 10.1111/j.1440-1746.2005.04086.x
41. De Ponti FF, Liu Z, Scott CL. Understanding the complex macrophage landscape in MASLD. JHEP Rep. (2024) 6:101196. doi: 10.1016/j.jhepr.2024.101196
42. Elmaleh-Sachs A, Schwartz JL, Bramante CT, Nicklas JM, Gudzune KA, Jay M. Obesity management in adults: a review. Jama. (2023) 330:2000–15. doi: 10.1001/jama.2023.19897
43. Herath HMM, Kodikara I, Weerarathna TP, Liyanage G. Prevalence and associations of non-alcoholic fatty liver disease (NAFLD) in Sri Lankan patients with type 2 diabetes: a single center study. Diabetes Metab Syndr. (2019) 13:246–50. doi: 10.1016/j.dsx.2018.09.002
44. Xiao Y, Liu Y, Zhao L, Zhou Y. Effect of 5:2 fasting diet on liver fat content in patients with type 2 diabetic with nonalcoholic fatty liver disease. Metab Syndr Relat Disord. (2022) 20:459–65. doi: 10.1089/met.2022.0014
45. Deng M, Wen Y, Yan J, Fan Y, Wang Z, Zhang R, et al. Comparative effectiveness of multiple different treatment regimens for nonalcoholic fatty liver disease with type 2 diabetes mellitus: a systematic review and Bayesian network meta-analysis of randomised controlled trials. BMC Med. (2023) 21:447. doi: 10.1186/s12916-023-03129-6
46. Chaim FDM, Pascoal LB, Chaim FHM, Palma BB, Damázio TA, da Costa LBE, et al. Histological grading evaluation of non-alcoholic fatty liver disease after bariatric surgery: a retrospective and longitudinal observational cohort study. Sci Rep. (2020) 10:8496. doi: 10.1038/s41598-020-65556-2
47. Hsu H, Hsu P, Cheng MH, Ito Y, Kanda E, Schaefer EJ, et al. Lipoprotein subfractions and glucose homeostasis in prediabetes and diabetes in Taiwan. J Atheroscler Thromb. (2019) 26:890–914. doi: 10.5551/jat.48330
48. Li Y, Yang P, Ye J, Xu Q, Wu J, Wang Y. Updated mechanisms of MASLD pathogenesis. Lipids Health Dis. (2024) 23:117. doi: 10.1186/s12944-024-02108-x
49. Verwer BJ, Scheffer PG, Vermue RP, Pouwels PJ, Diamant M, Tushuizen ME, et al. Is related to post-prandial triglyceride-enrichment of HDL particles in association with endothelial and HDL dysfunction. Liver Int. (2020) 40:2439–44. doi: 10.1111/liv.14597
50. Liu J, Fu Q, Su R, Liu R, Wu S, Li K, et al. Association between nontraditional lipid parameters and the risk of type 2 diabetes and prediabetes in patients with nonalcoholic fatty liver disease: from the national health and nutrition examination survey 2017-2020. Front Endocrinol. (2024) 15:1460280. doi: 10.3389/fendo.2024.1460280
51. Li R, Kong D, Ye Z, Zong G, Hu K, Xu W, et al. Correlation of multiple lipid and lipoprotein ratios with nonalcoholic fatty liver disease in patients with newly diagnosed type 2 diabetic mellitus: a retrospective study. Front Endocrinol. (2023) 14:1127134. doi: 10.3389/fendo.2023.1127134
52. Mikolasevic I, Domislovic V, Turk Wensveen T, Delija B, Klapan M, Juric T, et al. Screening for nonalcoholic fatty liver disease in patients with type 2 diabetes mellitus using transient elastography - a prospective, cross sectional study. Eur J Intern Med. (2020) 82:68–75. doi: 10.1016/j.ejim.2020.08.005
53. Cen C, Wang W, Yu S, Tang X, Liu J, Liu Y, et al. Development and validation of a clinical and laboratory-based nomogram to predict nonalcoholic fatty liver disease. Hepatol Int. (2020) 14:808–16. doi: 10.1007/s12072-020-10065-7
54. Mor A, Svensson E, Rungby J, Ulrichsen SP, Berencsi K, Nielsen JS, et al. Modifiable clinical and lifestyle factors are associated with elevated alanine aminotransferase levels in newly diagnosed type 2 diabetes patients: results from the nationwide DD2 study. Diabetes Metab Res Rev. (2014) 30:707–15. doi: 10.1002/dmrr.2539
55. Mansour A, Motamed S, Hekmatdoost A, Karimi S, Mohajeri-Tehrani MR, Abdollahi M, et al. Factors related to hypermetabolism in individuals with type 2 diabetes mellitus and non-alcoholic fatty liver disease. Sci Rep. (2023) 13:3669. doi: 10.1038/s41598-023-30945-w
56. Jin M, Zhu S, Hou Y. Insight on serum albumin: from structure and biological properties to functional biomaterials for bone repair. ACS Biomater Sci Eng. (2023) 9:2235–50. doi: 10.1021/acsbiomaterials.2c01553
57. Chang DC, Xu X, Ferrante AW Jr, Krakoff J. Reduced plasma albumin predicts type 2 diabetes and is associated with greater adipose tissue macrophage content and activation. Diabetol Metab Syndr. (2019) 11:14. doi: 10.1186/s13098-019-0409-y
Keywords: type 2 diabetes mellitus, metabolic dysfunction-associated steatotic liver disease, diagnostic predictive nomogram, NHANES database, clinical practice
Citation: Zhang M, Wang Y, Liu M, Shi Q, Wu X and Zhou Y (2025) Establishment and evaluation of a novel practical tool for the screening of metabolic dysfunction-associated steatotic liver disease in patients with type 2 diabetes mellitus. Front. Nutr. 12:1692394. doi: 10.3389/fnut.2025.1692394
Received: 25 August 2025; Revised: 17 November 2025;
Accepted: 27 November 2025; Published: 12 December 2025.
Edited by:
Berenice M. Román Calleja, National Institute of Medical Sciences and Nutrition Salvador Zubirán (INCMNSZ), MexicoReviewed by:
Azam Doustmohammadian, Iran University of Medical Sciences, IranCarlos Fernando Martínez-Cabrera, Centro de Investigación y Gastroenterología, Mexico
Copyright © 2025 Zhang, Wang, Liu, Shi, Wu and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xin'an Wu, d3V4YUBsenUuZWR1LmNu; Yan Zhou, emhvdXlhbjIxQGx6dS5lZHUuY24=
†These authors have contributed equally to this work
Mingkang Zhang1,2†