- 1Department of Gastroenterology, The Affiliated Hospital of Southwest Medical University, Luzhou Sichuan, China
- 2Department of Critical Care Medicine, The Affiliated Hospital of Southwest Medical University, Luzhou Sichuan, China
- 3Health Management Center, The Affiliated Hospital of Southwest Medical University, Luzhou, Sichuan, China
Introduction: With the rising incidence of metabolic dysfunction-associated fatty liver disease (MAFLD) in the elderly population, this study aimed to develop an optimal screening model by comparing ten different machine learning (ML) algorithms to identify high-risk elderly individuals using routine health examination data.
Methods: The study included 2,635 individuals aged 60 years and older who underwent annual health examinations at the Health Management Center of Southwest Medical University Affiliated Hospital from January to December 2024. Initial feature selection was performed using the least absolute shrinkage and selection operator (LASSO) regression, followed by univariate and multivariate logistic regression analysis to identify nine independent predictive factors. Predictive models were constructed using 10 ML algorithms, and model performance was evaluated based on discriminative ability, calibration ability, and clinical utility. Feature importance was visualized and individual-level interpretability was provided using the Shapley Additive exPlanations (SHAP) method.
Results: The final analysis included nine variables. After 10-fold cross-validation and hyperparameter tuning, the Random Forest (RF) model performed best, achieving an area under the curve (AUC) of 0.892 (95% CI: 0.870–0.914) in the validation cohort. Feature importance analysis revealed that the TyG-BMI index, height, and albumin levels played significant roles in predicting MAFLD risk.
Discussion: Machine learning models, particularly the random forest algorithm, can effectively predict the risk of MAFLD in the elderly population. These models may assist clinicians in early screening and intervention, thereby improving patient outcomes.
Introduction
Metabolic-associated fatty liver disease (MAFLD), formerly known as non-alcoholic fatty liver disease (NAFLD), is a condition strongly associated with metabolic dysfunction, including obesity, type 2 diabetes mellitus, insulin resistance, and metabolic syndrome (1). With the global acceleration of population aging, the prevalence of MAFLD is rising among older adults (2).
Metabolic-associated fatty liver disease not only impairs liver function but is also closely linked to a range of extrahepatic complications. Studies have shown that MAFLD significantly increases the risk of both fatal and non-fatal cardiovascular events, and patients with MAFLD are more likely to develop chronic kidney disease and type 2 diabetes compared to healthy individuals (3–5). Moreover, MAFLD can progress to non-alcoholic steatohepatitis (NASH), liver fibrosis, cirrhosis, or even hepatocellular carcinoma (HCC), posing a serious threat to patients’ health and survival (6). These complications are more prevalent in the elderly, further exacerbating the disease burden. Therefore, early identification of MAFLD in older adults is crucial for reducing healthcare costs, improving prognosis, and enhancing quality of life.
Abdominal ultrasonography is a widely used diagnostic method for detecting hepatic steatosis and offers high accuracy in identifying moderate to severe fatty liver. However, its sensitivity is limited for mild cases and is highly dependent on the operator’s expertise and interpretation (7). Liver biopsy remains the gold standard for diagnosing MAFLD, as it allows for direct histological assessment of hepatic pathology and severity. Nevertheless, due to its invasive nature, high cost, and low feasibility in routine screening, especially among older adults with multiple comorbidities, its clinical applicability is limited (8, 9). In addition, many MAFLD patients—especially the elderly—may remain asymptomatic in the early stages, making timely and accurate diagnosis particularly challenging.
Machine learning (ML) has emerged as a powerful predictive tool in the field of medicine (10–12). Unlike traditional statistical models, which rely on predefined assumptions and explicit mathematical formulations, ML makes no assumptions about the underlying data structure. It is capable of analyzing high-dimensional data and capturing complex nonlinear relationships. Furthermore, the use of SHapley Additive exPlanations (SHAP) enhances the interpretability of ML models by combining optimal credit allocation with local interpretability (13). As a result, ML is increasingly applied in clinical diagnostic research.
This study aims to develop and validate machine learning models to predict the risk of MAFLD among older adults, utilizing SHAP to visualize and interpret key predictors. The goal is to assist clinicians in identifying high-risk individuals and supporting early clinical interventions.
Methods
Participants
This cross-sectional study was conducted between January 2024 and December 2024 at the Health Management Center of the Affiliated Hospital of Southwest Medical University. The study population comprised older adults who underwent annual health examinations, including abdominal ultrasonography. Inclusion criteria were as follows: (1) age ≥ 60 years; (2) completion of abdominal ultrasound examination; and (3) availability of complete clinical data. Exclusion criteria included: (1) age < 60 years; (2) a confirmed history of liver diseases or previous liver surgery, such as primary hepatocellular carcinoma, large hepatic cysts, or cirrhosis; and (3) incomplete clinical data. Based on these criteria, a total of 3,175 individuals with complete abdominal ultrasound data were initially assessed. After excluding 383 cases with missing data and 157 cases with major liver diseases, 2,635 participants were included in the final analysis. Among them, 1,693 were male (64.25%) and 942 were female (35.75%), with a mean age of 67.79 ± 7.07 years. Of the total participants, 878 (33.32%) were diagnosed with MAFLD and 1,757 (66.68%) were non-MAFLD. The diagnosis of MAFLD was based on ultrasonographic findings consistent with hepatic steatosis. All procedures complied with relevant ethical regulations and guidelines. All procedures in this study were conducted in accordance with the relevant guidelines and regulations. Due to the retrospective nature of the study, the requirement for written informed consent was waived. The study was approved by the Ethics Committee of the Affiliated Hospital of Southwest Medical University (Approval No. KY2025195).
Data collection
Demographic, anthropometric, medical history, and laboratory data were extracted from the hospital’s electronic medical examination system. The collected variables included: Demographic Data: Age and sex. Anthropometric Measurements: Body mass index (BMI), systolic blood pressure (SBP), diastolic blood pressure (DBP), waist circumference (WC), hip circumference (HC), waist-to-hip ratio (WHR), height, and weight. Medical History: History of diabetes and history of hypertension (self-reported or clinically documented). Laboratory Tests: γ-glutamyl transpeptidase (GGT), alanine aminotransferase (ALT), aspartate aminotransferase (AST), AST/ALT ratio, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), total cholesterol (TC), total bilirubin (TBIL), direct bilirubin (DBIL), indirect bilirubin (IBIL), total protein (TP), globulin (GLO), triglycerides (TG), albumin (ALB), albumin-to-globulin ratio (A/G), and fasting plasma glucose (FPG). In addition, the triglyceride-glucose index (TyG) and its related parameters were calculated using the following formulas (14, 15):
TyG index = ln [TG (mg/dL) × FPG (mg/dL)/2].
TyG-BMI = TyG × BMI.
TyG-WC = TyG × WC.
TyG-WHR = TyG × WHR.
Diagnostic criteria for MAFLD
In this study, all enrolled participants underwent abdominal ultrasonography performed by experienced radiologists at a tertiary medical center. The diagnosis of hepatic steatosis was primarily based on the following sonographic features: increased hepatic echogenicity (“bright liver”) and/or unclear visualization of intrahepatic structures (16). The diagnosis of metabolic dysfunction-associated fatty liver disease (MAFLD) was established based on the presence of hepatic steatosis on ultrasound in addition to at least one of the following three criteria (17): Overweight or obesity (defined as BMI ≥ 23 kg/m2 for Asian populations); Type 2 diabetes mellitus; Lean or normal weight (BMI < 23 kg/m2 for Asian populations) with the presence of two or more of the following metabolic risk abnormalities: (1) Waist circumference (WC) ≥ 90 cm in men or ≥ 80 cm in women; (2) Blood pressure ≥ 130/85 mmHg or under antihypertensive treatment; (3) Triglycerides (TG) ≥ 1.70 mmol/L or receiving lipid-lowering therapy; (4) HDL-C < 1.0 mmol/L in men or < 1.3 mmol/L in women, or receiving specific treatment; (5) Prediabetes (FPG 5.6–6.9 mmol/L or HbA1c 5.7–6.4%); (6) Homeostasis Model Assessment of Insulin Resistance (HOMA-IR) ≥ 2.5; (7) High-sensitivity C-reactive protein (hs-CRP) ≥ 2 mg/L.
Statistical analysis and model development
All statistical analyses were conducted using R software (version 4.4.2), with a two-tailed p-value < 0.05 considered statistically significant. Continuous variables were expressed as mean ± standard deviation if normally distributed, or as median (interquartile range) if not. Group comparisons were performed using the t-test for normally distributed variables and the Mann–Whitney U test for non-normally distributed variables. Categorical variables were presented as frequencies (percentages) and compared using the chi-square test or Fisher’s exact test, as appropriate. We examined the missing rates of all variables included in the study. To ensure the accuracy and stability of the model, variables with a missing rate exceeding 10% were excluded from the analysis, while missing data for the remaining variables were imputed using the Multiple Imputation by Chained Equations (MICE) method.
In this study, we used a stratified random sampling method to divide the dataset into a training set and a validation set. All participants were first stratified according to their MAFLD status, and then randomly assigned within each stratum to either the training set (70%) or the validation set (30%). The training set consisted of 1,844 individuals, and the validation set included 791 individuals. The training set was used for model development, while the validation set was used to evaluate model performance. Comparability between the two datasets was assessed, and no statistically significant differences were observed (p > 0.05). Variable selection was initially performed using least absolute shrinkage and selection operator (LASSO) regression on the training set. LASSO regression was implemented with the glmnet package in R, incorporating L1 regularization to penalize model complexity by shrinking some coefficients to zero, thereby achieving feature selection. The issue of class imbalance was addressed by introducing the Synthetic Minority Over-sampling Technique (SMOTE) algorithm (18). Subsequently, variables were further filtered through univariate logistic regression followed by multivariate logistic regression, resulting in the identification of nine independent predictors. The variance inflation factor (VIF) was calculated for each variable, and all VIF values were below 5, indicating no significant multicollinearity. To further eliminate the impact of multicollinearity on variable selection, we calculated the Pearson correlation coefficient between TyG-BMI and BMI, which was found to be 0.842. According to relevant literature (19–21), when the Pearson correlation coefficient exceeds 0.85, it is necessary to exclude one of the variables that has a weaker association with the outcome. Therefore, after comprehensive consideration, this study retains both TyG-BMI and BMI. The flowchart of this study is shown in Figure 1.
Based on a comprehensive consideration of methodological diversity, predictive performance, and clinical interpretability. Ten machine learning algorithms were employed to construct predictive models, including logistic regression (LR), support vector machine (SVM), gradient boosting machine (GBM), neural network (NN), random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbor (KNN), adaptive boosting (AdaBoost), light gradient boosting machine (LightGBM), and categorical boosting (CatBoost). Ten-fold cross-validation was used to ensure model robustness, and grid search was applied to optimize the hyperparameters of each algorithm.
Model evaluation and interpretability
During hyperparameter tuning, the model with the highest area under the receiver operating characteristic (ROC) curve (AUC) was selected as the optimal model. The model was developed using the training set and internally validated using the optimal model. Model performance was evaluated based on AUC, sensitivity, specificity, F1-score, accuracy, precision, and Brier score. Additionally, calibration curves and decision curve analysis (DCA) were plotted to assess the model’s calibration and to demonstrate its potential clinical utility. To enhance model interpretability, SHapley Additive exPlanations (SHAP) were used to generate summary plots, waterfall plots, force plots, and feature importance rankings. This approach quantitatively illustrates the contribution of each feature to the model’s predictions (22, 23), thereby improving transparency and offering insight into how individual variables influence the model output.
Results
Baseline characteristics
All older adults were randomly divided into a training set (n = 1844, 70%) and a validation set (n = 791, 30%). Except for the variable hip circumference, no statistically significant differences were observed in baseline characteristics between the two groups (p > 0.05), indicating a balanced distribution of covariates (Table 1 and see Supplementary Material 1 for detailed information). Among the participants in the training set, 619 were diagnosed with MAFLD, yielding a prevalence rate of 33.57%. Significant differences in baseline characteristics were found between the MAFLD and non-MAFLD groups. Older adults with MAFLD exhibited notably abnormal metabolic indicators, including elevated levels of blood glucose, blood lipids, BMI, and liver function markers. Moreover, the prevalence of hypertension and diabetes was significantly higher in the MAFLD group compared to the non-MAFLD group (Table 2).
Predictor selection
Based on cross-validation of the least absolute shrinkage and selection operator (LASSO) regression, two regularization parameters (λ) were determined: λ.min (0.002995174) and λ.1se (0.01101739). To achieve an optimal balance between model complexity and predictive accuracy, λ.1se (0.01101739)—which corresponded to the minimum cross-validation error—was selected as the optimal parameter. A total of 13 predictors were initially selected in the training set: sex, diabetes, AST/ALT, ALT, ALB, A/G, DBIL, HDL-C, TyG-BMI, WHR, BMI, SBP, and height. The LASSO selection process is illustrated in Figure 2. Subsequently, univariate and multivariate logistic regression analyses were performed to further refine the variable selection, and 9 independent predictors were ultimately identified: diabetes, ALT, ALB, A/G, HDL-C, TyG-BMI, BMI, SBP, and height (Table 3). Variance inflation factor (VIF) values were calculated for all variables, with all values below 5, indicating the absence of multicollinearity among predictors.
Figure 2. Clinical feature selection via the lasso regression model. (A) The partial likelihood deviance (binomial deviance) curve was plotted vs. log (lambda). The dotted vertical lines represent the optimal predictors using the minimum criteria (min. criteria) and the 1 SE of the minimum criteria (1-SE criteria). (B) Lasso coefficients of a total of 13 clinical features. Dynamic process diagram of lasso screening variables.
Model development and performance evaluation
In this study, 10 machine learning models were developed to assess the risk of MAFLD among older adults. A 10-fold cross-validation with grid search was applied to obtain the optimal hyperparameters for nine machine learning algorithms (excluding logistic regression, LR). Detailed information on the optimal hyperparameters for each model is available in Supplementary Material 1. Risk prediction models were subsequently constructed based on the optimal hyperparameters for each algorithm. The area under the receiver operating characteristic curve (AUC) was first used as the primary metric to evaluate model discrimination. In the validation set, the AUC values for each model were as follows: LR (0.884), SVM (0.887), GBM (0.889), NN (0.859), RF (0.892), XGBoost (0.876), KNN (0.867), Adaboost (0.822), LightGBM (0.854), and CatBoost (0.889). Among these, the random forest (RF) model demonstrated the best discriminatory performance. Further evaluation of model performance included accuracy, sensitivity, specificity, precision, F1 score, and Brier score. Detailed metrics for all 10 models are presented in Table 4. Notably, the RF model achieved the highest F1 score (0.739) and sensitivity (0.919), along with the lowest Brier score (0.125), indicating excellent predictive capability and calibration. Additionally, calibration curves and decision curve analysis (DCA) were plotted to assess the models’ calibration and clinical utility in both the training and validation sets (see ROC curves, calibration curves, and DCA in Figure 3). Taking all performance metrics into account, the RF model demonstrated the best overall performance, with strong calibration and clinical applicability, making it the most suitable predictive model in this study.
Figure 3. Comparison of the ROC curves for 10 machine learning models. (A) Comparison of ROC curves in the training set, (B) comparison of ROC curves on the validation set. (C) Comparison of calibration curves in the training set, (D) comparison of calibration curves on the validation set. (E) Comparison of DCA in the training set, (F) comparison of DCA on the validation set. LR, logistic regression; SVM, support vector machine; GBM, Gradient Boosting Machine; NN, NeuralNetwork; RF, random forest; XGBoost, eXtreme Gradient Boosting; KNN, K-Nearest Neighbor; Adaboost, Adaptive Boosting; LightGBM, Light Gradient Boosting Machine; CatBoost, Categorical Boosting.
Model interpretability
To further interpret the results of the RF model, SHAP (SHapley Additive exPlanations) value-based visualizations were employed. As shown in Figure 4A, a summary (beeswarm) plot illustrates the distribution of SHAP values for each feature. In this plot, each point represents an individual patient; the X-axis indicates the magnitude and direction of the feature’s impact on the model output, while the Y-axis ranks the features by importance. Features positioned higher on the Y-axis have a greater influence on model predictions. The analysis identified nine key predictors for MAFLD in older adults: TyG-BMI, height, ALB, BMI, A/G, ALT, HDL-C, SBP, and diabetes. Among them, TyG-BMI, height, and ALB were the top three contributors to model predictions. Figures 4B,C present a detailed case study using SHAP waterfall and force plots to illustrate the prediction process for a specific individual. The waterfall plot reveals how the model prediction is formed by sequentially adding the SHAP values of individual features to a baseline value. The force plot offers a more intuitive visual summary of the collective “push and pull” effect of features on the prediction outcome for that patient. Additionally, Figure 4D displays a bar chart of feature importance ranked by their mean absolute SHAP values, clearly highlighting the relative contribution of each variable to the RF model. Features appearing at the top of the chart exert the most significant influence on the model’s predictions.
Figure 4. (A) Hive plot of the SHAP values of the model constructed by the RF algorithm. Vertical coordinates show the importance of the features, sorted in descending order of variable importance, while the variables above are more important to the model. For horizontal positions, the ‘Shap value’ shows whether the effect of this value is related to higher or lower predictions. The color of each SHAP value point indicates whether the observed value is high (yellow) or low (purple). (B) The waterfall plot of SHAP values for the model constructed by the RF algorithm. (C) SHAP value force plot of the model constructed using the RF algorithm. (D) The SHAP variable importance ranking plot of the model constructed using the RF algorithm.
Discussion
Metabolic-associated fatty liver disease (MAFLD) has a global prevalence of 38.77%, affecting more than one-third of the world’s population (24). A systematic review and meta-analysis forecast that by 2030, approximately 314.58 million people in China will be diagnosed with MAFLD (25). MAFLD has become an increasingly serious public health issue, imposing significant socioeconomic burdens. Epidemiological evidence indicates that the prevalence of MAFLD exhibits a distinct age-dependent pattern, with elderly individuals bearing a substantially higher burden of risk factors (26). Therefore, this study aims to develop machine learning models to enable early identification of high-risk elderly populations with MAFLD, thereby reducing medical and socioeconomic costs.
Our study identified TyG-BMI, height, albumin (ALB), body mass index (BMI), albumin/globulin ratio (A/G), alanine aminotransferase (ALT), systolic blood pressure (SBP), and diabetes as risk factors for MAFLD in the elderly, while high-density lipoprotein cholesterol (HDL-C) served as a protective factor. SHAP visualization further highlighted TyG-BMI, height, and ALB as the three most critical independent predictors.
TyG-BMI, a widely studied marker of metabolic dysregulation in recent years, integrates triglycerides (TG), fasting plasma glucose (FPG), and BMI, providing a comprehensive reflection of insulin resistance and metabolic abnormalities (27). Yang et al. (28) demonstrated a positive association between TyG-BMI and MAFLD, which remained significant after adjustments in multiple models. Additionally, a study based on the U. S. National Health and Nutrition Examination Survey (NHANES) data showed that TyG-BMI was significantly associated with all-cause mortality in MAFLD patients and had strong predictive value across different populations (29). Our findings that TyG-BMI is an independent predictor of MAFLD align with these previous reports.
Height emerged as a key predictor of MAFLD in our study, potentially related to differences in fat distribution among the elderly. Prior studies have shown significant correlations between height and both fat distribution and metabolic dysfunction, with taller individuals generally exhibiting higher basal metabolic rates and healthier fat distribution patterns (30–32). Albumin, synthesized by the liver (33), reflects hepatic synthetic function and reserve capacity. Chen et al. (34) reported that MAFLD patients tend to have lower ALB levels, indicating some degree of hepatic impairment. Li et al. (35) also found that decreased ALB levels were associated with an increased risk of MAFLD, potentially due to ALB’s anti-inflammatory and antioxidant properties. Our results corroborate these findings, confirming ALB as a risk factor for MAFLD in the elderly.
In addition to these three key predictors, BMI, A/G, ALT, SBP, and diabetes were also identified as risk factors for MAFLD in older adults. Studies have established a significant association between BMI and MAFLD risk, with BMI serving as a reliable predictor for MAFLD occurrence (36, 37). Due to hepatic fat accumulation and inflammation, immune activation leads to increased globulin synthesis, resulting in decreased A/G ratio. This change reflects hepatic synthetic function and overall health, indirectly indicating MAFLD risk (38). A prospective cohort study demonstrated that persistently high-normal ALT levels were significantly associated with increased risk of incident MAFLD, underscoring the importance of ALT monitoring for early identification of high-risk individuals (39). Furthermore, numerous studies have reported that MAFLD patients often present with hypertension and diabetes, with SBP ≥ 130 mmHg and diabetes significantly positively correlated with MAFLD risk (40–42).
HDL-C facilitates the transport of cholesterol from peripheral tissues to the liver for metabolism and excretion. One study indicated that low HDL-C levels may increase the risk of liver fibrosis and hepatocellular carcinoma in MAFLD patients, suggesting that higher HDL-C levels might be protective against MAFLD development, consistent with our findings (43).
Among the models developed, random forest (RF) demonstrated superior predictive accuracy and high sensitivity, making it the optimal model for predicting MAFLD risk in elderly populations. RF achieved the highest area under the ROC curve (AUC), with calibration curves closely aligned with the ideal line, and decision curve analysis (DCA) showing maximal net benefit across different threshold probabilities. At the same time, SHAP visualization was used to enhance the model’s interpretability, with the creation of hive plots, force plots, waterfall plots, and importance ranking plots for visual representation. These visualizations highlight how these factors interact and influence the prevalence of MAFLD in the elderly population. This interpretability ensures that the model is a transparent tool that clinicians and researchers can trust.
This study has several limitations. First, the RF model exhibited near-perfect performance on the training set, indicating a risk of overfitting. Although 10-fold cross-validation and regularization techniques were applied, further validation through nested cross-validation, early stopping, ensemble methods, or external validation on larger datasets is needed to ensure model robustness and generalizability. Second, all participants in this study were recruited from the Affiliated Hospital of Southwest Medical University, and the representativeness and regional applicability of the study population require further external validation using multi-center, large-scale clinical data to assess the generalizability of the findings. Third, this study is a cross-sectional study, and all sample data were drawn from the population undergoing health examinations at this hospital in 2024. Data from a single year may be subject to temporal and selection biases and cannot reflect the dynamic progression of the disease over time. Future studies should conduct prospective validation over longer follow-up periods and multiple time points to further ensure the robustness of the model. Fourth, the diagnosis of fatty liver disease in this study was based on abdominal ultrasound findings, which generally provide lower-level evidence compared to liver biopsy or magnetic resonance imaging (MRI). In addition, one of the diagnostic criteria for MAFLD is a plasma high-sensitivity C-reactive protein (hs-CRP) level ≥2 mg/L; however, this parameter was not routinely measured in the examined population. Other important factors affecting MAFLD risk, such as lifestyle habits and dietary patterns, were also not systematically recorded, which may have affected the accuracy of the prediction. Future research should aim to incorporate more comprehensive and detailed data to further enhance model performance and interpretability.
Conclusion
The increasing prevalence of MAFLD among the elderly population has drawn considerable public attention, underscoring the need for large-scale early screening models tailored to this demographic. In this study, 10 machine learning models were developed and their performances compared, with the random forest model identified as the optimal predictor for MAFLD. Furthermore, SHAP visualization was employed to elucidate the interactions between various risk factors and MAFLD. The findings demonstrate that the proposed MAFLD screening model exhibits satisfactory predictive performance, offering a novel, cost-effective approach for the prevention and early detection of MAFLD in the elderly.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found here: Science Data Bank (ScienceDB), DOI: 10.57760/sciencedb.26204.
Ethics statement
The studies involving humans were approved by Ethics Committee of the Affiliated Hospital of Southwest Medical University, Luzhou, Sichuan Province, China. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because due to the retrospective nature of the study, the requirement for written informed consent was waived. This study utilized de-identified health examination data that were collected as part of routine clinical care. The research posed minimal risk to participants as it involved only the analysis of existing medical records without any additional interventions or procedures. All personal identifiers were removed from the dataset to protect patient privacy, and the study methodology met the criteria for waiver of informed consent as outlined in research ethics guidelines for retrospective studies using anonymized clinical data.
Author contributions
YZ: Data curation, Formal analysis, Writing – review & editing, Writing – original draft, Methodology, Conceptualization, Visualization, Investigation. CY: Methodology, Writing – original draft, Conceptualization, Visualization, Project administration, Data curation, Writing – review & editing. XY: Data curation, Methodology, Investigation, Writing – review & editing. XZ: Investigation, Writing – review & editing, Data curation. GX: Resources, Project administration, Supervision, Conceptualization, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Acknowledgments
We thank the Health Management Center of the Affiliated Hospital of Southwest Medical University for providing us with the data. We thank the patients for allowing us to share their clinical data.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1678076/full#supplementary-material
References
1. Eslam, M, Sanyal, AJ, and George, J. MAFLD: a consensus-driven proposed nomenclature for metabolic associated fatty liver disease. Gastroenterology. (2020) 158:1999–2014.e1. doi: 10.1053/j.gastro.2019.11.312
2. Zeng, J, Qin, L, Jin, Q, Yang, RX, Ning, G, Su, Q, et al. Prevalence and characteristics of MAFLD in Chinese adults aged 40 years or older: a community-based study. Hepatobiliary Pancreat Dis Int. (2022) 21:154–61. doi: 10.1016/j.hbpd.2022.01.006
3. Kasper, P, Martin, A, Lang, S, Kütting, F, Goeser, T, Demir, M, et al. NAFLD and cardiovascular diseases: a clinical review. Clin Res Cardiol. (2021) 110:921–37. doi: 10.1007/s00392-020-01709-7
4. Younossi, ZM, Golabi, P, de Avila, L, Paik, JM, Srishord, M, Fukui, N, et al. The global epidemiology of NAFLD and NASH in patients with type 2 diabetes: A systematic review and meta-analysis. J Hepatol. (2019) 71:793–801. doi: 10.1016/j.jhep.2019.06.021
5. Ciardullo, S, Ballabeni, C, Trevisan, R, and Perseghin, G. Liver stiffness, albuminuria and chronic kidney disease in patients with NAFLD: a systematic review and meta-analysis. Biomolecules. (2022) 12:105. doi: 10.3390/biom12010105
6. Guo, Z, Wu, D, Mao, R, Yao, Z, Wu, Q, and Lv, W. Global burden of MAFLD, MAFLD related cirrhosis and MASH related liver cancer from 1990 to 2021. Sci Rep. (2025) 15:7083. doi: 10.1038/s41598-025-91312-5
7. Huang, YL, Sun, C, Wang, Y, Cheng, J, Wang, SW, Wei, L, et al. Ultrasound-guided attenuation parameter for identifying metabolic dysfunction-associated steatotic liver disease: a prospective study. Ultrasonography. (2025) 44:134–44. doi: 10.14366/usg.24204
8. Gopal, P, Hu, X, Robert, ME, and Zhang, X. The evolving role of liver biopsy: current applications and future prospects. Hepatol Commun. (2025) 9:628. doi: 10.1097/HC9.0000000000000628
9. Thomaides-Brears, HB, Alkhouri, N, Allende, D, Harisinghani, M, Noureddin, M, Reau, NS, et al. Incidence of complications from percutaneous biopsy in chronic liver disease: a systematic review and meta-analysis. Dig Dis Sci. (2022) 67:3366–94. doi: 10.1007/s10620-021-07089-w
10. Du, J, Tao, X, Zhu, L, Qi, W, Min, X, Deng, H, et al. A risk prediction system for depression in middle-aged and older adults grounded in machine learning and visualization technology: a cohort study. Front Public Health. (2025) 13:1606316. doi: 10.3389/fpubh.2025.1606316
11. Du, J, Tao, X, Zhu, L, Wang, H, Qi, W, Min, X, et al. Development of a visualized risk prediction system for sarcopenia in older adults using machine learning: a cohort study based on CHARLS. Front Public Health. (2025) 13:1544894. doi: 10.3389/fpubh.2025.1544894
12. Du, J, Yang, S, Zeng, Y, Ye, C, Chang, X, and Wu, S. Visualization obesity risk prediction system based on machine learning. Sci Rep. (2024) 14:22424. doi: 10.1038/s41598-024-73826-6
13. Bifarin, OO. Interpretable machine learning with tree-based shapley additive explanations: application to metabolomics datasets for binary classification. PLoS One. (2023) 18:e0284315. doi: 10.1371/journal.pone.0284315
14. Chang, M, Shao, Z, and Shen, G. Association between triglyceride glucose-related markers and the risk of metabolic-associated fatty liver disease: a cross-sectional study in healthy Chinese participants. BMJ Open. (2023) 13:e070189. doi: 10.1136/bmjopen-2022-070189
15. Ren, Q, Huang, Y, Liu, Q, Chu, T, Li, G, and Wu, Z. Association between triglyceride glucose-waist height ratio index and cardiovascular disease in middle-aged and older Chinese individuals: a nationwide cohort study. Cardiovasc Diabetol. (2024) 23:247. doi: 10.1186/s12933-024-02336-6
16. Zhang, YN, Fowler, KJ, Hamilton, G, Cui, JY, Sy, EZ, Balanay, M, et al. Liver fat imaging-a clinical overview of ultrasound, CT, and MR imaging. Br J Radiol. (2018) 91:20170959.
17. Eslam, M, Fan, JG, Yu, ML, Wong, VWS, Cua, IH, Liu, CJ, et al. The Asian Pacific association for the study of the liver clinical practice guidelines for the diagnosis and management of metabolic dysfunction-associated fatty liver disease. Hepatol Int. (2025) 19:261–301. doi: 10.1007/s12072-024-10774-3
18. Huang, G, Jin, Q, and Mao, Y. Predicting the 5-year risk of nonalcoholic fatty liver disease using machine learning models: prospective cohort study. J Med Internet Res. (2023) 25:e46891. doi: 10.2196/46891
19. Yu, Y, Yang, Y, Li, Q, Yuan, J, and Zha, Y. Predicting metabolic dysfunction associated steatotic liver disease using explainable machine learning methods. Sci Rep. (2025) 15:12382. doi: 10.1038/s41598-025-96478-6
20. Huang, AA, and Huang, SY. Comparison of model feature importance statistics to identify covariates that contribute most to model accuracy in prediction of insomnia. PLoS One. (2024) 19:e0306359. doi: 10.1371/journal.pone.0306359
21. Huang, AA, and Huang, SY. Dendrogram of transparent feature importance machine learning statistics to classify associations for heart failure: a reanalysis of a retrospective cohort study of the Medical Information Mart for Intensive Care III (MIMIC-III) database. PLoS One. (2023) 18:e0288819. doi: 10.1371/journal.pone.0288819
22. Sylvester, S, Sagehorn, M, Gruber, T, Atzmueller, M, and Schöne, B. SHAP value-based ERP analysis (SHERPA): increasing the sensitivity of EEG signals with explainable AI methods. Behav Res Methods. (2024) 56:6067–81. doi: 10.3758/s13428-023-02335-7
23. Wang, X, Chen, H, Wang, L, and Sun, W. Machine learning for predicting all-cause mortality of metabolic dysfunction-associated fatty liver disease: a longitudinal study based on NHANES. BMC Gastroenterol. (2025) 25:376. doi: 10.1186/s12876-025-03946-4
24. Chan, KE, Koh, TJL, Tang, ASP, Quek, J, Yong, JN, Tay, P, et al. Global prevalence and clinical characteristics of metabolic-associated fatty liver disease: a meta-analysis and systematic review of 10 739 607 individuals. J Clin Endocrinol Metab. (2022) 107:2691–700. doi: 10.1210/clinem/dgac321
25. Estes, C, Anstee, QM, Arias-Loste, MT, Bantel, H, Bellentani, S, Caballeria, J, et al. Modeling NAFLD disease burden in China, France, Germany, Italy, Japan, Spain, United Kingdom, and United States for the period 2016-2030. J Hepatol. (2018) 69:896–904. doi: 10.1016/j.jhep.2018.05.036
26. Eguchi, Y, Hyogo, H, Ono, M, Mizuta, T, Ono, N, Fujimoto, K, et al. Prevalence and associated metabolic factors of nonalcoholic fatty liver disease in the general population from 2009 to 2010 in Japan: a multicenter large retrospective study. J Gastroenterol. (2012) 47:586–95. doi: 10.1007/s00535-012-0533-z
27. Li, C, Zhang, Z, Luo, X, Xiao, Y, Tu, T, Liu, C, et al. The triglyceride-glucose index and its obesity-related derivatives as predictors of all-cause and cardiovascular mortality in hypertensive patients: insights from NHANES data with machine learning analysis. Cardiovasc Diabetol. (2025) 24:47. doi: 10.1186/s12933-025-02591-1
28. Yang, X, Rao, H, Yuan, Y, Hu, N, Zhang, X, Zeng, Y, et al. Correlation analysis of the triglyceride-glucose index and related parameters in metabolic dysfunction-associated fatty liver disease. Sci Rep. (2025) 15:23. doi: 10.1038/s41598-024-84809-y
29. Chen, Q, Hu, P, Hou, X, Sun, Y, Jiao, M, Peng,, et al. Association between triglyceride-glucose related indices and mortality among individuals with non-alcoholic fatty liver disease or metabolic dysfunction-associated steatotic liver disease. Cardiovasc Diabetol. (2024) 23:232. doi: 10.1186/s12933-024-02343-7
30. Cai, J, Lin, C, Lai, S, Liu, Y, Liang, M, Qin, Y, et al. Waist-to-height ratio, an optimal anthropometric indicator for metabolic dysfunction associated fatty liver disease in the Western Chinese male population. Lipids Health Dis. (2021) 20:145. doi: 10.1186/s12944-021-01568-9
31. Hosseini, SA, Alipour, M, Sarvandian, S, Haghighat, N, Bazyar, H, and Aghakhani, L. Assessment of the appropriate cutoff points for anthropometric indices and their relationship with cardio-metabolic indices to predict the risk of metabolic associated fatty liver disease. BMC Endocr Disord. (2024) 24:79. doi: 10.1186/s12902-024-01615-3
32. Agbim, U, Carr, RM, Pickett-Blakely, O, and Dagogo-Jack, S. Ethnic disparities in adiposity: focus on non-alcoholic fatty liver disease, visceral, and generalized obesity. Curr Obes Rep. (2019) 8:243–54. doi: 10.1007/s13679-019-00349-x
33. Gremese, E, Bruno, D, Varriano, V, Perniola, S, Petricca, L, and Ferraccioli, G. Serum albumin levels: a biomarker to be repurposed in different disease settings in clinical practice. J Clin Med. (2023) 12:6017. doi: 10.3390/jcm12186017
34. Chen, J, Dan, L, Tu, X, Sun, Y, Deng, M, Chen, X, et al. Metabolic dysfunction-associated fatty liver disease and liver function markers are associated with Crohn's disease but not Ulcerative Colitis: a prospective cohort study. Hepatol Int. (2023) 17:202–14. doi: 10.1007/s12072-022-10424-6
35. Li, XM, Liu, SL, He, YJ, and Shu, JC. Using new indices to predict metabolism dysfunction-associated fatty liver disease (MAFLD): analysis of the national health and nutrition examination survey database. BMC Gastroenterol. (2024) 24:109. doi: 10.1186/s12876-024-03190-2
36. Wang, B, Yang, Y, Yin, Z, and Yang, W. The causal impact of body mass index on metabolic biomarkers and nonalcoholic fatty liver disease risk. Sci Rep. (2025) 15:10314. doi: 10.1038/s41598-024-84165-x
37. Duan, SJ, Ren, ZY, Zheng, T, Peng, HY, Niu, ZH, Xia, H, et al. Atherogenic index of plasma combined with waist circumference and body mass index to predict metabolic-associated fatty liver disease. World J Gastroenterol. (2022) 28:5364–79. doi: 10.3748/wjg.v28.i36.5364
38. Yang, SS, Li, JT, Yan, SG, and Jiao, JZ. Research progress in complement receptor of the immunoglobulin superfamily in regulating liver immunity. Zhongguo Yi Xue Ke Xue Yuan Xue Bao. (2024) 46:603–9. doi: 10.3881/j.issn.1000-503X.15803
39. Chen, JF, Wu, ZQ, Liu, HS, Yan, S, Wang, YX, Xing, M, et al. Cumulative effects of excess high-normal alanine aminotransferase levels in relation to new-onset metabolic dysfunction-associated fatty liver disease in China. World J Gastroenterol. (2024) 30:1346–57. doi: 10.3748/wjg.v30.i10.1346
40. Ballestri, S, Zona, S, Targher, G, Romagnoli, D, Baldelli, E, Nascimbeni, F, et al. Nonalcoholic fatty liver disease is associated with an almost twofold increased risk of incident type 2 diabetes and metabolic syndrome. Evidence from a systematic review and meta-analysis. J Gastroenterol Hepatol. (2016) 31:936–44. doi: 10.1111/jgh.13264
41. Mantovani, A, Byrne, CD, Bonora, E, and Targher, G. Nonalcoholic fatty liver disease and risk of incident type 2 diabetes: a meta-analysis. Diabetes Care. (2018) 41:372–82. doi: 10.2337/dc17-1902
42. Yuan, M, He, J, Hu, X, Yao, L, Chen, P, Wang, Z, et al. Hypertension and NAFLD risk: Insights from the NHANES 2017-2018 and Mendelian randomization analyses. Chin Med J. (2024) 137:457–64. doi: 10.1097/CM9.0000000000002753
Keywords: machine learning, metabolic-associated fatty liver disease, predictive model, older adults, random forest
Citation: Zeng Y, Yang C, Yang X, Zhang X and Xia G (2025) Predicting the risk of metabolic-associated fatty liver disease in the elderly population in China: construction and evaluation of interpretable machine learning models. Front. Med. 12:1678076. doi: 10.3389/fmed.2025.1678076
Edited by:
Sri Krishnan, Toronto Metropolitan University, CanadaReviewed by:
George Grant, Independent Researcher, Aberdeen, United KingdomDu Jinsong, Zaozhuang University, China
Copyright © 2025 Zeng, Yang, Yang, Zhang and Xia. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Guodong Xia, ODk0MjQyMTMwQHFxLmNvbQ==
†These authors have contributed equally to this work and share first authorship
Chaobing Yang2†