Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Physiol.

Sec. Computational Physiology and Medicine

This article is part of the Research TopicMedical Knowledge-Assisted Machine Learning Technologies in Individualized Medicine Volume IIView all 29 articles

Development and Validation of a High-Performance Clinical Predictive Model for Early Identification of Non-Alcoholic Fatty Liver Disease

Provisionally accepted
Tong  LiangTong Liang1Junli  RenJunli Ren2*
  • 1Gansu University of Chinese Medicine, Lanzhou, China
  • 2Gansu Provincial Maternal and Child Health Hospital, Lanzhou, China

The final, formatted version of the article will be published soon.

Background: Non-alcoholic fatty liver disease (NAFLD) remains a significant global health challenge, imposing substantial clinical and economic burdens. There is an urgent need to develop reliable predictive tools for early identification and intervention. Methods: This study drew on Dryad database data to create and verify a clinical NAFLD predictive model, incorporating key parameters from 1,592 subjects randomly split into training and validation groups. We employed logistic regression on the training set to construct the model, visualized and internally validated it in R, and gauged its net benefit via decision curve analysis. The validation set underwent external assessment, with performance metrics including F1 score, precision, and recall. Results: The model showed strong discrimination, with an receiver operating characteristic curve area of 0.80 (95% confidence interval: 0.77-0.82) in training and 0.78 in validation, indicating high accuracy in NAFLD risk prediction. Calibration tests showed close alignment between predicted and actual risks, with mean absolute error values of 0.016 (training) and 0.012 (validation). Comprehensive metrics (F1 score: 0.76, precision: 0.71, recall: 0.82) reinforced its robustness and clinical value. Conclusion: This study's results confirm the effective creation of an NAFLD predictive tool boasting high calibration accuracy and outstanding performance. Leveraging readily available clinical data, the model offers a scalable, economical approach to NAFLD, poised to pioneer a new paradigm for its precise prevention and control, and enable personalized prevention and efficient resource allocation.

Keywords: diabetes, Non-alcoholic fatty liver disease, Prediction model, prevention, Tobacco

Received: 18 Oct 2025; Accepted: 20 Jan 2026.

Copyright: © 2026 Liang and Ren. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Junli Ren

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.