Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med., 09 February 2026

Sec. Hepatobiliary Diseases

Volume 13 - 2026 | https://doi.org/10.3389/fmed.2026.1709412

This article is part of the Research TopicBridging Molecular Drivers with New Therapies for Chronic Liver Disease ProgressionView all articles

Exploring the prediction model and core genes for coronary artery disease in non-obese steatotic liver disease patients

  • 1Department of Hepatology, The First Hospital of Jilin University, Changchun, Jilin, China
  • 2Department of Gastroenterology, The Second Hospital of Dalian Medical University, Dalian, China
  • 3Department of Geriatric, The Second Hospital of Dalian Medical University, Dalian, China

Backgrounds and aims: Non-obese steatotic liver disease (SLD) refers to a metabolic disorder characterized by ectopic fat deposition in the liver, but without increased subcutaneous adipose tissue and normal body mass index (BMI) in patients. Emerging evidence indicates that non-obese SLD is associated with coronary artery disease (CAD). However, the mechanisms underlying their mutual relationship remain undefined.

Methods: We retrospectively analyzed 8,722 subjects and constructed a prediction model for diagnosing CAD in non-obese SLD patients. Then, public datasets from the Gene Expression Omnibus (GEO) were retrieved for further bioinformatics analysis, and machine learning algorithms were used to screen candidate core genes.

Results: Through the analysis of clinical data, we found that the risk of CAD in non-obese SLD patients was significantly higher than that in obese SLD patients and individuals without SLD. We constructed a nomogram for predicting CAD in non-obese SLD patients, and the area under the curve for training and validation sets was 0.846 and 0.732, respectively. We analyzed the non-obese SLD dataset (GSE89632) and CAD dataset (GSE113079) and overlapped the differentially expressed genes (DEGs) in these two datasets. We found that there were 28 overlapping upregulated DEGs and 66 overlapping downregulated DEGs. The protein–protein interaction network generated a 94-edge network, and the top 40 hub genes were selected using the maximal clique centrality algorithm. The candidate core genes, including HNF4A and LTBP4, were screened based on machine learning algorithms. The receiver operating characteristic results showed that these two genes have considerable diagnostic value for non-obese SLD and CAD.

Conclusion: We found a close correlation between non-obese SLD and CAD. Our study developed a novel diagnostic model to predict CAD in non-obese SLD patients with promising predictive performance. In addition, through comprehensive bioinformatics analysis and machine learning algorithms, two key core genes, HNF4A and LTBP4, were identified to be associated with both non-obese SLD and CAD.

1 Introduction

Metabolic dysfunction-associated steatotic liver disease (MASLD) is one of the most common liver diseases and affects approximately 25% of the global population, which is the most common type of steatotic liver disease (SLD) (1, 2). Traditionally, MASLD is considered to be associated with overweight or obesity. Nonetheless, SLD has also been increasingly reported in normal-weight individuals. Non-obese SLD is considered ‘metabolically unhealthy normal weight’ SLD, characterized by no significant increase in subcutaneous fat accompanied by ectopic fat deposition in the liver (3, 4). The pathogenesis of non-obese SLD is different from conventional MASLD due to different metabolic and gut microbiota profiles (5, 6).

Cardiovascular diseases are the predominant global cause of death, with an estimated 18 million people dying from these diseases per year, which accounts for 31% of total deaths (7). Coronary artery disease (CAD) refers to the accumulation of lipids and immune cells in the subendothelial layer of the coronary arteries, or atherosclerosis, which is the primary cause of death from cardiovascular diseases, accounting for nearly 45% of such cases (810). Patients with MASLD are associated with a higher risk of cardiovascular events (11, 12). A recent large-scale retrospective cohort study demonstrated that cardiovascular diseases were one of the leading causes of mortality in patients with non-cirrhotic steatotic liver disease (13). In previous studies that compared obese MASLD patients and non-obese SLD patients, non-obese SLD individuals may lead to a more severe prognosis, including a higher risk of cardiovascular disease (14), severe liver disease (15, 16), hypertension, dyslipidemia, metabolic syndromes (17, 18), and all-cause mortality (19). However, other studies have also shown controversial conclusions. For example, some previous research showed that non-obese SLD had a lower prevalence of cardiovascular events (20, 21), hypertension, diabetes (DM), and metabolic syndromes (22) compared to obese MASLD patients. These discrepancies indicate that obese MASLD and non-obese SLD differ significantly in their clinical associations, particularly regarding cardiovascular events, though the exact underlying pathogenesis remains unclear.

In order to further study the relationship between non-obese SLD and CAD, we recruited subjects who completed coronary computed tomography angiography (CCTA) screening to assess coronary atherosclerosis and liver imaging to assess liver status. In the study, we analyzed baseline clinical characteristics and metabolic parameters to identify risk factors for CAD within the non-obese SLD cohort. Subsequently, we constructed the prediction model of CAD in non-obese SLD and explored shared core genes linking the two conditions through bioinformatics analysis and machine learning.

2 Methods

2.1 Clinical data analysis

2.1.1 Study population

A retrospective cross-sectional study was performed, including patients hospitalized at the Second Hospital of Dalian Medical University from January 2016 to June 2024. We collected the clinical data of patients who underwent liver imaging examinations and CCTA scans due to thoracalgia or physical examination. Existing literature indicates a high degree of concordance (>96%) between NAFLD and MASLD definitions, with several studies suggesting they may be used interchangeably in retrospective contexts (23, 24). Since a portion of our participants lacked recorded values for waist circumference, quantitative blood pressure reading or glycated hemoglobin measurements, for the diagnosis of SLD or MAFLD, we still implemented the diagnostic criteria for NAFLD: requires (a) evidence of hepatic steatosis by imaging, and (b) no causes of secondary hepatic fat accumulation such as significant alcohol consumption, use of steatogenic medication or hereditary disorders (25). The diagnostic criteria for CAD are that coronary artery stenosis reaches at least 50% under CCTA scan. Weight (in kilograms) and height (in centimeters) were objectively measured by nurses at the time of admission. SLD patients were divided into two groups according to body mass index (BMI) levels (BMI < 25 kg/m2 as non-obese SLD group, BMI ≥ 25 mg/m2 as obese MASLD group). Participants with normal ultrasound findings were defined as the healthy control group. This study was approved by the Ethics Committee of the Second Hospital of Dalian Medical University. This study used anonymous data, and informed consent was obtained before the CCTA scan.

2.1.2 Collection of clinical data

Baseline characteristics of enrolled patients were collected, including demographic data (age and sex), anthropometric measurements (height and weight), and medical history (specifically hypertension and diabetes). Laboratory assessments included serum levels of apolipoprotein B/apolipoprotein A1 [ApoB/A1], albumin [Alb], alanine aminotransferase [ALT], absolute Neutrophil Count [ANC], aspartate aminotransferase [AST], creatinine [Cr], fasting blood glucose [FBG], hemoglobin [HGB], hemoglobin A1c [HbA1c], high-density lipoprotein cholesterol [HDL-C], low-density lipoprotein cholesterol [LDL-C], white blood cell [WBC], platelet count [PLT], red blood cell [RBC], total bilirubin [TBIL], total cholesterol [TC], triglyceride [TG], and uric acid [UA]. Imaging modalities included abdominal CT scan, abdominal ultrasound, and CCTA scan. Anthropometric data were collected on the day of admission, and laboratory tests were conducted after overnight fasting within 24 h after admission. BMI was calculated by dividing a patient’s weight (in kg) by the square of his height (in meters).

2.1.3 Statistical analyses

Continuous variables were assessed for normality using the Shapiro–Wilk test. As the majority of the parameters did not follow a normal distribution, they were presented as medians along with interquartile ranges (25th–75th percentiles) and were compared by the Mann–Whitney U-test. Categorical variables (gender, prevalence of hypertension, DM, and CAD) were expressed as counts and percentages and analyzed by chi-square tests. Non-obese SLD patients were randomly divided into a training set and a validation set at a ratio of 3:2. After assessing for multicollinearity, multivariate stepwise logistic regression analysis was conducted to evaluate the association strength of the independent risk factors for CAD in non-obese SLD, and odds ratios (ORs) with 95% confidence intervals (CIs) were displayed. Selected variables were incorporated into a nomogram to predict the risk of CAD in non-obese SLD patients. The area under the curve (AUC) of the receiver operating characteristic (ROC) was used to assess the discriminative ability of the prediction models in both sets. Calibration curve and decision curve analysis (DCA) were performed to evaluate the accuracy and clinical utility of the prediction models. Statistical analyses were performed by SPSS (version 27) and R software (version 4.2.3). The packages used included “readr,” “MASS,” “rms,” and “pROC.” A two-sided p-value of < 0.05 was considered statistically significant.

2.2 Transcriptomics analyses

2.2.1 Data collection and processing

We downloaded two microarray datasets from the Gene Expression Omnibus (GEO1). The GSE89632 dataset contains liver tissue gene expression profiles, including both non-obese SLD patients and obese MASLD patients (GSE89632: n = 6 non-obese SLD, n = 31 obese MASLD). GSE113079 contains peripheral blood mononuclear cells gene expression profiles, including CAD patients and healthy controls (GSE113079: n = 93 CAD, n = 48 controls). Then we normalized the raw data and calculated the average expression when multiple probes referred to the same gene symbol. Detailed dataset characteristics are provided in Supplementary Table 1.

2.2.2 Identification of differentially expressed genes in CAD and non-obese SLD

We identified differentially expressed genes (DEGs) in non-obese SLD and CAD by the “limma” R package. A two-sided p-value of < 0.05 was used as the chosen standard for DEGs. Genes with log2 fold change (log2 FC) > 0 were considered as upregulated genes, and genes with log2 FC < 0 as downregulated genes. “randomcoloR” and “venn” packages were used to identify and show the overlap of DEGs between GSE89632 and GSE113079 datasets.

2.2.3 Construction of protein–protein interaction and identification of hub genes

Overlapping DEGs were analyzed by the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database2 to construct a protein–protein interaction (PPI) network. We set the parameters to “Homo sapiens” with a minimum interaction score threshold of 0.4. Hub DEGs were visualized by Cytoscape (Version 3.9.1) plug-in CytoHubba.3 The maximal clique centrality (MCC) algorithm screened out nodes with the top 40 highest scores as hub genes.

2.2.4 Screening optimal core genes via machine learning

Support vector machine-recursive feature elimination (SVM-RFE), least absolute shrinkage and selection operator (LASSO), and random forest (RF) were used to identify the optimal core genes in GSE89632. SVM-RFE was implemented using the “sigFeature” and “e1071” R packages to select the most relevant gene subset and prevent overfitting, with feature precedence determined by AvgRank. The optimal parameter λ was determined by 10-fold cross-validation of LASSO with the “glmnet” R package. RF is a supervised machine learning method that is performed by the “randomForest” R package. In this study, genes with importance > 0 were considered as candidate genes. Subsequently, by crossing the significant genes identified by these three machine learning methods, the optimal core genes were screened out.

2.2.5 Exploring the expression levels of core genes in non-obese populations

The expression levels of the optimal core genes in the datasets (GSE89632 and GSE113079) were calculated using the “ggplot2” R package. Boxplots were generated to compare core gene expression levels between the disease groups (non-obese SLD and CAD) and healthy groups. Statistical significance was assessed by the Wilcoxon test (p < 0.05).

2.2.6 Evaluation of diagnostic efficacy

To evaluate the diagnostic efficiency of core genes for non-obese SLD and CAD, AUCs were calculated by the “pROC” R package.

2.2.7 Statistical analyses

All statistical analyses and graph generation for bioinformatics studies were performed using R software (version 4.2.3). The differences between the two groups were assessed using the Wilcoxon signed-rank test. A two-sided p-value of < 0.05 was considered statistically significant.

3 Results

3.1 Demographic and clinical characteristics of the control, non-obese SLD, and obese MASLD group

A total of 8,722 participants who underwent both hepatic imaging examination and CCTA scan were enrolled in the analysis. Based on the liver imaging results and BMI, the participants were categorized into three groups: the control group (6,726, 77.12%), the non-obese SLD group (517, 5.92%), and the obese MASLD group (1,479, 16.96%). The clinical characteristics of this group are presented in Table 1.

Table 1
www.frontiersin.org

Table 1. Demographic and clinical characteristics of participants in the control, non-obese SLD, and obese MASLD groups.

Compared to the obese MASLD group, the non-obese SLD group exhibited significantly lower BMI, RBC, HBG, Cr, UA, TG, and ALT levels, as well as a lower proportion of males and prevalence of hypertension. However, age, FBG, TC, HDL, and prevalence of CAD were significantly higher in the non-obese SLD group. No significant differences were observed in HbA1c, WBC, ANC, PLT, urea, LDL, Apo B/A1, AST, Alb, TB, or prevalence of DM between these two groups.

Compared to the control group, the non-obese SLD group showed significantly elevated levels of FBG, HbA1c, PLT, TC, TG, LDL, Apo B/A1, ALT, AST, and Alb, along with a higher prevalence of DM and CAD. While age, BMI, urea, Cr, HDL, and the proportion of males were significantly lower. There were no differences in WBC, ANC, RBC, HGB, UA, TB, or the prevalence of hypertension levels. In the comparison between the obese MASLD and control groups, BMI, FBG, HbA1c, WBC, ANC, RBC, PLT, UA, TC, TG, Apo B/A1, ALT, AST, Alb, the proportion of males, and the prevalence of hypertension and DM were significantly increased in the obese MASLD group, while age and HDL were significantly decreased. No significant differences were found in urea, Cr, LDL, TB levels, or the prevalence of DM. Notably, the prevalence of CAD in non-obese SLD was significantly higher than that in both the control group and obese MASLD group, suggesting that non-obese SLD is closely related to the occurrence of CAD.

3.2 Demographic and clinical characteristics of non-obese SLD patients with or without CAD in the training set

Due to the higher probability of CAD in non-obese SLD, we further explored potential risk factors for CAD in non-obese SLD. The non-obese SLD patients were randomly divided into a training set (n = 310) and a validation set (n = 207). Baseline characteristics were comparable between the two sets, except for PLT and TBIL levels (Supplementary Table 2). We further compared the clinical features of non-obese SLD patients with or without CAD in the training set (Table 2). Compared to those without CAD, non-obese SLD patients with CAD were significantly older and exhibited higher levels of FBG, HbA1c, WBC, ANC, TC, HDL, and LDL. Additionally, this group showed a higher proportion of males and a higher prevalence of DM and hypertension. No significant differences were observed in RBC, HGB, PLT, Urea, Cr, UA, TG, ALT, AST, Alb, or TBIL levels between these two groups.

Table 2
www.frontiersin.org

Table 2. Demographic and clinical characteristics of non-obese SLD patients with or without CAD in the training set.

3.3 Independent risk factors of non-obese SLD patients with CAD in the training set

To determine independent risk factors for CAD in non-obese SLD patients, we conducted a multivariate logistic regression analysis. After excluding variables with multicollinearity, the factors with statistical differences between the non-obese SLD patients with or without CAD in the training set were included in the multivariate logistic regression analysis (Supplementary Table 3). The results indicated that male sex, history of DM, age, and elevated TC (with odds ratios of 3.266, 2.961, 1.079, and 2.420, respectively) were independently associated with the presence of CAD in non-obese SLD.

3.4 Constructing and validating a nomogram for predicting CAD in non-obese SLD patients

To further predict the risk of CAD in patients with non-obese SLD in the clinic, we constructed a nomogram based on the results of multivariate logistic regression analysis. In this nomogram, each variable was assigned a score on the points scale. The total score was calculated by summing these individual scores and was then projected onto the total points scale to estimate the probability of CAD (Figure 1A).

Figure 1
Panel A shows a clinical nomogram scoring system based on diabetes, gender, age, and total cholesterol with risk and point calculation. Panel B is a ROC curve with area under the curve 0.846, while panel C shows another ROC curve with area under the curve 0.732. Panel D presents a decision curve analysis comparing net benefits in training and validation sets. Panels E and F provide calibration plots of predicted versus observed probabilities, each with bias-corrected and apparent lines, alongside ideal reference lines.

Figure 1. Construction and validation of a nomogram for predicting CAD in non-obese SLD patients. (A) A nomogram for predicting CAD in non-obese SLD in the training set. (B) ROC curve of the nomogram for diagnosing CAD in non-obese SLD in the training set. (C) ROC curve of the nomogram for diagnosing CAD in non-obese SLD in the validation set. (D) DCAs for the prediction of CAD in the training set and validation set. The x-axis is the threshold probability, and the y-axis is the net benefit. (E) The calibration curve of the prediction model in the training set. (F) The calibration curve of the prediction model in the validation set.

The AUC of the logistic regression model in the training set was 0.846, and the cutoff value, specificity, and sensitivity were 0.167, 66.7, and 89.6%, respectively (Figure 1B). The accuracy, positive predictive value (PPV), and negative predictive value (NPV) of this model were 74.5, 47.6, and 95.0%, respectively (Supplementary Table 4). Next, the model was evaluated in the validation set. The AUC of the logistic regression model was 0.732, and the cutoff value, specificity, and sensitivity were 0.238, 68.7, and 74.5%, respectively (Figure 1C). The accuracy, PPV, and NPV were 70.2, 45.5, and 88.5%, respectively (Supplementary Table 4). Both DCAs in the training set and validation set lie above the “All” and “None” curves, indicating significant net clinical benefit (Figure 1D). Both calibration curves in both sets were close to the baselines, demonstrating good consistency between predicted and actual CAD probabilities (Figures 1E,F).

3.5 Identification of DEGs in the non-obese SLD dataset and CAD dataset

Based on the established significant association between non-obese SLD and CAD, we sought to investigate potential shared hub genes involved in their interaction. To this end, we analyzed two relevant datasets: the non-obese SLD dataset (GSE89632, liver tissue expression profiles) and the CAD dataset (GSE113079, peripheral blood mononuclear cell expression profiles). There were 477 DEGs (161 upregulated genes and 316 downregulated genes) in the non-obese SLD dataset and 4,622 DEGs (2089 upregulated genes and 2,533 downregulated genes) in the CAD dataset. The expression patterns of the top 60 significant DEGs (ranked by p-value) were visualized in heatmaps (Figures 2A,B), and the overall distribution of DEGs was presented in volcano plots (Figures 2C,D). There were 28 overlapping upregulated DEGs and 66 overlapping downregulated DEGs between the non-obese SLD and CAD datasets, as shown in the Venn diagram in Figures 2E,F.

Figure 2
Panel A shows a heatmap illustrating gene expression levels for control and non-obese SLD groups, while panel B presents a heatmap comparing PC and CAD groups; both use a blue-to-red scale. Panel C and panel D display volcano plots indicating significantly upregulated and downregulated genes based on log fold change and negative log p-values, with differentially expressed genes highlighted. Panel E and panel F feature Venn diagrams comparing gene overlap between GSE89632 and GSE113079 datasets, highlighting shared and unique gene counts within intersections.

Figure 2. Screening of DEGs in non-obese SLD dataset (GSE98632) and CAD dataset (GSE113079). (A) Heatmap of the top 60 DEGs for the non-obese SLD dataset. (B) Heatmap of the top 60 DEGs for the CAD dataset. (C) The volcano plots indicate the DEGs for the non-obese SLD dataset. (D) The volcano plots indicate the DEGs for the CAD dataset. Vertical line indicated log2 FC and horizontal line indicated -log10 (p-values). (E) Venn diagram of overlapping upregulated DEGs in the non-obese SLD dataset and CAD dataset. (F) Venn diagram of overlapping downregulated DEGs in the non-obese SLD dataset and CAD dataset.

3.6 Construction of the PPI network and identification of hub genes

After merging the DEGs that were simultaneously upregulated or downregulated in both datasets, we obtained 94 common genes. Subsequently, the most significant genes were recognized by the plug-in CytoHubba APP of Cytoscape, and the top 40 genes with the highest node scores from the MCC algorithm were considered hub genes (Figure 3A).

Figure 3
Panel A displays a heatmap of gene names with varying color intensities. Panel B is a line graph showing feature number versus 1–SVM accuracy. Panel C is a similar line graph depicting 1–SVM error. Panel D presents LASSO regression coefficient paths by log lambda. Panel E displays binomial deviance across log lambda values in a LASSO plot. Panel F is a random forest error plot with error rate versus tree number. Panel G is a bar plot ranking gene importance, with colored dots indicating importance scores. Panel H is a Venn diagram showing gene overlap among RandomForest, LASSO, and SVM-REF methods.

Figure 3. PPI network for DEGs and machine-learning algorithm screening for optimal core genes of CAD in non-obese SLD. (A) PPI network of the top 40 genes with the highest node scores from the MCC algorithm. (B) The accuracy was the highest when the feature was 13 for SVM-RFE analysis after 100 folds. (C) The error of the classifier was the smallest when the feature was 13 for SVM-RFE analysis after 100 folds. (D,E) The lowest binomial deviance of the LASSO regression curve corresponds to the most suitable candidate number of genes (n = 10). (F) The relationship between the number of trees and the error rate of the RF. (G) Ranking of genes based on the importance score calculated from RF. (H) Venn diagram shows that two core genes are screened by three machine-learning algorithms.

3.7 Integrating SVM-RFE, LASSO, and RF for screening the core genes

In order to further narrow down the candidate biomarkers, we further performed three machine-learning algorithms (SVM-RFE, LASSO, and RF) for the 40 DEGs obtained above. We performed an in-depth screening by SVM-RFE with 100 folds, and the results showed that when 13 genes were included, the classifier had the smallest error and the highest accuracy (Figures 3B,C). We further screened the core genes by LASSO regression. We identified 10 candidate genes with the lowest binomial deviance (Figures 3D,E). Furthermore, we determined seven genes using the RF algorithm with an importance score > 0 (Figures 3F,G). A Venn diagram showed there were two core genes named hepatocyte nuclear factor 4α (HNF4A) and latent transforming growth factor β (TGFβ) binding protein 4 (LTBP4) that were crossed among these three machine learning algorithms (Figure 3H). These two genes were selected for subsequent analysis.

3.8 Exploring the expression and predictive values of the core genes based on the datasets

The expression level of HNF4A was significantly higher (Figure 4A), while that of LTBP4 was significantly lower (Figure 4B) in the non-obese SLD group compared to healthy individuals in GSE89632. The AUCs of HNF4A and LTBP4 for diagnosing non-obese SLD were 0.909 and 0.806 (Figures 4C,D). The expression level of HNF4A was significantly higher (Figure 4E), while LTBP4 was significantly lower (Figure 4F) in the CAD group compared to healthy individuals in GSE113079. The AUCs of HNF4A and LTBP4 for diagnosing CAD were 0.745 and 0.746 (Figures 4G,H). These results confirmed that both core genes possessed significant predictive value for both diseases.

Figure 4
Eight-panel figure showing boxplots and ROC curves. Panels A and E display higher HNF4A expression in non-obese SLD and CAD groups versus controls, with significant p-values. Panels B and F show LTBP4 expression, also with significant differences. Panels C and D present ROC curves for NHF4A and LTBP4 diagnosing non-obese SLD, with AUCs of 0.909 and 0.806. Panels G and H show ROC curves for NHF4A and LTBP4 diagnosing CAD, with AUCs of 0.745 and 0.746. Blue represents control and red represents disease groups.

Figure 4. Expression patterns and diagnostic efficacy of core genes in non-obese SLD and CAD based on the datasets. Expression levels of HNF4A (A) and LTBP4 (B) in non-obese SLD. The ROC curves of HNF4A (C) and LTBP4 (D) for diagnosing non-obese SLD. Expression levels of HNF4A (E) and LTBP4 (F) in CAD. The ROC curves of HNF4A (G) and LTBP4 (H) for diagnosing CAD.

4 Discussion

SLD is generally considered to be associated with obesity, but it can also occur in individuals with BMI < 25 kg/m2, a phenotype termed “non-obese SLD.” A meta-analysis reported that non-obese SLD accounts for 20% of the non-obese population (26). According to previous reports, liver-related mortality (26) and all-cause mortality (19) are significantly higher in non-obese SLD patients compared to their obese counterparts. These findings underscore the high prevalence and poor prognosis of this condition; therefore, increased clinical attention is warranted for this specific population.

Non-obese SLD accounts for 36–45% of the global SLD population (2629). In our retrospective study, non-obese SLD accounted for 25.9% of the total SLD patients, which is lower than the reported value in previous investigations. This discrepancy may be that most subjects in our study were suspected of CAD, and obesity is an explicit risk factor for CAD. Consequently, our cohort likely included a disproportionately higher number of obese individuals. Compared to obese MASLD patients, non-obese SLD patients exhibited lower levels of FBG, UA, ALT, and TG, and a lower prevalence of hypertension, generally indicating a more favorable metabolic profile. However, the non-obese SLD group also showed high-risk characteristics, such as significantly increased TC levels and a higher prevalence of CAD. These findings suggest that the increased risk of CAD in non-obese SLD patients may be driven by dysregulated cholesterol metabolism. The association between abnormal cholesterol levels and CAD is well-documented. A study involving 4,467,942 veterans concluded that a lower serum cholesterol level was associated with a reduced risk of CAD (30). Another study reached similar conclusions that an elevated level of total cholesterol indicates an increased mortality risk of CAD (31, 32). Therefore, we hypothesize that the pathogenesis of CAD in non-obese SLD involves aberrant cholesterol metabolism, a mechanism that warrants further investigation.

In this study, a logistic regression model for predicting CAD in non-obese SLD was created with available laboratory parameters. A nomogram was constructed incorporating four independent predictors: history of DM, male sex, age, and TC. To verify the clinical usability of this model, calibration curves and DCA were performed in the training set and the validation set. We found that the nomogram we built was highly effective in predicting CAD in non-obese SLD. To the best of our knowledge, this is the first study to use such a comprehensive approach to construct a predictive model for CAD in non-obese SLD patients. The performance of this model was stable in both the training set and validation set. In addition, our nomogram offers distinct advantages over traditional criteria. Nomogram is a user-friendly tool that could guide clinical decision-making for doctors, as it can transform a complex statistical prediction model into a single numerical estimate based on different event probabilities. By summing the scores of different predictive variables, clinicians can quickly estimate an individual patient’s risk without complex calculations. By integrating multiple clinical variables, the nomogram provides a more precise prediction compared to single-factor assessments.

In this study, HNF4A and LTBP4 were selected as core genes based on an integrated bioinformatic analysis of the non-obese SLD dataset (GSE89632) and CAD (GSE113079) dataset. We also found that HNF4A and LTBP4 showed good accuracy of diagnosis for both non-obese SLD and CAD, with AUCs exceeding 0.7.

HNF4A is a pivotal liver transcription factor belonging to the nuclear receptor superfamily, highly expressed in the liver, intestine, kidney, and pancreas. The expression of HNF4A in MASLD remains controversial. Multiple studies have shown that HNF4A was overexpressed in the liver tissues of mice with a high-sugar diet and MASLD patients (33, 34), whereas some other studies have demonstrated that HNF4A was significantly reduced in high-fat diet-fed diabetic mice and MASH patients (35, 36). Our analysis revealed elevated HNF4A levels in both CAD and non-obese SLD patients. It plays a crucial role in the regulation of glucose and lipid metabolisms (37, 38). Thymiakou et al. (39) found that liver steatosis and liver triglyceride content increased, whereas the concentration of serum total cholesterol, HDL cholesterol, and triglycerides decreased in mice with liver-specific ablation of HNF4A. HNF4A deficiency promotes lipid accumulation within the liver while lowering cholesterol and triglyceride levels within the serum (40). In addition, our data indicated higher serum cholesterol levels in the non-obese SLD cohort. Thus, we hypothesized that the overexpression of HNF4A might influence hepatic fat deposition by disrupting cholesterol metabolism, thereby contributing to CAD in non-obese individuals. However, the specific mechanism remains unclear. Further investigation into the structural and functional properties of HNF4A may uncover novel mechanisms of systemic cholesterol homeostasis, offering potential therapeutic strategies.

The second core gene identified was LTBP4, which belongs to the LTBP family. LTBP4 can maintain the latent (inactive) state of TGF-β in the extracellular matrix by binding to TGF-β (41). Therefore, a decrease in LTBP4 expression leads to aberrant TGF-β activity. TGF-β plays a role in controlling cell growth and differentiation, and is involved in the pathogenesis of MASLD and CAD (42, 43). Several studies have found that the expression of TGF-β increases in NASH livers and stimulates the development of fibrosis (4446). Particularly, Ling Yang et al. reported that TGF-β signaling promotes hepatocyte steatosis, inflammatory cell infiltration, inflammatory cytokine production, and fibrosis, and participates in the pathogenesis of steatohepatitis by regulating cell death and lipid metabolism in the MASLD mouse model (46). In the context of CAD, patients with severe coronary artery obstruction (>50% stenosis in multiple major vessel lumens) have significantly elevated levels of active TGF-β1 compared to healthy individuals or those with mild obstruction (47, 48). In our study, LTBP4 was downregulated in both non-obese SLD and CAD. Therefore, we hypothesized that the inhibition of LTBP4 aggravated non-obese SLD and CAD by unleashing the TGF-β signaling pathway. Future research on LTBP4 may help to develop new therapeutic strategies, particularly for patients with comorbid non-obese SLD and CAD.

Our study has several limitations. First, as a retrospective single-center study, potential selection bias is inevitably present in the prediction model for CAD in non-obese SLD patients. Therefore, the conclusions drawn from this study need to be validated through large-scale, multi-center cohorts. Second, the core genes were identified from public datasets with limited sample sizes. Although we interpreted the research results based on comprehensive bioinformatics analysis, in vitro and in vivo experiments, experimental verification is essential. Nevertheless, it cannot be denied that our analysis possesses multiple notable strengths. Thirdly, to ensure that no critical biological signals were overlooked, we adopted more inclusive criteria during the initial screening phase. To remedy this issue, these initial candidates were subsequently refined through a rigorous, multi-layered bioinformatic pipeline—integrating PPI network analysis with three distinct machine learning algorithms. This ‘consensus-based’ strategy effectively filtered out noise and isolated highly credible core genes, thereby ensuring the robustness and validity of our conclusions. To our knowledge, this is the first study to construct a diagnostic model for CAD in non-obese SLD using common clinical indicators. This model may obviate the need for invasive procedures, enabling early risk stratification. We also utilized comprehensive bioinformatics analysis and machine learning to uncover the molecular link between non-obese SLD and CAD. In addition, for the diagnosis of CAD, previous studies mostly used clinical indicators to calculate risk scores to infer the presence of CAD, resulting in a lack of accuracy in diagnosis. CCTA is a well-validated non-invasive method. In our study, we used CCTA for evaluating coronary artery disease, thereby significantly improving diagnostic accuracy. We believe that this study contributes to elucidating new pathogenic mechanisms and may provide new therapeutic strategies for patients with CAD and non-obese SLD.

5 Conclusion

Non-obese SLD patients face a higher risk of CAD compared to the general population or obese patients. Our study developed a novel diagnostic model incorporating gender, age, history of DM, and TC to predict CAD in non-obese SLD patients. This model demonstrated robust predictive performance in both training (AUC = 0.846) and validation (AUC = 0.732) sets. Furthermore, comprehensive bioinformatics analysis and machine learning algorithms identified two core genes, HNF4A and LTBP4, which were associated with both conditions and possessed significant diagnostic value. These findings suggest potential pathogenic mechanisms involving HNF4A and LTBP4, warranting further molecular validation to explore their therapeutic potential.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the study protocol was approved by the Institutional Ethics Committee of the Second Hospital of Dalian Medical University, Dalian, China. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

YZ: Formal analysis, Investigation, Methodology, Writing – original draft. XS: Writing – original draft. MH: Writing – original draft. XG: Writing – original draft. SH: Funding acquisition, Validation, Visualization, Writing – review & editing. PG: Funding acquisition, Validation, Visualization, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. PG was supported by fund from, Special project for medicine and health talents in Jilin Province, JLSWSRCZX2025-20. SH was supported by fund from General Program of Basic Scientific Research Project in 2022, Education Department of Liaoning Province, LJKMZ20221295. MH was by the National Natural Science Foundation of China (No. 82100620), the cultivating scientific research project of the Second Hospital of Dalian Medical University (No. YJ20250026). XG was supported by fund from United Fund of the Second Affiliated Hospital of Dalian Medical University and Dalian Institute of Chemical Physics, Chinese Academy of Sciences (UF-QN-202007).

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2026.1709412/full#supplementary-material

Footnotes

References

1. Huang, DQ, Wong, V, Rinella, ME, Wong, VWS, Boursier, J, Lazarus, JV, et al. Metabolic dysfunction-associated steatotic liver disease in adults. Nat Rev Dis Primers. (2025) 11:14. doi: 10.1038/s41572-025-00599-1,

PubMed Abstract | Crossref Full Text | Google Scholar

2. Chui, Z, Xue, Y, and Xu, A. Hormone-based pharmacotherapy for metabolic dysfunction-associated fatty liver disease. Med Rev. (2024) 4:158–68. doi: 10.1515/mr-2024-0007

Crossref Full Text | Google Scholar

3. Iacobini, C, Pugliese, G, Blasetti Fantauzzi, C, Federici, M, and Menini, S. Metabolically healthy versus metabolically unhealthy obesity. Metabolism. (2019) 92:51–60. doi: 10.1016/j.metabol.2018.11.009,

PubMed Abstract | Crossref Full Text | Google Scholar

4. Long, MT, Noureddin, M, and Lim, JK. AGA clinical practice update: diagnosis and Management of Nonalcoholic Fatty Liver Disease in lean individuals: expert review. Gastroenterology. (2022) 163:764–774.e1. doi: 10.1053/j.gastro.2022.06.023,

PubMed Abstract | Crossref Full Text | Google Scholar

5. Eslam, M, Sanyal, AJ, and George, J. MAFLD: a consensus-driven proposed nomenclature for metabolic associated fatty liver disease. Gastroenterology. (2020) 158:1999–2014.e1. doi: 10.1053/j.gastro.2019.11.312,

PubMed Abstract | Crossref Full Text | Google Scholar

6. Eslam, M, El-Serag, HB, Francque, S, Sarin, SK, Wei, L, Bugianesi, E, et al. Metabolic (dysfunction)-associated fatty liver disease in individuals of normal weight. Nat Rev Gastroenterol Hepatol. (2022) 19:638–51. doi: 10.1038/s41575-022-00635-5,

PubMed Abstract | Crossref Full Text | Google Scholar

7. Benjamin, EJ, Virani, SS, Callaway, CW, Chamberlain, AM, Chang, AR, Cheng, S, et al. Heart disease and stroke statistics-2018 update: a report from the American Heart Association. Circulation. (2018) 137:e67–e492. doi: 10.1161/CIR.0000000000000558

Crossref Full Text | Google Scholar

8. Eijsvogels, TM, Molossi, S, Lee, DC, Emery, MS, and Thompson, PD. Exercise at the extremes: the amount of exercise to reduce cardiovascular events. J Am Coll Cardiol. (2016) 67:316–29. doi: 10.1016/j.jacc.2015.11.034,

PubMed Abstract | Crossref Full Text | Google Scholar

9. Marwick, TH, Cho, I, Ó Hartaigh, B, and Min, JK. Finding the gatekeeper to the cardiac catheterization laboratory: coronary CT angiography or stress testing. J Am Coll Cardiol. (2015) 65:2747–56. doi: 10.1016/j.jacc.2015.04.060,

PubMed Abstract | Crossref Full Text | Google Scholar

10. Mensah, GA, Fuster, V, Murray, C, and Roth, GAGlobal Burden of Cardiovascular Diseases and Risks Collaborators. Global burden of cardiovascular diseases and risks, 1990-2022. J Am Coll Cardiol. (2023) 82:2350–473. doi: 10.1016/j.jacc.2023.11.007,

PubMed Abstract | Crossref Full Text | Google Scholar

11. Simon, TG, Roelstraete, B, Hagström, H, Sundström, J, and Ludvigsson, JF. Non-alcoholic fatty liver disease and incident major adverse cardiovascular events: results from a nationwide histology cohort. Gut. (2022) 71:1867–75. doi: 10.1136/gutjnl-2021-325724,

PubMed Abstract | Crossref Full Text | Google Scholar

12. Alon, L, Corica, B, Raparelli, V, Cangemi, R, Basili, S, Proietti, M, et al. Risk of cardiovascular events in patients with non-alcoholic fatty liver disease: a systematic review and meta-analysis. Eur J Prev Cardiol. (2022) 29:938–46. doi: 10.1093/eurjpc/zwab212,

PubMed Abstract | Crossref Full Text | Google Scholar

13. Ochoa-Allemant, P, Hubbard, RA, Kaplan, DE, and Serper, M. Cause-specific mortality in patients with steatotic liver disease in the United States. J Hepatol. (2025) 83:13. doi: 10.1016/j.jhep.2025.05.013,

PubMed Abstract | Crossref Full Text | Google Scholar

14. Kim, Y, Han, E, Lee, JS, Lee, HW, Kim, BK, Kim, MK, et al. Cardiovascular risk is elevated in lean subjects with nonalcoholic fatty liver disease. Gut Liver. (2022) 16:290–9. doi: 10.5009/gnl210084,

PubMed Abstract | Crossref Full Text | Google Scholar

15. Wang, Q, You, H, Ou, X, Zhao, X, Sun, Y, Wang, M, et al. Non-obese histologically confirmed NASH patients with abnormal liver biochemistry have more advanced fibrosis. Hepatol Int. (2019) 13:766–76. doi: 10.1007/s12072-019-09982-z,

PubMed Abstract | Crossref Full Text | Google Scholar

16. Denkmayr, L, Feldman, A, Stechemesser, L, Eder, SK, Zandanell, S, Schranz, M, et al. Lean patients with non-alcoholic fatty liver disease have a severe histological phenotype similar to obese patients. J Clin Med. (2018) 7:562. doi: 10.3390/jcm7120562,

PubMed Abstract | Crossref Full Text | Google Scholar

17. Kwon, YM, Oh, SW, Hwang, SS, Lee, C, Kwon, H, and Chung, GE. Association of nonalcoholic fatty liver disease with components of metabolic syndrome according to body mass index in Korean adults. Am J Gastroenterol. (2012) 107:1852–8. doi: 10.1038/ajg.2012.314,

PubMed Abstract | Crossref Full Text | Google Scholar

18. Lee, SW, Lee, TY, Yang, SS, Tung, CF, Yeh, HZ, and Chang, CS. Risk factors and metabolic abnormality of patients with non-alcoholic fatty liver disease: either non-obese or obese Chinese population. Hepatobiliary Pancreat Dis Int. (2018) 17:45–8. doi: 10.1016/j.hbpd.2018.01.007,

PubMed Abstract | Crossref Full Text | Google Scholar

19. Ahmed, OT, Gidener, T, Mara, KC, Larson, JJ, Therneau, TM, and Allen, AM. Natural history of nonalcoholic fatty liver disease with Normal body mass index: a population-based study. Clin Gastroenterol Hepatol. (2022) 20:1374–1381.e6. doi: 10.1016/j.cgh.2021.07.016,

PubMed Abstract | Crossref Full Text | Google Scholar

20. Fracanzani, AL, Petta, S, Lombardi, R, Pisano, G, Russello, M, Consonni, D, et al. Liver and cardiovascular damage in patients with lean nonalcoholic fatty liver disease, and association with visceral obesity. Clin Gastroenterol Hepatol. (2017) 15:1604–1611.e1. doi: 10.1016/j.cgh.2017.04.045,

PubMed Abstract | Crossref Full Text | Google Scholar

21. Arvind, A, Henson, JB, Osganian, SA, Nath, C, Steinhagen, LM, Memel, ZN, et al. Risk of cardiovascular disease in individuals with nonobese nonalcoholic fatty liver disease. Hepatol Commun. (2022) 6:309–19. doi: 10.1002/hep4.1818,

PubMed Abstract | Crossref Full Text | Google Scholar

22. Akyuz, U, Yesil, A, and Yilmaz, Y. Characterization of lean patients with nonalcoholic fatty liver disease: potential role of high hemoglobin levels. Scand J Gastroenterol. (2015) 50:341–6. doi: 10.3109/00365521.2014.983160,

PubMed Abstract | Crossref Full Text | Google Scholar

23. Hagström, H, Vessby, J, Ekstedt, M, and Shang, Y. 99% of patients with NAFLD meet MASLD criteria and natural history is therefore identical. J Hepatol. (2024) 80:e76–7. doi: 10.1016/j.jhep.2023.08.026,

PubMed Abstract | Crossref Full Text | Google Scholar

24. Younossi, ZM, Paik, JM, Stepanova, M, Ong, J, Alqahtani, S, and Henry, L. Clinical profiles and mortality rates are similar for metabolic dysfunction-associated steatotic liver disease and non-alcoholic fatty liver disease. J Hepatol. (2024) 80:694–701. doi: 10.1016/j.jhep.2024.01.014,

PubMed Abstract | Crossref Full Text | Google Scholar

25. Chalasani, N, Younossi, Z, Lavine, JE, Diehl, AM, Brunt, EM, Cusi, K, et al. The diagnosis and management of non-alcoholic fatty liver disease: practice guideline by the American Association for the Study of Liver Diseases, American College of Gastroenterology, and the American Gastroenterological Association. Hepatology. (2012) 55:2005–23. doi: 10.1002/hep.25762,

PubMed Abstract | Crossref Full Text | Google Scholar

26. Ye, Q, Zou, B, Yeo, YH, Li, J, Huang, DQ, Wu, Y, et al. Global prevalence, incidence, and outcomes of non-obese or lean non-alcoholic fatty liver disease: a systematic review and meta-analysis. Lancet Gastroenterol Hepatol. (2020) 5:739–52. doi: 10.1016/S2468-1253(20)30077-7

Crossref Full Text | Google Scholar

27. Wang, AY, Dhaliwal, J, and Mouzaki, M. Lean non-alcoholic fatty liver disease. Clin Nutr. (2019) 38:975–81. doi: 10.1016/j.clnu.2018.08.008,

PubMed Abstract | Crossref Full Text | Google Scholar

28. Mellinger, JL, Pencina, KM, Massaro, JM, Hoffmann, U, Seshadri, S, Fox, CS, et al. Hepatic steatosis and cardiovascular disease outcomes: an analysis of the Framingham heart study. J Hepatol. (2015) 63:470–6. doi: 10.1016/j.jhep.2015.02.045,

PubMed Abstract | Crossref Full Text | Google Scholar

29. Sinn, DH, Gwak, GY, Park, HN, Kim, JE, Min, YW, Kim, KM, et al. Ultrasonographically detected non-alcoholic fatty liver disease is an independent predictor for identifying patients with insulin resistance in non-obese, non-diabetic middle-aged Asian adults. Am J Gastroenterol. (2012) 107:561–7. doi: 10.1038/ajg.2011.400,

PubMed Abstract | Crossref Full Text | Google Scholar

30. Nguyen, XT, Ho, YL, Li, Y, Ho, Y‐L, Song, RJ, Leung, KH, et al. Serum cholesterol and impact of age on coronary heart disease death in more than 4 million veterans. J Am Heart Assoc. (2023) 12:e030496. doi: 10.1161/JAHA.123.030496,

PubMed Abstract | Crossref Full Text | Google Scholar

31. Corti, MC, Guralnik, JM, Salive, ME, Harris, T, Ferrucci, L, Glynn, RJ, et al. Clarifying the direct relation between total cholesterol levels and death from coronary heart disease in older persons. Ann Intern Med. (1997) 126:753–60. doi: 10.7326/0003-4819-126-10-199705150-00001,

PubMed Abstract | Crossref Full Text | Google Scholar

32. Cholesterol Treatment Trialists’ (CTT) CollaborationBaigent, C, Blackwell, L, Emberson, J, Holland, LE, Reith, C, Bhala, N, et al. Efficacy and safety of more intensive lowering of LDL cholesterol: a meta-analysis of data from 170,000 participants in 26 randomised trials. Lancet. (2010) 376:1670–81. doi: 10.1016/S0140-6736(10)61350-5

Crossref Full Text | Google Scholar

33. Yang, H, Arif, M, Yuan, M, Li, X, Shong, K, Türkez, H, et al. A network-based approach reveals the dysregulated transcriptional regulation in non-alcoholic fatty liver disease. iScience. (2021) 24:103222. doi: 10.1016/j.isci.2021.103222,

PubMed Abstract | Crossref Full Text | Google Scholar

34. Baciu, C, Pasini, E, Angeli, M, Schwenger, K, Afrin, J, Humar, A, et al. Systematic integrative analysis of gene expression identifies HNF4A as the central gene in pathogenesis of non-alcoholic steatohepatitis. PLoS One. (2017) 12:e0189223. doi: 10.1371/journal.pone.0189223,

PubMed Abstract | Crossref Full Text | Google Scholar

35. Xu, Y, Zalzala, M, Xu, J, Li, Y, Yin, L, and Zhang, Y. A metabolic stress-inducible miR-34a-HNF4α pathway regulates lipid and lipoprotein metabolism. Nat Commun. (2015) 6:7466. doi: 10.1038/ncomms8466,

PubMed Abstract | Crossref Full Text | Google Scholar

36. Huang, KW, Reebye, V, Czysz, K, Ciriello, S, Dorman, S, Reccia, I, et al. Liver activation of hepatocellular nuclear factor-4α by small activating RNA rescues dyslipidemia and improves metabolic profile. Mol Ther Nucleic Acids. (2020) 19:361–70. doi: 10.1016/j.omtn.2019.10.044,

PubMed Abstract | Crossref Full Text | Google Scholar

37. Stoffel, M, and Duncan, SA. The maturity-onset diabetes of the young (MODY1) transcription factor HNF4alpha regulates expression of genes required for glucose transport and metabolism. Proc Natl Acad Sci USA. (1997) 94:13209–14. doi: 10.1073/pnas.94.24.13209,

PubMed Abstract | Crossref Full Text | Google Scholar

38. Hayhurst, GP, Lee, YH, Lambert, G, Ward, JM, and Gonzalez, FJ. Hepatocyte nuclear factor 4alpha (nuclear receptor 2A1) is essential for maintenance of hepatic gene expression and lipid homeostasis. Mol Cell Biol. (2001) 21:1393–403. doi: 10.1128/MCB.21.4.1393-1403.2001,

PubMed Abstract | Crossref Full Text | Google Scholar

39. Thymiakou, E, Othman, A, Hornemann, T, and Kardassis, D. Defects in high density lipoprotein metabolism and hepatic steatosis in mice with liver-specific ablation of hepatocyte nuclear factor 4A. Metabolism. (2020) 110:154307. doi: 10.1016/j.metabol.2020.154307,

PubMed Abstract | Crossref Full Text | Google Scholar

40. Krapivner, S, Iglesias, MJ, Silveira, A, Tegnér, J, Björkegren, J, Hamsten, A, et al. DGAT1 participates in the effect of HNF4A on hepatic secretion of triglyceride-rich lipoproteins. Arterioscler Thromb Vasc Biol. (2010) 30:962–7. doi: 10.1161/ATVBAHA.109.201426,

PubMed Abstract | Crossref Full Text | Google Scholar

41. Su, CT, and Urban, Z. LTBP4 in health and disease. Genes. (2021) 12:795. doi: 10.3390/genes12060795,

PubMed Abstract | Crossref Full Text | Google Scholar

42. Massagué, J, and Wotton, D. Transcriptional control by the TGF-beta/Smad signaling system. EMBO J. (2000) 19:1745–54. doi: 10.1093/emboj/19.8.1745,

PubMed Abstract | Crossref Full Text | Google Scholar

43. Deng, Z, Fan, T, Xiao, C, Tian, H, Zheng, Y, Li, C, et al. TGF-β signaling in health, disease, and therapeutics. Signal Transduct Target Ther. (2024) 9:61. doi: 10.1038/s41392-024-01764-w,

PubMed Abstract | Crossref Full Text | Google Scholar

44. Bataller, R, and Brenner, DA. Liver fibrosis. J Clin Invest. (2005) 115:209–18. doi: 10.1172/JCI24282,

PubMed Abstract | Crossref Full Text | Google Scholar

45. Seki, E, De Minicis, S, Osterreicher, CH, Kluwe, J, Osawa, Y, Brenner, DA, et al. TLR4 enhances TGF-beta signaling and hepatic fibrosis. Nat Med. (2007) 13:1324–32. doi: 10.1038/nm1663

Crossref Full Text | Google Scholar

46. Yang, L, Roh, YS, Song, J, Zhang, B, Liu, C, Loomba, R, et al. Transforming growth factor beta signaling in hepatocytes participates in steatohepatitis through regulation of cell death and lipid metabolism in mice. Hepatology. (2014) 59:483–95. doi: 10.1002/hep.26698,

PubMed Abstract | Crossref Full Text | Google Scholar

47. Wang, XL, Liu, SX, and Wilcken, DE. Circulating transforming growth factor beta 1 and coronary artery disease. Cardiovasc Res. (1997) 34:404–10. doi: 10.1016/S0008-6363(97)00033-3,

PubMed Abstract | Crossref Full Text | Google Scholar

48. Low, EL, Baker, AH, and Bradshaw, AC. TGFβ, smooth muscle cells and coronary artery disease: a review. Cell Signal. (2019) 53:90–101. doi: 10.1016/j.cellsig.2018.09.004,

PubMed Abstract | Crossref Full Text | Google Scholar

Glossary

Alb - albumin

ALT - alanine aminotransferase

ANC - absolute neutrophil count

ApoB/A1 - apolipoprotein B/ Apolipoprotein A1

AST - aspartate aminotransferase

AUC - the area under the receiver operating characteristics curve

BMI - body mass index

CAD - coronary artery disease

CCTA - coronary computed tomography angiography

CI - confidence interval

Cr - creatinine

DCA - decision curve analysis

DEGs - differentially expressed genes

DM - diabetes

FBG - fasting blood glucose

GEO - Gene Expression Omnibus

HGB - hemoglobin

HbA1c - hemoglobin A1c

HDL-C - high-density lipoprotein cholesterol

LASSO - the least absolute shrinkage and selection operator

LDL-C - low-density lipoprotein cholesterol

MCC - maximal clique centrality

NAFLD - non-alcoholic fatty liver disease

NPV - negative predictive value

OR - odds ratio

PLT - platelet count

PPI - protein–protein interaction

PPV - positive predictive value

RBC - red blood cell

RDW - red cell distribution width

RF - random forest

ROC - receiver operating characteristic

SVM-RFE - support vector machine-recursive feature elimination

TBIL - total bilirubin

TC - total cholesterol

TG - triglyceride

UA - uric acid

WBC - white blood cell

SLD - steatotic liver disease

MASLD - metabolic dysfunction-associated steatotic liver disease

Keywords: core genes, coronary artery disease, machine learning, metabolic dysfunction-associated steatotic liver disease, nomogram, prediction model

Citation: Zhang Y, Song X, Han M, Gao X, Han S and Gao P (2026) Exploring the prediction model and core genes for coronary artery disease in non-obese steatotic liver disease patients. Front. Med. 13:1709412. doi: 10.3389/fmed.2026.1709412

Received: 20 September 2025; Revised: 22 January 2026; Accepted: 22 January 2026;
Published: 09 February 2026.

Edited by:

Nidhi Jalan-Sakrikar, Mayo Clinic, United States

Reviewed by:

Anthony Lagina, Wayne State University, United States
Melika Amelimojarad, Iran University of Medical Sciences, Iran

Copyright © 2026 Zhang, Song, Han, Gao, Han and Gao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shuanglin Han, aGFuc2h1YW5nbGluQGRtdS5lZHUuY24=; Pujun Gao, Z3BqQGpsdS5lZHUuY24=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.