Using machine learning methods to investigate the role of volatile organic compounds in non-alcoholic fatty liver disease

Shen, Chih-Hao; Huang, Ruei-Hao; Li, Yaw-Kuen; Chu, Ta-Wei; Pei, Dee

doi:10.3389/fmolb.2025.1631265

ORIGINAL RESEARCH article

Front. Mol. Biosci., 06 August 2025

Sec. Molecular Diagnostics and Therapeutics

Volume 12 - 2025 | https://doi.org/10.3389/fmolb.2025.1631265

Using machine learning methods to investigate the role of volatile organic compounds in non-alcoholic fatty liver disease

Chih-Hao Shen ¹

Ruei-Hao Huang ²

Yaw-Kuen Li ²^*

Ta-Wei Chu ^3,4^*

Dee Pei ^5,6^*

1. Division of Pulmonary and Critical Care Medicine, Department of Medicine, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
2. Center for Emergent Functional Matter Science, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
3. Department of Obstetrics and Gynecology, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
4. Department CEO, MJ Health Research Foundation, Taipei, Taiwan
5. Department of Medicine, Medical School, Fu Jen Catholic University, New Taipei City, Taiwan
6. Division of Endocrinology and Metabolism, Department of Internal Medicine, School of Medicine, College of Medicine, Fu Jen Catholic University Hospital, New Taipei City, Taiwan

Article metrics

View details

Citations

2,2k

Views

358

Downloads

Abstract

Aims:

Approximately 25%–30% of the global population is affected by non-alcoholic fatty liver disease (NAFLD). This study aimed to explore whether NAFLD could be effectively detected using 341 volatile organic compounds (VOCs) via 10 machine learning (Mach-L) algorithms in a cohort of 1,501 individuals.

Methods:

Participants were selected from the Taiwan MJ cohort, which includes comprehensive demographic, biochemical, lifestyle, and VOCs data. NAFLD was diagnosed by experienced gastroenterologists. Exhaled breath samples were collected using a 1.0-L aluminum bag (late expiratory fraction) and analyzed with selected-ion flow-tube mass spectrometry. Ten Mach-L techniques were employed to evaluate two predictive models: Model 1 (demographic, lifestyle, and biochemical data), and Model 2 (Model 1 + VOCs), assessed using area under the receiver operating characteristic curve (AUC).

Results:

Subjects with NAFLD had significantly higher values for age, BMI, blood pressure, and other biomedical markers, except for eGFR and HDL-C. Key predictors of NAFLD included BMI, triglycerides (TG), uric acid (UA), fasting plasma glucose (FPG), γ-GT, gender, LDL-C, and sleep duration. The addition of VOCs to Model 1 improved the AUC from 0.722 ± 0.149 to 0.770 ± 0.264 (p < 0.001). Ten VOCs were identified as the most influential, in order of importance: 2-propanol, acetone, butyl 2-methylbutanoate, diethylethanolamine, urethane, β-caryophyllene, furfural, tridecane, 4-methyloctanoic acid, and (S)-2-methyl-1-butanol.

Conclusion:

Incorporating VOCs into traditional demographic, biochemical, and lifestyle data significantly enhanced the model’s predictive performance. This suggests that VOCs may be associated with the underlying pathophysiology of NAFLD.

Introduction

Non-alcoholic fatty liver disease (NAFLD) is defined as the presence of macrovesicular steatosis in more than 5% of hepatocytes without other identifiable causes, such as alcohol consumption or medication use. NAFLD progresses from simple steatosis to non-alcoholic steatohepatitis, fibrosis, and eventually cirrhosis, making it one of the leading causes of chronic liver disease worldwide (Younossi et al., 2016; Ghevariya et al., 2014). The global prevalence of NAFLD has increased from 15% in 2005 to 25%–30% in 2023, reflecting the global rise in obesity rates (Quek et al., 2023). In Taiwan, a similar trend has been observed, with two studies estimating that 11.4%–41% of the general population may be affected by NAFLD (Chen et al., 2006; Lin et al., 2005). Consequently, early detection and prevention of NAFLD have become key priorities for healthcare providers and policymakers.

Traditionally, multiple logistic regression (MLR) has been used to analyze the relationship between risk factors and disease outcomes in medical research. The performance of MLR models is commonly evaluated using the area under the receiver operating characteristic curve (AUC). Recently, machine learning (Mach-L)—a branch of artificial intelligence that allows algorithms to learn from past data without explicit programming—has emerged as a competitive and often superior approach to MLR (Marateb et al., 2014; Ye et al., 2020; Nusinovici et al., 2020). Unlike MLR, Mach-L can model complex, nonlinear interactions among multiple variables, making it more suitable for disease prediction tasks (Miller and Brown, 2018). Mach-L in the medical field involves using computer algorithms to analyze large amounts of healthcare data, helping with tasks. These tools can detect patterns in medical images, electronic health records, and other data faster and often more accurately than humans, leading to earlier diagnoses, better patient care, and more efficient healthcare delivery (Arkoudis and Papadakos, 2025).

For over five decades, researchers have shown increasing interest in volatile organic compounds (VOCs) emitted from the human body. In 1971, Nobel laureate Linus Pauling reported that human breath contains approximately 250 VOCs (Machado and Cortez-Pinto, 2014). Later, in 1999, Maurice and Manousou (2018) identified more than 3,400 VOCs in exhaled breath. Alterations in VOC concentrations can reflect disease states, such as cancer (Wei et al., 2020). As a result, breath-derived VOCs have been proposed as biomarkers for detecting metabolic changes associated with various diseases. There have been studies investigated the relationships between VOCs and NAFLD in the past. However, most of these studies focused on how VOCs affect or damage liver. The proposed mechanisms included metabolic dysregulation, oxidative stress, and cell death (Lang and Beier, 2018; Liu et al., 2023; Duan et al., 2025). Their goals were different from the present study. Analytical techniques like gas chromatography-mass spectrometry (GC-MS) have confirmed these associations in numerous studies (Samudrala et al., 2014; Markar et al., 2019; Ratiu et al., 2020; Keogh and Riches, 2022; Chung et al., 2022). However, while many studies have explored VOC-based disease identification, few have utilized Mach-L techniques for VOC profiling (Tsou et al., 2021; Shaffie et al., 2022; Sukaram et al., 2023).

In this study, we employed 10 different Mach-L algorithms to develop predictive models for NAFLD using health examination data combined with exhaled VOC profiles. The performance of these models was compared to evaluate their potential utility in clinical screening for NAFLD. Finally, by applying Shapley addictive explanation to examine the directions and strengths of impacts.

Materials and methods

This study utilized data from the ongoing Taiwan MJ cohort, a prospective cohort collected through health examinations conducted by the MJ Health Screening Centers in Taiwan (Wu et al., 2017). The dataset includes over 100 essential biological indicators such as anthropometric measurements, blood tests, and imaging tests, among others.

The data were obtained from MJ clinic. At the time of their health checkups, participants provided general consent forms for future anonymous research. This database was maintained by the Interpretation Foundation of MJ Health Research Foundation. All or part of the data used in this study were authorized and provided by the foundation (Authorization Code: MJHRF2022009A). However, it is important to note that all interpretations and conclusions in this study are those of the authors and do not necessarily represent the views of the MJ Health Research Foundation.

The study protocol was reviewed and approved by the Institutional Review Board of National Yang Ming Chiao Tung University, Taiwan (IRB No. NCTU-REC-109-074E). All participants signed a written informed consent form after receiving a thorough explanation of the study’s purpose, procedures, and potential risks by trained research assistants. These assistants ensured that all explanations were delivered using clear and understandable language, allowing participants to fully comprehend the study. After ample time for questions and deliberation, participants who provided informed and voluntary consent signed the consent form.

A total of 2,152 participants who underwent both medical ultrasound diagnosis for NAFLD and three sessions of exhaled breath volatile organic compounds (VOCs) collection (a total of 6,363 records) were included initially.

The inclusion criteria are:

1. Subjects between 30-70
2. With data of VOCs

Our exclusion criteria are:

1. Having significant medical diseases such as myocardial infarction, stroke, or cancers
2. Having drinking alcohol habit
3. Miss important data such as age, body mass index (BMI) or blood pressure

After excluding 651 records due to data loss or specific conditions, the final analysis included 1,501 individuals, as shown in Figure 1.

FIGURE 1

Flowchart depicting data processing of subjects undergoing medical ultrasound for non-alcoholic fatty liver diagnosis. It starts with 2,152 subjects with three breath VOCs measurements averaged. Then, exclusion of 651 incomplete records results in 1,501 subjects with complete information for modeling. — The participants selection scheme.

Clinical assessments and biochemical analyses

Details of obtaining basic parameters such as BMI, blood pressure, collecting blood samples, and questionnaires could be referred to our previous publication.

Fatty liver diagnosis

The diagnosis of fatty liver was based on ultrasound features, including increased hepatic parenchymal brightness, liver-to-kidney contrast, deep beam attenuation, visible intrahepatic vessels, and gallbladder wall definition. Qualitative grading classified fatty liver into mild, moderate, or severe, corresponding to grades 1 to 3, respectively, with grade 0 representing a normal liver (Mahale et al., 2018; Dasarathy et al., 2009). For the purpose of this study, grades 1–3 were collectively defined as having fatty liver (NAFLD).

Variable selection

Seventeen clinical variables potentially associated with NAFLD were selected (listed in Table 1) as independent variables. NAFLD status (yes/no) was used as the dependent categorical variable.

TABLE 1

	Men	Women
N	653	848
Age (year)	42 ± 12	46 ± 12
Body mass index (kg/m²)	21.3 ± 2.52	24.9 ± 3.84***
Systolic blood pressure (mmHg)	111.2 ± 15.9	119.8 ± 16.9***
Diastolic blood pressure (mmHg)	72.1 ± 10.3	77.4 ± 11.1***
Fasting plasma glucose (mg/dL)	96.5 ± 9.36	104.0 ± 17.0***
Total bilirubin (mg/dL)	1.09 ± 0.388	1.04 ± 0.401*
Albumin (g/dL)	4.40 ± 0.222	4.43 ± 0.229***
Alkaline phosphatase (U/L)	56.7 ± 16.3	64.3 ± 18.9***
Serum glutamic oxaloacetic transaminase (U/L)	21.4 ± 6.97	24.7 ± 14.6***
Serum glutamic pyruvic transaminase (U/L)	20.8 ± 13.2	32.1 ± 29.4***
γ-Glutamyl transferase (U/L)	19.5 ± 14.9	31.6 ± 33.6***
Estimated glomerular filtration rate (ml/min/1.73 m2)	86.1 ± 13.2	83.3 ± 13.7***
Uric acid (mg/dL)	5.17 ± 1.17	6.02 ± 1.43***
Triglyceride (mg/dL)	69.1 ± 33.8	115.6 ± 74.2***
High density lipoprotein cholesterol (mg/dL)	58.2 ± 16.4	57.6 ± 15.0
Low density lipoprotein cholesterol (mg/dL)	110.9 ± 28.7	126.9 ± 35.3***
Alpha-fetoprotein (ng/mL)	2.78 ± 1.91	3.05 ± 1.86**
Beta-caryophyllene (87-44-5)	0.020 ± 0.148	0.010 ± 0.102
Furfural (98-01-1)	5.05 ± 15.4	4.69 ± 9.11
Tridecane (629-50-5)	1.08 ± 3.62	1.89 ± 7.48*
Butyl 2-methylbutanoate (15706-73-7)	0.134 ± 0.521	0.133 ± 0.683
Diethylethanolamine (100-37-8)	3.03 ± 9.19	2.44 ± 5.72
4-methyloctanoic acid (54947-74-9)	0.170 ± 0.743	0.119 ± 0.735
(S)-2-methyl-1-butanol (1565-80-6)	4.89 ± 37.5	3.85 ± 8.74
Urethane (51-79-6)	20.7 ± 64.1	18.2 ± 29.3
2-propanol (67-63-0)	71.4 ± 161.1	61.2 ± 205.4
Acetone (67-64-1)	1,221.1 ± 3,008.5	1,053.7 ± 4,885.6

Smoking	n (%)
Non-smoker	1140 (75)
Less than 10 cigarettes/week	59 (3.9)
10–20 cigarettes/week	129 (8.6)
20 cigarettes/week	54 (3.4)
More than 20 cigarettes/week	119 (7.6)

Sleeping	n (%)
Less than 6 h/day	17 (4.7)
6 h/day	375 (25)
7 h/day	713 (47.5)
8 h/day	347 (24.9)
9 h/day	39 (2.6)
More than 9 h/day	10 (7)

The demographic, biochemistry, and volatile organic compounds data of the study cohort.

Protocol for breath sample collection

All volunteer participants remained in a designated room under resting conditions for at least 10 min prior to sample collection. To minimize contamination, each participant was asked to rinse their mouth with unchlorinated water before exhaling through a mouthpiece connected to a three-way direct-connect valve.

Initially, exhaled breath passed through the first outlet, which was connected to a gas bag (SKC Inc., Eighty-Four, PA, United States) to estimate the volume of exhaled air. Once the volume of the initial exhalation reached approximately 0.3 L, the valve was switched to the second outlet, which was attached to a 1.0-L aluminum bag. This second bag was used to collect the late expiratory fraction, which is more representative of alveolar air and thus suitable for volatile organic compound (VOC) analysis.

To ensure adequate sample volume for analysis, the collection procedure was repeated two to three times as necessary. All collected breath samples were sealed, stored at room temperature (25°C), and analyzed within 48 h.

To validate the stability of VOCs under these storage conditions, a time-dependent analysis was conducted on ten breath samples. Samples were analyzed twice daily over three consecutive days. Comparison of the quantitative VOC data indicated that the majority of compounds remained stable during the storage period.

VOCs analysis using SIFT-MS

A selected-ion flow-tube mass spectrometry system (SIFT-MS; VOICE200 Ultra, Syft Technologies, Christchurch, New Zealand) was employed to analyze volatile organic compounds (VOCs) in the collected late expiratory breath fraction. It is a quantitative mass spectrometry technique used for real-time analysis of trace volatile compounds, especially volatile organic compounds (VOCs), in air, breath, or headspace above liquids without the need for sample preparation or chromatographic separation (Španěl and Smith, 2020).

In this method, selected precursor ions (H₃O⁺, NO⁺, and O₂⁺) are injected into a nitrogen carrier gas within the flow tube. When breath samples are introduced, VOCs present in the sample undergo ionization, resulting in the formation of characteristic product ions. These product ions are detected by a quadrupole mass spectrometer, which measures the count rates of both precursor and product ions in real time.

For VOCs with significant product ion overlap that could not be resolved using the tolerance setting, concentrations were reported on a relative scale. Statistical models were constructed based on both absolute concentrations and these relative measures to ensure robustness in VOC profiling and interpretation.

To assure the reproducibility and data reliability we standardized and calibrated with the following methods:

1. Traceable reference materials: Use primary standards (e.g., NIST-traceable mixtures) for instrument calibration and secondary/working standards for routine checks (RI-URBANS, 2024; Dusanter et al., 2025).
2. Matrix modifiers: Add salt solutions (e.g., NaCl) to normalize partitioning behavior of VOCs in complex samples, reducing bias from dissolved solutes or organic components (U.S. Environmental Protection Agency, 2014; Final Report, 2012).
3. Dynamic calibration: For instruments like PTR-MS, use gas standards with known VOC concentrations and proton transfer rate constants to calculate normalized sensitivities (Dusanter et al., 2025).

Machine learning-based analysis technology

While numerous studies have explored the application of VOC measurements for disease identification (Ratiu et al., 2020; Keogh and Riches, 2022; Chung et al., 2022), relatively few have focused on utilizing Mach-L techniques specifically for VOC profiling (Tsou et al., 2021; Shaffie et al., 2022; Sukaram et al., 2023). In this study, we employed ten distinct Mach-L algorithms to construct predictive models for diagnosing non-alcoholic fatty liver disease (NAFLD) based on VOCs collected from exhaled breath. To assess the impact of VOCs, models were developed both with and without VOC data, and their predictive performances were compared.

The ten machine learning techniques applied are as follows:

•Random Forest (RF): An ensemble learning method utilizing multiple unpruned decision trees for classification (Breiman, 2001).
• C5.0 Decision Trees (C5.0): A rule-based model using entropy, information gain, and gain ratio for decision tree construction (Quinlan, 2004).
•Stochastic Gradient Boosting (SGB): Combines bagging and boosting to construct additive regression tree models (Friedman, 2001).
•Multivariate Adaptive Regression Splines (MARS): A non-parametric regression technique using piecewise polynomial functions (Friedman, 1991).
•Classification and Regression Tree (CART): A decision tree model built using Gini impurity for splitting nodes (Breiman et al., 1984).
•Least Absolute Shrinkage and Selection Operator (Lasso): A linear model applying L1 regularization to perform feature selection (Hastie et al., 2015).
•Ridge Regression (Ridge): Similar to Lasso but uses L2 regularization for coefficient shrinkage (Hoerl and Kennard, 2000).
•Extreme Gradient Boosting (XGBoost): An optimized gradient boosting algorithm designed for speed and performance (Meng et al., 2016).
•Gradient Boosting with Categorical Features (CatBoost): A boosting technique optimized for categorical features using an ordered boosting method (Dorogush et al., 2018).
•Light Gradient Boosting Machine (LightGBM): A fast, histogram-based gradient boosting algorithm designed for efficiency and scalability (Ke et al., 2017).

Although Mach-L algorithms are capable of identifying key predictor variables, relying on a single method may lead to suboptimal and biased feature selection. To overcome this limitation, variable ensemble strategies are often employed, which integrate the outputs from multiple algorithms. Prior research indicates that such ensemble approaches enhance variable selection robustness, reducing both bias and variance (Pes, 2020; Moghimi et al., 2018; Tuli et al., 2019).

In this study, the variable importance values generated by each Mach-L model were averaged. The top 10 VOCs, ranked by average importance across all models, were selected for further discussion.

All analyses were conducted using the R programming language (version 4.1.2, R Core Team, Vienna, Austria) and RStudio (version 1.1.453) (R Core Team, 2017; RStudio Team, 2015). The following R packages were employed for model development: random Forest, C50, gbm, RWeka, kernlab, earth, rpart, glmnet, XGBoost, LightGBM, and cat boost. Heatmaps were visualized using the pheatmap package (version 2.6.2) (Browne, 2000; Gu, 2022).

To train and evaluate each Mach-L model, an 80/20 train-test split was used. The training set (80%) was used to construct models, while the testing set (20%) evaluated predictive performance. Hyperparameter tuning was conducted using 10-fold cross-validation (CV) to ensure optimal performance for each algorithm. The final model for each method was selected based on the best-performing configuration. Cross-validation procedures were executed using the caret package (version 6.0-93) (Kuhn, 2022).

In order to understand the directions and impacts of the variables, XGboost SHAP was applied using the following Python packages: SHAP, the core package for computing and visualizing SHAP values, provides interpretability for model predictions and feature importance. Pandas, a powerful library for data manipulation and preprocessing, was used to manage datasets, clean data, and prepare inputs for SHAP analysis. NumPy, a fundamental package for numerical computations, supported array operations and numerical calculations required by SHAP. Matplotlib, a plotting library for creating static, interactive, and animated visualizations, was employed to generate SHAP plots, including summary plots, bar plots, and waterfall plots.te feature contributions to specific predictions.

Performance evaluation metrics

To comprehensively evaluate the predictive performance of the Mach-L, we employed a range of widely accepted performance measures, as recommended in previous studies (Dias Canedo and Cordeiro Mendes, 2020; Hussain et al., 2021; Tomer and Sharma, 2022). Specifically, the following metrics were utilized in our analysis: accuracy (ACC), sensitivity (Sens), and specificity (Spec). These metrics provide an overall understanding of the model’s classification capabilities.

However, when dealing with imbalanced datasets, traditional metrics such as ACC, Sens, and Spec can be misleading, as they tend to be disproportionately influenced by the majority class distribution. To mitigate this issue, we additionally calculated balanced accuracy (BA) and area under the receiver operating characteristic curve (AUC)—both of which are considered more robust and reliable indicators for evaluating model performance under class imbalance conditions (

di Biase et al., 2020

;

Hashim et al., 2021

•Balanced Accuracy (BA) accounts for imbalanced data by averaging sensitivity and specificity.
•AUC provides a threshold-independent measure of a model’s ability to distinguish between classes.

The definitions and formulas for all performance metrics used in this study are detailed in (Tharwat, 2021).

To assess the impact of volatile organic compounds (VOCs) on model performance, we compared each Mach-L model’s predictive ability with and without VOC features. We applied DeLong’s test for pairwise comparison of AUC values between these two scenarios across all models (DeLong et al., 1988), allowing for a statistically grounded evaluation of VOCs’ contribution to predictive improvement.

Results

A total of 1,501 participants were included in the present study. Table 1 presents the demographic and clinical characteristics of the participants, stratified by the presence or absence of non-alcoholic fatty liver disease (NAFLD).

As expected, participants diagnosed with NAFLD exhibited significantly higher values across several variables, including age, body mass index (BMI), blood pressure, and various biochemical markers, compared to those without NAFLD. The only exceptions were estimated glomerular filtration rate (eGFR) and high-density lipoprotein cholesterol (HDL-C), which did not follow the same trend.

Among all examined variables, the most influential predictors for identifying NAFLD were found to be: BMI, Triglycerides (TG), Uric acid (UA), Fasting plasma glucose (FPG), Gamma-glutamyl transferase (GGT), Gender, Low-density lipoprotein cholesterol (LDL-C) and Sleeping hours.

In parallel, VOC profiling using 10 different machine learning (Mach-L) techniques identified 10 key VOCs as significant predictors for NAFLD. Ranked from most to least important, these compounds are: 2-Propanol, Acetone, Butyl 2-methylbutanoate, Diethylethanolamine, Urethane, β -Caryophyllene, Furfural, Tridecane, 4-Methyloctanoic acid and (S)-2-Methyl-1-butanol.

Table 2 displays the comparative concentrations of these 10 VOCs in subjects with and without NAFLD, along with their corresponding rankings based on variable importance across the Mach-L models.

TABLE 2

Characteristic (mean ± SD)	NAFLD	Control groups	P value
n (%)	848 (56.50%)	653 (43.50%)
2-propanol	61.28 ± 205.46	71.42 ± 161.10	0.284
Acetone	1053.72 ± 4885.64	1221.07 ± 3008.48	0.414
butyl 2-methylbutanoate	0.13 ± 0.68	0.13 ± 0.52	0.955
diethylethanolamine	2.45 ± 5.73	3.03 ± 9.20	0.158
urethane	18.24 ± 29.31	20.72 ± 64.05	0.360
beta-caryophyllene	0.01 ± 0.10	0.02 ± 0.15	0.296
furfural	4.69 ± 9.11	5.05 ± 15.38	0.599
tridecane	1.89 ± 7.48	1.08 ± 3.62	0.006
4-methyloctanoic acid	0.12 ± 0.74	0.17 ± 0.74	0.190
(S)-2-methyl-1-butanol	3.85 ± 8.75	4.90 ± 37.47	0.485
methanamide	10.32 ± 16.59	12.21 ± 74.70	0.528
1-nonene	6.51 ± 16.03	5.23 ± 8.66	0.049
isobutane	160.08 ± 392.34	165.93 ± 510.95	0.808
trimethylamine	87.84 ± 306.13	104.05 ± 242.21	0.252
6-methyl-5-hepten-2-one	1.92 ± 2.50	1.93 ± 3.19	0.911
pyridine	1.60 ± 2.75	1.49 ± 2.51	0.414
benzoic acid	0.06 ± 0.22	0.08 ± 0.28	0.129
3-buten-2-one	14.13 ± 27.59	15.42 ± 55.93	0.587
propyl acetate	10.98 ± 14.32	15.81 ± 116.80	0.293
propyne	0.97 ± 3.55	0.68 ± 1.77	0.035

t-test comparing volatile organic compounds in subjects with and without NAFLD.

Data are presented as means ± standard deviation (SD) or numbers (%) as in the case; P values of excess statistically significant are from the t-test comparing subjects with and without NAFLD. All the statistical tests of independence were two-sided. Abbreviations: NAFLD, non-alcoholic fatty liver disease.

Model performance evaluation

The predictive performance of all 10 machine learning (Mach-L) methods is summarized in Table 3. Across all methods, Model 2—which incorporated volatile organic compounds (VOCs)—demonstrated superior performance compared to Model 1, which only included demographic, biochemical, and lifestyle variables. Specifically, accuracy (ACC), sensitivity (Sens), specificity (Spec), BA, and AUC were all improved in Model 2.

TABLE 3

Methods	ACC		Sens		Spec		AUC		BA
Methods	Model 1	Model 2	Model 1	Model 2	Model 1	Model 2	Model 1	Model 2	Model 1	Model 2
RF	0.734	0.777	0.639	0.858	0.799	0.713	0.777	0.848	0.719	0.785
C5.0	0.691	0.688	0.762	0.843	0.643	0.563	0.683	0.765	0.702	0.703
SGB	0.718	0.784	0.705	0.769	0.726	0.796	0.773	0.854	0.716	0.783
MARS	0.731	0.781	0.680	0.769	0.765	0.790	0.768	0.846	0.723	0.780
CART	0.714	0.781	0.598	0.716	0.793	0.832	0.696	0.781	0.696	0.774
Lasso	0.718	0.787	0.689	0.754	0.737	0.814	0.768	0.866	0.713	0.784
Ridge	0.741	0.744	0.631	0.881	0.816	0.635	0.780	0.832	0.723	0.758
XGBoost	0.688	0.791	0.885	0.828	0.553	0.761	0.784	0.861	0.719	0.794
CatBoost	0.744	0.787	0.697	0.769	0.777	0.802	0.779	0.860	0.737	0.786
LightGBM	0.761	0.787	0.697	0.866	0.805	0.725	0.792	0.860	0.751	0.795

Results of machine learning in Model 1 (without VOCs) and Model 2 (with VOCs).

VOC, volatile organic compound; ACC, accuracy; Sens, sensitivity; Spec, specificity; AUC, area under curve; BA, balanced accuracy; Model 1, without VOCs; Model 2, with VOCs; RF, random forest; C5.0, C5.0 decision trees; SGB, stochastic gradient boosting; MARS, multivariate adaptive regression splines; CART, classification and regression tree; Lasso, least absolute shrinkage and selection operator; Ridge, ridge regression; XGBoost, extreme gradient boosting; CatBoost, gradient boosting with categorical features support; LightGBM, light gradient boosting machine.

These findings suggest that the inclusion of VOCs significantly enhanced the predictive accuracy of the models in identifying individuals with NAFLD. The confusion matrices for Models 1 and 2 are presented in Figure 2, while Figure 3 illustrates the respective AUC curves for each model. Additionally, the heatmap of the top 10 VOCs identified across the Mach-L algorithms is shown in Figure 4, highlighting their relative importance in the classification task.

FIGURE 2

Confusion matrices compare two models (Model 1 and model 2) for different algorithms (RF, C5.0, SGB, MARS, CART, Lasso, Ridge,XGBoost, CatBoost, and LightGBM). Each matrix shows true positives, false positives, true negatives, and false negatives for analysis. — The Confusion matrix of Model 1 and 2 for each machine learning methods. Model 1, without VOCs; Model 2, with VOCs; RF, random forest; C5.0, C5.0 decision trees; SGB, stochastic gradient boosting; MARS, multivariate adaptive regression splines; CART, classification and regression tree; Lasso, least absolute shrinkage and selection operator; Ridge, ridge regression; XGBoost, extreme gradient boosting; CatBoost, gradient boosting with categorical features support; LightGBM, light gradient boosting machine. **(a)** Model 1 o RF. **(b)** Model 2 o RF. **(c)** Model 1 of C5.0. **(d)** Model 2 of C5.0. **(e)** Model 1 of SGB. **(f)** Model 2 of SGB. **(g)** Model 1 of MARS. **(h)** Model 2 of MARS. **(i)** Model 1 of CART. **(j)** Model 2 of CART. **(k)** Model 1 of Lasso. **(l)** Model 2 of Lasso. **(m)** Model 1 of Ridge. **(n)** Model 2 of Ridge. **(o)** Model 1 of XGBoost. **(p)** Model 2 of XGBoost. **(q)** Model 1 of CatBoost. **(r)** Model 2 of CatBoost. **(s)** Model 1 of LightGBM. **(t)** Model 2 of LightGBM.

FIGURE 3

Two ROC curve plots comparing model performance are shown. Model 1 includes several algorithms with their AUC values, such as RF (0.777), C5.0 (0.683), and XGBoost (0.784). Model 2 displays similar algorithms with AUC values, including RF (0.848), C5.0 (0.765), and XGBoost (0.861). Each plot assesses sensitivity versus one minus specificity. — The area under receiver operation curve in model 1 and 2 for all the machine learning methods. Model 1, without VOCs; Model 2, with VOCs; RF, random forest; C5.0, C5.0 decision trees; SGB, stochastic gradient boosting; MARS, multivariate adaptive regression splines; CART, classification and regression tree; Lasso, least absolute shrinkage and selection operator; Ridge, ridge regression; XGBoost, extreme gradient boosting; CatBoost, gradient boosting with categorical features support; LightGBM, light gradient boosting machine.

FIGURE 4

Heatmap displaying hierarchical clustering of various biomarkers such as albumin, uric acid, and body mass index, among others. Color gradients represent Row Z-Scores, ranging from 0 to 0.8. Additional categories include sleeping hours, gender, and NAFLD status, with corresponding color codes. — Heatmap of the top 10 volatile organic compounds identified across the machine learning methods.

Table 4 presents the pairwise comparisons of AUC values for the 10 Mach-L methods, evaluating the improvement in predictive performance with the inclusion of VOCs compared to models without VOCs. The results indicate that for all methods, the inclusion of VOCs led to a statistically significant improvement in model performance, as evidenced by p-values less than 0.05 across all comparisons. These findings suggest that incorporating VOC data into the Mach-L models for NAFLD diagnosis results in significantly enhanced predictive accuracy compared to models that exclude VOCs.

TABLE 4

Methods	Model 1	Model 2	Difference Model 1 and Model 2	p-value
RF	0.777	0.848	7.10%	−2.050 (0.040)
C5.0	0.683	0.765	8.20%	−1.988 (0.047)
SGB	0.773	0.854	8.10%	−2.384 (0.017)
MARS	0.768	0.846	7.80%	−2.159 (0.031)
CART	0.696	0.781	8.50%	−2.304 (0.021)
Lasso	0.768	0.866	9.80%	−2.846 (0.004)
Ridge	0.780	0.832	5.20%	−1.471 (0.041)
XGBoost	0.784	0.861	7.70%	−2.278 (0.023)
CatBoost	0.779	0.860	8.10%	−2.403 (0.016)
LightGBM	0.792	0.860	6.80%	−2.014 (0.044)

Pairwise comparisons of the area under curve values between in Model 1 (without VOCs) and Model 2 (with VOCs) using DeLong’s test.

VOC, volatile organic compound; Model 1, without VOCs; Model 2, with VOCs. The numbers in table are the corresponding p-values. p < 0.05 was considered statistically significant.

Table 5 displays the most important predictive factors identified by the Mach-L methods, encompassing demographic, biochemical, lifestyle, and VOC-related variables. In total, 25 factors were selected, including the top 10 VOCs. Among the non-VOC variables, BMI emerged as the most influential predictor, followed by triglycerides (TG), uric acid (UA), fasting plasma glucose (FPG), γ -glutamyl transferase (γ -GT), gender, GPT, LDL-cholesterol, sleep duration, albumin, total bilirubin, alkaline phosphatase, GOT, HDL-cholesterol, and diastolic blood pressure (DBP). Notably, beginning from the 10th rank in overall importance, 2-propanol was the first VOC to appear. The complete list of VOCs identified is detailed in the Methods section.

TABLE 5

Variables	RF	C5.0	SGB	MARS	CART	Lasso	Ridge	XGBoost	Catboost	LightGBM	Mean	ROI
Body mass index (kg/m²)	100	100	100	100	100	100	100	100	100	100	100.00	1
Triglyceride (mg/dL)	53.56	100	46.48	52.75	87.96	3.34	2.8	52.45	64.16	33.9	49.74	2
Uric acid (mg/dL)	7.06	32.42	2.27	30.95	39.63	22.12	80	0	1.43	4.42	22.03	3
FPG (mg/dL)	5.61	51.5	3.3	0	0	3.76	8	2.31	7.43	1.18	8.31	4
γ-GT (U/L)	10.34	0	6.35	0	47.7	0	1	6.5	6.72	2.7	8.13	5
Gender	0.07	1.17	0	0	0	0	67	0	0	0	6.82	6
GPT (U/L)	7.62	0	4.22	0	38.8	1.44	4.3	1.57	4.16	1.69	6.38	7
LDL-cholesterol (mg/dL)	5.4	19.75	3.95	14.61	8.33	1.33	2.8	1.31	2.63	2.64	6.28	8
Sleep time	0.74	11.17	0	0	0	0	39	0	0.36	0	5.13	9
2-propanol	1.63	35.92	2.04	0	7.95	0	0	0.43	2.75	0	5.07	10
Albumin (g/dL)	0.97	4.67	0	0	0	0	42	0	0	0	4.76	11
Acetone	1.4	0	0	35.85	0	0	0	0	0.33	1.29	3.89	12
Butyl 2-methylbutanoate	0.26	0	0	0	0	0	32	0	4.31	0	3.66	13
Total bilirubin (mg/dL)	3.35	17.33	2.09	8.83	0	0	0	0	1.86	1.99	3.55	14
AP (U/L)	2.96	22.92	0	0	0	0.08	2.1	0.47	1.07	1.38	3.10	15
Diethylethanolamine	1.21	0	3.73	21.85	0	0	0	0	2.19	1.26	3.02	16
urethane	0.75	4.08	0	24.32	0	0	0	0	0	0	2.92	17
GOT (U/L)	1.3	0.67	0	21.85	0	0	0.11	0	4.42	0.24	2.86	18
HDL-cholesterol (mg/dL)	2.56	17.08	2.09	2.37	0	0	0.93	0	0	0.6	2.56	19
β-caryophyllene	0	0	0	0	0	0	25	0	0	0	2.50	20
DBP (mmHg)	3.19	13.58	0	0	0	1.33	4.8	0	1.33	0.53	2.48	21
Furfural	0.87	22.42	0	0	0	0	0	0	0	0	2.33	22
Tridecane	0.58	15.08	0	0	0	0	4.7	0	2.05	0.07	2.25	23
4-methyloctanoic acid	0.34	21.67	0	0	0	0	0	0	0	0	2.20	24
(S)-2-methyl-1-butanol	0.62	21	0	0	0	0	0	0	0.04	0	2.17	25
…	…	…	…	…	…	…	…	…	…	…	…	…

The variables selected and the mean and rank of important values by ten machine learning methods.

Abbreviations: ROI, rank of importance; FPG, fasting plasma glucose; γ-GT, γ-glutamyl transferase; GPT, serum glutamic pyruvic transaminase; AP, alkaline phosphatase; GOT, serum glutamic oxaloacetic transaminase; DBP, diastolic blood pressure.

The Bee Swarm plot derived from the XGBoost SHAP was shown in Figure 5. From top to the bottom listed the features selected and the higher horizontal feature indicates it is more important. Each circle represents a participant’s value impact of that feature. The red color has stronger impact whilst the blue one has less. Thus, this figure shows the direction of impact of each participants. Finally, Figure 6 shows the absolute strengths each feature from the highest to the lowest.

FIGURE 5

Dot plot showing SHAP values for various features impacting a model's output. Features include BMI, TG, and Age. Blue to pink gradient indicates low to high feature values. BMI shows the widest range of impact. — The Bee Swarm plot derived from Shapley addictive explanation of eXtreme Gradient Boosting. Note: BMI: body mass index; TG, Triglycerides; LDL-C, Low density lipoprotein cholesterol; GPT, Serum glutamic pyruvic transaminase; FPG, Fasting plasma glucose; T-Bili, Total bilirubin; GOT, Serum glutamic oxaloacetic transaminase; UA, Uric acid; SBP, Systolic blood pressure; HDL-C, High density lipoprotein cholesterol; AFP, Alpha-fetoprotein; eGFR, estimated Glomerular filtration rate; γ-GT, Gamma glutamyl transpeptidase.

FIGURE 6

Bar chart showing mean SHAP values indicating the impact of features on model output. BMI has the highest impact, followed by TG, 2-propanol, LDL-C, and Age. Other features have progressively smaller impacts. — The absolute Shapley addictive explanation values of each feature. Note: BMI, body mass index; TG, Triglycerides; LDL-C, Low density lipoprotein cholesterol; GPT, Serum glutamic pyruvic transaminase; FPG, Fasting plasma glucose; T-Bili, Total bilirubin; GOT, Serum glutamic oxaloacetic transaminase; UA, Uric acid; SBP, Systolic blood pressure; HDL-C, High density lipoprotein cholesterol; AFP, Alpha-fetoprotein; eGFR, estimated Glomerular filtration rate; γ-GT, Gamma glutamyl transpeptid.

Discussion

To the best of our knowledge, this study represents the largest cohort to date in the field of breath-based diagnostics for NAFLD, with 1,501 participants included. Previous related studies typically involved fewer than 100 subjects (Grewal and Mahmood, 2009; Chen et al., 2015), thereby limiting the generalizability and statistical power of their findings. Additionally, those studies primarily employed traditional statistical methods, which often fail to capture non-linear relationships among complex variables. In contrast, our study applied 10 different Mach-L algorithms, demonstrating that the inclusion of 341 VOCs in Model 1 led to a notable improvement in AUC, ranging from 5.20% to 9.80%, across different modeling approaches.

Volatile organic compounds (VOCs)—produced through endogenous metabolism, microbiota activity, and various cellular processes—hold significant potential as non-invasive biomarkers for disease detection. One of the greatest strengths of VOC-based diagnostics lies in their non-invasive nature, making them ideal for monitoring chronic conditions, tracking disease progression, and conducting large-scale population screening where invasive procedures are impractical. Furthermore, VOCs may offer early indicators of disease, enabling prompt diagnosis and intervention. This is particularly crucial for diseases like cancer and metabolic disorders, where early detection significantly enhances treatment outcomes.

By constructing a quantitative VOCs library from both sub-healthy and diseased individuals, predictive models and diagnostic algorithms can be refined to detect diseases at earlier stages. Importantly, the combination of VOC data with traditional clinical parameters and biomarkers enhances the accuracy and robustness of predictive models. Thus, the integration of quantitative VOC analysis has great potential to advance preventive medicine and revolutionize early disease detection.

Nevertheless, for VOCs to be effectively implemented in clinical practice, further research and validation are essential. Our present study lays a solid foundation for enhancing non-invasive prediction of NAFLD, and additional studies based on these findings are currently underway. Compared to previous research utilizing breathomics in NAFLD patients (Table 6) (Akesson, 1977; Alkhouri et al., 2014; Calabrese et al., 2023), our study offers a more comprehensive investigation, not only due to its larger sample size, but also through the simultaneous consideration of VOCs and clinical data, and the application of multiple machine learning algorithms for predictive modeling—an approach not previously explored in this field. However, it should be noted that, at this stage, due to two reasons the application of VOC in clinical practice is not practical; first, the sensitivity and specificity are not high enough; second, the cost of VOCs is still high.

TABLE 6

Technique	Sample size	# Of VOCs	Methods	References
GC-MS	46	127	Partial least square discriminant analysis	Calabrese et al., 2023
SIFT-MS	60	14	Canonical discriminant analysis	Alkhouri et al., 2014
GC-MS	60	220	Univariate analysis	Raman et al., 2013

Analysis pipelines of studies using breath-based VOCs towards non-alcoholic fatty liver prediction.

In the present study we did use XGBoost in order to examine the directions and impacts of each variable. The interpretation was given in the results section and, out of the 20 features, 6 different VOCs were selected and 2-propanol was the third important VOCS.

From the initial 341 VOCs analyzed, the top 10 most relevant compounds were identified using machine learning algorithms. Ranked by importance, these VOCs were: 2-propanol, acetone, butyl 2-methylbutanoate, diethylethanolamine, urethane, β-caryophyllene, furfural, tridecane, 4-methyloctanoic acid, and (S)-2-methyl-1-butanol.

The gold standard for diagnosing NAFLD remains liver biopsy (Brunt et al., 1999; Kleiner et al., 2005), yet this approach is invasive and carries a complication risk of approximately 0.5% (Bravo et al., 2001; Piccinino et al., 1986). Alternative, less invasive methods such as the Fibrosis-4 Index, which incorporates age, liver enzymes, and platelet count, have shown a positive predictive value (PPV) of around 80% (Author Anonymous, 2025). Likewise, ultrasound has demonstrated high sensitivity and specificity (84.8% and 93.6%, respectively) in detecting moderate to severe steatosis. Given this, one might argue that VOC analysis is more labor-intensive and costly. However, the primary value of VOCs lies in their potential to uncover novel insights into NAFLD pathophysiology, which could eventually lead to new therapeutic targets. Importantly, this study did not establish a causal relationship between VOCs and NAFLD.

As expected, the top-ranking variables in our analysis were traditional risk factors, including BMI, triglycerides (TG), fasting plasma glucose (FPG), among others. The first VOC to appear in the list was 2-propanol. The well-established impact of traditional clinical variables should not be overlooked (Huh et al., 2022), but it must also be recognized that these factors may confound the identification of VOCs truly associated with NAFLD. To address this, our machine learning models adjusted for the effects of conventional predictors, allowing for a more accurate evaluation of VOC contributions.

Below is a brief discussion of the top 10 VOCs identified:

2-Propanol

Lu et al. demonstrated that subchronic exposure to 2-propanol in mice induced NAFLD through dysregulation of the AMPK signaling pathway (Lu et al., 2015). Interestingly, in our study, 2- propanol levels were lower in NAFLD subjects, suggesting possible upregulation of AMPK as a protective or compensatory modality (Fang et al., 2022). However, the specific mechanistic studies on 2-propanol and fatty liver are limited, its known hepatotoxicity and the findings from animal studies support the possibility that 2-propanol exposure can contribute to fatty liver development, especially with high or prolonged exposure (Satapathy et al., 2015; World HealthOrganizatiom-INTERNATIONAL P ROGRAMME ON CHEMICALSAFETY, 1990).

Acetone

Solga et al. previously reported that breath acetone was associated with NAFLD in morbidly obese patients undergoing bariatric surgery (Solga et al., 2006). This may reflect decreased d-3- hydroxybutyrate dehydrogenase activity or altered NADH levels, leading to acetone accumulation. There might be two mechanisms behind this relationship. First, enhancing ketogenesis reduces hepatic lipid accumulation in preclinical models. Exogenous ketones (e.g., β-hydroxybutyrate) show anti-inflammatory and antifibrotic effects, suggesting protective roles (Kwon et al., 2024). Second, while elevated acetone may indicate metabolic stress in early NAFLD, targeted ketone supplementation or ketogenic diets could mitigate steatosis and inflammation in specific contexts (Kwon et al., 2024). Our findings were consistent, with higher acetone levels in NAFLD subjects, though the difference was not statistically significant.

Butyl 2-methylbutanoate

This fatty acid ester, found naturally in apricots (Prunus armeniaca), has been associated with celiac disease and IBS (pubchem, 2025). Only one prior study has investigated its link to NAFLD, finding higher prevalence in NAFLD patients (Raman et al., 2013). It might have influences on NAFLD due to the gut microbiota alterations which correlates with elevated 2-butanone (a structurally related ketone), hinting at broader metabolic disruptions involving ester-like compounds (Del et al., 2016). In the same time, methyl tert-butyl ether, another ether compound, shows epidemiological links to NAFLD risk in humans, suggesting potential shared mechanisms for ester/ether-induced metabolic dysfunction (Cui et al., 2024). Additional research is needed to elucidate its pathophysiological role.

Diethylethanolamine

Akesson reported that diethylethanolamine promotes the conversion of phosphatidylethanolamine to phosphatidylcholine, a hepatoprotective compound (Akesson, 1977). However, at present, there is no direct evidence linking it to NAFLD. It is well known that alcohol metabolism produces acetaldehyde and reactive oxygen species that cause fatty liver. While diethylethanolamine is not an alcohol, its metabolism might theoretically produce reactive intermediates that could similarly affect hepatic cells (Liu, 2014). Our findings—lower levels in NAFLD subjects—support this mechanism and suggest its potential protective role.

Urethane

Studies in rats demonstrate that administration of carcinogenic doses of urethane leads to liver microsomal damage, including degranulation of liver microsomes, which impairs liver cell function and contributes to hepatic injury (Dani, 1983). In human, it is found that liver injury in workers exposed to N,N-dimethylformamide (DMF) (Nakasone et al., 2011; Nomiyama et al., 2001; Redlich et al., 1988), urethane levels were higher in the control group in our study. While this may indicate resistance to hepatic injury, further evidence is required to substantiate this hypothesis.

β-caryophyllene

This anti-inflammatory, plant-derived compound activates CB2 receptors, reducing oxidative stress and hepatic injury in mice models (Varga et al., 2018). In the same time, it could reduce intracellular lipid accumulation, primarily by lower saturated fatty acids and modifying the lipid profile toward less harmful species (Scandiffio et al., 2023). Our findings align with this, supporting its potential therapeutic use.

Furfural

Interestingly, furfural has a complex relationship with liver health. In low dose, it could improve mitochondrial function, reduced reactive oxygen species, and restoration of the NAD⁺/NADH redox balance, which is crucial for lipid metabolism and preventing fatty liver progression (Cheng et al., 2022). But when it is in high dose, a Maillard reaction product with antioxidant properties, furfural has demonstrated hepatocyte-protective effects in animal studies (Powell et al., 2014). This may explain its relevance in our NAFLD model.

Tridecane

There is currently no direct evidence or well-established research linking tridecane specifically to NAFLD or its progression. Tridecane is a hydrocarbon (alkane) commonly found in petroleum products and some environmental pollutants, but its direct impact on liver fat accumulation or liver metabolism has not been clearly documented. The only evidence is that tridecane has been associated with inflammation and lipid peroxidation, particularly in distinguishing NASH from non-NASH (The good scents company, 2025).

4-methyloctanoic acid

Although largely known for its use in food flavoring, this compound is a fatty acid and may reflect metabolic changes. The possible mechanisms include it is a BCFA involved in lipid metabolism; it can modulate gene expression related to fatty acid metabolism; and there is an indirect link to the NAVLD (Zhao et al., 2022; Liu et al., 2018; Pooya et al., 2012). We observed higher levels in non-NAFLD subjects, potentially due to better hepatic metabolism in healthier individuals (Yamaguchi et al., 2007).

(S)-2-methyl-1-butanol

Produced by Saccharomyces cerevisiae, this compound has antioxidant properties (Wilson et al., 2022; Agarwal et al., 2020; Landolfo et al., 2008), which may be linked to liver protection. In the same time, it is a fatty alcohol lipid molecule involved in metabolic pathways related to fatty acid and alcohol metabolism. It can be oxidized to 2-methylbutyrate, which then enters beta-oxidation to produce acetyl-CoA and propionyl-CoA, key intermediates in energy metabolism (Thompson et al., 2020). This indicates that (S)-2-methyl-1-butanol is metabolically linked to fatty acid catabolism through its conversion to fatty acid derivatives that feed into mitochondrial energy pathways. While no prior study has evaluated its relationship with NAFLD, our findings suggest it may play a role in hepatic defense mechanisms.

Limitations

Despite the strengths of our study—including a large sample size and comprehensive VOC profiling—there are important limitations. In the present study, we do have limitations. First, this is a cross-sectional study which is less persuasive than a longitudinal one. There is no conclusion of cause-effect relationship could be drawn. However, since some of these participants will continue to have a health check up in our clinic, in the future, we believe that we will have a longitudinal study. Second, the participants were only limited to Taiwanese. It should be exercised to another ethnic group with cautious. In the future, since these participants will remain to be followed up in the MJ clinic, longitudinal studies could be done by using the present results of VOCs to predict future diseases. In the same time, we will separate these participants into two groups; one for selecting the VOCs, the rest will be treated as a validation group.

Conclusion

Using 10 different Mach-L algorithms, we identified the relative importance of both clinical parameters and volatile organic compounds (VOCs) in predicting non-alcoholic fatty liver disease (NAFLD) within a cohort of 1,501 participants. Among clinical variables, the most influential features included body mass index (BMI), triglycerides (TG), uric acid (UA), fasting plasma glucose (FPG), γ-glutamyltransferase (γ-GT), gender, low-density lipoprotein cholesterol (LDL-C), and sleep duration.

In addition to traditional clinical predictors, 2-propanol emerged as the most influential VOC, followed in descending order by acetone, butyl 2-methylbutanoate, diethylethanolamine, urethane, β-caryophyllene, furfural, tridecane, 4-methyloctanoic acid, and (S)-2-methyl-1- butanol. The potential biological relevance and mechanisms of action of these VOCs were discussed in relation to liver metabolism and disease pathology.

While this study offers new insights into the role of VOCs as non-invasive biomarkers for NAFLD, its cross-sectional design limits the ability to determine causality. Future research should aim to conduct longitudinal studies to further elucidate the cause-effect relationships between VOCs and the development or progression of NAFLD.

Statements

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, upon reasonable request.

Ethics statement

The studies involving humans were approved by Institutional Review Board of National Yang Ming Chiao Tung University, Taiwan (IRB No. NCTU-REC-109-074E). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

C-HS: Conceptualization, Investigation, Writing – original draft. R-HH: Resources, Validation, Writing – original draft. Y-KL: Conceptualization, Project administration, Writing – review and editing. T-WC: Resources, Writing – review and editing. DP: Data curation, Supervision, Writing – review and editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was financially supported by the project MOST 109-2634-F-009-028 and the Center for Emergent Functional Matter Science of National Yang Ming Chiao Tung University from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan. This work was also financially supported by grants from the Tri-Service General Hospital (Award numbers: TSGH-E-113257, TSGH-E-114259), Ministry of National Defense, Medical Affairs Bureau (Award number: MND-MAB-D-113083, MND-MAB-D-114107), Teh-Tzer Study Group for Human Medical Research Foundation (Award number: B1141005). The funding sources had no role in the study design, in the collection, analysis, and interpretation of data, in the writing of the report, and in the decision to submit the article for publication.

Acknowledgments

The authors acknowledge the research collaboration and technical service supported by National Human Microbiome Core Facility, Taiwan (NSTC 112-2740-B-A49-002).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1
Agarwal P. Patel K. More P. Sapara K. K. Singh V. K. Agarwal P. K. (2020). The AlRabring7 E3-Ub-ligase mediates AlRab7 ubiquitination and improves ionic and oxidative stress tolerance in Saccharomyces cerevisiae. Plant Physiol. biochem.151, 689–704. 10.1016/j.plaphy.2020.03.030
- CrossRef
- Google Scholar
2
Akesson B. (1977). Effects of analogles of ethanolamine and choline on phospholipid metabolism in rat hepatocytes. Biochem. J.168 (3), 401–408. 10.1042/bj1680401
- CrossRef
- Google Scholar
3
Alkhouri N. Cikach F. Eng K. Moses J. Patel N. Yan C. et al (2014). Analysis of breath volatile organic compounds as a noninvasive tool to diagnose nonalcoholic fatty liver disease in children. Eur. J. Gastroenterol. Hepatol.26 (1), 82–87. 10.1097/MEG.0b013e3283650669
- CrossRef
- Google Scholar
4
Arkoudis N. A. Papadakos S. P. (2025). Machine learning applications in healthcare clinical practice and research. World J. Clin. Cases13 (1), 99744. 10.12998/wjcc.v13.i1.99744
- CrossRef
- Google Scholar
5
Author Anonymous (2025). Hepatology. Available online at: http://gihep.com/calculators/hepatology/fbrosis-4-score/.
- Google Scholar
6
Bravo A. A. Sheth S. G. Chopra S. (2001). Liver biopsy. N. Engl. J. Med.344 (7), 495–500. 10.1056/NEJM200102153440706
- CrossRef
- Google Scholar
7
Breiman L. Olshen (2001). Random forests. Mach. Learn45 (1), 5–32. 10.1023/A:1010933404324
- CrossRef
- Google Scholar
8
Breiman L. Friedman J. Olshen R. Stone C. (1984). Classification and regression trees. Wadsworth Int. Group.237–251. 10.1201/9781315139470
- CrossRef
- Google Scholar
9
Browne M. W. (2000). Cross-validation methods. J. Math. Psychol.44 (1), 108–132. 10.1006/jmps.1999.1279
- CrossRef
- Google Scholar
10
Brunt E. M. Janney C. G. Di Bisceglie A. M. Neuschwander-Tetri B. A. Bacon B. R. (1999). Nonalcoholic steatohepatitis: a proposal for grading and staging the histological lesions. Am. J. Gastroenterol.94 (9), 2467–2474. 10.1111/j.1572-0241.1999.01377.x
- CrossRef
- Google Scholar
11
Calabrese F. M. Celano G. Bonfiglio C. Campanella A. Franco I. Annunziato A. et al (2023). Synergistic effect of diet and physical activity on a NAFLD cohort: metabolomics profile and clinical variable evaluation. Nutrients15 (11), 2457. 10.3390/nu15112457
- CrossRef
- Google Scholar
12
Chen C. H. Huang M. H. Yang J. C. Nien C. K. Yang C. C. Yeh Y. H. et al (2006). Prevalence and risk factors of nonalcoholic fatty liver disease in an adult population of Taiwan: metabolic significance of nonalcoholic fatty liver disease in nonobese adults. J. Clin. Gastroenterol.40 (8), 745–752. 10.1097/00004836-200609000-00016
- CrossRef
- Google Scholar
13
Chen P. Torralba M. Tan J. Embree M. Zengler K. Stärkel P. et al (2015). Supplementation of saturated long-chain fatty acids maintains intestinal eubiosis and reduces ethanol-induced liver injury in mice. Gastroenterology148 (1), 203–214.e16. 10.1053/j.gastro.2014.09.014
- CrossRef
- Google Scholar
14
Cheng Z. Luo X. Zhu Z. Huang Y. Yan X. (2022). Furfural produces dose-dependent attenuating effects on ethanol-induced toxicity in the liver. Front. Pharmacol.13, 906933. 10.3389/fphar.2022.906933
- CrossRef
- Google Scholar
15
Chung J. Akter S. Han S. Shin Y. Choi T. G. Kang I. et al (2022). Diagnosis by volatile organic compounds in exhaled breath in exhaled breath from patients with gastric and colorectal cancers. Int. J. Mol. Sci.24 (1), 129. 10.3390/ijms24010129
- CrossRef
- Google Scholar
16
Cui F. Wang H. Guo M. Sun Y. Xin Y. Gao W. et al (2024). Methyl tert-Butyl ether may be a potential environmental pathogenic factor for nonalcoholic fatty liver disease: results from NHANES 2017–2020. Washington, DC: ACS publication. 10.1021/envhealth.4c00140
- CrossRef
- Google Scholar
17
Dani H. M. (1983). Effects of toxic doses of urethane on rat liver and lung microsomes. Toxicol. Lett.15 (1), 61–64. 10.1016/0378-4274(83)90170-4
- CrossRef
- Google Scholar
18
Dasarathy S. Dasarathy J. Khiyami A. Joseph R. Lopez R. McCullough A. J. (2009). Validity of real time ultrasound in the diagnosis of hepatic steatosis: a prospective study. J. Hepatol.51 (6), 1061–1067. 10.1016/j.jhep.2009.09.001
- CrossRef
- Google Scholar
19
Del C. F. Nobili V. Vernocchi P. Russo A. De S. C. Gnani D. et al (2016). Gut microbiota profiling of pediatric nonalcoholic fatty liver disease and obese patients unveiled by an integrated meta‐omics‐based approach. Hepatology65 (2), 451–464. 10.1002/hep.28572
- CrossRef
- Google Scholar
20
DeLong E. R. DeLong D. M. Clarke-Pearson D. L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics44 (3), 837–845. 10.2307/2531595
- CrossRef
- Google Scholar
21
Dias Canedo E. Cordeiro Mendes B. (2020). Software requirements classification using machine learning algorithms. Entropy22 (9), 1057. 10.3390/e22091057
- CrossRef
- Google Scholar
22
di Biase L. Di Santo A. Caminiti M. L. De Liso A. Shah S. A. Ricci L. et al (2020). Gait analysis in parkinson's disease: an overview of the Most accurate markers for diagnosis and symptoms monitoring. Sensors20 (12), 3529. 10.3390/s20123529
- CrossRef
- Google Scholar
23
Dorogush A. V. Ershov V. Gulin A. (2018). CatBoost: gradient boosting with categorical features support. arXiv. 10.48550/arXiv.1810.11363
- CrossRef
- Google Scholar
24
Duan X. Chen Z. Liao J. Wen M. Yue Y. Liu L. et al (2025). The association analysis between exposure to volatile organic compounds and fatty liver disease in US adults. Dig. Liver Dis.57 (2), 535–541. 10.1016/j.dld.2024.09.027
- CrossRef
- Google Scholar
25
Dusanter S. Holzinger R. Klein F. Salameh T. Jamar M. (2025). Measurement guidelines for VOC analysis by PTR-MS. ACTRIS. Available online at: https://www.actris.eu/sites/default/files/inline-files/PTRMS%20SOP%20(April2025).pdf.
- Google Scholar
26
Fang C. Pan J. Qu N. Lei Y. Han J. Zhang J. et al (2022). The AMPK pathway in fatty liver disease. Front. Physiol.13, 970292. 10.3389/fphys.2022.970292
- CrossRef
- Google Scholar
27
Final Report (2012). Analytical methodology development and evaluation. Sydney NSW Australia: University of New South Wales. Available online at: https://water360.com.au/wp-content/uploads/2022/02/Analytical-Methodology-Development-Report-Final.pdf.
- Google Scholar
28
Friedman J. H. (1991). Multivariate adaptive regression splines. Ann. Stat.19 (1), 1–67. 10.1214/aos/1176347963
- CrossRef
- Google Scholar
29
Friedman J. H. (2001). Greedy function approximation: a gradient boosting machine. Ann. Stat.29 (5), 1189–1232. 10.1214/aos/1013203451
- CrossRef
- Google Scholar
30
Ghevariya V. Sandar N. Patel K. Ghevariya N. Shah R. Aron J. et al (2014). Knowing what's out there: awareness of non-alcoholic fatty liver disease. Front. Med.1, 4. 10.3389/fmed.2014.00004
- CrossRef
- Google Scholar
31
Grewal R. K. Mahmood A. (2009). Ethanol induced changes in glycosylation of mucins in rat intestine. Ann. Gastroenterol.22, 178–183. Available online at: http://www.annalsgastro.gr/index.php/annalsgastro/article/view/746.
- Google Scholar
32
Gu Z. (2022). Make complex heatmaps. R. Package Version2 (6.2). Available online at: https://github.com/jokergoo/ComplexHeatmap (Accessed on May 11, 2024).
- Google Scholar
33
Hashim I. C. Shariff A. R. M. Bejo S. K. Muharam F. M. Ahmad K. (2021). Machine-learning approach using SAR data for the classification of oil palm trees that are Non- infected and infected with the basal stem rot disease. Agronomy11, 532. 10.3390/agronomy11030532
- CrossRef
- Google Scholar
34
Hastie T. Tibshirani R. Wainwright M. (2015). Statistical learning with sparsity: the lasso and generalizations. Boca Raton, FL, USA: CRC Press.
- Google Scholar
35
Hoerl A. E. Kennard R. W. (2000). Ridge regression: biased estimation for nonorthogonal problems. Technometrics42 (1), 80–86. 10.2307/1271436
- CrossRef
- Google Scholar
36
Huh Y. Cho Y. J. Nam G. E. (2022). Recent epidemiology and risk factors of nonalcoholic fatty liver disease. J. Obes. Metab. Syndr.31 (1), 17–27. 10.7570/jomes22021
- CrossRef
- Google Scholar
37
Hussain A. Choi H. E. Kim H. J. Aich S. Saqlain M. Kim H. C. (2021). Forecast the exacerbation in patients of chronic obstructive pulmonary disease with clinical indicators using machine learning techniques. Diagnostics11 (5), 829. 10.3390/diagnostics11050829
- CrossRef
- Google Scholar
38
Ke G. Meng Q. Finley T. Wang T. Chen W. Ma W. et al (2017). “Lightgbm: a highly efficient gradient boosting decision tree,” in Advances in neural information processing systems. Editors GuyonI.LuxburgU. V.BengioS.WallachH.FergusR.VishwanathanS.et al (New York: Curran Associates), 3146–3154.
- Google Scholar
39
Keogh R. J. Riches J. C. (2022). The use of breath analysis in the management of lung cancer: is it ready for primetime?Curr. Oncol.29 (10), 7355–7378. 10.3390/curroncol29100578
- CrossRef
- Google Scholar
40
Kleiner D. E. Brunt E. M. Van Natta M. Behling C. Contos M. J. Cummings O. W. et al (2005). Design and validation of a histological scoring system for nonalcoholic fatty liver disease. Hepatology41 (6), 1313–1321. 10.1002/hep.20701
- CrossRef
- Google Scholar
41
Kuhn M. (2022). Caret: classification and regression training. R. Package Version6, 93. 10.32614/CRAN.package.caret
- CrossRef
- Google Scholar
42
Kwon S. Jeyaratnam R. Kim K. H. (2024). Targeting ketone body metabolism to treat fatty liver disease. J. Pharm. and Pharm. Sci.27, 13375. 10.3389/jpps.2024.13375
- CrossRef
- Google Scholar
43
Landolfo S. Politi H. Angelozzi D. Mannazzu I. (2008). ROS accumulation and oxidative damage to cell structures in Saccharomyces cerevisiae wine strains during fermentation of high-sugar-containing medium. Biochim. Biophys. Acta1780 (6), 892–898. 10.1016/j.bbagen.2008.03.008
- CrossRef
- Google Scholar
44
Lang A. L. Beier J. I. (2018). Interaction of volatile organic compounds and underlying liver disease: a new paradigm for risk. Biol. Chem.399 (11), 1237–1248. 10.1515/hsz-2017-0324
- CrossRef
- Google Scholar
45
Lin T. J. Lin C. L. Wang C. S. Liu S. O. Liao L. Y. (2005). Prevalence of HFE mutations and relation to serum iron status in patients with chronic hepatitis C and patients with nonalcoholic fatty liver disease in Taiwan in review fatty liver diseasein Taiwan. World J. Gastroenterol.11 (25), 3905–3908. 10.3748/wjg.v11.i25.3905
- CrossRef
- Google Scholar
46
Liu J. (2014). Ethanol and liver: recent insights into the mechanisms of ethanol-induced fatty liver. World J. gastroenterology WJG20 (40), 14672–14685. 10.3748/wjg.v20.i40.14672
- CrossRef
- Google Scholar
47
Liu W. Cao S. Shi D. Yu L. Qiu W. Chen W. et al (2023). Single-chemical and mixture effects of multiple volatile organic compounds exposure on liver injury and risk of non-alcoholic fatty liver disease in a representative general adult population. Chemosphere339, 139753. 10.1016/j.chemosphere.2023.139753
- CrossRef
- Google Scholar
48
Liu W. Ding H. Erdene K. Chen R. Mu Q. Ao C. (2018). Effects of flavonoids from Allium mongolicum regel as a dietary additive on meat quality and composition of fatty acids related to flavor in lambs. Can. J. Animal Sci.99 (1), 15–23. 10.1139/cjas-2018-0008
- CrossRef
- Google Scholar
49
Lu J. Fang B. Ren M. Huang G. Zhang S. Wang Y. et al (2015). Nonalcoholic fatty liver disease induced by 13-week oral administration of 1,3-dichloro-2- propanol in C57BL/6J mice. Environ. Toxicol. Pharmacol.39 (3), 1115–1121. 10.1016/j.etap.2015.04.007
- CrossRef
- Google Scholar
50
Machado M. V. Cortez-Pinto H. (2014). Non-alcoholic fatty liver disease: what the clinician needs to know. World J. Gastroenterol.20 (36), 12956–12980. 10.3748/wjg.v20.i36.12956
- CrossRef
- Google Scholar
51
Mahale A. R. Prabhu S. D. Nachiappan M. Fernandes M. Ullal S. (2018). Clinical relevance of reporting fatty liver on ultrasound in asymptomatic patients during routine health checkups. J. Int. Med. Res.46 (11), 4447–4454. 10.1177/0300060518793039
- CrossRef
- Google Scholar
52
Marateb H. R. Mansourian M. Faghihimani E. Amini M. Farina D. (2014). A hybrid intelligent system for diagnosing microalbuminuria in type 2 diabetes patients without having to measure urinary albumin. Comput. Biol. Med.45, 34–42. 10.1016/j.compbiomed.2013.11.006
- CrossRef
- Google Scholar
53
Markar S. R. Chin S. T. Romano A. Wiggins T. Antonowicz S. Paraskeva P. et al (2019). Breath volatile organic compound profiling of colorectal cancer using selected ion flow-tube mass spectrometry. Ann. Surg.269 (5), 903–910. 10.1097/SLA.0000000000002539
- CrossRef
- Google Scholar
54
Maurice J. Manousou P. (2018). Non-alcoholic fatty liver disease. Clin. Med.18 (3), 245–250. 10.7861/clinmedicine.18-3-245
- CrossRef
- Google Scholar
55
Meng Q. Ke G. Wang T. Chen W. Ye Q. Ma Z. M. et al (2016). A Communication- efficient parallel algorithm for decision tree, arXiv preprint arXiv:1611.01276. Adv. 10.48550/arXiv.1611.01276
- CrossRef
- Google Scholar
56
Miller D. D. Brown E. W. (2018). Artificial intelligence in medical practice: the question to the answer?Am. J. Med.131 (2), 129–133. 10.1016/j.amjmed.2017.10.035
- CrossRef
- Google Scholar
57
Moghimi A. Yang C. Marchetto P. M. (2018). Ensemble feature selection for plant phenotyping: a journey from hyperspectral to multispectral imaging. IEEE Access6, 56870–56884. 10.1109/access.2018.2872801
- CrossRef
- Google Scholar
58
Nakasone H. Binh P. N. Yamazaki R. Tanaka Y. Sakamoto K. Ashizawa M. et al (2011). Association between serum high-molecular-weight adiponectin level and the severity of chronic graft-versus-host disease in allogeneic stem cell transplantation recipients. Blood117 (12), 3469–3472. 10.1182/blood-2010-10-316109
- CrossRef
- Google Scholar
59
Nomiyama T. Uehara M. Miyauchi H. Imamiya S . Tanaka S. Seki Y. (2001). Causal relationship between a case of severe hepatic dysfunction and low exposure concentrations of N,N- dimethylformamide in the synthetics industry. Ind. Health39 (1), 33–36. 10.2486/indhealth.39.33
- CrossRef
- Google Scholar
60
Nusinovici S. Tham Y. C. Chak Yan M. Y. Wei Ting D. S. Li J. Sabanayagam C. et al (2020). Logistic regression was as good as machine learning for predicting major chronic diseases. J. Clin. Epidemiol.122, 56–69. 10.1016/j.jclinepi.2020.03.002
- CrossRef
- Google Scholar
61
Pes B. (2020). Ensemble feature selection for high-dimensional data: a stability analysis across multiple domains. Neural Comput. Appl.32, 5951–5973. 10.1007/s00521-019-04082-3
- CrossRef
- Google Scholar
62
Piccinino F. Sagnelli E. Pasquale G. Giusti G. (1986). Complications following percutaneous liver biopsy. A multicentre retrospective study on 68,276 biopsies. J. Hepatol.2 (2), 165–173. 10.1016/s0168-8278(86)80075-7
- CrossRef
- Google Scholar
63
Pooya S. Blaise S. Garcia M. M. Giudicelli J. Alberto J. M. Guéant-Rodriguez R. M. et al (2012). Methyl donor deficiency impairs fatty acid oxidation through PGC-1α hypomethylation and decreased ER-α, ERR-α, and HNF-4α in the rat liver. J. hepatology57 (2), 344–351. 10.1016/j.jhep.2012.03.028
- CrossRef
- Google Scholar
64
Powell R. D. Swet J. H. Kennedy K. L. Huynh T. T. McKillop I. H. Evans S. L. (2014). Resveratrol attenuates hypoxic injury in a primary hepatocyte model of hemorrhagic shock and resuscitation. J. Trauma Acute Care Surg.76 (2), 409–417. 10.1097/TA.0000000000000096
- CrossRef
- Google Scholar
65
pubchem (2025). Butyl 2-methylbutyrate. Available online at: https://pubchem.ncbi.nlm.nih.gov/compound/Butyl-2-methylbutyrate#section=Associated-Disorders-and-Diseases.
- Google Scholar
66
Quek J. Chan K. E. Wong Z. Y. Tan C. Tan B. Lim W. H. et al (2023). Global prevalence of non-alcoholic fatty liver disease and non-alcoholic steatohepatitis in the overweight and Obese population: a systematic review and meta-analysis. Lancet Gastroenterol. Hepatol.8 (1), 20–30. 10.1016/S2468-1253(22)00317-X
- CrossRef
- Google Scholar
67
Quinlan J. R. (2004). Data mining tools See5 and C5.0. Available online at: http://www.rulequest.com/see5-info.htML (Accessed on August 3, 2020).
- Google Scholar
68
Raman M. Ahmed I. Gillevet P. M. Probert C. S. Ratcliffe N. M. Smith S. et al (2013). Fecal microbiome and volatile organic compound metabolome in obese humans with nonalcoholic fatty liver disease. Clin. Gastroenterol. Hepatol.11 (7), 868–75.e753. 10.1016/j.cgh.2013.02.015
- CrossRef
- Google Scholar
69
Ratiu I. A. Ligor T. Bocos-Bintintan V. Mayhew C. A. Buszewski B. (2020). Volatile organic compounds in exhaled breath as fingerprints of lung cancer, asthma and COPD. J. Clin. Med.10 (1), 32. 10.3390/jcm10010032
- CrossRef
- Google Scholar
70
R Core Team (2017). R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Available online at: http://www.R-project.org (Accessed on January 21, 2023).
- Google Scholar
71
Redlich C. A. Beckett W. S. Sparer J. Barwick K. W. Riely C. A. Miller H. et al (1988). Liver disease associated with occupational exposure to the solvent dimethylformamide. Ann. Intern Med.108 (5), 680–686. 10.7326/0003-4819-108-5-680
- CrossRef
- Google Scholar
72
RI-URBANS (2024). “Research infrastructures services reinforcing air quality monitoring capacities in European Urban and industrial AreaS”. European Union: RI-URBANS. Available online at: https://riurbans.eu/wp-content/uploads/2024/07/ST5_VOCs.pdf.
- Google Scholar
73
RStudio Team (2015). RStudio: integrated development environment for R; RStudio: boston, MA, USA. Available online at:: http://www.rstudio.com/ (Accessed on January 21, 2023).
- Google Scholar
74
Samudrala D. Lammers G. Mandon J. Blanchet L. Schreuder T. H. Hopman M. T. et al (2014). Breath acetone to monitor life style interventions in field conditions: an exploratory study. Obesity22 (4), 980–983. 10.1002/oby.20696
- CrossRef
- Google Scholar
75
Satapathy S. K. Kuwajima V. Nadelson J. Atiq O. Sanyal A. J. (2015). Drug-induced fatty liver disease: an overview of pathogenesis and management. Ann. hepatology14 (6), 789–806. 10.5604/16652681.1171749
- CrossRef
- Google Scholar
76
Scandiffio R. Bonzano S. Cottone E. Shrestha S. Bossi S. De Marchis S. et al (2023). Beta-caryophyllene modifies intracellular lipid composition in a cell model of hepatic steatosis by acting through CB2 and PPAR receptors. Int. J. Mol. Sci.24 (7), 6060. 10.3390/ijms24076060
- CrossRef
- Google Scholar
77
Shaffie A. Soliman A. Eledkawy A. Fu X. A. Nantz M. H. Giridharan G. et al (2022). Lung cancer diagnosis system based on volatile organic compounds (VOCs) profile measured in exhaled breath in review profile measured in exhaled breath. Appl. Sci.12 (14), 7165. 10.3390/app12147165
- CrossRef
- Google Scholar
78
Solga S. F. Alkhuraishe A. Cope K. Tabesh A. Clark J. M. Torbenson M. et al (2006). Breath biomarkers and non-alcoholic fatty liver disease: preliminary observations. Biomarkers11 (2), 174–183. 10.1080/13547500500421070
- CrossRef
- Google Scholar
79
Španěl P. Smith D. (2020). Quantification of volatile metabolites in exhaled breath by selected ion flow tube mass spectrometry, SIFT-MS. Clin. Mass Spectrom.16, 18–24. 10.1016/j.clinms.2020.02.001
- CrossRef
- Google Scholar
80
Sukaram T. Apiparakoon T. Tiyarattanachai T. Ariyaskul D. Kulkraisri K. Marukatat S. et al (2023). VOCs from exhaled breath for the diagnosis of hepatocellular carcinoma. Diagnostics13 (2), 257. 10.3390/diagnostics13020257
- CrossRef
- Google Scholar
81
Tharwat A. (2021). Classification assessment methods. Appl. Comput. Inf.17, 168–192. 10.1016/j.aci.2018.08.003
- CrossRef
- Google Scholar
82
The good scents company (2025). 4-methyloctanoic acid. Available online at: http://www.thegoodscentscompany.com/data/rw1036901.htML.
- Google Scholar
83
Thompson M. G. Incha M. R. Pearson A. N. Schmidt M. Sharpless W. A. Eiben C. B. et al (2020). Fatty acid and alcohol metabolism in Pseudomonas putida: functional analysis using random barcode transposon sequencing. Applied and Environmental Microbiology86 (21), e01665–20. 10.1128/AEM.01665-20
- CrossRef
- Google Scholar
84
Tomer V. Sharma S. (2022). Detecting IoT attacks using an ensemble machine learning model. Future Internet14 (4), 102. 10.3390/fi14040102
- CrossRef
- Google Scholar
85
Tsou P. H. Lin Z. L. Pan Y. C. Yang H. C. Chang C. J. Liang S. K. et al (2021). Exploring volatile organic compounds in breath for high-accuracy prediction of lung cancer. Cancers13 (6), 1431. 10.3390/cancers13061431
- CrossRef
- Google Scholar
86
Tuli S. Basumatary N. Gill S. S. Kahani M. Arya R. C. Wander G. S. et al (2019). HealthFog: an ensemble deep learning based smart healthcare system for automatic diagnosis of heart diseases in integrated IoT and fog computing environments. Future Gener. Comput. Syst.104, 187–200. 10.1016/j.future.2019.10.043
- CrossRef
- Google Scholar
87
U.S. Environmental Protection Agency (2014). Volatile organic compounds in various sample matrices using equilibrium headspace analysis. Pennsylvania Avenue, NW: U.S. Environmental Protection Agency. Available online at: https://www.epa.gov/sites/default/files/2015-12/documents/5021a.pdf.
- Google Scholar
88
Varga Z. V. Matyas C. Erdelyi K. Cinar R. Nieri D. Chicca A. et al (2018). β-Caryophyllene protects against alcoholic steatohepatitis by attenuating inflammation and metabolic dysregulation in mice. Br. J. Pharmacol.175 (2), 320–334. 10.1111/bph.13722
- CrossRef
- Google Scholar
89
Wei Y. T. Lee P. Y. Lin C. Y. Chen H. J. Lin C. C. Wu J. S. et al (2020). Non-alcoholic fatty liver disease among patients with sleep disorders: a nationwide study of Taiwan. BMC Gastroenterol.20 (1), 32. 10.1186/s12876-020-1178-7
- CrossRef
- Google Scholar
90
Wilson S. M. Oba P. M. Koziol S. A. Applegate C. C. Soto-Diaz K. Steelman A. J. et al (2022). Effects of a Saccharomyces cerevisiae fermentation product-supplemented diet on circulating immune cells and oxidative stress markers of dogs. J. Anim. Sci.100 (9), skac245. 10.1093/jas/skac245
- CrossRef
- Google Scholar
91
World Health Organizatiom-INTERNATIONAL PROGRAMME ON CHEMICAL SAFETY (1990). Environmental health criteria 103 2-Propanol. Available online at: https://www.inchem.org/documents/ehc/ehc/ehc103.htm.
- Google Scholar
92
Wu X. Tsai S. P. Tsao C. K. Chiu M. L. Tsai M. K. Lu P. J. et al (2017). Cohort profile: the Taiwan MJ cohort: half a million Chinese with repeated health surveillance data. Int. J. Epidemiol.46 (6), 1744–1744g. 10.1093/ije/dyw282
- CrossRef
- Google Scholar
93
Yamaguchi K. Yang L. McCall S. Huang J. Yu X. X. Pandey S. K. et al (2007). Inhibiting triglyceride synthesis improves hepatic steatosis but exacerbates liver damage and fibrosis in Obese mice with nonalcoholic steatohepatitis. Hepatology45 (6), 1366–1374. 10.1002/hep.21655
- CrossRef
- Google Scholar
94
Ye Y. Xiong Y. Zhou Q. Wu J. Li X. Xiao X. (2020). Comparison of machine learning methods and conventional logistic regressions for predicting gestational diabetes using routine clinical data: a retrospective cohort study. J. Diabetes Res.2020, 4168340. 10.1155/2020/4168340
- CrossRef
- Google Scholar
95
Younossi Z. M. Koenig A. B. Abdelatif D. Fazel Y. Henry L. Wymer M. (2016). Global epidemiology of nonalcoholic fatty liver disease-meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology64 (1), 73–84. 10.1002/hep.28431
- CrossRef
- Google Scholar
96
Zhao Y. Zhang Y. Khas E. Bai C. Cao Q. Ao C. (2022). Transcriptome analysis reveals candidate genes of the synthesis of branched-chain fatty acids related to mutton flavor in the lamb liver using Allium mongolicum regel extract. J. Animal Sci.100 (9), skac256. 10.1093/jas/skac256
- CrossRef
- Google Scholar

Summary

Keywords

Volatile Organic Compounds, non-alcoholic fatty liver, machine learning, AI, cohort

Citation

Shen C-H, Huang R-H, Li Y-K, Chu T-W and Pei D (2025) Using machine learning methods to investigate the role of volatile organic compounds in non-alcoholic fatty liver disease. Front. Mol. Biosci. 12:1631265. doi: 10.3389/fmolb.2025.1631265

Received

19 May 2025

Accepted

18 June 2025

Published

06 August 2025

Volume

12 - 2025

Edited by

Xiaoyan Xing, Chinese Academy of Medical Sciences and Peking Union Medical College, China

Reviewed by

Ashwin Dhakal, The University of Missouri, United States

Youfu He, Guizhou Provincial People’s Hospital, China

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yaw-Kuen Li, ykl@nycu.edu.tw; Ta-Wei Chu, taweichu@gmail.com; Dee Pei, peidee@gmail.com

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

ORIGINAL RESEARCH article

Using machine learning methods to investigate the role of volatile organic compounds in non-alcoholic fatty liver disease

Abstract

Introduction

Materials and methods

Clinical assessments and biochemical analyses

Fatty liver diagnosis

Variable selection

Protocol for breath sample collection

VOCs analysis using SIFT-MS

Machine learning-based analysis technology

Performance evaluation metrics

Results

Model performance evaluation

Discussion

2-Propanol

Acetone

Butyl 2-methylbutanoate

Diethylethanolamine

Urethane

β-caryophyllene

Furfural

Tridecane

4-methyloctanoic acid

(S)-2-methyl-1-butanol

Limitations

Conclusion

Statements

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Generative AI statement

Publisher’s note

References

Summary

Outline

Figures

Cite article

Share article

Article metrics