Machine learning assisted breathomic approach for early-stage thoracic cancer detection

Chen, Zhenguang; Peng, Minhua; Fan, Pengnan; Chen, Sai; Cheng, Xinxin; Xu, Bo; Chen, Ruiping; Hu, Xiao; Wei, Wei; Zhao, Tingting; Kong, Jun; Liang, Weiliang; Qiu, Xiangcheng; Chen, Sitong; Wang, Junqi

doi:10.3389/fonc.2025.1635280

ORIGINAL RESEARCH article

Front. Oncol., 17 September 2025

Sec. Thoracic Oncology

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1635280

This article is part of the Research TopicAdvancing Diagnostic Excellence in Early Lung Cancer DetectionView all 14 articles

Machine learning assisted breathomic approach for early-stage thoracic cancer detection

Zhenguang Chen^1*†

Minhua Peng^2†

Pengnan Fan²

Sai Chen³

Xinxin Cheng⁴

Bo Xu¹

Ruiping Chen¹

Xiao Hu⁵

Wei Wei⁵

Tingting Zhao⁵

Jun Kong²

Weiliang Liang²

Xiangcheng Qiu²

Sitong Chen²

Junqi Wang^2,6*

¹Department of Thoracic Surgery, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China
²ChromX Health Co., Ltd., Guangzhou, Guangdong, China
³Center for Private Medical Service & Healthcare, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, Guangdong, China
⁴State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, Guangdong, China
⁵Department of Thoracic Surgery, Guizhou Hospital of the First Affiliated Hospital of Sun Yat-sen University, Guiyang, Guizhou, China
⁶Jingjinji National Center of Technology Innovation, Beijing, China

Objective: This study explores the feasibility of using breathomic biomarkers analyzed by machine learning as a non-invasive diagnostic tool to differentiate between benign and malignant thoracic lesions, aiming to enhance early detection of thoracic cancers and inform clinical decision-making.

Methods: This study enrolled 132 participants with confirmed diagnosis of lung cancer, esophageal cancer, thymoma, and benign diseases. Exhaled breath samples were analyzed by thermal desorption-gas chromatography-mass spectrometry. A logistic regression algorithm was employed to construct a classification model for benign and malignant thoracic lesions. This model was trained on a subset of 80 cases and subsequently validated in a separate set comprising 52 samples.

Results: A logistic regression model based on thirteen exhaled volatile organic compounds (VOCs) was developed to differentiate benign and malignant thoracic lesions. The 13-VOC model achieved an AUC of 0.85 (0.72, 0.96), accuracy of 0.79 (0.66, 0.88), sensitivity of 0.82 (0.67, 0.91), and a specificity of 0.71 (0.45, 0.88). It correctly classified 80% of lung cancer, 80% of thymoma, and 100% of esophageal cancer cases, distinguishing 71.4% of benign lesions. For lung cancer, the model achieved an AUC of 0.79 (0.57, 0.98), sensitivity of 0.80 (0.63, 0.91), and specificity of 0.63 (0.31, 0.86), with 81.8% accuracy in detecting early-stage (Stage 0 + I + II) disease. The model outperformed a 4-serum tumor marker panel in sensitivity (0.90 vs. 0.39, p < 0.001). Additionally, in a cohort of 58 cancer patients, model-predicted risk significantly decreased post-surgery (p < 0.01), indicating a strong correlation with disease burden reduction.

Conclusion: This study demonstrates the feasibility of utilizing breathomics biomarkers for developing a non-invasive machine learning model for the early diagnosis of thoracic malignancies. These findings provide a foundation for breath analysis as a promising tool for early cancer detection, potentially facilitating improved clinical decision-making and enhancing patient outcomes.

GRAPHICAL ABSTRACT

Graphical Abstract. This ML-based breathomics study employs TD-GC-MS to analyze 13 exhaled VOCs, achieving superior sensitivity to 4 serum markers in distinguishing lung/thymoma/esophageal cancers from benign conditions. Post-surgical risk reduction confirmed VOC biomarkers' correlation with disease burden.

Introduction

Thoracic malignancies, particularly lung and esophageal cancers, represent a significant global health burden. Lung cancer is the leading cause of cancer-related deaths worldwide, with nearly 2.5 million new cases and over 1.8 million fatalities in 2022 (1). Despite treatment advancements, the five-year survival rate remains below 20%, primarily due to diagnoses at advanced stages (2). Similarly, the prognosis for esophageal cancer is also bleak, with a five-year survival rate below 20% (3), mirroring the situation for lung cancer, as evidenced by 511,000 new cases and 445,000 deaths attributed to the disease worldwide in 2022 (1). Thymomas, though rare, can lead to serious complications like myasthenia gravis. While their global incidence is between 0.13 and 0.26 per 100,000 individuals (4), their impact on patients’ quality of life is significant. These factors underscore the urgent need for improved diagnostic modalities for thoracic malignancies. Current diagnostic approaches, including imaging and invasive procedures like biopsy, the gold standard for thoracic tumor diagnosis, face limitations such as the imprecision to reliably differentiate benign from malignant lesions and the risk of complications associated with invasive procedures (5, 6). Particularly, the limited sensitivity of conventional blood-based tumor marker assays is underscored by the fact that 60-70% of cases are diagnosed at late stages, primarily due to these methods’ inability to detect early biological changes and distinguish between overlapping clinical features, thereby delaying timely intervention (4, 7, 8). Therefore, there is an urgent need to develop more precise, non-invasive, and highly sensitive tools to improve the early detection and diagnostic accuracy of thoracic tumors.

Exhaled breath volatile organic compounds (VOCs) are carbon-based molecules, primarily derived from endogenous metabolic processes and systemic circulation. Over 3,000 VOCs have been identified, reflecting the complex metabolic activity within the human body (9–12). Disease processes (e.g., oxidative stress, inflammation) or pathogens (e.g., bacteria and viruses), can perturb normal metabolic pathways, including lipid peroxidation, amino acid metabolism, and carbohydrate metabolism, leading to unique alterations in the VOC profile, creating disease-specific signatures (13). These VOCs, diffusing from blood into breath, serve as dynamic biomarkers, enabling the detection of subtle changes associated with disease onset and progression. Breath analysis of these VOC profiles thus offers a non-invasive, real-time method for early disease detection (14). Previous studies have demonstrated the potential of breath VOCs as biomarkers in identification of a variety of cancers, including lung cancer (15–18), breast cancer (19), and gastrointestinal malignancies (20–22). Gordon et al. were pioneers in using gas chromatography-mass spectrometry (GC-MS) to identify alkenes in the breath of lung cancer patients (23). Kumar et al. reported that a panel of 12 VOCs detected using a profile-3 selected ion flow tube mass spectrometry instrument could distinguish esophageal cancer from normal controls, achieving an AUC of 0.97 in the initial analysis and 0.92 ± 0.01 in the validation set (24). However, research efforts have predominantly focused on distinguishing cancer patients from healthy controls, with limited emphasis on differentiating between benign diseases and cancer patients. Furthermore, to the best of our knowledge, no studies have investigated the use of VOCs as biomarkers for identifying thymomas so far. This distinction is particularly important in thoracic tumors, where benign diseases such as granulomas or hamartomas may mimic malignancies on medical imaging, leading to diagnostic uncertainty and potentially unnecessary invasive procedures.

This study introduces a novel machine learning model that employs a comprehensive panel of breath-derived VOC biomarkers analyzed using GC-MS to achieve simultaneous early detection of lung cancer, esophageal cancer, and thymoma—the first breathomics-based strategy for multi-thoracic cancer diagnosis. By evaluating pre/postoperative predictions, we will assess its potential for real-time postoperative monitoring. Notably, we will also compare the sensitivity of this breath-based approach with conventional blood-based tumor markers, with the goal of providing a non-invasive solution for early detection and postoperative monitoring of thoracic cancers.

Methods

Study design and participants

This cross-sectional study, conducted from November 2021 to January 2022 at the East Division of the First Affiliated Hospital of Sun Yat-sen University in Guangzhou, China, received approval from the Ethics Committee of the First Affiliated Hospital of Sun Yat-sen University (No. 2022-016). All subjects provided signed informed consent. Inclusion criteria were adult participants aged 18–80 years with clinical suspicion of malignant thoracic tumors, supported by imaging evidence (CT/PET-CT) and a multidisciplinary team (MDT) assessment prior to histological confirmation. Eligible participants encompassed treatment-naïve, newly diagnosed thoracic cancer patients scheduled for surgical resection for diagnostic evaluation, and patients with a history of treated or recurrent thoracic malignancy, provided comprehensive treatment records were available. Exclusion criteria encompassed individuals who were unwilling or unable to provide in-person informed consent, those with unqualified breath samples, patients with relapsed diseases and incomplete treatment histories, individuals suffering from other malignant tumors, those with severe bronchial asthma or confirmed tuberculosis, and those with severe liver damage or kidney diseases. Each participant had undergone resection surgery and was pathologically confirmed to be categorized into one of the following groups: lung cancer, thymoma, esophageal cancer, and benign disease controls. Demographic and clinical information were meticulously recorded and collected. This study was registered in the Chinese Clinical Trial Registry (Registration No.: ChiCTR2200061264).

Exhale breath collection

All samples were collected following the same standardized procedure. Prior to collection, subjects were asked to rinse their mouths with purified water and rest for 15 minutes to stabilize their respiratory patterns. All subjects were required to abstain from food and beverages except water and smoking for at least 12 hours before the collection. To minimize the influence of diurnal metabolic variations, all collections were scheduled between 7:00 AM and 9:00 AM.

Subjects were instructed to remain seated and breathe normally through a mask for 3 minutes. During exhalation, breath samples were concurrently drawn through a breath sampler (CXBC-Alpha, ChromX Health Co., Ltd) containing an internal sampling pump and a flow control module (Figure 1). 900 mL of breath samples were collected at a rate of 300 mL/min and directed into thermal desorption tubes. These tubes, pre-conditioned with 99.9% nitrogen gas to ensure a clean and inert environment, contained Carbopack X and Carbopack B for sample enrichment, concentrating the target compounds for later analysis. All collected samples were sealed with inert end caps immediately and stored at -20°C to maintain their integrity and analyzed by thermal desorption-gas chromatography-mass spectrometry (TD-GC-MS) within 7 days to ensure timely and accurate results.

Figure 1

Medical device illustration. Panel A shows the exterior with a touch screen and handle. Panel B displays the interior, labeled with a start button, TD tube, main control board, and Luer taper along with a disposable face mask connected by a tube.

Figure 1. Exhaled breath collection device. The images depicted the schematic diagram (A) and sectional view (B) of the exhaled breath collection device.

TD-GC-MS analysis

Breath samples were analyzed by TD-GC-MS using a system incorporating a high-throughput autosampler, a thermal desorber (TD100-xr, MARKES), and an 7890B-5977A GC/MSD (Agilent Technologies). Separation was performed on an HP-5MS capillary column with nitrogen carrier gas. The mass spectrometer operated in electron ionization (EI) mode at 70 eV, acquiring data in full scan mode (m/z 33-450). Detailed instrument parameters are provided in Supplementary Materials.

GC-MS quantification and pre-analysis quality control

Raw GC-MS data were processed using MSDial v5.4 for peak detection, quantification, and alignment. The software generated matrices of peak area (VOC area matrix) and signal-to-noise ratio (SNR matrix). Prior to statistical analysis, a data preprocessing and filtering protocol was implemented in Python 3.9.18 to ensure data robustness. Firstly, the signal-to-noise ratio (SNR) matrix was used to assess response reliability. A VOC measurement was classified as valid if its SNR value exceeded 10; measurements below this threshold were excluded due to significant noise interference. For individual samples, the response rate was calculated as the percentage of valid VOC measurements relative to the total measurements in the sample. Samples with a response rate ≥80% were retained for further analysis. Similarly, compound-specific response rates were determined for each VOC by calculating the proportion of valid measurements across all samples. To ensure analytical robustness, only VOCs with a response rate ≥50% were included in the validated dataset, which was designated as the “valid VOC area matrix”. Secondly, the valid VOC area matrix was then log10-transformed to address heteroscedasticity, followed by normalization to account for variations in instrument response and sample loading. These steps enabled meaningful comparison of VOC abundances across samples.

Dataset partition

A dataset of 132 participants with malignant or benign thoracic lesions was used in this study, comprising 97 malignant and 35 benign samples. For biomarker discovery and model development, the dataset was randomly split into a discovery set (60%, n = 79; 59 malignant, 20 benign) and a testing set (40%, n = 53; 38 malignant, 15 benign). The discovery set (training set) was used for feature selection and model training, while the testing set served for independent model evaluation.

Biomarker screening

To identify VOCs differentially expressed between malignant and benign thoracic lesions, two complementary approaches were employed. First, the Wilcoxon rank-sum test was used to assess the distribution of individual VOCs across the two groups, generating corresponding p-values. Second, orthogonal partial least squares-discriminant analysis (OPLS-DA) was performed to evaluate the collective contribution of VOCs to group classification and to calculate variable importance in projection (VIP) scores (25). VOCs meeting both criteria of a p - value < 0.05 and a VIP score > 1 were selected as candidate biomarkers.

Putative biomarker identification was subsequently conducted using Agilent MassHunter Qualitative Analysis 10.0 software and the NIST 17 mass spectral library. Finally, metabolic pathway-associated VOCs reported in the literature were selected as candidate biomarkers for inclusion in diagnostic model development.

Machine learning algorithms selection and evaluation

Given the complexity inherent in omics data, it is essential to identify the most suitable model for the dataset at hand. To this end, five commonly used machine learning algorithms were systematically evaluated: logistic regression (LR) (26), random forest (RF) (27), k-nearest neighbors (KNN) (28), eXtreme Gradient Boosting (XGBoost) (29), and support vector machine (SVM) (30). Among these, logistic regression algorithm demonstrated the highest robustness and effectiveness, based on its superior performance across both the discovery and testing datasets. Consequently, logistic regression model was deployed for diagnostic prediction.

Feature selection

To minimize overfitting, a progressive feature selection approach was employed. Biomarkers were ranked by their area under the receiver operating characteristic curve (ROC-AUC) scores. A logistic regression model was trained using 5-fold cross-validation with stratified sampling, iteratively adding one feature at a time, starting with the highest-ranked biomarker. This process continued until no further significant improvement in model performance was observed.

Hyperparameter optimization

With the optimal feature subset identified, logistic regression hyperparameters were tuned using grid search with stratified sampling. The following hyperparameters were considered: regularization method, regularization strength, early stopping criteria, and class weights. The parameter combination that yielded the highest AUC score was selected for final model training.

Final model evaluation

The final logistic regression model, incorporating the optimized feature subset and hyperparameters, was trained on the training dataset. The model was then finalized, and a classification threshold was determined using the Youden index. Subsequently, the model’s performance was evaluated independently on the validation dataset. Performance was assessed using five metrics: F1-score, accuracy, sensitivity, specificity, and AUC, along with their respective confidence intervals. Further analyses were performed using this finalized model.

Statistical analysis

Statistical analyses were performed using Python (version 3.9.18). Continuous variables are presented as mean ± standard deviation or median [min, max], as appropriate. Categorical variables are presented as counts and percentages. The Wilcoxon rank-sum test was used to compare continuous variables between independent groups (e.g., malignant vs. benign). The Chi-square test was used to compare categorical variables. ROC analysis was performed using scikit-learn python (v1.5.1). 95% confidence intervals (95% CI) for AUC, F1-score, sensitivity, specificity, and accuracy were calculated using a binomial distribution. All statistical tests were two-sided, with a significant level of α = 0.05, unless otherwise stated.

Results

Study population

145 participants were enrolled in this study. Exclusion criteria were applied to exclude individuals outside the age range of 18 to 80 years, those who declined participation, and those who provided invalid breath samples, resulting in a final cohort of 132 eligible participants for analysis. Among these, 77 were diagnosed with lung cancer, 13 with thymoma, 7 with esophageal cancer, and 35 had benign diseases, as confirmed by pathological results (Figure 2). The demographic and clinical data of these participants are presented in Table 1. Statistical comparisons between the case and control groups were conducted on basic demographic characteristics, including age, gender, body mass index (BMI), smoking and alcohol consumption status, and family cancer history. As detailed in Table 1, no significant difference was observed in these factors.

Figure 2

Flowchart showing a study process. Participants from the East Division of the First Affiliated Hospital of Sun Yat-sen University (n=145) provided breath samples. GCMS data was collected from 138 participants. After surgery, six were excluded due to GC-MS quality control failure. Final participants were 132, divided into a model training set (LC=47, Thy=8, EC=4, Benign=21) and a model validation set (LC=30, Thy=5, EC=3, Benign=14).

Figure 2. Schematic representations of the research framework.

Table 1

Table 1. Patient demographic and clinical characteristics.

VOC identification and feature selection

Initial statistical screening using the Wilcoxon rank-sum test and OPLS-DA revealed twenty-seven VOCs that exhibited differential abundance (p < 0.05) and high VIP scores (VIP > 1) when comparing exhaled breath samples from malignant and benign groups. These candidate VOCs then underwent compound identification and further refinement to exclude those associated with drug metabolism, environmental contaminants, or unrelated to the disease pathology. This rigorous filtering process ultimately yielded a final set of 18 potentially disease-relevant VOCs (Supplementary Table S1).

Diagnostic model selection and feature optimization

To identify the optimal diagnostic model for differentiating benign from malignant thoracic lesions, five machine learning algorithms including logistic regression, SVM, random forest, KNN, and XGBoost were trained using the pre-selected panel of 18 VOCs. Comparison of the models revealed that logistic regression demonstrated robust performance in both the training and validation sets, achieving AUCs of 0.85 (95% CI: 0.82, 0.89) and 0.83 (0.80, 0.89), respectively (Figure 3A; Supplementary Table S2). The DeLong test indicated that logistic regression significantly outperformed the KNN, XGBoost, and SVM models in both datasets (p < 0.05). Furthermore, when compared to the random forest model, logistic regression demonstrated superior performance in the validation set (p < 0.01). Consequently, the logistic regression model was selected for further analysis and performance evaluation.

Figure 3

Panel A shows a ROC curve for different machine learning models, including LR, RF, KNN, XGBoost, and SVM, with varying AUC values. Panel B depicts a line graph of ROC-AUC versus the number of top VOCs, showing an upward trend. Panel C presents a series of box plots comparing log-transformed and Z-score peak area for benign and malignant cases across various VOCs. Panel D displays a chromatogram of scaled TIC over retention time for benign and malignant samples, highlighting specific chemical peaks.

Figure 3. VOC identification and feature selection. (A) ROC curves display the classification performance of the VOC-models using five machine learning algorithms including LR, RF, KNN, XGBoost, and SVM. (B) the graph shows AUC values (y-axis) against the number of features (x-axis) in the VOC-model training set. (C) box plots comparing the scaled peak area of 13 VOCs in benign and malignant patients. y-axis: scaled VOC peak area by log-transformation and z-score normalization. Significance levels are denoted as follows: * p < 0.05, ** p < 0.01 (Rank-sum test). (D) representative chromatograph of 13 selected VOCs in malignant vs. benign patients. Scaled Total Ion Chromatogram TIC (Real TIC/1M).

Final feature selection was conducted using the logistic regression algorithm to optimize model performance. Analysis of the AUC as a function of the number of top features revealed diminishing returns beyond 13 features. As incorporating additional features did not substantially improve the AUC, the top 13 features were selected for model development (Figure 3B). These identified compounds represent a diverse range of hydrocarbons, including methyl-cyclohexane, camphene, and d-limonene, as well as oxygenated species such as butanal, 1-butanol, propanoic acid, and p-cresol. Table 2 provides a comprehensive list of these compounds and their corresponding discriminant values. Analysis of the scaled VOC peak area (Figure 3C) demonstrated that all 13 VOCs were present at elevated levels in the malignant group (p < 0.05). A representative chromatogram of the 13 VOCs in malignant and benign samples was shown in Figure 3D.

Table 2

Table 2. 13 VOCs identified for model development.

Model performance in distinguishing benign from malignant thoracic lesions

In the training set (n = 80), the 13-VOC model demonstrated excellent performance with an AUC of 0.86 (0.83, 0.90), an accuracy of 0.83 (0.73, 0.89), a sensitivity of 0.86 (0.76, 0.93), and a specificity of 0.71 (0.50, 0.86). In the validation set (n = 52), the 13-VOC model achieved an AUC of 0.85 (0.81, 0.90), an accuracy of 0.79 (0.66, 0.88), a sensitivity of 0.82 (0.67, 0.91), and a specificity of 0.71 (0.45, 0.88), confirming its generalizability and clinical applicability (Figure 4A; Supplementary Table S3).

Figure 4

Graphs A, B, C, D, and G show receiver operating characteristic (ROC) curves with sensitivity versus 1-specificity for different cancer comparisons. Graphs E and F are scatter plots of volatile organic compound (VOC) model prediction scores versus principal component scores, highlighting cancer stages. Graphs H and I depict VOC model prediction scores against sample ID, with cancer stages indicated. Each graph includes additional details on areas under the curve and confidence intervals.

Figure 4. Model performance in distinguishing benign from malignant thoracic lesions. (A–D, G), ROC curves for the 13-VOCs model in distinguishing malignant (A) vs. benign, lung cancer (B) vs. benign all, thymoma (C) vs. benign all, esophageal cancer (D) vs. benign all, and lung cancer (G) vs. benign nodules. (E, F), Scatter plot depicting the relationship between VOC model prediction scores and the first principal component (PC1) in the training (E) and validation (F) sets, respectively. Each point represents an individual sample. The vertical red dashed line indicates the VOC model cutoff score of 0.64 used to discriminate between malignant and benign groups. (H, I), VOC model prediction scores for individual samples (SampleID) across lung cancer and benign nodule groups in the training (H) and validation (I) sets, respectively. Samples are color-coded by category. The red dashed horizontal line represents the VOC model cutoff score of 0.64 used to distinguish between groups. Thymoma (n=13) and esophageal cancer (n=7) analyses are exploratory (limited sample size) and serve as hypothesis-generating observations.

To further evaluate the performance of the detection model for individual cancer types, a subgroup analysis was conducted across various malignant thoracic lesions. Thymoma (n=13) and esophageal cancer (n=7) analyses are exploratory due to limited sample size and serve as hypothesis-generating observations. In the training set, the AUCs for lung cancer, thymoma, and esophageal cancer were 0.88 (0.85, 0.90), 0.81 (0.75, 0.88), and 0.80 (0.70, 0.96), respectively. In the validation set, corresponding AUCs were 0.84 (0.80, 0.90) for lung cancer, 0.86 (0.79, 1.00) for thymoma, and 0.91 (0.83, 0.95) for esophageal cancer (Figures 4B–D). To further visualize the model’s performance, prediction values for each participant were plotted against their actual disease status (lung cancer/thymoma/esophageal cancer vs. benign). Using a classification threshold of 0.64, the model achieved a high accuracy in the training set, correctly identifying 87.2% (75-94%) of lung cancer, 87.5% (53-98%) of thymoma, and 75% (30-95%) of esophageal cancer cases (Figure 4E). In the validation set, the model maintained high accuracy, correctly classifying 80% (63-91%) of lung cancer, 80% (38-96%) of thymoma, and 100% (44-100%) of esophageal cancer cases (Figure 4F). Additionally, the model demonstrated good specificity for benign lesions, correctly identifying 71.4% (50-86%, 45-88%) of such cases in both the training and validation sets (Figures 4E–F). These findings emphasize the model’s robust performance and generalized applicability in detecting various malignant thoracic lesions. Importantly, its ability to differentiate benign lesions underscores its potential to minimize unnecessary interventions and overtreatment, supporting its use in clinical practice.

Model performance in differentiating pulmonary lesions and across different lung cancer AJCC stages

Building upon previous findings, we further investigated the model’s ability to differentiate malignant and benign pulmonary lesions. In the training set (n = 59), the 13-VOC model achieved an AUC of 0.82 (0.68, 0.95), sensitivity of 0.89 (0.77, 0.95), and specificity of 0.58 (0.32, 0.81). In the validation set (n = 38), the model exhibited an AUC of 0.79 (0.57, 0.98), sensitivity of 0.80 (0.63, 0.91), and specificity of 0.63 (0.31, 0.86) (Figure 4G; Supplementary Table S3).

Early detection of lung cancer is critical in clinical practice, allowing for timely interventions and curative resections that substantially increase patient survival rates. To assess our model’s efficacy in diagnosing early lung cancer, we used the model to differentiate between various lung cancer stages and benign nodules. The predictive performance of the model was graphically demonstrated by plotting individual participant predictions against their corresponding ground truth classifications (lung cancer stages or benign nodule). With a predetermined classification cut-off at 0.64, the 13-VOC model demonstrated strong performance in identifying early-stage lung cancer, achieving high accuracy for stage 0 + I + II lung cancer (85.7% [70.6-93.7%]) and stage III + IV lung cancer (88.9% [56.5-98%]) in the training set, though the accuracy for benign nodules was comparatively lower at 58.3% (32-80.7%) (Figure 4H). In the validation set, the model maintained robust performance for stage 0 + I + II lung cancer (81.8% [61.5-92.7%]) and improved accuracy for benign nodules (66.7% [35.4-87.9%]), though there was a slight decrease in accuracy for stage III+IV lung cancer (71.4% [35.9-91.8%]) (Figure 4I). These findings highlight the model’s potential for early detection and timely treatment of lung cancer, as well as its capacity to reduce unnecessary interventions and overtreatment—an essential consideration in clinical decision-making. However, further optimization is necessary to enhance its ability to accurately differentiate benign nodules and address variability in diagnosing advanced-stage lung cancer.

Comparison of the diagnostic performance of the VOC model with traditional methods using serum tumor biomarkers

To determine whether the model represents an advancement in tumor diagnosis, we compared its predictive accuracy against that of four established clinical tumor biomarkers: CA125, ProGRP, CEA, and CFRA21-1. Among the 36 lung cancers patients, the discriminative sensitivities of CA125, ProGRP, CEA, and CFRA21–1 were 0.061, 0.121, 0.152 and 0.242 respectively, while the 13-VOCs model showed a paired discriminative sensitivity of 0.895 (p < 0.001) (Figures 5A–D, F). Given the common clinical practice of combining these four biomarkers to enhance specificity, we hypothesized that classifying individuals as positive for lung cancer if any of the serum tumor biomarkers fell outside the normal range (i.e., CA125: 0–35 KU/L, ProGRP: 0–46 ng/L, CEA: 0-5μg/L, CFRA21-1: 0–3 ng/L) could yield improved sensitivity. Of particular note, our 13-VOCs model significantly outperformed the 4-serum tumor marker panel, achieving a sensitivity of 0.895 compared to 0.394 (p < 0.001) (Figures 5E, F). Importantly, this superior performance was not attributable to an elevated false positive rate (Figure 4). These findings suggested that the 13-VOCs model represents a more robust diagnostic tool, potentially offering particular advantages for early detection, an area where traditional serum biomarkers have limited utility.

Figure 5

Graphs A to E compare VOC model prediction scores against clinical cutoffs for CA125, ProGRP, CEA, CFRA21-1, and a 4-serum tumor marker panel. Dashed lines represent cutoffs. Graph F displays sensitivity comparisons for CA125, ProGRP, CEA, CFRA21-1, the 4-serum marker panel, and the VOC model, showing the highest sensitivity for the VOC model at 0.895.

Figure 5. Comparison of the diagnostic performance of the 13-VOCs model with clinical serum tumor biomarkers among lung cancer patients. (A–E), diagnostic prediction of the lung cancer patients using the 13-VOCs model and clinical markers CA125, ProGRP, CEA, CFRA21-1, and the 4-serum tumor marker panel respectively. The blue vertical dashed line represents the clinical serum tumor biomarkers at various cutoffs, while the red horizontal dashed line indicates the VOC model cutoff score at 0.64. Each point corresponds to an individual lung cancer sample (n=36). (F), comparison of the four individual serum tumor marker, 4-serum tumor marker panel and 13-VOCs models’ detection sensitivity in predicting lung cancer patients. Sensitivity differences between the 13-VOC model, each individual serum tumor marker, and the 4-serum tumor marker panel were evaluated using McNemar’s test. Significance is denoted as follows: ***, p < 0.001.

Model performance for postoperative monitoring and follow-up

To assess whether the model accurately reflects dynamic changes in disease status and further validate that the features it captures are closely associated with disease activity or burden, we analyzed and compared the model’s score changes between preoperative assessments and postoperative timepoints, 7 days to 1 month after surgery. Among the cancer patients (n = 54), postoperative predicted probabilities were significantly lower than preoperative probabilities (p < 0.01), indicating a measurable decrease in predicted disease burden following surgical intervention (Figure 6A). Subgroup analysis confirmed this trend in both lung cancer (p < 0.05) (Figure 6C) and thymoma (p < 0.05) (Figure 6D), with postoperative scores remaining consistently lower across these malignancies. The postoperative reduction in esophageal cancer was not statistically significant (p > 0.05) (Figure 6E), possibly due to the limited sample size. Notably, there was no significant difference in predicted probabilities between the postoperative and preoperative groups in cases of benign disease (p > 0.05) (Figure 6B). Collectively, these findings demonstrate that the model effectively reflects the reduction in disease burden following lung cancer surgery, highlighting its potential utility for assessing the completeness of resection and detecting early signs of postoperative recurrence.

Figure 6

Box plots labeled A to E show VOC model prediction scores for different conditions, comparing preoperative and postoperative scores. A: “All malignant” shows a significant decrease (p<0.01). B: “All benign” shows no significant change. C: “Lung cancer” shows significant decrease (p<0.05). D: “Thymoma” shows significant decrease (p<0.05). E: “Esophageal cancer” shows no significant change. Lines indicate individual changes, color-coded for magnitude of change.

Figure 6. Model performance for postoperative monitoring and follow-up. (A–E), Comparison of VOC model prediction scores before (PreOp) and after (PostOp) surgery in the overall malignant group (A), benign group (B), lung cancer group (C), thymoma group (D), and esophageal cancer group (E), respectively. Each line represents an individual patient’s change in score, color-coded by the magnitude of change. Boxplots show the median and interquartile range. Comparisons in VOC prediction by scores postoperatively were performed using a paired t-test. Significance is denoted as follows: *p < 0.05, **p < 0.01, ns = not significant (p ≥ 0.05).

Discussion

This study aimed to develop and validate a novel machine learning model for the early diagnosis of thoracic malignancies using exhaled VOCs as biomarkers. To our knowledge, our findings demonstrate for the first time the feasibility of employing a single panel of VOC profiles to differentiate between benign and malignant thoracic lesions, particularly lung, esophageal, and thymic tumors.

To determine the optimal classifier for metabolomics data analysis, we evaluated five machine learning algorithms: logistic regression, random forest, k-nearest neighbor, XGBoost, and support vector machine, on training and validation datasets. Logistic regression demonstrated robust performance on both sets, making it the ideal choice for the baseline model. While algorithms like Random Forest and XGBoost showed some promise, they were not pursued due to their increased complexity and computational demands, without significant improvement in testing set performance. Given its simplicity, interpretability, efficiency, and strong generalization, logistic regression emerged as the optimal model for metabolomics data analysis. The 13-VOC model constructed by logistic regression algorithm achieved high accuracy in classifying thoracic tumors, with an AUC of 0.85, sensitivity of 82%, and specificity of 71%, representing a clinically significant advancement over existing clinical markers, which only achieved a sensitivity of 39.4%. Notably, the model exhibited robust performance in distinguishing early-stage lung cancer, suggesting its potential as a non-invasive screening tool.

Thirteen VOCs were identified as potential biomarkers for distinguishing malignant from benign thoracic lesions, many of which have established or emerging links to various cancer metabolism and pathogenesis. Several VOCs, including 1-butanol (31), propanoic acid (32), methyl-Cyclohexane (33, 34), sec-Butyl acetate (33), camphene (35), D-Limonene (35, 36), 6-methyl-5-Hepten-2-one (37), and p-cresol (38), have been previously reported as elevated in lung cancer and other malignancies.

Emerging evidence suggests that these VOCs may reflect key metabolic alterations characteristic of cancer pathogenesis. Trans-2-Decenal (39, 40), an alkenal mutagen found in cooking oil fumes, has been shown to promote oxidative DNA damage through reactive oxygen species formation, a well-established mechanism implicated in lung carcinogenesis, suggesting increased risk for individuals with frequent exposure (39). Methyl-cyclohexane, which has also been implicated in distinguishing colorectal cancer from healthy controls, may indicate broader metabolic reprogramming in malignancy (41). 6-methyl-5-Hepten-2-one, potentially linked to increased fatty acid oxidation (42), a hallmark of cancer cell metabolism, was reported to be elevated in various gastrointestinal cancers, including colorectal (42) and gastric cancer (43–45).

Butanal, elevated in esophagogastric cancer (46), may accumulate due to genetic dysregulation of its metabolic pathways or as a byproduct of lipid peroxidation—a process often amplified by chronic inflammation in the tumor microenvironment. This aligns with the recognized role of oxidative stress in cancer progression (47, 48). Furthermore, alterations in the gut microbiome, frequently observed in esophageal cancer, can modulate butanal production and metabolism (49), highlighting the interplay between host metabolism and microbial communities in cancer.

p-Cresol, with its complex metabolism influenced by gut and oral microbiota, hepatic processes, and disease state, has been identified as a potential breath biomarker in various cancers, including esophageal, gastric, thyroid, breast, oral, and lung cancers, and even in some non-malignant conditions (50, 51). This broad association suggests p-cresol and other VOCs may serve as general indicators of metabolic dysregulation or malignancy.

In contrast, isobutyl acetate, trans-1,2-Cyclopentanediol, and cis-2-Hexen-1-ol currently lack well-established links to cancer pathogenesis. Isobutyl acetate has been primarily reported as a marker for microbial (specifically Candida albicans) activity, particularly in respiratory infections (52). It may also indirectly contribute to metabolic disorders like obesity and diabetes through oxidative stress and neuroinflammation (53), and potentially to cardiovascular disease via ROS-mediated metabolic dysregulation (54). Further studies should include metabolomic and pathway enrichment analyses, such as KEGG, to elucidate their metabolic origins and explore potential sources like lipid peroxidation or microbial dysbiosis.

The observation that several VOCs are associated with multiple cancer types suggests they may serve as general indicators of malignancy or reflect shared metabolic pathways. Combining these VOCs into a diagnostic model is justified by their diverse origins and links to various cancer-related pathways, including genetic dysregulation, oxidative stress, lipid peroxidation, and microbiome alterations, enabling the capture of a more comprehensive metabolic fingerprint of thoracic malignancies, potentially improving diagnostic accuracy. Nevertheless, further mechanistic studies are needed to elucidate how these VOCs specifically relate to cancer pathogenesis and to validate their clinical utility as biomarkers.

Furthermore, the model’s ability to track changes in VOC profiles over time, as demonstrated by the significant decrease in predicted risk following surgery, highlights its potential for monitoring disease progression and treatment response. Wang et al. demonstrated the feasibility of using perioperative dynamic breathomics to identify a panel of VOCs as potential biomarkers for lung cancer (55). By comparing VOC profiles before and after surgery, they identified 16 VOCs that were significantly altered in lung cancer patients, and a machine learning model based on these VOCs achieved high accuracy of 86.9% in differentiating between lung cancer patients and healthy controls. Nardi-Agmon et al. explored the potential of breath analysis for monitoring the response to anticancer treatment in patients with advanced lung cancer (56). By utilizing a panel of three VOCs identified as significant indicators of treatment outcomes, this approach may provide a rapid and non-invasive method for assessing treatment response, potentially enabling earlier detection of treatment failure compared to conventional imaging techniques. These findings highlight the growing evidences supporting breath analysis as a valuable tool for lung cancer management, with the ability to detect dynamic changes in VOC profiles pre- and post-treatment suggesting its potential as a complementary approach to existing diagnostic and monitoring strategies.

However, several limitations of this study should be acknowledged. Firstly, although our cohort was prospectively enrolled, the sample size (n=132) and subtype distribution (lung cancer 79.4%, thymoma 13.4%, esophageal cancer 7.2%) reflect the underlying epidemiology of thoracic malignancies (1, 3, 4). This distribution enabled robust differentiation between malignant and benign lesions, but the small numbers for rarer subtypes limit the strength of conclusions for thymoma and esophageal cancer. These analyses are exploratory and serve as preliminary, hypothesis-generating observations. External validation in larger, multi-center cohorts, particularly through collaboration with international consortia for rare thoracic tumors, is essential to confirm these findings and support broader clinical application. Secondly, the reliance on GC-MS for VOC analysis presents challenges for clinical implementation. GC-MS is a complex, time-consuming, and expensive technique that requires specialized equipment and expertise, making it less feasible for routine clinical use (57). To address this, future research should focus on validating these findings using point-of-care testing (POCT) devices, such as micro-GC systems (58–65), electronic noses (66), or wearable VOC sensors (67), which offer real-time, bedside breath analysis for rapid clinical decision-making. However, challenges remain, including achieving sufficient sensor sensitivity and selectivity, minimizing sensor drift and environmental interference, and standardizing protocols. Progress in materials science and AI-driven data analysis, along with interdisciplinary collaboration, will be crucial to address these issues. Pilot studies in clinical settings are also needed to assess practicality, cost-effectiveness, and user acceptance, ultimately supporting the adoption of breath analysis in routine healthcare. Large-scale validation studies using such technologies could pave the way for the widespread adoption of breath analysis in clinical practice (58–63). Thirdly, this study focused only on lung cancer, thymoma, and esophageal cancer, limiting its applicability to other thoracic malignancies. Future research should include mesothelioma, mediastinal tumors, and other rare thoracic cancers to develop a more comprehensive model, which could improve breath analysis for diagnosis and monitoring across the entire spectrum of thoracic oncology. Fourthly, while the study demonstrates the potential of VOC analysis, further research is needed to elucidate the underlying biological mechanisms and to address the technical challenges associated with breath sample collection and analysis. Finally, although LDCT is widely used for lung cancer screening, it carries high costs and a notable false-positive rate, which can lead to unnecessary follow-up tests and increased patient anxiety (68). In contrast, breath-based VOC analysis offers a non-invasive, radiation-free, and potentially more cost-effective screening approach. However, our current methodology relies on GC-MS, which is not yet feasible for large-scale screening due to its expense and complexity. The development of portable, point-of-care VOC detection platforms may help overcome these limitations, enabling broader clinical implementation and possibly reducing the economic and logistical burden associated with current screening methods. Future studies should directly compare the clinical and economic outcomes of VOC-based POCT and existing modalities such as LDCT to determine the most effective and sustainable strategies for early cancer detection. Despite these limitations, this study provides a strong foundation for the development of breath analysis as a valuable tool for the early detection, diagnosis, and monitoring of thoracic cancers.

Conclusion

This study establishes the effectiveness of a breath-derived VOC model in distinguishing malignant and benign thoracic lesions, demonstrating its capability for multi-cancer detection and early-stage diagnosis. By pioneering breathomics for simultaneous identification of multiple thoracic malignancies and exploring its potential for postoperative monitoring, this work introduces a novel integration of non-invasive diagnostics with therapeutic surveillance. Compared to traditional serum biomarkers, the approach demonstrates superior sensitivity while eliminating invasive sampling, offering a patient-friendly alternative with clinical scalability. The methodology holds promise for improving early cancer detection and real-time postoperative evaluation, potentially enhancing clinical decision-making and personalized patient management. Future efforts should prioritize validation in broader populations, refinement of the predictive model, and development of point-of-care devices to facilitate clinical translation and improve patient outcomes.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Ethics Committee of the First Affiliated Hospital of Sun Yat-sen University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

ZC: Writing – review & editing, Conceptualization, Supervision. MP: Writing – original draft, Writing – review & editing, Conceptualization. PF: Writing – review & editing, Formal Analysis, Data curation, Methodology. SC: Investigation, Writing – review & editing, Methodology. XC: Methodology, Formal Analysis, Writing – review & editing, Data curation. BX: Writing – review & editing, Formal Analysis, Validation. RC: Formal Analysis, Data curation, Resources, Writing – review & editing. XH: Methodology, Writing – review & editing, Visualization. WW: Validation, Writing – review & editing, Data curation, Investigation. TZ: Methodology, Data curation, Investigation, Writing – review & editing. JK: Writing – review & editing, Methodology, Data curation. WL: Software, Methodology, Writing – review & editing, Investigation. XQ: Writing – review & editing, Data curation, Project administration. STC: Methodology, Data curation, Supervision, Writing – review & editing. JW: Writing – review & editing, Project administration, Funding acquisition.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. This study was supported by the Disruptive Technology Innovation Project of the National Key Research and Development Program of China (2024YFF1501500).

Conflict of interest

Authors MP, PF, JK, WL, XQ, STC, JW were employed by the company ChromX Health Co., Ltd.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1635280/full#supplementary-material

References

1. Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834

PubMed Abstract | Crossref Full Text | Google Scholar

2. Allemani C, Matsuda T, Di Carlo V, Harewood R, Matz M, Niksic M, et al. Global surveillance of trends in cancer survival 2000-14 (CONCORD-3): analysis of individual records for 37 513–025 patients diagnosed with one of 18 cancers from 322 population-based registries in 71 countries. Lancet. (2018) 391:1023–75. doi: 10.1016/S0140-6736(17)33326-3

PubMed Abstract | Crossref Full Text | Google Scholar

3. Sheikh M, Roshandel G, McCormack V, and Malekzadeh R. Current status and future prospects for esophageal cancer. Cancers (Basel). (2023) 15:765. doi: 10.3390/cancers15030765

PubMed Abstract | Crossref Full Text | Google Scholar

4. Alqaidy D. Thymoma: an overview. Diagnostics (Basel). (2023) 13:2982. doi: 10.3390/diagnostics13182982

PubMed Abstract | Crossref Full Text | Google Scholar

5. Lopes R and Betrouni N. Fractal and multifractal analysis: a review. Med Image Anal. (2009) 13:634–49. doi: 10.1016/j.media.2009.05.003

PubMed Abstract | Crossref Full Text | Google Scholar

6. Sun Y, Ge X, Niu R, Gao J, Shi Y, Shao X, et al. PET/CT radiomics and deep learning in the diagnosis of benign and Malignant pulmonary nodules: progress and challenges. Front Oncol. (2024) 14:1491762. doi: 10.3389/fonc.2024.1491762

PubMed Abstract | Crossref Full Text | Google Scholar

7. Nasim F, Sabath BF, and Eapen GA. Lung cancer. Med Clin North Am. (2019) 103:463–73. doi: 10.1016/j.mcna.2018.12.006

PubMed Abstract | Crossref Full Text | Google Scholar

8. Huang FL and Yu SJ. Esophageal cancer: Risk factors, genetic association, and treatment. Asian J Surg. (2018) 41:210–5. doi: 10.1016/j.asjsur.2016.10.005

PubMed Abstract | Crossref Full Text | Google Scholar

9. Phillips M, Herrera J, Krishnan S, Zain M, Greenberg J, and Cataneo RN. Variation in volatile organic compounds in the breath of normal humans. J Chromatogr B BioMed Sci Appl. (1999) 729:75–88. doi: 10.1016/s0378-4347(99)00127-9

PubMed Abstract | Crossref Full Text | Google Scholar

10. Minh Tdo C and Blake DR and Galassetti PR. The clinical potential of exhaled breath analysis for diabetes mellitus. Diabetes Res Clin Pract. (2012) 97:195–205. doi: 10.1016/j.diabres.2012.02.006

PubMed Abstract | Crossref Full Text | Google Scholar

11. Bake B, Larsson P, Ljungkvist G, Ljungstrom E, and Olin AC. Exhaled particles and small airways. Respir Res. (2019) 20:8. doi: 10.1186/s12931-019-0970-9

PubMed Abstract | Crossref Full Text | Google Scholar

12. Brusselmans L, Arnouts L, Millevert C, Vandersnickt J, van Meerbeeck JP, and Lamote K. Breath analysis as a diagnostic and screening tool for Malignant pleural mesothelioma: a systematic review. Transl Lung Cancer Res. (2018) 7:520–36. doi: 10.21037/tlcr.2018.04.09

PubMed Abstract | Crossref Full Text | Google Scholar

13. Phillips M, Greenberg J, and Awad J. Metabolic and environmental origins of volatile organic compounds in breath. J Clin Pathol. (1994) 47:1052–3. doi: 10.1136/jcp.47.11.1052

PubMed Abstract | Crossref Full Text | Google Scholar

14. Haworth JJ, Pitcher CK, Ferrandino G, Hobson AR, Pappan KL, and Lawson JLD. Breathing new life into clinical testing and diagnostics: perspectives on volatile biomarkers from breath. Crit Rev Clin Lab Sci. (2022) 59:353–72. doi: 10.1080/10408363.2022.2038075

PubMed Abstract | Crossref Full Text | Google Scholar

15. Fan X, Zhong R, Liang H, Zhong Q, Huang H, He J, et al. Exhaled VOC detection in lung cancer screening: a comprehensive meta-analysis. BMC Cancer. (2024) 24:775. doi: 10.1186/s12885-024-12537-7

PubMed Abstract | Crossref Full Text | Google Scholar

16. Saalberg Y and Wolff M. VOC breath biomarkers in lung cancer. Clin Chim Acta. (2016) 459:5–9. doi: 10.1016/j.cca.2016.05.013

PubMed Abstract | Crossref Full Text | Google Scholar

17. Peled N, Fuchs V, Kestenbaum EH, Oscar E, and Bitran R. An update on the use of exhaled breath analysis for the early detection of lung cancer. Lung Cancer (Auckl). (2021) 12:81–92. doi: 10.2147/LCTT.S320493

PubMed Abstract | Crossref Full Text | Google Scholar

18. Su Z, Yu X, He Y, Sha T, Guo H, Tao Y, et al. Inconsistencies in predictive models based on exhaled volatile organic compounds for distinguishing between benign pulmonary nodules and lung cancer: a systematic review. BMC Pulm Med. (2024) 24:551. doi: 10.1186/s12890-024-03374-2

PubMed Abstract | Crossref Full Text | Google Scholar

19. Leemans M, Bauer P, Cuzuel V, Audureau E, and Fromantin I. Volatile organic compounds analysis as a potential novel screening tool for breast cancer: A systematic review. biomark Insights. (2022) 17:11772719221100709. doi: 10.1177/11772719221100709

PubMed Abstract | Crossref Full Text | Google Scholar

20. Chung J, Akter S, Han S, Shin Y, Choi TG, Kang I, et al. Diagnosis by volatile organic compounds in exhaled breath in exhaled breath from patients with gastric and colorectal cancers. Int J Mol Sci. (2022) 24:129. doi: 10.3390/ijms24010129

PubMed Abstract | Crossref Full Text | Google Scholar

21. Van Liere E, van Dijk LJ, Bosch S, Vermeulen L, Heymans MW, Burchell GL, et al. Urinary volatile organic compounds for colorectal cancer screening: A systematic review and meta-analysis. Eur J Cancer. (2023) 186:69–82. doi: 10.1016/j.ejca.2023.03.002

PubMed Abstract | Crossref Full Text | Google Scholar

22. Hintzen KFH, Grote J, Wintjens A, Lubbers T, Eussen MMM, van Schooten FJ, et al. Breath analysis for the detection of digestive tract Malignancies: systematic review. BJS Open. (2021) 5:zrab013. doi: 10.1093/bjsopen/zrab013

PubMed Abstract | Crossref Full Text | Google Scholar

23. Gordon SM, Szidon JP, Krotoszynski BK, Gibbons RD, and O'Neill HJ. Volatile organic compounds in exhaled air from patients with lung cancer. Clin Chem. (1985) 31:1278–82. doi: 10.1093/clinchem/31.8.1278

PubMed Abstract | Crossref Full Text | Google Scholar

24. Kumar S, Huang J, Abbassi-Ghadi N, Mackenzie HA, Veselkov KA, Hoare JM, et al. Mass spectrometric analysis of exhaled breath for the identification of volatile organic compound biomarkers in esophageal and gastric adenocarcinoma. Ann Surg. (2015) 262:981–90. doi: 10.1097/SLA.0000000000001101

PubMed Abstract | Crossref Full Text | Google Scholar

25. Boccard J and Rutledge DN. A consensus orthogonal partial least squares discriminant analysis (OPLS-DA) strategy for multiblock Omics data fusion. Anal Chim Acta. (2013) 769:30–9. doi: 10.1016/j.aca.2013.01.022

PubMed Abstract | Crossref Full Text | Google Scholar

26. Zhu H, Wang Y, Wei X, Shi W, Yin H, and Gao C. Logistic regression analysis of risk factors for Respiratory Distress Syndrome in Late Preterm Infants. Pak J Med Sci. (2025) 41:461–5. doi: 10.12669/pjms.41.2.9796

PubMed Abstract | Crossref Full Text | Google Scholar

27. Jin Y, Lan A, Dai Y, Jiang L, and Liu S. Development and testing of a random forest-based machine learning model for predicting events among breast cancer patients with a poor response to neoadjuvant chemotherapy. Eur J Med Res. (2023) 28:394. doi: 10.1186/s40001-023-01361-7

PubMed Abstract | Crossref Full Text | Google Scholar

28. Florimbi G, Fabelo H, Torti E, Lazcano R, Madronal D, Ortega S, et al. Accelerating the K-Nearest Neighbors Filtering Algorithm to Optimize the Real-Time Classification of Human Brain Tumor in Hyperspectral Images. Sensors (Basel). (2018) 18:2314. doi: 10.3390/s18072314

PubMed Abstract | Crossref Full Text | Google Scholar

29. Ogunleye A and Wang QG. XGBoost model for chronic kidney disease diagnosis. IEEE/ACM Trans Comput Biol Bioinform. (2020) 17:2131–40. doi: 10.1109/TCBB.2019.2911071

PubMed Abstract | Crossref Full Text | Google Scholar

30. Polat H, Danaei Mehr H, and Cetin A. Diagnosis of chronic kidney disease based on support vector machine by feature selection methods. J Med Syst. (2017) 41:55. doi: 10.1007/s10916-017-0703-x

PubMed Abstract | Crossref Full Text | Google Scholar

31. Koureas M, Kirgou P, Amoutzias G, Hadjichristodoulou C, Gourgoulianis K, and Tsakalof A. Target analysis of volatile organic compounds in exhaled breath for lung cancer discrimination from other pulmonary diseases and healthy persons. Metabolites. (2020) 10:317. doi: 10.3390/metabo10080317

PubMed Abstract | Crossref Full Text | Google Scholar

32. Callol-Sanchez L, Munoz-Lucas MA, Gomez-Martin O, Maldonado-Sanz JA, Civera-Tejuca C, Gutierrez-Ortega C, et al. Observation of nonanoic acid and aldehydes in exhaled breath of patients with lung cancer. J Breath Res. (2017) 11:026004. doi: 10.1088/1752-7163/aa6485

PubMed Abstract | Crossref Full Text | Google Scholar

33. Chen X, Muhammad KG, Madeeha C, Fu W, Xu L, Hu Y, et al. Calculated indices of volatile organic compounds (VOCs) in exhalation for lung cancer screening and early detection. Lung Cancer. (2021) 154:197–205. doi: 10.1016/j.lungcan.2021.02.006

PubMed Abstract | Crossref Full Text | Google Scholar

34. Wang M, Sheng J, Wu Q, Zou Y, Hu Y, Ying K, et al. Confounding effect of benign pulmonary diseases in selecting volatile organic compounds as markers of lung cancer. J Breath Res. (2018) 12:046013. doi: 10.1088/1752-7163/aad9cc

PubMed Abstract | Crossref Full Text | Google Scholar

35. Zou Y, Hu Y, Jiang Z, Chen Y, Zhou Y, Wang Z, et al. Exhaled metabolic markers and relevant dysregulated pathways of lung cancer: a pilot study. Ann Med. (2022) 54:790–802. doi: 10.1080/07853890.2022.2048064

PubMed Abstract | Crossref Full Text | Google Scholar

36. Fernandez Del Rio R, O'Hara ME, Holt A, Pemberton P, Shah T, Whitehouse T, et al. Volatile biomarkers in breath associated with liver cirrhosis - comparisons of pre- and post-liver transplant breath samples. EBioMedicine. (2015) 2:1243–50. doi: 10.1016/j.ebiom.2015.07.027

PubMed Abstract | Crossref Full Text | Google Scholar

37. O'Neill HJ, Gordon SM, Krotoszynski B, Kavin H, and Szidon JP. Identification of isoprenoid-type components in human expired air: a possible shunt pathway in sterol metabolism. BioMed Chromatogr. (1987) 2:66–70. doi: 10.1002/bmc.1130020206

PubMed Abstract | Crossref Full Text | Google Scholar

38. Peralbo-Molina A, Calderon-Santiago M, Priego-Capote F, Jurado-Gamez B, and Luque de Castro MD. Identification of metabolomics panels for potential lung cancer screening by analysis of exhaled breath condensate. J Breath Res. (2016) 10:26002. doi: 10.1088/1752-7155/10/2/026002

PubMed Abstract | Crossref Full Text | Google Scholar

39. Dung CH and Wu SC and Yen GC. Genotoxicity and oxidative stress of the mutagenic compounds formed in fumes of heated soybean oil, sunflower oil and lard. Toxicol In Vitro. (2006) 20:439–47. doi: 10.1016/j.tiv.2005.08.019

PubMed Abstract | Crossref Full Text | Google Scholar

40. Wu SC and Yen GC and Sheu F. Mutagenicity and identification of mutagenic compounds of fumes obtained from heating peanut oil. J Food Prot. (2001) 64:240–5. doi: 10.4315/0362-028x-64.2.240

PubMed Abstract | Crossref Full Text | Google Scholar

41. Zbar AP. Exhaled volatile organic compounds identify patients with colorectal cancer (Br J Surg 2013; 100: 144-150). Br J Surg. (2013) 100:151. doi: 10.1002/bjs.8970

PubMed Abstract | Crossref Full Text | Google Scholar

42. Kononova E, Mezmale L, Polaka I, Veliks V, Anarkulova L, Vilkoite I, et al. Breath fingerprint of colorectal cancer patients based on the gas chromatography-mass spectrometry analysis. Int J Mol Sci. (2024) 25:1632. doi: 10.3390/ijms25031632

PubMed Abstract | Crossref Full Text | Google Scholar

43. Amal H, Leja M, Broza YY, Tisch U, Funka K, Liepniece-Karele I, et al. Geographical variation in the exhaled volatile organic compounds. J Breath Res. (2013) 7:47102. doi: 10.1088/1752-7155/7/4/047102

PubMed Abstract | Crossref Full Text | Google Scholar

44. Xu ZQ, Broza YY, Ionsecu R, Tisch U, Ding L, Liu H, et al. A nanomaterial-based breath test for distinguishing gastric cancer from benign gastric conditions. Br J Cancer. (2013) 108:941–50. doi: 10.1038/bjc.2013.44

PubMed Abstract | Crossref Full Text | Google Scholar

45. Shehada N, Bronstrup G, Funka K, Christiansen S, Leja M, and Haick H. Ultrasensitive silicon nanowire for real-world gas sensing: noninvasive diagnosis of cancer from breath volatolome. Nano Lett. (2015) 15:1288–95. doi: 10.1021/nl504482t

PubMed Abstract | Crossref Full Text | Google Scholar

46. Markar SR, Wiggins T, Antonowicz S, Chin ST, Romano A, Nikolic K, et al. Assessment of a noninvasive exhaled breath test for the diagnosis of oesophagogastric cancer. JAMA Oncol. (2018) 4:970–6. doi: 10.1001/jamaoncol.2018.0991

PubMed Abstract | Crossref Full Text | Google Scholar

47. Abbassi-Ghadi N, Kumar S, Huang J, Goldin R, Takats Z, and Hanna GB. Metabolomic profiling of oesophago-gastric cancer: a systematic review. Eur J Cancer. (2013) 49:3625–37. doi: 10.1016/j.ejca.2013.07.004

PubMed Abstract | Crossref Full Text | Google Scholar

48. Altomare DF, Di Lena M, Porcelli F, Trizio L, Travaglio E, Tutino M, et al. Exhaled volatile organic compounds identify patients with colorectal cancer. Br J Surg. (2013) 100:144–50. doi: 10.1002/bjs.8942

PubMed Abstract | Crossref Full Text | Google Scholar

49. Wroblewski LE, Peek RM Jr., and Coburn LA. The role of the microbiome in gastrointestinal cancer. Gastroenterol Clin North Am. (2016) 45:543–56. doi: 10.1016/j.gtc.2016.04.010

PubMed Abstract | Crossref Full Text | Google Scholar

50. Ge D, Zhou J, Chu Y, Lu Y, Zou X, Xia L, et al. Distinguish oral-source VOCs and control their potential impact on breath biomarkers. Anal Bioanal Chem. (2022) 414:2275–84. doi: 10.1007/s00216-021-03866-8

PubMed Abstract | Crossref Full Text | Google Scholar

51. Shi Y, Zhang J, He J, Liu D, Meng X, Huang T, et al. A method of detecting two tumor markers (p-hydroxybenzoic acid and p-cresol) in human urine using a porous magnetic -cyclodextrine polymer as solid phase extractant, an alternative for early gastric cancer diagnosis. Talanta. (2019) 191:133–40. doi: 10.1016/j.talanta.2018.08.036

PubMed Abstract | Crossref Full Text | Google Scholar

52. Filipiak W, Wenzel M, Ager C, Mayhew CA, Bogiel T, Wlodarski R, et al. Molecular Analysis of Volatile Metabolites Synthesized by Candida albicans and Staphylococcus aureus in In Vitro Cultures and Bronchoalveolar Lavage Specimens Reflecting Single- or Duo-Factor Pneumonia. Biomolecules. (2024) 14:788. doi: 10.3390/biom14070788

PubMed Abstract | Crossref Full Text | Google Scholar

53. Debarba LK, Jayarathne HSM, Stilgenbauer L, Terra Dos Santos AL, Koshko L, Scofield S, et al. Microglial NF-kappaB signaling deficiency protects against metabolic disruptions caused by volatile organic compound via modulating the hypothalamic transcriptome. bioRxiv. (2023). doi: 10.1101/2023.11.08.566279

PubMed Abstract | Crossref Full Text | Google Scholar

54. Snieckute G, Ryder L, Vind AC, Wu Z, Arendrup FS, Stoneley M, et al. ROS-induced ribosome impairment underlies ZAKalpha-mediated metabolic decline in obesity and aging. Science. (2023) 382:eadf3208. doi: 10.1126/science.adf3208

PubMed Abstract | Crossref Full Text | Google Scholar

55. Wang P, Huang Q, Meng S, Mu T, Liu Z, He M, et al. Identification of lung cancer breath biomarkers based on perioperative breathomics testing: A prospective observational study. EClinicalMedicine. (2022) 47:101384. doi: 10.1016/j.eclinm.2022.101384

PubMed Abstract | Crossref Full Text | Google Scholar

56. Nardi-Agmon I, Abud-Hawa M, Liran O, Gai-Mor N, Ilouze M, Onn A, et al. Exhaled breath analysis for monitoring response to treatment in advanced lung cancer. J Thorac Oncol. (2016) 11:827–37. doi: 10.1016/j.jtho.2016.02.017

PubMed Abstract | Crossref Full Text | Google Scholar

57. Hanna GB, Boshier PR, Markar SR, and Romano A. Accuracy and methodologic challenges of volatile organic compound-based exhaled breath tests for cancer diagnosis: A systematic review and meta-analysis. JAMA Oncol. (2019) 5:e182815. doi: 10.1001/jamaoncol.2018.2815

PubMed Abstract | Crossref Full Text | Google Scholar

58. Wang J, Nunovero N, Nidetz R, Peterson SJ, Brookover BM, Steinecker WH, et al. Belt-mounted micro-gas-chromatograph prototype for determining personal exposures to volatile-organic-compound mixture components. Anal Chem. (2019) 91:4747–54. doi: 10.1021/acs.analchem.9b00263

PubMed Abstract | Crossref Full Text | Google Scholar

59. Zang W, Liu Z, Kulkarni GS, Zhu H, Wu Y, Lee K, et al. A microcolumn DC graphene sensor for rapid, sensitive, and universal chemical vapor detection. Nano Lett. (2021) 21:10301–8. doi: 10.1021/acs.nanolett.1c03416

PubMed Abstract | Crossref Full Text | Google Scholar

60. Huang X, Sharma R, Sivakumar AD, Yang S, and Fan X. Ultrathin silica integration for enhancing reliability of microfluidic photoionization detectors. Anal Chem. (2023) 95:8496–504. doi: 10.1021/acs.analchem.3c00354

PubMed Abstract | Crossref Full Text | Google Scholar

61. Wei-Hao Li M, Ghosh A, Venkatasubramanian A, Sharma R, Huang X, and Fan X. High-sensitivity micro-gas chromatograph-photoionization detector for trace vapor detection. ACS Sens. (2021) 6:2348–55. doi: 10.1021/acssensors.1c00482

PubMed Abstract | Crossref Full Text | Google Scholar

62. Sharma R, Zang W, Zhou M, Schafer N, Begley LA, Huang YJ, et al. Real time breath analysis using portable gas chromatography for adult asthma phenotypes. Metabolites. (2021) 11:265. doi: 10.3390/metabo11050265

PubMed Abstract | Crossref Full Text | Google Scholar

63. Sharma R, Zang W, Tabartehfarahani A, Lam A, Huang X, Sivakumar AD, et al. Portable breath-based volatile organic compound monitoring for the detection of COVID-19 during the circulation of the SARS-coV-2 delta variant and the transition to the SARS-coV-2 omicron variant. JAMA Netw Open. (2023) 6:e230982. doi: 10.1001/jamanetworkopen.2023.0982

PubMed Abstract | Crossref Full Text | Google Scholar

64. Tian J, Zhang Q, Peng M, Guo L, Zhao Q, Lin W, et al. Exhaled volatile organic compounds as novel biomarkers for early detection of COPD, asthma, and PRISm: a cross-sectional study. Respir Res. (2025) 26:173. doi: 10.1186/s12931-025-03242-5

PubMed Abstract | Crossref Full Text | Google Scholar

65. Lai W, Li D, Wang J, Geng Q, Xia Y, Fu Y, et al. Exhaled breath is feasible for mild cognitive impairment detection: A diagnostic study with portable micro-gas chromatography. J Alzheimers Dis. (2025) 104:751–62. doi: 10.1177/13872877251319553

PubMed Abstract | Crossref Full Text | Google Scholar

66. Buma AIG, Muntinghe-Wagenaar MB, van der Noort V, de Vries R, Schuurbiers MMF, Sterk PJ, et al. Lung cancer detection by electronic nose analysis of exhaled breath: a multicentre prospective external validation study. Ann Oncol. (2025) 36:786–95. doi: 10.1016/j.annonc.2025.03.013

PubMed Abstract | Crossref Full Text | Google Scholar

67. Vishinkin R, Busool R, Mansour E, Fish F, Esmail A, Kumar P, et al. Profiles of volatile biomarkers detect tuberculosis from skin. Adv Sci (Weinh). (2021) 8:e2100235. doi: 10.1002/advs.202100235

PubMed Abstract | Crossref Full Text | Google Scholar

68. National Lung Screening Trial Research T, Aberle DR, Adams AM, Berg CD, Black WC, Clapp JD, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med. (2011) 365:395–409. doi: 10.1056/NEJMoa1102873

PubMed Abstract | Crossref Full Text | Google Scholar

69. Tan J, Taitz J, Nanan R, Grau G, and Macia L. Dysbiotic Gut Microbiota-Derived Metabolites and Their Role in Non-Communicable Diseases. Int J Mol Sci. (2023) 24:15256. doi: 10.3390/ijms242015256

PubMed Abstract | Crossref Full Text | Google Scholar

70. Fryc J and Naumnik B. Thrombolome and its emerging role in chronic kidney diseases. Toxins (Basel). (2021) 13:223. doi: 10.3390/toxins13030223

PubMed Abstract | Crossref Full Text | Google Scholar

71. Gryp T, Vanholder R, Vaneechoutte M, and Glorieux G. p-cresyl sulfate. Toxins (Basel). (2017) 9:52. doi: 10.3390/toxins9020052

PubMed Abstract | Crossref Full Text | Google Scholar

72. Toft PB, Vanslette AM, Trost K, Moritz T, Gillum MP, Backhed F, et al. Microbial metabolite p-cresol inhibits gut hormone expression and regulates small intestinal transit in mice. Front Endocrinol (Lausanne). (2023) 14:1200391. doi: 10.3389/fendo.2023.1200391

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: breathomics, volatile organic compounds, exhaled breath, thoracic cancer, machine learning, early diagnosis, thermal desorption-gas chromatography-mass spectrometry, postoperative monitoring

Citation: Chen Z, Peng M, Fan P, Chen S, Cheng X, Xu B, Chen R, Hu X, Wei W, Zhao T, Kong J, Liang W, Qiu X, Chen S and Wang J (2025) Machine learning assisted breathomic approach for early-stage thoracic cancer detection. Front. Oncol. 15:1635280. doi: 10.3389/fonc.2025.1635280

Received: 26 May 2025; Accepted: 25 August 2025;
Published: 17 September 2025.

Edited by:

Michael N. Kammer, Université Toulouse 1 Capitole, France

Reviewed by:

Yuanpin Zhou, Zhejiang University, China
Qing-Qing Yu, Jining First People’s Hospital, China

Copyright © 2025 Chen, Peng, Fan, Chen, Cheng, Xu, Chen, Hu, Wei, Zhao, Kong, Liang, Qiu, Chen and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhenguang Chen, Y2h6aGVuZ0BtYWlsLnN5c3UuZWR1LmNu; Junqi Wang, anVucWkud2FuZ0BjaHJvbXhoZWFsdGguY29t

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.