ORIGINAL RESEARCH article
Front. Oncol.
Sec. Thoracic Oncology
Volume 15 - 2025 | doi: 10.3389/fonc.2025.1635280
This article is part of the Research TopicAdvancing Diagnostic Excellence in Early Lung Cancer DetectionView all 3 articles
Machine Learning Assisted Breathomic Approach for Early-Stage Thoracic Cancer Detection
Provisionally accepted- 1The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
- 2ChromX Health Co., Ltd., Guangzhou, China
- 3Sun Yat-sen University Cancer Center, Guangzhou, China
- 4Guizhou Hospital of The First Affiliated Hospital of Sun Yat-sen University, Guizhou, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Objective: This study explores the feasibility of using breathomic biomarkers analyzed by machine learning as a non-invasive diagnostic tool to differentiate between benign and malignant thoracic lesions, aiming to enhance early detection of thoracic cancers and inform clinical decision-making.: This study enrolled 132 participants with confirmed diagnosis of lung cancer, esophageal cancer, thymoma, and benign diseases. Exhaled breath samples were analyzed by thermal desorption-gas chromatography-mass spectrometry. A logistic regression algorithm was employed to construct a classification model for benign and malignant thoracic lesions. This model was trained on a subset of 80 cases and subsequently validated in a separate set comprising 52 samples. Results: A logistic regression model based on thirteen exhaled volatile organic compounds (VOCs) was developed to differentiate benign and malignant thoracic lesions. The 13-VOC model achieved an AUC of 0.85 (0.72, 0.96), accuracy of 0.79 (0.66, 0.88), sensitivity of 0.82 (0.67, 0.91), and a specificity of 0.71 (0.45, 0.88). It correctly classified 80% of lung cancer, 80% of thymoma, and 100% of esophageal cancer cases, distinguishing 71.4% of benign lesions. For lung cancer, the model achieved an AUC of 0.79 (0.57, 0.98), sensitivity of 0.80 (0.63, 0.91), and specificity 4 / 49 of 0.63 (0.31, 0.86), with 81.8 % accuracy in detecting early-stage (Stage 0 + I + II) disease. The model outperformed a 4-serum tumor marker panel in sensitivity (0.90 vs. 0.39, p < 0.001). Additionally, in a cohort of 58 cancer patients, model-predicted risk significantly decreased post-surgery (p < 0.01), indicating a strong correlation with disease burden reduction.This study demonstrates the feasibility of utilizing breathomics biomarkers for developing a non-invasive machine learning model for the early diagnosis of thoracic malignancies. These findings provide a foundation for breath analysis as a promising tool for early cancer detection, potentially facilitating improved clinical decision-making and enhancing patient outcomes.
Keywords: Trial registration: Chinese Clinical Trial Registry, ChiCTR2200061264 Breathomics, Volatile Organic Compounds, exhaled breath, Thoracic cancer, machine learning, early diagnosis, thermal desorption-gas chromatography-mass spectrometry, Postoperative monitoring
Received: 26 May 2025; Accepted: 25 Aug 2025.
Copyright: © 2025 Chen, Peng, Fan, Chen, Cheng, Xu, Chen, Hu, Wei, Zhao, Kong, Liang, Qiu, Chen and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Zhenguang Chen, The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.