Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol.

Sec. Thoracic Oncology

Volume 15 - 2025 | doi: 10.3389/fonc.2025.1635280

This article is part of the Research TopicAdvancing Diagnostic Excellence in Early Lung Cancer DetectionView all 3 articles

Machine Learning Assisted Breathomic Approach for Early-Stage Thoracic Cancer Detection

Provisionally accepted
Zhenguang  ChenZhenguang Chen1*Minhua  PengMinhua Peng2Pengnan  FanPengnan Fan2Sai  ChenSai Chen1Xinxin  ChengXinxin Cheng3Bo  XuBo Xu1Ruiping  ChenRuiping Chen1Xiao  HuXiao Hu4Wei  WeiWei Wei4Tingting  ZhaoTingting Zhao4Jun  KongJun Kong2Weiliang  LiangWeiliang Liang2Xiangcheng  QiuXiangcheng Qiu2Sitong  ChenSitong Chen2Junqi  WangJunqi Wang2
  • 1The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
  • 2ChromX Health Co., Ltd., Guangzhou, China
  • 3Sun Yat-sen University Cancer Center, Guangzhou, China
  • 4Guizhou Hospital of The First Affiliated Hospital of Sun Yat-sen University, Guizhou, China

The final, formatted version of the article will be published soon.

Objective: This study explores the feasibility of using breathomic biomarkers analyzed by machine learning as a non-invasive diagnostic tool to differentiate between benign and malignant thoracic lesions, aiming to enhance early detection of thoracic cancers and inform clinical decision-making.: This study enrolled 132 participants with confirmed diagnosis of lung cancer, esophageal cancer, thymoma, and benign diseases. Exhaled breath samples were analyzed by thermal desorption-gas chromatography-mass spectrometry. A logistic regression algorithm was employed to construct a classification model for benign and malignant thoracic lesions. This model was trained on a subset of 80 cases and subsequently validated in a separate set comprising 52 samples. Results: A logistic regression model based on thirteen exhaled volatile organic compounds (VOCs) was developed to differentiate benign and malignant thoracic lesions. The 13-VOC model achieved an AUC of 0.85 (0.72, 0.96), accuracy of 0.79 (0.66, 0.88), sensitivity of 0.82 (0.67, 0.91), and a specificity of 0.71 (0.45, 0.88). It correctly classified 80% of lung cancer, 80% of thymoma, and 100% of esophageal cancer cases, distinguishing 71.4% of benign lesions. For lung cancer, the model achieved an AUC of 0.79 (0.57, 0.98), sensitivity of 0.80 (0.63, 0.91), and specificity 4 / 49 of 0.63 (0.31, 0.86), with 81.8 % accuracy in detecting early-stage (Stage 0 + I + II) disease. The model outperformed a 4-serum tumor marker panel in sensitivity (0.90 vs. 0.39, p < 0.001). Additionally, in a cohort of 58 cancer patients, model-predicted risk significantly decreased post-surgery (p < 0.01), indicating a strong correlation with disease burden reduction.This study demonstrates the feasibility of utilizing breathomics biomarkers for developing a non-invasive machine learning model for the early diagnosis of thoracic malignancies. These findings provide a foundation for breath analysis as a promising tool for early cancer detection, potentially facilitating improved clinical decision-making and enhancing patient outcomes.

Keywords: Trial registration: Chinese Clinical Trial Registry, ChiCTR2200061264 Breathomics, Volatile Organic Compounds, exhaled breath, Thoracic cancer, machine learning, early diagnosis, thermal desorption-gas chromatography-mass spectrometry, Postoperative monitoring

Received: 26 May 2025; Accepted: 25 Aug 2025.

Copyright: © 2025 Chen, Peng, Fan, Chen, Cheng, Xu, Chen, Hu, Wei, Zhao, Kong, Liang, Qiu, Chen and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Zhenguang Chen, The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.