ORIGINAL RESEARCH article

Front. Oncol.

Sec. Thoracic Oncology

Volume 15 - 2025 | doi: 10.3389/fonc.2025.1567673

This article is part of the Research TopicInnovations in Biomarker-Based Lung Cancer ScreeningView all 8 articles

Screening and identification of novel protein markers of early-stage lung cancer and construction and application of screening models

Provisionally accepted
Huijie  YuanHuijie Yuan1Shuyin  DuanShuyin Duan2Clement  Yaw EffahClement Yaw Effah1Sitian  HeSitian He1Yaru  ChaiYaru Chai1Xia  LiuXia Liu1Lihua  DingLihua Ding1*Yongjun  WuYongjun Wu1*
  • 1College of Public Health, Zhengzhou University, Zhengzhou, China
  • 2School of Public Health, Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, China

The final, formatted version of the article will be published soon.

Molecular biomarkers have the potential to improve the current state of early screening of lung cancer. This investigation aimed to identify novel protein markers for earlystage lung cancer and combine them with traditional tumor markers to develop machine learning models for lung cancer screening.The protein alters of peripheral blood (5 patients with early-stage lung adenocarcinoma, 5 patients with early-stage lung squamous cell carcinoma, and 8 healthy controls) were detected by label-free quantitative proteomics. The novel candidate protein markers were preferentially selected by multi-omics technology. Then, the malignant transformation of BEAS-2B cells and lung carcinogenesis in C57BL/6 mice were induced by coal tar pitch extracts (CTPE) so that the expressions of these markers at different stages of lung carcinogenesis could be dynamically tracked and validated.These markers in human plasma were detected and further confirmed by ELISA.Machine learning models were established to screen high-risk individuals of lung cancer.The C-type lectin domain family 3 member B (CLEC3B), membrane primary amine oxidase (AOC3), hemoglobin subunit beta (HBB), catalase (CAT), and selenoprotein P (SEPP1) were screened as candidate protein markers for early-stage lung cancer. The expressions of CLEC3B, AOC3, CAT, and SEPP1 were statistically significant in various passages of cells cultured with exposure to CTPE compared to the saline group (P<0.05). In addition, the expressions of these 5 proteins were statistically significant in lung tissues, plasma, and alveolar lavage fluid of mice exposed to CTPE for 3, 6, 9 and 12 months compared to normal controls (P<0.05). There were notable variations in AOC3, CAT, CLEC3B, SEPP1, HBB, CEA, CYFRA21-1, and NSE among the healthy control group, lung cancer group and coke oven workers (P<0.05). The decision tree C5.0 (AUC=0.868) and artificial neural network (AUC=0.844) which combined these 8 markers showed better performance.The differential changes of AOC3, CAT, CLEC3B, SEPP1, and HBB protein were proven as early molecular events in lung tumorigenesis. The screening models of lung cancer based on the novel protein markers and traditional tumor markers might be applied for the screening of high-risk individuals.

Keywords: lung cancer, screening, Protein markers, machine learning, High-risk individuals

Received: 28 Jan 2025; Accepted: 07 May 2025.

Copyright: © 2025 Yuan, Duan, Effah, He, Chai, Liu, Ding and Wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Lihua Ding, College of Public Health, Zhengzhou University, Zhengzhou, China
Yongjun Wu, College of Public Health, Zhengzhou University, Zhengzhou, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.