SYSTEMATIC REVIEW article

Front. Oncol.

Sec. Radiation Oncology

Volume 15 - 2025 | doi: 10.3389/fonc.2025.1576461

Machine Learning Approaches for EGFR Mutation Status Prediction in NSCLC: An Updated Systematic Review

Provisionally accepted
Haixian  LiuHaixian Liu1Shu  PangShu Pang1Zhao  LiZhao Li1Chunfeng  LuChunfeng Lu1Lun  LiLun Li2*
  • 1Weifang People's Hospital, Weifang, Shandong Province, China
  • 2Weifang University, Weifang, China

The final, formatted version of the article will be published soon.

Background: With the rapid advances in artificial intelligence-particularly convolutional neural networks-researchers now exploit CT, PET/CT and other imaging modalities to predict epidermal growth factor receptor (EGFR) mutation status in non-small-cell lung cancer (NSCLC) non-invasively, rapidly and repeatably. End-to-end deep-learning models simultaneously perform feature extraction and classification, capturing not only traditional radiomic signatures such as tumour density and texture but also peri-tumoural micro-environmental cues, thereby offering a higher theoretical performance ceiling than hand-crafted radiomics coupled with classical machine learning. Nevertheless, the need for large, well-annotated datasets, the domain shifts introduced by heterogeneous scanning protocols and preprocessing pipelines, and the "black-box" nature of neural networks all hinder clinical adoption. To address fragmented evidence and scarce external validation, we conducted a systematic review to appraise the true performance of deep-learning and radiomics models for EGFR prediction and to identify barriers to clinical translation, thereby establishing a baseline for forthcoming multicentre prospective studies.Methods: Following PRISMA 2020, we searched PubMed, Web of Science and IEEE Xplore for studies published between 2018 and 2024. Fifty-nine original articles met the inclusion criteria. QUADAS-2 was applied to the eight studies that developed models using real-world clinical data, and details of external validation strategies and performance metrics were extracted systematically.The pooled internal area under the curve (AUC) was 0.78 for radiomics-machine-learning models and 0.84 for deep-learning models. Only 17 studies (29 %) reported independent external validation, where the mean AUC fell to 0.77, indicating a marked domain-shift effect. QUADAS-2 showed that 31 % of studies had high risk of bias in at least one domain, most frequently in Index Test and Patient Selection.Although deep-learning models achieved the best internal performance, their reliance on single-centre data, the paucity of external validation and limited code availability preclude their use as stand-alone clinical decision tools. Future work should involve multicentre prospective designs, federated learning, decision-curve analysis and open sharing of models and data to verify generalisability and facilitate clinical integration.

Keywords: artificial intelligence, Non-small cell lung cancer (NSCLC), EGFR mutation, deep learning, medical imaging

Received: 17 Feb 2025; Accepted: 19 Jun 2025.

Copyright: © 2025 Liu, Pang, Li, Lu and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Lun Li, Weifang University, Weifang, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.