SYSTEMATIC REVIEW article
Front. Oncol.
Sec. Thoracic Oncology
Volume 15 - 2025 | doi: 10.3389/fonc.2025.1634694
This article is part of the Research TopicRadiomics and Artificial Intelligence in Oncology ImagingView all 20 articles
Artificial Intelligence Versus Radiologists in Predicting Lung Cancer Treatment Response: A Systematic Review and Meta-Analysis
Provisionally accepted- 1Saint Louis University, St Louis, United States
- 2Deparment of Medicine, Division of Hematology Oncology and Bone Marrow Transplant, St. Louis, United States
- 3Universidad de El Salvador, San Salvador, El Salvador
- 4Departmetn of OBGYN Oncology, Social Security System of El Salvador, San Salvador, El Salvador
- 5Johns Hopkins University, Baltimore, United States
- 6Johns Hopkins University Department of Medicine, Baltimore, United States
- 7Department of Medicine, Division of hematology Oncology and Bone Marrow Transplant, St. Louis University, Saint Louis, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background: Artificial intelligence (AI) has emerged as a promising adjunct to radiologist interpretation in oncology imaging. This systematic review and meta-analysis compares the diagnostic performance of AI systems versus radiologists in predicting lung cancer treatment response, focusing solely on treatment response rather than diagnosis. Methods: We systematically searched PubMed, Embase, Scopus, Web of Science, and the Cochrane Library from inception to March 31, 2025; Google Scholar and CINAHL were used for citation chasing/grey literature. The review protocol was prospectively registered in PROSPERO (CRD420251048243). Studies directly comparing AI-based imaging analysis with radiologist interpretation for predicting treatment response in lung cancer were included. Two reviewers extracted data independently (Cohen's κ = 0.87). We pooled sensitivity, specificity, accuracy, and risk differences using DerSimonian–Laird random-effects models. Heterogeneity (I²), threshold effects (Spearman correlation), and publication bias (funnel plots, Egger's test) were assessed. Subgroups were prespecified by imaging modality and therapy class. Results: Eleven retrospective studies (n = 6,615) were included. Pooled sensitivity for AI was 0.9 (95% CI: 0.8–0.9; I² = 58%), specificity 0.8 (95% CI: 0.8–0.9; I² = 52%), and accuracy 0.9 (95% CI: 0.8–0.9; pooled OR = 1.4, 95% CI: 1.2–1.7). Risk difference favored AI by 0.06 for sensitivity and 0.04 for specificity. AI's advantage was most apparent in CT and PET/CT, with smaller/non-significant gains in MRI. Egger's test suggested no significant publication bias (p = 0.21). Conclusion: AI demonstrates modest but statistically significant superiority over radiologists in predicting lung cancer treatment response, particularly in CT and PET/CT imaging. However, generalizability is limited by retrospective study dominance, incomplete demographic reporting, lack of regulatory clearance, and minimal cost-effectiveness evaluation. Prospective, multicenter trials incorporating explainable AI (e.g., SHAP, Grad-CAM), equity assessments, and formal economic analyses are needed.
Keywords: artificial intelligence, lung cancer, treatment response, Radiomics, Diagnostic accuracy
Received: 25 May 2025; Accepted: 12 Sep 2025.
Copyright: © 2025 Guevara Rodriguez, Coreas, Kumar Panjiyar and Kunwor. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Nehemias Guevara Rodriguez, Saint Louis University, St Louis, United States
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.