AUTHOR=Han Peng , Zhang Dai , Yao Wenjun , Lv MengYu , Qian YunHong , Zhao Hong TITLE=Noninvasive prediction of EGFR 19Del and 21L858R subtypes in lung adenocarcinoma: a comparative study of logistic regression and decision tree models JOURNAL=Frontiers in Oncology VOLUME=Volume 15 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2025.1642253 DOI=10.3389/fonc.2025.1642253 ISSN=2234-943X ABSTRACT=ObjectiveDespite the increasing interest in radiogenomic prediction, few studies have directly compared the performance of logistic regression and decision tree models in distinguishing epidermal growth factor receptor (EGFR) mutation subtypes. This study provides the first systematic comparison of the predictive performance of these two models in identifying exon 19 deletions (19Del) and exon 21 L858R point mutations (21L858R) in patients with lung adenocarcinoma. By leveraging imaging and clinical parameters, we aimed to address a critical gap in the literature by establishing an optimal prediction model and providing a noninvasive tool to support personalized treatment strategies for patients with unknown EGFR mutation status.Materials and methodsWe retrospectively collected clinical and radiological data from 193 patients with histologically confirmed lung adenocarcinoma who were admitted to the Second Affiliated Hospital of Anhui Medical University between May 2018 and June 2024. Based on EGFR genotyping results, patients were stratified into two groups: the EGFR 19Del mutation group and the EGFR 21L858R mutation group. Comparative statistical analyses—including Student’s t-test, Mann–Whitney U test, chi-square test, or Fisher’s exact test—were performed to evaluate differences in clinical and CT imaging characteristics between groups. Variables with P < 0.05 in the univariate analysis were subsequently included in both logistic regression and decision tree models to identify independent predictors of EGFR mutation subtype. Model performance was assessed using ROC curve analysis. The area under the curve (AUC) was calculated for each model, and their predictive accuracy was further compared using DeLong’s test, net reclassification improvement (NRI), and integrated discrimination improvement (IDI).ResultsIn the decision tree model, age and brain metastasis emerged as key decision nodes for differentiating 19Del and 21L858R mutations, with an AUC of 0.712 (95% CI: 0.639–0.785). In contrast, the logistic regression model identified age, pleural thickening, lymphadenopathy, and brain metastasis as independent predictors, achieving a higher AUC of 0.740 (95% CI: 0.671–0.810). The NRI and IDI values were 0.498 (P < 0.001, 95% CI: 0.238–0.758) and 0.043 (P = 0.004, 95% CI: 0.013–0.072), respectively, suggesting improved reclassification and discrimination by the logistic model. However, DeLong’s test revealed no statistically significant difference between the AUCs of the two models (Z = 1.314, P = 0.189).ConclusionBoth logistic regression and decision tree models demonstrated value in predicting EGFR 19Del and 21L858R mutations in lung adenocarcinoma, each offering distinct methodological advantages. The logistic regression model exhibited higher interpretability and statistical robustness, making it well-suited for clinical decision-making. Meanwhile, the decision tree model offered superior visual clarity and intuitive structure, which may enhance practical utility. A combined modeling approach that harnesses the strengths of both methods may provide a more accurate and comprehensive tool for early mutation identification and individualized treatment planning in patients with lung adenocarcinoma.