Deep learning on brain metastasis for predicting EGFR genotype and EGFR-TKI therapy response in metastatic NSCLC: a multicenter study

You, Shuailin; Fan, Ying; Yang, Zhiguang; Yang, Chunna; Sun, Yiyao; Luo, Yahong; Wang, Zekun; Sun, Bo; Jiang, Wenyan

doi:10.3389/fbioe.2025.1637095

ORIGINAL RESEARCH article

Front. Bioeng. Biotechnol., 02 October 2025

Sec. Biosensors and Biomolecular Electronics

Volume 13 - 2025 | https://doi.org/10.3389/fbioe.2025.1637095

This article is part of the Research TopicAdvances in Artificial Intelligence for Early Cancer Detection and Precision OncologyView all 7 articles

Deep learning on brain metastasis for predicting EGFR genotype and EGFR-TKI therapy response in metastatic NSCLC: a multicenter study

Shuailin You^1,2^†

Ying Fan³^†

Zhiguang Yang⁴^†

Chunna Yang²

Yiyao Sun²

Yahong Luo⁵

Zekun Wang⁵*

Bo Sun⁶*

Wenyan Jiang⁷*

¹College of Technology and Data, Yantai Nanshan University, Yantai, China
²School of Intelligent Medicine, China Medical University, Shenyang, Liaoning, China
³College of Biomedical Engineering, Fudan University, Shanghai, China
⁴Department of Radiology, Shengjing Hospital, Shenyang, China
⁵Department of Medical Imaging, Cancer Hospital of China Medical University, Liaoning Cancer Hospital and Institute, Shenyang, Liaoning, China
⁶Department of Radiology, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
⁷Department of Scientific Research and Academic, Cancer Hospital of China Medical University, Liaoning Cancer Hospital and Institute, Shenyang, Liaoning, China

Background: Brain metastases are common in patients with advanced non-small cell lung cancer (NSCLC), particularly those harboring EGFR mutations, and accurate prediction of EGFR mutation status and therapeutic response is crucial for guiding targeted therapy. This study aims to conduct a deep learning (DL) approach to automatically predict epidermal growth factor receptor (EGFR) genotype and response to EGFR-tyrosine kinase inhibitor (TKI) therapy in NSCLC patients with brain metastatic tumor (BM).

Methods: For training and validating the DL models, 388 patients were enrolled from three centers between Jul. 2014 and Dec.2022 (230 from center 1, 80 from center 2 and 78 from center 3). Contrast-enhanced T1-weighted (T1CE) and T2-weighted (T2W) brain MRI images before treatment for each patient were obtained for analyses. We developed an EGFR-TKI system (ETS) for automated detection of brain metastatic (BM) lesions and to differentiate EGFR mutation status and predict response to EGFR-TKI therapy. The models underwent rigorous evaluation through receiver operating characteristic (ROC) curve analyses, where metrics such as area under the curve (AUC), sensitivity, and specificity were examined.

Results: For prediction of EGFR mutation status, the ETS integrating radiological-based features and clinical factors achieved AUCs of 0.842, 0.833 and 0.832 on the internal validation, external validation 1 and external validation 2 cohort, respectively. For forecasting response to EGFR-TKI therapy, the fusion model created by amalgamating MRI with clinical factors generated AUCs of 0.747, 0.726 and 0.728 on the internal validation, external validation 1, and external validation 2 cohort, respectively.

Conclusion: The ETS may have the potential to work as a non-invasive tool for predicting EGFR mutation status and response to EGFR-TKI therapy, which holds promise as a non-invasive tool to assist clinicians in making decisions about personalized treatment strategies.

1 Introduction

Lung cancer has been a devastating disease and one of the most frequently diagnosed cancers around the world (Sculier, 2013). Lung cancer primarily begins in the lung and may spread to other organs (Boire et al., 2020). The survival statistics of patients with lung cancer are grim, which is often due to the development of distant metastasis (Arbour and Riely, 2019; Schuchert and Luketich, 2003). The brain metastasis (BM) is a major cause of morbidity in lung cancer and frequently results in poor survival rates of less than 1 year (Boire et al., 2020; Niu et al., 2016). And it was reported that approximately half of the lung cancer patients would develop BM (Arbour and Riely, 2019).

Epidermal growth factor receptor (EGFR)-tyrosine kinase inhibitors (TKIs) have been considered as one of the most effective therapeutic strategies for lung cancers (Lynch et al., 2004). Once the patient is diagnosed as an EGFR mutant, EGFR-TKI therapy can be the first-line choice (Yang et al., 2017). However, the effect of the EGFR-TKI is not always satisfactory, and many cases would suffer from tumour progression after receiving the EGFR-TKIs (Rebuzzi et al., 2020). To date, there is still a lack of accurate and reliable methods for the early detection of the EGFR mutation and evaluating therapeutic response to EGFR-TKI before treatment. Although biopsy sampling is routinely used in clinical settings, the biopsy is invasive and may introduce high risks of tissue damage and tumor cell spread (Thompson et al., 2016). In addition, intratumoral heterogeneities can influence the biopsy analysis results because the biopsy can only reflect a limited region in the tumor (Huang W-L. et al., 2017). Therefore, biopsy-based assessment of EGFR mutation status or response to EGFR-TKI is not suggested. Medical imaging-based assessments, on the other hand, are usually subjective and unreliable (Chetan and Gleeson, 2021). Radiologists can hardly evaluate the EGFR mutation status or therapeutic response because of the absence of a specific marker. There is a great need for an effective and non-invasive method to assist in preoperatively determining which patients can benefit from EGFR-TKI therapy.

Radiomics has demonstrated the relationship between underlying biological mechanisms and clinical significance by computing quantitative features directly from medical images (Lambin et al., 2017). While, traditional handcrafted-based radiomics has limitations (Sculier, 2013): handcrafted features are manually calculated based on previously proposed formulas, which can cover only limited types of features (e.g., shape-based, first-order and textural features), and hence result in limited capabilities of digging valuable information from imaging data (Lambin et al., 2017); and (Boire et al., 2020) the process of feature selection and modeling is laborious and time-consuming (Lambin et al., 2017), which cannot be performed as the end-to-end training and testing. In contrast to machine learning-based approaches, deep learning algorithms have been shown to automatically learn representative information from raw data (Pan et al., 2019; Magadza and Viriri, 2021). Deep learning-based models have been proposed for detecting the EGFR mutation, but all focused on thoracic imaging of the primary lung cancer (Wang S. et al., 2019; Yin et al., 2021; Wang et al., 2022). While, clinical evidences have shown that patients with EGFR mutant NSCLC have a high incidence of BM, which is also known as an important indicator to reflect the therapeutic efficacy (Boire et al., 2020; De Cos et al., 2009). Recent handcrafted radiomics studies proved that information highly associated with response to EGFR-TKI can be captured from the NSCLC originated BM (Fan et al., 2023a; Fan et al., 2023b; Fan et al., 2022), but all simply applied conventional machine learning methods on a limited sample size. To our knowledge, there is still no report investigating the value of deep learning in predicting therapeutic efficacy of EGFR-TKI therapy based on BM. In this study, we proposed an automated artificial intelligence EGFR-TKI system (ETS) to predict EGFR genotype and response to EGFR-TKI treatment, aiming to assist clinicians in making appropriate therapeutic plans based on the ETS predicted possibility of obtaining the benefit from EGFR-TKI treatment.

2 Methods

2.1 Patients

This study was approved by the ethics committee of our hospital. A total of 230 patients were enrolled from center 1 between January 2017 and December 2021 and served as the primary cohort. 80 patients were enrolled from center 2 (between Jul. 2014 and Feb. 2022), and 78 patients were enrolled from center 3 (between Jan. 2020 and Dec. 2022), and served as the external validation cohort 1 and 2, respectively. The Response Evaluation Criteria in Solid Tumors (RECIST) 1.1 (Eisenhauer et al., 2009) was used to determine treatment response to EGFR-TKI therapy. The inclusion criteria include (Sculier, 2013): underwent complete T1CE and T2W brain MRI scans before treatment, and (Boire et al., 2020) had complete gene test results. The exclusion criteria include (Sculier, 2013): with poor MRI image quality (Boire et al., 2020); age less than 18, and (Arbour and Riely, 2019) carrying a primary brain tumor or other tumor diseases. Patients from center 1 were divided into training and internal validation cohorts by random stratified sampling in a ratio of 8:2. Patients from centers 2 and 3 were used as independent sets to validate our DL methods. Figure 1 shows the screening process for patients from all three centers.

Figure 1

Flowchart illustrating the enrollment of 420 patients with metastatic NSCLC for a study, resulting in 388 participants. Inclusion criteria require MRI scans and gene test results; exclusion involves poor MRI quality, age under eighteen, and other brain tumor diseases. Participants are divided into three centers: Center 1 (230), Center 2 (80), and Center 3 (78). Center 1 participants are further categorized into EGFR and EGFR-TKI groups for training and validation, with subgroups based on EGFR status and responders. Centers 2 and 3 are used for external validation, also classified by EGFR status and responders.

Figure 1. Flowchart of patient recruitment in three centers.

2.2 MRI acquisition and region of interest (ROI) segmentation

Patients from center 1 were scanned by a 3.0-T MRI scanner (Siemens Verio, Erlangen, Germany), patients from center 2 were scanned by a 3.0-T MRI scanner (Siemens Magnetom Skyra, Erlangen, Germany), and patients from center 3 were scanned by a 3.0-T MRI scanner (Philips, Ingenia). In center 1, the T1CE MRI scanning parameters were as follows: Repeat time (TR) = 270 ms; Echo time (TE) = 2.48 ms; slice thickness = 5 mm, FOV = 194 × 230 mm, and matrix size = 320 × 216. The T2W MRI scanning parameters were as follows: TR = 3630 ms, TE = 87 ms; slice thickness = 5 mm; FOV 194 × 230 mm, and matrix size = 384 × 227 mm. In center 2, the T1CE MRI scanning parameters were as follows: TR = 1400 ms; TE = 9 ms; slice thickness = 6 mm, FOV = 179 × 230 mm and matrix size = 320 × 187. The T2W MRI scanning parameters were as follows: TR = 3500 ms, TE = 99 ms; slice thickness = 6 mm; FOV = 194 × 230 mm and matrix size = 320 × 270 mm. T1CE MRI images were taken 5 min after Gd-DTPA injection. In center 3, the parameters of T1CE and T2W MRI were as follows: T1CE: TR = 180 ms; TE = 2.3 ms; slice thickness = 6 mm, and matrix size = 256 × 256. T2W: TR = 2000 ms; TE = 80 ms; slice thickness = 6 mm, and matrix size = 256 × 256. The dose was 0.2 mL/kg, and the injection speed was 3 mL/s. The segmentation of regions of interest (ROIs) of the brain metastasis (BM) was performed using the ITK-SNAP (version 3.6.1). A radiologist with 5 years’ experience was invited to manually segment the ROI of BM, who was blinded to the clinicopathological information of the patients, except for the tumor location. And a senior radiologist with 15 years’ experience was invited to validate all manual delineations. Volume of peritumoral edema (VPE) was calculated using ITK-SNAP.

2.3 Development and validation of the ETS

The proposed automated artificial intelligence EGFR-TKI system (ETS) consists of two main components: (i) automatic tumor region segmentation and (ii) EGFR genotype prediction. The EGFR-Model of ETS can automatically recognize the region of interest (ROI), and directly predict the EGFR mutation status. For patients with EGFR mutation, the TKI-Model of ETS predicts response to EGFR-TKI therapy. The architecture of the ETS is shown in Figure 2.

Figure 2

Diagram of a neural network model flow for analyzing medical images. On the left, the process begins with convolution, leaky ReLU, and max pooling, followed by dense blocks (DB) and external attention (EA). Skip connections and convolutions are highlighted. On the right, decision trees combine manual features with deep learning features, indicating non-response and response outcomes. A DNA strand represents EGFR genotype probability.

Figure 2. Architecture of the proposed ETS.

The proposed automated artificial intelligence EGFR-TKI system (ETS) consists of two main components: (i) automatic tumor region segmentation and (ii) EGFR genotype prediction. Specifically, ETS first segments the brain metastasis region using a modified FC-DenseNet with LeakyReLU and external attention (EA), and then predicts EGFR mutation status using a DenseNet-121–based classifier. For patients with EGFR mutation, the system further predicts the response to EGFR-TKI therapy. The architecture of ETS is shown in Figure 2.

The segmentation subnetwork for the ETS is based on the FC-Densenet (Jégou et al., 2017) backbone and uses the LeakyRelu nonlinear activation function to replace the ReLU nonlinear activation function. In addition, an EA is added to the network’s downsampling and upsampling process (Guo et al., 2023). To train the segmentation network, we first performed data augmentation to increase the diversity of training samples and improve the robust performance of the training model. Each MRI image is randomly rotated by 90 degrees, and in addition, each image is randomly selected for data enhancement by one of three non-rigid body transformations: Elastic transform, Grid distortion, and Optical distortion. In the training process, the model is optimally trained by adaptive moment estimation (Adam) (Kinga and Adam, 2015) with a learning rate of 0.0001, the total number of iterations of the training model is 100, and the input size of the model is 128 × 128 × 3.

The classification subnetwork uses the Densenet-121 (Huang G. et al., 2017) as the backbone network. The fully connected layer of the Densenet-121 was replaced with the global average pooling (GAP) (Lin et al., 2025) for discriminating the EGFR mutation status. We applied the ideology of transfer learning, where the classification network was pre-trained on the ImageNet-1k dataset to increase the learning efficiency of the network. We evaluated four model variants, No Seg–VPE, No VPE, No Seg, and Seg–VPE—to isolate the contributions of the segmentation network and the volumetric peritumoral edema (VPE) feature.

To predict EGFR-TKI therapy response, we extracted DL features and handcrafted features from patients with EGFR mutation. The analysis of variance (ANOVA) and principal component analysis (PCA) (Witten et al., 2013) were applied to dimensionality reduction and screen features. Finally, we used a decision tree model to predict the response to EGFR-tyrosine TKI therapy. To enhance interpretability and reveal spatial correlations between image regions and prediction results, we applied Grad-CAM (Selvaraju et al., 2017) to the final convolutional layer of the DenseNet-121 classifier. This allowed us to visualize the discriminative regions that most influenced the EGFR mutation prediction. Since the classifier receives input features extracted from the segmented tumor region, the resulting attention maps reflect localized regions within the BM that are most relevant to the model’s decision-making process. In the training process, the model is optimally trained by adaptive moment estimation (Adam) (Kinga and Adam, 2015) with a learning rate of 0.0001; the epoch of the training model was set to 100. All DL experiments were performed in Python (v.3.6) using Keras (version 2.3) on a single GPU (Nvidia GeForce 3090) workstation.

To validate the predictive performance of the ETS for both EGFR-mutation status and EGFR-TKI response, we conducted independent evaluations on three datasets: an internal hold-out set (20% of the development data) and two external validation cohorts. The fully trained ETS was applied to each dataset. For each task and each cohort (Internal Validation, External Validation 1, External Validation 2), we generated receiver operating characteristic (ROC) curves and calculated the area under the curve (AUC), accuracy, F1 score, precision, and recall. Optimal decision thresholds were selected by maximizing Youden’s index.

2.4 Statistical analysis

All statistical analysis was performed in R software (version 3.6.0). ANOVA was performed for continuous variables, and the chi-square test was used for discrete (categorical) variables. Factors with a p-value less than 0.05 were considered statistically significant. The performance of the ETS was evaluated using area under the curve (AUC), accuracy, F1 score, precision, and recall. All evaluation metrics were implemented in Python (v.3.6) using the scikit-learn library. The Gradient Weighted Class Activation Map (Grad-CAM) was implemented on PyTorch (Version 1.12.0). Figure 3 depicts the workflow of our study.

Figure 3

Diagram illustrating brain metastasis model development and application. Model construction involves clinical problem identification, manual feature extraction from brain images, clinical data processing, and neural network training. Model application includes heatmap analysis for brain images and ROC analysis for evaluating model performance, shown in graphs depicting true positive rates versus false positive rates.

Figure 3. Study design of our work for predicting response to EGFR-TKI treatment. (A) Model construction. (B) Model application.

3 Results

3.1 Clinical characteristics

Table 1 listed demographic and clinical characteristics of the patients with BM originated from primary NSCLC. From Table 1, there was no statistical significance in terms of age, gender, and smoking history.

Table 1

Table 1. Clinical characteristic of patients from three centers.

3.2 Performance for predicting EGFR mutation status

Table 2 compared the performance of the proposed EGFR-Model^{No Seg−VPE}, EGFR-Model^{No VPE}, EGFR-Model^{No Seg} and EGFR-Model^Seg-VPE for predicting the EGFR mutation status. Without the subnetwork for segmentating the BM, the EGFR-Model^{No Seg−VPE} yielded lower AUCs, accuracy, F1-score, precision, and recall compared with EGFR-Model^{No VPE} in primary and external cohorts. The decreased predictive performance in EGFR-Model^{No Seg−VPE} suggested the necessity of the segmentation subnetwork. By integrating VPE, the EGFR-Model^{No Seg} showed better performance than EGFR-Model^{No Seg−VPE} in terms of AUC, accuracy, F1-score, precision, and recall. This indicated that the VPE can provide additional information to improve the capability of predicting the EGFR mutation status. The EGFR-Model^Seg-VPE, integrating both VPE and segmentation subnetworks, performed the best among all models for predicting the EGFR mutation status. ROC curves of all models on primary and external sets were shown in Figure 4. As shown in Figure 5, the Grad-CAM heatmaps highlight high-response areas within the segmented tumor region, indicating that the prediction of EGFR mutation status is driven by biologically relevant features. These results illustrate a link between the model architecture, particularly the segmentation-guided feature extraction, and the spatial mapping of predictive regions.

Table 2

Table 2. Performance of the ETS for predicting the EGFR mutation status.

Figure 4

Three ROC curve plots labeled A, B, and C display the true positive rate against the false positive rate for different EGFR models. Each plot includes four colored lines representing various models: EGFR-Model\(^\text{No Seg-VPE}\) (blue), EGFR-Model\(^\text{No Seg}\) (orange), EGFR-Model\(^\text{No VPE}\) (green), and EGFR-Model\(^\text{Seg-VPE}\) (red). The AUC values vary across plots and models, with model A having AUC values from 0.700 to 0.842, model B from 0.684 to 0.833, and model C from 0.675 to 0.832. A diagonal line indicates random performance.

Figure 4. ROC curves of the ETS for predicting the EGFR mutation status in the internal validation (A), external validation 1 (B), and external validation 2 (C) set.

Figure 5

MRI scans showcase different brain images using T1CE and T2W sequences. Each panel highlights a specific area with a zoomed-in view that includes a color-coded representation of data alongside a grayscale image segment. The top row contains T1CE images, and the bottom row contains T2W images, with distinct details and patterns visible in each section.

Figure 5. Attention heatmaps on the brain metastasis (BM) visualized by Grad-CAM. The first row shows heatmaps in T1CE MRI. The second row shows heatmaps in T2W MRI.

3.3 Performance for predicting response to EGFR-TKI therapy

Table 3 compared the performance of the proposed TKI-Model^{No Seg−VPE}, TKI-Model^{No VPE}, TKI-Model^{No Seg} and TKI-Model^Seg-VPE for predicting response to EGFR-TKI. The TKI-Model^{No Seg−VPE} without the subnetwork for segmenting the BM genarated lower AUC and ACC compared with TKI-Model^{No VPE} that has the segmentation subnetwork. The result indicates the necessity of the segmentation subnetwork. The TKI-Model^{No Seg} integrating VPE outperformed the TKI-Model^{No Seg−VPE} that is without VPE in terms of AUC and ACC in primary and external cohorts. This suggested that the VPE holds additional information correlated to the efficacy of EGFR-TKI. The TKI-Model^Seg-VPE integrating both VPE and the segmentation subnetwork achieved the best predictive performance with AUCs of 0.747, 0.726, and 0.728 in the internal validation, external validation 1 and external validation 2 cohort, respectively. Figure 6 depicted the ROC curves of the TKI-Model for predicting response to EGFR-TKI.

Table 3

Table 3. Performance of the TKI-Model for predicting response to EGFR-TKI.

Figure 6

Three ROC curve plots labeled A, B, and C compare the performance of four TKI-Models in terms of true positive rate versus false positive rate. Each plot shows curves representing models with variations of segmentation and VPE adjustments. Models are color-coded: blue, orange, green, and red with respective AUC values shown in the legend. AUC values range from approximately 0.599 to 0.747, demonstrating varying levels of predictive performance.

Figure 6. ROC curves of the ETS for predicting response to EGFR-TKI in the internal validation (A), external validation 1 (B), and external validation 2 (C) set.

4 Discussion

Current guidelines for clinical assessment of EGFR genotype and therapeutic response to EGFR-TKI rely on visual radiologic assessment, which is subjectively biased and unreliable (Lowery and Yu, 2017). Previous works have shown the power of deep learning in evaluating the efficiency of EGFR-TKI treatment in lung cancer (Song et al., 2021; Deng et al., 2022; Lu et al., 2023), but all have been based on the primary lesion. To our knowledge, deep learning has not been applied to lung cancer-originated brain metastasis (BM) for determining the presence of EGFR mutation and the efficiency of EGFR-TKI therapy.

This study constructed an ETS integrating a segmentation subnetwork and a classification subnetwork. Considering the BM only occupies a small percentage of the brain area, and thus using the whole brain MRI image as input to the network may introduce numerous noise features, we extracted the BM as an upstream task to determine the EGFR genotype. Prior research has indicated that lesion size plays a pivotal role in segmentation accuracy (Wang F. et al., 2019). To enhance the efficiency and expediency of brain tumor extraction, we expanded the region of interest (ROI) by 5 pixels to create a mask patch, thereby increasing the area of the segmentation region. Meanwhile, the external attention (Guo et al., 2023) was introduced into our segmentation subnetwork, which implicitly considers the relationship between different brain MRI feature maps and weights, and sums the different feature maps to realize the effective fusion of information, thus improving the segmentation performance.

Our classification network conducts feature extraction on the patch, including BM. Concurrently, handcrafted features are introduced to augment the comprehensiveness of the features, thereby enhancing the accuracy of EGFR prediction. This approach aligns, in part, with the findings by Nanni et al. (2017), which underscored the contribution of manual features in improving classification accuracy. Our model underwent a more detailed analysis based on both deep learning features and handcrafted features. The developed EGFR-Model generated AUCs of 0.832, 0.833, and 0.842 for predicting the EGFR mutation in the internal validation, external validation 1, and external validation 2 sets, respectively. This was much higher than previous works based on the primary lesion that obtained AUCs ranging from 0.575 to 0.762 (Tu et al., 2019; Mei et al., 2018; Digumarthy et al., 2019; Liu et al., 2016; Zhang et al., 2018; Gevaert et al., 2017; Yuan et al., 2017; Pinheiro et al., 2020). Our TKI-Model also outperformed the recent handcrafted-based radiomics study based on BM that generated AUCs ranging from 0.671 to 0.780 (Wang et al., 2021). The model’s effectiveness was further validated using a decision tree applied to both deep learning and handcrafted features. This dual-pronged approach showcased the model’s robust performance in predicting EGFR genotypes and treatment efficacy. The concurrent demonstration of efficacy on the internal validation set and two external test sets attests to the strong generalization ability of our model, as presented in Table 2, 3. This underscores its potential as a versatile tool for clinical decision-making in the context of personalized treatment for NSCLC patients with BM.

We identified the volume of peritumoral edema (VPE) as an independent clinical factor that is highly associated with the EGFR mutation status and response to EGFR-TKI. Integration of the VPE to the ETS can improve the system’s performance. The finding is consistent with previous histopathological reports that indicated that the peritumoral edema is causally linked to compressive ischemia, vascular shunting attributable to membranous microvascular parasitism, and secretory-excretory phenomena within tumor cells (Tamiya et al., 2001; Nakasu et al., 2005). Moreover, the cortical blood supply emerges as a critical factor influencing the development of peritumoral edema (Tamiya et al., 2001; Nakasu et al., 2005). This insight underscores the multifaceted nature of peritumoral edema and its relevance as a clinically significant factor in predicting EGFR mutation and response to EGFR-TKI. Our finding was supported by recent radiomics studies focusing on primary brain tumors that showed the peritumoral edema holds additional information associated with tumor diagnoses beyond the primary lesion (Kim et al., 2018; Prasanna et al., 2017; Joo et al., 2021), and the VPE and imaging-based radiomics can provide complementary information (Fan et al., 2023a).

First, the current study was retrospective, and the developed models therefore need to be further validated with prospective data. Second, the study only evaluated T1CE and T2W MRI, and the performance of the models may be potentially improved by incorporating more MRI sequences, e.g., diffusion-weighted imaging and fluid-attenuated inversion recovery MRI. Third, it is pivotal to recognize that the segmentation network used in this study operates at a patch level. For a more meticulous delineation of tumor boundaries, there exists a need for a segmentation approach that offers greater precision.Fourth, this study focused on predicting the presence of EGFR mutation, without differentiating specific subtypes such as exon 19 deletion or L858R. This may limit the model’s utility for precise therapeutic decision-making. Future work will explore subtype-level prediction for improved clinical relevance. Finally, this study only evaluated the EGFR gene mutation; other important genes that may also influence the effect of targeted therapy should be included in future studies.

5 Conclusion

In this study, we developed an automated EGFR-TKI system (ETS) to detect brain metastases and predict EGFR mutation status and therapy response.The system has been validated in both internal and external cohorts, demonstrating consistent performance. As a non-invasive method for detecting EGFR mutations, it holds potential to assist clinical decision-making and provide valuable support for non-small cell lung cancer (NSCLC) patients undergoing EGFR-TKI treatment.

Data availability statement

The datasets presented in this article are not readily available due to ethical restrictions involving patient privacy and hospital regulations. Requests to access the datasets should be directed to Wenyan Jiang eGlhb3lhODM5MjFAMTYzLmNvbQ==.

Author contributions

SY: Writing – original draft, Visualization, Conceptualization, Methodology. YF: Methodology, Validation, Conceptualization, Writing – original draft. ZY: Visualization, Conceptualization, Resources, Writing – review and editing. CY: Data curation, Methodology, Writing – review and editing, Writing – original draft. YS: Methodology, Visualization, Writing – review and editing. YL: Writing – review and editing, Supervision. ZW: Supervision, Resources, Writing – review and editing. BS: Conceptualization, Writing – review and editing, Data curation. WJ: Writing – review and editing, Conceptualization, Supervision, Investigation.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. The study was supported by the National Key Research and Development Program of China: BTIT (Grant 2022YFF1202803 and 2022YFF1202800), and Science and Technology Joint Program Fund Project of Liaoning (2023JH2/101700175).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Arbour, K. C., and Riely, G. J. (2019). Systemic therapy for locally advanced and metastatic non-small cell lung cancer A review. Jama-J Am. Med. Assoc. 322 (8), 764–774. doi:10.1001/jama.2019.11058

PubMed Abstract | CrossRef Full Text | Google Scholar

Boire, A., Brastianos, P. K., Garzia, L., and Valiente, M. (2020). Brain metastasis. Nat. Rev. Cancer 20 (1), 4–11. doi:10.1038/s41568-019-0220-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Chetan, M. R., and Gleeson, F. V. (2021). Radiomics in predicting treatment response in non-small-cell lung cancer: current status, challenges and future perspectives. Eur. Radiol. 31, 1049–1058. doi:10.1007/s00330-020-07141-9

PubMed Abstract | CrossRef Full Text | Google Scholar

de Cos, J. S., González, M. A. S., Montero, M. V., Calvo, M. C. P., Vicente, M. J. M., and Valle, M. H. (2009). Non-small cell lung cancer and silent brain metastasis. Lung Cancer 63 (1), 140–145. doi:10.1016/j.lungcan.2008.04.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Deng, K., Wang, L., Liu, Y., Li, X., Hou, Q., Cao, M., et al. (2022). A deep learning-based system for survival benefit prediction of tyrosine kinase inhibitors and immune checkpoint inhibitors in stage IV non-small cell lung cancer patients: a multicenter, prognostic study. EClinicalMedicine 51, 101541. doi:10.1016/j.eclinm.2022.101541

PubMed Abstract | CrossRef Full Text | Google Scholar

Digumarthy, S. R., Padole, A. M., Lo Gullo, R., Sequist, L. V., and Kalra, M. K. (2019). Can CT radiomic analysis in NSCLC predict histology and EGFR mutation status? Medicine 98 (1), e13963. doi:10.1097/md.0000000000013963

PubMed Abstract | CrossRef Full Text | Google Scholar

Eisenhauer, E. A., Therasse, P., Bogaerts, J., Schwartz, L., Sargent, D., Ford, R., et al. (2009). New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur. J. cancer 45 (2), 228–247. doi:10.1016/j.ejca.2008.10.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Fan, Y., Zhao, Z. L., Wang, X. L., Ai, H., Yang, C., Luo, Y., et al. (2022). Radiomics for prediction of response to EGFR-TKI based on metastasis/brain parenchyma (M/BP)-interface. Radiol. Med. 127 (12), 1342–1354. doi:10.1007/s11547-022-01569-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Fan, Y., Wang, X., Yang, C., Chen, H., Wang, H., Wang, X., et al. (2023a). Brain-tumor interface-based MRI radiomics models to determine EGFR mutation, response to EGFR-TKI and T790M resistance mutation in non-small cell lung carcinoma brain metastasis. J. Magnetic Reson. Imaging 58, 1838–1847. doi:10.1002/jmri.28751

PubMed Abstract | CrossRef Full Text | Google Scholar

Fan, Y., Wang, X. T., Dong, Y., Cui, E., Wang, H., Sun, X., et al. (2023b). Multiregional radiomics of brain metastasis can predict response to EGFR-TKI in metastatic NSCLC. Eur. Radiol. 33, 7902–7912. doi:10.1007/s00330-023-09709-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Gevaert, O., Echegaray, S., Khuong, A., Hoang, C. D., Shrager, J. B., Jensen, K. C., et al. (2017). Predictive radiogenomics modeling of EGFR mutation status in lung cancer. Sci. Rep-Uk 7, 41674. doi:10.1038/srep41674

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, M. H., Liu, Z. N., Mu, T. J., and Hu, S. M. (2023). Beyond self-attention: external attention using two linear layers for visual tasks. Ieee Trans. Pattern Analysis Mach. Intell. 45 (5), 5436–5447. doi:10.1109/TPAMI.2022.3211006

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, W.-L., Chen, Y.-L., Yang, S.-C., Ho, C. L., Wei, F., Wong, D. T., et al. (2017). Liquid biopsy genotyping in lung cancer: ready for clinical utility? Oncotarget 8 (11), 18590–18608. doi:10.18632/oncotarget.14613

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K. Q. (2017). “Densely connected convolutional networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 4700–4708.

Google Scholar

Jégou, S., Drozdzal, M., Vazquez, D., Romero, A., and Bengio, Y. (2017). “The one hundred layers tiramisu: fully convolutional densenets for semantic segmentation,” in Proceedings of the IEEE conference on computer vision and pattern recognition workshops, 11–19.

Google Scholar

Joo, L., Park, J. E., Park, S. Y., Nam, S. J., Kim, Y. H., Kim, J. H., et al. (2021). Extensive peritumoral edema and brain-to-tumor interface MRI features enable prediction of brain invasion in meningioma: development and validation. Neuro-Oncology 23 (2), 324–333. doi:10.1093/neuonc/noaa190

PubMed Abstract | CrossRef Full Text | Google Scholar

Kinga, D., and Adam, J. B. (2015). “A method for stochastic optimization,”Int. Conf. Learn. Represent. (ICLR) 5 6. Available online at: https://arxiv.org/pdf/1412.6980

Google Scholar

Kim, Y., Cho, H.-h., Kim, S. T., Park, H., Nam, D., and Kong, D.-S. (2018). Radiomics features to distinguish glioblastoma from primary central nervous system lymphoma on multi-parametric MRI. Neuroradiology 60, 1297–1305. doi:10.1007/s00234-018-2091-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Lambin, P., Leijenaar, R. T. H., Deist, T. M., Peerlings, J., de Jong, E. E., van Timmeren, J., et al. (2017). Radiomics: the bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 14 (12), 749–762. doi:10.1038/nrclinonc.2017.141

PubMed Abstract | CrossRef Full Text | Google Scholar

Lin, M., Chen, Q., and Yan, S. (2025). Network in network. arXiv preprint arXiv:13124400 2013.

Google Scholar

Liu, Y., Kim, J., Balagurunathan, Y., Li, Q., Garcia, A. L., Stringfield, O., et al. (2016). Radiomic features are associated with EGFR mutation status in lung adenocarcinomas. Clin. Lung Cancer 17 (5), 441–448.e6. doi:10.1016/j.cllc.2016.02.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Lowery, F. J., and Yu, D. H. (2017). Brain metastasis: unique challenges and open opportunities. Bba-Rev Cancer 1867 (1), 49–57. doi:10.1016/j.bbcan.2016.12.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, C. F., Liao, C. Y., Chao, H. S., Chiu, H. Y., Wang, T. W., Lee, Y., et al. (2023). A radiomics-based deep learning approach to predict progression free-survival after tyrosine kinase inhibitor therapy in non-small cell lung cancer. Cancer Imaging 23 (1), 9. doi:10.1186/s40644-023-00522-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Lynch, T. J., Bell, D. W., Sordella, R., Gurubhagavatula, S., Okimoto, R. A., Brannigan, B. W., et al. (2004). Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib. New Engl. J. Med. 350 (21), 2129–2139. doi:10.1056/nejmoa040938

PubMed Abstract | CrossRef Full Text | Google Scholar

Magadza, T., and Viriri, S. (2021). Deep learning for brain tumor segmentation: a survey of state-of-the-art. J. Imaging 7 (2), 19. doi:10.3390/jimaging7020019

PubMed Abstract | CrossRef Full Text | Google Scholar

Mei, D. D., Luo, Y., Wang, Y., and Gong, J. S. (2018). CT texture analysis of lung adenocarcinoma: can Radiomic features be surrogate biomarkers for EGFR mutation statuses. Cancer Imaging 18, 52. doi:10.1186/s40644-018-0184-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Nakasu, S., Fukami, T., Jito, J., and Matsuda, M. (2005). Microscopic anatomy of the brain–meningioma interface. Brain Tumor Pathol. 22, 53–57. doi:10.1007/s10014-005-0187-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Nanni, L., Ghidoni, S., and Brahnam, S. (2017). Handcrafted vs. non-handcrafted features for computer vision classification. Pattern Recogn. 71, 158–172. doi:10.1016/j.patcog.2017.05.025

CrossRef Full Text | Google Scholar

Niu, F.-Y., Zhou, Q., Yang, J.-J., Zhong, W. Z., Chen, Z. H., Deng, W., et al. (2016). Distribution and prognosis of uncommon metastases from non-small cell lung cancer. BMC cancer 16, 149–6. doi:10.1186/s12885-016-2169-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Pan, C. C., Schoppe, O., Parra-Damas, A., Cai, R., Todorov, M. I., Gondi, G., et al. (2019). Deep learning reveals cancer metastasis and therapeutic antibody targeting in the entire body. Cell 179 (7), 1661–1676.e19. doi:10.1016/j.cell.2019.11.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Pinheiro, G., Pereira, T., Dias, C., Freitas, C., Hespanhol, V., Costa, J. L., et al. (2020). Identifying relationships between imaging phenotypes and lung cancer-related mutation status: EGFR and KRAS. Sci. Rep-Uk 10 (1), 3625. doi:10.1038/s41598-020-60202-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Prasanna, P., Patel, J., Partovi, S., Madabhushi, A., and Tiwari, P. (2017). Radiomic features from the peritumoral brain parenchyma on treatment-naive multi-parametric MR imaging predict long versus short-term survival in glioblastoma multiforme: preliminary findings. Eur. Radiol. 27, 4188–4197. doi:10.1007/s00330-016-4637-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Rebuzzi, S. E., Alfieri, R., La Monica, S., Minari, R., Petronini, P. G., and Tiseo, M. (2020). Combination of EGFR-TKIs and chemotherapy in advanced EGFR mutated NSCLC: review of the literature and future perspectives. Crit. Rev. Oncology/Hematology 146, 102820. doi:10.1016/j.critrevonc.2019.102820

PubMed Abstract | CrossRef Full Text | Google Scholar

Schuchert, M. J., and Luketich, J. D. (2003). Solitary sites of metastatic disease in non-small cell lung cancer. Curr. Treat. options Oncol. 4, 65–79. doi:10.1007/s11864-003-0033-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Sculier, J.-P. (2013). Nonsmall cell lung cancer. Eur. Respir. Rev. 22 (127), 33–36. doi:10.1183/09059180.00007012

PubMed Abstract | CrossRef Full Text | Google Scholar

Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (2017). “Grad-cam: visual explanations from deep networks via gradient-based localization,” in Proceedings of the IEEE international conference on computer vision, 618–626. doi:10.1109/iccv.2017.74

CrossRef Full Text | Google Scholar

Song, J., Wang, L., and Ng, N. N. (2021). An opportunity to reduce disparities in lung cancer screening. Jama Netw. Open 4 (2), e2129126. doi:10.1001/jamanetworkopen.2021.29126

PubMed Abstract | CrossRef Full Text | Google Scholar

Tamiya, T., Ono, Y., Matsumoto, K., and Ohmoto, T. (2001). Peritumoral brain edema in intracranial meningiomas: effects of radiological and histological factors. Neurosurgery 49 (5), 1046–1052. doi:10.1227/00006123-200111000-00003

PubMed Abstract | CrossRef Full Text | Google Scholar

Thompson, J. C., Yee, S. S., Troxel, A. B., Savitch, S. L., Fan, R., Balli, D., et al. (2016). Detection of therapeutically targetable driver and resistance mutations in lung cancer patients by next-generation sequencing of cell-free circulating tumor DNA. Clin. Cancer Res. 22 (23), 5772–5782. doi:10.1158/1078-0432.ccr-16-1231

PubMed Abstract | CrossRef Full Text | Google Scholar

Tu, W. T., Sun, G. Y., Fan, L., Wang, Y., Xia, Y., Guan, Y., et al. (2019). Radiomics signature: a potential and incremental predictor for EGFR mutation status in NSCLC patients, comparison with CT morphology. Lung Cancer 132, 28–35. doi:10.1016/j.lungcan.2019.03.025

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, S., Shi, J. Y., Ye, Z. X., Dong, D., Yu, D., Zhou, M., et al. (2019). Predicting EGFR mutation status in lung adenocarcinoma on computed tomography image using deep learning. Eur. Respir. J. 53 (3), 1800986. doi:10.1183/13993003.00986-2018

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, F., Jiang, R., Zheng, L., Meng, C., and Biswal, B. (2019). “3d u-net based brain tumor segmentation and survival days prediction,” in International MICCAI brainlesion workshop (Springer), 131–141.

Google Scholar

Wang, G. Y., Wang, B. M., Wang, Z., Li, W., Xiu, J., Liu, Z., et al. (2021). Radiomics signature of brain metastasis: prediction of EGFR mutation status. Eur. Radiol. 31 (7), 4538–4547. doi:10.1007/s00330-020-07614-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, S., Yu, H., Gan, Y. C., Wu, Z., Li, E., Li, X., et al. (2022). Mining whole-lung information by artificial intelligence for predicting EGFR genotype and targeted therapy response in lung cancer: a multicohort study. Lancet Digit. Health 4 (5), E309–E319. doi:10.1016/s2589-7500(22)00024-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Witten, D., James, G., Hastie, T., and Tibshirani, R. (2013). An introduction to statistical learning with applications in R. New York, NY: springer publication.

Google Scholar

Yang, J. J., Zhou, C. C., Huang, Y. S., Feng, J., Lu, S., Song, Y., et al. (2017). Icotinib versus whole-brain irradiation in patients with EGFR-mutant non-small-cell lung cancer and multiple brain metastases (BRAIN): a multicentre, phase 3, open-label, parallel, randomised controlled trial. Lancet Resp. Med. 5 (9), 707–716. doi:10.1016/s2213-2600(17)30262-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Yin, G., Wang, Z., Song, Y., Li, X., Chen, Y., Zhu, L., et al. (2021). Prediction of EGFR mutation status based on 18F-FDG PET/CT imaging using deep learning-based model in lung adenocarcinoma. Front. Oncol. 11, 709137. doi:10.3389/fonc.2021.709137

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan, M., Pu, X. H., Xu, X. Q., Zhang, Y. D., Zhong, Y., Li, H., et al. (2017). Lung adenocarcinoma: assessment of epidermal growth factor receptor mutation status based on extended models of diffusion-weighted image. J. Magnetic Reson. Imaging 46 (1), 281–289. doi:10.1002/jmri.25572

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, L. W., Chen, B. J., Liu, X., Song, J., Fang, M., Hu, C., et al. (2018). Quantitative biomarkers for prediction of epidermal growth factor receptor mutation in non-small cell lung cancer. Transl. Oncol. 11 (1), 94–101. doi:10.1016/j.tranon.2017.10.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: NSCLC, EGFR, TKI, brain metastasis, deep learning

Citation: You S, Fan Y, Yang Z, Yang C, Sun Y, Luo Y, Wang Z, Sun B and Jiang W (2025) Deep learning on brain metastasis for predicting EGFR genotype and EGFR-TKI therapy response in metastatic NSCLC: a multicenter study. Front. Bioeng. Biotechnol. 13:1637095. doi: 10.3389/fbioe.2025.1637095

Received: 28 May 2025; Accepted: 17 September 2025;
Published: 02 October 2025.

Edited by:

Andreas Kanavos, Ionian University, Greece

Reviewed by:

Venkatachalam Deepa Parvathi, Sri Ramachandra Institute of Higher Education and Research, India
Hesong Wang, Fourth Hospital of Hebei Medical University, China

Copyright © 2025 You, Fan, Yang, Yang, Sun, Luo, Wang, Sun and Jiang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zekun Wang, d2FuZ3prODdAMTYzLmNvbQ==; Bo Sun, c3VuYm95Y211QDE2My5jb20=; Wenyan Jiang, eGlhb3lhODM5MjFAMTYzLmNvbQ==

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.