Predicting Survival Duration With MRI Radiomics of Brain Metastases From Non-small Cell Lung Cancer

Chen, Bihong T.; Jin, Taihao; Ye, Ningrong; Mambetsariev, Isa; Wang, Tao; Wong, Chi Wah; Chen, Zikuan; Rockne, Russell C.; Colen, Rivka R.; Holodny, Andrei I.; Sampath, Sagus; Salgia, Ravi

doi:10.3389/fonc.2021.621088

ORIGINAL RESEARCH article

Front. Oncol., 05 March 2021

Sec. Neuro-Oncology and Neurosurgical Oncology

Volume 11 - 2021 | https://doi.org/10.3389/fonc.2021.621088

This article is part of the Research TopicAdvances of Radiomics and Artificial Intelligence in the Management of Patients with Central Nervous System TumorsView all 12 articles

Predicting Survival Duration With MRI Radiomics of Brain Metastases From Non-small Cell Lung Cancer

Bihong T. Chen¹^*

Isa Mambetsariev²

Zikuan Chen¹

Russell C. Rockne⁵

Rivka R. Colen⁶

Andrei I. Holodny⁷

Sagus Sampath⁸

Ravi Salgia²

¹Department of Diagnostic Radiology, City of Hope National Medical Center, Duarte, CA, United States
²Department of Medical Oncology and Therapeutics Research, City of Hope Comprehensive Cancer Center and Beckman Research Institute, Duarte, CA, United States
³Departments of Interventional Radiology, Nanjing First Hospital, Nanjing Medical University, Nanjing, China
⁴Applied AI and Data Science, City of Hope National Medical Center, Duarte, CA, United States
⁵Division of Mathematical Oncology, City of Hope National Medical Center, Duarte, CA, United States
⁶Department of Radiology, Hillman Cancer Center, University of Pittsburgh Medical Center, Pittsburgh, PA, United States
⁷Department of Radiology, Memorial Sloan-Kettering Cancer Center, New York, NY, United States
⁸Department of Radiation Oncology, City of Hope National Medical Center, Duarte, CA, United States

Background: Brain metastases are associated with poor survival. Molecular genetic testing informs on targeted therapy and survival. The purpose of this study was to perform a MR imaging-based radiomic analysis of brain metastases from non-small cell lung cancer (NSCLC) to identify radiomic features that were important for predicting survival duration.

Methods: We retrospectively identified our study cohort via an institutional database search for patients with brain metastases from EGFR, ALK, and/or KRAS mutation-positive NSCLC. We segmented the brain metastatic tumors on the brain MR images, extracted radiomic features, constructed radiomic scores from significant radiomic features based on multivariate Cox regression analysis (p < 0.05), and built predictive models for survival duration.

Result: Of the 110 patients in the cohort (mean age 57.51 ± 12.32 years; range: 22–85 years, M:F = 37:73), 75, 26, and 15 had NSCLC with EGFR, ALK, and KRAS mutations, respectively. Predictive modeling of survival duration using both clinical and radiomic features yielded areas under the receiver operative characteristic curve of 0.977, 0.905, and 0.947 for the EGFR, ALK, and KRAS mutation-positive groups, respectively. Radiomic scores enabled the separation of each mutation-positive group into two subgroups with significantly different survival durations, i.e., shorter vs. longer duration when comparing to the median survival duration of the group.

Conclusion: Our data supports the use of radiomic scores, based on MR imaging of brain metastases from NSCLC, as non-invasive biomarkers for survival duration. Future research with a larger sample size and external cohorts is needed to validate our results.

Introduction

Lung cancer is the second most commonly diagnosed cancer (1). Non-small cell lung cancer (NSCLC) makes up ~85–90% of all lung cancer cases, and 30–50% of patients with NSCLC develop brain metastases (2, 3). Despite advancements in treatment, the survival duration of patients with lung cancer brain metastases remains short, with a poor median survival of 4–8 months after diagnosis (4). Molecular characteristics help to determine whether patients with cancer will respond to targeted therapies thus prolong survival (5). The molecular testing of lung cancer usually screens for genes encoding epidermal growth factor receptor (EGFR), anaplastic lymphoma kinase (ALK) and Kirsten rat sarcoma viral oncogene homolog (KRAS) (6–8). Molecularly targeted medications that can penetrate the central nervous system have improved outcomes in patients with brain metastases from lung cancers with actionable mutations. For example, tyrosine kinase inhibitors, such as erlotinib, have been effective in treating brain metastases in NSCLC patients with EGFR mutations (9). Therefore, the knowledge of molecular mutation status is essential for planning individualized treatments and for predicting survival.

Pathological tissue confirmation and molecular characterization of brain metastases through invasive biopsy or surgical resection are not always possible or practical. In contrast, neuroimaging methods, such as brain magnetic resonance imaging (MRI), are commonly used to non-invasively assess the entire brain to diagnose and to plan treatments for patients with brain metastases. In addition, brain metastases may present with various imaging features depending on the mutation status of the primary NSCLC (10). However, little is known about the relationship between the neuroimaging features of brain metastases and the NSCLC mutation subtypes for survival prediction. There is an unmet need to identify non-invasive neuroimaging biomarkers to predict survival duration for NSCLC patients with brain metastases who may have one of the three most common mutations, i.e., EGFR, ALK, or KRAS.

Radiomics is a computerized method to extract high-dimensional data from non-invasive standard-of-care medical images (11). It can provide a detailed characterization of tumors, in terms of tumor heterogeneity in relation to aggressiveness, which are not perceptible to the human eye (12, 13). In addition, linking imaging features with molecular and immune characteristics will contribute valuable information that is critical for cancer treatment and prognosis (14). Furthermore, the radiomic approach allows the non-invasive analysis of treatment response and prognosis at multiple time points, which is not feasible or practical using invasive biopsies. Radiomic scores, which incorporate information about key imaging features, have shown potential as biomarkers for predicting survival in patients with lung cancer and breast cancers (13, 15, 16). However, to the best of our knowledge, no published studies have used radiomic analysis of brain metastases to predict survival duration of patients with NSCLC according to their mutation status.

Here, we performed a MRI radiomic analysis of brain metastases for survival duration in patients with NSCLC. We hypothesize that MRI radiomics of brain metastases could be used to predict survival duration in patients with NSCLC. Our objective was to use radiomic features extracted from MR images of the brain metastases to build machine learning models for predicting survival durations of patients with NSCLC according to the specific mutation status of their primary NSCLC, i.e., EGFR, ALK, or KRAS. In addition, we constructed a radiomic score for each mutation-positive group to predict whether the patients survived longer or shorter than the median survival duration for each group.

Methods

Patient Selection and Imaging Acquisition

We retrospectively identified consecutive patients for this study by searching the Thoracic Oncology Registry for all lung cancer patients treated at City of Hope National Medical Center (Duarte, CA, USA) between 2009 and 2017. Eligibility criteria included the following: diagnosis of NSCLC; confirmation via genotype testing of an EGFR, ALK, and/or KRAS mutation in the primary NSCLC tumors; and having brain MRI scans performed to diagnose brain metastases but before initiating treatment for the brain metastases. Patient demographic data, survival information including date of death or last follow-up, and mutation status were abstracted from electronic medical records (Table 1). The Institutional Review Board at City of Hope National Medical Center approved this study and waived informed consent due to its retrospective nature. The study was conducted in accordance with the Declaration of Helsinki.

TABLE 1

Table 1. Demographic information for the study cohort.

Brain MR images including both the T1-weighted contrast-enhanced (T1C) and T2-weighted fluid-attenuated inversion recovery (FLAIR) sequences were retrieved from our Picture Archiving and Communication System. Brain MR scans were obtained from the same in-house 3T VERIO Siemens scanner (Siemens, Erlangen, Germany). T1C sequence was acquired with axial T1-weighted three-dimensional (3D) magnetization prepared rapid gradient echo (MPRAGE) imaging after intravenous administration of MultiHance^® (gadobenate dimeglumine) at 0.1 mmol/Kg. The FLAIR sequence for the peritumoral edema was acquired with routine imaging protocol. Detailed scanning parameters have been reported in our previous study (10).

Brain Tumor Segmentation

For image segmentation, we co-registered T1C and FLAIR images into the same geometric space under an affine transformation as established by the elastix toolbox (17). We segmented the T1C and FLAIR images for enhancing tumor and peritumoral edema, respectively. We performed image transformation and re-slicing with FSL scripts (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/).

Subsequently, we used ITK-SNAP, an open-source 3D image analysis software (www.itksnap.org) to contour the tumor boundaries of both T1C (for the enhancing tumor) and FLAIR (for the peritumoral edema) images in a semi-automated fashion on a slice-by-slice basis (18). This semi-automated method consisted of the two steps. First, the ITK-SNAP software automatically placed a region of interest box around the tumors. Second, the tumor boundaries were manually drawn slice-by-slice by our trained research personnel (NY, TW, and BC). One researcher (NY) was a neuroimaging researcher with 2 years of experience in tracing tumors for radiomic research. The other two researchers (TW and BC) were neuroradiologists with a combined 20 years of experience in neuroimaging. Discrepancy during tumor segmentation was resolved by consensus of the research group. We have reported the details of brain tumor segmentation previously (10). The imaging delineation (mask) of the two segmented phenotypes (enhancing tumor and peritumoral edema) were exported for radiomic analysis. Our analysis included up to 10 of the largest tumors from each patient, limited to tumors >5 mm in diameter because smaller tumors could not be reliably segmented for 3D analysis. Our dataset consisted of 452 lesions from 110 patients. Figure 1 presents the schema for brain tumor segmentation, radiomic feature extraction, and predictive modeling for survival duration.

FIGURE 1

Figure 1. Schema for brain tumor segmentation, radiomic feature extraction, and predictive modeling. (A) Representative tumor segmentation images from post-contrast T1-weighted (T1C) and T2-weighted fluid-attenuated inversion recovery (FLAIR) data. (B) Illustrations of radiomic features extracted from the brain tumor images, including texture, shape, and intensity. GLCM, Gray Level Co-occurrence Matrix; GLRLM, Gray Level Run Length Matrix; GLSZM, Gray Level Size Zone Matrix; NGTDM, Neighboring Gray Tone Difference Matrix. (C) Receiver operating characteristic (ROC) curves for the models predicting the survival durations of patients in each of the three mutation-positive groups (EGFR, ALK, and KRAS mutation-positive groups) and representative survival duration analysis.

To assess the consistency of image segmentation and the stability of radiomic features extracted for modeling, two researchers (NY and TW) independently performed tumor segmentation on the brain images from 20 randomly selected patients with the results being blinded to each other. We then used their segmentation results to test the inter-observer variability. In addition, one researcher (NY) repeated the brain tumor segmentation twice with 1 month apart for testing the intra-observer variability. We used the interclass correlation coefficient (ICC) test to assess the consistency of the radiomic features for both inter-observer and intra-observer variability. An inter-observer and intra-observer ICC > 0.80 was considered stable for tumor segmentation and radiomic feature extraction. The inter-observer ICC between the two researchers (NY and TW) for tumor segmentation achieved at 0.96 ± 0.04 in a range from 0.87 to 0.99 and for edema segmentation achieved at 0.95 ± 0.05 in a range from 0.80 to 0.99. The intra-observer ICC between the two measurements by the same researcher (NY) achieved 0.99 ± 0.006 (range from 0.97 to 1.00), and 0.99 ± 0.007 (range from 0.97 to 1.00) for segmentation of tumor and edema, respectively. The results indicated favorable inter- and intra-observer reproducibility and stability for tumor segmentation and subsequent radiomic feature extraction.

Radiomic Feature Extraction and Selection

The image preprocessing and radiomic feature extraction have been previously reported by our group (10). Briefly, we preprocessed each of the T1C or FLAIR images using a pipeline consisting of three steps: (i) skull-stripping using the Brain Extraction Tool (BET; http://fsl.fmrib.ox.ac.uk/fsl/fslwiki/BET) and Free Surfer (https://surfer.nmr.mgh.harvard.edu/); (ii) bias field correction using the routine N4ITKBiasFieldCorrection of nipype (https://nipype.readthedocs.io/en/0.12.0/users/index.html); (iii) image intensity normalization using an algorithm to standardize the intensity scales across MR images of the same contrast (19). Subsequently, we applied six different filters (Wavelet, Laplacian of Gaussian, Square, Square Root, Logarithm, or Exponential) to each of the preprocessed images, generating six derived images. Therefore, there were 12 derived images associated with each brain lesion, 6 for each of the two original (T1C and FLAIR) images. Finally, we performed radiomic feature extraction using an open-source python package PyRadiomics (https://pyradiomics.readthedocs.io/en/latest/) (20) on each derived image by applying a tumor or edema mask based on the modality of the original image, i.e., applying the tumor mask on the six images derived from the original T1C image, and applying the edema mask on the six images derived from the original FLAIR image. We extracted three types of radiomic features from each image including: (i) textural features, including Gray Level Co-occurrence Matrix (GLCM), Gray Level Run Length Matrix (GLRLM), Gray Level Size Zone Matrix (GLSZM), Neighboring Gray Tone Difference Matrix (NGTDM), and Gray Level Dependence Matrix (GLDM); (ii) shape-based features, including Volume, Surface Area, and Sphericity; and (iii) intensity-based features, such as Minimum, Maximum, and Mean. We extracted a total of 2,786 radiomic features from the 12 derived images for each lesion.

We performed feature selection in two steps. First, we selected 2,520 stable features from the total of 2,786 features based on the inter-observer ICC test with a threshold of 0.8 (corrected p < 0.05). Second, from those 2,520, the 50 most relevant features for model building were selected using a minimum redundancy and maximum relevance (MRMR) algorithm (21).

Building Predictive Models for Survival Duration

We dichotomized the patients in each mutation-positive group into two subgroups, i.e., shorter and longer survival subgroups, by assigning the patients with survival duration shorter than the median of the mutation-positive group to the shorter survival subgroup and the remaining patients to the longer survival subgroup. Subsequently we built independent machine learning models for each mutation-positive group to predict whether a patient survived longer than the median survival duration of the group. We evaluated the predictive performance of the machine learning models through leave one out cross validation (LOOCV) using four commonly used performance metrics including the area under the curve (AUC) of the receiver operating characteristic curves (ROC), the specificity, sensitivity and the prediction accuracy (22). We used an open source software scikit-learn for the machine learning model training and evaluation (23). Model training and prediction were tumor-based rather than patient-based, meaning each tumor was treated as an independent instance. The synthetic minority over-sampling technique (SMOTE) was used to improve learning using imbalanced datasets (24).

We built the predictive models using the 50 radiomic features alone or together with 18 additional features including demographic, clinical, and tumor information. Demographic information included gender (male, female), race (Caucasian, Asian, and other), and smoking history (yes, no). Clinical information included the presence or absence of extracranial metastases at 11 sites (bone, lymph, liver, lung, kidney, pancreas, breast, spinal cord, mediastinum, pericardium, and pleura). Tumor information included the number of tumors, the volume of the enhancing tumor core, and the edema/tumor volume ratio. The MRMR-based feature selection was performed in each round of LOOCV process, i.e., 50 most relevant radiomic features were selected using the MRMR algorithm using the training dataset (sample size equals N−1 for a N sample dataset) after leaving one sample out as the test dataset.

Selection of Machine Learning Algorithm

We used the gradient boosting classifier to build the machine learning models for predicting the survival durations of all three mutation groups. We selected this algorithm using a model selection process that has been previously described (10). Briefly, (a) we tested 30 classifiers implemented in Scikit-Learn software (23) and evaluated their performance using leave-one-out cross validation (LOOCV), (b) we subsequently ranked their performances according to the area under the curve (AUC) of the receiver operating characteristic curve (ROC) of each model, and (c) we selected the algorithm, Gradient boosting classifier, because it was the only one ranked among top three algorithms for modeling each of the three patient groups.

Table 2 presents the performance data for the top three algorithms for each of the three mutation groups. The performance metrics include accuracy, AUC, sensitivity, and specificity. A total of five classifiers (ada boosting, random forest, extra tree, bagging, and gradient boosting) ranked among the top three classifiers for modeling at least one of the three mutation groups. Gradient boosting classifier was the only classifier ranked among top three for all three mutation-positive groups, therefore, we used this algorithm to build the predictive models for all three mutation groups.

TABLE 2

Table 2. Performance metrics for the top three machine learning algorithms for predicting whether patients survive longer than the group median in the EGFR, ALK, and KRAS mutation-positive groups using radiomic features only.

Statistical Analysis and Radiomic Score

Demographic Data

We used analysis of variance (ANOVA) to determine the statistical significance of group differences in age. The normality of the distribution was tested using the Shapiro-Wilk test, and the homoscedasticity (the three groups have equal variance) was tested using Bartlett's test implemented in SciPy. We used Fisher's exact test to determine the statistical significance of group differences in the distributions of the categorical variables, including gender, race, smoking history, histology, and other metastatic sites. P < 0.05 were considered statistically significant. We used the statistical analysis package in the SciPy: open source scientific tools for Python library (https://www.scipy.org/) for the analysis described above.

Survival Analysis and Radiomic Score

We selected radiomic and clinical features that were important for patients' survival duration and subsequently computed radiomic score for each patient by sequentially performing univariate and multivariate Cox proportional hazard regression through the following steps (Figure 2): (A) Selecting 20 radiomic features potentially associated with patients' survival duration. In this step, we computed the feature importance of the 50 radiomic features used in the machine learning models using scikit learn software as described in the Section: Building Predictive Models for Survival Duration) and selected the top 20 radiomic features according to the feature importance value (Supplementary Table 1, Supplementary Material); (B) Performing univariate Cox regression using each of the selected top 20 radiomic features (one by one) and selected those with p < 0.05 in the analysis; (C) Performing multivariate Cox regression using the above selected radiomic features together with the 18 clinical feature (described in Section Building Predictive Models for Survival Duration) and chose those with p < 0.05 in the analysis as the final selected radiomic and clinical features; (D) Computing radiomic score for each patient in each mutation-positive group using a linear combination of the features selected in step C weighted by the coefficients determined by the multivariate Cox regression. We divided each mutation group into two subgroups according to the radiomic scores. In each mutation group, those patients with higher radiomic scores than the group median were assigned into the high radiomic score subgroup, and the rest of the patients in the mutation-positive group were assigned into the subgroup with lower radiomic score. We tested the statistical significance of the differences in the median survival durations between the two subgroups in each mutation-positive group using log rank test. We used log rank test to compare the median survival durations of patients in the EGFR, ALK, and KRAS mutation-positive groups. We used Lifelines, an open source software in Python (https://lifelines.readthedocs.io/en/latest/), for the survival analysis and presentation described in this section.

FIGURE 2

Figure 2. Major steps of Cox proportional hazard regression analysis for determining the effects of radiomic features on survival durations of patients for each of the three mutation-positive group (EGFR, ALK, and KRAS mutation-positive groups). The top 20 radiomic features (step A) were selected based on the feature importance as determined by the multivariate Cox regression during classifier training.

Results

Patient Information

The 110 patients in this study cohort [mean age: 57.51 ± 12.32 years (range: 22 to 85 years), M:F = 37:73] were separated into three groups according to mutation status of the three oncogenes EGFR, ALK, and KRAS. In this cohort, 75 patients had EGFR mutation, 21 had ALK mutation, and 15 had KRAS mutation in their primary NSCLC, respectively (Table 1). There was one patient who was positive for both ALK and EGFR mutations. A detailed summary of the demographic and clinical information for the cohort has been reported previously focusing on classification of mutation status from lung cancer brain metastases (10). Briefly, there were statistically significant group differences for the two categorical variables, race (p < 0.05) and smoking history (p < 0.001). There was a significant difference in the racial distribution of the EGFR and KRAS groups (p = 0.005), and the KRAS group had a higher percentage of smokers than the EGFR (p = 0.0002) and ALK (p = 0.0036) groups.

We also compared the demographic data between the mutation-positive group and the mutation-negative groups for each gene mutation, i.e., EGFR (+) vs. EGFR (–), ALK (+) vs. ALK (–), and KRAS (+) vs. KRAS (–). There was a significantly greater percentage of Asian patients in the EGFR (+) group than the EGFR (–) group (p = 0.042). The KRAS (+) group was significantly older than the KRAS (–) group (p = 0.002). There was a higher percentage of smokers in the KRAS (+) group than the KRAS (–) group (p = 0.0001).

The median survival durations for EGFR, ALK, and KRAS mutation-positive groups were 12.7, 20.9, and 17.0 months, respectively. The pair-wise log-rank test indicated that the median survival duration of the ALK mutation-positive group was significantly longer than that of the EGFR mutation-positive group (p = 0.011), whereas the difference between the ALK and KRAS mutation-positive groups was not significant (p > 0.05).

Prediction of Survival Duration

For all mutation-positive groups, the predictive performance of models built with radiomic features alone was better than that of models built with clinical data alone. Combining radiomic features and clinical data resulted in the most accurate prediction results (Figure 3). When using both clinical data and radiomic features in the modeling, the AUCs for predicting whether patients survived longer than the median survival duration of the group was 0.977, 0.905, and 0.947 for EGFR, ALK, and KRAS, respectively. Table 3 shows the accuracy, AUC, sensitivity, and specificity of the survival duration predictions for the patients in EGFR, ALK, or KRAS mutation-positive group, respectively. Both radiomic features and clinical data were combined to generate the performance data in Table 3. The accuracy was 94.9%, 84.1%, and 83.0% for the survival duration predictions for EGFR, ALK, and KRAS mutation-positive group, respectively. The sensitivity was 96.0, 88.0, and 83.0% for the survival duration predictions of EGFR, ALK, and KRAS mutation-positive group, respectively. The specificity was 94.0, 81.0, and 83.0% for the survival duration predictions of the patients in the EGFR, ALK, and KRAS mutation-positive groups, respectively.

FIGURE 3

Figure 3. Receiver operating characteristic (ROC) curves for models predicting whether patients with mutations of (A) EGFR, (B) ALK, and (C) KRAS survived longer than the median survival duration of the mutation-positive group. Curves are shown for models using clinical data only (green), radiomics features only (blue), and a combination of both clinical data and radiomic features (red). The areas under the receiver operating characteristic curves (AUCs) are indicated in each panel. KRAS mutation—positive group has too small a sample size to build the predictive model using clinical data alone.

TABLE 3

Table 3. Performance metrics for predicting whether patients survive longer than the group median in EGFR, ALK, and KRAS mutation-positive groups.

Cox Regression Analysis and Radiomic Score Calculation

Table 4 presents multivariate Cox regression results for the three mutation-positive groups. The demographic and radiomic features that were statistically significantly associated with survival duration (p < 0.05) are listed in Table 4. The features with positive coefficients were associated with shorter survival duration while those with negative coefficients were associated with longer survival duration. For the EGFR mutation-positive group, the radiomic score consisted of age {[Coefficient (coef): 2.76]}, Caucasian race (coef: 0.961), male sex (coef: 0.89), edema/tumor volume ratio (coef: −3.71), tumor number (coef: 1.78), an intensity feature exacted from edema area (coef: 1.37) and a textual feature exacted from tumor area (coef: −1.41). For the ALK mutation-positive group, the radiomic score consisted of the tumor number (coef: 3.05), and an intensity feature exacted from edema area (coef: −1.76). For the KRAS mutation-positive group, the radiomic score consisted of the edema/tumor volume ratio (coef: −16.8) and the tumor number (coef: −1.06). The feature names and the z score listed in Table 4 are graphically presented in Figure 4.

TABLE 4

Table 4. Demographic and radiomic features significantly associated with survival duration for each mutation-positive group as determined by multivariate Cox regression analysis.

FIGURE 4

Figure 4. Radiomic scores of survival durations for EGFR, ALK, and KRAS mutation-positive groups. Each column represents the components of the radiomic score for survival prediction for each mutation-positive group, as indicated on the left end. The color indicates the z-score for each feature, based on multivariate Cox regression analysis, according to the scale shown on the right end. The numerical value of each Wald statistics is indicated with imbedded texts. Features with positive values (red) are associated with shorter survival duration, while those with negative values are associated with longer survival duration. The corresponding Cox regression coefficients of the features are shown in Table 4. *Edema Median Intensity: Edema_Intensity_squareroot_Intensity_Median. **Tumor Texture: Tumor Texture log-sigma-3-mm-3D GLRLM LongRunHighGrayLevelEmphasis.

To assess the collective prognostic power of the features that were statistically significantly associated with the patients' survival, we constructed radiomic scores through a linear combination of the significant radiomic features listed in Table 4 which were weighted by the coefficients. We then divided each of the three patient groups into two subgroups based on the radiomic scores, i.e., assigning those patients with the radiomic scores lower than the median radiomic score of the group into a lower score subgroup and assigning the rest of the patients in the group into a higher score group. Figure 5 shows Kaplan–Meier plots of the two subgroups within each mutation-positive group based on radiomic scores. In each of the three mutation-positive groups, the subgroup with lower radiomic score had longer median survival duration than that of the subgroup with higher radiomic score.

FIGURE 5

Figure 5. Kaplan–Meier plots for each mutation-positive group (A–C) separated into two subgroups by their radiomic scores (higher than or lower than the median radiomic score for each mutation-positive group). The subgroup with radiomic score values higher than the median radiomic score of each mutation-positive group had significantly shorter survival duration than the subgroup with values lower than the median radiomic score. The radiomic scores were computed as the weighted average of the features shown in Table 4 (weighted by Cox regression coefficients).

Discussion

In this study, we built machine learning models to predict whether patients with EGFR, ALK, or KRAS mutation-positive primary NSCLC survived longer than the median survival duration for each specific mutation group. The final models of our study used 50 radiomic features together with 18 clinical features and achieved AUC of 0.977, 0.905, and 0.947 for the three mutation-positive groups, i.e., EGFR, ALK, and KRAS groups, respectively. Subsequently, we identified radiomic and clinical features significantly associated with survival duration for the patients in the three mutation-positive groups. Finally, we constructed radiomic scores using linear combinations of these features weighted with their coefficients in the multivariate regression. After dividing each of the three mutation groups into two subgroups according to radiomic scores, our study showed that the subgroup with lower radiomic scores had statistically significant longer median survival duration, indicating strong association between radiomic scores and the patients' survival duration.

The performance of our predictive models compared favorably to those of published predictive models based on the computed tomography (CT) images of primary lung cancer (25–28). Hosny et al. (29) used a 3D convolutional neural network (CNN) to study prognostic stratification in a multi-cohort radiomic study using the lung CT images of 1,194 patients with NSCLC. Their models predicted whether patients could survive longer than 2 years after treated either with radiotherapy or surgery, and achieved AUC of 0.70 and 0.71, respectively. It is challenging to compare our results, which were based on the MRI radiomics of brain metastases, to the results of the deep learning study which was based on lung CT images. Nevertheless, judging by AUC values alone, the performance of our predictive models was comparable to the work performed by deep learning networks (29).

Our predictive models achieved reasonable performance as compared to other studies using radiomic features from MR images of brain metastases (30–33). For example, Béresová et al. (33) demonstrated that using MR image-based textural radiomic analysis could distinguish brain metastases originating from lung cancer vs. breast cancer, achieving AUC of 0.70. In another study, Ortiz-Ramon et al. (32) used radiomic features extracted from MR images of brain metastases to predict whether the primary cancer being lung cancer or melanoma, achieving AUC of 0.95. Recently, Kniep et al., build predictive models using radiomic features from MR images to predict whether brain metastases originated from primary breast cancer, small cell lung cancer, NSCLC, gastrointestinal cancer, or melanoma. The AUC of their predictive models were between 0.64 for NSCLC and 0.82 for breast cancer (34).

Our approach using radiomic scores to predict survival duration of NSCLC patients with brain metastases was novel. We constructed radiomic scores with linear combinations of 2–7 significant radiomic features for each mutation-positive group, weighted by their Cox coefficients. Our radiomic score calculations indicated that different sets of radiomic features were significantly associated with survival duration in different mutation groups. For example, an edema feature, the Edema_Intensity_squareroot_Intensity_Median, was significantly associated with survival duration of patients in the EGFR and ALK mutation-positive groups, but not in the KRAS mutation-positive group. Edema Tumor Volume ratio on the other hand, was significantly associated with survival duration in the EGFR and KRAS mutation-positive groups, but not in ALK mutation-positive group. Our findings indicated the potential mutation-specific association between the radiomic features and survival durations. These results were not unexpected since our radiomic scores were consisted of features reflecting tumor heterogeneity such as edema intensity and tumor texture which have been known to affect survival (35).

Our findings regarding the relationship between peritumoral edema of brain metastases and the survival durations is generally in line with published literature (35, 36). Spanberger et al. studied the prognostic value of the extent of peritumoral brain edema in the patients operated for single brain metastasis. They reported a strong correlation between the extent of peritumoral edema on brain MRI scans and overall survival, i.e., patients with small peritumoral edema have longer survival than patients with large peritumoral edema (35). Our current study showed similar findings, i.e., lower edema/tumor ratio in our radiomic scores indicated longer survival duration. In addition, Berghoff et al. studied the role of tumor-infiltrating lymphocytes (TIFs) in the immune microenvironment of 116 specimen of brain metastases originating from different primary cancers including lung cancer, breast cancer, melanoma, and renal cell carcinoma. They found that dense TIFs correlated with peritumoral brain edema and the overall survival (36). A recent study by Nardone et al. (37) has also shown that the peritumoral edema and tumor volume of brain metastases were correlated with overall survival in patients with NSCLC undergoing radiosurgery. Taken together of the prior published reports and our current study, there is supporting evidence for incorporating brain tumor characteristics such as edema and tumor volume into survival analysis of patients with brain metastases.

The multivariate Cox regression in our study showed that age at diagnosis, Caucasian race, and male gender, were highly correlated with survival duration in the EGFR mutation-positive group. This result was consistent with literature indicating that age, active extracranial disease, and EGFR mutation are independently associated with survival (9). However, it is challenging to compare our analysis of survival duration with others because of differences in study cohorts, systemic disease status and treatment regimen for both the primary cancers and brain metastases. Nevertheless, it is reasonable to evaluate survival in terms of mutation status since molecular targeted therapy based on mutation information may improve prognosis and survival (38). For instance, the progression-free and overall survival of patients with EGFR and ALK mutations may be improved by treatment with tyrosine kinase inhibitors and ALK inhibitors specifically targeting these two mutations (38). Our study results provide the pilot data supporting radiomic scores as non-invasive biomarkers for assessment of survival duration in lung cancer brain metastases according to the mutation status. Nevertheless, independent validation is needed to substantiate our results.

There were several limitations to this study. First, this was a retrospective study focusing on NSCLC patients with brain metastases who were treated at a single institution over a 9-year interval. Our study design was inherently limited by various confounding variables, such as patient characteristics, imaging parameters, and treatment regimens for the primary NSCLC. Second, our sample size was modest, which might have limited our ability to build more robust predictive models with radiomic features. Third, the mutation status for this cohort was obtained from the primary NSCLC. Since most patients in our cohort did not undergo invasive biopsy or surgery of the brain metastases, the brain metastases could not be directly genotyped and we therefore assumed that brain metastases having the same mutation status as the primary NSCLC. We recognize this limitation with the understanding that mutation status in the primary NSCLC and distant metastases may not always be concordant (39). Lastly, this pilot study did not evaluate or control for all the potential confounding factors that might have contributed to survival duration, such as primary tumor status, systematic disease status, neurological deficits, and treatment regimen for the primary NSCLC and brain metastases. This was because we did not have the statistical power in this retrospective study with a modest sample size to control for all the highly variable confounding factors affecting survival. We recognize our approach for building predictive models with the potential uncontrolled variables may have affected our model performance. We will consider those confounding factors in our future large-scale multicenter research.

Despite these limitations, our study had strengths. First, to the best of our knowledge, our study was the first to use MRI radiomics of brain metastases and machine learning algorithms to predict the survival durations of patients with NSCLC, accounting for their mutation status. Second, we used a 3D slice-by-slice approach to segment brain metastases in their entirety, which we believe should have provided a more detailed characterization of tumor heterogeneity than what could be achieved using a 2D method (32). Third, we constructed radiomic scores using both radiomic features and clinical data, which improved predictive power compared to the scores constructed using either clinical data or radiomic data alone. Therefore, our study has merit as an exploratory, proof-of-concept pilot study from which to generate hypotheses for future large-scale, multicenter studies using imaging biomarkers to predict survival durations of patients with brain metastases from NSCLC and other primary cancers.

In summary, our study showed that a MRI radiomic approach capturing the critical radiological features of brain metastases in patients with primary NSCLC may be used to predict survival durations according to mutation status. Our data supports the concept of using radiomic scores as non-invasive imaging biomarkers for survival analysis, which is important for personalized treatment and prognostic assessment for cancer patients with metastatic disease.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors to qualified researchers, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by the Institutional Review Board at City of Hope National Medical Center which approved this study and waived informed consent due to its retrospective nature. Written informed consent for participation was not required for this study in accordance with the institutional requirements.

Author Contributions

BC and RS designed and conducted the study. NY, TJ, IM, TW, BC, and RS analyzed the brain MR imaging data. NY, TW, and BC performed tumor segmentation and reviewed the segmented images for consistency. TJ developed the pipeline for predictive modeling and machine learning. TJ and NY performed statistical analysis. BC, TJ, NY, IM, CW, TW, ZC, RR, RC, AH, SS, and RS contributed to data interpretation. BC, TJ, NY, IM, and RS contributed to the manuscript writing process and BTC prepared the first draft of the entire manuscript. All authors approved the final manuscript.

Funding

This work was supported by the National Cancer Institute of the National Institutes of Health under Grants No. P30CA033572 and 1U54CA209978-01A1. TJ was partially supported by the Center for Cancer and Aging Pilot Project Award at City of Hope to BC. This work was also supported by the City of Hope Research Initiative Health Equity Pilot Grant (Awarded to BC and RS).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors thank Kerin K. Higa, Ph.D. for editing this manuscript.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2021.621088/full#supplementary-material

References

1. Fenske DC, Price GL, Hess LM, John WJ, Kim ES. Systematic review of brain metastases in patients with non-small-cell lung cancer in the United States, European Union, and Japan. Clin Lung Cancer. (2017) 18:607–14. doi: 10.1016/j.cllc.2017.04.011

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Hu C, Chang EL, Hassenbusch SJ 3rd, Allen PK, Woo SY, Mahajan A, et al. (2006). Nonsmall cell lung cancer presenting with synchronous solitary brain metastasis. Cancer. 106, 1998–2004. doi: 10.1002/cncr.21818

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Chen Z, Fillmore CM, Hammerman PS, Kim CF, Wong KK. Non-small-cell lung cancers: a heterogeneous set of diseases. Nat Rev Cancer. (2014) 14:535–46. doi: 10.1038/nrc3775

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Mak KS, Gainor JF, Niemierko A, Oh KS, Willers H, Choi NC, et al. Significance of targeted therapy and genetic alterations in EGFR, ALK, or KRAS on survival in patients with non-small cell lung cancer treated with radiotherapy for brain metastases. Neuro Oncol. (2015) 17:296–302. doi: 10.1093/neuonc/nou146

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Ellison G, Zhu G, Moulis A, Dearden S, Speake G, Mccormack R. EGFR mutation testing in lung cancer: a review of available methods and their use for analysis of tumour tissue and cytology samples. J Clin Pathol. (2013) 66:79–89. doi: 10.1136/jclinpath-2012-201194

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Riely GJ, Marks J, Pao W. KRAS mutations in non-small cell lung cancer. Proc Am Thorac Soc. (2009) 6:201–5. doi: 10.1513/pats.200809-107LC

CrossRef Full Text | Google Scholar

7. Kwak EL, Bang YJ, Camidge DR, Shaw AT, Solomon B, Maki RG, et al. Anaplastic lymphoma kinase inhibition in non-small-cell lung cancer. N Engl J Med. (2010) 363:1693–703. doi: 10.1056/NEJMoa1006448

CrossRef Full Text | Google Scholar

8. Siegelin MD, Borczuk AC. Epidermal growth factor receptor mutations in lung adenocarcinoma. Lab Invest. (2014) 94:129–37. doi: 10.1038/labinvest.2013.147

CrossRef Full Text | Google Scholar

9. Porta R, Sanchez-Torres JM, Paz-Ares L, Massuti B, Reguart N, Mayo C, et al. Brain metastases from lung cancer responding to erlotinib: the importance of EGFR mutation. Eur Respir J. (2011) 37:624–31. doi: 10.1183/09031936.00195609

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Chen BT, Jin T, Ye N, Mambetsariev I, Daniel E, Wang T, et al. Radiomic prediction of mutation status based on MR imaging of lung cancer brain metastases. Magn Reson Imaging. (2020) 69:49–56. doi: 10.1016/j.mri.2020.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Kuo MD, Jamshidi N. Behind the numbers: Decoding molecular phenotypes with radiogenomics–guiding principles and technical considerations. Radiology. (2014) 270:320–5. doi: 10.1148/radiol.13132195

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, Van Stiphout RG, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Carvalho S, et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. (2014) 5:4006. doi: 10.1038/ncomms5006

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Sun R, Limkin EJ, Vakalopoulou M, Dercle L, Champiat S, Han SR, et al. A radiomics approach to assess tumour-infiltrating CD8 cells and response to anti-PD-1 or anti-PD-L1 immunotherapy: an imaging biomarker, retrospective multicohort study. Lancet Oncol. (2018) 19:1180–91. doi: 10.1016/S1470-2045(18)30413-3

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Shen C, Liu Z, Guan M, Song J, Lian Y, Wang S, et al. 2D and 3D CT radiomics features prognostic performance comparison in non-small cell lung cancer. Transl Oncol. (2017) 10:886–94. doi: 10.1016/j.tranon.2017.08.007

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Park H, Lim Y, Ko ES, Cho HH, Lee JE, Han BK, et al. Radiomics signature on magnetic resonance imaging: association with disease-free survival in patients with invasive breast cancer. Clin Cancer Res. (2018) 24:4705–14. doi: 10.1158/1078-0432.CCR-17-3783

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Klein S, Staring M, Murphy K, Viergever MA, Pluim JP. elastix: a toolbox for intensity-based medical image registration. IEEE Trans Med Imaging. (2010) 29:196–205. doi: 10.1109/TMI.2009.2035616

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Yushkevich PA, Piven J, Hazlett HC, Smith RG, Ho S, Gee JC, et al. User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability. Neuroimage. (2006) 31:1116–28. doi: 10.1016/j.neuroimage.2006.01.015

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Shinohara RT, Sweeney EM, Goldsmith J, Shiee N, Mateen FJ, Calabresi PA, et al. Statistical normalization techniques for magnetic resonance imaging. Neuroimage Clin. (2014) 6:9–19. doi: 10.1016/j.nicl.2014.08.008

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–e107. doi: 10.1158/0008-5472.CAN-17-0339

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Peng H, Long F, Ding C. Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell. (2005) 27:1226–38. doi: 10.1109/TPAMI.2005.159

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Arlot S, Celisse A. A survey of cross-validation procedures for model selection. Statist Surv. (2010) 4:40–79. doi: 10.1214/09-SS054

CrossRef Full Text | Google Scholar

23. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. (2011) 12:2825–30. doi: 10.1016/j.patcog.2011.04.006

CrossRef Full Text | Google Scholar

24. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. (2002) 16:321–57. doi: 10.1613/jair.953

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Liu Y, Kim J, Balagurunathan Y, Li Q, Garcia AL, Stringfield O, et al. (2016). Radiomic features are associated with EGFR mutation status in lung adenocarcinomas. Clin Lung Cancer. 17:441–8.e446. doi: 10.1016/j.cllc.2016.02.001

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Rizzo S, Petrella F, Buscarino V, De Maria F, Raimondi S, Barberis M, et al. CT radiogenomic characterization of EGFR, K-RAS, and ALK mutations in non-small cell lung cancer. Eur Radiol. (2016) 26:32–42. doi: 10.1007/s00330-015-3814-0

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Gevaert O, Echegaray S, Khuong A, Hoang CD, Shrager JB, Jensen KC, et al. Predictive radiogenomics modeling of EGFR mutation status in lung cancer. Sci Rep. (2017) 7:41674. doi: 10.1038/srep41674

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Zhang L, Chen B, Liu X, Song J, Fang M, Hu C, et al. Quantitative biomarkers for prediction of epidermal growth factor receptor mutation in non-small cell lung cancer. Transl Oncol. (2018) 11:94–101. doi: 10.1016/j.tranon.2017.10.012

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Hosny A, Parmar C, Coroller TP, Grossmann P, Zeleznik R, Kumar A, et al. Deep learning for lung cancer prognostication: a retrospective multi-cohort radiomics study. PLoS Med. (2018) 15:e1002711. doi: 10.1371/journal.pmed.1002711

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Li Z, Mao Y, Li H, Yu G, Wan H, Li B. Differentiating brain metastases from different pathological types of lung cancers using texture analysis of T1 postcontrast MR. Magn Reson Med. (2016) 76:1410–9. doi: 10.1002/mrm.26029

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Nardone V, Tini P, Biondi M, Sebaste L, Vanzi E, De Otto G, et al. Prognostic value of MR imaging texture analysis in brain non-small cell lung cancer oligo-metastases undergoing stereotactic irradiation. Cureus. (2016) 8:e584. doi: 10.7759/cureus.584

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Ortiz-Ramon R, Larroza A, Arana E, Moratal D. A radiomics evaluation of 2D and 3D MRI texture features to classify brain metastases from lung cancer and melanoma. Conf Proc IEEE Eng Med Biol Soc. (2017) 2017:493–6. doi: 10.1109/EMBC.2017.8036869

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Béresová M, Larroza A, Arana E, Varga J, Balkay L, Moratal D. 2D and 3D texture analysis to differentiate brain metastases on MR images: proceed with caution. Magn Reson Mater Phys Biol Med. (2018) 31:285–94. doi: 10.1007/s10334-017-0653-9

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Kniep HC, Madesta F, Schneider T, Hanning U, Schonfeld MH, Schon G, et al. Radiomics of brain MRI: utility in prediction of metastatic tumor type. Radiology. (2019) 290:479–87. doi: 10.1148/radiol.2018180946

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Spanberger T, Berghoff AS, Dinhof C, Ilhan-Mutlu A, Magerle M, Hutterer M, et al. Extent of peritumoral brain edema correlates with prognosis, tumoral growth pattern, HIF1a expression and angiogenic activity in patients with single brain metastases. Clin Exp Metastasis. (2013) 30:357–68. doi: 10.1007/s10585-012-9542-9

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Berghoff AS, Fuchs E, Ricken G, Mlecnik B, Bindea G, Spanberger T, et al. Density of tumor-infiltrating lymphocytes correlates with extent of brain edema and overall survival time in patients with brain metastases. Oncoimmunology. (2016) 5:e1057388. doi: 10.1080/2162402X.2015.1057388

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Nardone V, Nanni S, Pastina P, Vinciguerra C, Cerase A, Correale P, et al. Role of perilesional edema and tumor volume in the prognosis of non-small cell lung cancer (NSCLC) undergoing radiosurgery (SRS) for brain metastases. Strahlenther Onkol. (2019) 195:734–44. doi: 10.1007/s00066-019-01475-0

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Di Lorenzo R, Ahluwalia MS. Targeted therapy of brain metastases: latest evidence and clinical implications. Ther Adv Med Oncol. (2017) 9:781–96. doi: 10.1177/1758834017736252

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Bozzetti C, Tiseo M, Lagrasta C, Nizzoli R, Guazzi A, Leonardi F, et al. Comparison between epidermal growth factor receptor (EGFR) gene expression in primary non-small cell lung cancer (NSCLC) and in fine-needle aspirates from distant metastatic sites. J Thorac Oncol. (2008) 3:18–22. doi: 10.1097/JTO.0b013e31815e8ba2

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: radiomics, machine learning, survival, lung cancer, brain metastases, brain MRI, artificial intelligence

Citation: Chen BT, Jin T, Ye N, Mambetsariev I, Wang T, Wong CW, Chen Z, Rockne RC, Colen RR, Holodny AI, Sampath S and Salgia R (2021) Predicting Survival Duration With MRI Radiomics of Brain Metastases From Non-small Cell Lung Cancer. Front. Oncol. 11:621088. doi: 10.3389/fonc.2021.621088

Received: 25 October 2020; Accepted: 08 February 2021;
Published: 05 March 2021.

Edited by:

Xuejun Li, Central South University, China

Reviewed by:

Bo Gao, Affiliated Hospital of Guizhou Medical University, China
Minghao Dong, Xidian University, China

Copyright © 2021 Chen, Jin, Ye, Mambetsariev, Wang, Wong, Chen, Rockne, Colen, Holodny, Sampath and Salgia. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Bihong T. Chen, QmVjaGVuQGNvaC5vcmc=; orcid.org/0000-0002-3127-0711

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.