Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 19 June 2025

Sec. Thoracic Oncology

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1599882

This article is part of the Research TopicRadiomics and Artificial Intelligence in Oncology ImagingView all 7 articles

Machine learning-based radiomics for differentiating lung cancer subtypes in brain metastases using CE-T1WI

  • 1Division of Head & Neck Tumor Multimodality Treatment, Cancer Center, West China Hospital, Sichuan University, Chengdu, China
  • 2Department of Targeting Therapy & Immunology, Cancer Center, West China Hospital, Sichuan University, Chengdu, China

Objectives: The purpose of this research was to create and validate radiomic models based on machine learning that can effectively discriminate between primary non-small cell lung cancer (NSCLC) and small cell lung cancer (SCLC) in individuals with brain metastases (BMs) by utilizing high-dimensional radiomic characteristics derived from contrast-enhanced T1-weighted imaging (CE-T1WI).

Methods: A cohort of 260 individuals were chosen as participants. Among them, 173 individuals had NSCLC with 228 BMs, while 87 patients were diagnosed with SCLC with 142 BMs. Patients were allocated to a training dataset with a total of 259 BMs and an independent test dataset with a total of 111 BMs. Tumor tissues in axial CE-T1WI were manually outlined to delineate regions of interest (ROIs). Radiomic features were obtained from the ROIs using PyRadiomics, which were then chosen through a multistep selection process, including least absolute shrinkage and selection operator (LASSO) regression. Ten machine learning models, including Light Gradient Boosting Machine (LightGBM), RandomForest, and eXtreme Gradient Boosting (XGBoost), were built using selected features. The models’ performance was evaluated using receiver operating characteristic (ROC) analysis and area under the curve (AUC) calculations, complemented by additional metrics such as accuracy, specificity, sensitivity, positive predictive value (PPV), and negative predictive value (NPV).

Results: A total of 833 radiomic features were extracted from the ROIs. Through a multistep selection process, a refined subset of 15 optimal radiomic features was identified for model training. Ten classifier models were built based on features extracted from CE-T1WI. In the training dataset, the top-performing classifiers were the XGBoost, LightGBM, support vector machine (SVM) and random forest models, which achieved AUC of 0.963, 0.881, 0.876 and 0.855, respectively, with 5-fold cross-validation. Among the ten models tested, the LightGBM algorithm exhibited superior performance, with an AUC of 0.853 in the test cohort. This performance was superior to that of other models, such as RandomForest (AUC 0.843) and ExtraTrees (AUC 0.835). Radiomic features significantly contributed to the differentiation between NSCLC and SCLC.

Conclusion: Machine learning-based radiomics using CE-T1WI data is highly effective in distinguishing between NSCLC and SCLC in patients with BMs. The LightGBM model showed the best performance, suggesting that this approach shows promise as a supportive, non-invasive diagnostic tool, pending further validation in prospective clinical settings.

Introduction

Brain metastases (BMs) are the most common form of malignant neoplasms affecting the brain, representing a significant complication in cancer progression and management (1). Nearly 40% of cancer patients are estimated to develop BMs during the course of their illness, which markedly increases mortality rates and severely compromises quality of life (2). The growing incidence of BMs in recent years is largely attributable to advances in systemic anticancer therapies that prolong patient survival, the unique immunological microenvironment of the brain, and enhanced detection capabilities through magnetic resonance imaging (MRI) (3). Among all primary tumors, advanced lung cancer (LC)—comprising non-small cell lung cancer (NSCLC) and small cell lung cancer (SCLC)—accounts for approximately 50% of all BM cases (4). Notably, about 25% to 50% of NSCLC patients and nearly half of SCLC patients develop BMs during the disease course (5). In particular, NSCLC patients with specific genetic mutations or elevated programmed cell death ligand 1 (PD-L1) expression benefit significantly from targeted therapy and immunotherapy, which have enhanced overall survival in recent years (6, 7). Consequently, early and accurate differentiation between NSCLC and SCLC patients with BMs is essential for tailoring appropriate treatment strategies and improving clinical outcomes.

Biopsy, while an effective traditional diagnostic method, often requires invasive procedures and carries the risk of severe complications, especially when involving brain lesions. Moreover, brain lesion biopsies have limited ability to accurately differentiate between the two subtypes of LC (8). This challenge is particularly evident in patients who present with both a lung mass and brain metastases at initial diagnosis and are in poor general condition, making them unsuitable candidates for biopsy. In such cases, a non-invasive diagnostic approach is critically important. MRI, with its exceptional soft-tissue contrast and detailed anatomical information, is a cornerstone for evaluating BMs, and contrast-enhanced T1-weighted imaging (CE-T1WI) is crucial for detecting them (9). BMs from NSCLC and SCLC exhibit distinct, though occasionally overlapping, MRI features (10). NSCLC-related BMs typically present as solitary or fewer, larger lesions with well-defined margins, marked contrast enhancement, and significant peritumoral edema (11). Certain NSCLC subtypes, such as adenocarcinoma, may exhibit distinctive imaging features, though considerable variability exists. In contrast, SCLC-related BMs usually appear as multiple small, densely clustered lesions with intense enhancement and surrounding edema, often with poorly defined boundaries due to rapid tumor growth (12). However, overlapping features—particularly in enhancement patterns and edema—can make it challenging to reliably distinguish between NSCLC and SCLC metastases based solely on MRI.

Recently, advancements in radiomics and machine learning (ML) have shown promise in overcoming this limitation. Radiomics, which involves deriving quantitative features from imaging data, has gained significant attention for its ability to uncover latent patterns and provide valuable diagnostic and prognostic insights (13). ML algorithms can analyze these complex features, enabling the differentiation between SCLC and NSCLC by identifying subtle image characteristics that may not be detectable through conventional imaging analysis alone. Currently, various studies have aimed to develop non-invasive ML or deep learning (DL) techniques that utilize features extracted from MRI to assess the characteristics of BMs and determine their origins. Using a DL model, Sui L et al. reported an AUC ranging from 0.8024 to 0.8019 for subtype classification of lung cancer patients with BMs (14). Egashira M et al. found that the AUC of the ML model was 0.744 for NSCLC and 0.861 for SCLC (15). However, previous studies have encountered several limitations. First, these radiomic models generally exhibited low AUCs and demonstrated relatively poor diagnostic performance. Second, although radiomics-based classification models have shown efficacy in tumor type estimation, there has been limited focus on predicting the origins of BMs.

This study addresses a critical clinical need by investigating a non-invasive machine learning-based approach to accurately differentiate NSCLC from SCLC in patients with BMs. By enhancing diagnostic precision without the need for traditional biopsy, this method holds promise as a supportive tool in facilitating personalized treatment planning and improving patient outcomes.

Materials and methods

Patient and MRI protocol

This research involved a group of patients confirmed with BMs originating from the LC at our institution between January 2021 and December 2023. The research was carried out following the guidelines of the Declaration of Helsinki. The institutional research ethics committee reviewed the study and waived the requirement for formal approval, as it involved retrospective analysis of anonymized patient data. The requirements for participating in the study included the following: 1) histopathologically confirmed diagnosis of LC; 2) diagnosis of BMs by pathology or MRI; and 3) availability of pretreatment contrast-enhanced MRI scans of the head. Three factors were excluded: 1) pre-MRI treatment with antitumor therapy, such as radiotherapy, targeted therapy or immunotherapy; 2) BMs less than one centimeter in diameter; and 3) substantial artifacts in the images. A total of 260 patients were retrospectively identified, with 87 patients with SCLC and 173 patients with NSCLC.

All participants were examined using Siemens Trio scanners (3.0T) at the institution’s radiology department. Gadopentetate dimeglumine (0.1 mmol/kg), which was calculated based on weight, was injected intravenously into patients. The imaging parameters for CE-T1WI acquisition were as follows: TR/TE/TI = 1900/2.26/900 ms, flip angle = 9°, slice thickness = 1 mm, axial field of view (FOV) = 25.6 × 25.6 cm², and data matrix = 256 × 256. CE-T1WI were acquired in multidirectional mode within a 90–250 second interval.

Data preprocessing and image segmentation

The dataset was split 7:3 between training and test sets for model evaluation. BMs from the same patient were not split between the training and validation cohorts. Each patient’s lesions were exclusively assigned to either the training or the validation set to ensure independence between cohorts and to avoid any potential data leakage that could artificially inflate model performance. Medical imaging datasets often present heterogeneous voxel spacing due to differences in scanner models and acquisition protocols. To address this variability, a fixed-resolution resampling approach was employed in this study, standardizing all images to an isotropic voxel size of 1×1×1 mm. This ensured uniform spatial resolution across the dataset. Subsequently, z-score normalization (zero-mean, unit-variance) was applied to harmonize feature scaling. In addition, bias field correction using the N4ITK algorithm was performed to mitigate intensity inhomogeneities resulting from magnetic field non-uniformities.

In this research, the 3D Slicer open-source software application (Version 4.11), accessible at https://download.slicer.org, was utilized to assist in the three-dimensional manual segmentation of MRI. Two seasoned radiologists, who were unaware of the histopathological information, independently performed the image segmentation. Radiologist A, who has seven years of expertise in brain imaging, manually outlined the tumor tissues in the axial CE-T1WI. The radiologist meticulously outlined each image layer to precisely identify and segment the tumors following the software’s designated protocol. The delineation of regions of interests (ROIs) in each slice was based on the boundaries of the tumor tissue, encompassing necrotic areas and tumor blood vessels while excluding peritumoral edema. Radiologist B, who has a decade of expertise in neuroimaging, thoroughly examined all the ROIs delineated by Radiologist A to ensure precision and uniformity. Conflicts were settled through discussion until a consensus was reached. Since the segmentation was performed using a single set of delineations validated by expert consensus, inter- and intra-observer variability metrics—such as the Dice similarity coefficient and intraclass correlation coefficient (ICC)—were not evaluated. This approach aligns with common practices in radiomics studies when multi-reader segmentations are unavailable, aiming to reduce variability through stringent expert validation.

Radiomic feature extraction and selection

A total of 833 radiomic features were obtained from the ROIs on the CE-T1WI using PyRadiomics (Version 3.0.1) in Python 3.9 (16). These features can be classified into three groups: geometry, intensity, and texture, which describe the shape, statistical distribution, and spatial patterns of the tumor, respectively.

The Mann–Whitney U test and feature screening were carried out, and only characteristics with a p value below the 0.05 threshold were retained. Spearman’s rank correlation coefficient was used to analyze the relationships between characteristics with considerable reproducibility. Characteristics with a correlation coefficient exceeding 0.9 between any two features were selected by using a greedy recursive deletion method. The least absolute shrinkage and selection operator (LASSO) regression model was employed to build a signature by shrinking regression coefficients toward zero and setting many unrelated features to zero with reference to the regulation weight λ (17). The optimal value of λ was found through cross-validation with ten folds, resulting in a radiomic signature with retained features having nonzero coefficients.

Radiomic model building

The selected features were utilized as input variables for model development (18). A total of ten machine learning algorithms were evaluated to identify the most effective classifier. The dataset was partitioned into training and testing sets in a 7:3 ratio. Cross-validation was employed during the training phase to optimize hyperparameters and evaluate model stability. The entire training set was used to build the predictive models, while the test set served as an independent dataset to assess model performance. Hyperparameter tuning was conducted via grid search to determine the optimal configuration for each algorithm. The final classifier was selected based on its performance metrics, including accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC), as evaluated on the validation set. To further assess clinical utility, decision curve analysis (DCA) was performed, and the standardized net benefit (sNB)—ranging from 0 to 1—was calculated.

Statistical analysis

Python (Version 3.9) was used for the statistical analysis (19). The threshold for significance was established at a p value of less than 0.05. Group differences were measured by employing t tests or Mann–Whitney U tests for quantitative metrics and chi–square or Fisher’s exact tests for qualitative metrics. To compare the performance of different machine learning models, DeLong’s test was applied to assess the statistical significance of differences between correlated receiver operating characteristic (ROC) curves.

Results

Patient characteristics

Figure 1 illustrates the flowchart for the choice of BMs for LC patients. A cohort of 260 individuals who received at our institution from January 2021 to December 2023 were chosen as participants for this investigation in accordance with the predefined inclusion and exclusion criteria. Among them, 173 individuals had NSCLC with 228 BMs, while 87 patients were diagnosed with SCLC with 142 BMs. Patients were allocated to a training dataset with a total of 259 BMs and an independent test dataset with a total of 111 BMs. A pathologist reviewed the pathological data. Table 1 provides a comprehensive overview of the patient traits. Notably, no meaningful clinical differences were detected between the training and validation groups.

Figure 1
Flowchart illustrating selection criteria and division of patients with pathologically confirmed brain metastases and lung cancer from January 2021 to December 2023. Initially, 458 patients are identified. Exclusions include pre-MRI anti-tumor therapy (34 patients), brain metastases less than one centimeter (41 patients), and substantial image artifacts (13 patients), resulting in a final cohort of 370. This cohort is divided into a training cohort of 259 (160 NSCLC, 99 SCLC) and a validation cohort of 111 (68 NSCLC, 43 SCLC).

Figure 1. Flow diagram of the study enrollment BMs (n) with lung cancer. Of 458 patients with pathologically confirmed brain metastases from lung cancer (Jan 2021–Dec 2023), 88 were excluded. The final cohort (n = 370) was split into a training set (n = 259) and a validation set (n = 111), with subgroups of NSCLC and SCLC.

Table 1
www.frontiersin.org

Table 1. Characteristics of the included patients with BMs (n).

Feature selection and model construction

A total of 833 handcrafted features were extracted, with 538 features chosen through statistical tests. A total of 117 features were then selected based on correlation and a recursive deletion strategy. Fifteen optimal radiomic features were selected using the LASSO logistic model and regularization parameter λ with coefficients and mean standard errors shown in Figures 2, 3. A Rad signature was established by utilizing the non-zero coefficients features selected by LASSO regression and features coefficients are displayed in Figure 4. The multistep selection process resulted in a final subset of 15 radiomic features, which were subsequently used for model training, as shown in Table 2. These selected features demonstrated a strong discriminatory ability between the two lung cancer subtypes in the training group as well as the testing group. Ten machine learning algorithms were evaluated to determine the most effective classifier.

Figure 2
Plot showing Lasso regression coefficients on the y-axis against varying lambda values on the x-axis. Lines representing different predictors converge towards zero as lambda increases, depicted on a logarithmic scale.

Figure 2. LASSO coefficient profiles of features generated from 10-fold cross-validation. Each colored line represents the trajectory of a feature’s coefficient as the regularization parameter lambda changes. The vertical dashed line indicates the optimal lambda value (λ = 0.0339), at which the model achieves the best balance between sparsity and performance.

Figure 3
A plot showing the mean squared error (MSE) on the y-axis against Lambda on the x-axis, ranging from 10^-3 to 10^0. The graph includes blue vertical lines representing error bars and red points indicating the MSE values. The MSE decreases to a minimum around 10^-2 and then increases. A vertical dashed line marks Lambda at 0.0339.

Figure 3. Mean squared error (MSE) plot from 10-fold cross-validation used to select the optimal regularization parameter (lambda) in LASSO regression. The red dots represent the average MSE for each lambda, and the blue bars show ±1 standard error. The vertical dashed line indicates the optimal lambda value (λ = 0.0339), which minimizes the prediction error.

Figure 4
Horizontal bar chart showing various feature names on the vertical axis and their corresponding coefficients on the horizontal axis. The features are listed from wavelet_LLH_gldm_DependenceNonUniformity at the top to wavelet_LLL_firstorder_10Percentile at the bottom. Coefficients range from approximately -0.04 to 0.03, with the highest positive coefficient for wavelet_LHH_glcm_InverseVariance and the most negative for wavelet_LLH_gldm_DependenceNonUniformity. The chart legend indicates the bars represent coefficients.

Figure 4. Bar plot showing the coefficients of the 15 most predictive radiomic features selected by the LASSO model. Positive coefficients indicate a direct association with the outcome, while negative coefficients suggest an inverse relationship. Feature names reflect the original or wavelet-transformed image filters and statistical classes used in radiomic extraction.

Table 2
www.frontiersin.org

Table 2. Radiomics features obtained by LASSO regression analysis.

Performance of the models

Ten classifier models were built based on features extracted from MRI. The classifiers exhibited robust efficiency in relation to the AUC and accuracy. In the training dataset, the top-performing classifiers were the XGBoost, LightGBM, SVM and RandomForest models, which achieved AUCs of 0.963, 0.881, 0.876 and 0.855, respectively, with 5-fold cross-validation. In addition, these classifiers also demonstrated favorable performance in the independent test. In the testing dataset, LightGBM achieved the highest AUC of 0.853, significantly outperforming random forest (0.843), extra trees (0.835), and naive Bayes (0.827), with differences confirmed as statistically significant by DeLong’s test (p < 0.05). The ROC contours for these models were displayed in Figure 5, and a thorough contrast was presented in Table 3. DCA of LightGBM in the training and test cohorts is shown in Figure 6.

Figure 5
Three graphs show ROC curves for different models. Graph (a) displays the training model AUC for various algorithms, including LR, Naive Bayes, and others. Graph (b) shows the test model AUC with similar algorithms. Graph (c) focuses on the LightGBM model, highlighting a train AUC of 0.881 and test AUC of 0.853. Each graph plots sensitivity against 1-specificity, with a diagonal line indicating random performance.

Figure 5. Receiver operating characteristic (ROC) curves for ten machine learning models in the (a) training and (b) testing datasets. (c) ROC curves for the LightGBM model, which achieved the highest performance, with an AUC of 0.881 (95% CI: 0.841–0.922) in the training set and 0.853 (95% CI: 0.781–0.924) in the testing set.

Table 3
www.frontiersin.org

Table 3. Performance of ten machine learning models in the training and validation sets.

Figure 6
Two net benefit graphs comparing LightGBM decision curve analysis. Graph (a) shows training model results, while graph (b) displays test model results. Both plots depict net benefit against threshold probability, with lines for the model, treat all, and treat none options. A shaded area highlights the effective range of the model.

Figure 6. Decision curve analysis (DCA) of the LightGBM model in the training (a) and test cohorts (b). The x-axis represents the threshold probability, and the y-axis indicates the net benefit. The LightGBM model demonstrates a higher standardized net benefit (sNB) across a range of clinically relevant threshold probabilities, indicating superior clinical utility.

Discussion

This research intended to evaluate the viability and precision of employing ML techniques to differentiate between NSCLC patients and SCLC patients with BMs by utilizing high-dimensional radiomic features derived fromCE-T1WI. Ten distinct machine learning prediction models were constructed and evaluated. Notably, the LightGBM algorithm exhibited superior performance, attaining an AUC of 0.853 in the test cohort. Consequently, the findings suggest promising prospects for ML models in predicting tumor pathology types.

Lung cancer subtype classification is critical for determining treatment strategies, as NSCLC and SCLC have different treatment protocols and prognoses (20). SCLC is an aggressive disease with a lack of predictive biomarkers, and treatment primarily involves chemotherapy and radiotherapy (21). NSCLC can be treated with surgery combined with radiotherapy and chemotherapy. Additionally, targeted therapies based on driver genes and immunotherapy with immune checkpoint inhibitors have significantly improved NSCLC patient survival (22). Histopathological examination is the best way to diagnose brain tumors, but brain biopsies are associated with risks such as bleeding and damage to critical brain tissue (23). This is a concern for patients who are not suitable for biopsy or those with severe comorbidities. CE-T1WI is crucial for detecting BMs, but its ability to differentiate the origin of BMs is limited. Non-invasive diagnostic methods using radiomics and ML show promise for improving clinical decision-making and identifying candidates for targeted therapy and immunotherapy. Pathological transformation between SCLC and NSCLC during treatment may impact treatment strategies (24). A study by Thai et al. reviewed the evolving landscape of lung cancer subtypes and the corresponding imaging and treatment strategies, reinforcing the necessity of precise subtype differentiation for optimized therapeutic outcomes (20). Finally, Vogelbaum et al. provided guidelines for treating BMs, reinforcing the importance of precise diagnostic tools to guide treatment strategies (25). Some studies have demonstrated that the ML technique can offer additional dynamic diagnostic capabilities and valuable diagnostic insights that traditional imaging might miss (26, 27).

Previous research has extensively investigated the differentiation of BMs originating from NSCLC and SCLC utilizing a variety of methodologies, with a primary focus on DL and ML techniques. Sui et al. employed DL models on CE-T1WI, achieving AUCs of 0.8024 for NSCLC and 0.8019 for SCLC, highlighting the possibility of DL within this field (14). Similarly, Egashira et al. showed an AUC of 0.744 for NSCLC and 0.861 for SCLC using ML models, emphasizing the significance of radiomics features in predictive modeling (15). Notably, a related study indicated that a model using individual radiomic features derived from CE-T1WI with the Xgboost algorithm achieved optimal performance, with an AUC of 0.85 in inner testing and 0.80 in exterior testing (28). In another study, the proposed radiomics-based classifier achieved a sensitivity of 94.44% and specificity of 95.33%, outperforming DL-based classifiers and demonstrating that the radiomic approach is a valuable diagnostic tool for differentiating lung cancer subtypes in BMs with small datasets (29).

Our findings align with the aforementioned research by demonstrating the efficacy of machine learning-based radiomics on CE-T1WI in differentiating between NSCLC and SCLC in patients with BMs. CE-T1WI is capable of providing detailed insights into the pathological structure of cancers (30, 31). The LightGBM algorithm in our study achieved an AUC of 0.853 in the validation cohort, which is similar to or exceeds that of models reported in previous research (28, 32). This consistency underscores the robustness of ML approaches in handling high-dimensional radiomic features for accurate tumor classification. However, our study diverges from previous research in several key aspects. Unlike Sui et al. and Egashira et al., who reported lower AUCs, our study achieved higher predictive accuracy, potentially due to the incorporation of a larger and more diverse dataset, as well as the use of advanced feature selection techniques such as LASSO regression. Moreover, our focus on comprehensive ML model evaluation, including DCA, provided a further rigorous examination of model capability, highlighting the clinical utility of our approach.

In the present study, ten distinct ML models were developed and evaluated, and their performances were assessed using various metrics. Of these models, LightGBM exhibited the best performance, achieving an AUC of 0.853 in the validation cohort. This was followed by the random forest, extra trees, and naive Bayes models, which achieved AUCs of 0.843, 0.835, and 0.827, respectively. Table 3 and Figure 5 show the performances of the models, highlighting the superior diagnostic capabilities of the LightGBM model on both the training and test datasets. The efficiency of the LightGBM model in dealing with extensive data and multidimensional features contributes to its strong performance. The algorithm effectively prevents overfitting with techniques such as gradient-based one-sided sampling (GOSS) and exclusive feature bundling (EFB), ensuring high accuracy and robustness in predictions. Advanced feature selection methods such as LASSO regression focus on relevant features, improving the model’s ability to differentiate between SCLC and NSCLC. The observed discrepancy between the XGBoost model’s AUC of 0.963 in the training dataset and 0.816 in the validation dataset suggests potential overfitting. This significant drop in performance indicates that while XGBoost effectively captures patterns in the training data, its generalizability to unseen data is limited. Overfitting may arise due to the model’s complexity or insufficient regularization, causing it to memorize training data rather than learn generalizable features. This highlights that advanced techniques like GOSS and EFB used by LightGBM help mitigate overfitting, suggesting that similar strategies might improve XGBoost’s performance. Additionally, the absence of precision-recall curves in the study could obscure the model’s performance on imbalanced datasets, which may contribute to the observed AUC drop in the validation set. To address this, future work could incorporate regularization techniques, automated feature selection like LASSO regression, or cross-validation adjustments to enhance XGBoost’s robustness and generalizability.

We extracted 833 radiomic features from axial CE-T1WI using the PyRadiomics package (33) and identified 15 key features that effectively differentiated SCLC from NSCLC. These included intensity, texture, and geometric descriptors such as original_ngtdm_Contrast, wavelet_HHH_firstorder_Skewness, and wavelet_LLL_glcm_MaximumProbability. These features have biological relevance (34, 35). For example, original_ngtdm_Contrast reflects tumor heterogeneity, which is typically higher in SCLC due to its rapid proliferation, necrosis, and disorganized cellular architecture. The wavelet_HHH_firstorder_Skewness captures asymmetrical intensity distributions, potentially linked to irregular tumor growth in SCLC. In contrast, NSCLC often exhibits more structured, homogeneous patterns. Geometric features provide insight into tumor shape and invasive potential. SCLC frequently presents with irregular, infiltrative borders, while NSCLC tends to show more defined and organized structures. These differences in morphology and biological behavior are well represented by the selected radiomic features and enhance the model’s ability to distinguish between subtypes (36). These features are crucial for accurate classification and, when combined with advanced ML algorithms, improve the ability to differentiate between LC subtypes for better diagnosis and treatment planning (37).

This study has several limitations that should be acknowledged. First, the retrospective design and limited sample size may affect the generalizability of our findings. Future multicenter prospective studies incorporating deep learning techniques could enhance the model’s robustness and clinical applicability. Second, our analysis relied solely on CE-T1WI sequences; incorporating additional sequences (e.g., T2-FLAIR) might improve discriminatory performance. Third, although SHAP (SHapley Additive exPlanations) analysis was not performed in this study, we acknowledge its value in improving model transparency. SHAP could help visualize the impact of each feature on model predictions, offering deeper insights into the decision-making process (38, 39). We consider this an important direction for future work to enhance interpretability and clinical trust. Fourth, class imbalance in the training dataset and the absence of precision-recall curves may obscure the model’s true performance on minority classes. These curves are essential for evaluating imbalanced datasets and should be included in future work. Fifth, manual tumor segmentation may introduce inter-observer variability; automated or semi-automated methods could improve reproducibility. Finally, while our top-performing models demonstrated strong AUCs, clinical translation would further require validating the model across diverse, real-world datasets to ensure practical utility.

Conclusion

Our research illustrates that the application of machine learning-based radiomics, utilizing high-dimensional features extracted from CE-T1WI, is a highly effective method for distinguishing between primary NSCLC and SCLC in patients with BMs. The LightGBM model displayed superior capability with an AUC of 0.853 in the test cohort, suggesting considerable promise for enhancing diagnostic precision and facilitating the refinement of customized protocol. This machine learning approach shows promise as a supportive, non-invasive diagnostic tool, pending further validation in prospective clinical settings.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.

Ethics statement

The research was carried out following the guidelines of the Declaration of Helsinki. This study was conducted in accordance with institutional guidelines and regulations. As it involved only retrospective data or non-identifiable patient information, formal ethical approval was not required. However, the study was reviewed and conducted with oversight in accordance with the west China hospital’s policies. Patient informed consent was obtained during this research.

Author contributions

XX: Writing – original draft, Conceptualization, Data curation. WD: Writing – original draft, Formal Analysis, Software. QG: Resources, Writing – review & editing, Funding acquisition, Conceptualization, Supervision.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work received support from Sichuan Provincial Science and Technology Program Project (2024YFHZ0051) and Sichuan University’s Innovation Project from 0 to 1 (2022SCUH0032).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Ostrom QT, Price M, Neff C, Cioffi G, Waite KA, Kruchko C, et al. CBTRUS statistical report: primary brain and other central nervous system tumors diagnosed in the United States in 2015–2019. Neuro-Oncology. (2022) 24:v1–v95. doi: 10.1093/neuonc/noac202

PubMed Abstract | Crossref Full Text | Google Scholar

2. Sacks P and Rahman M Epidemiology of brain metastases. Neurosurg Clinics North America. (2020) 31:481–8. doi: 10.1016/j.nec.2020.06.001

PubMed Abstract | Crossref Full Text | Google Scholar

3. Aizer AA, Lamba N, Ahluwalia MS, Aldape K, Boire A, Brastianos PK, et al. Brain metastases: A Society for Neuro-Oncology (SNO) consensus review on current management and future directions. Neuro-Oncology. (2022) 24:1613–46. doi: 10.1093/neuonc/noac118

PubMed Abstract | Crossref Full Text | Google Scholar

4. Fecci PE, Champion CD, Hoj J, McKernan CM, Goodwin CR, Kirkpatrick JP, et al. The evolving modern management of brain metastasis. Clin Cancer Res. (2019) 25:6570–80. doi: 10.1158/1078-0432.Ccr-18-1624

PubMed Abstract | Crossref Full Text | Google Scholar

5. Ernani V and Stinchcombe TE. Management of brain metastases in non–small-cell lung cancer. J Oncol Pract. (2019) 15:563–70. doi: 10.1200/jop.19.00357

PubMed Abstract | Crossref Full Text | Google Scholar

6. Sperduto PW, De B, Li J, Carpenter D, Kirkpatrick J, Milligan M, et al. Graded prognostic assessment (GPA) for patients with lung cancer and brain metastases: initial report of the small cell lung cancer GPA and update of the non-small cell lung cancer GPA including the effect of programmed death ligand 1 and other prognostic factors. Int J Radiat OncologyBiologyPhysics. (2022) 114:60–74. doi: 10.1016/j.ijrobp.2022.03.020

PubMed Abstract | Crossref Full Text | Google Scholar

7. Sereno M, Hernandez de Córdoba I, Gutiérrez-Gutiérrez G, and Casado E. Brain metastases and lung cancer: molecular biology, natural history, prediction of response and efficacy of immunotherapy. Front Immunol. (2024) 14:1297988. doi: 10.3389/fimmu.2023.1297988

PubMed Abstract | Crossref Full Text | Google Scholar

8. Herbst RS, Morgensztern D, and Boshoff C. The biology and management of non-small cell lung cancer. Nature. (2018) 553:446–54. doi: 10.1038/nature25183

PubMed Abstract | Crossref Full Text | Google Scholar

9. Derks SHAE, van der Veldt AAM, and Smits M. Brain metastases: the role of clinical imaging. Br J Radiol. (2022) 95:20210944. doi: 10.1259/bjr.20210944

PubMed Abstract | Crossref Full Text | Google Scholar

10. Brockelsby C, Maconachie R, Navani N, Prendecki R, Randles V, King J, et al. Brain imaging in lung cancer staging: A real-world, multi-centre study of prevalence of brain metastases, impact on treatment and re-modelling of the NICE health economic analysis. Lung Cancer. (2023) 178:145–50. doi: 10.1016/j.lungcan.2023.02.012

PubMed Abstract | Crossref Full Text | Google Scholar

11. Mayer N, Boschetti L, Scarci M, Cioffi U, De Simone M, Schnider M, et al. Brain imaging in patients with non-small cell lung cancer—A systematic review. J Clin Med. (2025) 14:708. doi: 10.3390/jcm14030708

PubMed Abstract | Crossref Full Text | Google Scholar

12. Zhu D, Shao Y, Yang Z, Cheng A, Xi Q, Liang X, et al. Magnetic resonance imaging characteristics of brain metastases in small cell lung cancer. Cancer Med. (2023) 12:15199–206. doi: 10.1002/cam4.6206

PubMed Abstract | Crossref Full Text | Google Scholar

13. Zhang L, Wang Y, Peng Z, Weng Y, Fang Z, Xiao F, et al. The progress of multimodal imaging combination and subregion based radiomics research of cancers. Int J Biol Sci. (2022) 18:3458–69. doi: 10.7150/ijbs.71046

PubMed Abstract | Crossref Full Text | Google Scholar

14. Sui L, Chang S, Xue L, Wang J, Zhang Y, Yang K, et al. Deep learning based on enhanced MRI T1 imaging to differentiate small-cell and non-small-cell primary lung cancers in patients with brain metastases. Curr Med Imaging Form Curr Med Imaging Rev. (2023) 19:1541–8. doi: 10.2174/1573405619666230130124408

PubMed Abstract | Crossref Full Text | Google Scholar

15. Egashira M, Arimura H, Kobayashi K, Moriyama K, Kodama T, Tokuda T, et al. Magnetic resonance-based imaging biopsy with signatures including topological Betti number features for prediction of primary brain metastatic sites. Phys Eng Sci Med. (2023) 46:1411–26. doi: 10.1007/s13246-023-01308-6

PubMed Abstract | Crossref Full Text | Google Scholar

16. Thomas HMT, Wang HYC, Varghese AJ, Donovan EM, South CP, Saxby H, et al. Reproducibility in radiomics: A comparison of feature extraction methods and two independent datasets. Appl Sci. (2023) 13:s00701-024-05977-4. doi: 10.3390/app13127291

PubMed Abstract | Crossref Full Text | Google Scholar

17. Staartjes VE, Kernbach JM, Stumpo V, van Niftrik CHB, Serra C, Regli L, et al. Foundations of feature selection in clinical prediction modeling. Acta Neurochir Suppl. (2022) 134:51–7. doi: 10.1007/978-3-030-85292-4_7

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zhang YP, Zhang XY, Cheng YT, Li B, Teng XZ, Zhang J, et al. Artificial intelligence-driven radiomics study in cancer: the role of feature engineering and modeling. Milit Med Res. (2023) 10:22. doi: 10.1186/s40779-023-00458-8

PubMed Abstract | Crossref Full Text | Google Scholar

19. Jin Y, Kodama T, and Arimura H. Applications 10. Radiomics researches for cancer treatment using python. Japan J Radiol Technol. (2024) 80:549–57. doi: 10.6009/jjrt.2024-2354

PubMed Abstract | Crossref Full Text | Google Scholar

20. Thai AA, Solomon BJ, Sequist LV, Gainor JF, and Heist RS. Lung cancer. Lancet. (2021) 398:535–54. doi: 10.1016/s0140-6736(21)00312-3

PubMed Abstract | Crossref Full Text | Google Scholar

21. Petty WJ and Paz-Ares L. Emerging strategies for the treatment of small cell lung cancer. JAMA Oncol. (2023) 9:419–29. doi: 10.1001/jamaoncol.2022.5631

PubMed Abstract | Crossref Full Text | Google Scholar

22. Ettinger DS, Wood DE, Aisner DL, Akerley W, Bauman JR, Bharat A, et al. NCCN guidelines® Insights: non–small cell lung cancer, version 2.2023. J Natl Compr Cancer Netw. (2023) 21:340–50. doi: 10.6004/jnccn.2023.0020

PubMed Abstract | Crossref Full Text | Google Scholar

23. Schaff LR and Mellinghoff IK. Glioblastoma and other primary brain Malignancies in adults. Jama. (2023) 329:574–87. doi: 10.1001/jama.2023.0023

PubMed Abstract | Crossref Full Text | Google Scholar

24. Giaccone G and He Y. Current knowledge of small cell lung cancer transformation from non-small cell lung cancer. Semin Cancer Biol. (2023) 94:1–10. doi: 10.1016/j.semcancer.2023.05.006

PubMed Abstract | Crossref Full Text | Google Scholar

25. Vogelbaum MA, Brown PD, Messersmith H, Brastianos PK, Burri S, Cahill D, et al. Treatment for brain metastases: ASCO-SNO-ASTRO guideline. J Clin Oncol. (2022) 40:492–516. doi: 10.1200/jco.21.02314

PubMed Abstract | Crossref Full Text | Google Scholar

26. Lohmann P, Franceschi E, Vollmuth P, Dhermain F, Weller M, Preusser M, et al. Radiomics in neuro-oncological clinical trials. Lancet Digit Health. (2022) 4:e841–9. doi: 10.1016/s2589-7500(22)00144-3

PubMed Abstract | Crossref Full Text | Google Scholar

27. Priya S, Liu Y, Ward C, Le NH, Soni N, Pillenahalli Maheshwarappa R, et al. Radiomic based machine learning performance for a three class problem in neuro-oncology: time to test the waters? Cancers. (2021) 13:2568. doi: 10.3390/cancers13112568

PubMed Abstract | Crossref Full Text | Google Scholar

28. Deng F, Liu Z, Fang W, Niu L, Chu X, Cheng Q, et al. MRI radiomics for brain metastasis sub-pathology classification from non-small cell lung cancer: a machine learning, multicenter study. Phys Eng Sci Med. (2023) 46:1309–20. doi: 10.1007/s13246-023-01300-0

PubMed Abstract | Crossref Full Text | Google Scholar

29. Tulum G. Novel radiomic features versus deep learning: differentiating brain metastases from pathological lung cancer types in small datasets. Br J Radiol. (2023) 96:20220841. doi: 10.1259/bjr.20220841

PubMed Abstract | Crossref Full Text | Google Scholar

30. Li Z, Mao Y, Li H, Yu G, Wan H, Li B, et al. Differentiating brain metastases from different pathological types of lung cancers using texture analysis of T1 postcontrast MR. Magnet Reson Med. (2016) 76:1410–9. doi: 10.1002/mrm.26029

PubMed Abstract | Crossref Full Text | Google Scholar

31. Amemiya S, Takao H, Kato S, Yamashita H, Sakamoto N, and Abe O. Feature-fusion improves MRI single-shot deep learning detection of small brain metastases. J Neuroimaging. (2021) 32:111–9. doi: 10.1111/jon.12916

PubMed Abstract | Crossref Full Text | Google Scholar

32. Zhang J, Jin J, Ai Y, Zhu K, Xiao C, Xie C, et al. Differentiating the pathological subtypes of primary lung cancer for patients with brain metastases based on radiomics features from brain CT images. Eur Radiol. (2020) 31:1022–8. doi: 10.1007/s00330-020-07183-z

PubMed Abstract | Crossref Full Text | Google Scholar

33. van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–7. doi: 10.1158/0008-5472.Can-17-0339

PubMed Abstract | Crossref Full Text | Google Scholar

34. Ranjbarzadeh R, Caputo A, Tirkolaee EB, Jafarzadeh Ghoushchi S, and Bendechache M. Brain tumor segmentation of MRI images: A comprehensive review on the application of artificial intelligence tools. Comput Biol Med. (2023) 152:106405. doi: 10.1016/j.compbiomed.2022.106405

PubMed Abstract | Crossref Full Text | Google Scholar

35. Wadhwa A, Bhardwaj A, and Singh VV. A review on brain tumor segmentation of MRI images. Magnet Reson Imaging. (2019) 61:247–59. doi: 10.1016/j.mri.2019.05.043

PubMed Abstract | Crossref Full Text | Google Scholar

36. Tabassum M, Suman AA, Suero Molina E, Pan E, Di Ieva A, and Liu S. Radiomics and machine learning in brain tumors and their habitat: A systematic review. Cancers. (2023) 15:3845. doi: 10.3390/cancers15153845

PubMed Abstract | Crossref Full Text | Google Scholar

37. Ehret F, Kaul D, Clusmann H, Delev D, and Kernbach JM. Machine learning-based radiomics in neuro-oncology. Acta Neurochir Suppl. (2022) 134:139–51. doi: 10.1007/978-3-030-85292-4_18

PubMed Abstract | Crossref Full Text | Google Scholar

38. Yang YR, Yuan Y, Han Z, and Liu G. Interpretability analysis for thermal sensation machine learning models: An exploration based on the SHAP approach. Indoor Air. (2022) 32:e12984. doi: 10.1111/ina.12984

PubMed Abstract | Crossref Full Text | Google Scholar

39. Karim MR, Islam T, Shajalal M, Beyan O, Lange C, Cochez M, et al. Explainable AI for bioinformatics: methods, tools and applications. Briefings Bioinf. (2023) 24:bbad236. doi: 10.1093/bib/bbad236

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: radiomics, magnetic resonance imaging, lung cancer, brain metastases, machine learning

Citation: Xia X, Du W and Gou Q (2025) Machine learning-based radiomics for differentiating lung cancer subtypes in brain metastases using CE-T1WI. Front. Oncol. 15:1599882. doi: 10.3389/fonc.2025.1599882

Received: 25 March 2025; Accepted: 05 June 2025;
Published: 19 June 2025.

Edited by:

Morgan Michalet, Institut du Cancer de Montpellier (ICM), France

Reviewed by:

Zhenyu Gong, Technical University of Munich, Germany
Suhrud Panchawagh, Mayo Clinic, United States

Copyright © 2025 Xia, Du and Gou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qiheng Gou, Z291cWloZW5nNTEzQHdjaHNjdS5jbg==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.