Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 05 January 2026

Sec. Neuro-Oncology and Neurosurgical Oncology

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1679634

This article is part of the Research TopicImaging to Guide Treatment in Brain TumorsView all 8 articles

A comprehensive evaluation of MRI-based radiogenomics and prognosis prediction in glioma

Mehdi Astaraki,,*Mehdi Astaraki1,2,3*Marta Lazzeroni,Marta Lazzeroni1,2Iuliana Toma-Dasu,Iuliana Toma-Dasu1,2
  • 1Division of Medical Radiation Physics, Department of Physics, Stockholm University, Stockholm, Sweden
  • 2Department of Oncology-Pathology, Karolinska Institutet, Stockholm, Sweden
  • 3Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden

Background and purpose: In gliomas, characterization of the molecular landscape plays a critical role in determining prognosis and guiding treatment regimens. Imaging biomarker models hold promise for non-invasive characterization of glioma subtypes. We, comprehensively, assessed the potential of magnetic resonance imaging (MRI) data for predicting the survival status and molecular subtypes of glioma.

Methods: We introduce a novel method for quantifying the spatial distribution of gliomas within brain anatomy. The method measures the volumetric ratio of 32 brain anatomical structures affected by the tumor. This novel feature set was combined with established radiomics to build models for predicting O6-methylguanine-DNA methyltransferase (MGMT) methylation status, isocitrate dehydrogenase (IDH) mutation status, and Overall Survival (OS) time of glioma patients. The performance of these models was evaluated on preoperative MRIs of 1788 subjects from four independent datasets, employing both cross-validation (CV) and cross-dataset evaluation strategies.

Results: The proposed feature set revealed no regular patterns in tumor locations across the brain. Integration of these features with radiomics improved model performance for the three tasks. The best performance, in terms of AUROC, respectively, for CV and cross-data tests were: 0.685 and 0.628 for MGMT status, 0.972 and 0.764 for IDH status, and 0.748 and 0.719 for OS time status.

Conclusions: Our experiments demonstrate the potential of imaging biomarkers for IDH prediction, highlighting the challenges associated with predicting MGMT and OS only from image data. This underscores the need for additional information beyond MRI, for accurate prediction of these prognostic markers.

1 Introduction

Gliomas are the most common malignant primary brain tumors in adults. Based on the 2021 World Health Organization (WHO) classification, three main types of gliomas are astrocytoma, oligodendroglioma, and glioblastoma (GBM) (1). Despite advances in treatment regimens, including surgical resection to the maximal safe extent followed by radiation and chemotherapy, the survival rate of GBM patients, the most aggressive type of glioma, remains poor, e.g., the median overall survival (OS) time is less than 15 months (2, 3). Treatment options for glioma subtypes are different (4). In this context, there are two important biomarkers representing different molecular attributes of glioma: (1) O6-methylguanine-DNA methyltransferase (MGMT) promoter methylation status and (2) isocitrate dehydrogenase (IDH) mutation status. Specifically, MGMT methylation status is accepted as one of the leading determinants of prognosis and a potential predictor of response to temozolomide therapy and, therefore, would influence neoadjuvant options before surgery (5). In general, patients with methylated MGMT promoters have longer survival rates than those with unmethylated MGMT promoters. In addition, IDH status indicates the tumor grade, including low-grade diffuse gliomas (IDH-mutated) and GBM (IDH-wildetype) with IDH-mutated tumors have better prognosis. Therefore, inaccurate classification of tumor subtypes negatively impacts patient’s treatment options and can result in limited access to neoadjuvant therapies (6). Stratifications of the glioma subtypes require invasive biopsy procedures with a risk of sampling bias due to the inherent heterogeneity of genetic features (7). Statistics from the cancer genome atlas pilot project show that only 35% of all sampled tissues via biopsy had sufficient tumoral contents for accurate glioma molecular profiling (8). Hence, developing accurate and reliable non-invasive alternative methods would be beneficial for the diagnosis and treatment of glioma patients.

Radiogenomics refers to the study of the correlation between imaging phenotypes and underlying gene expression of cancerous tissues. The ability of Magnetic Resonance Imaging (MRI) to, non-invasively, provide spatial heterogeneity within the tumors on one side, and advances in the field of radiogenomics on the other side, resulted in the development of a variety of imaging biomarkers models for glioma prognosis and subtype classification (9). Several studies showcased the effectiveness of radiomics and/or deep learning (DL) techniques in predicting IDH mutation, MGMT promoter methylation, and survival status of glioma patients (1012). In a recent comprehensive study, Lost et al. (4), reviewed 85 articles that met their requirements from 1323 preliminary selected articles. They summarized that the reported prediction power, in terms of area under a receiver operating characteristic curve (AUROC) for IDH mutation status lies around 0.88, and for MGMT methylation status, this metric lies around 0.77. However, there exist other studies showing the inability of imaging biomarkers for accurate molecular profiling and prognosis predictions of gliomas (13). As an example, the 2021 RSNA-MICCAI Brain Tumor Segmentation (BraTS) challenge (14) aggregated imaging data from five public repositories as well as a number of private institutions and aimed to, objectively, evaluate the performance of radiomics/DL models for the prediction of MGMT methylation status. Nevertheless, the top-performing models resulted in AUROC values below 0.6 on the test set (15).

In this study, we introduce a new feature set that is capable of capturing the spatial distribution of gliomas within the normal brain anatomy. Then, we conduct a comprehensive analysis to answer the question of whether MRI data can be used for accurate prediction of IDH mutation status, MGMT methylation status, and survival status of gliomas by employing several large-scale MRI datasets.

2 Materials and methods

2.1 Datasets

Four datasets were examined to investigate the potential of imaging biomarkers models for glioma molecular profiling and OS time prediction.

2.1.1 MICCAI-RSNA 2021 radiogenomics dataset

The Brain Tumor Segmentation (BraTS) community in collaboration with the Medical Image Computing and Computer Assisted Intervention Society (MICCAI) and the Radiological Society of North America (RSNA) held a challenge in 2021 targeting the evaluation of computational algorithms for two independent tasks: (i) glioma segmentation and (ii) glioma molecular characterization in terms of MGMT methylation status (14). The MGMT classification task contains multiparametric MRI (mpMRI) scans for 585 glioma patients categorized into two groups: methylated and unmethylated MGMT.

2.1.2 UPenn-GBM dataset

The University of Pennsylvania Glioblastoma Imaging, Genomics, and Radiomics (UPenn-GBM) dataset incorporates the currently largest publicly available preoperative dataset of 611 patients diagnosed with GBM. This collection includes advanced mpMRIs (structural, diffusion, and perfusion images), the molecular status of IDH mutation and MGMT methylation, as well as OS time (16).

2.1.3 UCSF Glioma MRI dataset

The University of California San Francisco Preoperative Diffuse Glioma MRI (UCSF-PDGM) dataset contains 501 patients diagnosed with diffuse gliomas. The preoperative scans were acquired with 3-T MRI machines following standard protocols for brain tumor imaging and included structural, diffusion, and perfusion volumes. The dataset also includes IDH mutation and MGMT promotor methylation status for WHO grade 3 and 4 gliomas as well as OS time (17).

2.1.4 LUMIERE GBM dataset

LUMIERE is an MRI dataset of 91 GBM patients, which includes both preoperative and selected follow-up studies according to the response assessment of neuro-oncology criteria (RANO). The provided structural mpMRIs were accompanied by supplementary clinical information including IDH mutation and MGMT methylation status, along with OS time (18).

The imaging data for each subject in all the studied datasets incorporate structural mpMRI sequences, including T1-weighted (T1), T1-weighted post-contrast (T1CE), T2-weighted (T2), and T2 Fluid Attenuated Inversion Recovery (T2-FLAIR). The UPenn and UCSF datasets contain additional MRI sequences, such as diffusion and perfusion derivatives. Therefore, to avoid inconsistency in the analysis steps, we employed only the structural MRIs. In addition to the imaging data, the studied datasets already consist of segmentation masks of Gross Tumor Volumes (GTVs) which include enhancing tumor core, necrotic/cystic core, and peritumoral edematous/infiltrated tissue. A summary of the studied datasets is presented in Table 1.

Table 1
www.frontiersin.org

Table 1. An overview of the studied datasets. The number of available subjects for each dataset, task, and target label is presented inside the parenthesis.

2.2 Data preparation and preprocessing

The provided MICCAI-RSNA, UPenn, and UCSF datasets were already preprocessed, i.e., coregistered, resampled, and aligned to the SRI24 atlas (19), as well as skull-stripped. No further preprocessing steps were applied to these images. Accordingly, the same protocols were applied to the preoperative scans in the LUMIERE dataset. Specifically, the highest resolution scan among MRI sequences of each subject was determined and used as a reference to resample other sequences in order to bring the patient’s MRI sequences into a unique space. Then, the T1 sequence was employed as the moving image in registration to the SRI24 atlas. The obtained deformation field was utilized to warp the remaining sequences.

2.2.1 Definition of class labels

The following strategies were applied to the provided clinical information to define the target labels for model development. For the task of OS time prediction, the median survival time among each dataset was chosen as a threshold to categorize the OS status into a binary class (short-term vs. long-term survivors). The MGMT status prediction is defined as a binary task (methylated vs. unmethylated). Finally, the IDH status prediction was determined as a binary problem to classify IDH-wildtype gliomas from the rest.

2.3 Spatial distribution pattern

In this section, we introduce a new feature set, spatial distribution pattern, which aims at quantifying the anatomical distribution of gliomas in brain structures by segmenting the MRI data into constituent sub-structures.

Parcellation of brain anatomy in MRIs is a fundamental technique in neuroscience communities to study brain organization and functionality (20). However, the presence of major anomalies such as brain tumors changes the appearance of healthy structures and deforms the surrounding tissues. To delineate the anatomical regions of the brain despite the presence of gliomas, we propose the following strategy.

First, we utilized the SynthSeg model to segment the brain structures in T1 images into 32 constituent anatomical regions (maskSynth). Supplementary Table 1 shows the names of anatomical structures with corresponding class labels. SynthSeg is an advanced out-of-the-box DL model developed for brain segmentation in MRIs (21, 22). This model is trained on synthetic image data, which were randomly sampled from a generative model that was conditioned on the segmentation masks and exposed to a different combination of contrast, resolution, morphology, artifacts, and noise. Specifically, the generation process begins with a 3D anatomical label map, which is first deformed using a combination of random affine and non-linear transformations to simulate morphological variability. To generate the image intensities, a Gaussian Mixture Model (GMM) conditioned on the deformed labels is sampled at every voxel. The means and variances of the GMM are drawn from uniform distributions at each training step, fully randomizing the contrast and noise levels. The resulting image is further corrupted by a simulated bias field and a random Gamma transform to augment intensity distributions. Finally, to ensure robustness against resolution differences, the model simulates slice thickness via Gaussian blurring and slice spacing via downsampling, before resampling the image to the target resolution. Therefore, the model can robustly segment brain structures in real MRIs of any contrast and resolution. While this model can accurately segment anatomical subregions even in the presence of small lesions or small-field deformations, it cannot robustly determine the anatomical structures centered/around the gliomas. Therefore, the main source of failure in the segmentations resulting from SythSeg is concentrated on the glioma regions. We, temporarily, masked out these tumoral regions from the obtained segmentations by using the Gross Tumor Volume (GTV) masks (maskSynth-NoTumor).

In the second step, we adopted an atlas-based registration strategy to fill the masked-out regions with meaningful labels. To this end, we executed the SynthSeg model on the SRI24 atlas and segmented it into 32 sub-anatomies (atlasSynth) since all the studied images were already registered to the SRI24 space. Then, we partially aligned the obtained parcellated atlas into patient anatomy by rigidly registering the atlasSynth into maskSynth for each subject. Similar to the first step, the resulting registered atlas was then multiplied by the GTVs to mask out the tumoral regions (atlasSynth-NoTumor). A deformable registration step was executed to register the maskSynth-NoTumor into atlasSynth-NoTumor and correct their misalignments. This step ensures that these two masks are maximally aligned, excluding the tumoral regions. Finally, to determine the anatomical regions within the masked-out areas, the inverse of obtained deformation fields was adopted to warp the atlasSynth. This process registers the parcellated atlas into patient anatomy despite the presence of gliomas and mass effects. Therefore, the final registered segmentation masks (maskFinal) incorporate 32 constituent anatomical structures covering healthy anatomical regions as well as those occupied by the gliomas (see Figure 1; Supplementary Figure 1).

Figure 1
MRI brain scan series showing different stages and sections labeled A to I. Panel A is a normal brain. Panel B shows a darker region indicating a potential abnormality. Panel C highlights a bright, circular anomaly. Panels D to I illustrate colored illustrations of the brain with variations in shading and highlighted areas, depicting changes or treatments over time, with panel E having red arrows pointing to specific regions.

Figure 1. A step-by-step illustration of brain tissue segmentation in the presence of glioma. (A) central axial slice of SRI24 atlas in T1 sequence; (B) an example of glioma appeared in T1 sequence; (C) corresponding T1CE sequence of the same subject depicts tumoral subregions with better contrast, note that all the analyses were done on T1 images; (D) the result of parcellated atlas with SynthSeg model (atlasSynth); (E) the result of SynthSeg model on tumoral data (maskSynth) which was failed on/around glioma regions, red arrows highlighting over-segmentation of CSF, and wrongly labeled right cerebral cortex and right cerebral white matter regions; (F) masking out the glioma regions (maskSynth-NoTumor); (G) the result of rigidly registering the parcellated atlas into patient space; (H) masking out the glioma regions from the partially aligned atlas allows running a deformable registration between atlasSynth-NoTumor and maskSynth-NoTumor; (I) warping the atlasSynth with the inverse of obtained deformation fields leads to correct labels within the glioma regions (maskFinal).

To quantify how gliomas are distributed in the brain and form a quantitative feature map, we measure the volume of each of the 32 sub-anatomies. Then, by determining the overlapping areas between the GTV masks and the computed maskFinal, we calculated the ratio of the sub-anatomical volumes affected by the gliomas. In other words, the proposed feature map contains 32 values that vary in the range of 0 to 1, representing the ratio of anatomical structures that are affected by gliomas. As a result, with the proposed spatial distribution pattern, the MRI scans of each patient are transformed into a feature vector of length 32. The underlying reason to quantify the anatomical topographic distribution of gliomas is the fact that brain tumors appear in different sizes and shapes at different anatomical locations, and such heterogeneity might affect prognosis and treatment options (23, 24). We hypothesized that the proposed spatial distribution pattern would depict how the invasion of gliomas into different sub-anatomies is associated with the underlying genetic attributes and prognostic factors.

2.4 Radiomics descriptors

We adopted the radiomic features to quantify the intensity, morphology, and textural attributes of the gliomas by extracting the reproducible features according to the image biomarker standardization initiative on the employed MRI sequences (25, 26). The radiomics pool includes 14 shape-based, 18 first-order statistics, and 70 second-order statistics. Given that morphological (shape-based) features are invariant across imaging sequences, these features were exclusively extracted from the T1ce images. This approach yielded a total of 366 3D radiomic descriptors. It should be mentioned that we followed the guidelines for radiomics studies recommended by CLEAR (27).

2.5 Model training, evaluation and validation

A standard machine learning (ML) pipeline was developed to predict the desired class labels for each of the three tasks (MGMT, IDH, OS) based on the extracted handcrafted features. This pipeline includes the following steps: training and validating the performance of learning algorithms with repeated cross-validation (CV) resampling technique, feature selection, model ensembling, and cross-data evaluation. Note that, in the rest of this paper, the ‘combined’ feature set refers to the combination of the proposed Spatial distribution pattern and radiomics sets.

The first step involves training a learning algorithm on each of the datasets and for each of the tasks independently. In this study, two learning algorithms, including extreme gradient boosting (XGB), and random forest (RF) were examined as they are recognized as robust and powerful classifiers (2830). The hyperparameters of these models were tuned independently by running a grid search strategy (31). The prediction power of these models was ranked and the parameters of the top-performing models were stored to instantiate the final learning algorithms and further model ensembling. To reduce the risk of overfitting and removing the irrelevant or partially relevant features, forward feature selection (FFS) was embedded in the training procedure of each cross-validation iteration. Subsets of the most prognostic features selected from the training folds were validated on the validation sets. The described procedure was executed by repeating the 5-fold CV resampling strategy 10 times, with different random seeds for data splitting. Accordingly, the reported metrics are averaged over 250 observed values (10repetition × 5folds × 5models). It is worth mentioning that, to rescale the feature values, min-max normalization was applied to the training subsets and inferred into the validation and test sets. Last but not least, to address class imbalance, the SVM-SMOTE technique was integrated into the training phase. This variant was specifically selected for its superior robustness in high-dimensional spaces compared to standard KNN-based interpolation (32). It is worth highlighting that to prevent data leakage, all feature engineering steps including normalization, class balancing, and feature selection were strictly fitted on training fold data. The resulting transformation parameters were then applied directly to the validation folds and the testing dataset without any further optimization.

To further assess the generalization power of the developed models, we conducted cross-data evaluations. In other words, the models that were developed and validated on each dataset internally were, then, tested on other cohorts as external sets, given the availability of the same class labels. It should be noted that the whole pipeline was tested, independently, on the introduced spatial distribution pattern, radiomic set, and their combinations.

In all the experiments, the performance of the models was quantified by employing the following metrics: AUROC, accuracy, sensitivity, and specificity. Finally, the pairwise AUROC comparison method proposed by Delong et al. (33) was employed to assess the statistical significance between the observed values of different feature sets. All the implementations were done in relevant python libraries such as numpy, scikit-learn, and ANTsPy.

3 Results

In this section, we first present the quantified statistics resulting from the proposed spatial distribution pattern. Then, the prediction power of the employed feature set for each task will be presented in terms of CV and cross-dataset analyses.

3.1 Statistics of spatial distribution pattern

Applying the proposed spatial distribution pattern resulted in 32 quantitative values representing the percentage of volumes of anatomical structures affected by gliomas. Table 2 shows the average values of these ratios over the studied datasets. In addition, Supplementary Figures 2a-d illustrate detailed statistics of how gliomas were distributed in brain anatomies in the examined datasets.

Table 2
www.frontiersin.org

Table 2. Average distribution of glioma in the brain across four datasets, considering the ratio of tumor volume to anatomical structures; L and R stand for left and right.

The proposed feature set measures the relative size of tumors w.r.t the anatomical structures. To better understand how large the tumors are, compared to the size of anatomical structures, Supplementary Tables 3, 4 indicate the absolute volumes of tumoral regions as well as measured brain structures in the 4 studied datasets. For instance, the volume of the left cerebellum cortex which is presented with label 8 varies in the range of 0.056 to 0.058 liter in the examined subjects. Then, the reported value in Table 2 means that 0.2% of the volume of this structure is affected by gliomas.

The largest segmented anatomical structures are left/right cerebral white matter and left/right cerebral cortex. From Table 2, the pattern of glioma distribution is rather similar for these structures. In fact, 9.3% of left cerebral white matter, 10% of right cerebral white matter, 4.8% of left cerebral cortex, and 5% of right cerebral cortex were affected by the presence of tumors. Figure 2 depicts the results of the proposed method on some challenging cases in which the large-size gliomas significantly changed the appearance of normal anatomies.

Figure 2
Fourteen images showing axial brain scans. The top and third rows display grayscale MRI images, while the second and bottom rows show corresponding segmentations with different colors representing various brain regions. Each column presents slices from similar axial levels.

Figure 2. Brain tissue segmentation in the presence of gliomas. An example of 8 challenging cases with large-size tumors that were segmented by the proposed method into 32 constituent anatomical structures.

3.2 Results of CV analysis

Extensive ablation studies were conducted to determine the optimal parameters of the employed learning algorithms. To this end, we first mixed all the datasets, then, randomly picked out 20 percent of the data to apply the grid search experiments. Supplementary Table 2 demonstrates the examined hyperparameters and their investigated range of values for model tuning. For each task, the performance of the algorithms, in terms of AUROC, was sorted and the top 5 performing models were selected for final training in which the full datasets were employed, independently, for CV analysis.

Experimental analysis determined that the most predictive subsets were comprised of 5 features from the spatial distribution pattern, 10 from radiomics, and 10 from the combined feature sets. Extending the feature selection beyond these thresholds did not enhance predictive performance, but rather increased model complexity without providing benefits. Supplementary Figure 4, Supplementary Table 5 depict the names and contributions of the selected features to the prediction power for each task. Tables 35 represent the performance of CV analysis over the validation sets.

Table 3
www.frontiersin.org

Table 3. The results of repeated CV analysis over ensembled models for MGMT methylation status prediction.

Table 4
www.frontiersin.org

Table 4. The results of repeated CV analysis over ensembled models for IDH status prediction.

Table 5
www.frontiersin.org

Table 5. The results of repeated CV analysis over ensembled models for OS time prediction.

The reported metrics indicate that the radiomics feature set carries more predictive power than the proposed spatial distribution pattern; however, combining these two feature sets yields the highest prediction powers for the conducted experiments. On average, the combined feature sets, in terms of AUROC, improved the prediction power of radiomics by 0.8% in MGMT analysis, 1.7% in IDH analysis, and 1.3% in OS time analysis.

For the MGMT status prediction, the highest prediction power was achieved from the UCSF dataset with an AUROC value of 0.685. This metric remains relatively within the same range (0.622 to 0.667) for the other three datasets. However, one can observe the presence of large variations in the prediction scores of IDH status. In particular, while the UCSF dataset carries the highest prediction power with a value of 0.972, the other two datasets resulted in AUROC values of 0.758 and 0.789. Nevertheless, OS time analysis resulted in rather similar results in which the highest prognostic power comes from the LUMIERE dataset with an AUROC score of 0.748. Supplementary Table 6 demonstrates the results of the statistical comparison between the examined feature sets.

3.3 Results on cross-dataset evaluation

To evaluate the generalization power of the models on external datasets, the best-performing models from CV analyses were selected to be used in the inference mode for predicting the class labels of other datasets. Therefore, the model developed on the UCSF dataset for the MGMT task was applied to the UPenn and LUMIERE datasets. However, since the UCSF dataset is a constituent of the MICCAI-RSNA dataset, this cross-data evaluation was skipped. Likewise, for the IDH task, the UCSF model was applied to the UPenn and LUMIERE datasets as well. Regarding the OS task, the LUMIERE dataset with only 86 subjects outweighed the other sets in CV analyses. For this task, the UPenn dataset resulted in a close performance (0.748 vs. 0.736), nevertheless, it comprises 366 more subjects than LUMIERE. To minimize the risk of overfitting, we picked the UPenn model to be applied to other cohorts. Tables 68 show the results of cross-data evaluations. Furthermore, Supplementary Figures 3 and 4 illustrate the ROC curves, calibration plots, and SHapley Additive exPlanations (SHAP) interpretability analyses for the best-performing models.

Table 6
www.frontiersin.org

Table 6. Prediction performance of the MGMT methylation status measured over testing datasets.

Table 7
www.frontiersin.org

Table 7. Prediction performance of the IDH status measured over testing datasets.

Table 8
www.frontiersin.org

Table 8. Prediction performance of the OS time status measured over testing datasets.

Similar to CV analyses, radiomic feature set outperformed the proposed spatial distribution pattern with rather remarkable margins. Nonetheless, the combined feature sets resulted in the highest prediction scores. The combined feature sets consistently outperformed the radiomic sets. Specifically, the AUROC scores improved by 0.85% for MGMT, 1.65% for IDH, 1.50% for OS. However, generalization remains a challenge. When applying the models to external datasets, performance dropped across all tasks: MGMT: -4.3%, IDH: -3.2%, and OS: -2.9%.

3.4 Results summary

In this study, we conducted a comprehensive radiomics analysis for the radiogenomic characterization and survival prediction of gliomas. Additionally, we proposed a novel feature set designed to quantify the spatial distribution of tumors within the brain anatomy and assessed its predictive value when integrated with standard radiomic features. Three distinct clinical tasks include MGMT promoter methylation, IDH mutation status, and OS time prediction, were investigated using three feature strategies (Radiomics, Spatial, and Combined). Detailed experiments were conducted to evaluate model performance under both internal CV and external cross-dataset evaluation settings. To facilitate a comparative analysis of these findings, Table 9 provides a consolidated summary of the discrimination powers in terms of AUROC achieved by the best-performing models across all experiments and evaluation protocols.

Table 9
www.frontiersin.org

Table 9. Summary of the best prediction performance achieved for each clinical task across CV and Cross-Dataset evaluation settings.

4 Discussion

While diagnosis and treatment of brain tumors mainly depend on the pathologic findings, the remarkable degree of inter/intra-tumor heterogeneity of gliomas significantly impacts the intervention options and even the clinical outcomes. Pathology-based evaluation is an inherently localized procedure and, therefore, may not be capable of fully capturing the heterogeneities in genetic patterns of gliomas. On the other hand, preoperative MRIs can provide quantitative global information that might be used for the prediction of glioma subtype and, therefore, influence the choice of treatment options. In recent years, large numbers of studies have been conducted, showing promising results for glioma stratification and prognostic prediction by using MRI data (4). In this work, we assessed the capabilities of radiomics and ML models for predicting the molecular subtypes and treatment outcomes of glioma patients. Additionally, we introduced a novel approach that can quantify the neuroanatomical location of gliomas. We thoroughly evaluated the accuracy and reliability of the radiomics and proposed feature sets for the prediction of MGMT methylation status, IDH status, and OS status.

4.1 Spatial distribution pattern

Segmenting the brain anatomy into 32 constituent structures in the presence of brain tumors with the proposed method provides a feature set that quantitatively presents how gliomas are distributed in the brain. Our results show a rather irregular pattern in the distribution of gliomas which were scattered across various brain locations. These findings align with previous research (23, 24). For instance, cerebral white and gray matters in the left and right hemispheres were equally affected by the gliomas. In addition, the right putamen, a small structure involved in movement control, has the highest tumor concentration, with over 16% of its volume affected. Conversely, the right cerebellum cortex, responsible for coordination, shows the least impact, with only 0.1% affected. However, the interpretation needs to consider the relative size of these structures. Large regions like the cerebral cortex (around 0.27 liters) can accommodate tumors with a smaller relative impact compared to tiny structures like the right accumbens area (0.001 liters), where even a small tumor can significantly affect a large portion of the region. Accordingly, brain tumors, in terms of relative size, can fully affect tiny regions while causing less impact on large structures. One surprising finding is that 0.5% of the cerebrospinal fluid (CSF), was affected by the tumor. While gliomas typically do not develop in CSF due to the lack of glial cells, it should be noted that we defined the tumoral regions as a combination of enhancing part, necrosis/non-enhancing as well as peritumoral edema. The edematous regions can contain infiltrative tumor cells and these cells can be presented in the CSF regions. Hence, the impact of tumoral regions on the CSF structures can signify the presence of infiltrative cells.

Employing the spatial distribution pattern for predicting the MGMT methylation, IDH mutation, and OS time status resulted in inferior performance compared to radiomic features in all the experiments. In fact, the proposed feature set aims at quantifying only the spatial distribution of the gliomas and cannot be directly compared to radiomics which are designed to capture intensity, morphological, and textural attributes. However, integrating the proposed feature set into radiomic pools consistently improved the discrimination power in different experiments. Indeed, the proposed spatial distribution pattern can provide global information regarding the impacts of gliomas on brain anatomies in contrast to radiomics which only quantifies the localized characteristics of tumoral regions. Therefore, the proposed method can be considered as a complementary feature set to radiomic pools. The selected features from the spatial distribution pattern were different for the three studied tasks. This observation implies that the impacted anatomical structures are not equally correlated with each of the MGMT methylation, IDH mutation, and OS time status. Additionally, in the combined feature set, textural radiomics and spatial distribution patterns frequently contributed to the build of most prognostic feature subsets when the FFS method was applied. The proposed feature set presents a valuable opportunity for quantifying the spatial distribution of infiltrative glioma cells. Recent findings indicate that the distribution of these cells, particularly those extending beyond the visible margins of the GTV, is correlated with OS outcomes (34, 35). Accordingly, the described spatial distribution pattern can be leveraged to estimate the precise anatomical spread of infiltrative cells within the brain.

Beyond improved prediction, the proposed method offers several advantages. It relies on simple, size-based measurements, making the results easily interpretable by clinicians. Additionally, the segmentation technique used to analyze brain structures can be applied to other brain tumor studies, such as investigating recurrence patterns and tracking changes in brain structures over time.

4.2 MGMT methylation status prediction

Our analysis of 1398 subjects across 4 datasets revealed a concerning limitation in predicting MGMT status. The highest achieved prediction power in terms of AUROC was only 0.685. This performance remained consistently poor across datasets and even dropped by 5.7% when the best-trained model was applied to other datasets during the cross-data testing phase, which resulted in the highest AUROC score of 0.628. To technically evaluate such poor results, it should be emphasized that the applied grid searches for tuning the model hyperparameters assured us that the developed models were not underfitted. Moreover, integrating the feature selection technique to keep only a few prognostic features, training the models with repeated CV strategy, and further ensembling the top-performing models increased our confidence that the developed models were not overfitted either. In this context, it should be noted that the MGMT labels were almost equally distributed between the binary classes; thus, the models were not biased toward the majority class. In particular, the percentage of methylated and unmethylated MGMTs was 0.52 vs. 0.48 for the RSNA dataset, 0.44 vs. 0.56 for the Upenn dataset, and 0.46 vs. 0.54 for the LUMIERE dataset. We adopted the SMOTE technique to deal with the imbalance distribution in the UCSF dataset (0.72 vs. 0.28). Accordingly, it can be derived that the observed poor prediction powers were not due to methodological issues. In other words, prediction scores of MGMT status remained poor regardless of the studied data, examined feature set, or training strategies. This finding is further substantiated by the calibration curves (Supplementary Figure 3). Although the combined feature set demonstrates slight adherence to the ideal diagonal compared to individual feature sets in the mid-probability range, all models exhibit suboptimal performance in this interval. The observed behavior of the curves suggests a limited discriminative capacity, indicating that the classifiers struggle to reliably stratify moderate-risk and high-risk cases.

Our observations are in agreement with the findings of recent studies investigating the potential correlations between MRI data and MGMT status.

As detailed in Supplementary Table 7, our results (AUROC 0.645) are competitive with recent literature using the RSNA dataset. While some studies on small-scale or private datasets report higher accuracy (41, 42), studies using the public RSNA benchmark challenge (39) consistently report AUROCs below 0.65 (3638, 40). While these discrepancies in the observed performance can be caused by a variety of reasons such as the inclusion of lower-grade tumors in the examined cohorts, and different techniques for determining the MGMT status in the laboratory, lack of external validation steps, and over-reliance on certain variables could be other important factors. Accordingly, it is strongly recommended to develop reproducible models on heterogeneous datasets and further evaluate their performance on external testing sets.

Our comprehensive experiments yielded poor prediction power, suggesting that current imaging biomarker models, particularly those utilizing structural MRIs, are inadequate for capturing meaningful correlations between imaging data and MGMT methylation status. Consequently, future research should explore alternative MRI sequences, such as perfusion and diffusion-weighted imaging, instead of using more advanced ML/DL models. More importantly, integrating imaging biomarkers with genetic and clinical data for multi-omics analyses holds promise for improved prediction.

4.3 IDH status prediction

In contrast to MGMT analysis, promising results were achieved for the prediction of IDH mutation status, demonstrating the potential of imaging data as a non-invasive measurement for tumor stratification purposes. After analyzing 1180 subjects from 3 different datasets, the most accurate model was developed over the UCSF dataset, which resulted in an average AUROC value of 0.972. It should be noted that the UCSF dataset contains a relatively good balance between the wildtype group with 398 subjects and the mutant group with 103 subjects. On the other hand, the distributions of class labels in the Upenn and LUMIERE cohorts were strongly skewed toward the wildtype IDH group. Specifically, 98% of subjects in the Upenn dataset and 96% of the LUMIERE cohort belonged to the wildtype class. Such severe imbalanced distributions cannot be compensated with resampling or augmentation techniques. Therefore, we reformulated the binary classification problem for these two cohorts as wildtype IDH class against the rest of the classes, which includes mutant, negatives, and NOS/NEC. Accordingly, the percentage of wildtype IDHs against the other class became 0.81 vs. 0.19 for the Upenn dataset and 0.83 vs. 0.17 for the LUMIERE cohort. While this label reformulation slightly improved the imbalanced issues, it essentially led to presenting multiple subclasses in one single group. Therefore, the highest prediction power over the Upenn and LUMIERE datasets (0.758 to 0.789) is not comparable to those of the UCSF cohort. We believe that the lower yet promising prediction powers achieved for these two cohorts can be justified by (i) highly imbalanced distributions of class labels, and (ii) classification complexity caused by label reformulation. Subsequently, applying the UCSF model to the Upenn and LUMIERE datasets in the cross-data testing step resulted in a significant reduction of the classification power of the UCSF dataset alone. Nonetheless, the AUROC metrics in the test phase were in the range of 0.726 to 0.764, which is relatively similar to the range of their CV analysis step. Additionally, no significant changes in the quantified values of the accuracy, sensitivity, and specificity metrics imply that the learned models are not biased toward either of the binarized classes.

Our proposed pipeline yields competitive classification power compared to other studies that developed and tested their models on the UCSF dataset. In this domain, the reported AUROC metrics on CV analysis with a variety of radiomics and end-to-end DL models lie in the range of 0.87 to 0.964 (30, 36, 43). Besides the UCSF dataset, the reported metrics from studies that predicted IDH status by running cross-data analysis show robustness in model performance with only limited drops of accuracy when tested on external datasets, resulting in the classification power of 0.84 to 0.86 (4446).

The IDH status prediction task yields the most stable calibration curves. While the spatial features alone are prone to error in high-probability regions, the radiomics and combined sets closely approximate the ideal diagonal, indicating trustworthy predictive capacity. Despite a slight systemic over-confidence, these models show superior alignment with reality compared to the other tasks. With the promising results we achieved for IDH mutation status and the lack of inconsistency in the reported results by other studies, it can be inferred that structural MRI data incorporate quantitative characteristics of glioma that are correlated with IDH mutation state. Nevertheless, relying on the models that were tested on limited datasets could be misleading; therefore, the reliability of imaging biomarker models for IDH status prediction should be further investigated by testing these models on multiple large-scale and heterogeneous datasets.

4.4 Overall survival prediction

Our experiments for predicting the OS time in terms of a binary status (short-term vs. long-term survivors) over a total number of 1308 subjects from 3 datasets resulted in relatively stable models. The highest discrimination scores for the CV analysis lie in the range of 0.729 to 0.748. Similar to the MGMT and IDH experiments, classification power was dropped when the best model achieved from the CV analysis evaluated on the testing datasets, which resulted in the highest value of 0.719. Different factors, including imaging vendors, acquisition settings, and patient populations, can cause such domain shifts between the training and testing cohorts. Our observations are in agreement with other relevant studies using the same type of image data for OS time predictions. In this domain, the developed radiomics and DL models resulted in robust prediction powers of up to 0.78 and dropped to 0.67 when testing on external datasets (12, 4749). It is worth mentioning that applying the median survival time as a hard threshold to binarize the OS status addresses the problem of class imbalance issues. Even for this simplified binary survival analysis, calibration curves display significant noise evidenced by oscillatory patterns. These deviations point to potential data sparsity within the Lumiere dataset and more importantly the models’ inability to converge on a stable signal, highlighting the complexity of modeling survival outcomes. While such a binary problem facilitates the development of ML/DL pipelines, in many applications, other types of predictions, including three-class classification and continuous-time prediction for survival analysis, are of interest. Nevertheless, the prediction powers of such scenarios remain limited despite the advances in the field of DL techniques (50, 51). Accordingly, it can be deduced that structural MRI data can provide a relatively meaningful correlation with the binarized OS time status of glioma patients; however, these image data are not sufficient to derive solid models for other survival analysis applications. Hence, more investigations are required to examine the potential of alternative MRI sequences as well as the combination of image data with laboratory and clinical factors for developing more robust and reliable models for survival analysis.

While imaging biomarker models show promise for various glioma-related tasks, their robustness and reliability require extensive validation and reproducibility testing. International benchmarking competitions (52), like those established by the BraTS community, offer an objective benchmarking framework through glioma diagnosis and prognosis prediction challenges. In this study, we compared MGMT methylation and OS time prediction against the reported results by related challenges, while we evaluated the IDH mutation prediction on 3 independent public datasets due to the absence of a dedicated challenge.

This study acknowledges some limitations which can be addressed in future studies. First, the potential bias due to imbalanced structure size in the proposed segmentation method can be addressed by incorporating alternative structural or functional atlases for more uniform segmentation. Second, the restricted use of structural MRI sequences calls for further investigation into the potential performance improvements gained by incorporating perfusion and diffusion MRI sequences. Finally, future studies should explore the impact of preprocessing steps like bias field correction and intensity normalization on the generalizability of the models.

5 Conclusion

Our experiments indicate that the imaging signature, derived by combining the proposed feature set with radiomics, exhibits a strong correlation with IDH mutation status (AUROC: CV = 0.972, Cross-data = 0.764). In contrast, we observed a moderate correlation with OS time (AUROC: CV = 0.748, Cross-data = 0.719) and a weak correlation with MGMT methylation status (AUROC: CV = 0.685, Cross-data = 0.628). These findings suggest the promising utility of imaging biomarkers for IDH prediction but highlight limitations in using solely image data for the other two tasks.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

MA: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. ML: Writing – original draft, Writing – review & editing. IT: Funding acquisition, Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This study was supported by the Swedish Cancer Society. The Cancer Research Funds of Radiumhemmet and The Swedish Research Council (2020-04618) are gratefully acknowledged.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1679634/full#supplementary-material

Footnotes

  1. ^ Not otherwise specified (NOS), Not elsewhere classified (NEC).

References

1. Louis DN, Perry A, Wesseling P, Brat DJ, Cree IA, Figarella-Branger D, et al. The 2021 WHO classification of tumors of the central nervous system: a summary. Neuro Oncol. (2021) 23:1231–51. doi: 10.1093/NEUONC/NOAB106

PubMed Abstract | Crossref Full Text | Google Scholar

2. Martin P, Holloway L, Metcalfe P, Koh ES, and Brighi C. Challenges in glioblastoma radiomics and the path to clinical implementation. Cancers. (2022) 14:3897. doi: 10.3390/CANCERS14163897

PubMed Abstract | Crossref Full Text | Google Scholar

3. Guo J, Fathi Kazerooni A, Toorens E, Akbari H, Yu F, Sako C, et al. Integrating imaging and genomic data for the discovery of distinct glioblastoma subtypes: a joint learning approach. Sci Rep. (2024) 14:1–11. doi: 10.1038/s41598-024-55072-y

PubMed Abstract | Crossref Full Text | Google Scholar

4. Lost J, Verma T, Jekel L, von Reppert M, Tillmanns N, Merkaj S, et al. Systematic literature review of machine learning algorithms using pretherapy radiologic imaging for glioma molecular subtype prediction. Am J Neuroradiol. (2023) 44:1126–34. doi: 10.3174/AJNR.A8000

PubMed Abstract | Crossref Full Text | Google Scholar

5. Butler M, Pongor L, Su YT, Xi L, Raffeld M, Quezado M, et al. MGMT status as a clinical biomarker in glioblastoma. Trends Cancer. (2020) 6:380–91. doi: 10.1016/j.trecan.2020.02.010

PubMed Abstract | Crossref Full Text | Google Scholar

6. Olar A and Aldape KD. Using the molecular classification of glioblastoma to inform personalized treatment. J Pathol. (2014) 232:165–77. doi: 10.1002/PATH.4282

PubMed Abstract | Crossref Full Text | Google Scholar

7. Salle G, Tumminello L, Laino ME, Shalaby S, Aghakhanyan G, Fanni SC, et al. Accuracy of radiomics in predicting IDH mutation status in diffuse gliomas: A bivariate meta-analysis. Radiol Artif Intell. (2024) 6. doi: 10.1148/RYAI.220257/ASSET/IMAGES/LARGE/RYAI.220257.FIG4.JPEG

PubMed Abstract | Crossref Full Text | Google Scholar

8. McLendon R, Friedman A, Bigner D, Van Meir EG, Brat DJ, Mastrogianakis GM, et al. Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature. (2008) 455:1061–8. doi: 10.1038/nature07385

PubMed Abstract | Crossref Full Text | Google Scholar

9. Menze B, Isensee F, Wiest R, Wiestler B, Maier-Hein K, Reyes M, et al. Analyzing magnetic resonance imaging data from glioma patients using deep learning. Comput Med Imaging Graph. (2021) 88:101828. doi: 10.1016/J.COMPMEDIMAG.2020.101828

PubMed Abstract | Crossref Full Text | Google Scholar

10. Flies CM, Friedrich M, Lohmann P, van Garderen KA, Smits M, Tonn J-C, et al. Treatment-associated imaging changes in newly diagnosed MGMT promoter-methylated glioblastoma undergoing chemoradiation with or without cilengitide. Neuro Oncol. (2024) 26(5):902–10. doi: 10.1093/NEUONC/NOAD247

PubMed Abstract | Crossref Full Text | Google Scholar

11. Rathore S, Mohan S, Bakas S, Sako C, Badve C, Pati S, et al. Multi-institutional noninvasive in vivo characterization of IDH, 1p/19q, and EGFRvIII in glioma using neuro-Cancer Imaging Phenomics Toolkit (neuro-CaPTk). Neurooncol Adv. (2020) 2:iv22–34. doi: 10.1093/NOAJNL/VDAA128

PubMed Abstract | Crossref Full Text | Google Scholar

12. Lee JO, Ahn SS, Choi KS, Lee J, Jang J, Park JH, et al. Added prognostic value of 3D deep learning-derived features from preoperative MRI for adult-type diffuse gliomas. Neuro Oncol. (2024) 26:571–80. doi: 10.1093/NEUONC/NOAD202

PubMed Abstract | Crossref Full Text | Google Scholar

13. Saeed N, Hardan S, Abutalip K, and Yaqub M. Is it Possible to Predict MGMT Promoter Methylation from Brain Tumor MRI Scans using Deep Learning Models? Proc Mach Learn Res. (2022) 172:1005–18. Available online at: https://arxiv.org/abs/2201.06086v2.

Google Scholar

14. Baid U, Ghodasara S, Mohan S, Bilello M, Calabrese E, Colak E, et al. The RSNA-ASNR-MICCAI braTS 2021 benchmark on brain tumor segmentation and radiogenomic classification (2021). Available online at: https://arxiv.org/abs/2107.02314v2 (Accessed March 28, 2024).

Google Scholar

15. Abler D, Andrearczyk V, Oreiller V, Garcia JB, Vuong D, Tanadini-Lang S, et al. Comparison of MR preprocessing strategies and sequences for radiomics-based MGMT prediction. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer (2022). p. 367–80. 12963 LNCS. doi: 10.1007/978-3-031-09002-8_33/COVER

Crossref Full Text | Google Scholar

16. Bakas S, Sako C, Akbari H, Bilello M, Sotiras A, Shukla G, et al. The University of Pennsylvania glioblastoma (UPenn-GBM) cohort: advanced MRI, clinical, genomics, & radiomics. Sci Data. (2022) 9:1–12. doi: 10.1038/s41597-022-01560-7

PubMed Abstract | Crossref Full Text | Google Scholar

17. Calabrese E, Villanueva-Meyer JE, Rudie JD, Rauschecker AM, Baid U, Bakas S, et al. The university of California San Francisco preoperative diffuse glioma MRI dataset. Radiol Artif Intell. (2022) 4. doi: 10.1148/RYAI.220058/ASSET/IMAGES/LARGE/RYAI.220058.TBL2.JPEG

PubMed Abstract | Crossref Full Text | Google Scholar

18. Suter Y, Knecht U, Valenzuela W, Notter M, Hewer E, Schucht P, et al. The LUMIERE dataset: Longitudinal Glioblastoma MRI with expert RANO evaluation. Sci Data. (2022) 9:1–8. doi: 10.1038/s41597-022-01881-7

PubMed Abstract | Crossref Full Text | Google Scholar

19. Rohlfing T, Zahr NM, Sullivan EV, and Pfefferbaum A. The SRI24 multichannel atlas of normal adult human brain structure. Hum Brain Mapp. (2010) 31:798–819. doi: 10.1002/HBM.20906

PubMed Abstract | Crossref Full Text | Google Scholar

20. Eickhoff SB, Yeo BTT, and Genon S. Imaging-based parcellations of the human brain. Nat Rev Neurosci. (2018) 19:672–86. doi: 10.1038/s41583-018-0071-7

PubMed Abstract | Crossref Full Text | Google Scholar

21. Billot B, Magdamo C, Cheng Y, Arnold SE, Das S, and Iglesias JE. Robust machine learning segmentation for large-scale analysis of heterogeneous clinical brain MRI datasets. Proc Natl Acad Sci U.S.A. (2023) 120:e2216399120. doi: 10.1073/PNAS.2216399120/SUPPL_FILE/PNAS.2216399120.SAPP.PDF

PubMed Abstract | Crossref Full Text | Google Scholar

22. Billot B, Greve DN, Puonti O, Thielscher A, Van Leemput K, Fischl B, et al. SynthSeg: Segmentation of brain MRI scans of any contrast and resolution without retraining. Med Image Anal. (2023) 86:102789. doi: 10.1016/J.MEDIA.2023.102789

PubMed Abstract | Crossref Full Text | Google Scholar

23. Larjavaara S, Mäntylä R, Salminen T, Haapasalo H, Raitanen J, Jääskeläinen J, et al. Incidence of gliomas by anatomic location. Neuro Oncol. (2007) 9:319. doi: 10.1215/15228517-2007-016

PubMed Abstract | Crossref Full Text | Google Scholar

24. MacKintosh C, Butterfield R, Zhang N, Lorence J, Zlomanczuk P, Bendok B, et al. Does location matter? Characterisation of the anatomic locations, molecular profiles, and clinical features of gliomas. Neurol Neurochir Pol. (2020) 54:456–65. doi: 10.5603/PJNNS.A2020.0067

PubMed Abstract | Crossref Full Text | Google Scholar

25. Van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–7. doi: 10.1158/0008-5472.CAN-17-0339

PubMed Abstract | Crossref Full Text | Google Scholar

26. Zwanenburg A, Vallières M, Abdalah MA, Aerts HJWL, Andrearczyk V, Apte A, et al. The image biomarker standardization initiative: Standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology. (2020) 295:328–38. doi: 10.1148/RADIOL.2020191145/ASSET/IMAGES/LARGE/RADIOL.2020191145.FIG5.JPEG

PubMed Abstract | Crossref Full Text | Google Scholar

27. Kocak B, Baessler B, Bakas S, Cuocolo R, Fedorov A, Maier-Hein L, et al. CheckList for EvaluAtion of Radiomics research (CLEAR): a step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII. Insights Imaging. (2023) 14:1–13. doi: 10.1186/S13244-023-01415-8/TABLES/1

PubMed Abstract | Crossref Full Text | Google Scholar

28. Chen T and Guestrin C. (2016). XGBoost: A scalable tree boosting system, in: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (San Francisco, California, USA: Association for Computing Machinery), 13-17-August-2016. pp. 785–94. doi: 10.1145/2939672.2939785

Crossref Full Text | Google Scholar

29. Astaraki M, Yang G, Zakko Y, Toma-Dasu I, Smedby Ö, and Wang C. A comparative study of radiomics and deep-learning based methods for pulmonary nodule Malignancy prediction in low dose CT images. Front Oncol. (2021) 11:3389/fonc.2021.737368. doi: 10.3389/fonc.2021.737368

PubMed Abstract | Crossref Full Text | Google Scholar

30. Foltyn-Dumitru M, Schell M, Rastogi A, Sahm F, Kessler T, Wick W, et al. Impact of signal intensity normalization of MRI on the generalizability of radiomic-based prediction of molecular glioma subtypes. Eur Radiol. (2023) 34:2782–90. doi: 10.1007/S00330-023-10034-2/FIGURES/3

PubMed Abstract | Crossref Full Text | Google Scholar

31. Mantovani RG, Rossi ALD, Vanschoren J, Bischl B, and De Carvalho ACPLF. (2015). Effectiveness of Random Search in SVM hyper-parameter tuning, in: Proceedings of the International Joint Conference on Neural Networks, 2015-September. doi: 10.1109/IJCNN.2015.7280664

Crossref Full Text | Google Scholar

32. Chawla NV, Bowyer KW, Hall LO, and Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. (2002) 16:321–57. doi: 10.1613/jair.953

Crossref Full Text | Google Scholar

33. DeLong ER, DeLong DM, and Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics. (1988) 44:837. doi: 10.2307/2531595

Crossref Full Text | Google Scholar

34. Häger W, Toma-Daşu I, Astaraki M, and Lazzeroni M. Overall survival prediction for high-grade glioma patients using mathematical modeling of tumor cell infiltration. Physica Med. (2023) 113. doi: 10.1016/J.EJMP.2023.102669/ASSET/40B2E7E8-AC3B-4BFB-A7F5-002038F97887/MAIN.ASSETS/GR5.SML

PubMed Abstract | Crossref Full Text | Google Scholar

35. Astaraki M, Häger W, Lazzeroni M, and Toma-Dasu I. Radiomics and deep learning models for glioblastoma treatment outcome prediction based on tumor invasion modeling. Physica Med. (2025) 129. doi: 10.1016/J.EJMP.2024.104881/ATTACHMENT/2A2EA84B-3B13-440D-BD6E-56C4A9A3F0FC/MMC1.DOCX

PubMed Abstract | Crossref Full Text | Google Scholar

36. Calabrese E, Villanueva-Meyer JE, and Cha S. A fully automated artificial intelligence method for non-invasive, imaging-based identification of genetic alterations in glioblastomas. Sci Rep. (2020) 10:1–11. doi: 10.1038/s41598-020-68857-8

PubMed Abstract | Crossref Full Text | Google Scholar

37. Saeed N, Ridzuan M, Alasmawi H, Sobirov I, and Yaqub M. MGMT promoter methylation status prediction using MRI scans? An extensive experimental evaluation of deep learning models. Med Image Anal. (2023) 90:102989. doi: 10.1016/J.MEDIA.2023.102989

PubMed Abstract | Crossref Full Text | Google Scholar

38. Capuozzo S, Gravina M, Gatta G, and Marrone S. Sansone C. A multimodal knowledge-based deep learning approach for MGMT promoter methylation identification. J Imaging. (2022) 8:321. doi: 10.3390/JIMAGING8120321

PubMed Abstract | Crossref Full Text | Google Scholar

39. Leaderboard of RSNA-MICCAI brain tumor radiogenomic classification challenge. Available online at: https://www.kaggle.com/competitions/rsna-miccai-brain-tumor-radiogenomic-classification/leaderboard (Accessed April 27, 2024).

Google Scholar

40. Emchinov A. A deep learning approach to glioblastoma radiogenomic classification using brain MRI. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer (2022). p. 345–56. 12963 LNCS. doi: 10.1007/978-3-031-09002-8_31

Crossref Full Text | Google Scholar

41. Korkmaz S and Demirel E. The success of volumetric means ADC in predicting MGMT promoter hypermethylation in glioblastomas. J Health Sci Med. (2023) 6:767–71. doi: 10.32322/JHSM.1311867

Crossref Full Text | Google Scholar

42. Chen S, Xu Y, Ye M, Li Y, Sun Y, Liang J, et al. Predicting MGMT promoter methylation in diffuse gliomas using deep learning with radiomics. J Clin Med. (2022) 11:3445. doi: 10.3390/JCM11123445/S1

PubMed Abstract | Crossref Full Text | Google Scholar

43. Bangalore Yogananda CG, Wagner BC, Truong NCD, Holcomb JM, Reddy DD, Saadat N, et al. MRI-based deep learning method for classification of IDH mutation status. Bioengineering. (2023) 10:1045. doi: 10.3390/BIOENGINEERING10091045/S1

Crossref Full Text | Google Scholar

44. Du N, Zhou X, Mao R, Shu W, Xiao L, Ye Y, et al. Preoperative and noninvasive prediction of gliomas histopathological grades and IDH molecular types using multiple MRI characteristics. Front Oncol. (2022) 12:873839/BIBTEX. doi: 10.3389/FONC.2022.873839/BIBTEX

PubMed Abstract | Crossref Full Text | Google Scholar

45. Wu J, Xu Q, Shen Y, Chen W, Xu K, and Qi XR. Swin transformer improves the IDH mutation status prediction of gliomas free of MRI-based tumor segmentation. J Clin Med. (2022) 11:4625. doi: 10.3390/JCM11154625/S1

PubMed Abstract | Crossref Full Text | Google Scholar

46. Wei Y, Li Y, Chen X, Schönlieb CB, Li C, and Price SJ. Predicting isocitrate dehydrogenase mutation status in glioma using structural brain networks and graph neural networks. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer (2022). p. 140–50. 12962 LNCS. doi: 10.1007/978-3-031-08999-2_11

Crossref Full Text | Google Scholar

47. Pálsson S, Cerri S, Poulsen HS, Urup T, Law I, and Van Leemput K. Predicting survival of glioblastoma from automatic whole-brain and tumor segmentation of MR images. Sci Rep. (2022) 12:1–12. doi: 10.1038/s41598-022-19223-3

PubMed Abstract | Crossref Full Text | Google Scholar

48. Fathi Kazerooni A, Saxena S, Toorens E, Tu D, Bashyam V, Akbari H, et al. Clinical measures, radiomics, and genomics offer synergistic value in AI-based prediction of overall survival in patients with glioblastoma. Sci Rep. (2022) 12:1–13. doi: 10.1038/s41598-022-12699-z

PubMed Abstract | Crossref Full Text | Google Scholar

49. Bakas S, Shukla G, Akbari H, Erus G, Sotiras A, Rathore S, et al. Overall survival prediction in glioblastoma patients using structural magnetic resonance imaging (MRI): advanced radiomic features may compensate for lack of advanced MRI modalities. J Med Imaging. (2020) 7:1. doi: 10.1117/1.JMI.7.3.031505

PubMed Abstract | Crossref Full Text | Google Scholar

50. Bakas S, Reyes M, Jakab A, Bauer S, Rempfler M, Crimi A, et al. Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the BRATS challenge. (2019). doi: 10.48550/arXiv.1811.02629

Crossref Full Text | Google Scholar

51. Huang H, Zhang W, Fang Y, Hong J, Su S, and Lai X. Overall survival prediction for gliomas using a novel compound approach. Front Oncol. (2021) 11:724191/BIBTEX. doi: 10.3389/FONC.2021.724191/BIBTEX

PubMed Abstract | Crossref Full Text | Google Scholar

52. Eisenmann M, Reinke A, Weru V, Tizabi MD, Isensee F, Adler TJ, et al. (2023). Why is the winner the best?, in: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2023-June. pp. 19955–66. doi: 10.1109/CVPR52729.2023.01911

Crossref Full Text | Google Scholar

Keywords: brain segmentation, glioma, radiogenomics, radiomics, survival status

Citation: Astaraki M, Lazzeroni M and Toma-Dasu I (2026) A comprehensive evaluation of MRI-based radiogenomics and prognosis prediction in glioma. Front. Oncol. 15:1679634. doi: 10.3389/fonc.2025.1679634

Received: 04 August 2025; Accepted: 08 December 2025; Revised: 27 November 2025;
Published: 05 January 2026.

Edited by:

Jan Petr, Helmholtz Association of German Research Centres (HZ), Germany

Reviewed by:

Marco Diego Dominietto, Gate To Brain SA, Switzerland
Paola Feraco, University of Trento, Italy

Copyright © 2026 Astaraki, Lazzeroni and Toma-Dasu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mehdi Astaraki, bWVoZGkuYXN0YXJha2lAa2kuc2U=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.