Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 12 January 2026

Sec. Cancer Imaging and Image-directed Interventions

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1743451

This article is part of the Research TopicTracing the Evolution of Diagnostic Imaging in Glioma ManagementView all articles

Evaluating sequence contributions to MRI radiomics for glioblastoma survival: single vs fusion models

Ruirui GuoRuirui Guo1Ya Gao*Ya Gao2*
  • 1Department of Radiology, Shenmu Hospital, Shenmu, Shaanxi, China
  • 2Department of Functional Sciences, Shenmu Hospital, Shenmu, Shaanxi, China

Background: Glioblastoma is the most aggressive primary brain tumor with a poor prognosis. Multiparametric MRI-based radiomics shows promise for prognosis, yet most studies fuse all sequences without quantifying their individual contributions. We systematically compared the prognostic value of features from single sequences and comprehensive fusions for survival classification.

Methods: This retrospective study included glioblastoma patients from TCIA. Quantitative features were extracted from T1, T1-GD, T2, and T2-FLAIR images. Univariate ROC analyses with Benjamini-Hochberg false discovery rate correction assessed each feature’s prognostic value. For multivariate modeling, combining univariate analysis with the maximum relevance minimum redundancy algorithm, was used to select the top predictors for each model. Logistic regression models were then built using features from single sequences, dual-sequence fusions, and a comprehensive four-sequence fusion. Performance was compared using repeated random subsampling validation with AUC as the metric.

Results: Univariate analysis identified 249 features with significant prognostic power (T1-GD: 79; T2: 58; T2-FLAIR: 57; T1: 55). In the multivariate analysis, the four-sequence fusion model achieved the highest performance on the training cohort (AUC: 0.8467), but this advantage did not generalize to the validation cohorts. On the validation set, single-sequence models achieved AUCs ranging from 0.6599 to 0.7010, with the T2 model performing best. The dual-sequence model combining features from T1-GD and T2 sequences yielded the highest overall performance, achieving a mean validation AUC of 0.7066. Notably, this targeted two-sequence model outperformed the more complex four-sequence model (AUC: 0.7030).

Conclusions: For glioblastoma survival prediction, a strategic selection of complementary imaging sequences might be more effective than an indiscriminate aggregation of all available structural MRI data. However, this finding is currently specific to this particular dataset and clinical task. Future research using large-scale data with multiple tumor types and clinical objectives is necessary to explore the broader generalizability of these results.

Introduction

Glioblastoma is the most common and aggressive primary brain tumor in adults, with a prognosis that remains exceptionally poor despite multimodal treatment (1, 2). The median survival time has seen little improvement over the past decade, highlighting an urgent need for reliable prognostic biomarkers (3, 4). Such biomarkers are critical for patient counseling, clinical trial stratification, and guiding personalized therapeutic strategies (5). While molecular markers are valuable, they require invasive tissue sampling (6). This limitation has fueled interest in non-invasive methods, positioning medical imaging at the forefront of prognostic research.

Medical imaging, particularly multiparametric Magnetic Resonance Imaging (MRI), has emerged as a cornerstone in the management of glioblastoma, providing critical information on tumor location, size, and relationship with surrounding structures (7). The emerging field of radiomics, which extracts high-throughput quantitative features from these images, shows significant promise for developing powerful prediction models (813). The standard glioblastoma protocol includes several MRI sequences, each offering unique biological insights. For instance, post-contrast T1-weighted (T1-Gd) images delineate the active tumor core, while T2-weighted (T2) and T2 Fluid-Attenuated Inversion Recovery (T2-FLAIR) sequences excel at visualizing surrounding edema and non-enhancing tumor (14, 15).

Given this multiparametric approach, most radiomic studies operate on a prevailing assumption: that fusing features from all available sequences will yield the most powerful and comprehensive model (16, 17). However, this “more is better” strategy has a critical, often unexamined, drawback. Indiscriminately aggregating data can introduce information redundancy and noise, increasing model complexity without a proportional gain in prognostic value. More importantly, this complexity raises the risk of overfitting, where a model performs exceptionally well on training data but fails to generalize to new, unseen cases. To date, few studies have systematically dismantled this fusion-based approach to quantify the actual prognostic contribution of each individual sequence.

Despite the rapid growth of GBM radiomics using multiparametric MRI, the optimal scope of sequence inclusion remains uncertain. Many studies report improved performance with multi sequence inputs. However, fewer works have performed a controlled comparison of reduced sequence sets versus full structural protocols for survival related endpoints. Such comparisons are often confounded by differences in classifiers, feature selection strategies, or feature counts. As a result, it remains unclear whether broader fusion truly adds task specific value beyond a targeted subset of sequences.

Therefore, this study challenges the conventional fusion paradigm. Our primary objective was to systematically evaluate the independent and combined prognostic value of features from four standard MRI sequences for glioblastoma survival classification. We hypothesized that a strategic combination of complementary sequences would produce a more robust and generalizable model than an indiscriminate, all-inclusive fusion. To test this, we constructed and rigorously compared the performance of models built from single sequences, dual-sequence fusions, and a comprehensive four-sequence fusion. This work seeks to clarify the contribution of each MRI sequence, providing crucial insights for designing more efficient and effective radiomics models in neuro-oncology.

Materials and methods

Study design and patient dataset

Our study workflow consisted of two main stages: univariate and multivariate analysis (Figure 1). We obtained a retrospective dataset of 227 glioblastoma patients from The Cancer Imaging Archive (TCIA) (18, 19). For each patient, we used multiparametric MRI data, including T1-weighted (T1), post-contrast T1-weighted (T1-Gd), T2-weighted (T2), and T2 fluid-attenuated inversion recovery (T2-FLAIR) sequences. All images underwent a uniform preprocessing pipeline before feature extraction. After de identification and de facing, scans were maintained in their original resolution and orientation. Brain extraction and tumor sub region segmentation were then applied. The regions of interest included the necrotic tumor core, the enhancing tumor, and the peritumoral edematous or infiltrated tissue. The enhancing tumor and necrotic core were identified based on contrast related intensity differences on T1 Gd relative to T1 and in reference to normal appearing white matter. The necrotic core encompassed non enhancing or faintly enhancing tumor components along with transitional and necrotic regions that are commonly resected together with the enhancing component. The edema region was delineated by the hyperintense signal envelope on T2 FLAIR images.

Figure 1
Flowchart detailing the analysis process for MRI images. It starts with TCIA and UPENN-GBM data leading to T1, T1-Gd, T2, and FLAIR MR images. These images yield respective features, which undergo univariate analysis and comparison. Feature selection follows, resulting in T1, T1-Gd, T2, and FLAIR signatures, culminating in multivariate analysis and comparison.

Figure 1. The workflow of this study. The workflow begins with data acquisition from the TCIA database. Subsequent stages include a univariate analysis for initial feature screening, and a multivariate analysis involving feature selection and the construction of single-sequence and multi-sequence prognostic models for comparison.

We defined the survival endpoint as one-year overall survival and performed binary classification based on whether patients were alive at 12 months after diagnosis. The demographic details of these patients were summarized in Table 1.

Table 1
www.frontiersin.org

Table 1. Demographics of the dataset for this study.

Tumor segmentations and the corresponding pre computed radiomic feature tables were obtained from the previously published resource associated with the UPenn GBM collection on TCIA. The original resource provided standardized tumor subregion masks for the cohort. We used these segmentations as provided. We did not perform additional manual editing or re segmentation. The radiomic features for each MRI sequence were computed in the original work using a consistent extraction configuration. We adopted the same feature tables for all single sequence and fusion analyses. This design ensures that our comparisons across sequences were based on a shared upstream segmentation and feature extraction framework.

To enhance reproducibility, we utilized pre-calculated radiomic features from a prior study. These features were originally extracted using the PyRadiomics library in compliance with the Image Biomarker Standardisation Initiative (IBSI) guidelines. A total of 432 quantitative features were available for each MRI sequence. Detailed descriptions of the image preprocessing and feature extraction are available in the previous studies (12, 18).

Univariate analysis

First, we assessed the prognostic value of each individual radiomic feature using univariate receiver operating characteristic (ROC) analysis (20). The area under the ROC curve (AUC) served as the primary metric to quantify discriminatory performance, where an AUC of 1.0 represents a perfect classifier and 0.5 indicates performance equivalent to random chance. We determined the statistical significance of each feature by testing its AUC against a null value of 0.5 with the DeLong test (21).

To account for multiple comparisons arising from the large feature set, the raw p-values were adjusted using the Benjamini-Hochberg procedure to control the false discovery rate (FDR) (22). A feature was considered to possess statistically significant discriminatory power if its corresponding FDR-adjusted P-value was less than 0.05.

Multivariate analysis

To build robust models, we implemented a repeated random sub-sampling validation strategy. We repeated the entire model development and evaluation process 10 times. In each iteration, we randomly partitioned the patients into a training cohort and a validation cohort at a ratio of 3:1. All feature values were normalized prior to feature selection.

Our multivariate analysis employed a two-stage feature selection process (23, 24). First, we screened features based on the univariate analysis, retaining only those with significant prognostic value (FDR-adjusted p < 0.05). This step reduced feature space dimensionality and filtered out potential noise. Second, we applied the maximum relevance minimum redundancy (mRMR) algorithm to this reduced set. The mRMR algorithm selects an optimal feature subset by maximizing relevance to the outcome while minimizing redundancy among features (25). The mRMR feature selection approach was implemented in R software using the “mRMRe” package (26). Univariate screening and mRMR were performed independently within each repeated split using only the training subset, and the selected features were used to train the classifier before evaluation on the held-out validation subset.

To ensure a fair and standardized comparison between models, we fixed the feature subset size at 10 for all models. We then used a logistic regression algorithm to construct the classifiers. This consistent framework allowed for an objective evaluation of three model categories:

(1) single-sequence models, which were constructed using significant features derived exclusively from one of the four sequences (i.e., T1-only, T1-GD-only, T2-only, and T2-FLAIR-only); (2) dual-sequence fusion models, which combined significant features from pairs of complementary sequences; and (3) a comprehensive multi-sequence fusion model, which incorporated significant features from all four sequences simultaneously.

It was pertinent to clarify that the primary goal of this study was not to build the single best prognostic model, but rather to systematically compare the predictive performance of these different MRI feature sets. Our methodology was therefore designed to support this specific comparative objective.

Results

Univariate analysis and comparison

From an initial set of 1728 radiomic features, with 432 extracted from each of the four MRI sequences (T1, T1-GD, T2, and T2-FLAIR), the univariate ROC analysis was performed to identify individual predictors with significant prognostic value. After statistical analysis using the DeLong test and subsequent correction for multiple comparisons with the Benjamini-Hochberg procedure, 249 of the 1728 features were identified as having statistically significant discriminatory power (all FDR-adjusted P-values < 0.05).

The prognostic performance and distribution of these 249 significant features across the different sequences are summarized in Table 2. Specifically, the T1-GD sequence yielded the highest number of significant predictors (n=79), followed by T2 (n=58), T2-FLAIR (n=57), and T1 (n=55) (Figure 1a). The retained features from all sequences demonstrated modest but consistently significant predictive capabilities. The mean AUC values were comparable across the modalities, with features from the T2-FLAIR sequence exhibiting the highest average performance (Mean AUC = 0.6483 ± 0.0266; Median AUC = 0.6530) (Figure 2b). Notably, the single feature with the highest individual prognostic value originated from the T1-GD sequence, achieving a maximum AUC of 0.7119. Overall, the AUC values for all significant features ranged from a minimum of 0.5978 to a maximum of 0.7119.

Table 2
www.frontiersin.org

Table 2. Univariate analysis for individual imaging feature.

Figure 2
Two-part chart shows MRI sequence data. Chart (a) is a bar graph displaying counts for T1, T1-GD, T2, and T2-FLAIR sequences with values 55, 79, 58, and 57, respectively. Chart (b) is a box plot showing AUC distributions for T1, T1GD, T2, and FLAIR sequences.

Figure 2. Univariate analysis of significant prognostic features from different MRI sequences. (a) Bar chart illustrating the number of statistically significant features identified from each of the four MRI sequences (T1, T1-GD, T2, and T2-FLAIR). (b) Boxplot comparing the distribution of the AUC values for the significant features from each sequence.

Multivariate analysis and comparison

In line with our initial hypothesis, the multivariate analysis revealed distinct prognostic capabilities among the models derived from different MRI sequences. Following the feature selection protocol, we constructed and evaluated three categories of multivariate models: single-sequence, dual-sequence fusion, and a comprehensive four-sequence model. The performance of the single-sequence models is visually summarized in Figure 3 and detailed in Table 3. For the single-sequence models, the mean AUC values on the validation cohorts ranged from 0.6599 to 0.7010 (Figure 3b). Among the four individual MRI sequences, the model derived from T2-weighted images demonstrated the highest prognostic performance, achieving a mean validation AUC of 0.7010. The performance of the T1-GD model was highly comparable, with a mean AUC of 0.6939, and there was no significant statistical difference between the T2 and T1-GD based models (P-value = 0.8789). In contrast, the T1 and T2-FLAIR models yielded slightly lower performances, with mean AUCs of 0.6605 and 0.6599, respectively. Although the T2-based model showed a trend towards better performance compared to the T1 and T2-FLAIR models, these differences did not reach statistical significance (P-value = 0.0753 and P-value = 0.0892, respectively).

Figure 3
Box plots comparing AUC values for different MRI sequences. Plot (a) shows higher AUC values ranging from approximately 0.76 to 0.84 across T1, T1-GD, T2, and T2-FLAIR sequences. Plot (b) shows lower AUC values ranging from about 0.65 to 0.8 for the same sequences.

Figure 3. Performance of single-sequence prognostic models. Boxplots comparing the distribution of AUC values for models constructed from each of the four individual MRI sequences across 10 independent experiments. (a) Performance on the training cohorts. (b) Performance on the corresponding validation cohorts.

Table 3
www.frontiersin.org

Table 3. Detailed prognosis performance of single-sequence images using multivariate analysis.

Next, we assessed the performance of the fusion models that combined features from multiple sequences. The results are visualized in Figure 4 and summarized in detail in Table 4. On the training cohorts, fusing features consistently provided more effective information and yielded higher predictive performance (Figure 4a). For example, the comprehensive four-sequence model achieved the highest training AUC of 0.8467, and the best dual-sequence model (T1-GD + T2) reached a training AUC of 0.8384, both of which were higher than the best single-sequence model’s training performance (T1-GD, AUC = 0.8176). However, this performance enhancement was less consistent on the validation cohorts (Figure 4b). The dual-sequence model combining T1-GD and T2 features yielded the highest prognostic performance among all constructed models, with a mean validation AUC of 0.7066. The comprehensive model that integrated all four sequences also showed robust performance with a mean AUC of 0.7030. Despite the success of these top-performing fusion models, not all combinations resulted in improved accuracy. Several dual-sequence models, such as the one combining T1 and T2 features (mean AUC = 0.6836), resulted in lower validation performance than the best single-sequence models. These findings suggest that while feature fusion can be beneficial, the specific combination of T1-GD and T2 sequences provides the most synergistic and effective prognostic information for the validation set.

Figure 4
Two boxplots compare AUC values for MRI sequences. Plot a shows higher AUC values, ranging from 0.78 to 0.86. Plot b shows lower values, from 0.6 to 0.75. Both plots evaluate sequences like T1, T2, FLAIR, and others, with 'ALL' generally having the highest AUC.

Figure 4. Performance comparison of dual-sequence and multi-sequence fusion models. Boxplots comparing the AUC distributions for models built using features from pairs of sequences and a comprehensive model using features from all four sequences (“ALL”). (a) Performance on the training cohorts. (b) Performance on the validation cohorts.

Table 4
www.frontiersin.org

Table 4. Detailed prognosis performance of dual-sequence images using multivariate analysis.

Feature comparison of the multi-sequence models

To investigate the contribution and stability of individual features in the multivariate models, we analyzed the selection frequency of each feature by the mRMR algorithm across the 10 independent iterations. The most frequently selected features from each of the four MRI sequences are detailed in Tables 5, 6. The analysis revealed that certain features served as highly stable and robust predictors. From the T1-GD sequence, the feature ET_Histogram_Bins-16_Bins-16_Bin-13_Probability was exceptionally dominant, being selected in 9 out of the 10 iterations. Similarly, for the T2-FLAIR sequence, ET_GLCM_Bins-16_Radius-1_Correlation was a highly stable predictor, selected in 8 of the 10 iterations. For the T1 sequence, the most frequently selected feature was NC_Histogram_Bins-16_Bins-16_Bin-9_Probability, selected 6 times. In the T2 sequence, two features, ED_Histogram_Bins-16_Bins-16_TenthPercentile and ED_NGTDM_Complexity, shared the highest selection frequency, each chosen 4 times. Regarding the diversity of selected features, the T1-GD and T2 sequences each had 12 unique features that were selected at least once during the modeling process. In contrast, the T2-FLAIR sequence provided the most concentrated set of predictors, with only 6 unique features being selected in total across all iterations.

Table 5
www.frontiersin.org

Table 5. The selected T1 and TA-GD features for the construction of multi-sequence models.

Table 6
www.frontiersin.org

Table 6. The selected T2 and T2-FLAIR features for the construction of multi-sequence models.

Discussion

In this study, we conducted a systematic comparison of quantitative radiomic features derived from four structural MRI sequences to evaluate their independent and combined prognostic value for survival classification in glioblastoma. Our analysis yielded two principal findings. First, among the models built from individual sequences, the T2-weighted sequence demonstrated the highest prognostic performance. Second, and most importantly, our results revealed a notable divergence in model performance between the training and validation cohorts. While aggregating data from all four sequences achieved the highest performance on the training data, this advantage did not generalize to the validation set. In the validation cohorts, the dual-sequence model that strategically combined features from T1-Gd and T2 sequences yielded the highest prognostic accuracy among all constructed models, even outperforming the more complex four-sequence model.

The strong performance of the T1-Gd and T2 dual-sequence model may suggest a potential synergistic relationship between the biological information these sequences capture. Glioblastoma has two key pathological hallmarks. One is a highly proliferative core with a compromised blood-brain barrier. The other is an aggressively infiltrative margin extending into the brain parenchyma (1). The T1-Gd sequence is effective at delineating the enhancing tumor core, which reflects active neovascularity. In contrast, the T2-weighted sequence is highly sensitive to edema and non-enhancing tumor infiltration (7). Therefore, combining T1-Gd and T2 could provide a more complete radiomic signature. This signature may capture both the tumor’s aggressive core and its invasive periphery, which are critical factors for patient outcome.

Our finding that the four-sequence model was not optimal in the validation set highlights a crucial concept in radiomics. Adding more data does not always lead to better results due to the risk of information redundancy. For instance, sequences like T2 and T2-FLAIR both assess edema and may offer overlapping information. This could introduce noise and increase model complexity without adding significant prognostic value. This observation is consistent with other studies. These studies have also reported that models from a limited set of sequences can perform as well as, or even better than, those using a full multiparametric protocol (27). This emphasizes that strategic sequence selection may be more critical than simply aggregating all available data.

Our findings extend the existing GBM radiomics literature by focusing on sequence contribution under a controlled analytic framework. Prior studies have frequently adopted multi sequence protocols. Yet reduced set versus full protocol comparisons for survival stratification have not always been evaluated systematically within the same modeling constraints. In this study, we fixed the classifier and the number of selected features across settings. We also applied repeated splits. This design reduces methodological confounding and allows a clearer assessment of the incremental information provided by each sequence and fusion strategy. The observation that T1 Gd plus T2 can match or exceed four sequence fusion therefore supports task specific and purpose-oriented sequence selection rather than default comprehensive fusion.

A key strength of this study is its systematic and rigorous comparative framework. Many radiomics studies adopt a fusion-based approach, often assuming that combining all available imaging data will yield the best results. Our work challenges this assumption. Instead of only presenting a final fused model, we meticulously dissected the prognostic contribution of each individual MRI sequence. This head-to-head comparison provides a more granular understanding of which sequences offer the most valuable or potentially redundant information. Methodologically, the robustness of our findings is enhanced by several factors. First, the use of a publicly available dataset ensures transparency and allows for greater reproducibility of our results. Second, we implemented a repeated random sampling validation strategy, iterating the entire modeling process 10 times to ensure that our performance estimates are stable and not due to a single favorable data split. Finally, by fixing the feature subset size to 10 for all multivariate models, we established an equitable framework that allows for a fair and direct comparison of the predictive power inherent to different MRI feature sets. Together, these strengths ensure that our conclusions are based on a robust and unbiased analytical foundation.

By the feature comparison of the multi-sequence models, the most stable feature was ET_Histogram_Bins-16_Bins-16_Bin-13_Probability, which was selected in 9 of 10 splits. As a first order histogram descriptor within the enhancing tumor, this feature may capture the distribution of enhancement intensities. This may reflect vascular permeability and necrotic or viable tumor admixture. These processes are closely linked to more aggressive tumor behavior. In addition, the most stable T2 FLAIR feature was ET_GLCM_Bins-16_Radius-1_Correlation, selected in 8 of 10 splits. GLCM correlation is a texture measure that reflects local spatial dependency of signal. In this context, it may serve as a proxy for microstructural heterogeneity within or adjacent to the enhancing component as depicted on FLAIR. Nevertheless, we would like to emphasize that these interpretations remain theoretical and hypothesis driven in the context of the present study. We did not perform direct biological validation or imaging pathology correlation. Although prior studies have begun to explore associations between radiomic signatures and underlying tumor biology, such evidence remains limited in GBM and is still insufficient to establish robust, generalizable mechanistic explanations (28, 29). We also outline that future work will prioritize biological correlation studies, ideally integrating radiomics with molecular markers and pathology to validate and refine the proposed interpretations.

Several of the features that were repeatedly selected across splits belonged to first order and histogram-based categories. These features summarize the global distribution of voxel intensities within the tumor related regions. Their stable selection suggests that coarse intensity patterns may carry consistent prognostic signals in this cohort. On T1 Gd, histogram features may reflect the degree and burden of contrast enhancement. This can be linked to vascular permeability and blood brain barrier disruption. On T2, similar features may capture the extent and heterogeneity of hyperintense components. This may relate to edema, infiltration, or necrotic and cystic changes. Nevertheless, these interpretations are indirect. They should be considered hypothesis generating. Future studies that integrate radiomics with molecular markers and pathology correlations will be important to refine the biological meaning of these imaging signatures.

Despite the promising findings, this study has several limitations that must be acknowledged. First, the analysis was conducted on a retrospective dataset of limited size from a single public archive. As we noted, the performance gap between our training and validation cohorts suggests a risk of model overfitting, which is a common challenge when working with small and potentially heterogeneous datasets. While we used repeated sub-sampling to mitigate this, the generalizability of our findings must be confirmed through external validation on a larger, prospective, multi-center cohort. A further limitation relates to the use of external segmentations and pre computed features. We did not independently evaluate segmentation accuracy. We also could not quantify inter rater variability within this study. Any systematic bias in the original segmentation strategy may influence the absolute performance of our models. This should be considered when interpreting the reported AUC values. At the same time, our main objective was a controlled comparison of sequence specific and fusion based feature sets. All models in our framework relied on the same upstream segmentations. This consistency helps reduce confounding in the relative comparisons across sequences. Future studies should assess segmentation sensitivity more directly. Further, we also acknowledge the use of a binary one-year survival endpoint. The main aim of this study was to examine the prognostic contribution of different MRI sequences under a controlled framework. It was not to build the best possible survival model. This simplified endpoint allowed fair comparisons across sequence combinations with fixed feature numbers and the same classifier. Nevertheless, time to event models may provide richer prognostic information and should be evaluated in future work.

In terms of clinical translation, the proposed dual sequence strategy using T1 Gd and T2 may offer a practical pathway for radiomics based survival stratification in GBM. Both sequences are routinely included in standard preoperative MRI protocols. This reduces barriers to implementation. It also avoids the need for additional scanning. If validated prospectively, this approach could be integrated into preoperative planning. It could serve as a decision support component. The model could provide an interpretable risk category. This may inform multidisciplinary discussions. It may also support patient counseling and trial stratification. Our findings suggest that a targeted two sequence fusion can achieve comparable generalization performance relative to full four sequence fusion in the validation setting. The performance may be slightly improved. This observation supports clinically streamlined and purpose-oriented sequence selection for specific predictive tasks. Future studies should prioritize prospective and ideally multicenter evaluation. These efforts should focus on workflow standardization and real-world feasibility. They should also assess robustness and interpretability in routine preoperative decision making. Nevertheless, we also emphasize that the present study is retrospective. It is based on publicly available multicenter data. Prospective validation will be essential before clinical deployment. This validation should use harmonized acquisition, segmentation, and feature harmonization pipelines. Future work will focus on establishing a standardized prospective workflow. We will also evaluate performance across centers in real world settings.

The findings from this study have several important implications for future research and potential clinical applications. Our results, which highlight the superior prognostic performance of a T1-Gd and T2 feature set, suggest that it may be possible to streamline MRI acquisition protocols for radiomic analyses in glioblastoma. If validated, a more focused imaging protocol could reduce patient scan times, lower costs, and improve the efficiency of data collection for clinical trials without sacrificing prognostic power. Furthermore, it is crucial to recognize that our conclusions are specific to this particular dataset and clinical objective. The applicability of these findings to other tumor types or different clinical tasks remains uncertain. Therefore, future research should incorporate more diverse datasets to investigate whether the superiority of the T1-Gd and T2 combination holds true across a broader range of oncological applications.

Our conclusions are based on glioblastoma and conventional structural MRI. The observed ranking of sequences and fusion strategies may not generalize directly to other glioma subtypes. Direct extrapolation may be insufficiently rigorous without dedicated validation. Lower grade gliomas differ in enhancement, infiltration, and molecular context. These factors may alter which sequences are most informative. Even so, our findings support a purpose oriented and streamlined approach to sequence selection. They suggest that adding more sequences is not automatically beneficial. Diffusion and perfusion imaging may provide complementary microstructural and hemodynamic information. Future work should test standardized integration of these modalities with structural radiomics across glioma subtypes.

Conclusions

In this study, we systematically investigated the independent and combined prognostic value of radiomic features from four standard MRI sequences for survival classification in glioblastoma. Our results demonstrated that while the T2-weighted sequence provided the most performant single-sequence model, a dual-sequence fusion of features from T1-Gd and T2 images yielded the highest predictive accuracy overall. Notably, this targeted two-sequence combination outperformed the more complex model that aggregated features from all four sequences. In conclusion, our findings suggest that for glioblastoma survival prediction, the strategic selection of complementary imaging sequences, specifically the combination of T1-Gd and T2, is more effective than an indiscriminate aggregation of all available MRI data.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.

Ethics statement

The dataset was public via TCIA. The original protocol was approved by the Institutional Review Board at the UPHS, and informed consent was obtained from all participants. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

RG: Data curation, Formal Analysis, Investigation, Writing – original draft, Writing – review & editing. YG: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Conflict of interest

The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Louis DN, Perry A, Wesseling P, Brat DJ, Cree IA, Figarella-Branger D, et al. The 2021 WHO classification of tumors of the central nervous system: a summary. Neuro-oncology. (2021) 23:1231–51. doi: 10.1093/neuonc/noab106

PubMed Abstract | Crossref Full Text | Google Scholar

2. Ostrom QT, Price M, Neff C, Cioffi G, Waite KA, Kruchko C, et al. CBTRUS statistical report: primary brain and other central nervous system tumors diagnosed in the United States in 2015–2019. Neuro-oncology. (2022) 24:v1–v95. doi: 10.1093/neuonc/noac202

PubMed Abstract | Crossref Full Text | Google Scholar

3. Stupp R, Mason WP, Van Den Bent MJ, Weller M, Fisher B, Taphoorn MJ, et al. Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. New Engl J Med. (2005) 352:987–96. doi: 10.1056/NEJMoa043330

PubMed Abstract | Crossref Full Text | Google Scholar

4. Thakkar JP, Dolecek TA, Horbinski C, Ostrom QT, Lightner DD, Barnholtz-Sloan JS, et al. Epidemiologic and molecular prognostic review of glioblastoma. Cancer Epidemiol Biomarkers Prev. (2014) 23:1985–96. doi: 10.1158/1055-9965.EPI-14-0275

PubMed Abstract | Crossref Full Text | Google Scholar

5. Ellingson BM, Bendszus M, Boxerman J, Barboriak D, Erickson BJ, Smits M, et al. Consensus recommendations for a standardized Brain Tumor Imaging Protocol in clinical trials. Neuro-oncology. (2015) 17:1188–98. doi: 10.1093/neuonc/nov095

PubMed Abstract | Crossref Full Text | Google Scholar

6. Eckel-Passow JE, Lachance DH, Molinaro AM, Walsh KM, Decker PA, Sicotte H, et al. Glioma groups based on 1p/19q, IDH, and TERT promoter mutations in tumors. New Engl J Med. (2015) 372:2499–508. doi: 10.1056/NEJMoa1407279

PubMed Abstract | Crossref Full Text | Google Scholar

7. Pope WB. Brain metastases: neuroimaging. Handb Clin Neurol. (2018) 149:89–112. doi: 10.1016/B978-0-12-811161-1.00007-4

PubMed Abstract | Crossref Full Text | Google Scholar

8. Choi YH, Kim J-E, Lee RW, Kim B, Shin HC, Choe M, et al. Histopathological correlations of CT-based radiomics imaging biomarkers in native kidney biopsy. BMC Med Imaging. (2024) 24:256. doi: 10.1186/s12880-024-01434-x

PubMed Abstract | Crossref Full Text | Google Scholar

9. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, Van Stiphout RGPM, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036

PubMed Abstract | Crossref Full Text | Google Scholar

10. Xu L, Yang P, Liang W, Liu W, Wang W, Luo C, et al. A radiomics approach based on support vector machine using MR images for preoperative lymph node status evaluation in intrahepatic cholangiocarcinoma. Theranostics. (2019) 9:5374. doi: 10.7150/thno.34149

PubMed Abstract | Crossref Full Text | Google Scholar

11. Kickingereder P, Burth S, Wick A, Götz M, Eidel O, Schlemmer H-P, et al. Radiomic profiling of glioblastoma: identifying an imaging predictor of patient survival with improved performance over established clinical and radiologic risk models. Radiology. (2016) 280:880–9. doi: 10.1148/radiol.2016160845

PubMed Abstract | Crossref Full Text | Google Scholar

12. Xu L, Zhao W, Guo R, and Huang X. Exploring the associations between features from multi-parametric MR images in Glioblastoma using radiomics. BMC Med Imaging. (2025) 25:253. doi: 10.1186/s12880-025-01788-w

PubMed Abstract | Crossref Full Text | Google Scholar

13. Yoo JJ, Namdar K, Carey S, Fischer SE, McIntosh C, Khalvati F, et al. Non-invasive liver fibrosis screening on CT images using radiomics. BMC Med Imaging. (2025) 25:285. doi: 10.1186/s12880-025-01823-w

PubMed Abstract | Crossref Full Text | Google Scholar

14. Kaufmann TJ, Smits M, Boxerman J, Huang R, Barboriak DP, Weller M, et al. Consensus recommendations for a standardized brain tumor imaging protocol for clinical trials in brain metastases. Neuro-oncology. (2020) 22:757–72. doi: 10.1093/neuonc/noaa030

PubMed Abstract | Crossref Full Text | Google Scholar

15. Wen PY, Macdonald DR, Reardon DA, Cloughesy TF, Sorensen AG, Galanis E, et al. Updated response assessment criteria for high-grade gliomas: response assessment in neuro-oncology working group. J Clin Oncol. (2010) 28:1963–72. doi: 10.1200/JCO.2009.26.3541

PubMed Abstract | Crossref Full Text | Google Scholar

16. Tian Q, Yan LF, Zhang X, Zhang X, Hu YC, Han Y, et al. Radiomics strategy for glioma grading using texture features from multiparametric MRI. J Magn Reson Imaging. (2018) 48:1518–28. doi: 10.1002/jmri.26010

PubMed Abstract | Crossref Full Text | Google Scholar

17. Zhang X, Lu H, Tian Q, Feng N, Yin L, Xu X, et al. A radiomics nomogram based on multiparametric MRI might stratify glioblastoma patients according to survival. Eur Radiol. (2019) 29:5528–38. doi: 10.1007/s00330-019-06069-z

PubMed Abstract | Crossref Full Text | Google Scholar

18. Bakas S, Sako C, Akbari H, Bilello M, Sotiras A, Shukla G, et al. The University of Pennsylvania glioblastoma (UPenn-GBM) cohort: advanced MRI, clinical, genomics, & radiomics. Sci Data. (2022) 9:453. doi: 10.1038/s41597-022-01560-7

PubMed Abstract | Crossref Full Text | Google Scholar

19. Clark K, Vendt B, Smith K, Freymann J, Kirby J, Koppel P, et al. The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository. J Digital Imaging. (2013) 26:1045–57. doi: 10.1007/s10278-013-9622-7

PubMed Abstract | Crossref Full Text | Google Scholar

20. Hajian-Tilaki K. Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Caspian J Internal Med. (2013) 4:627.

Google Scholar

21. DeLong ER, DeLong DM, and Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. (1988) 44:837–45. doi: 10.2307/2531595

Crossref Full Text | Google Scholar

22. Benjamini Y and Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Society: Ser B (Methodol). (1995) 57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x

Crossref Full Text | Google Scholar

23. Xu L, Yang P, Hu K, Wu Y, Xu-Welliver M, Wan Y, et al. Prediction of neoadjuvant chemotherapy response in high-grade osteosarcoma: added value of non-tumorous bone radiomics using CT images. Quant Imaging Med Surg. (2021) 11:1184. doi: 10.21037/qims-20-681

PubMed Abstract | Crossref Full Text | Google Scholar

24. Yang J, Xu L, Yang P, Wan Y, Luo C, Yen EA, et al. Generalized methodology for radiomic feature selection and modeling in predicting clinical outcomes. Phys Med Biol. (2021) 66:215005. doi: 10.1088/1361-6560/ac2ea5

PubMed Abstract | Crossref Full Text | Google Scholar

25. Peng H, Long F, and Ding C. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell. (2005) 27:1226–38. doi: 10.1109/TPAMI.2005.159

PubMed Abstract | Crossref Full Text | Google Scholar

26. De Jay N, Papillon-Cavanagh S, Olsen C, El-Hachem N, Bontempi G, and Haibe-Kains B. mRMRe: an R package for parallelized mRMR ensemble feature selection. Bioinformatics. (2013) 29:2365–8. doi: 10.1093/bioinformatics/btt383

PubMed Abstract | Crossref Full Text | Google Scholar

27. Bathla G, Priya S, Liu Y, Ward C, Le NH, Soni N, et al. Radiomics-based differentiation between glioblastoma and primary central nervous system lymphoma: a comparison of diagnostic performance across different MRI sequences and machine learning techniques. Eur Radiol. (2021) 31:8703–13. doi: 10.1007/s00330-021-07845-6

PubMed Abstract | Crossref Full Text | Google Scholar

28. Grossmann P, Stringfield O, El-Hachem N, Bui MM, Rios Velazquez E, Parmar C, et al. Defining the biological basis of radiomic phenotypes in lung cancer. elife. (2017) 6:e23421. doi: 10.7554/eLife.23421

PubMed Abstract | Crossref Full Text | Google Scholar

29. Xu L, Wu Y, Shen X, Zhou L, Lu Y, Teng Z, et al. Exploring the biological basis of CT imaging features in pancreatic neuroendocrine tumors: a two-center study. Phys Med Biol. (2024) 69:125013. doi: 10.1088/1361-6560/ad51c7

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: classification comparison, quantitative imaging features, radiomic, structural MRI, survival

Citation: Guo R and Gao Y (2026) Evaluating sequence contributions to MRI radiomics for glioblastoma survival: single vs fusion models. Front. Oncol. 15:1743451. doi: 10.3389/fonc.2025.1743451

Received: 11 November 2025; Accepted: 18 December 2025; Revised: 15 December 2025;
Published: 12 January 2026.

Edited by:

Morteza Esmaeili, University of Stavanger, Norway

Reviewed by:

Zhenyu Gong, Sun Yat-sen University, China
Yunus Soleymani, Tehran University of Medical Sciences, Iran

Copyright © 2026 Guo and Gao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ya Gao, Z2FveWFzbWhAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.