Predictive performance of MRI and CT radiomics in predicting the response to induction chemotherapy in nasopharyngeal carcinoma: a network meta-analysis

Jian, Yongjie; Peng, Jiaxuan; Yang, Gan; He, Xiaojuan; Wang, Jing; Yin, Jun; Shi, Hui; Tao, Di; Lan, Qiyu; Yang, Zuogang; Shu, Zhenyu

doi:10.3389/fonc.2025.1590420

SYSTEMATIC REVIEW article

Front. Oncol., 13 November 2025

Sec. Head and Neck Cancer

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1590420

This article is part of the Research TopicBased Models and Machine Learning on CT, MRI and PET-CT in Head and Neck Cancer Diagnosis, Staging and Outcome PredictionView all 5 articles

Predictive performance of MRI and CT radiomics in predicting the response to induction chemotherapy in nasopharyngeal carcinoma: a network meta-analysis

Yongjie Jian^1,2†

Jiaxuan Peng^3†

Gan Yang^4†

Xiaojuan He¹

Jing Wang⁵

Jun Yin⁶

Hui Shi¹

Di Tao¹

Qiyu Lan⁷

Zuogang Yang⁸

Zhenyu Shu^3*

¹Department of Radiology, Affiliated Hospital of Sichuan Nursing Vocational College, The Third People’s Hospital of Sichuan Province, Chengdu, Sichuan, China
²Jinzhou Medical University Postgraduate Training Base (Zhejiang Provincial People’s Hospital, People’s Hospital of Hangzhou Medical College), Hangzhou, Zhejiang, China
³Center for Rehabilitation Medicine, Department of Radiology, Zhejiang Provincial People’s Hospital, Affiliated People’s Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China
⁴Department of General Surgery, Shehong People’s Hospital, Suining, Sichuan, China
⁵Department of Medical Technology, Sichuan Nursing Vocational College, Chengdu, Sichuan, China
⁶Department of Oncology, Affiliated Hospital of Sichuan Nursing Vocational College, The Third People’s Hospital of Sichuan Province, Chengdu, Sichuan, China
⁷Department of Ultrasound, Affiliated Hospital of Sichuan Nursing Vocational College, The Third People’s Hospital of Sichuan Province, Chengdu, Sichuan, China
⁸Department of Radiology, Rangtang Country People’s Hospital, Aba Tibetan and Qiang Autonomous Prefecture, Sichuan, China

Objectives: To evaluate the accuracy of different radiomics methods in predicting the response of nasopharyngeal carcinoma (NPC) to induction chemotherapy (IC).

Methods: A systematic search was conducted in PubMed, Embase, Web of Science, and Cochrane Library. Radiomics studies utilizing CT and MRI were included in this network meta-analysis. The quality of the studies was appraised via the PROBAST, RQS, and IBSI guidelines. The sensitivity, specificity, and accuracy of different radiomics models were analyzed.

Results: Ten eligible studies involving 1550 subjects were included. The pooled sensitivity and specificity of the radiomics models were 0.86 (95% CI: 0.78-0.91) and 0.69 (95% CI: 0.62-0.75), respectively. The AUC based on the SROC curve was 0.83 (95% CI: 0.70-0.91). The predictive performance of each model was rated using SUCRA values. The MRI-based support vector machine radiomics model had the highest specificity, and accuracy, at 80.7% and 73.2%, respectively. The MRI-based SVM radiomics combined with clinical features model had the highest sensitivity (82.0%). Among the CT methods, the deep learning (DL)-based convolutional neural network model had the highest sensitivity, and accuracy, at 51.0% and 44.9%, respectively. The PROBAST showed that 7 studies were at risk for bias.

Conclusion: This study synthesized existing evidence to confirm that radiomics serves as a viable exploratory tool for predicting IC efficacy in NPC. MRI-based nonlinear models and clinical-radiomics fusion models exhibit considerable promise, whereas clinical translation necessitates three critical steps: (1) standardized protocols following IBSI/METRICS/RQS guidelines; (2) prospective multicenter validation; and (3) investigating tumor microenvironment mechanisms. These measures will facilitate the transition of radiomics from technical exploration to clinical utility.

Systematic Review Registration: https://www.crd.york.ac.uk/prospero/, identifier CRD42024509331.

1 Introduction

Nasopharyngeal carcinoma (NPC), found mostly in Asia, is a form of head and neck cancer originating from the epithelium of the nasopharynx (1), with approximately 75% of cases being diagnosed at the locally advanced stage (2). Induction chemotherapy (IC) is also called neoadjuvant chemotherapy (3). According to the 2022 National Comprehensive Cancer Network (NCCN) guidelines, IC plus concurrent chemoradiotherapy (CCRT) has been listed as a level 1 recommendation for locoregionally advanced NPC (LA-NPC) patients (4). Several retrospective studies have investigated the effectiveness of IC plus intensity-modulated radiotherapy (IMRT) vs. IC plus CCRT for LA-NPC, but the conclusions have been mixed (5–8). Despite the use of standard radiotherapy and chemotherapy following the guidelines on tumor-node-metastasis (TNM) stage, the 5-year treatment failure rate of LA-NPC is still up to 30% (9). This may be because the TNM staging system is only based on anatomical information provided by imaging and ignores intratumoral heterogeneity, resulting in an inability to accurately stratify the risk of LA-NPC (10, 11). Therefore, in clinical practice, identifying the heterogeneity of characteristics within tumors and screening patients who are sensitive to IC are essential.

Radiomics noninvasively characterizes tumor heterogeneity by transforming medical images into high-dimensional quantitative features (12–15). Relevant studies have shown that radiomics can be used to noninvasively evaluate the IC response and provide additional benefits for NPC patients (16–21). Machine learning (ML) serves as the foundational analytical framework, among which the support vector machine (SVM) is a typical method of ML (22). Fully connected neural network (FCNN) model complex nonlinear relationships through simulated biological propagation (23, 24). Deep learning (DL) (a subset of ML) (25), where convolutional neural networks (CNN) excel in medical image pattern recognition. These approaches (SVM/FCNN/CNN) have been successfully implemented for NPC staging, treatment response prediction, and outcome prognostication (26–31), underscoring the translational potential of ML-driven radiomics in precision NPC oncology.

Systematic reviews/meta-analyses of radiomics-based prognostic studies in NPC have validated the utility of radiomics, yet their limitations warrant emphasis. Deng et al. (32) reported that the combined AUC value of radiomics for predicting NPC prognosis was 0.8265. Wang et al. (33) evaluated MRI-based radiomics for predicting local recurrence-free survival (LRFS), distant metastasis-free survival (DMFS), progression-free survival (PFS), and overall survival (OS). Meanwhile, Lee et al. (34) focused on the performance of MRI-based radiomics in PFS prediction, yielding a pooled c-index of 0.762. Yang et al. (31) focus on evaluating radiomics for assessing induction chemotherapy (IC) efficacy in NPC, the overall AUC was 0.91. Collectively, existing research is limited by high heterogeneity, potential biases, data gaps in key subgroups, and retrospective designs, which may undermine the generalizability of conclusions.

This study rigorously assessed the methodological quality of included studies and conducted the first network meta-analysis (NMA) to compare the predictive efficacy of diverse radiomics algorithms for IC efficacy in NPC, thereby establishing a hierarchy of evidence to support clinical decision-making for response-adaptive therapy.

2 Materials and methods

2.1 Program and registration

This study was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (35). The original study protocol was registered with PROSPERO before initiating the systematic search as a priori study design (CRD42024509331). The study utilized existing published data, eliminating the need for additional ethical approval and informed consent procedures.

2.2 Literature search

The PubMed, Embase, Web of Science, and Cochrane Library databases were searched from inception until May 25, 2024. The search was conducted independently by two researchers (H.S. and Q.L.) following the Cochrane Handbook for Systematic Reviews of Interventions, and discrepancies or uncertainties in the articles were resolved through discussion or consultation with a third reviewer (J.P.). Literature was searched manually via different combinations of free words and MeSH terms. The search terms included “nasopharyngeal carcinoma”, “machine learning”, “deep learning”, “radiomics”, and their variations. See Supplementary Material Table S1 for a detailed search strategy for MeSH terms.

2.3 Inclusion and exclusion criteria

Inclusion criteria: Patients who were diagnosed with NPC and who received IC, regardless of age, sex, race, or country. Research type: Imaging analysis using radiomics to predict the response to IC in NPC. Results: Sensitivity, specificity, and accuracy. Exclusion criteria: Insufficient data integrity to extract two-by-two data tables, conference reports, systematic reviews, summary articles, non-English articles, or conference proceedings.

2.4 Data extraction

The following data were extracted from the included articles in a standardized format: (1) study characteristics (first author, year of publication, nationality, affiliation, study period, and study design); (2) cohort characteristics (mean age, patient numbers, including the numbers of patients in the training and validation cohorts, sex, cancer stages, number of patients with effective treatment, and examination methods); and (3) image feature analysis (segmentation software, segmentation method, radiomics software, feature selection method, type of features, number of image features selected, type of validation, and type of algorithm).

For each study, true positive (TP), false positive (FP), false negative (FN), and true negative (TN) values were extracted. TP was defined as no response to treatment, and TN was defined as effective treatment. If there were multiple imaging models of different types in a study, among the models of the same type, the model with the highest AUC value that could extract data was selected. The performance metrics of the external validation cohort (or internal validation cohort, if the former was absent) were recorded, and a two-by-two table was constructed. If the study did not report these values, a two-by-two table from the diagnostic estimates presented in the article for each index test was constructed.

2.5 Quality assessment

The Prediction Model Risk of Bias Assessment Tool (PROBAST) checklist was used to assess the risk of bias and applicability (36). Two authors (H.S. and Q.L.) independently assessed the presence of bias and concerns regarding the applicability of the studies, and any differences were resolved by consensus or with the participation of a third reviewer (J.P.) if necessary. The radiomics quality score (RQS) (37) was also applied by two authors (H.S. and Q.L.) to gauge the methodological soundness of the radiomics studies, encompassing image acquisition to validation, ensuring the dependability of the findings. The IBSI guideline provides a comprehensive and unified reporting checklist for radiomics studies (38). Since many items in the IBSI checklist overlap with those in the RQS checklist, two authors (H.S. and Q.L.) only evaluated items relevant to image pre-processing steps.

2.6 Statistical analysis

This NMA was conducted within a Bayesian framework via Markov chain Monte Carlo subset simulations (39) according to the PRISMA NMA guidelines (40), and the data of the different models included in the study were directly and indirectly compared. First, traditional meta-analysis was conducted via a bivariate model of pooled sensitivity and specificity, which was visualized via forest plots and summary receiver operating characteristic (SROC) curves. Heterogeneity between the included studies was assessed by Cochrane’s Q test and I², with I² > 75% indicating greater heterogeneity (41). To further investigate the sources of heterogeneity, meta-regression and subgroup analyses were performed. The following factors were considered as potential sources of heterogeneity: device type (MRI vs. CT), events per variable (EPV) (≥10 vs. <10), ROI structure (2D vs. 3D), model validation (external validation vs. others), algorithms (linear vs. nonlinear), feature quantity after feature dimensionality reduction (≥10 vs. <10), RQS score (≥16.2 vs. <16.2), and sample size (≥200 vs. <200). The network was next drawn, where nodes represent each method and lines represent direct comparisons. The node size was correlated with the number of studies. The line thickness represents the number of comparisons between two studies. The control group was established as the clinical gold standard based on RECIST 1.1 criteria, primarily serving dual purposes: (1) evaluating lesion remission status following IC and (2) Providing a benchmark for comparative validation of predictive models. The data likelihood was specified using a binomial distribution to model the TP and TN counts for each study; a logit link function was employed to transform the accuracy metrics; and the random effects covariance structure was modeled via a multivariate normal distribution to characterize the correlation between sensitivity and specificity. We implemented the node-splitting method to estimate the consistency between the direct and indirect evidence for the whole study and then chose the consistency model or the inconsistency model based on the results (42). To estimate the predictive performance and ranking probability for each model, we used the surface under the cumulative ranking curve (SUCRA). The SUCRA percentages range from 0 to 1, with higher values indicating a greater likelihood of being the best prediction model (43). The predictive performance of the different models was judged by analyzing the sensitivity, specificity, and accuracy indicators. We tested the robustness of the results by sensitivity analysis. In addition, we constructed comparison-adjusted funnel plots and Deek’s test to assess potential publication bias.

Traditional meta-analysis was performed using the “midas” and “metan” package in Stata (version 17.0; Stata Corporation, College Station, Texas, USA). Bayesian NMA and associated graphical analyses were implemented via the “network” package within the same Stata framework.

2.7 Evaluation criteria for IC

The response evaluation of IC in all included studies was based on the RECIST version 1.1 (44). Complete response (CR) and partial response (PR) were defined as effective treatment, whereas stable disease (SD) and progressive disease (PD) were defined as no response to treatment. According to this standard, a control group (the gold standard) was established to evaluate whether the lesion was in remission and to compare and verify different models.

3 Results

3.1 Literature selection

The initial search for this study yielded a total of 1245 index records. After removing 633 duplicate articles, further review focused on titles and abstracts and excluded an additional 584 unrelated articles. The remaining 28 articles were subsequently evaluated more rigorously, including the accessibility of the full-text version and the feasibility of data extraction, and 10 full-text studies that met the predetermined inclusion criteria were identified for evaluation. The PRISMA flowchart of the selection process is shown in Figure 1.

Figure 1

Flowchart illustrating the selection process for a systematic review. Initial database search identified 1,245 records: PubMed (317), Embase (499), Web of Science (400), Cochrane Library (29). After removing 633 duplicates, 612 records were screened; 584 were excluded after title and abstract screening. Full-text eligibility was assessed for 28 articles; 18 were excluded (reasons: unavailable full-text, different outcome indicators, missing data). Finally, 10 articles were included in the network meta-analysis.

Figure 1. Flow chart of the study selection process.

3.2 Baseline characteristics of the included studies

These 10 articles included data from 1550 subjects. The baseline characteristics of the included studies are summarized in Table 1. Imaging methods included MRI and CT. The models were divided into the following seven types according to different features: (1) the radiomics combined with linear algorithms (logistic regression, LASSO) was used to construct a Radiomics model; (2) the SVM-based radiomics method was used to construct an SVM model; (3) the FCNN-based radiomics method was used to construct a FCNN model; (4) the pretrained CNN of DL was used to construct a CNN model; (5) the radiomics combined with linear algorithms and clinical features was used to construct a Radiomics-Clinical model; (6) the SVM-based radiomics combined with clinical features was used to construct an SVM-Clinical model; and (7) the SVM-based radiomics combined with CNN was used to construct an SVM-CNN model. According to the imaging methods and model types, the following 9 prediction models were included in this study: the MRI Radiomics model, the MRI Radiomics-Clinical model, the MRI SVM model, the MRI SVM-Clinical model, the MRI FCNN model, the CT SVM model, the CT SVM-Clinical model, the CT CNN model, and the CT SVM-CNN model.

Table 1

Table 1. Baseline characteristics of the included studies.

3.3 Image analyses

Detailed information on the image analyses included in the study and the predictive performance measures of the models are summarized in Table 2 and Supplementary Table S2. With respect to the selection of the region of interest (ROI), the primary and lymph node gross tumor volume (GTV) was segmented in one study (17), whole tumors were segmented in seven studies (16, 18, 21, 27, 29, 31, 45), and only the largest axial slice was segmented in two studies (19, 20). The number of image features selected for the prediction models analyzed in this study ranged from 2 to 24. Except for one study that did not report model validation details (20), internal validation was performed in the remaining studies.

Table 2

Table 2. Methodological characteristics and predictive performance of included radiomics models.

3.4 Quality assessment

The PROBAST assessment revealed that the overall risk of bias (ROB) was low in three studies (17, 20, 21). In the analysis, the number of EPV was less than 10 in six studies (16, 18, 19, 21, 29, 45), and one study (31) selected predictors based on univariate analysis, which had a high risk of bias; thus, the overall risk of bias was high. Another study (27) revealed that, among participants, the inclusion and exclusion criteria for all participants were vague; thus, the risk of bias was unclear, and the EPV in the analysis was less than 10, resulting in an overall high risk of bias. The overall applicability of all the studies’ concerns was low (Figure 2). The average RQS for the 10 studies was 16.2 (45.0%). The pre-processing steps were carried out following the IBSI guidelines, with an overall adherence rate of 60.0% (42/70). Detailed evaluations are available in Supplementary Material (Supplementary Tables S3, S4, S5).

Figure 2

Assessment table of ten studies, evaluating risk of bias and applicability concerns across categories. Most cells are green with a negative sign, indicating low concern. Zhao 2020 shows yellow with a question mark under “Participants” for risk of bias. Wang 2018, Zhang 2020, Yang 2022, Liao 2021, and Hu 2021 have red cells with a positive sign under “Analysis” for risk of bias and ROB for concerns.

Figure 2. PROBAST results of the included studies. +, high; -, low;?, unclear.

3.5 Traditional meta-analysis

In the 18 cohorts of 10 studies in which radiomics was used to predict the efficacy of IC in NPC, the pooled sensitivity and specificity were 0.86 (95% CI: 0.78-0.91) and 0.69 (95% CI: 0.62-0.75), respectively. From the plotted SROC curve, an AUC of 0.83 (95% CI: 0.70-0.91) was obtained. The forest plot of the sensitivity and specificity of the predictive performance of the radiomics model is shown in Figure 3A, and the SROC curve is shown in Figure 3B.

Figure 3

Panel A shows forest plots for sensitivity and specificity from multiple studies on MRI and CT modalities, with calculated random effects models. Panel B displays an SROC curve with observed data points, a summary operating point, and both 95% confidence and prediction contours, illustrating the diagnostic accuracy of these modalities with AUC, sensitivity, and specificity values.

Figure 3. (A) Forest plots of the pooled sensitivity and specificity. (B) SROC curve for the ability of radiomics to predict IC efficacy in NPC patients.

Methodologically, this study computed pooled HSROC AUC for individual models only when two or more studies were available. The MRI radiomics model (8 studies (16–21, 29, 45)) demonstrated pooled sensitivity of 0.88 (95% CI: 0.80–0.90), specificity of 0.70 (95% CI: 0.59–0.80), and HSROC AUC of 0.88 (95% CI: 0.84–0.90). Similarly, the MRI radiomics-clinical model (3 studies (21, 29, 45)) showed pooled sensitivity of 0.83 (95% CI: 0.71–0.90), specificity of 0.74 (95% CI: 0.47–0.90), and HSROC AUC of 0.84 (95% CI: 0.81–0.87). Detailed performance metrics are presented in Supplementary Material (Supplementary Figure S1). Models with fewer than two studies were excluded due to insufficient meta-analytic feasibility.

3.6 Heterogeneity exploration and meta-regression

The I² statistic reveals significant heterogeneity in sensitivity (I² = 78.31%) and moderate heterogeneity in specificity (I² = 74.55%) (Supplementary Table S6). As shown in Supplementary Tables S7, eight covariates were used to explore potential sources of heterogeneity. Meta-regression and joint model analysis indicated the following factors as contributors to significant heterogeneity in the meta-analysis: device type (MRI vs. CT) (P < 0.01), sample (≥200 vs. <200) (P = 0.02). The device type was a highly heterogeneous source (I² = 82.00%), while the CT equipment came from the same study. We therefore performed a subgroup analysis of multiple studies of MRI devices.

Results were analyzed in subgroups according to MRI device (Table 3). Compared to studies with an EPV <10, those with EPV≥10 showed lower pooled sensitivity (0.85 vs. 0.90, P = 0.01) but higher specificity (0.77 vs. 0.67, P = 0.68, nonsignificant). The sensitivity of 3D ROI was lower than that of 2D ROI (0.87 vs. 0.93, P = 0.03), and the specificity of 3D ROI was similar (0.71 vs. 0.70, P = 0.59, nonsignificant). Studies that validated the predictive performance of the model on external validation cohorts manifested lower sensitivity (0.87 vs. 0.93, P = 0.03) and similar specificity (0.71 vs. 0.70, P = 0.59, nonsignificant). Linear models demonstrated lower sensitivity than nonlinear models (0.87 vs. 0.97, P = 0.03) and similar specificity (0.71 vs. 0.70, P = 0.58, nonsignificant). Compared to using ≥10 features, those with < 10 features showed lower pooled sensitivity (0.85 vs. 0.90, P = 0.28, nonsignificant) but higher specificity (0.77 vs. 0.67, P = 0.04). Higher RQS correlated with decreased sensitivity (0.88 vs. 0.91, P = 0.02) and specificity (0.69 vs. 0.74, P = 0.15, nonsignificant). Notably, studies with larger sample sizes (≥200) showed higher pooled sensitivity (0.90 vs. 0.88, P = 0.15, nonsignificant) but lower specificity(0.61 vs. 0.76, P = 0.01) compared to smaller cohorts.

Table 3

Table 3. Investigation of heterogeneity using meta-regression and subgroup analysis of MRI devices.

3.7 Network evidence diagram

Figure 4 quantifies the density of direct comparative evidence within the NMA, demonstrating that both the MRI Radiomics model and MRI Radiomics-Clinical model have undergone extensive validation across multiple studies.

Figure 4

Network diagram illustrating nodes labeled A to J connected by edges of varying thickness. The thickness represents the weight of the connections, with thicker lines indicating higher weights. Node A has the largest size, followed by G. Most connections have a weight of one, while the link between nodes G and B has a weight of three. Blue nodes are prominently featured on a white background.

Figure 4. Diagram of the evidence network included in this study. (A), Control group (clinical gold standard for IC efficacy assessment based on RECIST 1.1); (B), MRI Radiomics-Clinical model; (C), CT CNN model; (D), CT SVM model; (E), CT SVM-Clinical model; (F), MRI FCNN model; (G), MRI Radiomics model; (H), MRI SVM model; (I), MRI SVM-Clinical model; (J), CT SVM-CNN model.

3.8 Consistency and inconsistency analysis

The sensitivity, specificity, and accuracy of all included studies were analyzed via inconsistency analysis employing the node-splitting method, and the results indicated consistency among the direct and indirect evidence of all outcomes (all p > 0.05). Therefore, the consistency model was applied in the current study.

3.9 Network meta-analysis

NMA revealed that the CT CNN model was superior to the CT SVM model (OR = 3.47, 95% CI: 1.16-10.38), the CT SVM-Clinical model (OR = 5.15, 95% CI: 1.75-15.15), and the CT SVM-CNN model (OR = 6.21, 95% CI: 2.12-18.23) in sensitivity. The MRI FCNN model was superior to the MRI Radiomics-Clinical model (OR = 2.71, 95% CI: 1.34-5.48) and the MRI Radiomics model (OR = 3.14, 95% CI: 1.55-6.33) in specificity. The MRI FCNN model was superior to the MRI Radiomics-Clinical model (OR = 2.72, 95% CI: 1.13-6.60) and the MRI Radiomics model (OR = 2.90, 95% CI: 1.20-6.99) in accuracy. The league table of the outcome indicators is shown in Supplementary Material Table S6.

3.10 SUCRA values

The SUCRA values for the 9 prediction models are summarized in Figure 5. The MRI SVM model had the highest specificity, and accuracy (SUCRA) at 80.7%, and 73.2%, respectively. The MRI SVM-Clinical model had the highest sensitivity (82.0%). The sensitivity, specificity, and accuracy of the MRI FCNN model ranked the top two, which were 76.7%, 68.6%, and 68.6%, respectively. Among the CT methods, the CNN model of DL had the highest SUCRA values for sensitivity, and accuracy at 51.0%, and 44.9%, respectively. The SUCRA curves are presented in Figure 6.

Figure 5

Heatmap comparing the performance of various models using sensitivity, specificity, and accuracy metrics. Models include MRI SVM-Clinical, MRI SVM, MRI FCNN, MRI Radiomics-Clinical, MRI Radiomics, CT CNN, CT SVM, CT SVM-Clinical, and CT SVM-CNN. Color gradient represents performance from worse to better with corresponding percentages in each cell.

Figure 5. The SUCRA values for the 9 prediction models with 3 endpoints.

Figure 6

Three line graphs illustrating the cumulative probability against rank for different models in terms of sensitivity, specificity, and accuracy. Each graph shows multiple models, differentiated by colored dashed lines. Models include MRI SVM-Clinical, CT CNN, CT SVM-Clinical, MRI Radiomics, among others, as detailed in the legend. Each model's performance varies across the graphs, reflecting differences in ranking for each metric.

Figure 6. Cumulative ranking probability plots for the 9 prediction models with 3 endpoints.

3.11 Sensitivity analyses

No significant changes were observed when each included study was eliminated from the analysis one by one. The results of sensitivity analyses for each study are shown in Figure 7.

Figure 7

Forest plot showing meta-analysis estimates for various studies related to MRI and CT radiomics. Each entry lists a study with the publication year and method, along with points representing estimates and lines indicating confidence intervals. Ranges are marked from approximately 6.11 to 22.85.

Figure 7. Results of sensitivity analyses.

3.12 Assessment of publication bias

The comparison-adjusted funnel plots (Figure 8) show roughly symmetrical scatter points of the same color, indicating the negligible presence of publication bias or other forms of bias within the studies, as confirmed by Deek’s test (P = 0.427 > 0.05). This symmetry bolstered the reliability of the findings.

Figure 8

Three funnel plots display the standard error of effect size versus effect size centered at comparison-specific pooled effect, labeled as Sensitivity, Specificity, and Accuracy. Each plot shows data points of various colors representing different comparisons between letters (e.g., A vs J). Dashed lines indicate reference limits, with a central vertical line at zero. A red line represents the line of best fit for each plot. The legend details the color-coded comparisons.

Figure 8. Comparison-adjusted funnel plot for 9 prediction models. Each data point represents a single study. (A), Control group (clinical gold standard for IC efficacy assessment based on RECIST 1.1). (B), MRI Radiomics-Clinical model; (C), CT CNN model; (D), CT SVM model; (E), CT SVM-Clinical model; (F), MRI FCNN model; (G), MRI Radiomics model; (H), MRI SVM model; (I), MRI SVM-Clinical model; (J), CT SVM-CNN model.

4 Discussion

The efficacy of IC serves as a robust predictor of survival outcomes in NPC patients post-IC (31). Prior evidence indicates a CR/PR rate of 76.9% following IC in NPC cohorts (46), underscoring that not all patients derive clinical benefit from IC. Our study demonstrated that the MRI SVM model achieved the highest specificity (80.7%), effectively reducing FP and ensuring aggressive therapies like IC are reserved for high-confidence responders. Conversely, the MRI SVM-Clinical model exhibited peak sensitivity (82.0%), capturing more true responders at the cost of increased over-treatment and potential exposure to IC-associated toxicities without survival benefit. In resource-limited settings, model thresholds can be calibrated to prioritize cost-effectiveness (e.g., avoiding IC in low-response-probability subgroups). For high-toxicity regimens such as cisplatin-based IC (47), stringent thresholds may be preferred to mitigate harm. Iterative adjustments based on real-world outcome monitoring (e.g., post-IC surveillance) further enable dynamic refinement of decision boundaries as evidence evolves.

Both our study and Yang et al. (31) focused on assessing radiomics to evaluate IC efficacy in NPC. Notably, compared with Yang et al. (31), our research included more primary studies and models (6 vs. 10 primary studies and 6 vs. 18 models), compared multiple radiomics algorithms, standardized the definitions of TP/FP cases to clarify model performance, and applied meta-regression and subgroup analyses to address heterogeneity. We performed the first NMA to compare the value of 9 different models in predicting IC efficacy in NPC. Based on the SUCRA values of different models, the following conclusions are drawn: Radiomics based on MRI and CT serves as a viable exploratory tool for predicting IC efficacy in NPC. Among them, the MRI FCNN model ranked in the top two for sensitivity, specificity, and accuracy, indicating superior overall predictive performance. The MRI SVM model had better specificity and accuracy, while the MRI SVM-Clinical model had better sensitivity. The above nonlinear ML combined with radiomics had a good predictive performance for the noninvasive identification of IC treatment response in NPC. Moreover, Li et al. (48) reported that the FCNN model exhibited optimal performance in evaluating HER2-low breast cancer undergoing neoadjuvant therapy. Therefore, the combination of FCNN and radiomics may be a promising method for predicting NPC treatment response.

The three studies (16, 29, 45) in this NMA showed that the MRI Radiomics-Clinical model could improve the predictive performance over that of the MRI Radiomics model. However, Wang et al. (21) showed that the MRI Radiomics-Clinical model could not improve predictive performance. Our research revealed that, compared with the MRI Radiomics model, the MRI Radiomics-Clinical model had greater specificity, and a lower sensitivity in the surca ranking. This is consistent with the pooled sensitivity and specificity ranking, which confirms the surca value accuracy of NMA. Owing to the lack of data, the clinical characteristics (such as EBV-DNA, LDH, etc.) that were independent predictors in other studies were not included in the study of Wang et al. (21), which might explain why the performance of the MRI Radiomics-Clinical model was lower than that of the MRI Radiomics model. In future studies, more relevant clinical data should be included to analyze the correlation between tumor microenvironment and radiomics features.

Notably, the I² values of specificity, and accuracy of the 9 models ranged were 74.55% and 69.51%, respectively, indicating moderate heterogeneity, whereas the I² value of sensitivity was 78.31%, greater than 75%, indicating high heterogeneity. Device type (MRI vs. CT)(P < 0.01) was the source of high heterogeneity (Supplementary Table S7). MRI led the studies (9/10), while CT was the only study (Table 1). The prediction efficiency of the MRI models was generally greater than that of the CT models. The possible reason is that MRI has significantly better resolution of soft tissue than CT does and effectively shows the range of the parapharyngeal space, skull base, and intracranial tumors (49), thus providing more realistic internal characteristics of the tumors. In the model based on CT images, we found that the SUCRA values of sensitivity, and accuracy of the CNN model were the highest, and sensitivity, and accuracy were even higher than those of the linear algorithms MRI models (MRI Radiomics model and MRI Radiomics-Clinical model) (Figure 5). The CNN has multiple layers of neuron-like computational connections, which sets a target-size bounding box on the lesion area, evaluates the malignant probability of the identified lesion, and achieves end-to-end output of entire image sequences (50). It reduces the need for manual intervention and traditional preprocessing steps by automating the extraction of complex features. Comes et al. (51) showed that MRI-based radiomics combined with a CNN model could predict the pathological CR of patients with breast cancer to neoadjuvant chemotherapy early, with an AUC of 0.82 (95% CI: 0.75-0.88). Therefore, the combination of CNN and MRI in future studies of NPC can provide a new way to predict the efficacy of IC.

In MRI-based radiomics subgroup analyses, studies with EPV ≥ 10 demonstrated significantly lower sensitivity than EPV <10 cohorts (0.85 vs. 0.90, P = 0.01). This inverse relationship indicates that low EPV (<10) may induce model overfitting to training data noise, thereby inflating sensitivity estimates in smaller cohorts (52). For ROI segmentation, 3D showed reduced sensitivity versus 2D (0.87 vs. 0.93, P = 0.03). While 2D segmentation risks missing heterogeneous tumor regions (increasing FN), 3D segmentation introduces non-target biological signals (e.g., necrosis/edema) (53). This sensitivity reduction reflects technical-biological complexity coupling, not methodological inferiority. The future requires dynamically optimizing ROI and adaptive segmentation based on necrosis proportion (54), coupled with analyzing how tumor microenvironment components (e.g., necrotic core vs. active margins) influence IC sensitivity, to unlock the potential of 3D imaging in resolving spatial heterogeneity of treatment effects.

External validation cohorts exhibited lower sensitivity than internal cohorts (0.87 vs. 0.93, P = 0.03), indicating overfitting risks. Large-sample studies (≥200 patients) showed poorer specificity than small-sample cohorts (0.61 vs. 0.76, P = 0.01), likely due to increased patient diversity compromising model discriminability. Conversely, small samples risk overfitting (e.g., Zhao et al. (27), validation n = 23, sensitivity = 100%). Therefore, on the basis of expanding the sample size, punitive modeling should be promoted simultaneously to solve the risk of overfitting in small samples. Nonlinear algorithms outperformed linear algorithms in sensitivity (0.97 vs. 0.87, P = 0.03), aligning with NPC’s complex response dynamics (55) and SUCRA rankings (Figure 5). Models with <10 features achieved higher specificity than those with ≥10 features (0.77 vs. 0.67, P = 0.04). This finding demonstrates that reduced feature quantity can mitigate model complexity, thereby decreasing false-positive predictions and enhancing both the robustness and diagnostic accuracy of radiomics models in predicting IC efficacy in NPC. While a previous study reported an average RQS of 31.0% (56), our analysis showed that the average RQS increased to 45.0%, which indicates a methodological advance. However, Low RQS studies correlated with inflated sensitivity (0.88 vs. 0.91, P = 0.02), highlighting uncorrected optimism bias. Therefore, we advocate that the methodological radiomics score (METRICS) be followed in future studies, a new scoring tool for assessing the methodological quality of the radiomics research (57). The “feature stability” and “model development” domains of METRICS systematically identify optimism bias sources overlooked by RQS, while its core domains of “clinical utility” and “reproducibility” address clinically detached optimism bias at its root. According to Supplementary Table S5, only 40% of studies (4/10) adhered to IBSI preprocessing protocols, with critical gaps in gray-level discretization (40% implementation) and image interpolation (40%), collectively contributing to diminished feature stability.

There were several limitations in this study. First, studies with two arms or more were relatively rare, and the extractable data were limited, without forming a closed loop. Second, the ROB ratings of some studies included in this study were high, mainly because the EPV was less than 10, which may have led to an increased risk of overfitting the prediction model. Future model development studies must ensure EPV ≥10 via a priori sample size estimation based on radiomics-specific guidelines. Third, the universal absence of external validation fundamentally limits the generalizability of current radiomics models, which prevents robust validation across institutionally heterogeneous cohorts.

5 Conclusions

This study provides a clinical reference for IC efficacy prediction in NPC by synthesizing radiomics evidence. As an exploratory tool, radiomics demonstrates potential; nevertheless, its performance generalizability requires cautious interpretation owing to technical heterogeneity. MRI-based nonlinear models and clinical-integrated frameworks demonstrate significant potential for clinical translation; however, achieving reliable deployment necessitates three critical steps: (1) standardized protocols adhering to IBSI/METRICS/RQS guidelines to reduce heterogeneity, (2) rigorous validation frameworks employing prospective multicenter designs to address generalization gaps, and (3) biological mechanism exploration linking imaging features to tumor microenvironment dynamics. Collectively, these strategies will facilitate the transition of radiomics from technical exploration to clinical utility in NPC precision oncology.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

YJ: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Project administration, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. JP: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing – original draft. GY: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Writing – review & editing. XH: Data curation, Formal Analysis, Investigation, Methodology, Project administration, Resources, Validation, Writing – original draft. JW: Data curation, Formal Analysis, Investigation, Methodology, Project administration, Resources, Validation, Writing – original draft. JY: Resources, Supervision, Writing – review & editing.. HS: Data curation, Formal Analysis, Investigation, Methodology, Project administration, Writing – original draft. DT: Data curation, Visualization, Writing – review & editing. QL: Data curation, Formal Analysis, Investigation, Methodology, Writing – review & editing. ZY: Data curation, Formal Analysis, Investigation, Methodology, Validation, Visualization, Writing – review & editing. ZS: Data curation, Formal Analysis, Investigation, Methodology, Project administration, Resources, Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, and/or publication of this article.

Acknowledgments

We would like to thank Dr. Hao Xu from the Affiliated Cancer Hospital of University of Electronic Science and Technology of China for providing methodological assistance and guidance.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1590420/full#supplementary-material

Abbreviations

CCRT, Concurrent chemoradiotherapy; CNN, Convolutional neural network; CR, Complete response; DL, Deep learning; Events per variable, EPV; FCNN, Fully connected neural network; FN, False negative; FP, False positive; GTV, Gross tumor volume; IBSI, Imaging biomarker standardization initiative; IC, Induction chemotherapy; IMRT, Intensity-modulated radiotherapy; LA-NPC, Locoregionally advanced nasopharyngeal carcinoma; METhodological RadiomICs Score, METRICS; ML, Machine learning; NCCN, National Comprehensive Cancer Network; NMA, Network meta-analysis; NPC, Nasopharyngeal carcinoma; PD, Progressive disease; PR, Partial response; PRISMA, Systematic Reviews and Meta-Analyses; PROBAST, Prediction Model Risk of Bias Assessment Tool; ROI, Region of interest; RQS, Radiomics quality score; SD, Stable disease; SUCRA, Surface under the cumulative ranking curve; SVM, Support vector machine; TN, True negative; TNM, Tumor-node-metastasis; TP, True positive.

References

1. Liu H, Tang L, Li Y, Xie W, Zhang L, Tang H, et al. Nasopharyngeal carcinoma: current views on the tumor microenvironment’s impact on drug resistance and clinical outcomes. Mol Cancer. (2024) 23:20. doi: 10.1186/s12943-023-01928-2

PubMed Abstract | Crossref Full Text | Google Scholar

2. Pan JJ, Ng WT, Zong JF, Lee SW, Choi HC, Chan LL, et al. Prognostic nomogram for refining the prognostication of the proposed 8th edition of the AJCC/UICC staging system for nasopharyngeal cancer in the era of intensity-modulated radiotherapy. Cancer. (2016) 122:3307–15. doi: 10.1002/cncr.30198

PubMed Abstract | Crossref Full Text | Google Scholar

3. Liu Y, Yang L, Zhang S, and Lin B. The efficacy and safety of concurrent chemoradiotherapy with induction chemotherapy vs. concurrent chemoradiotherapy alone for locally advanced nasopharyngeal carcinoma: a systematic-review and meta-analysis. Transl Cancer Res. (2022) 11:1207–18. doi: 10.21037/tcr-22-604

PubMed Abstract | Crossref Full Text | Google Scholar

4. Caudell JJ, Gillison ML, Maghami E, Spencer S, Pfister DG, Adkins D, et al. NCCN guidelines^® Insights: head and neck cancers, version 1.2022. J Natl Compr Canc Netw. (2022) 20:224–34. doi: 10.6004/jnccn.2022.0016

PubMed Abstract | Crossref Full Text | Google Scholar

5. Liu L, Fei Z, Chen M, Zhao L, Su H, Gu D, et al. Induction chemotherapy plus concurrent chemoradiotherapy versus induction chemotherapy plus volumetric modulated arc therapy alone in the treatment of stage II-IVB nasopharyngeal carcinoma patients: a retrospective controlled study. Radiat Oncol. (2018) 13:148. doi: 10.1186/s13014-018-1092-0

PubMed Abstract | Crossref Full Text | Google Scholar

6. Wang F, Jiang C, Wang L, Yan F, Sun Q, Ye Z, et al. Infuence of concurrent chemo- therapy on locoregionally advanced nasopharyngeal carcinoma treated with neoadjuvant chemotherapy plus intensity-modulated radiotherapy: a retrospective matched analysis. Sci Rep. (2020) 10:2489. doi: 10.1038/s41598-020-59470-w

PubMed Abstract | Crossref Full Text | Google Scholar

7. Wang Q, Xu G, Xia Y, Zuo J, Zeng G, Xue Z, et al. Comparison of induction chemotherapy plus concurrent chemoradiotherapy and induction chemotherapy plus radiotherapy in locally advanced nasopharyngeal carcinoma. Oral Oncol. (2020) 111:104925. doi: 10.1016/j.oraloncology.2020.104925

PubMed Abstract | Crossref Full Text | Google Scholar

8. Wei Z, Zhang Z, Luo J, Li N, and Peng X. Induction chemotherapy plus IMRT alone versus induction chemotherapy plus IMRT-based concurrent chemoradiotherapy in locoregionally advanced nasopharyngeal carcinoma: a retrospective cohort study. J Cancer Res Clin Oncol. (2019) 145:1857–64. doi: 10.1007/s00432-019-02925-z

PubMed Abstract | Crossref Full Text | Google Scholar

9. Chen YP, Liu X, Zhou Q, Yang KY, Jin F, Zhu XD, et al. Metronomic capecitabine as adjuvant therapy in locoregionally advanced nasopharyngeal carcinoma: a multicentre, open-label, parallel-group, randomised, controlled, phase 3 trial. Lancet. (2021) 398:303–13. doi: 10.1016/S0140-6736(21)01123-5

PubMed Abstract | Crossref Full Text | Google Scholar

10. Xi Y, Ge X, Ji H, Wang L, Duan S, Chen H, et al. Prediction of response to induction chemotherapy plus concurrent chemoradiotherapy for nasopharyngeal carcinoma based on MRI radiomics and delta radiomics: A two-center retrospective study. Front Oncol. (2022) 12:824509. doi: 10.3389/fonc.2022.824509

PubMed Abstract | Crossref Full Text | Google Scholar

11. Guo R, Tang LL, Mao YP, Du XJ, Chen L, Zhang ZC, et al. Proposed modifications and incorporation of plasma Epstein-Barr virus DNA improve the TNM staging system for Epstein-Barr virus-related nasopharyngeal carcinoma. Cancer. (2019) 125:79–89. doi: 10.1002/cncr.31741

PubMed Abstract | Crossref Full Text | Google Scholar

12. Limkin EJ, Sun R, Dercle L, Zacharaki EI, Robert C, Reuzé S, et al. Promises and challenges for the implementation of computational medical imaging (radiomics) in oncology. Ann Oncol. (2017) 28:1191–206. doi: 10.1093/annonc/mdx034

PubMed Abstract | Crossref Full Text | Google Scholar

13. Keek SA, Leijenaar RT, Jochems A, and Woodruf HC. A review on radiomics and the future of theranostics for patient selection in precision medicine. Br J Radiol. (2018) 91:20170926. doi: 10.1259/bjr.20170926

PubMed Abstract | Crossref Full Text | Google Scholar

14. Rogers W, Thulasi Seetha S, Refaee TAG, Lieverse RIY, Granzier RWY, Ibrahim A, et al. Radiomics: from qualitative to quantitative imaging. Br J Radiol. (2020) 93:20190948. doi: 10.1259/bjr.20190948

PubMed Abstract | Crossref Full Text | Google Scholar

15. Jiang Y, Zhou K, Sun Z, Wang H, Xie J, Zhang T, et al. Non-invasive tumor microenvironment evaluation and treatment response prediction in gastric cancer using deep learning radiomics. Cell Rep Med. (2023) 4:101146. doi: 10.1016/j.xcrm.2023.101146

PubMed Abstract | Crossref Full Text | Google Scholar

16. Hu C, Zheng D, Cao X, Pang P, Fang Y, Lu T, et al. Application value of magnetic resonance radiomics and clinical nomograms in evaluating the sensitivity of neoadjuvant chemotherapy for nasopharyngeal carcinoma. Front Oncol. (2021) 11:740776. doi: 10.3389/fonc.2021.740776

PubMed Abstract | Crossref Full Text | Google Scholar

17. Wang Y, Li C, Yin G, Wang J, Li J, Wang P, et al. Extraction parameter optimized radiomics for neoadjuvant chemotherapy response prognosis in advanced nasopharyngeal carcinoma. Clin Transl Radiat Oncol. (2021) 33:37–44. doi: 10.1016/j.ctro.2021.12.005

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zhang L, Ye Z, Ruan L, and Jiang M. Pretreatment MRI-derived radiomics may evaluate the response of different induction chemotherapy regimens in locally advanced nasopharyngeal carcinoma. Acad Radiol. (2020) 27:1655–64. doi: 10.1016/j.acra.2020.09.002

PubMed Abstract | Crossref Full Text | Google Scholar

19. Wang G, He L, Yuan C, Huang Y, Liu Z, and Liang C. Pretreatment MR imaging radiomics signatures for response prediction to induction chemotherapy in patients with nasopharyngeal carcinoma. Eur J Radiol. (2018) 98:100–6. doi: 10.1016/j.ejrad.2017.11.007

PubMed Abstract | Crossref Full Text | Google Scholar

20. Yongfeng P, Chuner J, Lei W, Fengqin Y, Zhimin Y, Zhenfu F, et al. The usefulness of pretreatment MR-based radiomics on early response of neoadjuvant chemotherapy in patients with locally advanced nasopharyngeal carcinoma. Oncol Res. (2021) 28:605–13. doi: 10.3727/096504020X16022401878096

PubMed Abstract | Crossref Full Text | Google Scholar

21. Wang A, Xu H, Zhang C, Ren J, Liu J, and Zhou P. Radiomic analysis of MRI for prediction of response to induction chemotherapy in nasopharyngeal carcinoma patients. Clin Radiol. (2023) 78:e644–53. doi: 10.1016/j.crad.2023.05.012

PubMed Abstract | Crossref Full Text | Google Scholar

22. Esteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K, et al. A guide to deep learning in healthcare. Nat Med. (2019) 25:24–9. doi: 10.1038/s41591-018-0316-z

PubMed Abstract | Crossref Full Text | Google Scholar

23. Salem M, Ryan MA, Oliver A, Hussain KF, and Lladó X. Improving the detection of new lesions in multiple sclerosis with a cascaded 3D fully convolutional neural network approach. Front Neurosci. (2022) 16:1007619–1007619. doi: 10.3389/fnins.2022.1007619

PubMed Abstract | Crossref Full Text | Google Scholar

24. Gabr RE, Coronado I, Robinson M, Sujit SJ, Datta S, Sun X, et al. Brain and lesion segmentation in multiple sclerosis using fully convolutional neural networks: A large-scale study. Mult Scler. (2020) 26:1217–26. doi: 10.1177/1352458519856843

PubMed Abstract | Crossref Full Text | Google Scholar

25. Lee JG, Jun S, Cho YW, Lee H, Kim GB, Seo JB, et al. Deep learning in medical imaging: general overview. Korean J Radiol. (2017) 18:570–84. doi: 10.3348/kjr.2017.18.4.570

PubMed Abstract | Crossref Full Text | Google Scholar

26. Wang Z, Fang M, Zhang J, Tang L, Zhong L, Li H, et al. Radiomics and deep learning in nasopharyngeal carcinoma: A review. IEEE Rev BioMed Eng. (2024) 17:118–35. doi: 10.1109/RBME.2023.3269776

PubMed Abstract | Crossref Full Text | Google Scholar

27. Zhao L, Gong J, Xi Y, Xu M, Li C, Kang X, et al. MRI-based radiomics nomogram may predict the response to induction chemotherapy and survival in locally advanced nasopharyngeal carcinoma. Eur Radiol. (2020) 30:537–46. doi: 10.1007/s00330-019-06211-x

PubMed Abstract | Crossref Full Text | Google Scholar

28. Selvaraj G, Kaliamurthi S, Kaushik AC, Khan A, Wei YK, Cho WC, et al. Identification of target gene and prognostic evaluation for lung adenocarcinoma using gene expression meta-analysis, network analysis and neural network algorithms. J BioMed Inform. (2018) 86:120–34. doi: 10.1016/j.jbi.2018.09.004

PubMed Abstract | Crossref Full Text | Google Scholar

29. Liao H, Chen X, Lu S, Jin G, Pei W, Li Y, et al. MRI-based back propagation neural network model as a powerful tool for predicting the response to induction chemotherapy in locoregionally advanced nasopharyngeal carcinoma. J Magn Reson Imaging. (2022) 56:547–59. doi: 10.1002/jmri.28047

PubMed Abstract | Crossref Full Text | Google Scholar

30. Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. (2017) 542:115–8. doi: 10.1038/nature21056

PubMed Abstract | Crossref Full Text | Google Scholar

31. Yang Y, Wang M, Qiu K, Wang Y, and Ma X. Computed tomography-based deep-learning prediction of induction chemotherapy treatment response in locally advanced nasopharyngeal carcinoma. Strahlenther Onkol. (2022) 198:183–93. doi: 10.1007/s00066-021-01874-2

PubMed Abstract | Crossref Full Text | Google Scholar

32. Deng Q, Hou Y, Zhang X, and Zan H. A systematic review of the predictive value of radiomics for nasopharyngeal carcinoma prognosis. Med (Baltimore). (2024) 103:e39302. doi: 10.1097/MD.0000000000039302

PubMed Abstract | Crossref Full Text | Google Scholar

33. Wang CK, Wang TW, Lu CF, Wu YT, and Hua MW. Deciphering the prognostic efficacy of MRI radiomics in nasopharyngeal carcinoma: A comprehensive meta-analysis. Diagnostics (Basel). (2024) 14:924. doi: 10.3390/diagnostics14090924

PubMed Abstract | Crossref Full Text | Google Scholar

34. Lee S, Choi Y, Seo MK, Jang J, Shin NY, Ahn KJ, et al. Magnetic resonance imaging-based radiomics for the prediction of progression-free survival in patients with nasopharyngeal carcinoma: A systematic review and meta-analysis. Cancers (Basel). (2022) 14:653. doi: 10.3390/cancers14030653

PubMed Abstract | Crossref Full Text | Google Scholar

35. O’Donnell JPM, Gasior SA, Davey MG, O’Malley E, Lowery AJ, McGarry J, et al. The accuracy of breast MRI radiomic methodologies in predicting pathological complete response to neoadjuvant chemotherapy: A systematic review and network meta-analysis. Eur J Radiol. (2022) 157:110561. doi: 10.1016/j.ejrad.2022.110561

PubMed Abstract | Crossref Full Text | Google Scholar

36. Wolff RF, Moons KGM, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: A tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. (2019) 170:51–8. doi: 10.7326/M18-1376

PubMed Abstract | Crossref Full Text | Google Scholar

37. Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141

PubMed Abstract | Crossref Full Text | Google Scholar

38. Zwanenburg A, Vallières M, Abdalah MA, Aerts HJWL, Andrearczyk V, Apte A, et al. The image biomarker standardization initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology. (2020) 295:328–38. doi: 10.1148/radiol.2020191145

PubMed Abstract | Crossref Full Text | Google Scholar

39. Heinecke A, Tallarita M, and De Iorio M. Bayesian splines versus fractional polynomials in network meta-analysis. BMC Med Res Methodol. (2020) 20:261. doi: 10.1186/s12874-020-01113-9

PubMed Abstract | Crossref Full Text | Google Scholar

40. Hutton B, Salanti G, Caldwell DM, Chaimani A, Schmid CH, Cameron C, et al. The PRISMA extension statement for reporting of systematic reviews incorporating network meta-analyses of health care interventions: checklist and explanations. Ann Intern Med. (2015) 162:777–84. doi: 10.7326/M14-2385

PubMed Abstract | Crossref Full Text | Google Scholar

41. Aitken M, Davidson M, Chan MV, Urzua Fresno C, Vasquez LI, Huo YR, et al. Prognostic value of cardiac MRI and FDG PET in cardiac sarcoidosis: A systematic review and meta-analysis. Radiology. (2023) 307:e222483. doi: 10.1148/radiol.222483

PubMed Abstract | Crossref Full Text | Google Scholar

42. Dias S, Welton NJ, Caldwell DM, and Ades AE. Checking consistency in mixed treatment comparison meta-analysis. Stat Med. (2010) 29:932–44. doi: 10.1002/sim.3767

PubMed Abstract | Crossref Full Text | Google Scholar

43. Chen R, Guo Y, Kuang Y, and Zhang Q. Effects of home-based exercise interventions on post-stroke depression: A systematic review and network meta-analysis. Int J Nurs Stud. (2024) 152:104698. doi: 10.1016/j.ijnurstu.2024.104698

PubMed Abstract | Crossref Full Text | Google Scholar

44. Eisenhauer EA, Therasse P, Bogaerts J, Schwartz LH, Sargent D, Ford R, et al. New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer. (2009) 45:228–47. doi: 10.1016/j.ejca.2008.10.026

PubMed Abstract | Crossref Full Text | Google Scholar

45. Chen Z, Wang Z, Liu S, Zhang S, Zhou Y, Zhang R, et al. Nomograms based on multiparametric MRI radiomics integrated with clinical-radiological features for predicting the response to induction chemotherapy in nasopharyngeal carcinoma. Eur J Radiol. (2024) 175:111438. doi: 10.1016/j.ejrad.2024.111438

PubMed Abstract | Crossref Full Text | Google Scholar

46. Liu SL, Sun XS, Yan JJ, Chen QY, Lin HX, Wen YF, et al. Optimal cumulative cisplatin dose in nasopharyngeal carcinoma patients based on induction chemotherapy response. Radiother Oncol. (2019) 137:83–94. doi: 10.1016/j.radonc.2019.04.020

PubMed Abstract | Crossref Full Text | Google Scholar

47. Xu C, Zhou GQ, Li WF, Hu DS, Chen XZ, Lin SJ, et al. Nivolumab combined with induction chemotherapy and radiotherapy in nasopharyngeal carcinoma: A multicenter phase 2 PLATINUM trial. Cancer Cell. (2025) 43:925–36.e4. doi: 10.1016/j.ccell.2025.01.014

PubMed Abstract | Crossref Full Text | Google Scholar

48. Li XP, Lin ZQ, Yu QH, Qiu CR, Lai C, Huang H, et al. Development and validation of a prognostic model for HER2-low breast cancer to evaluate neoadjuvant therapy. Gland Surg. (2023) 12:183–96. doi: 10.21037/gs-22-729

PubMed Abstract | Crossref Full Text | Google Scholar

49. Zhang YM, Gong GZ, Qiu QT, Han YW, Lu HM, and Yin Y. Radiomics for diagnosis and radiotherapy of nasopharyngeal carcinoma. Front Oncol. (2022) 11:767134. doi: 10.3389/fonc.2021.767134

PubMed Abstract | Crossref Full Text | Google Scholar

50. Li Y, Deng J, Ma X, Li W, and Wang Z. Diagnostic accuracy of CT and PET/CT radiomics in predicting lymph node metastasis in non-small cell lung cancer. Eur Radiol. (2024) 35:1966–79. doi: 10.1007/s00330-024-11036-4

PubMed Abstract | Crossref Full Text | Google Scholar

51. Comes MC, Fanizzi A, Bove S, Didonna V, Diotiaiuti S, Fadda F, et al. Explainable 3D CNN based on baseline breast DCE-MRI to give an early prediction of pathological complete response to neoadjuvant chemotherapy. Comput Biol Med. (2024) 172:108132. doi: 10.1016/j.compbiomed.2024

PubMed Abstract | Crossref Full Text | Google Scholar

52. Cao Q, Jiang Z, Wang Z, Wee L, Dekker A, Zhang Z, et al. Minimum sample size calculation for radiomics-based binary outcome prediction models: Theoretical framework and practical example. Radiother Oncol. (2025) 212:111134. doi: 10.1016/j.radonc.2025.111134

PubMed Abstract | Crossref Full Text | Google Scholar

53. Truong NCD, Bangalore Yogananda CG, Wagner BC, Holcomb JM, Reddy D, Saadat N, et al. Two-stage training framework using multicontrast MRI radiomics for IDH mutation status prediction in glioma. Radiol Artif Intell. (2024) 6:e230218. doi: 10.1148/ryai.230218

PubMed Abstract | Crossref Full Text | Google Scholar

54. Liu Z, Mouni D, Zhang S, Du T, Li C, Grzegorzek M, et al. Predicting the early response to neoadjuvant chemotherapy in high-grade serous ovarian cancer by intratumoral habitat heterogeneity based on 18 F-FDG PET/CT. Eur J Nucl Med Mol Imaging. (2025). doi: 10.1007/s00259-025-07480-z

PubMed Abstract | Crossref Full Text | Google Scholar

55. Chen X, Li Y, Li X, Cao X, Xiang Y, Xia W, et al. An interpretable machine learning prognostic system for locoregionally advanced nasopharyngeal carcinoma based on tumor burden features. Oral Oncol. (2021) 118:105335. doi: 10.1016/j.oraloncology.2021.105335

PubMed Abstract | Crossref Full Text | Google Scholar

56. Kocak B, Keles A, Kose F, and Sendur A. Quality of radiomics research: comprehensive analysis of 1574 unique publications from 89 reviews. Eur Radiol. (2024) 35:1980–92. doi: 10.1007/s00330-024-11057-z

PubMed Abstract | Crossref Full Text | Google Scholar

57. Deng K, Chen T, Leng Z, Yang F, Lu T, Cao J, et al. Radiomics as a tool for prognostic prediction in transarterial chemoembolization for hepatocellular carcinoma: a systematic review and meta-analysis. Radiol Med. (2024) 129:1099–117. doi: 10.1007/s11547-024-01840-9

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: nasopharyngeal carcinoma, induction chemotherapy, radiomics, deep learning, network meta-analysis, machine learning, Bayesian

Citation: Jian Y, Peng J, Yang G, He X, Wang J, Yin J, Shi H, Tao D, Lan Q, Yang Z and Shu Z (2025) Predictive performance of MRI and CT radiomics in predicting the response to induction chemotherapy in nasopharyngeal carcinoma: a network meta-analysis. Front. Oncol. 15:1590420. doi: 10.3389/fonc.2025.1590420

Received: 26 March 2025; Accepted: 20 October 2025;
Published: 13 November 2025.

Edited by:

Emma Gangemi, Hospital Physiotherapy Institutes (IRCCS), Italy

Reviewed by:

Cheng-Wei Tie, Affiliated Hospital of Qinghai University, China
Parya Valizadeh, Tehran University of Medical Sciences, Iran
Yen Cho Huang, Keelung Chang Gung Memorial Hospital, Taiwan

Copyright © 2025 Jian, Peng, Yang, He, Wang, Yin, Shi, Tao, Lan, Yang and Shu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhenyu Shu, Y29vbGp1dHlAaG90bWFpbC5jb20=

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.