Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 08 August 2025

Sec. Radiation Oncology

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1644053

This article is part of the Research TopicOptimizing Radiotherapy for Cervical Cancer Efficacy Toxicity and Brachytherapy IntegrationView all 8 articles

Combined clinical and MRI-based radiomics model for predicting acute hematologic toxicity in gynecologic cancer radiotherapy

Lumeng Luo,Lumeng Luo1,2Jiahao Wang,Jiahao Wang1,2Hongling Xie,Hongling Xie1,2Bingxin Chen,Bingxin Chen2,3Hui Wang,Hui Wang2,3Qiu Tang,*Qiu Tang1,2*
  • 1Department of Radiation Oncology, Women’s Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
  • 2Zhejiang Provincial Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women’s Hospital, Zhejiang University School of Medicine, Hangzhou, China
  • 3Department of Gynecologic Oncology, Women’s Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China

Acute hematologic toxicity (HT) remains a critical dose-limiting complication in gynecologic cancer patients undergoing pelvic radiotherapy, particularly when combined with chemotherapy. Early prediction of severe HT could inform personalized management and minimize toxicity. We developed and validated a predictive model integrating clinical parameters and radiomic features, evaluating five machine learning approaches. Clinical data, dosimetric parameters, and pelvic bone marrow radiomic features extracted from MRI and CT images were analyzed. Feature selection was performed using LASSO and random forest algorithms, followed by comparison across multiple classification models. In the independent test set, the combined clinical and MRI-radiomics model showed superior predictive performance (AUC=0.927, accuracy=85.5%, sensitivity=92.3%, specificity=66.7%) compared to clinical-only (AUC=0.703), MRI-only (AUC=0.925, but low specificity of 38.1%), and CT-only models (AUC=0.54). The model performed notably better in patients receiving concurrent chemoradiotherapy. Key predictors included baseline hemoglobin, white blood cell count, bone marrow dosimetry, and MRI-derived texture and fat fraction features. Integrating clinical data with MRI-based radiomics provides a robust approach for predicting acute HT, potentially guiding personalized management strategies and improving safety during gynecologic cancer radiotherapy.

1 Introduction

Acute hematologic toxicity (HT) remains a significant limitation in pelvic radiotherapy (RT), particularly in patients undergoing concurrent chemoradiotherapy (CRT) for gynecologic and other pelvic malignancies. Severe (grade ≥3) HT occurs in approximately 20–25% of patients receiving pelvic CRT, often causing treatment delays or chemotherapy dose reductions, potentially compromising oncologic outcomes (1, 2). Hematopoietic marrow within pelvic bones represents a substantial fraction of the body’s marrow reserve, making its sparing essential for maintaining blood cell counts during treatment (3, 4). Thus, accurate pre-treatment prediction of severe HT is critical to enable tailored interventions and avoid treatment interruptions.

Currently, clinical and dosimetric factors serve as the primary basis for predicting HT risk, including concurrent chemotherapy, patient characteristics (e.g., baseline hematologic counts, BMI), and bone marrow radiation dose-volume metrics (e.g., pelvic marrow volume receiving ≥20 Gy) (58). Despite these correlations, the predictive performance remains modest due to patient heterogeneity and complexity of marrow radiosensitivity (5). Consequently, individual risk prediction remains challenging, highlighting the need for more precise and personalized predictive biomarkers.

Radiomics, an emerging approach that extracts quantitative imaging features from routine medical images, offers promise for improving toxicity prediction (9, 10). Initial efforts utilizing computed tomography (CT)-based radiomics have demonstrated predictive value for hematologic toxicities in pelvic radiotherapy (11, 12). However, CT images primarily reflect bone density and trabecular structure, providing indirect and limited insight into marrow composition or functional status. Prior studies indicate that CT-derived radiomic features alone may not robustly capture marrow radiosensitivity due to their inability to differentiate hematopoietically active marrow from fatty marrow (12, 13).

Magnetic resonance imaging (MRI), with its superior soft-tissue contrast and capability to quantify marrow fat content and cellularity, provides a stronger rationale for radiomics-based marrow characterization. MRI can reliably distinguish active (red) marrow from inactive (fatty) marrow (14, 15). These findings highlight the potential of MRI-based radiomics to capture clinically relevant information about marrow status that is not discernible on CT. To date, however, no study has employed pre-treatment MRI-based radiomics to predict chemoradiotherapy hematologic toxicity; most existing models rely solely on CT images.

Given these considerations, this study aims to develop and validate a combined predictive model integrating clinical parameters and MRI-derived radiomics for predicting acute hematologic toxicity in patients undergoing pelvic radiotherapy for gynecologic cancers and other pelvic tumors. We hypothesize that combining clinical factors with MRI radiomics will significantly improve the accuracy and specificity of HT risk stratification, with the ultimate aim of enabling proactive identification of high-risk patients and informing bone marrow–sparing or supportive strategies in pelvic radiotherapy.

2 Materials and methods

2.1 Data collection

This study retrospectively analyzed 534 patients treated at Women’s Hospital, School of Medicine, Zhejiang University, between April 2023 and February 2025 for cervical or endometrial cancer with pelvic radiotherapy (RT). The requirement for informed consent was waived by the Ethics Committee of the Women’s Hospital, School of Medicine, Zhejiang University because this was a retrospective study using de-identified data. Eligible patients included: (1) cervical cancer patients receiving definitive radiotherapy, cervical or endometrial cancer receiving postoperative adjuvant radiotherapy; (2) treated with IMRT to the pelvic field (with or without extended-field to para-aortic nodes); (3) had weekly complete blood count from one week before RT through the end of RT; and (4) underwent pelvic contrast-enhanced magnetic resonance imaging (CE-MRI) at 3.0 Tesla and planning CT imaging before RT for radiomics analysis. Patients were excluded if they had prior pelvic radiation or extensive chemotherapy, pre-existing hematologic disorders, or significant bone lesions (e.g. pelvic bone metastases or replacement) that could affect marrow function. A summary of the study design is presented in Figure 1.

Figure 1
Diagram showing feature selection and model development for gynecologic cancer patients undergoing radiotherapy or chemoradiotherapy. Patients are divided into training (375) and test sets (159). Features include clinical, CT radiomics (2675), and MRI radiomics (2553). Models used are Logistic Regression, Random Forest, Support Vector Machine, Extreme Gradient Boosting, and Neural Network. The process involves performance evaluation, model comparison, and validation.

Figure 1. Workflow charts. HT, hematologic toxicity; RT, Radiotherapy; CRT, Chemoradiotherapy.

2.2 CT simulation and radiotherapy planning procedures

CT simulation and radiotherapy were performed with patients in the supine position and with a relatively full bladder. CT images covered the region extending from the lower lumbar spine to the entire pelvic cavity and were reconstructed using a Philips Brilliance Big Bore CT scanner system (Philips Healthcare, Best, the Netherlands) with a matrix size of 512 × 512 and a slice thickness of 5 mm. Clinical target volumes (CTVs) were delineated manually by experienced radiation oncologists according to guidelines established by the Radiation Therapy Oncology Group (RTOG). Pelvic bone marrow was delineated and included among the organs at risk (OARs) for external beam radiation therapy (EBRT) planning. EBRT planning and structural delineation were performed using the Pinnacle Treatment Planning System (Version 9.16.2, Philips Corp., Fitchburg, WI, USA). All manual contours were subsequently reviewed and validated by senior radiation oncologists specialized in gynecologic oncology to ensure standardization. EBRT was delivered utilizing IMRT on Elekta Infinity medical linear accelerator (Elekta, Stockholm, Sweden). Planning CT scan (non-contrast CT images acquired during CT simulation) was used for feature extraction.

2.3 MRI

All patients underwent a pelvic MRI scan (diagnostic MRI prior to RT). MRI was performed using a GE SIGNA Premier 3.0T magnetic resonance imaging system(General Electric Company HealthCare, USA)with gadobutrol as the contrast agent for enhanced imaging. Three sequences that highlighted bone marrow composition were acquired: (1) Axial contrast-enhanced Liver Acquisition with Volume Acceleration (Axial LAVA+C), repetition time (TR)=3.9 ms, echo time (TE)=1.7 ms, slice thickness=5.6 mm; (2) Axial Water LAVA-Flex; (3) Axial Fat LAVA-Flex, TR=6.1ms, TE =2ms, slice thickness=5.6mm.

2.4 Radiomics feature extraction (MRI and CT)

For radiomics analysis, we defined the region of interest (ROI) as the pelvic bone marrow within the radiation field. The pelvic bone marrow ROI was delineated on the planning CT and MRI, encompassing the internal volumes of the pelvic bones (sacrum, ilium, ischium, pubis, proximal femora) from the pelvic brim to the ischial tuberosities.

Radiomic feature extraction was performed using 3D Slicer (version 5.8.1) with the PyRadiomics extension (16). Prior to feature extraction, voxel intensity discretization was performed using a fixed bin width (25.0). For CT images, spatial filters including Haar wavelet decomposition and Laplacian of Gaussian (LoG) were applied to enhance textural representation (17, 18). For MRI images, only Haar wavelet decomposition was used, due to the inclusion of three distinct MRI sequences per patient, which would otherwise have generated excessive feature dimensionality. Finally, from CT images, 2,675 radiomic features were extracted per patient, and from MRI images, 2,553 radiomic features (851 features per MRI sequence) were obtained. These radiomic features comprised shape features (volumetric indices), first-order intensity statistics, and texture features computed from Gray-Level Co-occurrence Matrix (GLCM), Gray-Level Run-Length Matrix (GLRLM), Gray-Level Size Zone Matrix (GLSZM), Gray-Level Dependence Matrix (GLDM), and Neighboring Gray-Tone Difference Matrix (NGTDM) (19).

2.5 Hematologic toxicity assessment

All patients had baseline blood counts within one week prior to RT and weekly during treatment up to completion. We focused on four hematologic parameters: white blood cell count (leukocytes), absolute neutrophil count (ANC), hemoglobin, and platelet count. Acute hematologic toxicity (HT) was graded according to CTCAE v5.0 criteria (20) for leukopenia, neutropenia, anemia, and thrombocytopenia, using the nadir values during RT or within 4 weeks post-RT. For each patient, we determined the maximum grade of toxicity in any of the four categories. We defined “severe hematologic toxicity” as grade ≥3 toxicity (i.e. grade 3 or 4 in at least one parameter). This severe HT endpoint (yes/no) was used for predictive modeling.

2.6 Feature selection

Prior to feature selection, all continuous numerical features (including clinical and radiomic) were standardized using z-score normalization to ensure consistent feature scales. Features with near-zero variance or high inter-feature correlation (Pearson’s r > 0.9) were removed. Given the large dimensionality of radiomic features relative to the sample size, a two-step feature selection approach was used. Two supervised feature selection methods were used: (1) LASSO logistic regression, a regularization technique that imposes an L1 penalty to promote sparsity in feature selection, tuned via cross-validation (21); (2) Random Forest feature importance ranking using mean decrease in Gini impurity (18). The final feature set was determined by combining non-zero LASSO features and the top 30 Random Forest features, followed by cross-validation to optimize the number of features included in each model.

2.7 Model development

All models were trained using 5-fold cross-validation (k = 5), with AUC used as the primary evaluation metric during feature selection and hyperparameter tuning. Five classifiers were trained and evaluated using the selected features: 1) Logistic Regression (LR): a generalized linear model used for binary classification (22). 2) Random Forest (RF): an ensemble classifier using bootstrapped decision trees (18). 3) Support Vector Machine (SVM): implemented with a radial basis function (RBF) kernel and cross-validated tuning (23). 4) Extreme Gradient Boosting (XGBoost): a tree-based boosting algorithm optimized with hyperparameter tuning (24). 5)Neural Network (NN): a multi-layer perceptron with one hidden layer and L2 regularization (25). Performance was evaluated on the test set using area under the ROC curve (AUC), accuracy, sensitivity, specificity, and F1-score.

Finally, the tuned models were applied to the unseen test set (n=159) to evaluate performance. We calculated the AUC for predicting severe HT in the test set for each classifier, as well as accuracy, sensitivity (recall of severe HT cases), specificity, and F1-score. The primary metric of interest was AUC.

All model development was conducted in R (v4.4) and using packages glmnet, randomForest, e1071, xgboost, caret, pROC, ggplot2, MASS, rpart and tidyr.

3 Results

3.1 Patient characteristics

Of the 534 patients, 480 had cervical cancer (380 received postoperative RT after hysterectomy; 100 received definitive RT), and 54 had endometrial cancer (postoperative RT). All patients were treated with EBRT to the pelvis using IMRT (typically 45–50.4 Gy in 25–28 fractions). A subset (17% in training, 21% in test) also received prophylactic para-aortic nodal irradiation (extended-field IMRT). The training (n=375) and test (n=159) sets were well-balanced in clinical characteristics (Table 1). Concurrent chemotherapy (weekly cisplatin, 40 mg/m²) was administered in 68% of patients (65.1% of training cohort vs 73.6% of test cohort; p=0.068), primarily those with cervical cancer. After EBRT, intracavitary brachytherapy was delivered for definitive cervical cancer cases and postoperative cervical cancer cases with positive vaginal surgical margins (32.4% of training, 30.8% of test patients; p=0.435). Baseline patient characteristics were comparable between the training (n=375) and independent test set (n=159), with no significant differences in age (median ~58 years), disease type distribution, doses to bone marrow, baseline blood counts and plasma protein level (all p>0.05).

Table 1
www.frontiersin.org

Table 1. Clinical patient characteristics (Training set vs. Test set).

In the entire cohort, 259 patients (48.5%) experienced at least one Grade 2 hematologic toxicity during RT, and 142 patients (26.6%) experienced Grade ≥3 (severe) hematologic toxicity. The distribution of HT grades in each subset is detailed in Table 2. Leukopenia was the most common toxicity: over 68% of patients had developed acute leukopenia (grade ≥2), with a few grade 4 cases in training set. Neutropenia often accompanied leukopenia; around 8% had grade 3 or 4 neutropenia. Anemia was frequently observed at low grades (over one-third had grade 1), but only ~5% had grade 3 anemia (1% grade 4). Thrombocytopenia was mild in most cases (grade 0–1 in >90%); grade 3 thrombocytopenia occurred in ~2–3% of patients. Overall, only ~6–8% of patients had no hematologic toxicity at all (grade 0 for all indices), reflecting that the vast majority experienced at least transient cytopenias during pelvic RT.

Table 2
www.frontiersin.org

Table 2. Hematologic toxicity summary.

3.2 Clinical predictors of hematologic toxicity

On univariate analysis, several clinical factors were associated with increased HT: concurrent chemotherapy, larger irradiated volume (para-aortic extended-field), lower baseline white blood cell count/hemoglobin, and higher Bone marrow dose metrics. In the multivariate logistic regression (Figure 2), five factors remained independently significant. Concurrent chemotherapy had the strongest association with severe HT (OR ~4, p=0.001). Baseline bone marrow function (as reflected by baseline white blood cell count and hemoglobin) was also critical, highlighting that patients with stronger baseline counts better tolerated radiotherapy without severe cytopenias. As expected, para-aortic RT was associated with higher HT risk in our data (OR ~2.37, p<0.05). Finally, a higher maximum bone marrow dose predicted higher HT risk (OR ~1.003, p<0.05). While the effect per unit was small, it was statistically significant, reinforcing that avoiding very high point doses to active marrow (hotspots, e.g. in lumbosacral bone) could be beneficial.

Figure 2
Forest plot illustrating the multivariate analysis of five variables with their odds ratios and confidence intervals. Baseline white blood cell count shows an odds ratio of 0.67 with a p-value less than 0.001. Baseline hemoglobin has an odds ratio of 0.97 and a p-value of 0.009. Marrow max dose shows an odds ratio of 1.00 with a p-value of 0.013. Para-aortic radiotherapy has an odds ratio of 2.37 and a p-value of 0.016. CDDP chemotherapy has an odds ratio of 4.08 with a p-value of 0.001. Significance is indicated by different colors for p-values.

Figure 2. Forest plot showing multivariate logistic regression analysis for predictors of severe acute hematologic toxicity (HT). Odds ratios (OR) are indicated by colored squares according to statistical significance levels, and horizontal lines represent the 95% confidence intervals (CI).

3.3 Clinical model performance

Model performance on the independent test set was notably poorer than in the training set. The clinical models showed suboptimal generalizability, with test set AUCs ranging from 0.57 to 0.70 and notably low specificity (<20%) despite high sensitivity (>90%) (Figure 3A, E, Supplementary Table 1). This discrepancy highlights considerable overfitting: although clinical features especially chemotherapy status allowed models to correctly identify nearly all patients who developed severe hematologic toxicity, they also excessively flagged patients as high-risk, resulting in numerous false-positive predictions and low practical specificity despite excellent initial training performance.

Figure 3
Graphs depict performance comparisons of various models in training and test sets for clinical, CT-radiomic, MRI-radiomic, and combined models. Panels A-H show ROC curves with AUC values for each model type. Panels I-M display bar charts comparing AUC, accuracy, sensitivity, specificity, and F1 score across different machine learning techniques: logistic regression, random forest, XGBoost, SVM, and neural network. Each bar chart uses color coding to distinguish between model types, showcasing performance metrics.

Figure 3. Performance evaluation and comparison of predictive models for acute hematologic toxicity using clinical factors, CT radiomics, MRI radiomics, and combined clinical-MRI radiomics features. (A–D) Receiver operating characteristic (ROC) curves illustrating model performance on the training set. (E–H) ROC curves for the independent test set. (I–M) Bar charts comparing performance metrics (AUC, Accuracy, Sensitivity, Specificity, F1 Score) across the four modeling strategies in the test dataset.

3.4 CT-radiomics model performance

The predictive performances of the five machine learning models using the top 30 selected CT-based radiomic features are presented in Figure 3B, F, Supplementary Table 1. On the independent test set, accuracy ranged from approximately 63% (neural network) to 74% (SVM), with modest AUC values (0.50–0.54). Although sensitivity was high, specificity was uniformly low (0–19%), indicating frequent false-positive predictions. These results suggest that, despite rigorous feature selection, CT-based radiomic features alone provided limited predictive value for acute hematologic toxicity. The selected features were predominantly texture- and intensity-based metrics extracted from filtered images (e.g., wavelet, exponential, gradient), reflecting marrow heterogeneity that was insufficient to support robust prediction compared to MRI radiomics.

3.5 MRI-radiomics model performance

We next evaluated the predictive performance of models based on MRI radiomic features of the bone marrow. A total of 50 MRI-based radiomic features were selected by our feature selection process to feed into the classifiers. These included features primarily included first-order intensity (mean, median, skewness) and advanced texture metrics (e.g., GLCM correlation, GLSZM Large Area High Gray Level Emphasis, GLDM entropy and uniformity measures). On the independent test set, models achieved good discrimination, notably the XGBoost (AUC=0.925, accuracy=81.1%) and neural network models (AUC=0.915, accuracy=84.3%). These models exhibited high sensitivity (~97%) and moderate specificity (38–50%), indicating effective identification of patients at high risk of severe hematologic toxicity (Figures 3C, G, Supplementary Table 1). In conclusion, MRI features likely reflect marrow composition and heterogeneity, contributing to their superior predictive power compared to CT-radiomics.

3.6 Combined clinical and MRI-radiomics model performance

Given that both clinical factors and MRI radiomic features demonstrated strong predictive value individually, we developed a combined model integrating the features from both domains. The combined model integrated a total of 55 rigorously selected features, encompassing both key clinical parameters and MRI-derived radiomic metrics (Figure 4). The clinical features primarily included baseline hematologic, radiation dosage metrics and disease type. MRI-radiomic features primarily included intensity and texture metrics derived from three sequences (LAVA+C, Water LAVA-Flex, and Fat LAVA-Flex), encompassing first-order statistics (e.g., mean, median, skewness, kurtosis, 10 Percentile), texture features (GLCM, GLSZM, GLDM, NGTDM) and shape metrics.

Figure 4
Bar chart titled “Selected Features After Cross-validation” showing the mean decrease in Gini impurity for various features. The x-axis represents the mean decrease, while the y-axis lists features. Bars are color-coded by feature group: Clinical (red), FAT LAVA FLEX (blue), LAVA C (green), and WATER LAVA FLEX (purple). The top feature is “FLF w-HLL fo- Skewness” with the highest impurity decrease, followed by “Baseline hemoglobin” and others.

Figure 4. Radiomic and clinical features selected after cross-validation ranked by their relative importance. Features are color-coded according to their source: clinical parameters (red), and MRI-derived radiomic features from Fat LAVA-Flex (blue), Water LAVA-Flex (green), and LAVA+C (purple) imaging sequences.

On the test set, this combined model outperformed all single-modality models. The XGBoost classifier achieved the highest AUC (0.927), with accuracy of 85.5%, sensitivity of 92.3%, and specificity of 66.7%. The neural network also performed well (AUC = 0.891, accuracy = 86.8%, specificity = 73.8%). Random Forest and SVM models yielded comparable AUCs (0.929 and 0.917), while logistic regression showed slightly lower specificity (61.9%).

Figures 3I–M compares the predictive performance of clinical, CT-radiomics, MRI-radiomics, and combined models. The combined model consistently achieved the best performance, with improved accuracy (up to 86.8%) and notably higher specificity (up to 73.8%) compared to individual models. Although sensitivity remained high across all models, the combined model’s enhanced specificity suggests more precise risk stratification and reduced false positives.

Overall, integrating clinical data with MRI-derived radiomics markedly improved prediction accuracy, specificity, and overall discriminative power compared to either clinical or imaging features alone.

3.7 Subtype-specific predictive performance of the combined clinical and MRI-radiomic model

The predictive performances of the combined clinical and MRI-radiomic model for identifying severe (Grade ≥3) hematologic toxicities across four different subtypes (anemia, leukopenia, neutropenia, and thrombocytopenia) are summarized in Figure 5, Supplementary Table 2. Among these hematologic toxicities, the predictive accuracy varied notably. Random Forest achieved the highest AUCs for anemia (0.868), leukopenia (0.910), and neutropenia (0.828). However, prediction for thrombocytopenia was limited by severe class imbalance (positive:negative ≈ 1:32), resulting in low specificity (0.0) despite a moderate AUC (0.829), reflecting difficulty in correctly identifying negative cases. This was likely influenced by the class imbalance. Thrombocytopenia was the most severely imbalanced (positive: negative ratio approximately 1:32), which significantly limited the model’s ability to reliably predict true negatives, resulting in notably lower specificity and overall predictive stability.

Figure 5
Eight ROC curves compare the performance of machine learning models for diagnosing Anemia, Leukopenia, Neutropenia, and Thrombocytopenia in training (A-D) and test (E-H) sets. Each plot shows the sensitivity versus 1-specificity, with models including Logistic Regression, Neural Network, Random Forest, Support Vector Machine, and XGBoost, and associated AUC scores. Random Forest shows high AUC in all conditions.

Figure 5. Receiver operating characteristic (ROC) curves illustrating model performance in predicting specific subtypes of severe acute hematologic toxicity (anemia, leukopenia, neutropenia, thrombocytopenia) for both training (A–D) and independent test sets (E–H).

3.8 Predictive performance of the combined clinical and MRI-radiomic model stratified by treatment modality

The combined clinical and MRI-radiomic model exhibited treatment-dependent performance differences (Figure 6, Supplementary Table 3). In the test set, predictive accuracy was higher in the CRT subgroup (AUC up to 0.912) than in the RT-alone subgroup (AUC up to 0.809). Although RT-alone models showed high accuracy, specificity was consistently low (≤0.25), suggesting overfitting. In contrast, the CRT group achieved better balance between sensitivity and specificity, indicating robust discriminative capacity. These results highlight treatment-dependent differences in model performance, emphasizing that the combined MRI radiomics and clinical model is particularly beneficial for predicting hematologic toxicity risk in patients receiving concurrent chemoradiotherapy.

Figure 6
Comparison of ROC curves for various models in two groups: Radiotherapy Alone and Chemoradiotherapy. Panels A and B show training set results, with high AUC scores for models like Random Forest and Neural Network. Panels C and D display test set results, indicating model performance variations.

Figure 6. Model performance comparison stratified by treatment regimen. (A, B) ROC curves demonstrating predictive capability of five machine learning models in the training set for Radiotherapy Alone and Chemoradiotherapy groups, respectively. (C, D) ROC curves for the independent test set. (E–I) Bar charts showing detailed performance metrics (AUC, Accuracy, Sensitivity, Specificity, F1 Score) across Radiotherapy Alone group, Chemoradiotherapy group, and all patients group.

4 Discussion

In this study, we developed predictive models for acute hematologic toxicity (AHT) in gynecologic cancer patients receiving pelvic radiotherapy, leveraging both clinical factors and radiomic features of the pelvic bone marrow. Our MRI‐based radiomic model demonstrated superior predictive performance for AHT compared to the CT‐based model. For comparison, Le et al. reported that a CT radiomics model combined with clinical features achieved an AUC of approximately 0.80–0.85 in predicting severe AHT (11). In our study, further integrating MRI radiomic features with clinical parameters led to the highest overall accuracy among all models, highlighting the added value of MRI radiomic features.

These findings highlight the value of MRI in capturing biologically relevant information that CT cannot provide. MRI offers better soft-tissue contrast and enables differentiation between active and fatty marrow through specialized sequences (e.g., LAVA+C, Water and Fat LAVA-Flex), which together provide detailed information on marrow composition, vascularity, and fat content. Radiomic features extracted from these sequences—such as intensity statistics and texture metrics—capture spatial heterogeneity and microstructural variations within the marrow, reflecting its hematopoietic capacity more precisely than CT-based density features (1315, 26). Prior studies support this advantage: Carmona et al. showed that MRI fat fraction quantification is highly sensitive to bone marrow compositional changes during chemoradiotherapy and that these changes correlate strongly with declines in peripheral blood counts (13). While these MRI sequences can quantify marrow fat content directly, radiomic analysis adds value by capturing spatial heterogeneity, texture, and signal distribution within the marrow. Instead of relying solely on global fat fraction values, radiomic features—including first-order statistics (e.g., entropy, uniformity) and higher-order texture metrics (e.g., GLCM, GLRLM)—detect subtle microregional variations in marrow architecture that may reflect differential hematopoietic capacity. This complementary information enables a more nuanced assessment of radiosensitivity and improves predictive modeling. Prior studies support this advantage: Qin et al. demonstrated that functional MRI radiomic features can detect microscopic marrow changes across various dose levels, providing an objective basis for bone-marrow sparing strategies (15). In contrast, CT-based radiomics primarily capture density and structural variations that do not directly reflect bone marrow reserve or function. Thus, MRI’s ability to probe the marrow’s fat content and microarchitecture gives it a biologic edge in predicting hematologic toxicity. Notably, our feature selection identified several MRI-derived texture and intensity features (from fat and water images) related to marrow fat distribution, supporting the notion that the spatial arrangement of active marrow is a critical predictor of hematologic tolerance.

Our findings also concord with the emerging evidence that MRI-defined active bone marrow is the key subvolume driving hematologic toxicity. For example, Ke et al. delineated “red marrow” on pelvic MRI and found that higher radiation doses to this active marrow (such as bone marrow V15) were significantly associated with increased risk of acute HT (27). This suggests that MRI can localize the functionally important marrow regions that should be preferentially spared. In our study, even without explicit sparing, the MRI radiomic model may inherently focus on these active regions (through its features), explaining its stronger predictive power. Together, these considerations provide a compelling technical and biological rationale for why the MRI-based model outperforms the CT-based model in forecasting hematologic toxicity.

The predictive utility of our models was more pronounced in patients receiving CRT than in those undergoing RT alone. The MRI model maintained robust performance in the CRT subgroup, correctly stratifying high-risk patients, whereas prediction in the RT-alone subgroup was more modest. This trend aligns with recent findings from a large cervical cancer cohort, where AHT risk models performed significantly better in CRT patients than in RT-only patients (28). Since MRI scans were acquired prior to treatment, the radiomic features likely reflect preexisting differences in bone marrow reserve or function. These baseline differences may have a greater clinical impact in patients receiving CRT, where the hematologic stress is more severe, thereby amplifying the consequences of underlying marrow vulnerability. In contrast, RT alone exerts milder marrow suppression, and preexisting differences may not translate into significant toxicity. This may explain the stronger predictive power observed in the CRT subgroup, and underscores the potential of MRI radiomics as a pre-treatment biomarker of marrow resilience under intensive therapy.

The superior performance of the predict model has practical significance. Early identification of patients at high risk for severe hematologic toxicity could enable personalized interventions to maintain treatment intensity. For instance, patients flagged by the combined model as high-risk might benefit from prophylactic measures such as bone marrow–sparing radiation techniques, dose reduction of concurrent chemotherapy, or growth factor support (e.g., Granulocyte Colony-Stimulating Factor), in order to prevent interruptions in therapy. AHT is known to compromise treatment efficacy and prolong the overall treatment time (29); therefore, a reliable prediction model is clinically valuable to trigger timely supportive care. Our combined model – which incorporated MRI radiomic features along with key clinical factors (notably baseline hemoglobin and other blood counts) – achieved the highest discrimination. This reinforces that a multifactorial approach is optimal: integrating radiomic signatures with clinical variables improves predictive accuracy relative to radiomics or clinical data alone (26). In practice, such a model could be deployed as a nomogram or decision tool before treatment to stratify patients by toxicity risk. High-risk patients could then be closely monitored and managed proactively, whereas low-risk patients may proceed with standard treatment protocols. Ultimately, this strategy could reduce unplanned dose delays and ensure patients complete chemoradiation on schedule, potentially improving oncologic outcomes.

We acknowledge several limitations in our study. First, the sample size is moderate and drawn from a single institution; larger multi-center validations are warranted to ensure the model’s generalizability. Radiomics models can be prone to overfitting, especially with high-dimensional feature spaces – we mitigated this via feature selection and cross-validation, but external confirmation remains essential. Subtype prediction, particularly for rare toxicities like grade ≥3 thrombocytopenia, was challenged by severe class imbalance and limited events, affecting specificity. Larger, balanced datasets and rebalancing strategies are needed for improved modeling of these subgroups. Additionally, our models did not explicitly include advanced dose metrics beyond the basic dose-volume variables – incorporating “dosiomic” features (radiomics of the 3D dose distribution) alongside imaging features is an intriguing approach that could further improve prediction.

Going forward, prospective trials should be considered to test whether acting on model predictions can indeed reduce hematologic toxicity. For example, a trial could stratify patients by a baseline MRI radiomics risk score and assign high-risk patients to an intensified supportive care or marrow-sparing plan. Such prospective validation would establish the clinical utility of our combined model beyond a retrospective proof-of-concept. In parallel, exploration of AI explainability methods might help illuminate which imaging phenotypes drive toxicity risk, enhancing clinician trust in the model’s predictions.

Our study demonstrates that a combined clinical and MRI-radiomic model accurately predicts acute hematologic toxicity in patients undergoing pelvic radiotherapy. MRI radiomics outperformed CT-based models, especially in chemoradiotherapy-treated patients, highlighting MRI’s potential as a noninvasive imaging biomarker. Future validation and integration into personalized treatment planning may significantly reduce treatment-related hematologic complications.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Ethics Committee of the Women’s Hospital, School of Medicine, Zhejiang University. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because this was a retrospective study using de-identified data. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

LL: Project administration, Visualization, Validation, Formal analysis, Writing – original draft, Conceptualization, Investigation, Methodology. JW: Project administration, Writing – review & editing, Funding acquisition, Visualization, Investigation, Validation, Resources. HX: Investigation, Writing – review & editing, Project administration. BC: Writing – review & editing, Funding acquisition, Project administration. QT: Investigation, Validation, Writing – review & editing, Supervision. HW: Conceptualization, Validation, Supervision, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. This study was funded by Natural Science Foundation of Zhejiang Province (No. ZCLTGY24H1602 and LQ23H160033).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1644053/full#supplementary-material

Abbreviations

HT, Hematologic Toxicity; AHT, Acute Hematologic Toxicity; RT, Radiotherapy; CRT, Chemoradiotherapy; IMRT, Intensity-Modulated Radiotherapy; MRI, Magnetic Resonance Imaging; CT, Computed Tomography; ROI, Region of Interest; GLCM, Gray-Level Co-occurrence Matrix; GLRLM, Gray-Level Run-Length Matrix; GLSZM, Gray-Level Size Zone Matrix; GLDM, Gray-Level Dependence Matrix; NGTDM, Neighboring Gray-Tone Difference Matrix; CTCAE, Common Terminology Criteria for Adverse Events; LASSO, Least Absolute Shrinkage and Selection Operator; LR, Logistic Regression; RF, Random Forest; SVM, Support Vector Machine; XGB (XGBoost), Extreme Gradient Boosting; NN, Neural Network; ROC, Receiver Operating Characteristic; AUC, Area Under the Curve; WBC, White Blood Cell; CTV, Clinical Target Volume; OARs, Organs at Risk; EBRT, External Beam Radiation Therapy; FIGO, International Federation of Gynecology and Obstetrics; RTOG, Radiation Therapy Oncology Group.

References

1. Konnerth D, Gaasch A, Zinn A, Rogowski P, Rottler M, Walter F, et al. Hematologic toxicity and bone marrow-sparing strategies in chemoradiation for locally advanced cervical cancer: a systematic review. Cancers (Basel). (2024) 16:1842. doi: 10.3390/cancers16101842

PubMed Abstract | Crossref Full Text | Google Scholar

2. Huang J, Gu F, Ji T, Zhao J, and Li G. Pelvic bone marrow sparing intensity-modulated radiotherapy reduces the incidence of hematologic toxicity in cervical cancer patients receiving concurrent chemoradiotherapy: a single-center randomized trial. Radiat Oncol. (2020) 15:180. doi: 10.1186/s13014-020-01606-3

PubMed Abstract | Crossref Full Text | Google Scholar

3. Kurotaki D, Uede T, and Tamura T. Functions and development of red pulp macrophages. Microbiol Immunol. (2015) 59:55–62. doi: 10.1111/1348-0421.12228

PubMed Abstract | Crossref Full Text | Google Scholar

4. Mauch P, Constine L, Greenberger J, Knospe W, Sullivan J, Liesveld JL, et al. Hematopoietic stem cell compartment: acute and late effects of radiation therapy and chemotherapy. Int J Radiat Oncol Biol Phys. (1995) 31:1319–39. doi: 10.1016/0360-3016(94)00430-S

PubMed Abstract | Crossref Full Text | Google Scholar

5. Hara JHL, Jutzy JMS, Arya R, Kothari R, McCall AR, Howard AR, et al. Predictors of acute hematologic toxicity in women receiving extended-field chemoradiation for cervical cancer: do known pelvic radiation bone marrow constraints apply? Adv Radiat Oncol. (2022) 7:100998. doi: 10.1016/j.adro.2022.100998

PubMed Abstract | Crossref Full Text | Google Scholar

6. Kumar T, Schernberg A, Busato F, Magne N, Mazeron R, Chargari C, et al. Correlation between pelvic bone marrow radiation dose and acute hematological toxicity in cervical cancer patients treated with concurrent chemoradiation. Cancer Manag Res. (2019) 11:6285–97. doi: 10.2147/CMAR.S195989

PubMed Abstract | Crossref Full Text | Google Scholar

7. Liang Y, Messer K, Rose BS, Lewis JH, Jiang SB, Yashar CM, et al. Impact of bone marrow radiation dose on acute hematologic toxicity in cervical cancer: principal component analysis on high-dimensional data. Int J Radiat Oncol Biol Phys. (2010) 78:912–9. doi: 10.1016/j.ijrobp.2009.11.062

PubMed Abstract | Crossref Full Text | Google Scholar

8. Yan K, Ramirez E, Xie XJ, Gu X, Xi Y, and Albuquerque K. Predicting severe hematologic toxicity from extended-field chemoradiation of para-aortic nodal metastases from cervical cancer. Pract Radiat Oncol. (2018) 8:13–9. doi: 10.1016/j.prro.2017.07.001

PubMed Abstract | Crossref Full Text | Google Scholar

9. Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141

PubMed Abstract | Crossref Full Text | Google Scholar

10. Gillies RJ, Kinahan PE, and Hricak H. Radiomics: images are more than pictures, they are data. Radiology. (2016) 278:563–77. doi: 10.1148/radiol.2015151169

PubMed Abstract | Crossref Full Text | Google Scholar

11. Le Z, Wu D, Chen X, Wang L, Xu Y, Zhao G, et al. A radiomics approach for predicting acute hematologic toxicity in patients with cervical or endometrial cancer undergoing external-beam radiotherapy. Radiother Oncol. (2023) 182:109489. doi: 10.1016/j.radonc.2023.109489

PubMed Abstract | Crossref Full Text | Google Scholar

12. Ren K, Shen L, Qiu J, Sun K, Chen T, Xuan L, et al. Treatment planning computed tomography radiomics for predicting treatment outcomes and hematologic toxicities in locally advanced cervical cancer treated with radiotherapy: a retrospective cohort study. BJOG. (2023) 130:222–30. doi: 10.1111/1471-0528.17285

PubMed Abstract | Crossref Full Text | Google Scholar

13. Carmona R, Pritz J, Bydder M, Gulaya S, Zhu H, Williamson CW, et al. Fat composition changes in bone marrow during chemotherapy and radiation therapy. Int J Radiat Oncol Biol Phys. (2014) 90:155–63. doi: 10.1016/j.ijrobp.2014.05.041

PubMed Abstract | Crossref Full Text | Google Scholar

14. Lawless M, Byrns K, Bednarz BP, Meudt J, Shanmuganayagam D, Shah J, et al. Feasibility of identifying proliferative active bone marrow with fat fraction MRI and multi-energy CT. Phys Med Biol. (2024) 69:135013. doi: 10.1088/1361-6560/ad58a0

PubMed Abstract | Crossref Full Text | Google Scholar

15. Qin X, Wang C, Gong G, Wang L, Su Y, Yin Y, et al. Functional MRI radiomics-based assessment of pelvic bone marrow changes after concurrent chemoradiotherapy for cervical cancer. BMC Cancer. (2022) 22:1149. doi: 10.1186/s12885-022-10254-7

PubMed Abstract | Crossref Full Text | Google Scholar

16. Fedorov A, Beichel R, Kalpathy-Cramer J, Finet J, Fillion-Robin JC, Pujol S, et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn Reson Imaging. (2012) 30:1323–41. doi: 10.1016/j.mri.2012.05.001

PubMed Abstract | Crossref Full Text | Google Scholar

17. Van Griethuysen JJ, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–7. doi: 10.1158/0008-5472.CAN-17-0339

PubMed Abstract | Crossref Full Text | Google Scholar

18. Breiman L. Random forests. Mach Learn. (2001) 45:5–32. doi: 10.1023/A:1010933404324

Crossref Full Text | Google Scholar

19. Zwanenburg A, Vallières M, Abdalah MA, Aerts HJWL, Andrearczyk V, Apte A, et al. The Image Biomarker Standardization Initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology. (2020) 295:328–38. doi: 10.1148/radiol.2020191145

PubMed Abstract | Crossref Full Text | Google Scholar

20. National Cancer Institute. Common terminology criteria for adverse events (CTCAE) version 5.0. Bethesda, MD: NIH (2017).

Google Scholar

21. Friedman J, Hastie T, and Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Software. (2010) 33:1–22. doi: 10.18637/jss.v033.i01

PubMed Abstract | Crossref Full Text | Google Scholar

22. Bewick V, Cheek L, and Ball J. Statistics review 14: logistic regression. Crit Care. (2005) 9:112–8. doi: 10.1186/cc3045

PubMed Abstract | Crossref Full Text | Google Scholar

23. Noble WS. What is a support vector machine? Nat Biotechnol. (2006) 24:1565–7. doi: 10.1038/nbt1206-1565

PubMed Abstract | Crossref Full Text | Google Scholar

24. Chen T and Guestrin C. XGBoost: a scalable tree boosting system. In: Proc 22nd ACM SIGKDD int conf knowl discov data min. New York, NY, USA: Association for Computing Machinery (2016). p. 785–94. doi: 10.1145/2939672.2939785

Crossref Full Text | Google Scholar

25. Jiang J, Trundle P, and Ren J. Medical image analysis with artificial neural networks. Comput Med Imaging Graph. (2010) 34:617–31. doi: 10.1016/j.compmedimag.2010.07.003

PubMed Abstract | Crossref Full Text | Google Scholar

26. Bainbridge A, Bray TJP, Sengupta R, and Hall-Craggs MA. Practical approaches to bone marrow fat fraction quantification across magnetic resonance imaging platforms. J Magn Reson Imaging. (2020) 52:298–306. doi: 10.1002/jmri.27039

PubMed Abstract | Crossref Full Text | Google Scholar

27. Ke X, Zang C, Wu A, and Liu Y. Predicting the incidence of hematologic toxicity in concurrent chemoradiotherapy for cervical cancer using pelvic MRI imaging. J Cancer Res Ther. (2023) 19:1453–60. doi: 10.4103/jcrt.jcrt_205_24

PubMed Abstract | Crossref Full Text | Google Scholar

28. Yue H, Li X, You J, Li R, Ren J, Wang M, et al. Acute hematologic toxicity prediction using dosimetric and radiomics features in patients with cervical cancer: does the treatment regimen matter? Front Oncol. (2024) 14:1365897. doi: 10.3389/fonc.2024.1365897

PubMed Abstract | Crossref Full Text | Google Scholar

29. Zou D, Guo M, and Zhou Q. A clinical study of pegylated recombinant human granulocyte colony stimulating factor (PEG-rhG-CSF) in preventing neutropenia during concurrent chemoradiotherapy of cervical cancer. BMC Cancer. (2016) 16:658. doi: 10.1186/s12885-021-08364-9

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: hematologic toxicity, radiomics, gynecologic cancer, magnetic resonance imaging, radiotherapy, machine learning

Citation: Luo L, Wang J, Xie H, Chen B, Wang H and Tang Q (2025) Combined clinical and MRI-based radiomics model for predicting acute hematologic toxicity in gynecologic cancer radiotherapy. Front. Oncol. 15:1644053. doi: 10.3389/fonc.2025.1644053

Received: 09 June 2025; Accepted: 21 July 2025;
Published: 08 August 2025.

Edited by:

Dorothy Lombe, Treatment and Support Services MidCentral, New Zealand

Reviewed by:

Mirek Fatyga, Mayo Clinic Arizona, United States
Yoichi Watanabe, University of Minnesota Twin Cities, United States

Copyright © 2025 Luo, Wang, Xie, Chen, Wang and Tang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qiu Tang, cWl1dGFuZ0B6anUuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.