Development and validation of machine learning-based MRI radiomics models for preoperative lymph node staging in T3 rectal cancer

Qubie, Xuelei; Chen, Weijuan; Chen, Jun; Ma, Jiangqin; Wei, Xin; Gu, Xiling; Zhang, Wei; He, Xiaojing

doi:10.3389/fonc.2025.1610892

ORIGINAL RESEARCH article

Front. Oncol., 08 September 2025

Sec. Gastrointestinal Cancers: Colorectal Cancer

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1610892

This article is part of the Research TopicRadiomics and AI-Driven Deep Learning for Cancer Diagnosis and TreatmentView all 20 articles

Development and validation of machine learning-based MRI radiomics models for preoperative lymph node staging in T3 rectal cancer

Xuelei Qubie^1,2†

Weijuan Chen^1†

Jun Chen³

Jiangqin Ma¹

Xin Wei¹

Xiling Gu⁴

Wei Zhang^2*

Xiaojing He^1*

¹Department of Radiology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
²Department of Radiology, Sichuan Provincial Corps Hospital, Chinese People’s Armed Police Forces, Leshan, China
³Optical Engineering, Beijing Institute of Technology, Beijing, China
⁴Department of Pathology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China

Objective: The present research aimed to evaluate the diagnostic performance of a magnetic resonance imaging (MRI)-based radiomics model for predicting lymph node staging in patients with stage T3 rectal cancer (RC).

Methods: This retrospective study included 225 patients with RC who underwent surgical resection without neoadjuvant therapy treatment. Radiomics features were extracted from high-resolution T2-weighted imaging (T2WI) of primary tumor. Feature selection was performed using the least absolute shrinkage and selection operator (LASSO) algorithm. Five machine learning classifiers were employed to construct radiomics signatures differentiating between N0/N1 (low nodal burden) and N2 (high nodal burden) stages prediction in the training cohort. The predictive performance of each classifier was evaluated using receiver operating characteristic curve analysis, with area under the curve (AUC) comparisons conducted via DeLong’s test. Decision curve analysis (DCA) and calibration curves were utilized to assess the clinical utility and calibration performance of the developed models, respectively.

Results: A total of 1,746 radiomics features were extracted from the imaging data, of which 16 features were selected to construct a radiomics signature for lymph node staging in RC. The logistic regression classifier demonstrated the best predictive performance, achieving an AUC of 0.900 [95% confidence interval (CI), 0.848–0.952] in the training cohort. The model’s robustness was further validated in the test cohort, with an AUC of 0.876 (95% CI, 0.765–0.986). DCA confirmed the clinical utility of the model.

Conclusions: The radiomics model based on high-resolution T2WI provided an effective and noninvasive approach for preoperatively differentiating between N0/1 and N2 stages in stage T3 RC.

1 Introduction

Rectal cancer (RC) is one of the most prevalent malignant tumors of the digestive tract. It ranks as the third leading cause of cancer-related mortality worldwide (1). A substantial proportion of patients present with locally advanced RC (LARC) at initial diagnosis (2). According to the National Comprehensive Cancer Network guidelines (3), the standard treatment paradigm for LARC includes neoadjuvant chemoradiotherapy followed by total mesorectal excision (TME) after a 5–12-week interval, with optional postoperative adjuvant chemotherapy. However, a recent multicenter randomized trial demonstrated that a neoadjuvant chemotherapy-only regimen in patients with LARC at a relatively low risk of recurrence (T2N1/2, T3N0/N1) is not inferior to preoperative chemoradiotherapy in terms of disease-free survival and local recurrence rates (4). This finding underscores the clinical imperative of accurately identifying T3N0/N1 RC preoperatively, as it enables tailored, risk-adapted treatment strategies and potentially mitigates unnecessary treatment-related toxicity.

Magnetic resonance imaging (MRI) serves as the preferred method for RC staging, achieving exceptional T-stage classification accuracy of 88–99% through high-resolution soft tissue characterization (5). Nevertheless, its diagnostic performance in classifying lymph node metastasis (LNM) remains suboptimal (accuracy: <80%) and is constrained by its reliance on size-morphology criteria with inherent limitations (6). Pathological analyses revealed that 28% of metastatic nodes measure ≤3 mm in short-axis diameter (7), fundamentally challenging conventional size thresholds. While functional MRI techniques, particularly diffusion-weighted imaging (DWI), show potential for improved nodal characterization through cellularity assessment via apparent diffusion coefficient (ADC) mapping (8), their diagnostic performance remains moderate (accuracy: 66%, sensitivity: 53%, and specificity: 82%), Significant ADC value overlap between reactive and malignant nodes necessitates complementary diagnostic strategies (9).

Radiomics enables the extraction of high-dimensional quantitative features from medical images. It has emerged as a powerful tool for overcoming conventional imaging limitations (10) and may serve as a valuable adjunct in assessing LNM in RC (11). However, most existing studies have primarily focused on presence vs. absence of LNM (12), with limited exploration of predictive models in terms of specific lymph node staging (e.g., N0/N1 vs. N2). The present study aimed to develop and validate a radiomics model based on high-resolution T2-weighted MRI to differentiate between N0/N1 and N2 stages in stage T3 RC patients using multiple machine learning algorithms. It also sought to establish the clinical utility of radiomics-based nodal staging as a noninvasive diagnostic tool by systematically evaluating the predictive performance of various models. A more precise lymph node staging framework may enable risk-adapted treatment strategies, such as chemotherapy-based regimens for low-risk patients (e.g., T3N0/N1), reduce unnecessary radiotherapy exposure, minimize the risk of overtreatment, and ultimately improve patient outcomes and quality of life.

2 Materials and methods

2.1 Patients

The present study initially enrolled 287 patients who underwent radical RC resection between September 2019 and March 2024. A retrospective analysis of their preoperative clinical and imaging data was carried out. The inclusion criteria were as follows: (1) postoperative pathology confirmed stage T3 RC and (2) completion of pelvic MRI within 2 weeks preceding surgery with confirmed negative circumferential resection margin. Exclusion criteria comprised: (1) incomplete MRI sequences or suboptimal image quality, (2) concurrent presence of other malignancies, (3) neoadjuvant chemoradiotherapy or other preoperative treatment regimens, and (4) incomplete clinicopathologic data. A total of 225 eligible patients were included in the final analysis after completing the screening process. The study population was randomly stratified into training (n = 157) and testing (n = 68) cohorts at a 7:3 ratio, with the detailed screening process illustrated in Figure 1.

Figure 1

Flowchart depicting patient enrollment for a study from September 2019 to March 2024. Out of 287 initial patients with rectal cancer, exclusions were made for reasons such as poor HRMRI quality, prior chemoradiotherapy, other tumors, or incomplete data. This resulted in 225 patients enrolled. They were split into a training cohort of 157 patients (126 with N0/1 and 31 with N2) and a test cohort of 68 patients (55 with N0/1 and 13 with N2).

Figure 1. Flow chart of inclusion and exclusion criteria.

The retrospective analysis was approved by the ethics committee of the Second Affiliated Hospital of Chongqing Medical University(Approval No.: 2024-79). The requirement for informed consent was waived.

2.2 Clinicopathological characteristics

Patient clinical characteristics were extracted from electronic medical records and comprised demographic data, such as age and sex. Tumor marker levels, including those of carcinoembryonic antigen (CEA) and carbohydrate antigen 19-9 (CA19-9), were also recorded. The normal reference ranges were defined as 0–5 ng/mL for CEA and 0–37 U/mL for CA19-9 (13).

All patients in the study underwent TME. Histopathological evaluation of stage T3 tissue specimens was performed by a pathologist with 15 years of experience. Tumor staging was conducted according to the Tumor-Node-Metastasis classification system outlined in the eighth edition of the American Joint Committee on Cancer Staging Manual (14). Specifically, LNM was categorized as follows: N0 indicated no regional LNM, N1 denoted metastasis in 1–3 regional lymph nodes, and N2 represented metastasis in four or more regional lymph nodes. Patients were stratified into the following two groups based on pathological criteria: N0–1 (low nodal burden) and N2 (high nodal burden) stages, reflecting distinct histological grades.

2.3 MRI data acquisition

All patients underwent rectal MRI examinations using a 3.0-T scanner (Magnetom Prisma, Siemens Healthineers, Erlangen, Germany) equipped with an 18-channel surface phased array coil. The patients fasted for 4 h prior to the examination and performed bowel preparation with a glycerol enema (20 mL). The standard rectal MRI protocols, including sagittal T2-weighted imaging (T2WI), oblique axial T2WI, coronal T2WI, and DWI with two b-factor (0 and 1,000 s/mm²) sequences, were conducted. The oblique axial T2WI sequence was determined in the sagittal position, which was perpendicular to the long axis of the rectal tumor according to the following parameters: field of view of 250 mm×250 mm, repetition time of 1,700 ms, echo time of 92 ms, slice thickness of 1.2 mm, flip angle of 90°, and acquisition matrix of 320×320.

2.4 Image evaluation

To explore the radiologic markers, two radiologists with 15 and 20 years of experience in abdominal imaging diagnosis evaluated the circumferential resection margin (CRM), extramural vascular invasion (EMVI), and distance from the anal verge. Specifically, CRM was considered to be the distance between the tumor, lymph nodes, or other lesions and the mesorectal fascia ≤1 mm (15). EMVI status was assessed using the 0–4-point grading system proposed by Jhaveri et al. (16). Patients with scores of 0–2 were classified as EMVI-negative, while others were EMVI-positive. Tumor location was measured on the approximate luminal center of the rectum on the sagittal T2WI sequence and categorized as low (0–5 cm), middle (5.1–10 cm), or high (10.1–15 cm) according to the distance from the anal verge to the lowest edge of the tumor (17).

2.5 Tumor segmentation

The MRI datasets were anonymized and transferred from the Picture Archiving and Communication System to a dedicated offline workstation for segmentation and subsequent analysis. Regions of interest (ROIs) were manually delineated using ITK-SNAP software (version 4.1; http://www.itksnap.org). A radiologist with 15 years of experience in abdominal imaging diagnosis outlined an ROI along the periphery of the primary rectal tumor on sequential images in oblique axial high-resolution T2WI, excluding obvious necrosis, gas, and lumen content areas. The corresponding volumetric regions of interest (VOIs) were subsequently automatically generated. The segmented VOIs were then reviewed and modified by another radiologist with 20 years of experience in abdominal imaging diagnosis in order to ensure accuracy. The radiologists were unaware of both the clinical outcomes and histopathological results. Any discrepancies in their interpretations were resolved through collaborative discussion.

2.6 Radiomics feature extraction

PyRadiomics software (https://github.com/Radiomics/pyradiomics) was utilized to extract a total of 1,746 radiomics features from MRI results. The radiomics features were divided into seven groups as follows: shape, first-order, gray-level co-occurrence matrix (GLCM), gray-level dependence matrix (GLDM), gray-level run length matrix (GLRLM), gray-level size zone matrix (GLSZM), and neighborhood gray-tone difference matrix (NGTDM). These quantitative radiomics features were extracted from the original, Laplacian of Gaussian (LoG), and wavelet images, which were obtained from eight decompositions after wavelet filtering. High (H)- or low (L)-pass filter application in three dimensions resulted in eight combinations as follows: LHL, HHL, HLL, HHH, HLH, LHH, LLH, and LLL. LoG images were generated by a LoG filter with a sequence of sigma values. Low and high sigma values emphasized fine and coarse textures in LoG images, respectively. Sigma values of 2, 3, 4, and 5 were utilized in the study.

2.7 Feature selection and model construction

A three-step procedure was performed for dimensionality reduction of radiomics features. First, radiomics features with a variance of >1.0 were selected. Second, analysis of variance was carried out in order to select the statistical influence feature. The radiomics features were available after applying the least absolute shrinkage and selection operation (LASSO) regression method, which was used to select the N-stage classification-related features with non-zero coefficients from the training cohort. The radiomics score (rad-score) was computed for each patient after feature selection utilizing the LASSO regression with a combination of selected features weighted by their respective coefficients. Five machine learning models [logistic regression (LR), support vector machine, (SVM), Bernoulli Naïve Bayes, ridge, and stochastic gradient descent (SGD)] were developed to fully exploit the potential of the remaining radiomics features. The grid search and five-fold cross-validation algorithm were used in the training dataset to select the optimal model hyperparameters. The model with the best cross-validation performance was used for further analysis. Both feature selection and radiomics signature development were performed in the training cohort. The performance of the obtained radiomics signature was evaluated using an inter-validation cohort, which was not employed for model development. Stratified cross-validation was implemented, employing a stratified sampling approach to preserve consistent class distribution across all data subsets.

The processes of tumor segmentation, feature extraction, feature selection, and model validation are shown in Figure 2.

Figure 2

Flowchart detailing a three-step process for medical image analysis. On the left, “Image segmentation” shows MRI scans with highlighted areas. In the center, “Feature Extraction and Selection” lists statistical features used, such as GLRLM and GLCM. On the right, “Model Construction and Validation” lists machine learning techniques like logistic regression and SVM, accompanied by various performance graphs and ROC curves.

Figure 2. The framework for the radiomics workflow.

2.8 Statistical analysis

R software (version 3.5.3, http://www.R-project.org) and Python software (version 3.7.12, http://www.Python.org); were used to perform statistical analyses and model construction. Categorical variables were expressed as frequencies (percentages), and continuous variables were presented as mean ± standard deviation (SD) for normalization distribution and medians (25% quantile, 75% quantile) for other variables. Categorical variables were analyzed using a χ2 or Fisher’s exact test. The Kolmogorov-Smirnov method was used to test the normality of all measurement data. An independent sample t-test or Mann-Whitney U test was used to measure statistical differences. Receiver operating characteristic (ROC) curve, sensitivity, and specificity analyses were performed to compare the performance of five machine learning models. To evaluate model performance under class imbalance, the F1-score was employed. The Matthews correlation coefficient (MCC), calculated from the four categories of the confusion matrix (true positives, false positives, true negatives, false negatives), was used for binary outcome assessment. The DeLong’s test was used to compare the models’ discrimination abilities. Calibration analysis and the Hosmer-Lemeshow test were utilized to examine the agreement between the observed N stage and prediction probabilities. Decision curve analysis (DCA) was performed to determine the net benefits in clinical application of the constructed models. A p-value of < 0.05 was considered statistically significant.

3 Results

3.1 Clinical baseline characteristics

A total of 225 patients with stage T3 RC were included in the study. The cohort comprised 90 female and 135 male patients with an age range of 24–85 years. Among them, 181 cases were in N0/N1 stage, and 44 cases were in N2 stage. No statistically significant difference was found between the two groups in terms of clinical characteristics. Detailed patient characteristics and statistical results are shown in Table 1.

Table 1

Table 1. Patients’ clinical characteristics.

3.2 Models construction and validation

3.2.1 Model construction

A total of 1,746 radiomics features were successfully extracted from T2W images for each patient. LASSO regression analysis was utilized to select radiomics features with coefficients of >0, resulting in a final retention of 16 features, as shown in Figure 3. The detailed feature names and their corresponding rad-score values are listed in Supplementary Figure 1,including 15 texture (two GLRLM, two GLCM, three NGTDM, three GLDM, and six GLSZM) and one first-order features. The features were significantly different between the N0/1 and N2 groups (all p < 0.05), except for feature F9 (wavelet-HHH_firstorder_Mean) (Supplementary Figure 2).

Figure 3

Three-panel graph showing coordinated descent results with varying alpha values: A) Mean square error across folds, decreasing as log(alpha) increases. Solid line indicates average. B) Coefficients on each fold, with different lines showing diverse trends as alpha changes. C) MSE deviation with decreasing trend, red line indicates CV estimate for alpha. Dashed vertical lines highlight specific alpha values.

Figure 3. (A) Five-fold cross-validation results for alpha selection. The optimal alpha is marked by the dashed line. (B) Coefficient values corresponding to the optimal α value and selected features with non-zero coefficients. (C) MSE deviation on each fold using coordinate descent in five-fold cross-validation.

Five different radiomics signature models for predicting lymph node staging were then constructed using the above selected features based on LR, SVM, Bernoulli Naïve Bayes, ridge, and SGD classifiers in the training dataset.

3.2.2 Predictive performance and validation of the model

Table 2 summarizes the five models’ sensitivity, specificity, F1-Score, MCC, PPV, NPV, accuracy, and AUC data, with the corresponding ROC curves depicted in Figure 4. Among the five machine learning classifiers, the LR model performed the best in both the training and test sets, with respective AUC values of 0.900 [95% confidence interval (CI), 0.848–0.952] and 0.876 (95% CI, 0.765–0.986). The corresponding accuracy values across the two cohorts were 0.847 (95% CI, 0.843–0.852) and 0.882 (95% CI, 0.873–0.892).

Table 2

Table 2. Radiomics signature model performance.

Figure 4

ROC curves for logistic models. Panel A shows the training cohort with the logistic model achieving an AUC of 0.9, followed by Ridge 0.887, SVM 0.875, BernoulliNB 0.845, and SGD 0.82. Panel B displays the test cohort with the logistic model at AUC 0.876, followed by SVM 0.859, Ridge 0.835, BernoulliNB 0.78, and SGD 0.764. Sensitivity is plotted against 1-Specificity.

Figure 4. ROC curves based on five machine learning models in the training and testing cohorts.

Differences in the AUCs among the five models were compared using the DeLong’s test. The LR model significantly outperformed the Bernoulli Naïve Bayes and SGD models in the training cohort (p < 0.05). There was a significant difference in AUC values between the LR and Bernoulli Naïve Bayes models in the test cohort (p < 0.05). Further details are provided in Supplementary Table 1.

The rad-scores in the training cohort were significantly higher in the N2 group compared to those in the N0/1 group, which was consistently validated in the test cohort, as shown in Figure 5. The Radscores derived from five classifier for each patient in the training and test cohort datasets are depicted in Supplementary Figure 3. LR and SVM demonstrated robust discriminative performance in both the training and test cohorts.

Figure 5

Box plots comparing scaled values for different machine learning models: Logistic, SVM, Bernoulli Naïve Bayes, Ridge, and SGD. Panel A shows significant differences with stars indicating statistical significance; blue boxes represent class 0, orange boxes class 1. Panel B displays a similar pattern with slight variations in model performance.

Figure 5. (A, B) Boxplots of corresponding radiomics scores in the training and testing cohorts. 0 (blue) represents N0/1 group, 1 (orange) represents N2 group. Asterisk (*) indicates level of statistical significance between categories, with more asterisks representing a higher level of significance. ("***p < 0.001, **p < 0.01, *p < 0.05").

3.2.3 Apparent performance and clinical use of the radiomics signature model

The calibration curve of the radiomics models for the predicted lymph node staging in stage T3 RC showed good agreement between the observed outcomes and predicted probabilities in all datasets (Supplementary Figure 4). The p-values obtained from the Hosmer-Lemeshow test were all >0.05 and not statistically significant. The DCA of the radiomics signature models is presented in Figure 6. The DCA showed satisfactory positive benefits of the nomogram on most of the threshold probabilities, indicating a favorable potential clinical effect of the models.

Figure 6

Two graphs labeled A and B compare the standardized net benefit against high-risk thresholds for different machine learning models. Each line represents a model: Logistic, SVM, Bernoulli Naïve Bayes, Ridge, SGD, All, and None, distinguished by color. Both graphs show similar trends with lines generally decreasing as thresholds increase, reflecting changes in net benefit across models.

Figure 6. DCA for five models in the training and testing cohorts. (A) DCA of five models in the training cohort. (B) DCA of five models in the testing cohort.

3.2.4 Diagnostic performance of initial imaging interpretations

Among the 225 patients evaluated, initial imaging interpretations classified 157 as N0/N1 and 68 as N2. This approach achieved a sensitivity of 54.55%, specificity of 75.69%, and an overall accuracy of 71.56%. The corresponding AUC was 0.651, indicating modest discriminative performance.

4 Discussion

The lymph node status in RC is a critical factor in determining the necessity of adjuvant therapy and surgical resection (18). However, Initial MRI radiology report for lymph node (LN) staging in rectal cancer relies on subjective size-morphology criteria, often showing poor interobserver agreement (κ = 0.416) and limited accuracy (AUC: 0.60–0.62), remains challenging (19). In contrast, Dong et al. (20)demonstrated that a radiomics model achieved higher diagnostic precision (PPV: 75.9%) by quantifying tumor heterogeneity, overcoming the limitations of traditional MRI. Building upon the recently published novel perspectives and conclusions (4), an exploratory investigation was conducted in the present study. Five radiomics models were developed based on high-resolution T2WI features using five different machine learning algorithms in order to noninvasively differentiate between N0/N1 and N2 stages in stage T3 RC. The performance and clinical utility of these models were systematically compared. The results showed that the LR model exhibited the best performance in predicting lymph node staging in stage T3 RC. The clinical applicability of this model was further validated via calibration curves and DCA, indicating positive clinical benefits at most threshold probabilities.

Radiomics features can reveal subtle changes in rectal tumor lesions that are difficult to discern with the naked eye (21, 22). In the present study, 1,746 radiomics features were extracted from oblique axial T2W images. LASSO regression was subsequently utilized to select 16 key features for in-depth analysis. These included one first-order and 15 texture features, with texture features being the most abundant, accounting for 93.75% of the total. This result is consistent with those described by Li and Yin et al. (23), who demonstrated that T2WI-based texture features have high accuracy in predicting LNM in RC (AUC of 0.805), further validating the importance of texture features in tumor assessment.

GLSZM features were predominant in the present study. GLSZM quantifies the size distribution of continuous image regions, finely characterizing the distribution patterns of homogeneous regions of different sizes within tumors. Smaller regions may correspond to densely packed tumor cells or micro-nodules, while larger continuous regions may reflect necrotic areas or fibrotic changes. (24). This spatial heterogeneity is closely related to tumor proliferation activity and invasiveness. Studies have shown that the short-zone emphasis in GLSZM features is significantly negatively correlated with tumor microvascular density, while large-zone emphasis is positively correlated with stromal fibrosis (25). This ability to quantify spatial heterogeneity in tumor microstructures makes GLSZM features an important indicator for assessing tumor invasiveness and metastatic potential. Therefore, texture features can capture the microscopic heterogeneity within tumors, reflecting the spatial arrangement and gray-level distribution patterns of tumor cells (26–28). These features showed significant differences in distinguishing between N0/N1 and N2 stages (p < 0.05) in the present study, indicating that LNM is closely related to microscopic structural changes within the tumor. Additionally, the predictive model based on T2WI radiomics features was constructed using wavelet features (8/16). Wavelet transform is a multi-scale analysis method with perfect reconstruction capability, ensuring no information loss or redundancy during signal decomposition. It can decompose images into high-frequency (heterogeneity) and low-frequency (homogeneity) components, facilitating the extraction of structural information and details from the original images (29). Previous studies (30) have also reported the effectiveness of wavelet features in predicting lymph node status in T2WI. Furthermore, He et al. (31) found that wavelet features in T2WI performed well in RC tumor grading, further demonstrating that they can represent the biological behavior and heterogeneity of tumors.

A growing body of evidence underscores the value of clinical parameters as complementary predictors in radiomics-based models, with integrated frameworks often demonstrating superior discrimination compared with radiomics alone (12). For example, (32) reported that a combined clinical–radiomic model improved sensitivity (82.6% vs. 78.3%) and specificity (88.9% vs. 57.9%) relative to a purely radiomic approach, while also mitigating the subjectivity of conventional MRI interpretation through visualized risk maps. Such clinical metrics may encode systemic or tumor-related biological processes that radiomics, which predominantly captures spatial and textural heterogeneity, cannot fully represent. In our cohort, however, none of the routinely collected clinical variables demonstrated statistically significant intergroup differences across outcome categories (all P > 0.05). This absence of discriminative signal suggests that, in this specific population, these variables may have limited incremental value for the prediction task. From a modeling perspective, the inclusion of non-informative covariates in high-dimensional feature spaces risks diluting true signal, inflating model variance, and impairing generalizability—particularly in datasets of modest size. Given these considerations, and in pursuit of parsimony, we elected to exclude clinical parameters from the final model. Several factors may underlie this discrepancy with prior studies. First, differences in patient demographics, disease stage distribution, and treatment patterns between our cohort and other trials could attenuate the predictive contribution of clinical variables. Second, sample size constraints may have limited statistical power to detect subtle effects, especially for variables with low intergroup variability. Third, the endpoints examined—derived from imaging-based nodal staging—may be more tightly coupled to local morphologic features than to systemic clinical markers. These hypotheses warrant systematic evaluation in larger, multi-institutional datasets, ideally with harmonized variable definitions and broader biological characterization, to clarify the true translational potential of integrating clinical and radiomic predictors.

The widespread application of machine learning algorithms in the field of radiomics has significantly improved diagnostic performance (33). Selecting the appropriate classifier is crucial for building high-performance predictive models. The present study systematically evaluated five supervised learning models commonly used for binary classification tasks, including LR, SVM, Bernoulli Naive Bayes, ridge regression, and SGD. The results showed that the LR model performed the best, with AUCs of 0.900 and 0.876 in the training and testing sets, respectively. Additionally, the model demonstrated excellent accuracy (training set: 0.847, testing set: 0.882) and specificity (training set: 0.873, testing set: 0.927). The superior performance of the LR model could be attributed to its ability to effectively handle linearly separable data, low complexity after feature selection, and resistance to overfitting. Moreover, the LR model has strong interpretability, allowing for the quantification of each feature’s contribution to the prediction results (34), which provides important references for clinical decision-making. Similarly, Wei et al. (35) developed and validated a clinical radiomics model by combining T2W and amide proton transfer-weighted MRI radiomics features, achieving efficient LNM prediction in rectal adenocarcinoma using an LR classifier (AUCs of 0.983, 0.864, and 0.851 in the training, validation, and testing sets, respectively). Cui et al. (36) used an LR model to predict a complete pathological response in LARC and achieved an AUC of 0.90. However, the LR model is not suitable for all scenarios, as classifier performance highly depends on the distribution characteristics of the training and testing sets. Qu et al. (37) found that the SVM classifier performed best when constructing a predictive model using T2W images, with AUCs of 0.892 and 0.71 in the training and validation sets, respectively. This is because SVM can handle nonlinear relationships through kernel functions and exhibits strong classification capabilities (38).

This study has several limitations. First, this study focused on evaluating MRI-based radiomics for lymph node staging in patients undergoing upfront surgical resection for stage T3 rectal adenocarcinoma. However, select patients with low-risk, distally located tumors (e.g., T3N0/N1) achieving a clinical complete response after neoadjuvant therapy may be eligible for a watch-and-wait (W&W) organ-preserving strategy. Excluding such cases may limit the model’s applicability in settings where nonoperative management is considered. Second, the model was developed solely on T2-weighted MRI data, potentially limiting sensitivity for detecting small or morphologically subtle lesions. Future work should explore the integration of functional imaging sequences—such as diffusion-weighted imaging or contrast-enhanced MRI-within a multiparametric framework to enhance lesion conspicuity and improve diagnostic accuracy. Third, manual ROI delineation inevitably introduces interobserver and intraobserver variability, which may bias feature extraction and subsequent model performance. The adoption of automated or semi-automated segmentation algorithms could reduce subjectivity, standardize feature generation, and improve reproducibility across centers. Finally, the modest sample size and absence of external validation restrict the model’s generalizability. Rigorous validation using large, multicenter datasets with diverse patient populations is essential to confirm robustness, refine calibration, and establish the clinical utility of the proposed model.

5 Conclusion

The present study confirmed the effectiveness of a machine learning-based high-resolution T2WI radiomics model in predicting lymph node staging in stage T3 RC. Radiomics is a noninvasive assessment method that can provide a valuable alternative for lymph node staging, supporting personalized treatment decisions. It showed broad application prospects in optimizing treatment pathways, avoiding overtreatment, and improving the prognosis of patients with LARC. Future research may further advance the clinical application of this technology via multimodal imaging and large-scale validation.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by The Second Affiliated Hospital of Chongqing Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

XQ: Conceptualization, Data curation, Formal analysis, Methodology, Writing – original draft. WC: Data curation, Formal analysis, Investigation, Methodology, Writing – original draft. JC: Data curation, Formal analysis, Methodology, Writing – original draft. JM: Data curation, Methodology, Software, Writing – original draft. XW: Data curation, Methodology, Resources, Writing – original draft. XG: Data curation, Investigation, Resources, Writing – original draft. WZ: Conceptualization, Methodology, Resources, Supervision, Writing – review & editing. XH: Conceptualization, Methodology, Resources, Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1610892/full#supplementary-material

Supplementary Figure 1 | Histogram of radiomics scores based on selected features.

Supplementary Figure 2 | Boxplots of 16 radiomics features in N0/1 and N2 groups.

Supplementary Figure 3 | Radiomics scores derived from five models for each patient. 0 (blue) represents N0/1 group, 1 (orange) represents N2 group.

Supplementary Figure 4 | Calibration performance of five predictive models in two independent cohorts.

References

1. Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834

PubMed Abstract | Crossref Full Text | Google Scholar

2. Meldolesi E, Chiloiro G, Giannini R, Menghi R, Persiani R, Corvari B, et al. The role of simultaneous integrated boost in locally advanced rectal cancer patients with positive lateral pelvic lymph nodes. Cancers (Basel). (2022) 14(7):1643. doi: 10.3390/cancers14071643

PubMed Abstract | Crossref Full Text | Google Scholar

3. Benson AB, Venook AP, Al-Hawary MM, Azad N, Chen YJ, Ciombor KK, et al. Rectal cancer, version 2.2022, NCCN clinical practice guidelines in oncology. J Natl Compr Canc Netw. (2022) 20:1139–67. doi: 10.6004/jnccn.2022.0051

PubMed Abstract | Crossref Full Text | Google Scholar

4. Schrag D. Preoperative treatment of locally advanced rectal cancer. Reply. N Engl J Med. (2023) 389:1631–2. doi: 10.1056/NEJMc2309857

PubMed Abstract | Crossref Full Text | Google Scholar

5. Jia X, Zhang Y, Wang Y, Feng C, Shen D, Ye Y, et al. MRI for restaging locally advanced rectal cancer: detailed analysis of discrepancies with the pathologic reference standard. AJR Am J Roentgenol. (2019) 213:1081–90. doi: 10.2214/ajr.19.21383

PubMed Abstract | Crossref Full Text | Google Scholar

6. Bates DDB, Homsi ME, Chang KJ, Lalwani N, Horvat N, and Sheedy SP. MRI for rectal cancer: staging, mrCRM, EMVI, lymph node staging and post-treatment response. Clin Colorectal Cancer. (2022) 21:10–8. doi: 10.1016/j.clcc.2021.10.007

PubMed Abstract | Crossref Full Text | Google Scholar

7. Yamaoka Y, Kinugasa Y, Shiomi A, Yamaguchi T, Kagawa H, Yamakawa Y, et al. The distribution of lymph node metastases and their size in colon cancer. Langenbecks Arch Surg. (2017) 402:1213–21. doi: 10.1007/s00423-017-1628-z

PubMed Abstract | Crossref Full Text | Google Scholar

8. Chen L, Shen F, Li Z, Lu H, Chen Y, Wang Z, et al. Diffusion-weighted imaging of rectal cancer on repeatability and cancer characterization: an effect of b-value distribution study. Cancer Imaging. (2018) 18:43. doi: 10.1186/s40644-018-0177-1

PubMed Abstract | Crossref Full Text | Google Scholar

9. Ge YX, Hu SD, Wang Z, Guan RP, Zhou XY, Gao QZ, et al. Feasibility and reproducibility of T2 mapping and DWI for identifying Malignant lymph nodes in rectal cancer. Eur Radiol. (2021) 31:3347–54. doi: 10.1007/s00330-020-07359-7

PubMed Abstract | Crossref Full Text | Google Scholar

10. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036

PubMed Abstract | Crossref Full Text | Google Scholar

11. Yang YS, Feng F, Qiu YJ, Zheng GH, Ge YQ, and Wang YT. High-resolution MRI-based radiomics analysis to predict lymph node metastasis and tumor deposits respectively in rectal cancer. Abdom Radiol (NY). (2021) 46:873–84. doi: 10.1007/s00261-020-02733-x

PubMed Abstract | Crossref Full Text | Google Scholar

12. Li J, Zhou Y, Wang X, Zhou M, Chen X, and Luan K. An MRI-based multi-objective radiomics model predicts lymph node status in patients with rectal cancer. Abdom Radiol (NY). (2021) 46:1816–24. doi: 10.1007/s00261-020-02863-2

PubMed Abstract | Crossref Full Text | Google Scholar

13. Lakemeyer L, Sander S, Wittau M, Henne-Bruns D, Kornmann M, and Lemke J. Diagnostic and prognostic value of CEA and CA19–9 in colorectal cancer. Diseases. (2021) 9(1):21. doi: 10.3390/diseases9010021

PubMed Abstract | Crossref Full Text | Google Scholar

14. Amin MB, Greene FL, Edge SB, Compton CC, Gershenwald JE, Brookland RK, et al. The Eighth Edition AJCC Cancer Staging Manual: Continuing to build a bridge from a population-based to a more “personalized” approach to cancer staging. CA Cancer J Clin. (2017) 67:93–9. doi: 10.3322/caac.21388

PubMed Abstract | Crossref Full Text | Google Scholar

15. Taylor FG, Quirke P, Heald RJ, Moran BJ, Blomqvist L, Swift IR, et al. Preoperative magnetic resonance imaging assessment of circumferential resection margin predicts disease-free survival and local recurrence: 5-year follow-up results of the MERCURY study. J Clin Oncol. (2014) 32:34–43. doi: 10.1200/jco.2012.45.3258

PubMed Abstract | Crossref Full Text | Google Scholar

16. Jhaveri KS, Hosseini-Nik H, Thipphavong S, Assarzadegan N, Menezes RJ, Kennedy ED, et al. MRI detection of extramural venous invasion in rectal cancer: correlation with histopathology using elastin stain. AJR Am J Roentgenol. (2016) 206:747–55. doi: 10.2214/ajr.15.15568

PubMed Abstract | Crossref Full Text | Google Scholar

17. Horvat N, Carlos Tavares Rocha C, Clemente Oliveira B, Petkovska I, and Gollub MJ. MRI of rectal cancer: tumor staging, imaging techniques, and management. Radiographics. (2019) 39:367–87. doi: 10.1148/rg.2019180114

PubMed Abstract | Crossref Full Text | Google Scholar

18. Cervantes A, Adam R, Roselló S, Arnold D, Normanno N, Taïeb J, et al. Metastatic colorectal cancer: ESMO Clinical Practice Guideline for diagnosis, treatment and follow-up. Ann Oncol. (2023) 34:10–32. doi: 10.1016/j.annonc.2022.10.003

PubMed Abstract | Crossref Full Text | Google Scholar

19. Gröne J, Loch FN, Taupitz M, Schmidt C, and Kreis ME. Accuracy of various lymph node staging criteria in rectal cancer with magnetic resonance imaging. J Gastrointest Surg. (2018) 22:146–53. doi: 10.1007/s11605-017-3568-x

PubMed Abstract | Crossref Full Text | Google Scholar

20. Dong X, Ren G, Chen Y, Yong H, Zhang T, Yin Q, et al. Effects of MRI radiomics combined with clinical data in evaluating lymph node metastasis in mrT1-3a staging rectal cancer. Front Oncol. (2023) 13:1194120. doi: 10.3389/fonc.2023.1194120

PubMed Abstract | Crossref Full Text | Google Scholar

21. Pinker K, Chin J, Melsaether AN, Morris EA, and Moy L. Precision medicine and radiogenomics in breast cancer: new approaches toward diagnosis and treatment. Radiology. (2018) 287:732–47. doi: 10.1148/radiol.2018172171

PubMed Abstract | Crossref Full Text | Google Scholar

22. Ulrich EJ, Menda Y, Boles Ponto LL, Anderson CM, Smith BJ, Sunderland JJ, et al. FLT PET radiomics for response prediction to chemoradiation therapy in head and neck squamous cell cancer. Tomography. (2019) 5:161–9. doi: 10.18383/j.tom.2018.00038

PubMed Abstract | Crossref Full Text | Google Scholar

23. Li C and Yin J. Radiomics based on T2-weighted imaging and apparent diffusion coefficient images for preoperative evaluation of lymph node metastasis in rectal cancer patients. Front Oncol. (2021) 11:671354. doi: 10.3389/fonc.2021.671354

PubMed Abstract | Crossref Full Text | Google Scholar

24. Han Y, Wang W, Yang Y, Sun YZ, Xiao G, Tian Q, et al. Amide proton transfer imaging in predicting isocitrate dehydrogenase 1 mutation status of grade II/III gliomas based on support vector machine. Front Neurosci. (2020) 14:144. doi: 10.3389/fnins.2020.00144

PubMed Abstract | Crossref Full Text | Google Scholar

25. Niu Y, Yu X, Wen L, Bi F, Jian L, Liu S, et al. Comparison of preoperative CT- and MRI-based multiparametric radiomics in the prediction of lymph node metastasis in rectal cancer. Front Oncol. (2023) 13:1230698. doi: 10.3389/fonc.2023.1230698

PubMed Abstract | Crossref Full Text | Google Scholar

26. Gillies RJ, Kinahan PE, and Hricak H. Radiomics: images are more than pictures, they are data. Radiology. (2016) 278:563–77. doi: 10.1148/radiol.2015151169

PubMed Abstract | Crossref Full Text | Google Scholar

27. Li M, Sun K, Dai W, Xiang W, Zhang Z, Zhang R, et al. Preoperative prediction of peritoneal metastasis in colorectal cancer using a clinical-radiomics model. Eur J Radiol. (2020) 132:109326. doi: 10.1016/j.ejrad.2020.109326

PubMed Abstract | Crossref Full Text | Google Scholar

28. Li Y, Eresen A, Lu Y, Yang J, Shangguan J, Velichko Y, et al. Radiomics signature for the preoperative assessment of stage in advanced colon cancer. Am J Cancer Res. (2019) 9:1429–38.

Google Scholar

29. Tang VH, Duong STM, Nguyen CDT, Huynh TM, Duc VT, Phan C, et al. Wavelet radiomics features from multiphase CT images for screening hepatocellular carcinoma: analysis and comparison. Sci Rep. (2023) 13:19559. doi: 10.1038/s41598-023-46695-8

PubMed Abstract | Crossref Full Text | Google Scholar

30. Ma X, Shen F, Jia Y, Xia Y, Li Q, and Lu J. MRI-based radiomics of rectal cancer: preoperative assessment of the pathological features. BMC Med Imaging. (2019) 19:86. doi: 10.1186/s12880-019-0392-7

PubMed Abstract | Crossref Full Text | Google Scholar

31. He B, Ji T, Zhang H, Zhu Y, Shu R, Zhao W, et al. MRI-based radiomics signature for tumor grading of rectal carcinoma using random forest model. J Cell Physiol. (2019) 234:20501–9. doi: 10.1002/jcp.28650

PubMed Abstract | Crossref Full Text | Google Scholar

32. Yan H, Yang H, Jiang P, Dong L, Zhang Z, Zhou Y, et al. A radiomics model based on T2WI and clinical indexes for prediction of lateral lymph node metastasis in rectal cancer. Asian J Surg. (2024) 47:450–8. doi: 10.1016/j.asjsur.2023.09.156

PubMed Abstract | Crossref Full Text | Google Scholar

33. Samala RK, Drukker K, Shukla-Dave A, Chan HP, Sahiner B, Petrick N, et al. AI and machine learning in medical imaging: key points from development to translation. BJR Artif Intell. (2024) 1:ubae006. doi: 10.1093/bjrai/ubae006

PubMed Abstract | Crossref Full Text | Google Scholar

34. Wu Q, Wang S, Chen X, Wang Y, Dong L, Liu Z, et al. Radiomics analysis of magnetic resonance imaging improves diagnostic performance of lymph node metastasis in patients with cervical cancer. Radiother Oncol. (2019) 138:141–8. doi: 10.1016/j.radonc.2019.04.035

PubMed Abstract | Crossref Full Text | Google Scholar

35. Wei Q, Yuan W, Jia Z, Chen J, Li L, Yan Z, et al. Preoperative MR radiomics based on high-resolution T2-weighted images and amide proton transfer-weighted imaging for predicting lymph node metastasis in rectal adenocarcinoma. Abdom Radiol (NY). (2023) 48:458–70. doi: 10.1007/s00261-022-03731-x

PubMed Abstract | Crossref Full Text | Google Scholar

36. Cui Y, Yang X, Shi Z, Yang Z, Du X, Zhao Z, et al. Radiomics analysis of multiparametric MRI for prediction of pathological complete response to neoadjuvant chemoradiotherapy in locally advanced rectal cancer. Eur Radiol. (2019) 29:1211–20. doi: 10.1007/s00330-018-5683-9

PubMed Abstract | Crossref Full Text | Google Scholar

37. Qu X, Zhang L, Ji W, Lin J, and Wang G. Preoperative prediction of tumor budding in rectal cancer using multiple machine learning algorithms based on MRI T2WI radiomics. Front Oncol. (2023) 13:1267838. doi: 10.3389/fonc.2023.1267838

PubMed Abstract | Crossref Full Text | Google Scholar

38. Yimit Y, Yasin P, Tuersun A, Abulizi A, Jia W, Wang Y, et al. Differentiation between cerebral alveolar echinococcosis and brain metastases with radiomics combined machine learning approach. Eur J Med Res. (2023) 28:577. doi: 10.1186/s40001-023-01550-4

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: lymph node staging, machine learning, MRI, radiomics, rectal cancer

Citation: Qubie X, Chen W, Chen J, Ma J, Wei X, Gu X, Zhang W and He X (2025) Development and validation of machine learning-based MRI radiomics models for preoperative lymph node staging in T3 rectal cancer. Front. Oncol. 15:1610892. doi: 10.3389/fonc.2025.1610892

Received: 13 April 2025; Accepted: 26 August 2025;
Published: 08 September 2025.

Edited by:

Sunitha B Thakur, Memorial Sloan Kettering Cancer Center, United States

Reviewed by:

Rui Li, Affiliated Hospital of North Sichuan Medical College, China
Mladen Marinkovic, Institute of Oncology and Radiology of Serbia, Serbia

Copyright © 2025 Qubie, Chen, Chen, Ma, Wei, Gu, Zhang and He. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xiaojing He, aGVfeGlhb2ppbmdAaG9zcGl0YWwuY3FtdS5lZHUuY24=; Wei Zhang, d2lsbF96aGFuZ18xMTExQDE2My5jb20=

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.