ORIGINAL RESEARCH article
Front. Oncol.
Sec. Gastrointestinal Cancers: Colorectal Cancer
This article is part of the Research TopicApplications and Advances of Artificial Intelligence in Medical Image Analysis: PET, SPECT/ CT, MRI, and Pathology ImagingView all 11 articles
Deep Learning Analysis of MRI to assess Rectal Cancer Treatment
Provisionally accepted- 1S-SPIRE Center, Department of Surgery, Stanford University, Stanford, United States
- 2Department of Radiology, Stanford University, Stanford, United States
- 3VA Palo Alto Health Care System, Palo Alto, United States
- 4Department of Radiation Oncology, Stanford University, Stanford, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Introduction: Traditional neoadjuvant therapy for locally advanced rectal cancer (LARC) results in pathologic complete response (pCR) in approximately 15% of patients, supporting non-operative strategies for those with clinical complete response (cCR). The subjectivity and variability in MRI-based cCR assessments highlight the need for objective, quantitative tools. Objective: To develop deep learning models for automated rectal tumor segmentation on pre- and post-treatment MRIs, and to identify radiomic features differentiating cCR from non-cCR patients. Materials and Methods: We retrospectively analyzed pre- and post-treatment MRIs from 37 LARC patients enrolled in a Phase 2 TNT trial (NCT04380337). Rectal tumors were segmented on T2-weighted images by two data scientists, refined by a radiologist (reference standard), and independently segmented by a fellow. For pre-treatment segmentation, Model 1 (baseline; n=37) was trained on reference cases, then used to generate pseudo-labels for 81 additional cases. Model 2 (semi-supervised; n=118) was trained on the combined dataset. Model 3 (baseline; n=37) was trained on post-treatment cases. Radiomic features were extracted from post-treatment ADC maps, filtered by reproducibility (ICC ≥0.8) and redundancy (Spearman ρ≤0.95), then analyzed using unsupervised hierarchical clustering. Results: For pre-treatment segmentation, radiologist-fellow inter-rater agreement was DSC =0.748±0.092. Model 1 achieved mean DSC =0.682±0.254 versus the radiologist, significantly lower than inter-rater agreement. Model 2 improved performance to mean DSC =0.769±0.214 (mean gain =0.087; 12.8% relative improvement; p<0.001), slightly outperforming inter-rater agreement. For post-treatment segmentation, inter-rater agreement declined to mean DSC =0.362±0.256, while Model 3 achieved mean DSC =0.175±0.231 versus the radiologist, reflecting challenges from treatment-induced tissue changes affecting both automated models and human raters. Radiomic clustering revealed two distinct patient groups aligned with cCR and non-cCR status. Conclusion: This study demonstrates the feasibility of deep learning-based automated segmentation and radiomic profiling for differentiating treatment response in rectal cancer. Semi-supervised learning with pseudo-labeled data significantly improved segmentation performance, offering a practical approach to overcome limited annotations. Radiomic features warrant validation in larger multi-center studies for clinical translation.
Keywords: rectal cancer, MRI, clinical complete response, deep learning, segmentation, and nnU-Net
Received: 23 Jun 2025; Accepted: 18 Nov 2025.
Copyright: © 2025 Selby, Son, Sheth, Wagner, Pollom and Morris. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Heather M. Selby, selbyh@stanford.edu
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
