ORIGINAL RESEARCH article
Front. Radiol.
Sec. Artificial Intelligence in Radiology
This article is part of the Research TopicEfficient AI for Radiology Imaging AnalysisView all articles
A Vision Transformer-Radiomics Approach for Enhanced Chemotherapy Outcome Prediction in Ovarian Cancer
Provisionally accepted- 1The University of Oklahoma, Norman, United States
- 2The University of Oklahoma Health Sciences, Oklahoma City, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Early prediction of chemotherapy response in ovarian cancer patients is essential for enabling personalized treatment strategies and improving clinical outcomes. However, this prediction remains challenging due to the high heterogeneity of tumor biology, patient-specific factors, and treatment regimens. Recent advances in imaging biomarkers derived from both radiomics and advanced deep learning methods offer promising tools for characterizing tumor phenotypes and predicting treatment outcomes. In this study, we present a predictive methodology that integrates handcrafted radiomics features and transformer-based embeddings. The deep features come from two models: a pretrained vision transformer (ViT) and MedSAM, a foundation model adapted for medical image segmentation. We retrospectively collected CT scans from 182 ovarian cancer patients acquired prior to chemotherapy. From these images, we extracted three groups of features: radiomics descriptors, ViT-based embeddings, and MedSAM-based embeddings. These features were standardized and then processed using the least absolute shrinkage and selection operator (LASSO) regression for optimal feature selection. Support vector machines (SVMs) were used to train classification models for predicting 6-month progression-free survival (PFS). Our results show that the model combining ViT and MedSAM image embeddings achieved the highest AUC of 0.924 ± 0.032. The model integrating all three feature sets attained a comparable AUC of 0.924 ± 0.037 but reached the highest classification accuracy of 0.831 ± 0.042. These findings highlight the effectiveness of combining single-modality multi-method imaging biomarkers and demonstrate the potential of advanced models to generate rich, task-specific tumor representations from CT scans in support of precision oncology.
Keywords: chemotherapy response prediction, Foundation models, ovarian cancer, Radiomics, Transfer Learning, vision transformers
Received: 10 Sep 2025; Accepted: 16 Feb 2026.
Copyright: © 2026 Abdoli, Gilley, Zhang, Sadri, Thai, Chen, Dockery, Moore, Mannel and Qiu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Yuchen Qiu
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
