Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Radiol.

Sec. Artificial Intelligence in Radiology

This article is part of the Research TopicEfficient AI for Radiology Imaging AnalysisView all articles

A Vision Transformer-Radiomics Approach for Enhanced Chemotherapy Outcome Prediction in Ovarian Cancer

Provisionally accepted
Neman  AbdoliNeman Abdoli1Patrik  GilleyPatrik Gilley1Ke  ZhangKe Zhang1Youkabed  SadriYoukabed Sadri1Theresa  ThaiTheresa Thai2Yong  ChenYong Chen2Lauren  DockeryLauren Dockery2Kathleen  MooreKathleen Moore2Robert  MannelRobert Mannel2Yuchen  QiuYuchen Qiu1*
  • 1The University of Oklahoma, Norman, United States
  • 2The University of Oklahoma Health Sciences, Oklahoma City, United States

The final, formatted version of the article will be published soon.

Early prediction of chemotherapy response in ovarian cancer patients is essential for enabling personalized treatment strategies and improving clinical outcomes. However, this prediction remains challenging due to the high heterogeneity of tumor biology, patient-specific factors, and treatment regimens. Recent advances in imaging biomarkers derived from both radiomics and advanced deep learning methods offer promising tools for characterizing tumor phenotypes and predicting treatment outcomes. In this study, we present a predictive methodology that integrates handcrafted radiomics features and transformer-based embeddings. The deep features come from two models: a pretrained vision transformer (ViT) and MedSAM, a foundation model adapted for medical image segmentation. We retrospectively collected CT scans from 182 ovarian cancer patients acquired prior to chemotherapy. From these images, we extracted three groups of features: radiomics descriptors, ViT-based embeddings, and MedSAM-based embeddings. These features were standardized and then processed using the least absolute shrinkage and selection operator (LASSO) regression for optimal feature selection. Support vector machines (SVMs) were used to train classification models for predicting 6-month progression-free survival (PFS). Our results show that the model combining ViT and MedSAM image embeddings achieved the highest AUC of 0.924 ± 0.032. The model integrating all three feature sets attained a comparable AUC of 0.924 ± 0.037 but reached the highest classification accuracy of 0.831 ± 0.042. These findings highlight the effectiveness of combining single-modality multi-method imaging biomarkers and demonstrate the potential of advanced models to generate rich, task-specific tumor representations from CT scans in support of precision oncology.

Keywords: chemotherapy response prediction, Foundation models, ovarian cancer, Radiomics, Transfer Learning, vision transformers

Received: 10 Sep 2025; Accepted: 16 Feb 2026.

Copyright: © 2026 Abdoli, Gilley, Zhang, Sadri, Thai, Chen, Dockery, Moore, Mannel and Qiu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Yuchen Qiu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.