Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol.

Sec. Cancer Imaging and Image-directed Interventions

Volume 15 - 2025 | doi: 10.3389/fonc.2025.1657604

Development and Validation of a Deep Learning Model Using MR Imaging for Predicting Brain Metastases: An Accuracy-Focused Study

Provisionally accepted
Dan  ShiDan Shi1Meng  YangMeng Yang2Min  DongMin Dong3Ning  XuanNing Xuan4Yinsu  ZhuYinsu Zhu1Xiaoqiong  LvXiaoqiong Lv1Chao  XieChao Xie1Fei  XiaFei Xia1Lingchun  XuLingchun Xu1Qinglei  ZhangQinglei Zhang2Na  YinNa Yin1*
  • 1Jiangsu Cancer Hospital, Nanjing Medical University, Nanjing, China
  • 2Nanjing University Medical School Affiliated Nanjing Drum Tower Hospital Department of Medical Imaging, Nanjing, China
  • 3Oncology Department, Danyang People's Hospital of Jiangsu Province, Danyang, China
  • 4Columbia University, New York, United States

The final, formatted version of the article will be published soon.

Background: Brain metastases (BM), originating from extracranial malignancies, significantly threaten patient health. Accurate BM identification is crucial but labor-intensive manually. This study developed and validated a system for BM diagnosis, assessing its performance and stability Methods:470 patients diagnosed with BM were divided into an 80% training set (n=379) and a 20% internal test set (n=91) using systematic sampling. An additional 172 patients were retrospectively enrolled for external validation. A comprehensive preprocessing pipeline was implemented. We developed a 3D U-Net model with a ResNet-34 backbone for BM prediction. MRI scans were resampled to 0.833 mm³ isotropic voxels, underwent skull stripping using SynthStrip, and were intensity-normalized via Z-score normalization. The model was trained on MRI scans paired with segmentation masks, utilizing ImageNet-pretrained encoder weights and a patch-based strategy (128×128×128 voxels). Results: The model maintained perfect specificity and AUCs across gender and age groups, with no significant differences in other metrics, confirming false positive exclusion unaffected by demographics. By cancer type: Internal testing showed significant difference of AUC (p<0.001) between lung cancer (n=74) and other cancers (n=17). The differences of other performance metrics were not statistically significant (p>0.13), though other cancers showed higher median F1/IoU/MCC. External validation showed other cancers (n=79) had significantly higher precision than lung cancer (n=93) (p<0.05). Lung cancer AUC (0.82) was significantly lower than other cancers (0.89) (p<0.001), suggesting need for sensitivity optimization; both maintained specificity=1.0000. Model time was significantly shorter than manual annotation (internal: 69s vs 113s; external: 66s vs 96s; both p<0.001), with high agreement. Conclusion: The model demonstrated strong robustness and perfect specificity across demographics. While showing cancer type dependency (requiring improved lung cancer sensitivity), its high efficiency (40%-50% time reduction) and generalization provide a solid foundation for clinical translation.

Keywords: brain metastases, deep learning, artificial intelligence, Diagnostic accuracy, Magneticresonance imaging

Received: 01 Jul 2025; Accepted: 01 Sep 2025.

Copyright: © 2025 Shi, Yang, Dong, Xuan, Zhu, Lv, Xie, Xia, Xu, Zhang and Yin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Na Yin, Jiangsu Cancer Hospital, Nanjing Medical University, Nanjing, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.