Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol.

Sec. Genitourinary Oncology

Volume 15 - 2025 | doi: 10.3389/fonc.2025.1590396

This article is part of the Research TopicEnhancing Prostate Cancer Diagnosis: Biomarkers and Imaging for Improved Patient OutcomesView all 16 articles

Evaluation of a PSA and transrectal prostate ultrasound video-based machine learning model as a tool for prostate cancer diagnosis

Provisionally accepted
Yanhong  DuYanhong Du*Anli  ZhaoAnli ZhaoMaoliang  ZhangMaoliang ZhangXiaoyang  QiXiaoyang Qi*Zhengping  WangZhengping WangLiyan  HuLiyan Hu*
  • Department of Ultrasound, Affiliated Dongyang Hospital of Wenzhou Medical University (Dongyang People’s Hospital), Dongyang, Zhejiang, China

The final, formatted version of the article will be published soon.

Objective: To develop a machine learning-based model incorporating prostate-specific antigen (PSA) levels and prostate ultrasound video clips for diagnosing prostate cancer.The study enrolled 928 participants, of whom 429 had prostate cancer and 499 other non-prostate cancers. Univariate and multivariate analyses of serological indices were conducted to detect significant variables. From this cohort, 742 patients were randomly chosen for model validation, while the other 186 were employed to evaluate the accuracy and reliability of the model. Seven features were extracted from ultrasound video clips and combined with PSA and other clinical indicators.Predictive models were established using six machine learning algorithms and receiver operating characteristic (ROC) curves were used to determine the optimal model. SHapley Additive exPlanations (SHAP) was utilized to visualize feature importance in the best-performing model.All six of the evaluated machine learning models performed favorably, with area under the ROC curve (AUC) values in the test set ranging from 0.800 to 0.881. Of these models, the XGBoost model achieved the most promising performance, significantly surpassing that of the other models (P < 0.05). SHAP visualization revealed that PSA, prostatic volume(PV), age, wavelet.LHL.firstorder.Median, wavelet.HLH.glszm.ZoneEntropy, and original.shape.MinorAxisLength were the most influential features in the XGBoost model.The developed machine learning models demonstrated significant potential for prostate cancer diagnosis. Among them, the XGBoost model outperformed the others, highlighting its superior predictive capability.

Keywords: prostate cancer, PSA, machine learning, prostate ultrasound video, SHAP 1.Introduction

Received: 09 Mar 2025; Accepted: 19 Aug 2025.

Copyright: © 2025 Du, Zhao, Zhang, Qi, Wang and Hu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Yanhong Du, Department of Ultrasound, Affiliated Dongyang Hospital of Wenzhou Medical University (Dongyang People’s Hospital), Dongyang, Zhejiang, China
Xiaoyang Qi, Department of Ultrasound, Affiliated Dongyang Hospital of Wenzhou Medical University (Dongyang People’s Hospital), Dongyang, Zhejiang, China
Liyan Hu, Department of Ultrasound, Affiliated Dongyang Hospital of Wenzhou Medical University (Dongyang People’s Hospital), Dongyang, Zhejiang, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.