- 1Department of Urology, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, Jiangsu, China
- 2Department of Andrology, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China
- 3Institute of Urology, Nanjing University, Nanjing, China
- 4Department of Radiology, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China
Background: Human Epidermal Growth Factor Receptor 2 (HER2), a component of the epidermal growth factor receptor family, is thought to be related to advanced prostate cancer (PCa) when overexpressed. Currently, most research on HER2 is limited to molecular pathology, with relatively few studies focused on imaging aspects.
Objectives: To develop a predictive model by extracting high-throughput radiomics features from magnetic resonance imaging and combining them with clinical characteristics for predicting HER2 overexpression.
Materials and methods: A total of 201 patients who underwent radical prostatectomy and HER2 immunohistochemistry were retrospectively enrolled. These patients were randomly divided into a training set (n=160) and a test set (n=41). Multimodal radiomics features extracted from T2-weighted imaging (T2WI) and apparent diffusion coefficient maps (ADC) were selected using Mann-Whitney U test and least absolute shrinkage and selection operator (LASSO) with ten-fold cross-validation. Predictive models were developed and evaluated based on discrimination and clinical utility.
Results: The combined model integrating ISUP Grade, PSA and Radscore achieved an area under the curve (AUC) of 0.841 (95% CI: 0.697-0.955) in the test set, significantly outperforming the clinical model (AUC = 0.580; p = 0.02, DeLong test) and demonstrating a modest improvement over the Radscore model (AUC = 0.838 (0.693-0.951). Evaluation results showed consistent discriminatory power: 0.78 accuracy, 0.77 sensitivity, and 0.79 specificity, indicating well-balanced performance between positive and negative classes. Decision curve analysis and Waterfall plot demonstrated strong clinical applicability.
Conclusion: The combined model effectively predicts HER2 overexpression in prostate cancer, with potential to inform more personalized treatment strategies for HER2-overexpressing PCa patients.
Introduction
Prostate cancer (PCa) remains one of the most common malignancies in men globally, presenting a substantial public health challenge due to its highly variable clinical behavior and outcomes (1). Although many cases follow an indolent course, a clinically significant subset—characterized by high-grade disease, molecular alterations, or advanced staging—is associated with elevated risks of disease progression, therapeutic resistance, and mortality (2, 3). Among the emerging molecular drivers of aggressive prostate cancer, human epidermal growth factor receptor 2 (HER2) has gained increasing attention for its established roles in tumorigenesis, metastasis, and treatment resistance, as evidenced in cancers such as breast and gastric malignancies (4–6).
In prostate cancer, HER2 overexpression activates key signaling pathways, including PI3K/AKT and MAPK/ERK, which promote tumor cell proliferation, survival, and resistance to androgen deprivation therapy (ADT) (7, 8). Accumulating clinical evidence further associates HER2 overexpression with adverse prognostic outcomes, including shortened progression-free and overall survival, particularly in patients with castration-resistant prostate cancer (CRPC) (9–11). Current assessment of HER2 status relies primarily on immunohistochemistry (IHC)—an invasive, time-consuming approach constrained by tissue heterogeneity, sampling variability, and interobserver disagreement (12, 13). These limitations highlight the pressing need for reproducible, non-invasive methods for HER2 phenotyping.
Recent advances in radiomics and artificial intelligence (AI) have opened promising avenues for non-invasive biomarker discovery. As demonstrated by Piras et al., AI-integrated predictive models are increasingly being applied in prostate cancer to optimize treatment personalization, such as predicting radiotherapy-related toxicity using dosimetric and imaging data (14). Similarly, Sekhoacha et al. and Varaprasad et al. have outlined the expanding role of genetic profiling and novel diagnostic modalities—including serum, urine, and tissue biomarkers—in refining risk stratification and therapeutic decision-making for prostate cancer patients (13, 15). These developments reflect a broader shift toward precision oncology, in which imaging-derived biomarkers can effectively complement molecular assays. In this context, HER2 has re-emerged as an actionable therapeutic target. The recent U.S. FDA approval of trastuzumab deruxtecan (T-DXd) for HER2-positive solid tumors represents a pivotal advancement, creating new opportunities for targeted therapy in HER2-overexpressing prostate cancer (16–19). Despite this therapeutic potential, the non-invasive detection of HER2 status via radiomics remains underexplored. Therefore, this study aims to develop and validate an interpretable machine learning model for the non-invasive prediction of HER2 overexpression in prostate cancer by integrating radiomic features derived from biparametric MRI with clinical variables. Our work seeks to contribute to the growing paradigm of image-based molecular phenotyping and support personalized treatment planning in oncology.
Materials and methods
Study design and participants
This study enrolled patients who underwent radical prostatectomy and had HER2 immunohistochemistry performed on the resected specimens between January 1, 2018, and July 31, 2024, at our hospital. All preoperative clinical and imaging data for the patients were collected within three months. Patients who had undergone other treatment modalities were selectively excluded. The pathological results are assessed based on the immunohistochemical findings of the postoperative radical prostatectomy specimen. The detailed inclusion and exclusion criteria are listed as follows (Figure 1). Then, 201 selected patients were randomly divided into two groups: a training set (n=160) and a test set (n=41). To prevent data imbalance, the random allocation ensured consistency in the ratio of positive to negative cases between the training and test set.
Inclusion criteria
Patients diagnosed with prostate cancer and underwent radical prostatectomy.
Immunohistochemical analysis for HER2 was performed on the resected specimens.
Prostate MR images and clinical indicators obtained within three months prior to surgery.
Exclusion criteria
Patients who received endocrine therapy or surgical treatment before the MRI examination.
Volume of interest (VOI) was too small to extract viable features from the MR image.
Patients with incomplete clinical or imaging data.
Preoperative clinical parameters
Comprehensive clinical data were prospectively collected prior to surgical intervention. The following parameters were included:
● Age: At the time of surgery.
● Prostate-Specific Antigen (PSA): Serum level measured within one month before surgery.
● Prostate Volume: Determined by transrectal ultrasound.
● PSA Density (PSAD): Calculated as PSA divided by prostate volume.
● Lesion Dimension On MRI: Maximum tumor diameter on preoperative MRI.
● Tumor Focality: Assessed on MRI as unifocal or multifocal.
● Positive Biopsy Cores/Total Cores: Number of positive cores divided by the total number of cores.
● Biopsy ISUP Grade Group: Based on preoperative biopsy pathology.
● PI-RADS Score (v2.1): Assessed by an expert radiologist with over 10 years of experience.
Postoperative pathological assessment
Pathological evaluation was performed on whole-mount sections of radical prostatectomy specimens. The following key parameters were recorded:
● ISUP Grade Group (RP): The final grade group assigned after surgical pathology review.
● Pathological T Stage (pT Stage): Extent of primary tumor invasion.
● Pathological N Stage (pN Stage): Lymph node involvement status.
HER2 overexpression evaluation
Currently there is no universally accepted international standard for determining HER2 overexpression in prostate cancer. Based on existing IHC criteria and reports from relevant literature, only 20-25% of prostate cancer cases demonstrate moderate HER2 expression (2+), while strong expression (3+) is observed in approximately 5% of cases. A majority (70%-75%) of prostate cancers lack or have low expression of HER2. In normal prostate tissue, HER2 expression is detected in fewer than 5% of cases (12, 13). To address this, we classified HER2 overexpression based on IHC results, defining cases with scores of 2+ to 3+ as overexpression, scores of 0 or 1+ were considered normal expression (no or low expression). All HER2 expression statuses were assessed using specific antibodies on postoperative pathological specimens (Figure 2).
Image segmentation and radiomics feature extraction
The non-enhanced and axial MRI images at 3.0T, with a thickness of 3mm, were acquired from two sequences (T2WI and ADC) for image segmentation and feature extraction, including a GE Signa HD, a GE 750 Discovery, and a Siemens Skyra, with the following shared parameters: slice thickness: 3 mm; pixel spacing: 0.49 mm; magnetic field strength: 3.0 T; and number of slices: 24. A radiologist and a urologist manually segmented the volume of interest (VOI) along the tumor contour on the axial MRI slices (Figure 3). Before feature extraction, all the mask images were normalized with a bin width of 25, followed by N4 bias field correction, and resampled to a pixel spacing of (3, 3, 3).
Figure 3. The figures above illustrate axial magnetic resonance imaging (T2WI, ADC) of two cases, along with their corresponding HE staining and immunohistochemical staining. Scanning electron microscopy (SEM) examination of pathological sections was performed at 100x magnification following standard conductive preparation (10mm-100um). Case1: An 81-year-old male presented with an initial PSA level of 6.08 ng/mL. Magnetic resonance imaging (MRI) revealed an abnormal signal in the left peripheral zone of the prostatic mid-gland, with a PI-RADS score of 3. Both preoperative needle biopsy and postoperative pathological sections indicated a Gleason score of 3 + 3 = 6. (a, b) Tumor region outlined for feature extraction. (c, d) Schematic representation of the postoperative pathological section. (e) Immunohistochemistry (IHC) showed HER2 (0). Case 2: A 79-year-old male had a PSA level of 8.77 ng/mL. MRI demonstrated an abnormal signal in the right peripheral zone of the prostatic mid-gland, with a PI-RADS score of 5. Both preoperative needle biopsy and postoperative pathological sections indicated a Gleason score of 4 + 5 = 9. (f, g) Tumor region outlined for feature extraction. (h–i) Schematic representation of the postoperative pathological section. (j) IHC results showed HER2 (3+).
Then, we got 1,694 high-throughput imaging features from each MRI sequence (847 from T2WI and 847 from ADC), including the original images, Gaussian Laplacian filtering (LoG), and wavelet transform (wavelet) images: shape-based features (n=10), first-order features (n=18), texture features (n=75), LoG and wavelet features (n=744). Texture features included gray-level co-occurrence matrix (GLCM), gray-level size zone matrix (GLSZM), gray-level run length matrix (GLRLM), neighboring gray tone difference matrix (NGTDM), and gray-level dependence matrix (GLDM) features. To ensure good consistency between measurements from different observers, we set an intraclass correlation coefficient (ICC) threshold of > 0.75 for feature filtering.
Supplements: For VOI segmentation, ITK-SNAP was used to mask the tumor area in each sequence. PyRadiomics v3.8 was employed to help radiomics feature extraction. The overall extraction process was conducted in accordance with the guidelines of the Image Biomarker Standardization Initiative.
Feature preprocessing and feature selection
To mitigate the influence of varying scales across imaging features, all such data were standardized using z-score normalization. Feature selection proceeded in a two-stage approach exclusively on the training cohort. Initially, univariate screening was conducted with the Mann–Whitney test (p < 0.05) to retain features exhibiting significant intergroup differences. These selected features were subsequently subjected to the least absolute shrinkage and selection operator (LASSO) regression with 10-fold cross-validation to further refine the feature set and mitigate multicollinearity. In parallel, univariate logistic regression was applied to clinical variables to identify those significantly associated with the outcome. This integrated process yielded a final set of features most predictive of HER2 overexpression from both imaging and clinical domains. Crucially, the test set remained entirely isolated from all steps of feature selection to prevent data leakage and ensure unbiased model evaluation.
Model construction and development
Three machine learning classifiers—Random Forest (RF), Support Vector Machine (SVM), and Logistic Regression (LR)—were implemented following feature selection to develop prediction models. Each model underwent training followed by evaluation on a held-out test set. Performance was quantified across multiple metrics: the area under the receiver operating characteristic curve (AUC), accuracy, precision, recall, F1-score, sensitivity, and specificity. The clinical utility of the models was further compared using decision curve analysis (DCA). Additionally, during the model training process on the training set, we employed the SMOTE function to balance the disparity between positive and negative samples, thereby enhancing the stability of the model.
Statistical analysis
Statistical analyses were conducted using SPSS version 27.0 and Python 3.0. In the analysis of baseline data comparison. For continuous variables, the Mann-Whitney U test was employed; for categorical variables, the Chi-square test or Fisher’s exact test was used, as appropriate. Feature selection and modeling are primarily handled by Python.
Ethical considerations
This retrospective study was approved by the Hospital Ethics Committee. Given the retrospective nature of the study and the use of de-identified patient data, the requirement for informed consent was waived by the IRB. Informed consent was obtained from all participants prior to their inclusion in the study. Furthermore, anonymization was performed on all patient data. All procedures were conducted in accordance with the ethical standards of the institutional and national research committees, as well as the 1964 Helsinki Declaration and its later amendments, or other comparable ethical guidelines.
In summary, the comprehensive collection and rigorous assessment of clinical data in this study provide a robust foundation for analyzing the prognostic value of radiomics and clinical factors in predicting HER2 overexpression in prostate cancer. The methodology ensures high-quality data and reliable results, contributing to the advancement of precision medicine in prostate cancer management.
Results
Baseline characteristics
A total of 201 patients diagnosed with prostate cancer who underwent systematic biopsy after surgical treatment were included in the study. Immunohistochemistry results showed that 33.3% of specimens (n=66) were HER2 overexpression, while 66.7% of specimens (n=135) were HER2 no or low expression. Baseline characteristics of the total cohort are summarized in Table 1.
Twenty percent of the patients were randomly assigned to test set. In the training set (n=160), 53 (33.1%) patients were HER2 overexpression, and 107 (66.9%) patients were HER2 no or low expression. In the test set (n=41), 13 (31.7%) patients were HER2 overexpression, and 28 (68.3%) patients were HER2 no or low expression. Table 2 summarizes the baseline characteristics of patients in the training and test cohorts.
Feature selection and model construction
Radiomic feature stability was assessed using the intraclass correlation coefficient. Based on an ICC threshold of > 0.75, 1257 robust features were selected from the original 1694 for subsequent analysis, ensuring reproducibility. Feature selection was then performed on the training cohort in a two-stage approach. First, the Mann-Whitney U test (p < 0.05) was applied to identify features with significant differences between groups. Second, the LASSO regression with 10-fold cross-validation was employed to further reduce redundancy and select the most predictive features. This process resulted in the final selection of 23 non-redundant and informative features, as visualized in the selection pathway (Figure 4). A radiomics score (Radscore) was computed for each patient as a linear combination of the selected features weighted by their respective LASSO coefficients. The detailed calculation formula is provided in the Supplementary Materials. Finally, the Radscore was incorporated into a logistic regression model to predict HER2 overexpression. The model demonstrated strong discriminatory performance, achieving an AUC of 0.838 in the test set.
Figure 4. Radiomics feature screening engineering flowchart. (a) Lasso coefficient path plot. (b) Lasso path plot mean squared error plot. (c) Variable selection trace.
Concurrently, univariable logistic regression analysis was performed on preoperative clinical variables within the training set to identify potential predictors, using a liberal significance threshold of p < 0.10 to avoid overlooking marginally significant features (Table 3). Two clinical variables were subsequently retained for model construction: PSA level and ISUP Grade Group (dichotomized as 3–5 versus 1-2). A clinical model based solely on these two predictors was developed using logistic regression, yielding an area under the curve (AUC) of 0.58 in the test set. To develop a more powerful predictive tool, a combined model was built by integrating the Radscore with the two selected clinical features. This multimodal dataset was used to train and evaluate three distinct machine learning classifiers: Random Forest, Support Vector Machine, and Logistic Regression. The performance of each classifier was rigorously assessed to identify the optimal integrated model for predicting HER2 overexpression.
Model performance and observations
In this study, the Logistic Regression-based combined model (Combined LR) demonstrated superior discriminative performance, achieving the highest AUC of 0.841 (95% CI: 0.697–0.955) alongside an accuracy of 0.780, a sensitivity of 0.769, and a specificity of 0.786 on the test set. Its well-balanced performance across sensitivity and specificity underscores its robustness as a classifier. In contrast, the Clinical model exhibited limited predictive utility, with an AUC of only 0.580 (95% CI: 0.382–0.773), significantly lower than all other models. The Radscore model alone showed strong performance with an AUC of 0.838 (95% CI: 0.693–0.951), markedly outperforming the Clinical model (ΔAUC = 0.258), which highlights the substantial value of radiomic features beyond conventional clinical variables (Figure 5). Integration of clinical features with the Rad-score further enhanced predictive ability, as evidenced by the Combined LR model’s slight but meaningful improvement over the Radscore-only model (ΔAUC = 0.003). This suggests complementary information between imaging-derived and clinical predictors. Table 4 summarizes the performance metrics for all models developed in this study.
Figure 5. (A) Summary graph of AUC curves for the clinical model, radscore model, and combined model constructed by logistic regression on the test set. (B) Waterfall plot visualization of the combined model on the training and test set, with green boxes representing no or low HER2 expression and yellow boxes representing HER2 overexpression.
Among the other combined models, the SVM classifier achieved competitive accuracy and specificity but did not surpass the LR model in overall discriminative power (AUC = 0.764). The RF-based combined model yielded more moderate results (AUC = 0.723), indicating possible underfitting or suboptimal hyperparameter tuning (Figure 6). To minimize the variability associated with random data partitioning, we conducted 100 iterations of repeated random sub-sampling validation (Monte Carlo cross-validation). The average performance metrics for the final model over the 1000 random 80/20 splits are as follows: Average AUC: 0.84 ± 0.03; Average Accuracy: 0.77 ± 0.04; Average Sensitivity: 0.60 ± 0.13; Average Specificity: 0.85 ± 0.07. The distribution of the AUC values is presented in Supplementary Figure S1, and the results (mean ± standard deviation) are reported in Supplementary Table T1. These results confirm that the predictive performance of our modeling strategy is stable and not dependent on a single, potentially favorable data split. In summary, the Combined LR model emerged as the optimal predictor in this task, successfully integrating radiomic and clinical information to achieve the highest diagnostic accuracy. Future work will focus on external validation and interpretability-driven analysis to elucidate feature contributions and facilitate clinical translation.
Figure 6. Validation of predictive models using an independent test cohort. The performance of Random Forest (RF), Support Vector Machine (SVM), and Logistic Regression (LR) is shown. (a) Receiver Operating Characteristic (ROC) curves with Area Under the Curve (AUC) values (with 95% confidence intervals estimated via bootstrap). (b) Decision curve analysis (DCA) comparing the net benefit of the models against default strategies. (c–d) Calibration curves for the training and test sets, demonstrating model generalizability.
Discussion
Our study validates the feasibility of predicting HER2 overexpression in prostate cancer using radiomic features derived from bp-MRI, supporting the broader potential of imaging-based molecular phenotyping in oncology.
Current radiomic research in prostate cancer has primarily targeted the prediction of well-established biomarkers such as TP53 and Ki-67, in addition to conventional clinical endpoints like biochemical recurrence (BCR) and extracapsular extension (EPE), which have been extensively investigated (20, 21). For instance, Ruchuan Chen et al. utilized Random Forest and SVM algorithms to predict TP53 mutations based on peritumoral texture features, achieving AUCs of 0.84 and 0.83 in internal and external validation, respectively. In a similar vein, Xiaofeng Qiao et al. developed a machine learning model incorporating T2-weighted and ADC images to predict both Ki-67 expression and Gleason grade groups (GGG), with reported AUCs of 0.88 and 0.92—significantly outperforming a prior deep learning model for Ki-67 prediction (AUC = 0.75) (22–24). Based on the existing literature, radiomics approaches have demonstrated promising utility in discriminating HER2 expression status in breast cancer, as evidenced by studies from Ramtohul et al. and Zheng et al. (25, 26). These studies successfully developed multiparametric MRI-based radiomics models to differentiate HER2-zero, -low, and -positive breast cancers, achieving AUCs of 0.80 and 0.725-0.889 across different validation cohorts, respectively. Their methodologies, incorporating feature selection techniques such as LASSO regression, align with the analytical framework employed in our current study.
To the best of our knowledge, this study represents the first systematic effort to develop a radiomics-based model for predicting HER2 overexpression specifically in prostate cancer. Against this backdrop, our model demonstrates competitive performance, with an AUC of 0.841 in the test cohort, thereby filling a significant gap in the field of prostate cancer molecular imaging.
Beyond prostate-specific applications, deep learning approaches have demonstrated transformative potential in surgical oncology and molecular imaging. For instance, Sibilano et al. developed a semantic segmentation model for robot-assisted prostatectomy, illustrating how AI can enhance anatomic localization and procedural precision (27). In a related domain, Ali et al. showcased the utility of radiolabeled molecular imaging for rheumatoid arthritis, underscoring the cross-disciplinary potential of quantitative imaging biomarkers for disease characterization and treatment monitoring (28). These studies collectively affirm that imaging-based AI models can capture subtleties beyond human visual assessment. Nevertheless, while deep learning approaches have shown promise in certain domains, their performance in specific radiomic classification tasks—such as EPE prediction—has been modest. A recent meta-analysis reported an AUC of 0.72 for deep learning models compared to 0.82 for leading radiomic models in this context (29, 30).
The successful radiomic prediction of HER2 status holds particular clinical significance in the context of castration-resistant prostate cancer (CRPC), where HER2 overexpression has been implicated in disease progression and resistance to conventional androgen receptor pathway inhibitors. Accumulating evidence suggests that HER2 amplification and overexpression may activate alternative signaling cascades that bypass androgen dependency, thereby sustaining tumor growth in the castration-resistant setting. Our non-invasive prediction approach may thus facilitate improved patient stratification and help guide targeted therapeutic interventions—akin to the established role of HER2-directed therapy in breast cancer. The recent FDA approval of trastuzumab deruxtecan (T-DXd) for HER2-positive solid tumors further underscores the translational relevance of HER2 as a actionable target in advanced prostate cancer. Together with advances in predicting TP53 and Ki-67, our findings underscore the expanding utility of radiomics in elucidating tumor biology and argue for the integration of AI-derived imaging biomarkers into future clinical prognostication frameworks.
Limitations and future work
However, our study has several limitations that should be considered. First, HER2 overexpression remains relatively uncommon in prostate cancer and has not yet been integrated into routine clinical biomarker assessment. Further validation is required to establish its prognostic and predictive utility, which would enable more accurate risk stratification and personalized treatment selection. Second, this was a retrospective analysis with a limited sample size from a single institution. Future work should prioritize external validation using multi-institutional datasets acquired from different scanner vendors to evaluate model generalizability. Additionally, prospective study designs are needed to minimize selection bias and verify clinical applicability. Importantly, subsequent research should also investigate the correlation between imaging-based HER2 predictions and treatment response in castration-resistant prostate cancer (CRPC) patients receiving HER2-targeted therapies, which would help validate the clinical relevance of our radiomic biomarker. Through these efforts, we hope this line of research will ultimately contribute to more refined, image-guided personalized treatment strategies for prostate cancer patients.
Conclusion
In this study, we successfully developed and validated an integrated radiomics-clinical model for predicting HER2 overexpression in prostate cancer, demonstrating robust predictive performance. This approach provides a non-invasive framework for HER2 status assessment, paving the way for image-guided molecular phenotyping in oncology. By enabling pre-treatment identification of HER2-overexpressing subtypes, our model holds significant potential to inform personalized therapeutic strategies, particularly in the era of antibody-drug conjugates and other HER2-targeted agents. These findings underscore the evolving role of radiomics as a decision-support tool in precision oncology and highlight its capacity to help tailor treatment selection toward more individualized patient care.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by The full name of the committee: The Ethics Committee of Nanjing Drum Tower Hospital Affiliation: Nanjing Drum Tower Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
XH: Data curation, Investigation, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. ZJ: Data curation, Formal analysis, Writing – original draft, Writing – review & editing. XW: Data curation, Methodology, Software, Writing – review & editing. JG: Investigation, Resources, Software, Writing – review & editing. DL: Data curation, Methodology, Resources, Software, Writing – review & editing. QZ: Resources, Supervision, Validation, Writing – review & editing. XZ: Investigation, Methodology, Resources, Writing – review & editing. HG: Funding acquisition, Resources, Supervision, Validation, Writing – review & editing.
Funding
The author(s) declared that financial support was not received for this work and/or its publication.
Conflict of interest
The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1707946/full#supplementary-material
Supplementary Figure 1 | (A) Distribution of AUC across 100 random splits. (B) Box plot of AUC across 100 random splits.
Abbreviations
EPE, Extracapsular extension; GLCM, Gray level co-occurrence matrix; GLRLM, Gray level run length matrix; GLSZM, Gray level size zone matrix; GLDM, Gray level dependence matrix; ISUP, International Society of Urological Pathology Grade; LASSO, Least Absolute Shrinkage and Selection Operator; LoG, Gaussian Laplacian filtering; LR, Logistics regression; MRI, Magnetic Resonance Imaging; NCCN, National Comprehensive Cancer Network; NGTDM, Neighborhood gray tone difference matrix; OS, Overall survival; PCa, Prostate cancer; PFS, Progression-free survival; PI-RADS, Prostate Imaging Reporting and Data System; PSA, Prostate-specific antigen; PSAD, Prostate- specific antigen density; RP, Radical prostatectomy; SVM, Support Vector Machine; T-DXd, Trastuzumab deruxtecan; VOI, Volume of interest; Wavelet, Wavelet transform.
References
1. Litwin MS and Tan HJ. The diagnosis and treatment of prostate cancer: A review. JAMA. (2017) 317:2532–42. doi: 10.1001/jama.2017.7248
2. Caffo O and Veccia A. Prostate cancer: screening and early detection. Asian J Androl. (2024) 26:559–61. doi: 10.4103/aja202417
3. Lange J, Remmers S, Gulati R, Bill-Axelson A, Johansson JE, Kwiatkowski M, et al. Impact of cancer screening on metastasis: A prostate cancer case study. J Med Screen. (2021) 28:480–7. doi: 10.1177/0969141321989738
4. Oh DY and Bang YJ. HER2-targeted therapies - a role beyond breast cancer. Nat Rev Clin Oncol. (2020) 17:33–48. doi: 10.1038/s41571-019-0268-3
5. Kumagai S, Koyama S, and Nishikawa H. Antitumour immunity regulated by aberrant ERBB family signalling. Nat Rev Cancer. (2021) 21:181–97. doi: 10.1038/s41568-020-00322-0
6. Friedlaender A, Subbiah V, Russo A, Banna GL, Malapelle U, Rolfo C, et al. EGFR and HER2 exon 20 insertions in solid tumours: from biology to treatment. Nat Rev Clin Oncol. (2022) 19:51–69. doi: 10.1038/s41571-021-00558-1
7. Estephan F, Lap CJ, Banagan J, Antonio M, Liu S, Diao G, et al. The prevalence and clinical significance of HER2 expression in prostate adenocarcinoma. Ann Diagn Pathol. (2023) 67:152219. doi: 10.1016/j.anndiagpath.2023.152219
8. Clairefond S, Ouellet V, Peant B, Barres V, Karakiewicz PI, Mes-Masson AM, et al. Expression of ERBB family members as predictive markers of prostate cancer progression and mortality. Cancers (Basel). (2021) 13:1688. doi: 10.3390/cancers13071688
9. Day KC, Lorenzatti Hiles G, Kozminsky M, Dawsey SJ, Paul A, Broses LJ, et al. HER2 and EGFR overexpression support metastatic progression of prostate cancer to bone. Cancer Res. (2017) 77:74–85. doi: 10.1158/0008-5472.CAN-16-1656
10. Maillet D, Allioli N, Peron J, Plesa A, Decaussin-Petrucci M, Tartas S, et al. Her2 expression in circulating tumor cells is associated with poor outcomes in patients with metastatic castration-resistant prostate cancer. Cancers (Basel). (2021) 13:6014. doi: 10.3390/cancers13236014
11. Minner S, Jessen B, Stiedenroth L, Burandt E, Kollermann J, Mirlacher M, et al. Low level HER2 overexpression is associated with rapid tumor cell proliferation and poor prognosis in prostate cancer. Clin Cancer Res. (2010) 16:1553–60. doi: 10.1158/1078-0432.CCR-09-2546
12. Sharifi N, Salmaninejad A, Ferdosi S, Bajestani AN, Khaleghiyan M, Estiar MA, et al. HER2 gene amplification in patients with prostate cancer: Evaluating a CISH-based method. Oncol Lett. (2016) 12:4651–8. doi: 10.3892/ol.2016.5235
13. Sekhoacha M, Riet K, Motloung P, Gumenku L, Adegoke A, and Mashele S. Prostate cancer review: genetics, diagnosis, treatment options, and alternative approaches. Molecules. (2022) 27:5730. doi: 10.3390/molecules27175730
14. Piras A, Corso R, Benfante V, Ali M, Laudicella R, Alongi P, et al. Artificial intelligence and statistical models for the prediction of radiotherapy toxicity in prostate cancer: A systematic review. Appl Sci. (2024) 14:10947. doi: 10.3390/app142310947
15. Varaprasad GL, Gupta VK, Prasad K, Kim E, Tej MB, Mohanty P, et al. Recent advances and future perspectives in the therapeutics of prostate cancer. Exp Hematol Oncol. (2023) 12:80. doi: 10.1186/s40164-023-00444-9
16. Lap CJ, Rajendran R, Martin JM, Uppal M, Escobar A, Heiraty AM, et al. Response of human epidermal growth factor receptor 2-expressing prostate cancer to trastuzumab deruxtecan. Ann Intern Med. (2024) 177:1738–41. doi: 10.7326/ANNALS-24-01409
17. Meric-Bernstam F, Makker V, Oaknin A, Oh DY, Banerjee S, Gonzalez-Martin A, et al. Efficacy and safety of trastuzumab deruxtecan in patients with HER2-expressing solid tumors: primary results from the DESTINY-panTumor02 phase II trial. J Clin Oncol. (2024) 42:47–58. doi: 10.1200/JCO.23.02005
18. Wilkinson S, Ku AT, Lis RT, King IM, Low D, Trostel SY, et al. Localized high-risk prostate cancer harbors an androgen receptor low subpopulation susceptible to HER2 inhibition. medRxiv. (2024) e189900 https. doi: 10.1101/2024.02.09.24302395
19. Dhritlahre RK and Saneja A. Recent advances in HER2-targeted delivery for cancer therapy. Drug Discov Today. (2021) 26:1319–29. doi: 10.1016/j.drudis.2020.12.014
20. Huynh LM, Bonebrake B, Tran J, Marasco JT, Ahlering TE, Wang S, et al. Multi-institutional development and validation of a radiomic model to predict prostate cancer recurrence following radical prostatectomy. J Clin Med. (2023) 12:7322. doi: 10.3390/jcm12237322
21. Dutta A, Chan J, Haworth A, Dubowitz DJ, Kneebone A, and Reynolds HM. Robustness of magnetic resonance imaging and positron emission tomography radiomic features in prostate cancer: Impact on recurrence prediction after radiation therapy. Phys Imaging Radiat Oncol. (2024) 29:100530. doi: 10.1016/j.phro.2023.100530
22. Qiao X, Gu X, Liu Y, Shu X, Ai G, Qian S, et al. MRI radiomics-based machine learning models for ki67 expression and gleason grade group prediction in prostate cancer. Cancers (Basel). (2023) 15:4536. doi: 10.3390/cancers15184536
23. Deng S, Ding J, Wang H, Mao G, Sun J, Hu J, et al. Deep learning-based radiomic nomograms for predicting Ki67 expression in prostate cancer. BMC Cancer. (2023) 23:638. doi: 10.1186/s12885-023-11130-8
24. Chen R, Zhou B, Liu W, Gan H, Liu X, and Zhou L. Association of pathological features and multiparametric MRI-based radiomics with TP53-mutated prostate cancer. J Magn Reson Imaging. (2024) 60:1134–45. doi: 10.1002/jmri.29186
25. Ramtohul T, Djerroudi L, Lissavalid E, Nhy C, Redon L, Ikni L, et al. Multiparametric MRI and radiomics for the prediction of HER2-zero, -low, and -positive breast cancers. Radiology. (2023) 308:e222646. doi: 10.1148/radiol.222646
26. Zheng S, Yang Z, Du G, Zhang Y, Jiang C, Xu T, et al. Discrimination between HER2-overexpressing, -low-expressing, and -zero-expressing statuses in breast cancer using multiparametric MRI-based radiomics. Eur Radiol. (2024) 34:6132–44. doi: 10.1007/s00330-024-10641-7
27. Sibilano E, Delprete C, Marvulli PM, Brunetti A, Marino F, Lucarelli G, et al. Deep learning strategies for semantic segmentation in robot-assisted radical prostatectomy. Appl Sci. (2025) 15:10665. doi: 10.3390/app151910665
28. Ali M, Benfante V, Di Raimondo D, Laudicella R, Tuttolomondo A, and Comelli A. A review of advances in molecular imaging of rheumatoid arthritis: from in vitro to clinic applications using radiolabeled targeting vectors with technetium-99m. Life (Basel). (2024) 14:751. doi: 10.3390/life14060751
29. van den Berg I, Soeterik TFW, van der Hoeven E, Claassen B, Brink WM, Baas DJH, et al. The development and external validation of artificial intelligence-driven MRI-based models to improve prediction of lesion-specific extraprostatic extension in patients with prostate cancer. Cancers (Basel). (2023) 15:5452. doi: 10.3390/cancers15225452
Keywords: HER2, machine learning, magnetic resonance imaging, prostate cancer, radiomics
Citation: Huang X, Jiang Z, Wang X, Gao J, Li D, Zhang Q, Zhao X and Guo H (2026) Predicting HER2 overexpression in prostate cancer using machine learning: implications for personalized therapy. Front. Oncol. 15:1707946. doi: 10.3389/fonc.2025.1707946
Received: 18 September 2025; Accepted: 17 December 2025; Revised: 29 November 2025;
Published: 13 January 2026.
Edited by:
Abhishek Mahajan, The Clatterbridge Cancer Centre, United KingdomReviewed by:
Muhammad Ali, Ri.MED Foundation, ItalyMarko Raseta, Helmholtz Association of German Research Centers (HZ), Germany
Copyright © 2026 Huang, Jiang, Wang, Gao, Li, Zhang, Zhao and Guo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Qing Zhang, ZHJ6aGFuZ3FAbmp1LmVkdS5jbg==; Xiaozhi Zhao, emhhb3h6QG5qdS5lZHUuY24=; Hongqian Guo, ZHIuZ2hxQG5qdS5lZHUuY24=
†These authors have contributed equally to this work and share first authorship
Zhen Jiang1,3†