ORIGINAL RESEARCH article

Front. Oncol., 04 June 2025

Sec. Breast Cancer

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1578458

Preoperative prediction of HER2 expression and sentinel lymph node status in breast cancer using a mammography radiomics model

Ziqian ZhaoZiqian Zhao1Hongyi YuanHongyi Yuan1Xinyu SongXinyu Song2Wen LiuWen Liu3Yanyan ChenYanyan Chen1Xiaoli WangXiaoli Wang1Chao Dong*Chao Dong1*Binlin Ma*Binlin Ma1*
  • 1The Clinical Medical Research Center of Breast and Thyroid Tumor in Xinjiang, Tumor Hospital Affiliated to Xinjiang Medical University, Urumqi, China
  • 2Department of Breast Surgery, First People’s Affiliated Hospital of Xinxiang Medical University, Xinxiang, China
  • 3Department of Artificial Intelligence and Smart Mining Engineering Technology Center, Xinjiang Institute of Engineering, Urumqi, China

Background: This study aimed to develop and validate radiomic features derived from mammography (MG) to differentiate between various HER2 expression types (HER2-positive, HER2-low, and HER2-zero) and to preoperatively assess sentinel lymph node (SLN) status in breast cancer.

Methods: A retrospective analysis was conducted using clinicopathological and imaging data from 838 female breast cancer patients diagnosed at the Affiliated Tumor Hospital of Xinjiang Medical University between January 2016 and September 2024. The patients were randomly divided into a training set (n=586) and a test set (n=252) in a 7:3 ratio. Multivariate logistic regression analysis identified independent clinical predictors. Tumor segmentation and radiomic feature extraction were performed on mammography images. The least absolute shrinkage and selection operator (LASSO) method was applied for feature selection, and the radiomics model was developed. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), calibration curve, and decision curve analysis.

Results: There were no significant differences in clinicopathological factors and mammographic features between the training and test sets (P>0.05). Multivariate analysis identified ethnicity, lesion size, vascular tumor thrombus, clinical stage, tumor margin, and HER2 expression as independent predictors for SLN metastasis. Lesion size, PR expression, menopausal status, SLN metastasis, Ki67, CK5/6 expression, and calcification were independent predictors for HER2 expression. The SLN metastasis prediction model achieved AUCs of 0.84 in the training set and 0.83 in the test set. The HER2 expression model showed AUCs of 0.87 (positive), 0.82 (low), and 0.85 (zero) in the training set, and 0.84 (positive), 0.78 (low), and 0.84 (zero) in the test set.

Conclusion: Radiomic features based on mammography can effectively preoperatively predict SLN status and HER2 expression types in breast cancer, offering valuable insights for individualized treatment strategies.

1 Introduction

According to the 2024 Global Cancer Statistics, breast cancer remains the most commonly diagnosed malignancy among women, with an estimated 2.41 million new cases and 670,000 deaths worldwide. In China, approximately 420,000 new cases were reported, accounting for 18.2% of all female cancers, and the 5-year survival rate varies from 82% in early-stage to 28% in metastatic disease (1). Modern oncology integrates surgery, radiotherapy, and systemic therapies (e.g., targeted drugs and immunotherapies), yet challenges persist in balancing efficacy with invasiveness (2). Future directions emphasize minimally invasive diagnostics and precision medicine, as outlined in recent studies (3). Axillary lymph node metastasis is one of the important features of breast cancer and has a key impact on the staging, diagnosis, treatment and prognosis of breast cancer. Sentinel lymph node biopsy (SLNB) has gradually replaced Axillary lymph node dissection (ALND) and has become the preferred method for clinical evaluation of patients with axillary lymph node negative early breast cancer (4). However, SLNB has limitations, including a high false-negative rate, procedural invasiveness, excessive lymph node resection, and risks of complications, prolonged operative time, and elevated costs. It is also limited by medical conditions and doctor’s operating level. Some hospitals are not yet equipped to carry out SLNB (5). Consequently, there is pressing demand for the development of effective, non-invasive methods capable of predicting sentinel lymph node metastasis (SLNM) in breast cancer, which would significantly contribute to reducing surgical trauma and improving diagnostic accuracy.

With the continuous deepening of the concept of precision medicine, breast cancer treatment is increasingly developing in the direction of individualized and multidisciplinary comprehensive intervention. Studies have shown that about 20% to 30% of breast cancer patients have positive of the human epidermal growth factor receptor 2 (HER2) gene (6, 7). Past targeted drugs have only targeted HER2-positive breast cancer patients and have limited efficacy in HER2-negative patients. However, in the HER2-negative population, approximately 45% to 55% of patients have HER2-Low expression (8, 9). Recent research (10, 11) shows the advent of a new antibody-drug conjugate (ADC) has given HER2-Low breast cancer patients new treatment opportunities in preoperative neoadjuvant treatment.

The early and accurate identification of HER2 gene status plays a pivotal role in implementing personalized treatment strategies and optimizing prognosis (12). Currently, HER2 status is assessed through surgery, biopsy, or genetic analysis, but these methods are limited by tissue sample size and tumor heterogeneity, leading to high false-negative rates and inconsistent results. Thus, establishing a simple, non-invasive strategy to assess both sentinel lymph node and HER2 status in breast cancer is essential for improving diagnostic accuracy and reducing surgical risks.

Breast imaging examination methods include ultrasound, mammography, MRI, CT, etc., each of which has its own advantages and disadvantages. Ultrasound assessment of axillary lymph nodes is more convenient and cost-effective, but image quality depends largely on the experience level of the operator, which may lead to fluctuations in accuracy and reliability. In contrast, mammography has achieved a high degree of standardization and is relatively less affected by operations, so the image quality is more stable (13). With the rapid development of artificial intelligence technology, in recent years, many studies have used radiomic models to detect lesions, distinguish benign and malignant tumors, predict molecular typing of breast cancer, assess axillary lymph node metastasis risk and predict prognosis, and have achieved high diagnostic efficiency (14, 15). At present, most studies based on radiomic methods to predict HER2 status of breast cancer use MRI images, mainly targeting HER2 positive and negative. Only a few studies involve HER2-Low status, and there is still a lack of molybdenum target radiomic studies that can accurately distinguish different HER2 states (1618).

This study aims to combine preoperative mammography images with clinical and pathological data to create a radiomics model that predicts sentinel lymph node metastasis (SLNM) and HER2 status. The goal is to provide reliable, non-invasive diagnostic evidence for preoperative breast cancer evaluation, axillary lymph node metastasis risk assessment, and personalized treatment planning.

2 Materials and methods

2.1 Study subjects

This retrospective study included 838 female breast cancer patients who received treatment at Xinjiang Medical University Cancer Hospital between January 2016 and September 2024. The patients met the following inclusion criteria: (1) aged 24 to 88 years; (2) pathologically confirmed breast cancer; (3) sentinel lymph node biopsy (SLNB) or axillary lymph node dissection (ALND), with complete ALND if SLN was positive; (4) complete immunohistochemistry (IHC) data (ER, PR, HER2); (5) preoperative mammography; (6) unilateral breast cancer diagnosis in women aged 20–88 years; (7) complete clinical, pathological, and mammographic data; (8) no prior endocrine therapy, radiotherapy, or chemotherapy; (9) signed informed consent. Exclusion criteria included: (1) incomplete clinical data or poor-quality mammography; (2) no postoperative HER2 IHC or fluorescence in situ hybridization (FISH) testing, or an IHC score of 2+ without FISH; (3) distant metastasis; (4) prior breast cancer or other malignancies; (5) male breast cancer.

2.2 Clinical data collection

The 838 patients were randomly divided into a training set (586 cases) and a validation set (252 cases) in a 7:3 ratio. Data collected included age, ethnicity, menopausal status, lesion size, histological grade, TNM stage, vascular tumor thrombus, ER, PR, HER2, Ki-67, CK5/6, nerve invasion, SLN metastasis, and mammographic features (e.g., breast density, mass shape, margin, density, architectural distortion, calcification, skin changes). Ethnicity was classified into four categories: Han Chinese, Uyghur, Kazakh, and Others (including Hui and Mongolian). This classification reflects the predominant ethnic groups in Xinjiang and aligns with prior epidemiological studies in this region. HER2 status was classified according to the 2018 ASCO/CAP guidelines as HER2-zero (IHC score of 0), HER2-low (IHC score of 1+ or 2+ with negative FISH), or HER2-positive (IHC score of 3+ or 2+ with positive FISH) (19).

2.3 Instruments and methods

2.3.1 Mammography image acquisition

IHC or FISH serves as the gold standard for HER2 assessment in breast cancer. Experienced breast X-ray specialists analyzed craniocaudal (CC) and mediolateral oblique (MLO) images using standard imaging techniques, ensuring maximum compression and automatic exposure control. Particular attention was given to lesions, axillary lymph nodes, and skin conditions during image acquisition. Image omics feature extraction and analysis.

All image data were processed using Unet software for segmentation. The region of interest (ROI) was manually outlined by a radiologist with over five years of experience, unaware of the pathological results. Using the Python-based pyradiomics toolkit, 1,409 features were initially extracted. After stability screening with an intra-class correlation coefficient (ICC > 0.75), 1,302 highly stable features were retained for subsequent analysis. Features included first-order statistics, 2D shape descriptors, texture features, and high-order features (e.g., GLCM, GLRLM, GLSZM, GLDM, NGTDM). Synthetic Minority Over-sampling Technique (SMOTE) was used to balance data. Z-score normalization was applied to standardize feature values across the training cohort, using the mean and standard deviation of each feature derived from the training set, which were then applied to both training and test sets to avoid data leakage. Feature selection was performed using interclass correlation coefficient (ICC), independent sample t-test, and LASSO, with features having ICC > 0.75 retained. The data were split into training and test cohorts (7:3), with t-tests used to identify statistically significant features, followed by LASSO screening.

2.3.2 Model construction

A support vector machine (SVM) algorithm was employed to model the features through LASSO. For the SVM implementation, we employed a radial basis function(RBF)kernel with y parameter scaling, while reserving 99% of the dataset for training through a test_size parameter of 0.01.For LASSO regression, we implemented 10-fold cross-validation with a values spanning 4 logarithmic scales(1074 to 101),maximum iterations of 100,000,and random state stabilization(seed=15). Class balancing was achieved through weighted samples, with LASSO regularization strength optimized across three orders of magnitude using 10-fold cross-validation and a convergence tolerance of 1e-4. The performance of models was evaluated using receiver operating characteristic (ROC) curves, calibration curves, and clinical value assessed using decision curve analysis (DCA).

2.4 Statistical analysis

Statistical analysis was performed using SPSS 21, R (version 3.4.1), and Python (version 3.1). All tests were two-sided, with P < 0.05 considered significant. Quantitative data are presented as mean ± standard deviation (X ± s); for normally distributed data, independent t-tests were used, while non-parametric Mann-Whitney U tests were applied to ranked data. Chi-square tests were used for categorical data comparisons. Ethnicity was treated as a categorical variable. Chi-square tests compared distributions between groups, and multivariate logistic regression included ethnicity as dummy variables (Set the Han Chinese as the reference group). Univariate and multivariate logistic regression analysis identified clinical and imaging features associated with HER2 expression and SLN metastasis. ROC and DCA curves were generated using Python, with 95% confidence intervals (CI). The model’s performance was evaluated by area under the curve (AUC), sensitivity, and specificity.

3 Results

3.1 Analysis of clinical characteristics of sentinel lymph node metastasis

This study included 838 patients (mean age 52.4 ± 10.6 years), 413 of whom had sentinel lymph node metastasis and 425 did not, all with complete clinical and pathological data. No significant differences were found between the training and test groups for any clinical or pathological factors (P > 0.05) (Table 1). In the training cohort, univariate analysis identified 12 clinical and pathological factors associated with sentinel lymph node status, including ethnicity, lesion size, histological grade, vascular tumor thrombus, nerve invasion, menopausal status, Ki67, clinical stage, mass margin, mass density, skin changes, and HER2 expression. These 12 factors were analyzed using multivariate logistic regression, which identified six independent risk factors for sentinel lymph node metastasis: ethnicity, lesion size, vascular tumor thrombus, clinical stage, tumor margin, and HER2 expression (Table 2).

Table 1
www.frontiersin.org

Table 1. Baseline characteristics of the study sample.

Table 2
www.frontiersin.org

Table 2. Univariate and multivariate logistic regression analysis for predicting SLNM.

3.2 Analysis of clinical characteristics of HER2 expression

Of the 838 patients, 246 (29.4%) had HER2-zero (0), 297 (35.4%) had HER2 1+ or 2+ (FISH-), and 295 (35.2%) had HER2 2+ (FISH+) or HER2 3+. No significant differences were found between the training and test groups (P > 0.05). In the training cohort, after incorporating 20 variables into univariate and multivariate logistic regression, seven factors were identified as independent risk factors for HER2 expression: lesion size, PR expression, menopausal status, SLN metastasis, Ki67, CK5/6 expression, and calcification (Table 3).

Table 3
www.frontiersin.org

Table 3. Univariate and multivariate logistic regression analysis for predicting HER2 positive, low, and zero expression.

3.3 Construction of radiomics prediction model for sentinel lymph node metastasis

A total of 1,302 stable radiomic features (ICC ≥ 0.75) were analyzed for each patient, with 1,060 showing high stability (ICC ≥ 0.75). After preliminary screening with an independent sample t-test, the LASSO algorithm selected the 20 best features (Figure 1)(Table 4). These features include: First Order Statistics (3 features), GLCM features (4 features), GLSZM features (3 features), GLRLM features (5 features), GLDM features (2 features), Wavelet Transform Features (4 features), Multi-scale Filtering Features (5 features), LBP Features (2 features). Based on these features and their weighting coefficients, a radiomics score (radscore) was calculated for each patient. The support vector machine (SVM) algorithm was used to construct a prediction model. The model’s performance was evaluated using the receiver operating characteristic (ROC) curve, achieving an AUC of 0.84 in the training set and 0.83 in the validation set (Figure 2) (Table 5).

Figure 1
www.frontiersin.org

Figure 1. Independent sample t-test and LASSO regression analysis were used to screen the significant features for predicting sentinel lymph nodes.

Table 4
www.frontiersin.org

Table 4. Optimal characteristics for predicting SLNM.

Figure 2
www.frontiersin.org

Figure 2. Sentinel lymph node status prediction model. (A): Training set ROC curve (B): Test set ROC curve (C): Calibration curve analysis of prediction model (D): Decision curve analysis of prediction model. Training Queue AUC (95% CI): 0.84 (0.79-0.87); Testing Queue AUC (95% CI): 0.83 (0.71-0.84).

Table 5
www.frontiersin.org

Table 5. Performance evaluation of sentinel lymph node status model.

3.4 Construction of radiomics prediction model for HER2 expression

From the craniocaudal (CC) and mediolateral oblique (MLO) mammographic images, 1,302 radiomic features were extracted. After dimensionality reduction using ICC and t-tests, 54 optimal features were selected by LASSO regression (Figure 3) (Table 6). These features include: First Order Statistics (9 features), GLCM features (5 features), GLSZM features (10 features), GLRLM features (8 features), GLDM features (6 features), Wavelet Transform Features (12 features), Multi-scale Filtering Features (10 features), LBP Features (4 features). A radiomics model based on these features showed strong prediction performance for HER2 status. ROC analysis revealed AUCs of 0.85 (training) and 0.84 (validation) for HER2-zero (0); 0.82 (training) and 0.78 (validation) for HER2 1+ or 2+ (FISH-); and 0.87 (training) and 0.84 (validation) for HER2 2+ (FISH+) or HER2 3+ (Figure 4) (Table 7).

Figure 3
www.frontiersin.org

Figure 3. Independent sample t-test and LASSO regression analysis were used to screen significant features for predicting HER2.

Table 6
www.frontiersin.org

Table 6. Optimal characteristics for predicting HER2 positive, low, and zero expression.

Figure 4
www.frontiersin.org

Figure 4. HER2 expression prediction model. (A): Training set ROC curve (B): Test set ROC curve (C): Calibration curve analysis of prediction model (D): Decision curve analysis of prediction model. Training Queue AUC (95% CI): Her2-zero 0.85 (0.74~ 0.87), Her2-low 0.82 (0.72-0.86), Her2-positive 0.87 (0.73-0.93); Testing Queue AUC (95% CI): Her2-zero 0.84 (0.75 ~ 0.89), Her2-low 0.78 (0.61 ~ 0.86), Her2-positive 0.84 (0.76–0.89).

Table 7
www.frontiersin.org

Table 7. Performance evaluation of HER2 expression model.

4 Discussion

The evolving approach to breast cancer surgery emphasizes minimizing trauma and enhancing patient quality of life. Accurate preoperative assessment of HER2 status and sentinel lymph node metastasis is crucial for tailoring treatment plans, evaluating prognosis, and predicting recurrence risk (20, 21). This study demonstrates that radiomic features derived from mammography effectively predict HER2 expression subtypes and sentinel lymph node (SLN) metastasis. Our model achieved AUCs of 0.82–0.87 in the training set, outperforming traditional clinical-pathological assessments. These findings provide a non-invasive tool to guide personalized treatment strategies and reduce unnecessary surgical interventions. The decision curve analysis (DCA) further confirmed the clinical utility of the model, showing a net benefit across a wide range of threshold probabilities (10–60%). For example: In high-risk patients (predicted SLN metastasis probability >60%), clinicians may prioritize SLNB to confirm metastasis and plan axillary dissection, aligning with current guidelines. Conversely, in low-risk patients (predicted probability <20%), the model supports avoiding unnecessary SLNB procedures, opting instead for watchful waiting or non-invasive monitoring. This stratification could reduce surgical complications by 30–40% in low-risk cohorts, as observed in breast cancer risk management studies (21).This dual-threshold approach highlights the utility of DCA in translating model outputs into actionable decisions, as similarly demonstrated in glioma biomarker research (22). Unlike ROC analysis, which evaluates diagnostic accuracy (AUC), DCA quantifies clinical net benefit by balancing true-positive gains against false-positive harms. For instance, in our HER2 expression model, the AUC of 0.87 reflects high discriminative power, while DCA shows that applying the model at a 15–50% threshold range would prevent 35–50% of unnecessary biopsies without compromising sensitivity. While AUC-ROC quantifies diagnostic accuracy, DCA evaluates clinical net benefit. Both metrics were analyzed (Figure 4D), but AUC remains the gold standard for direct comparison with prior radiomics studies, such as those investigating RAD51, SCN3B, and CDK2 in cancer biomarker discovery (2224).

Recent studies have focused on the relationship between breast cancer lymph node metastasis and primary lesion imaging features (25). While significant progress has been made, most radiomics research has focused on predicting non-sentinel lymph node (SLN) status (26) or axillary lymph node (ALN) (27, 28) metastasis to reduce postoperative complications. Research on SLN status, however, remains limited. This study found that ethnicity, lesion size, vascular tumor thrombus, clinical stage, mammography-based tumor margin, and HER2 expression were independent predictors of SLN metastasis, aligning with previous studies (4, 29). Previous research has highlighted a strong correlation between tumor margin characteristics and ALN metastasis (ALNM) risk (30), Spiculated margins, in particular, increase ALNM risk by approximately sixfold compared to clear margins, a finding supported by this study. This may be attributed to cancer cell infiltration inducing fibrosis, which accelerates the formation of blood and lymphatic vessels, thereby facilitating tumor spread. However, some studies suggest that fibroplasia might slightly delay tumor spread (31). HER2 is a transmembrane receptor protein with tyrosine kinase activity, typically in an inactive state, playing a role in cell growth and differentiation. HER2 positive is associated with tumor development and metastasis. This study confirmed the association between high HER2 expression and SLN metastasis, consistent with the findings of Ding J et al. (31). Other studies have also identified vascular invasion and tumor size as strongly correlated with SLN metastasis. In this study, vascular tumor thrombus was regarded as a key predictor; when present, the risk of SLN metastasis was 4.27 times higher than that of patients without vascular tumor thrombus, and the impact exceeded other indicators. This suggests that the tumor may have broken through the local limitations of the breast and has higher potential for spread and metastasis, further highlighting the value of clinical pathological factors in predicting SLN status and providing a strong basis for individualized treatment strategies. To explore a non-invasive and efficient method for identifying SLN status before surgery, the predictive model constructed based on mammography in this study achieved an area under the receiver operating characteristic (ROC) curve (AUC) of 0.84 in the training set. It is expected to serve as a digital biomarker that conveys information similar to SLN biopsy or lymph node dissection, providing an important reference for clinical treatment decisions. In comparison, Dong et al. (32) predicted lymph node status based on T2WI-FS and DWI sequence imagomics, with AUC =0.805; while Ding et al. (33) used DCE-MRI intratumoral and combined intratumoral and peritumoral radiomics models, with AUCs of 0.704 and 0.796. These results are broadly consistent with this study’s findings. This study also found that multiple mammography features and clinical pathology factors are independently related to SLN status, highlighting the potential value of mammography imaging as a non-invasive tool to identify SLN status in breast cancer patients. With the growing use of neoadjuvant systemic therapy (NAST), SLN biopsy is frequently performed after neoadjuvant therapy. In these cases, radiomics evaluation can assist in subsequent treatment decisions, particularly when therapy leads to downstaging.

The precise stratification of HER2 expression subtypes (HER2-zero, -low, -positive) is pivotal for tailoring ADC therapies. For example, HER2-low patients, once considered ineligible for HER2-targeted drugs, now represent a population with emerging therapeutic options. The DESTINY-Breast04 trial (10) demonstrated that DS-8201 significantly improves survival in HER2-low metastatic breast cancer. Our radiomics model, with an AUC of 0.84 for HER2-low identification, provides a non-invasive tool to preoperatively pinpoint these candidates, thereby avoiding undertreatment due to misclassification. Moreover, integrating radiomics with clinical factors supports can optimize treatment strategies and minimize surgical overtreatment. This study revealed that several clinicopathological factors were independently associated with changes in HER2 expression patterns. These factors, including lesion size, PR expression, menopausal status, axillary lymph node status, Ki67, CK5/6, and mammography calcification, predicted HER2 zero, low, and positive expression, in line with previous studies (20, 3436). The observed association between calcification features and HER2 expression may reflect underlying biological processes. Calcification, caused by calcium deposition in breast tissue, often develops due to tissue ischemia and necrosis resulting from hypoxia and nutrient deficiency in rapidly growing tumors (37). HER2 positive is known to drive tumor proliferation and metabolic reprogramming, potentially leading to hypoxia-induced necrosis and subsequent calcium deposition in the tumor microenvironment (38). Additionally, HER2 signaling activates pathways such as PI3K/AKT and MAPK, which promote cellular stress and apoptosis, further contributing to dystrophic calcifications (39). Radiomic features capturing clustered or linear calcifications on mammography may thus serve as non-invasive indicators of HER2-driven tumor aggressiveness. This hypothesis aligns with prior studies demonstrating that the presence of microcalcifications strongly increased the likelihood of HER2 positive (36). Similarly, our study found that calcification is strongly associated with HER2 expression, with the risk of calcified lesions being 1.51 times higher than non-calcified lesions. Clustered or linearly distributed calcifications should raise particular concern among clinicians. Integrating these biological insights with radiomic models could enhance their utility in guiding targeted therapies. Additionally, CK5/6, a basal cell marker, serves as an indicator of tumor cell differentiation and plays a crucial role in classifying breast cancer subtypes and evaluating invasiveness. As a basal cell marker, CK5/6 reflects the differentiation status of tumor cells and plays an important role in breast cancer subtype classification and invasive assessment. This study found that CK5/6 positivity is more common in less differentiated and HER2-low expressing breast cancers, especially in basal-like subtypes, which show greater invasiveness and metastatic ability. These findings further emphasize the potential of combining mammography imaging with clinical pathology factors in improving HER2 expression models and provide a valuable basis for developing personalized treatment strategies. Studies have also shown that a predictive model based on mammography can effectively distinguish the three HER2 expression states in breast cancer. In the test set, the model’s AUCs for distinguishing between HER2 positive, HER2 low expression, and HER2 zero expression were 0.87, 0.82, and 0.85, respectively, which was superior to the previously reported single-parameter MRI radiomics method (40). For instance, Bian et al.’s (41) multi-parameter MRI-based imaging study had an AUC of 0.76 in distinguishing HER2 positive from HER2 negative, but when identifying tumors with low HER2 expression and zero HER2 expression, the AUC was only 0.71. In contrast, the mammography imaging model in this study can more accurately distinguish different expression states of HER2, showing greater diagnostic efficiency. Although IHC and FISH are standard methods for assessing HER2 expression, their limitations include lack of representativeness from a single sample and tumor heterogeneity. This study suggests that incorporating radiomics features into diagnostics can assist pathologists in achieving more comprehensive HER2 identification and enhancing the precision of biopsy target selection (42, 43). Additionally, during neoadjuvant chemotherapy, radiomics can dynamically track HER2 expression changes, enabling timely adjustments to treatment strategies. For patients with drug-resistant or triple-negative breast cancer, imaging-guided re-detection of low HER2 expression in clinical trials may become a critical strategy for optimizing treatment. In addition, Future research may incorporate advanced nanomaterials to enhance imaging resolution and therapeutic monitoring, thereby refining radiomic feature extraction and clinical applicability (44).

Although our model demonstrated high diagnostic accuracy and potential clinical application value, its clinical translation requires validation in multicenter cohorts. We recognize that variations in mammography equipment and regional differences in HER2 testing protocols (e.g., IHC/FISH criteria) may impact model generalizability. To address this, we are attempting to initiate partnerships with institutions in geographically diverse regions of China, aiming to collect heterogeneous data for external validation.

5 Conclusion

In conclusion, breast cancer mammography radiomics demonstrated high accuracy in identifying HER2 subtypes and predicting sentinel lymph node (SLN) metastasis. This has significant implications for developing personalized treatment plans, assessing prognosis, and guiding clinical decision-making. However, the use of radiomics is still in its early stages. As data sharing expands and machine learning technology advances, its potential value in the medical field requires further exploration.

6 Limitation

This study has several limitations: (1) This study is limited by its single-center retrospective design, which may restrict the generalizability of the model to other populations. While we employed rigorous internal validation, future multicenter studies are imperative to assess performance across diverse ethnic groups, imaging devices, and clinical protocols. Challenges such as inter-institutional data harmonization and ethical approvals currently hinder immediate expansion, but collaborative efforts are underway; (2) ROI delineation was performed using a two-dimensional approach, which may be influenced by the volume effect. Future studies could consider using three-dimensional imaging to enhance accuracy; (3) Some clinical characteristics were assessed semi-qualitatively, and the results could be influenced by evaluator subjectivity.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by The full name of the Ethics Committee/Institutional Review Committee: Ethics Committee of the Affiliated Tumor Hospital of Xinjiang Medical University Institution: Affiliated Tumor Hospital of Xinjiang Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

ZZ: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Supervision, Writing – original draft, Writing – review & editing. HY: Data curation, Formal Analysis, Methodology, Software, Supervision, Writing – original draft, Writing – review & editing. XS: Conceptualization, Formal Analysis, Investigation, Supervision, Writing – original draft, Writing – review & editing. WL: Conceptualization, Data curation, Formal Analysis, Investigation, Software, Writing – original draft, Writing – review & editing. YC: Conceptualization, Investigation, Software, Writing – original draft, Writing – review & editing. XW: Conceptualization, Investigation, Software, Writing – original draft, Writing – review & editing. CD: Funding acquisition, Project administration, Resources, Validation, Visualization, Writing – original draft, Writing – review & editing. BM: Funding acquisition, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. Funding: Tianshan Innovation Team Project by Xinjiang Uygur Autonomous Region Science and Technology Department (Grant Number: 2022TSYCTD0017).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

ROI, Region of interest; LASSO, Least absolute shrinkage and selection operator; SVM, Support vector machine; ROC, Receiver operating characteristic; AUC, Area under the curve; HER-2, Human epidermal growth factor receptor 2; SLN, Sentinel lymph node; SLNB, Sentinel lymph node biopsy; SLNM, Sentinel lymph node metastasis; ALND, Axillary lymph node dissection; ER, Estrogen receptor; PR, Progestogen receptor; IARC, International Agency for Research on Cancer; IHC, Immunohistochemistry; FISH, Fluorescence in situ hybridization; MRI, Magnetic resonance imaging; MG, Mammography; CC, Craniocaudal; MLO, Mediolateral oblique; GLCM, Gray-level co-occurrence matrix; GLRLM, Gray-level run-length matrix; GLSZM, Gray-level size zone matrix; GLDM, Gray level dependence matrix; NGTDM, Neighborhood gray tone difference matrix; ICC, Intra-class correlation coefficient; SMOTE, Synthetic minority oversampling technique; CI, Confidence interval; 2D, Two-dimensional.

References

1. Siegel RL, Giaquinto AN, and Jemal A. Cancer statistics, 2024. CA Cancer J Clin. (2024) 74:12–49. doi: 10.3322/caac.21820

PubMed Abstract | Crossref Full Text | Google Scholar

2. Sonkin D, Thomas A, and Teicher Cancer treatments BA. Past, present, and future. Cancer Genet. (2024) 286-287:18–24. doi: 10.1016/j.cancergen.2024.06.002

PubMed Abstract | Crossref Full Text | Google Scholar

3. Joshi RM, Telang B, Soni G, and Khalife A. Overview of perspectives on cancer, newer therapies, and future directions. Endoscopic Ultrasound. (2024) 10:105–9. doi: 10.1097/ot9.0000000000000039

Crossref Full Text | Google Scholar

4. Marino MA, Avendano D, Zapata P, Riedl CC, and Pinker K. Lymph node imaging in patients with primary breast cancer: concurrent diagnostic tools. Oncologist. (2020) 25:e231–42. doi: 10.1634/theoncologist.2019-0427

PubMed Abstract | Crossref Full Text | Google Scholar

5. Zha HL, Zong M, Liu XP, Pan JZ, Wang H, Gong HY, et al. Preoperative ultrasound-based radiomics score can improve the accuracy of the Memorial Sloan Kettering Cancer Center nomogram for predicting sentinel lymph node metastasis in breast cancer. Eur J Radiol. (2021) 135:109512. doi: 10.1016/j.ejrad.2020.109512

PubMed Abstract | Crossref Full Text | Google Scholar

6. Harbeck N, Penault-Llorca F, Cortes J, Gnant M, Houssami N, Poortmans P, et al. Breast cancer. Nat Rev Dis Primers. (2019) 5:66. doi: 10.1038/s41572-019-0111-2

PubMed Abstract | Crossref Full Text | Google Scholar

7. Zou Y, Zheng S, Xie X, Ye F, Hu X, Tian Z, et al. N6-methyladenosine regulated FGFR4 attenuates ferroptotic cell death in recalcitrant HER2-positive breast cancer. Nat Commun. (2022) 13:2672. doi: 10.1038/s41467-022-30217-7

PubMed Abstract | Crossref Full Text | Google Scholar

8. Agostinetto E, Rediti M, Fimereli D, Debien V, Piccart M, Aftimos P, et al. HER2-low breast cancer: molecular characteristics and prognosis. Cancers (Basel). (2021) 13:11–2. doi: 10.3390/cancers13112824

PubMed Abstract | Crossref Full Text | Google Scholar

9. Shao X, Xie N, Chen Z, Wang X, Cao W, Zheng Y, et al. Inetetamab for injection in combination with vinorelbine weekly or every three weeks in HER2-positive metastatic breast cancer: A multicenter, randomized, phase II clinical trial. J Transl Int Med. (2024) 12:466–77. doi: 10.1515/jtim-2024-0022

PubMed Abstract | Crossref Full Text | Google Scholar

10. Modi S, Jacot W, Yamashita T, Sohn J, Vidal M, Tokunaga E, et al. Trastuzumab deruxtecan in previously treated HER2-low advanced breast cancer. N Engl J Med. (2022) 387:9–20. doi: 10.1056/NEJMoa2203690

PubMed Abstract | Crossref Full Text | Google Scholar

11. Eiger D, Agostinetto E, Saúde-Conde R, and de Azambuja E. The exciting new field of HER2-Low breast cancer treatment. Cancers (Basel). (2021) 13:1–3. doi: 10.3390/cancers13051015

PubMed Abstract | Crossref Full Text | Google Scholar

12. Eun NL, Kang D, Son EJ, Park JS, Youk JH, Kim JA, et al. Texture analysis with 3.0-T MRI for association of response to neoadjuvant chemotherapy in breast cancer. Radiology. (2020) 294:31–41. doi: 10.1148/radiol.2019182718

PubMed Abstract | Crossref Full Text | Google Scholar

13. Chang JM, Leung JWT, Moy L, Ha SM, and Moon WK. Axillary nodal evaluation in breast cancer: state of the art. Radiology. (2020) 295:500–15. doi: 10.1148/radiol.2020192534

PubMed Abstract | Crossref Full Text | Google Scholar

14. Afrin H, Larson NB, Fatemi M, and Alizad A. Deep learning in different ultrasound methods for breast cancer, from diagnosis to prognosis: current trends, challenges, and an analysis. Cancers (Basel). (2023) 15:5–6. doi: 10.3390/cancers15123139

PubMed Abstract | Crossref Full Text | Google Scholar

15. Song X, Xu H, Wang X, Liu W, Leng X, Hu Y, et al. Use of ultrasound imaging Omics in predicting molecular typing and assessing the risk of postoperative recurrence in breast cancer. BMC Womens Health. (2024) 24:380. doi: 10.1186/s12905-024-03231-8

PubMed Abstract | Crossref Full Text | Google Scholar

16. Xu Z, Yang Q, Li M, Gu J, Du C, Chen Y, et al. Predicting HER2 status in breast cancer on ultrasound images using deep learning method. Front Oncol. (2022) 12:829041. doi: 10.3389/fonc.2022.829041

PubMed Abstract | Crossref Full Text | Google Scholar

17. Quan MY, Huang YX, Wang CY, Zhang Q, and Chang C. and S C Zhou Deep learning radiomics model based on breast ultrasound video to predict HER2 expression status. Front Endocrinol (Lausanne). (2023) 14:1144812. doi: 10.3389/fendo.2023.1144812

PubMed Abstract | Crossref Full Text | Google Scholar

18. Huang Y, Wei L, Hu Y, Shao N, Lin Y, He S, et al. Multi-parametric MRI-based radiomics models for predicting molecular subtype and androgen receptor expression in breast cancer. Front Oncol. (2021) 11:706733. doi: 10.3389/fonc.2021.706733

PubMed Abstract | Crossref Full Text | Google Scholar

19. Wolff AC, Hammond MEH, Allison KH, Harvey BE, Mangu PB, Bartlett JMS, et al. Human epidermal growth factor receptor 2 testing in breast cancer: American society of clinical oncology/college of American pathologists clinical practice guideline focused update. J Clin Oncol. (2018) 36:2105–22. doi: 10.1200/jco.2018.77.8738

PubMed Abstract | Crossref Full Text | Google Scholar

20. Ramtohul T, Djerroudi L, Lissavalid E, Nhy C, Redon L, Ikni L, et al. Multiparametric MRI and radiomics for the prediction of HER2-zero, -low, and -positive breast cancers. Radiology. (2023) 308:e222646. doi: 10.1148/radiol.222646

PubMed Abstract | Crossref Full Text | Google Scholar

21. Chen M, Kong C, Lin G, Chen W, Guo X, Chen Y, et al. Development and validation of convolutional neural network-based model to predict the risk of sentinel or non-sentinel lymph node metastasis in patients with breast cancer: a machine learning study. EClinicalMedicine. (2023) 63:102176. doi: 10.1016/j.eclinm.2023.102176

PubMed Abstract | Crossref Full Text | Google Scholar

22. Liu H, Weng J, Huang CL, and Jackson AP. Is the voltage-gated sodium channel β3 subunit (SCN3B) a biomarker for glioma? Funct Integr Genomics. (2024) 24:162. doi: 10.1007/s10142-024-01443-7

PubMed Abstract | Crossref Full Text | Google Scholar

23. Liu H and Weng J. A pan-cancer bioinformatic analysis of RAD51 regarding the values for diagnosis, prognosis, and therapeutic prediction. Front Oncol. (2022) 12:858756. doi: 10.3389/fonc.2022.858756

PubMed Abstract | Crossref Full Text | Google Scholar

24. Liu H and Weng J. A comprehensive bioinformatic analysis of cyclin-dependent kinase 2 (CDK2) in glioma. Gene. (2022) 822:146325. doi: 10.1016/j.gene.2022.146325

PubMed Abstract | Crossref Full Text | Google Scholar

25. Wang C, Chen X, Luo H, Liu Y, Meng R, Wang M, et al. Development and internal validation of a preoperative prediction model for sentinel lymph node status in breast cancer: combining radiomics signature and clinical factors. Front Oncol. (2021) 11:754843. doi: 10.3389/fonc.2021.754843

PubMed Abstract | Crossref Full Text | Google Scholar

26. Qiu Y, Zhang X, Wu Z, Wu S, Yang Z, Wang D, et al. MRI-based radiomics nomogram: prediction of axillary non-sentinel lymph node metastasis in patients with sentinel lymph node-positive breast cancer. Front Oncol. (2022) 12:811347. doi: 10.3389/fonc.2022.811347

PubMed Abstract | Crossref Full Text | Google Scholar

27. Fong W, Tan L, Tan C, Wang H, Liu F, Tian H, et al. Predicting the risk of axillary lymph node metastasis in early breast cancer patients based on ultrasonographic-clinicopathologic features and the use of nomograms: a prospective single-center observational study. Eur Radiol. (2022) 32:8200–12. doi: 10.1007/s00330-022-08855-8

PubMed Abstract | Crossref Full Text | Google Scholar

28. Yu Y, Tan Y, Xie C, Hu Q, Ouyang J, Chen Y, et al. Development and validation of a preoperative magnetic resonance imaging radiomics-based signature to predict axillary lymph node metastasis and disease-free survival in patients with early-stage breast cancer. JAMA Netw Open. (2020) 3:e2028086. doi: 10.1001/jamanetworkopen.2020.28086

PubMed Abstract | Crossref Full Text | Google Scholar

29. Liu L, Lin Y, Li G, Zhang L, Zhang X, Wu J, et al. A novel nomogram for decision-making assistance on exemption of axillary lymph node dissection in T1–2 breast cancer with only one sentinel lymph node metastasis. Front Oncol. (2022) 12:924298. doi: 10.3389/fonc.2022.924298

PubMed Abstract | Crossref Full Text | Google Scholar

30. Zong Q, Deng J, Ge W, Chen J, and Xu D. Establishment of simple nomograms for predicting axillary lymph node involvement in early breast cancer. Cancer Manag Res. (2020) 12:2025–35. doi: 10.2147/cmar.S241641

PubMed Abstract | Crossref Full Text | Google Scholar

31. Eun NL, Bae SJ, Youk JH, Son EJ, Ahn SG, Jeong J, et al. Tumor-infiltrating lymphocyte level consistently correlates with lower stiffness measured by shear-wave elastography: subtype-specific analysis of its implication in breast cancer. Cancers (Basel). (2024) 16:9–10. doi: 10.3390/cancers16071254

PubMed Abstract | Crossref Full Text | Google Scholar

32. Liu Y, Li X, Zhu L, Zhao Z, Wang T, Zhang X, et al. Preoperative prediction of axillary lymph node metastasis in breast cancer based on intratumoral and peritumoral DCE-MRI radiomics nomogram. Contrast Media Mol Imaging. (2022) 2022:6729473. doi: 10.1155/2022/6729473

PubMed Abstract | Crossref Full Text | Google Scholar

33. Ding J, Chen S, Serrano Sosa M, Cattell R, Lei L, Sun J, et al. Optimizing the peritumoral region size in radiomics analysis for sentinel lymph node status prediction in breast cancer. Acad Radiol. (2022) 29 Suppl 1:S223–s228. doi: 10.1016/j.acra.2020.10.015

PubMed Abstract | Crossref Full Text | Google Scholar

34. Yin Y, Mo S, Li G, Wu H, Hu J, Zheng J, et al. Ultrasound radiomics for the prediction of breast cancers with HER2-zero, -low, and -positive status: A dual-center study. Technol Cancer Res Treat. (2024) 23:15330338241292668. doi: 10.1177/15330338241292668

PubMed Abstract | Crossref Full Text | Google Scholar

35. Xu A, Chu X, Zhang S, Zheng J, Shi D, Lv S, et al. Prediction breast molecular typing of invasive ductal carcinoma based on dynamic contrast enhancement magnetic resonance imaging radiomics characteristics: A feasibility study. Front Oncol. (2022) 12:799232. doi: 10.3389/fonc.2022.799232

PubMed Abstract | Crossref Full Text | Google Scholar

36. Deng Y, Lu Y, Li X, Zhu Y, Zhao Y, Ruan Z, et al. Prediction of human epidermal growth factor receptor 2 (HER2) status in breast cancer by mammographic radiomics features and clinical characteristics: a multicenter study. Eur Radiol. (2024) 34:5464–76. doi: 10.1007/s00330-024-10607-9

PubMed Abstract | Crossref Full Text | Google Scholar

37. Azam S, Eriksson M, Sjölander A, Gabrielson M, Hellgren R, Czene K, et al. Mammographic microcalcifications and risk of breast cancer. Br J Cancer. (2021) 125:759–65. doi: 10.1038/s41416-021-01459-x

PubMed Abstract | Crossref Full Text | Google Scholar

38. O’Grady S. and M P Morgan Microcalcifications in breast cancer: From pathophysiology to diagnosis and prognosis. Biochim Biophys Acta Rev Cancer. (2018) 1869:310–20. doi: 10.1016/j.bbcan.2018.04.006

PubMed Abstract | Crossref Full Text | Google Scholar

39. Miricescu D, Totan A, Stanescu-Spinu II, Badoiu SC, Stefani C, and Greabu M. PI3K/AKT/mTOR signaling pathway in breast cancer: from molecular landscape to clinical aspects. Int J Mol Sci. (2020) 22:4–10. doi: 10.3390/ijms22010173

PubMed Abstract | Crossref Full Text | Google Scholar

40. Xu A, Chu X, Zhang S, Zheng J, Shi D, Lv S, et al. Development and validation of a clinicoradiomic nomogram to assess the HER2 status of patients with invasive ductal carcinoma. BMC Cancer. (2022) 22:872. doi: 10.1186/s12885-022-09967-6

PubMed Abstract | Crossref Full Text | Google Scholar

41. Bian X, Du S, Yue Z, Gao S, Zhao R, Huang G, et al. Potential antihuman epidermal growth factor receptor 2 target therapy beneficiaries: the role of MRI-based radiomics in distinguishing human epidermal growth factor receptor 2-low status of breast cancer. J Magn Reson Imaging. (2023) 58:1603–14. doi: 10.1002/jmri.28628

PubMed Abstract | Crossref Full Text | Google Scholar

42. Zheng S, Yang Z, Du G, Zhang Y, Jiang C, Xu T, et al. Discrimination between HER2-overexpressing, -low-expressing, and -zero-expressing statuses in breast cancer using multiparametric MRI-based radiomics. Eur Radiol. (2024) 34:6132–44. doi: 10.1007/s00330-024-10641-7

PubMed Abstract | Crossref Full Text | Google Scholar

43. Peng Y, Zhang X, Qiu Y, Li B, Yang Z, Huang J, et al. Development and validation of MRI radiomics models to differentiate HER2-zero, -low, and -positive breast cancer. AJR Am J Roentgenol. (2024) 222:e2330603. doi: 10.2214/ajr.23.30603

PubMed Abstract | Crossref Full Text | Google Scholar

44. Zhou M, Tian M, and Li C. Copper-based nanomaterials for cancer imaging and therapy. Bioconjug Chem. (2016) 27:1188–99. doi: 10.1021/acs.bioconjchem.6b00156

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: breast cancer, HER2 expression, sentinel lymph node, mammography, radiomics

Citation: Zhao Z, Yuan H, Song X, Liu W, Chen Y, Wang X, Dong C and Ma B (2025) Preoperative prediction of HER2 expression and sentinel lymph node status in breast cancer using a mammography radiomics model. Front. Oncol. 15:1578458. doi: 10.3389/fonc.2025.1578458

Received: 20 February 2025; Accepted: 15 May 2025;
Published: 04 June 2025.

Edited by:

Hailin Tang, Sun Yat-sen University Cancer Center (SYSUCC), China

Reviewed by:

Hengrui Liu, University of Cambridge, United Kingdom
Min Deng, Guangzhou Medical University Cancer Hospital, China
Lei Li, University of South China, China

Copyright © 2025 Zhao, Yuan, Song, Liu, Chen, Wang, Dong and Ma. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Binlin Ma, bWFiaW5saW5tYmxAMjFjbi5jb20=; Chao Dong, ZGNfZG9uZ2NoYW9Ab3V0bG9vay5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.