Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 14 January 2026

Sec. Breast Cancer

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1674922

Value of intra- and peritumoral ultrasound radiomics for predicting axillary lymph node burden in breast cancer

  • 1Department of Ultrasound Medicine, Baoding First Central Hospital, Baoding, China
  • 2Department of Postgraduate, Chengde Medical University, Chengde, China
  • 3Department of Urology, Baoding First Central Hospital, Baoding, China
  • 4Department of Radiology, Baoding First Central Hospital, Baoding, China

Objective: To evaluate ultrasound-based radiomic features, derived from both intratumoral and peritumoral regions, for noninvasive preoperative prediction of axillary lymph node(ALN) burden in breast cancer.

Methods: This retrospective study analyzed data from 300 pathologically confirmed breast cancer patients undergoing preoperative ultrasound. The cohort was randomly divided into a training set (n = 210) and a testing set (n = 90) at a 7∶3 ratio. Primary tumor regions of interest (ROIs) were manually delineated on preoperative ultrasound images using ITK-SNAP. Peritumoral ROIs were generated by radially expanding the intratumoral ROI by 2mm, 3mm, and 4mm. A comprehensive set of radiomic features was extracted from each ROI, with feature selection via LASSO based methods. Six machine-learning classifiers were trained on intratumoral features to identify the optimal algorithm. Using this algorithm, we built: (1) A radiomics model based solely on intratumoral or peritumoral features. (2) Combined models incorporating intratumoral and peritumoral features at each expansion margin (2mm, 3mm, and 4mm). The best-performing radiomics model was then integrated with significant clinical and conventional imaging variables to construct a composite nomogram. Model discrimination was evaluated by area under the receiver operating characteristic curve (AUC), calibration was assessed via calibration curves, and clinical utility was appraised using decision curve analysis (DCA). Model interpretability was facilitated through Shapley additive explanation (SHAP) values and visualized in a nomogram.

Results: A Random Forest classifier applied to combined intratumoral and 3mm peritumoral features yielded the highest AUCs (training set: 0.825; testing set: 0.746). Multivariable logistic regression identified lesion location and ultrasonographic axillary lymph node status as independent clinical predictors (p<0.05). The integrated nomogram—combining these clinical factors with the optimal radiomics signature—demonstrated superior performance (training AUC: 0.906; testing AUC: 0.818). DCA confirmed that the combined model conferred the greatest net clinical benefit across a range of threshold probabilities, and calibration curves indicated excellent agreement between predicted and observed probabilities.

Discussion: A composite model integrating intratumoral and 3mm peritumoral ultrasound radiomic features with key clinical and imaging variables enables accurate, noninvasive preoperative prediction of ALN burden in breast cancer. This approach may serve as a valuable decision support tool to guide individualized surgical planning.

1 Introduction

Breast cancer is the most frequently diagnosed malignancy among women worldwide and remains a leading cause of cancer-related mortality (1). According to GLOBOCAN 2022 estimates, both the incidence and mortality of breast cancer have continued to rise globally (2). In the diagnostic and therapeutic pathway for breast cancer, accurate preoperative assessment of axillary lymph node status is critical for clinical staging, treatment planning, and prognostic stratification, and it constitutes a key determinant of long-term survival outcomes (3).

The National Comprehensive Cancer Network (NCCN) guidelines recommend that, for patients with clinical T1–2 disease who have not received neoadjuvant chemotherapy and are candidates for breast-conserving surgery followed by whole-breast irradiation, low axillary lymph node burden (≤2 metastatic nodes (4)) may obviate the need for axillary lymph node dissection, whereas high burden (>3 metastatic nodes) typically warrants combined systemic therapy and axillary lymph node dissection (5). Consequently, imaging objectives have evolved from simply determining the presence or absence of nodal metastasis to the quantification of nodal tumor load (6). Preoperative quantification of axillary lymph node burden is therefore pivotal for tailoring surgical approach and minimizing postoperative complications (7). Notably, approximately 70% of early-stage breast cancer patients are ultimately found to have no nodal metastasis on postoperative pathology, suggesting that current invasive assessments may contribute to overtreatment (8). Hence, there is a pressing clinical need to develop noninvasive methods for accurately identifying axillary lymph node burden.

Ultrasound is routinely employed in this context due to its safety profile, cost-effectiveness, and real-time imaging capability (9). Conventional ultrasound assessment relies on subjective evaluation of primary tumor morphology and alterations in nodal architecture (6), but its diagnostic specificity remains limited and heavily dependent on the operator’s experience.

Radiomics—a high-throughput approach that extracts large numbers of quantitative, high-dimensional features from medical images—offers a noninvasive means of characterizing tumor heterogeneity (10). Prior radiomic investigations into breast cancer axillary lymph node status have predominantly focused on intratumoral features. However, emerging evidence indicates that the peritumoral region harbors critical biological information that may drive tumor progression, underscoring the tumor microenvironment as a potential therapeutic target (11). Most studies to date, however, lack a systematic comparison of different machine-learning classifiers, limiting the selection and optimization of the most effective predictive models (12).

In this study, we extracted both intratumoral and peritumoral radiomic features and constructed predictive models using multiple classification algorithms. We then identified the best-performing machine-learning algorithm and integrated its radiomic signature with clinicopathological variables to build a composite model, with the goal of enhancing the accuracy and robustness of noninvasive preoperative axillary lymph node burden assessment in breast cancer and providing potential for noninvasive preoperative assessment of breast cancer axillary lymph node burden.

2 Materials and methods

2.1 Study cohort

This retrospective study was designed to develop a preoperative prediction model. Accordingly, patient selection and outcome definition followed a two-stage process. This retrospective study included consecutive patients with primary breast cancer who underwent preoperative ultrasound and had core-needle biopsy (CNB) at the First Central Hospital of Baoding between January 2023 and December 2024. From an initial screen of 418 patients, 300 were enrolled based on the following preoperative criteria: (1) female patients aged >18 years; (2) complete histopathological confirmation of breast carcinoma by CNB; (3) an intent to undergo axillary surgery (SLNB or ALND), which ensured the future availability of the outcome data; (4) solitary breast lesion without distant metastasis; and (5) ultrasound imaging performed within two weeks before surgery with adequate image quality. Exclusion criteria comprised: (1) any neoadjuvant or other oncologic treatment administered prior to ultrasound; and (2) incomplete or unavailable clinical histopathological data, or suboptimal depiction of the lesion on ultrasound.

Following surgery, the outcome variable (axillary lymph node burden) was defined based on the final histopathology of the axillary surgical specimens. patients were stratified by postoperative nodal burden into low-load (≤2 metastatic nodes; n=154) and high-load (>2 metastatic nodes; n=146) groups. The cohort was then randomly allocated (7∶3) into a training set (n=210) and a testing set (n=90). The study subject flowchart is displayed in Figure 1.

Figure 1
Flowchart illustrating the selection and division process for a study on breast cancer patients. It starts with 418 eligible patients, followed by inclusion criteria: female patients over 18, confirmed carcinoma. Exclusions include prior treatments and incomplete data. Ultimately, 300 patients were included. These were split into a training set (210) and testing set (90) with further subdivision into low and high load.

Figure 1. Flowchart of participant enrollment based on inclusion and exclusion criteria.

2.2 Clinical data collection

Demographic and clinical variables were retrieved from the institutional pathology database for all 300 patients, with core indicators including age at diagnosis, menopausal status, family history of breast cancer, and other basic parameters.

2.3 Histopathological assessment

Histopathological data were obtained from percutaneous core-needle biopsy reports. It is noteworthy that CNB data, rather than surgical pathology, were intentionally utilized in this study to align with the objective of developing a preoperative predictive model. This approach ensures that all input variables, including histopathological characteristics, are available prior to surgery, thereby maintaining temporal consistency with the preoperative ultrasound imaging and reflecting real-world clinical decision-making scenarios. Variables included histologic grade (I, II, or III), tumor type, human epidermal growth factor receptor-2 (HER-2) status, estrogen receptor (ER) status, progesterone receptor (PR) status, Ki-67 proliferation index, and molecular subtype. Immunohistochemical assays defined ER and PR positivity as ≥1% of tumor-cell nuclei staining, and a Ki-67 cutoff of 14% was applied as per prior literature (13).

2.4 Ultrasound equipment and protocol

All patients underwent ultrasound scanning within two weeks prior to surgery using a Philips EPIQ7 color Doppler system (Philips Healthcare, Andover, MA, USA) equipped with an L12–5 linear-array transducer operating at 8–12MHz. Patients were positioned supine with both arms raised above or alongside the head to fully expose the breasts and axillae. Comprehensive sweeps of both transverse and longitudinal planes were performed. The image demonstrating the maximal lesion long-axis diameter in the standard plane was saved for subsequent radiomic analysis.

2.5 Two-dimensional ultrasound feature analysis

Ultrasound images were independently reviewed by two attending radiologists with 5 and 8 years of experience, respectively, under blinded conditions. Assessed ultrasound features included lesion size, location, and axillary ultrasound status. axillary ultrasound status was considered positive for nodal involvement if any of the following criteria were met: cortical thickness >3mm, long-to-short axis ratio <2, cortical-to-medullary thickness ratio >1, or absence of the fatty hilum (14). Discrepancies between the two readers were resolved by consensus with a third senior radiologist.

2.6 Ultrasound radiomic analysis

2.6.1 ROI delineation and feature extraction

Ultrasound images were exported in NIfTI format (.nii.gz) and imported into ITK-SNAP. Two breast-imaging specialists (each with 5 years of experience), blinded to clinical and histopathological data, manually delineated the intratumoral region of interest (ROI) on the two-dimensional ultrasound slice depicting the lesion’s maximum long-axis dimension. Based on this intratumoral ROI, peritumoral ROIs were generated by isotropically expanding the contour by 2mm, 3mm, and 4mm; the original intratumoral ROI was then subtracted to yield pure peritumoral ROIs at each margin. All ROIs were saved in NIfTI format. A senior radiologist subsequently reviewed and approved all segmentations.

Radiomic features of the intratumoral, peritumoral, and combined intratumoral + peritumoral ROIs were automatically extracted using the open-source PyRadiomics package, including shape features, first-order statistical features, texture features, and wavelet features.

2.6.2 Feature selection, dimensionality reduction, and model development

All radiomic features were standardized via Z-score normalization. Univariate analyses (independent-samples t test or Mann–Whitney U test, as appropriate) and Pearson correlation analysis were employed to preselect features associated with axillary lymph node burden. Subsequently, least absolute shrinkage and selection operator (LASSO) regression with ten-fold cross-validation was applied to identify the optimal regularization parameter (λ) and to select the subset of nonzero-coefficient features most predictive of axillary lymph node status.

Six machine-learning classifiers—Extreme Gradient Boosting (XGBoost), Random Forest, Support Vector Machine (SVM), K-nearest neighbors (KNN), Extra Trees, and Multilayer Perceptron (MLP)—were each trained using the intratumoral radiomic signature to determine the best-performing algorithm. This optimal algorithm was then used to develop seven radiomic signatures: one intratumoral model, three peritumoral models (2mm, 3mm, and 4mm), and three combined intratumoral+peritumoral models (2mm/3mm/4mm).

Clinical and conventional ultrasound features were subjected to univariate and multivariate logistic regression to identify independent predictors. A clinical model was constructed from those variables reaching statistical significance. Finally, the radiomic signature (“radiomics score”) was combined with significant clinical predictors in a multivariable logistic regression framework to construct the integrative (combined) model.

2.6.3 Feature selection, dimensionality reduction, and model development

Model discrimination was assessed by plotting receiver operating characteristic (ROC) curves and calculating the area under the curve (AUC), sensitivity, specificity, and overall accuracy for each model. Pairwise comparisons of AUCs were conducted using DeLong’s test to determine statistically significant differences in performance. Calibration of the combined model was evaluated using calibration curves to verify agreement between predicted probabilities and observed outcomes. Clinical utility was quantified via decision curve analysis (DCA), which estimates the net benefit across a range of threshold probabilities, thus demonstrating the potential of each model to guide clinical decision-making.

2.7 Statistical analysis

Statistical analyses were performed using SPSS version 26.0 and R software. Continuous variables conforming to a normal distribution are presented as mean±standard deviation (x¯±s) and compared between groups using the independent-samples t test; non-normally distributed data are expressed as median (interquartile range, M [Q1, Q3]) and compared using the Mann–Whitney U test. Categorical variables are summarized as counts (n) and compared using the chi-square test (χ²) or Fisher’s exact test, as appropriate. Ordinal variables were analyzed with the Wilcoxon rank-sum test. A two-sided P value <0.05 was considered statistically significant.

2.8 Model visualization

The optimal intratumoral+peritumoral radiomics model was coupled with Shapley Additive Explanations (SHAP) to visualize and interpret the radiomic signature. Global SHAP analyses quantified and ranked the consistent contributions of individual radiomic features to the prediction of axillary lymph node status, thereby elucidating the association patterns between features and the outcome. Additionally, SHAP force-plot visualizations for individual patients illustrated the patient-specific predictive mechanisms underlying nodal metastasis. Independently significant clinical predictors were then combined with the radiomics signature to construct a nomogram, providing an integrated, user-friendly visualization of the overall predictive model. The workflow of radiomics analysis in this study is presented in Figure 2.

Figure 2
Ultrasound images with a highlighted region of interest are shown at the top left. Adjacent are three grayscale images demonstrating ROI segmentation. On the top right, colorful violin plots depict feature extraction. Below, feature extraction graphs illustrate various data patterns and coefficients. At the bottom, model construction is presented through a ROC curve, line graphs, and dot plots showing feature impact and risk assessment.

Figure 2. The workflow of radiomics analysis in this study.

3 Results

3.1 Baseline characteristics

A total of 300 patients were included based on preoperative criteria. After surgical intervention and postoperative pathological assessment, they were stratified into low axillary lymph node burden (n=154) and high axillary lymph node burden (n=146) groups. The cohort was randomly assigned (7∶3) to a training set (n=210) and a testing set (n=90). Comparison of clinicopathological parameters and ultrasound features between the training and testing sets revealed no significant differences (all P>0.05).Refer to Table 1, confirming the comparability of the two cohorts for subsequent modeling.

Table 1
www.frontiersin.org

Table 1. Comparison of clinicopathological and imaging features between the training and testing sets.

3.2 Optimal ultrasound radiomics model development and visualization

3.2.1 Model construction

For each patient, 1–562 radiomic features were extracted from the intratumoral and peritumoral ROIs at 2 mm, 3 mm, and 4 mm. Following univariate filtering (t test and Pearson correlation) and LASSO-based dimensionality reduction, corresponding radiomic signatures were established. Six classifiers—XGBoost, Random Forest (RF), support vector machine (SVM), k-nearest neighbors (KNN), Extra Trees, and multilayer perceptron (MLP)—were trained on the intratumoral signature, yielding AUCs in the training set of 0.853, 0.755, 0.724, 0.719, 0.658, and 0.609, respectively, and validation-set AUCs of 0.570, 0.681, 0.632, 0.629, 0.579, and 0.526. Figures 3A, B shows The Random Forest classifier achieved the highest validation AUC (0.681) and was selected as the optimal algorithm. Its performance metrics were: training AUC 0.755 (95% CI: 0.687 – 0.822) and validation AUC 0.681 (95% CI: 0.570 – 0.792). This Random Forest algorithm was then applied to build all seven radiomic models (intratumoral alone; peritumoral at 2 mm, 3 mm, 4 mm; and combined intratumoral + peritumoral at 2 mm, 3 mm, 4 mm).

Figure 3
Receiver Operating Characteristic (ROC) curves comparing model performance. Panel A shows higher AUC values with XGBoost achieving 0.858. Panel B shows lower AUC values with XGBoost at 0.570. Models include RandomForest, SVM, KNN, ExtraTrees, and MLP. Sensitivity versus 1-specificity is plotted, with a diagonal reference line for random chance.

Figure 3. (A, B) Comparison of diagnostic performance of intratumoral models using different machine learning ROC curves in the training set (A) and the testing set (B).

ROC analysis demonstrated that the intratumoral + 3 mm peritumoral model achieved the highest AUC and was therefore designated the optimal radiomic model. After feature selection, 18 features remained (12 intratumoral, 6 peritumoral)(see Figures 4A-D), which were used to calculate the radiomics signature. A detailed description of these features, including their intuitive interpretation, is provided in “Supplementary Table S1”.In the training and validation cohorts, this signature yielded AUCs of 0.825 (95% CI: 0.765 – 0.885) and 0.746 (95% CI: 0.639 – 0.854)(see Figures 5A, B), sensitivities of 0.937 and 0.840, and specificities of 0.711 and 0.650, respectively. Refer to Table 2.

Figure 4
A four-panel image displays various graphs:  A: A coefficient path plot with curves showing coefficients versus log(lambda).   B: A mean square error plot with error bars indicating variability over log(lambda) values.   C: A bar chart of coefficients, displaying feature names on the y-axis and coefficient values on the x-axis.   D: A bar chart showing Random Forest prediction scores, differentiated by two labels in blue and orange.

Figure 4. (A–C) Parameter selection via LASSO regression, showing optimal regularization parameter λ (A). Coefficient profiles of radiomic features versus log(λ),highlighting nonzero coefficients (B). Feature importance weights for the radiomic signature (C). Histogram of sample prediction probabilities generated by the Random Forest model (D).

Figure 5
ROC curve charts labeled A and B show model performance with different AUC scores. Chart A has higher AUC values, indicating better performance than chart B. Each curve represents different model configurations, with sensitivity on the Y-axis and 1-specificity on the X-axis. The legend provides specific AUC values and confidence intervals for various models.

Figure 5. (A, B) Comparison of ROC Curves for Intratumoral, Peritumoral, and Combined Models in the training set (A) and the testing set (B).

Table 2
www.frontiersin.org

Table 2. Comparison of performance of intratumoral, peritumoral, and combined models in the training and testing sets.

Delong’s test indicated that, in the training set, the AUC of the intratumoral + 3 mm peritumoral model was significantly higher than that of the 4 mm peritumoral model (P = 0.006), and the intratumoral + 4 mm peritumoral model also outperformed the 4 mm peritumoral model (P = 0.029). In the testing set, no pairwise differences reached statistical significance (P > 0.05), possibly due to the smaller sample size.

3.2.2 Model visualization

The optimal radiomics model developed in this study was visualized using SHAP values to display the contribution of each feature (see Figure 6A).

Figure 6
Chart A shows a SHAP summary plot with features ranked by their impact on model output, using color to denote high (pink) and low (blue) feature values. Chart B is a SHAP decision plot indicating how features influence a specific output, transitioning from left (low) to right (high), with contributions visualized in pink. Chart C is another SHAP decision plot showing feature impact on a different output, moving from left (low) to right (high) with contributions in blue.

Figure 6. (A–C) Single-Sample SHAP Analysis (A). SHAP force-plot explaining how the radiomics model differentiates ALN status (final predicted value=0.687, above the baseline 0.447, predicting low burden) (B). SHAP force-plot explaining how the model differentiates ALN status for another patient (final predicted value=0.317, below the baseline 0.447, predicting high burden) (C).

In the swarm plot, dense scatter points map the strength of each variable’s impact on the prediction; each point represents an individual sample, and the color gradient (blue→red) reflects the continuous distribution of feature values from low to high. As shown, Intra_wavelet_LHL_glcm_JointAverage is the feature with the largest contribution to predicting axillary lymph node burden. As its value decreases, the SHAP value increases, indicating a higher predicted probability of nodal burden.

The waterfall plot decomposes the SHAP values for a single sample(see Figures 6B, C), using the model’s expected value (E[f(X)]) as the baseline and visualizing the incremental contribution of each feature in sequence. Red arrows indicate an increased risk of low axillary lymph node burden, while blue arrows indicate a decreased risk.

3.3 Nomogram construction and performance evaluation

On univariate and multivariate analyses of conventional ultrasound features and clinicopathological variables, axillary ultrasound status and lesion location emerged as independent risk factors for axillary lymph node burden (P<0.05).Refer to Table 3. These two predictors were used to develop the clinical model. ROC analysis yielded AUCs of 0.816 (95% CI: 0.752–0.880) in the training set and 0.734 (95% CI: 0.627–0.841) in the testing set(see Figures 7A, B).

Table 3
www.frontiersin.org

Table 3. Univariate and multivariate logistic regression analysis of ultrasound features.

Figure 7
Two ROC curve graphs labeled A and B show the model performance. Graph A displays Clinic Signature (AUC: 0.816), Rad Signature (AUC: 0.825), and Nomogram (AUC: 0.908). Graph B shows Clinic Signature (AUC: 0.734), Rad Signature (AUC: 0.746), and Nomogram (AUC: 0.818). Sensitivity is plotted against 1-Specificity for each model.

Figure 7. (A, B) ROC Curves Comparing the Radiomics Model, Clinical Model, and Combined Model in the training set (A) and the testing set (B).

A combined model was then constructed by integrating the independent clinical risk factors with the radiomics signature. In the training set, the combined model achieved an AUC of 0.908, accuracy of 0.852, and specificity of 0.860; in the testing set, it yielded an AUC of 0.818, accuracy of 0.789, and specificity of 0.525 (see Table 4). A nomogram was generated to visualize the combined model, and its interactive online implementation (available at: https://my-nomogram-app.shinyapps.io/my-nomogram-app/) is presented in Figure 8, which includes the original nomogram in the lower-right corner for verification and reference. Delong’s test demonstrated that, in the training set, the combined model significantly outperformed both the intratumoral+3mm peritumoral radiomics model (P=0.009) and the clinical model alone (P<0.001). In the testing set, the combined model also showed a statistically significant improvement over the clinical model (P=0.006) (see Figures 9A, B).

Table 4
www.frontiersin.org

Table 4. Comparison of performance of the radiomics model, clinical model, and combined model.

Figure 8
Breast cancer high axillary lymph node burden risk prediction tool interface showing patient input fields for lesion position, ultrasound axillary lymph node status, and radiomic signature score with a slider. Prediction results indicate a total score of fifty and a predicted risk of thirty-three percent. A histogram displays the uncertainty of predicted risk with a confidence interval. A nomogram reference is provided below, indicating scales for points, lesion position, ultrasound score, radiomic signature, total points, and associated risk on different axes.

Figure 8. An online nomogram for predicting high ALND has been established, which includes the following three parameters: axillary lymph node status, lesion position, and radiomic signature score. By accessing the link [https://my-nomogram-app.shinyapps.io/my-nomogram-app/], the demonstration shows that when the lesion is located in the outer upper quadrant, the axillary lymph node status is negative, and the radiomic signature score is 0.5, the nomogram predicts a 33.3% probability of high ALND.

Figure 9
Two heatmaps labeled A and B compare Rad_Signature and Clinic_Signature within “Cohort train Delong” and “Cohort test Delong.” Each heatmap has values: A shows 0.852 and 0.009 for Rad_Signature, 0 and 0.221 for Clinic_Signature; B shows 0.876 and 0.221 for Rad_Signature, 0.006 and 0 for Clinic_Signature. Color bars range from -0.4 to 0.8.

Figure 9. (A, B) Comparison of model performance between the training (A) and testing sets (B).

Calibration curves indicated excellent agreement between predicted probabilities and observed outcomes for axillary lymph node burden in both the training and validation cohorts(see Figures 10A, B). Decision‐curve analysis revealed that, across a wide range of threshold probabilities, the combined model provided greater net clinical benefit than the clinical or radiomics models alone. Specifically, in the training set, the combined model was superior when the threshold probability ranged from 0.07–0.25 and 0.67–0.87; in the testing set, its advantage was most pronounced at thresholds of 0.45–0.58 and 0.63–0.80 (see Figures 11A, B).

Figure 10
Two calibration plots labeled A and B. Both display “Fraction of positives” on the y-axis versus “Mean predicted probability” on the x-axis. A dotted line represents perfect calibration. Graph A shows three lines: blue for Rad_Signature, orange for Clinic_Signature, and green for Nomo, each with varying calibration performance. Graph B shows similar lines with different trends.

Figure 10. (A, B) Calibration curves for the training (A) and testing sets (B).

Figure 11
Graphs labeled A and B show models for Decision Curve Analysis (DCA) with net benefit on the y-axis and threshold probability on the x-axis. Both graphs compare Rad_Signature, Clinic_Signature, Nomo, Treat all, and Treat none models. Rad_Signature is blue, Clinic_Signature is orange, Nomo is green, Treat all is black, and Treat none is dotted black. Graph A shows varying net benefits across models with decreasing curves, while Graph B shows similar trends with slightly different net benefit levels.

Figure 11. (A, B) Decision curve analysis for the training (A) and testing sets (B).

4 Discussion

Accurate assessment of axillary lymph node status is pivotal in clinical practice for disease staging, individualized surgical decision-making, and prognostic evaluation. Patients with high nodal burden typically require axillary lymph node dissection and systemic therapy, whereas those with low burden may avoid axillary lymph node dissection and thereby reduce the risk of procedure-related complications (15). Notably, the biological underpinnings of axillary lymph node burden are closely linked to the tumor microenvironment. Breast tumors consist of malignant epithelial cells and stromal components, the latter mediating peritumoral extracellular matrix remodeling. Given these clinical imperatives and pathophysiological mechanisms, the development of a noninvasive, preoperative method to evaluate axillary lymph node burden holds substantial value for precise triage and optimized therapeutic selection. In this study, we employed six machine-learning algorithms to construct intratumoral radiomics models, identified the optimal classifier, and extended it to peritumoral (2mm, 3mm, and 4mm) and combined intratumoral+peritumoral models. We then integrated independent clinical predictors with the superior radiomics signature to build a combined model, aiming to enable noninvasive prediction of breast cancer axillary lymph node burden and provide more precise decision-support for clinicians. Furthermore, our model is built exclusively on preoperatively available data, including CNB-based pathology, which reinforces its practical utility for decision-making prior to any surgical intervention.

Previous investigations have demonstrated that tumor diameter, histologic grade, molecular subtype, and pathologic type variably influence breast cancer axillary lymph node burden (1618). In our cohort, these factors did not differ significantly between high- and low-burden groups, which may reflect the cohort’s broad clinical diversity, limitations in sample size, or population characteristics. Earlier studies have explored the predictive value of ultrasound-detected axillary lymph node positivity for nodal burden in breast cancer (19). Our current work corroborates that sonographic axillary lymph node status is an independent predictor: metastatic tumor cells infiltrate lymphatic sinuses, inducing diffuse cortical thickening, medullary invasion, and displacement or obliteration of the fatty hilum (20), with aberrant cell proliferation leading to nodal enlargement and morphological distortion (19). Consequently, suspicious ultrasound features correlate positively with the extent of tumor invasion and axillary lymph node burden.

Hwang et al. (21) reported superior survival in patients with tumors located in the upper-outer quadrant, attributing this to earlier detection and clearance of axillary metastases, whereas lesions in the inner-lower quadrant more frequently metastasize to the less accessible internal mammary nodes. Consistent with these findings, our study identified primary tumor location as an independent determinant of axillary lymph node burden. The clinical model constructed from these factors achieved an AUC of 0.816 in the training set and 0.734 in the testing set, reflecting moderate predictive performance. Importantly, compared to prior clinicopathologic models, our approach enables noninvasive preoperative evaluation of axillary lymph node status.

Ultrasound radiomics has demonstrated significant value in differentiating benign from malignant breast lesions, identifying molecular subtypes, and predicting response to neoadjuvant therapy (22).

In the field of preoperative axillary lymph node status prediction, Gao etal. developed a two-dimensional ultrasound-based radiomics model for predicting axillary lymph node metastasis, achieving an AUC of 0.723 in the validation cohort (23). Although their methodology parallels ours, their study was limited to invasive carcinoma, whereas our research encompassed a broader spectrum of pathologies—including invasive lobular carcinoma, invasive carcinoma of no special type, and ductal carcinoma in situ—thereby enhancing the model’s generalizability. Given that the progression of aggressive malignancies is often accompanied by stromal proliferation or interstitial infiltration, the imaging characteristics of the peritumoral tissue harbor critical biological information (24). Samantha etal. combined peritumoral features to construct a composite model for predicting axillary lymph node status in clinically node-negative breast cancer patients, yielding an AUC of 0.86 (25). Dong etal. found that diagnosis based solely on peritumoral tissue was insufficient (26), a conclusion our study corroborates: the combined model’s AUC consistently exceeded that of models containing only intratumoral or only peritumoral features. As the spatial pattern extends from intratumoral to peritumoral regions, these models more comprehensively capture tumor heterogeneity and invasiveness (27). Zhang etal. reported that a diagnostic model integrating intratumoral and peritumoral radiomic features with clinical variables accurately predicted axillary lymph node status, with the tumor+3mm margin model performing best (28).

Consistent with our findings, the intratumoral+3mm peritumoral model maximally assessed axillary lymph node status, whereas prior studies have noted practical limitations in defining a 5mm peritumoral margin (29). For superficially or deeply located lesions, a 5mm expansion often extends beyond the breast parenchyma boundary, introducing non-target tissue and compromising accuracy. Distinctively, our study evaluated six different machine-learning algorithms for intratumoral modeling and ultimately selected Random Forest—an algorithm known for robustness and resistance to overfitting, with documented high diagnostic performance (30)—further enhancing predictive efficacy.

To enhance clinical applicability, we integrated the radiomics signature with independent clinical predictors to develop a combined model, achieving AUCs of 0.908 and 0.818 in the training and validation cohorts, respectively, alongside high clinical utility. We employed both SHAP and a nomogram to visualize the model: SHAP provides an intuitive depiction of the Random Forest decision process by computing Shapley values from cooperative game theory, thereby quantifying each feature’s marginal contribution to the model’s output. This elucidates feature importance at both cohort and individual levels, bolstering interpretability and clinician trust (31). Prior studies have validated SHAP’s value in explicating machine-learning models (32). In our results, the SHAP-identified top contributing feature “Intra_wavelet_LHL_glcm_JointAverage”, a wavelet-based co-occurrence matrix feature. It captures subtle textural patterns on ultrasound, imperceptible in routine assessment, by measuring the average gray-level intensity of pixel pairs. Biologically, lower values of this metric indicate a more disrupted tumor microarchitecture, which is associated with a higher ALN burden. The nomogram offers a straightforward visualization of the combined model’s prediction workflow, incorporating easily obtainable preoperative variables (tumor location and sonographic axillary lymph node status), thus enabling noninvasive, individualized prediction of axillary lymph node burden. The integrated application of SHAP and the nomogram establishes complementary interpretative frameworks for both imaging and clinical features. This synergy enhances model transparency, credibility, and clinical utility while providing novel methodological insights for future research.

This study has several limitations. First, its retrospective, single-center design may introduce selection bias. Second, the sample size constitutes the primary constraint; an ‘Events Per Variable’ (EPV) analysis confirmed a suboptimal ratio, which inherently elevates overfitting risk as reflected in the performance gap between training and test sets. Although mitigation strategies were employed, this fundamental limitation persists. Although we employed mitigation strategies—such as dataset partitioning, rigorous feature selection, and the use of algorithms with anti-overfitting properties—to reduce this risk, nevertheless, this fundamental limitation remains an important consideration for the interpretation of our results. Furthermore, only two-dimensional ultrasound images were analyzed; three-dimensional, elastography, and contrast-enhanced ultrasound modalities were not included. Finally, we implemented traditional machine-learning methods without exploring the recently advanced deep-learning techniques that offer improved data processing and generalization capabilities.

It is important to note that the broad clinical diversity of our cohort stemmed from deliberate design choices to enhance generalizability. This included a pragmatic application of inclusion criteria, such as accepting borderline-quality images assessed by consensus and defining “solitary lesion” in a clinically relevant manner. Furthermore, our inclusive approach resulted in a spectrum of pathological types and molecular subtypes. To verify that this diversity did not unduly influence our core findings, a sensitivity analysis was performed wherein the optimal radiomics model was retrained on a homogeneous invasive ductal carcinoma subgroup (n=152). The model’s performance remained robust (AUC = 0.736) and was not significantly different from its performance on the full cohort (AUC = 0.746; DeLong’s p = 0.818), confirming the stability of our radiomic signature across the cohort’s clinical and pathological spectrum. (See Supplementary Figures 1A, B).

In summary, future studies using larger, prospective, multi-center datasets are essential to further validate and refine our model and to robustly assess its clinical generalizability, and evaluate deep-learning approaches for axillary lymph node burden prediction.

5 Towards clinical application

To bridge the gap between our findings and clinical application, a clear translation pathway has been initiated. We have developed an online nomogram-based calculator to facilitate initial use. Our immediate focus is to engineer a semi-automated image analysis module, eliminating the dependency on manual feature extraction. Subsequent prospective validation will then be conducted to rigorously evaluate the tool’s performance and workflow efficiency within real clinical settings, paving the way for its practical adoption.

6 Conclusion

By applying the optimal radiomics algorithm to combined intratumoral and peritumoral ultrasound radiomic signatures alongside clinical predictors, we established a noninvasive, individualized model with robust diagnostic performance for preoperative prediction of axillary lymph node status in breast cancer patients. The incorporation of SHAP and a nomogram not only offers clear, graphical model interpretation but also enables an individualized preoperative prediction framework, thereby providing more precise guidance for clinical decision making.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Medical Ethics Committee of Baoding First Central Hospital(reference number: Fast [2024]206).Medical Ethics Committee of Baoding First Central Hospital, Baoding, Hebei Province, China. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin. Retrospective use of patient medical images did not require informed consent. Written informed consent was not obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article because retrospective use of patient medical images did not require informed consent.

Author contributions

M-HH: Conceptualization, Data curation, Methodology, Software, Writing – original draft, Writing – review & editing. FZ: Data curation, Investigation, Writing – review & editing. CZ: Data curation, Investigation, Writing – review & editing. NS: Methodology, Writing – review & editing. WM: Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the “2024 Medical Science Research Project of Hebei Province” [Grant Number: 20240865]. This work also received financial support from the “Hebei Key Laboratory of Multi-Omics Precision Diagnosis and Treatment for Breast Cancer” [Grant Number:SZX202526].

Acknowledgments

The authors sincerely appreciate the clinical experts and technical teams from the Department of Ultrasound, Radiology, Urology, Breast Surgery, and Pathology at our institution for their professional support and valuable insights. Their expertise in data acquisition, diagnostic consultation, and technical optimization was instrumental to this study. We are particularly grateful for their interdisciplinary collaboration, which ensured the methodological rigor and clinical relevance of this work. We also thank the Hebei Key Laboratory of Multi-Omics Precision Diagnosis and Treatment for Breast Cancer for their support.

Conflict of interest

The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1674922/full#supplementary-material

References

1. Kim J, Harper A, McCormack V, Sung H, Houssami N, Morgan E, et al. Global patterns and trends in breast cancer incidence and mortality across 185 countries. Nat Med. (2025) 31:1154–62. doi: 10.1038/s41591-025-03502-3

PubMed Abstract | Crossref Full Text | Google Scholar

2. Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. (2024) 74:229–63. doi: 10.1016/j.jacasi.2022.12.005

PubMed Abstract | Crossref Full Text | Google Scholar

3. Park KU and Caudle A. Management of the axilla in the patient with breast cancer. Surg Clin North Am. (2018) 98:747–60. doi: 10.1016/j.suc.2018.03.009

PubMed Abstract | Crossref Full Text | Google Scholar

4. Duan Y, Zhu Y, Nie F, Guan L, Jia Y, Chen K, et al. Predictive value of combining clinicopathological, multimodal ultrasonic characteristics in axillary lymph nodal metastasis burden. Clin Hemorheology Microcirculation. (2022) 81:255–69. doi: 10.3233/CH-221398

PubMed Abstract | Crossref Full Text | Google Scholar

5. Bever ST, Niel BL, Baker JL, Bennett DL, Bonaccio E, Camp MS, et al. NCCN Guidelines® insights: breast cancer screening and diagnosis, version 1.2023. J Natl Compr Canc Netw. (2023) 21:900–9. doi: 10.6004/jnccn.2023.0043

PubMed Abstract | Crossref Full Text | Google Scholar

6. Li JW, Tong YY, Jiang YZ, Shui XJ, Shi ZT, and Chang C. Clinicopathologic and ultrasound variables associated with a heavy axillary nodal tumor burden in invasive breast carcinoma. J Ultrasound Med. (2019) 38:1747–55. doi: 10.1002/jum.14871

PubMed Abstract | Crossref Full Text | Google Scholar

7. Zhao HF, Yao JC, Wang LP, Fan LY, and Xu D. Development of a nomogram to predict axillary lymph node metastatic burden in T1–2 breast cancer. Chin J Ultrason Med. (2023) 39:1224–6. doi: 10.3877/cma.j.issn.1004-4477.2023.11.002

Crossref Full Text | Google Scholar

8. Wei W, Ma Q, Feng H, Wei T, Jiang F, Fan L, et al. Deep learning radiomics for prediction of axillary lymph node metastasis in patients with clinical stage T1–2 breast cancer. Quant Imaging Med Surg. (2023) 13:4995–5011. doi: 10.21037/qims-22-1363

PubMed Abstract | Crossref Full Text | Google Scholar

9. Wei W, Feng HJ, Wang Y, Wei TJ, He LY, Zhang X, et al. Nomogram based on ultrasound radiomics for predicting ipsilateral axillary lymph node metastasis in T1 breast cancer. Chin J Med Imaging. (2024) 32:796–808. doi: 10.3760/cma.j.cn131148-20230815-00042

Crossref Full Text | Google Scholar

10. Lambin P, Rios-Velázquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. (2012) 48:441–6. doi: 10.1016/j.ejca.2011.11.036

PubMed Abstract | Crossref Full Text | Google Scholar

11. Mittal S, Brown NJ, and Holen I. The breast tumor microenvironment: role in cancer development, progression and response to therapy. Expert Rev Mol Diagn. (2018) 18:227–43. doi: 10.1080/14737159.2018.1453358

PubMed Abstract | Crossref Full Text | Google Scholar

12. Zhang CM, Ding Z, Chen P, and Liu QF. Value of machine-learning models based on intratumoral and peritumoral radiomic features from DCE-MRI for predicting axillary lymph node metastasis in breast cancer. J Thorac Imaging. (2023) 29:618–24. doi: 10.3760/cma.j.cn112217-20230210-00015

Crossref Full Text | Google Scholar

13. Han P, Yang H, Liu M, Cheng L, Wang S, Tong F, et al. Lymph node predictive model with in vitro ultrasound features for breast cancer lymph node metastasis. Ultrasound Med Biol. (2020) 46:1395–402. doi: 10.1016/j.ultrasmedbio.2020.01.020

PubMed Abstract | Crossref Full Text | Google Scholar

14. Goldhirsch A, Wood WC, Coates AS, Gelber RD, Thürlimann B, Senn HJ, et al. Strategies for subtypes–dealing with the diversity of breast cancer: highlights of the St Gallen international expert consensus on the primary therapy of early breast cancer 2011. Ann Oncol. (2011) 22:1736–47. doi: 10.1093/annonc/mdr304

PubMed Abstract | Crossref Full Text | Google Scholar

15. Wang L, Li J, Qiao J, Guo X, Bian X, Guo L, et al. Establishment of a model for predicting sentinel lymph node metastasis in early breast cancer based on contrast-enhanced ultrasound and clinicopathological features. Gland Surg. (2021) 10:1701–12. doi: 10.21037/gs-21-339

PubMed Abstract | Crossref Full Text | Google Scholar

16. Li B, Zhao X, Wang Q, Jing H, Shao H, Zhang L, et al. Prediction of high nodal burden in invasive breast cancer by quantitative shear wave elastography. Quant Imaging Med Surg. (2022) 12:1336–47. doi: 10.21037/qims-21-1061

PubMed Abstract | Crossref Full Text | Google Scholar

17. Setyawati Y, Rahmawati Y, Widodo I, Ghozali A, and Purnomosari D. The association between molecular subtypes of breast cancer with histological grade and lymph node metastases in Indonesian women. Asian Pac J Cancer Prev. (2018) 19:1263–8. doi: 10.22034/APJCP.2018.19.5.1263

PubMed Abstract | Crossref Full Text | Google Scholar

18. Shi LN, Cao CL, Sang T, Li WX, Cao YW, and Li J. Nomogram incorporating ultrasound features and clinicopathologic indicators to predict axillary lymph node metastasis in breast cancer. Chin J Med Imaging. (2024) 32:332–8. doi: 10.3760/cma.j.cn131148-20231215-00108

Crossref Full Text | Google Scholar

19. Xue JX, Yang Q, and Hou DL. Predictive value of combined breast ultrasound and peripheral blood immuno-inflammatory markers for axillary lymph node tumor burden in breast cancer. J Med Imaging. (2025) 35:60–3. doi: 10.3760/cma.j.cn131148-20250115-00003

Crossref Full Text | Google Scholar

20. Gillot L, Baudin L, Rouaud L, Kridelka F, and Noël A. The pre-metastatic niche in lymph nodes: formation and characteristics. Cell Mol Life Sci. (2021) 78:5987–6002. doi: 10.1007/s00018-021-03873-z

PubMed Abstract | Crossref Full Text | Google Scholar

21. Hwang KT, Kim J, Kim EK, Jung SH, Sohn G, Kim SI, et al. Poor prognosis of lower inner quadrant in lymph node-negative breast cancer patients who received no chemotherapy: a study based on nationwide Korean breast cancer registry database. Clin Breast Cancer. (2017) 17:169–84. doi: 10.1016/j.clbc.2017.01.005

PubMed Abstract | Crossref Full Text | Google Scholar

22. Li CZ and Chen CC. Advances in the application of ultrasound radiomics in breast tumors. Imaging Res Med Appl. (2023) 7:5–7. doi: 10.19416/j.cnki.2096-3181.2023.05.002

Crossref Full Text | Google Scholar

23. Gao Y, Luo Y, Zhao C, Xiao M, Ma L, Li W, et al. Nomogram based on radiomics analysis of primary breast cancer ultrasound images: prediction of axillary lymph node tumor burden in patients. Eur Radiol. (2021) 31:928–37. doi: 10.1007/s00330-020-07181-1

PubMed Abstract | Crossref Full Text | Google Scholar

24. Zhou J, Zhan W, Chang C, Zhang X, Jia Y, Dong Y, et al. Breast lesions: evaluation with shear wave elastography, with special emphasis on the “stiff rim” sign. Radiology. (2014) 272:63–72. doi: 10.1148/radiol.14130818

PubMed Abstract | Crossref Full Text | Google Scholar

25. Bove S, Comes MC, Lorusso V, Cristofaro C, Didonna V, Gatta G, et al. A ultrasound-based radiomic approach to predict the nodal status in clinically negative breast cancer patients. Sci Rep. (2022) 12:7914. doi: 10.1038/s41598-022-11845-x

PubMed Abstract | Crossref Full Text | Google Scholar

26. Dong F, She R, Cui C, Shi S, Hu X, Zeng J, et al. One step further into the blackbox: a pilot study of how to build more confidence around an AI-based decision system of breast nodule assessment in 2D ultrasound. Eur Radiol. (2021) 31:4991–5000. doi: 10.1007/s00330-020-07418-z

PubMed Abstract | Crossref Full Text | Google Scholar

27. Gursoy M, Sezgin G, Horoz EM, Dirim Mete B, and Erdogan N. Histopathological and tumor characteristics associated with false negative axillary ultrasonography results in breast cancer. Med Ultrason. (2019) 21:232–8. doi: 10.11152/mu-1955

PubMed Abstract | Crossref Full Text | Google Scholar

28. Zhang W, Wang S, Wang Y, Sun J, Wei H, Xue W, et al. Ultrasound-based radiomics nomogram for predicting axillary lymph node metastasis in early-stage breast cancer. La Radiologia Med. (2024) 129:211–21. doi: 10.1007/s11547-023-01699-2

PubMed Abstract | Crossref Full Text | Google Scholar

29. Zhan C, Hu Y, Wang X, Liu H, Xia L, and Ai T. Prediction of axillary lymph node metastasis in breast cancer using intra peritumoral textural transition analysis based on dynamic contrast enhanced magnetic resonance imaging. Acad Radiol. (2022) 29:S107–15. doi: 10.1016/j.acra.2021.12.021

PubMed Abstract | Crossref Full Text | Google Scholar

30. Yang J, Wu M, Feng XD, Du Yao, Cui GH, and Liu FF. Stage-wise prediction of axillary lymph node metastatic burden in breast cancer using ultrasound radiomics. Chin J Ultrason Med. (2024) 40:1217–20. doi: 10.3877/cma.j.issn.1004-4477.2024.11.001

Crossref Full Text | Google Scholar

31. Wang Y, Lang J, Zuo JZ, Dong Y, Hu Z, Xu X, et al. The radiomic-clinical model using the SHAP method for assessing the treatment response of whole-brain radiotherapy: a multicentric study. Eur Radiol. (2022) 32:8737–47. doi: 10.1007/s00330-022-08904-2

PubMed Abstract | Crossref Full Text | Google Scholar

32. Wang SR, Cao CL, Du TT, Wang JL, Li J, Li WX, et al. Machine learning model for predicting axillary lymph node metastasis in clinically node positive breast cancer based on peritumoral ultrasound radiomics and SHAP feature analysis. J Ultrasound Med. (2024) 43:1611–25. doi: 10.1002/jum.16414

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: breast cancer, axillary lymph node burden, radiomics, ultrasound, peritumoral region

Citation: Hao M-H, Zhang F, Zhang C, Shi N and Mu W (2026) Value of intra- and peritumoral ultrasound radiomics for predicting axillary lymph node burden in breast cancer. Front. Oncol. 15:1674922. doi: 10.3389/fonc.2025.1674922

Received: 28 July 2025; Accepted: 23 December 2025; Revised: 22 November 2025;
Published: 14 January 2026.

Edited by:

Sharon R Pine, University of Colorado Anschutz Medical Campus, United States

Reviewed by:

Guy Clifton, Brooke Army Medical Center, United States
Cherry Bansal, Tantia University, India

Copyright © 2026 Hao, Zhang, Zhang, Shi and Mu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Weina Mu, bXZuMjAxNkAxMjYuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.