- 1Clinical Medicine College, Graduate School of Qinghai University, Xining, Qinghai, China
- 2Department of Oncology Surgery, Qinghai University Affiliated Hospital, Xining, Qinghai, China
Background: Accurately predicting axillary lymph node metastasis (ALNM) preoperatively is crucial for optimizing management in patients with clinically node-negative (cN0) hormone receptor-positive (HR+) breast cancer (BC).
Methods: We retrospectively analyzed 816 cN0 HR+ BC patients (2016-2024). Data from 2016-2023 (n=726) were randomly assigned to a training set (n=503) or an internal test set (n=223) in a 7:3 ratio. Patients treated in the most recent year, 2024 (n=90), were reserved as a held-out temporal validation set. Following feature selection via Recursive Feature Elimination (RFE), five machine learning models—XGBoost, Random Forest, Logistic Regression, Support Vector Machine, and K-Nearest Neighbors (KNN)—were developed. Performance was assessed by the area under the receiver operating characteristic curve (AUC) and decision curve analysis (DCA). The optimal model was interpreted using SHapley Additive exPlanations (SHAP).
Results: The incidence of ALNM was 30.9%. The KNN model demonstrated optimal performance, achieving an AUC of 0.898 (95% CI: 0.857–0.939) in the test set and 0.774 (95% CI: 0.655–0.892) in the external validation set. DCA indicated that the KNN model provided the highest net clinical benefit within the 30%–65% threshold probability range. SHAP analysis identified parity as the most critical predictor, followed by age, tumor location, menopausal status, tumor diameter, lymphocyte count, platelet count, alpha-fetoprotein (AFP), neutrophil count, and carcinoembryonic antigen (CEA).
Conclusion: The KNN model, integrated with the SHAP interpretability framework, shows favorable performance, interpretability, and clinical applicability for predicting ALNM in cN0 HR+ BC, offering a valuable tool for preoperative risk assessment and individualized decision-making.
1 Introduction
Breast cancer (BC) is the most commonly diagnosed malignancy and a leading cause of cancer-related mortality among women worldwide (1). Clinical management is guided by molecular subtypes, with hormone receptor-positive (HR+) BC (encompassing Luminal A and B) being the most prevalent, accounting for approximately 70% of cases (2). Luminal A tumors tend to follow a more indolent course, whereas Luminal B tumors are associated with a higher risk of recurrence (3). This intrinsic diversity results in a broad spectrum of metastatic potential, rendering the preoperative distinction between patients at genuine low risk (who may be candidates for de-escalated axillary surgery) and those with occult high-risk disease particularly challenging yet critical within the HR+ population. Consequently, management strategies for HR+ patients require greater refinement, with accurate assessment of axillary lymph node status being paramount for prognosis and therapeutic decision-making.
Axillary lymph node metastasis is one of the most important independent prognostic factors in breast cancer, directly influencing disease staging, treatment selection, and subsequent adjuvant therapy decisions (4). For patients with negative clinical lymph nodes (cN0), that is, those whose regional lymph node metastasis is not found on preoperative palpation and imaging (5), sentinel lymph node biopsy (SLNB) is a conventional pathological staging method. However, a significant proportion (approximately 20–40%) of cN0 patients are found to have nodal metastases on SLNB, many of whom may not benefit additionally from completion axillary lymph node dissection (ALND), potentially facing overtreatment (6, 7). The development of reliable tools to preoperatively stratify ALNM risk in cN0 patients, particularly within the large HR+ subgroup, is therefore clinically relevant.
Currently, preoperative prediction of ALNM primarily relies on imaging evaluations (such as ultrasound and MRI) and prediction nomograms based on clinical and pathological characteristics (8, 9). These methods have limitations, including variable accuracy and a lack of specificity for the HR+ subtype. Machine learning (ML) offers a potential avenue for improving prediction by modeling complex relationships in multi-dimensional data (10). While several studies have applied ML to ALNM prediction in BC (11, 12), many include all subtypes or lack validation in separate cohorts. Furthermore, the interpretability of ML models remains a barrier to clinical translation (13).
To address these considerations, this study aimed to develop a HR+ cN0-stage specific prediction model for ALNM using interpretable ML methods. Our objectives were: (1) to construct and compare multiple ML algorithms based on preoperative clinicopathological data; (2) to evaluate the temporal generalizability of the optimal model using a cohort from a subsequent time period; and (3) to employ the SHAP (SHapley Additive exPlanations) framework to interpret the model’s predictions, thereby providing actionable insights for clinicians. This approach may significantly enhance preoperative risk assessment and individualized treatment planning for this patient population.
2 Materials and methods
2.1 Data split and validation strategy
Patients diagnosed between January 2016 and December 2023 were included in the model development cohort. This cohort was randomly divided into a training set (70%, n=503) and a test set (30%, n=223) for model development, hyperparameter tuning, and initial evaluation.
To assess the model’s performance over time and reduce overfitting, an independent temporal validation cohort was established. This cohort comprised all consecutive patients meeting the same inclusion criteria from the subsequent and non-overlapping period of January 2024 to December 2024 (n=90). Among these 90 patients, 28 cases (31.1%) had ALNM, a finding consistent with the ALNM incidence rate (30.9%) observed in the development cohort. Crucially, this temporal validation set was locked before model development and was used strictly for the final, one-time evaluation of the selected model. No data from this 2024 cohort were used in any phase of model training or parameter tuning.
This study complied with the ethical principles of the Declaration of Helsinki for medical research involving human subjects and was approved by the Ethics Committee of Qinghai University Affiliated Hospital (approval No. P-SL-2024-406). Due to the retrospective design of this study, the Ethics Committee waived the requirement for informed consent. This study adhered to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) guidelines to ensure transparency and scientific rigor in study design, implementation, and reporting.
Inclusion Criteria: (1) Female patients aged ≥18 years. (2) Preoperative core needle biopsy pathology confirming: (a) invasive breast carcinoma, and (b) HR+ status (defined as estrogen receptor [ER]-positive and/or progesterone receptor [PR]-positive). (3) Preoperatively assessed as clinically node-negative (cN0 stage) by physical examination and imaging. (4) Underwent primary surgical treatment (mastectomy or breast-conserving surgery) with pathological evaluation of axillary nodal status via SLNB and/or ALND. (5) Availability of complete preoperative data required for modeling.
Exclusion Criteria: (1) Male breast cancer. (2) Inflammatory breast cancer, clinical T4-stage tumors (with chest wall/skin invasion), or evidence of distant metastasis (stage IV disease) at initial diagnosis. (3) Bilateral synchronous or multicentric BC. (4) History of any other invasive malignancy within the past 5 years prior to the current BC diagnosis. (5) Receipt of any form of neoadjuvant therapy (chemotherapy, endocrine therapy, targeted therapy, or radiotherapy) prior to surgery. (6) Preoperative biopsy pathology suggesting non-invasive carcinoma (e.g., ductal carcinoma in situ) or microinvasive carcinoma only. (7) Missing or incomplete key preoperative data necessary for model construction.
2.2 Predictors and outcome variable
1. Clinical Characteristics: Age at diagnosis, BMI, menopausal status, marital status, parity.
2. Preoperative Core Needle Biopsy Pathology: Estrogen Receptor (ER) Status, Progesterone Receptor (PR) Status, Human Epidermal Growth Factor Receptor 2 (HER2) Status, Ki-67 Index, Histological Grade.
3. Hematologic Data: Alpha-Fetoprotein (AFP), Carcinoembryonic Antigen (CEA), Neutrophil Count, Lymphocyte Count, Platelet Count.
4. Preoperative Imaging Characteristics: Tumor location and maximum diameter of the primary tumor as determined by preoperative breast ultrasound.
Primary Outcome: ALNM status, determined by postoperative histopathological examination of excised sentinel lymph nodes or axillary lymph nodes, which serves as the gold standard.
2.3 Pathology interpretation criteria
1. Histological grading: Assessed based on glandular formation proportion, nuclear pleomorphism, and mitotic count. The sum of these three scores categorizes tumors as follows: 3–5 points = Grade I (well-differentiated); 6–7 points = Grade II (moderately differentiated); 8–9 points = Grade III (poorly differentiated) (14).
2. ER and PR interpretation criteria: Nuclear staining in <1% of tumor cells is negative; ≥1% nuclear staining is positive (1–10% = low-positive expression). BC is classified as HR+ if tumor cells show ER and/or PR positivity (15).
3. HER-2 interpretation criteria: For HER-2 immunohistochemical (IHC) staining, IHC 3+ denotes positivity; 0 or 1+ denotes negativity; IHC 2+ necessitates in situ hybridization (ISH) confirmation. Fluorescence in situ hybridization (FISH)-positive results indicate gene amplification, confirming HER-2 positivity; FISH-negative results are interpreted as HER-2 negative.
4. Ki-67 interpretation criteria: Nuclear brown staining (of any intensity) is defined as Ki-67 positivity. The percentage of Ki-67-positive nuclei among tumor cells is recorded quantitatively (16).
5. P53 interpretation criteria: Nuclear brownish-yellow granular staining is defined as P53 positivity. Semi-quantitative assessment: Ten random high-power fields (×400) are selected to quantify positive cell proportion: <10% = negative (-); 11–30% = weak positive (+); 31–50% = moderate positive (++); >50% = strong positive (+++). For this study, P53 positivity is defined as ≥11% positive cells.
6. Axillary lymph node positivity: Axillary lymph nodes are considered positive if histopathological examination identifies macrometastases, micrometastases, or isolated tumor cells (ITCs).
7. cN0 definition: Preoperative clinical lymph node status was assessed in all patients. cN0 is defined as the absence of regional lymph node metastases detected by imaging or clinical examination (5).
2.4 Data preprocessing
The missing values in the dataset were handled using Multiple Imputation by Chained Equations (MICE, implemented via the mice package in R). MICE is an iterative algorithm that fills missing values for each variable by establishing a series of chained regression models. The imputation model incorporated all variables required for subsequent modeling (including clinical features, pathological indicators, laboratory tests, imaging characteristics, and outcome variables). Notably, the outcome variable had no missing values prior to imputation. The mice() function was used with default settings to generate 5 imputed datasets, and the third imputed dataset (complete(imp, 3)) was selected for subsequent analysis. Use the md.pattern(mydata) function to visualize missing data and confirm that missing values are randomly distributed among variables, with no concentration in specific subgroups. In addition, continuous variables were standardized, and categorical variables were factorized. Crucially, to prevent data leakage, all preprocessing steps strictly adhered to a sequential timeline and workflow:
1. Initial Split: The primary cohort (patients from 2016–2023, n=726) was first randomly split into a training set (70%, n=503) and a test set (30%, n=223).
2. Class Imbalance Handling: To address class imbalance, the Random Over-Sampling Examples (ROSE) technique was applied exclusively to the training set. The test set was kept completely untouched and remained in its original imbalanced state to ensure an unbiased evaluation (17).
3. Independent Validation Set: The temporal validation cohort (patients from 2024, n=90) was completely isolated and locked prior to analysis. It underwent no resampling or preprocessing based on the development cohort and was used solely for final model evaluation.
2.5 Feature selection
Recursive Feature Elimination (RFE) was utilized to identify the optimal subset of predictive features. RFE iteratively removes features with the lowest contribution to model performance, systematically quantifying each feature’s importance while mitigating the risk of overfitting. To prevent information leakage, feature selection was restricted to the training set exclusively, with five-fold cross-validation employed to ensure model robustness. Ultimately, RFE identified 18 potential predictive features; based on performance metrics and model complexity trade-offs, the top 10 were selected for model development. These included reproductive history, age, tumor location, menopausal status, tumor diameter, lymphocyte count, platelet count, AFP, neutrophil count, and CEA.
2.6 Model construction
1. Support Vector Machine (SVM): This algorithm maximizes the interclass margin by identifying an optimal hyperplane to achieve data classification, making it well-suited for high-dimensional datasets.
2. Logistic Regression (LR): A generalized linear model that transforms linear regression outputs into probability values via a logistic (sigmoid) function, primarily applied to binary classification tasks.
3. Random Forest (RF): An ensemble learning method that constructs multiple decision trees and aggregates their predictions; it exhibits resistance to overfitting and is capable of handling high-dimensional data.
4. K-Nearest Neighbors (KNN): An instance-based learning algorithm that classifies samples by computing inter-sample distances and implementing a voting mechanism, suitable for multi-class classification tasks.
5. XGBoost: An ensemble learning algorithm based on gradient-boosted decision trees, characterized by robust predictive performance and strong generalization ability.
2.7 Machine learning interpretable tools
In this study, the model is interpreted using the SHAP method. SHAP is based on the Shapley values from game theory and is able to quantify the specific contribution of each feature to the model’s predicted outcomes and analyze the interactions between the features, providing an interpretable framework for model predictions. The unique SHAP value for each sample enables in-depth interpretation of individual prediction results.
2.8 Statistical analysis
All statistical modeling and visualizations were conducted using R software (version 4.4.2). Categorical variables were analyzed using the chi-square test or Fisher’s exact test and reported as frequencies (percentages). Continuous variables with a normal distribution were summarized as mean ± standard deviation, with intergroup comparisons conducted using the t-test. Non-normally distributed continuous variables were presented as quartiles, with groupwise variability assessed using the Wilcoxon rank-sum test. Statistical significance was defined as a two-tailed p-value < 0.05. The discriminatory performance of the model was quantified using the area under the receiver operating characteristic curve (AUC). For clinical utility assessment, decision curve analysis (DCA) was performed to calculate net benefit values across a range of risk thresholds, thereby evaluating the clinical decision-making utility of the predictive model.
3 Results
3.1 Patient characteristics
A total of 816 patients treated between January 2016 and December 2024 were included in the study, stratified into a training set (n=503), a test set (n=223), and a temporal validation set (n=90). The dataset exhibited a 6.3% missing value rate, which was addressed using Multiple Imputation. Baseline characteristics were well-balanced between the training and test sets, with no statistically significant differences observed (p>0.05). Detailed patient characteristics are presented in Table 1.
Among the included patients, the Luminal B subtype accounted for 75.5%, and 97.9% were married. Regarding reproductive history, A parity of 2 (P2) was the most common (38.8%), followed by P1 (27.0%) and P3 (21.5%). Overall, 47.0% of patients were postmenopausal, and 49.9% had a BMI <24 kg/m². Most tumors were located in the upper outer quadrant (44.5%), with T2 stage being the most prevalent (50.0%), and histological grading was predominantly moderately differentiated (Grade II, 64.5%). The Ki-67 index was most frequently 14–30% (48.3%), P53 positivity was observed in 37.9% of patients, and HER2 positivity in 17.4%. The incidence of axillary lymph node metastasis (ALNM) was 30.9%. The mean patient age was 46.9 ± 10.8 years. Median laboratory parameters were as follows: neutrophil count 3.23 × 109/L, lymphocyte count 1.84 × 109/L, platelet count 208.0 × 109/L, carcinoembryonic antigen (CEA) 2.58 ng/mL, and alpha-fetoprotein (AFP) 2.54 ng/mL.
In the training set, between-group comparisons of ALNM status (Table 2) revealed that parity ≥3, postmenopausal status, tumor location in the upper outer quadrant, advanced tumor stage (T3), Luminal B subtype, poor histological grade (Grade III), high Ki67 index (>30%), older age, and elevated CEA levels were significantly associated with an increased risk of ALNM (P < 0.05). In contrast, variables including marital status and body mass index (BMI) showed no significant association with ALNM risk (P> 0.05).
Table 2. Clinical and pathological characteristics of the influence of the training set on the occurrence of ALNM.
3.2 Model construction and evaluation
Based on training set data, we developed five machine learning predictive models: XGBoost, RF, LR, SVM, and KNN. During model development, hyperparameter optimization was conducted using five-fold cross-validation combined with grid search to systematically evaluate the performance of each parameter combination. The test set and temporal validation set remained fully independent throughout the training and tuning process, and were only utilized once after model finalization to ensure an objective assessment of the model’s generalization ability. Evaluation of the test set demonstrated that all models achieved an AUC of >0.80 (Figure 1). Among these, the KNN model exhibited the strongest performance, with an AUC of 0.898 (95% confidence interval [CI]: 0.857–0.939).
Figure 1. Test set: ROC curves analyzing the prediction performance of five ML algorithms for ALNM in HR+ patients.
To evaluate the temporal generalizability of the models, we assessed their performance on the held-out temporal validation cohort (Figure 2). The KNN model exhibited robust performance on this temporal validation set, achieving an AUC of 0.774 (95% CI: 0.655–0.892). This outcome was highly consistent with its performance on the training set, confirming strong generalizability of the KNN model. To evaluate the clinical utility of the models, DCA was used to quantify the net clinical benefit of each model across a range of threshold probabilities (Figure 3). All models outperformed the reference strategies of “treating all patients” (orange line) and “treating no patients” (yellow line) on the decision curve. Notably, the KNN model yielded the highest net clinical benefit across a threshold probability range of 30–65%.
Figure 2. Temporal validation set: ROC curves analyzing the prediction performance of five ML algorithms for ALNM in HR+ patients.
Figure 3. Decision curve analysis of net returns for five models with different threshold probabilities.
Model performance was further comprehensively assessed across multiple metrics, including accuracy, sensitivity, positive predictive value (PPV), negative predictive value (NPV), and F1 score (Table 3). These analyses confirmed the superior performance of the KNN model. Accordingly, the KNN model was designated as the optimal model in this study for predicting ALNM risk in patients with HR+ BC.
3.3 Interpret the KNN model using the SHAP method
Additionally, we ranked the relative importance of clinicopathologic features within the model. As depicted in Figure 4, reproductive history was identified as the most impactful predictor of ALNM development, followed by age, tumor location, menopausal status, tumor diameter, lymphocyte count, and platelet count. Figure 5 further characterizes the directional effects of each variable on ALNM risk: positive SHAP values (orange, right) signify an elevated probability of ALNM, whereas negative values (purple, left) denote a reduced risk. Reproductive history exhibited a strong positive association with ALNM occurrence, where higher values (orange) correlated with elevated risk. Conversely, specific tumor locations (orange, left) were associated with favorable outcomes, as indicated by negative SHAP values. For instance, higher reproductive history scores (orange, right) correlated with poorer prognostic outcomes, whereas specific tumor locations (orange, left) were associated with more favorable outcomes relative to lower scores (purple, right).
3.4 SHAP waterfall diagram
Individual SHAP waterfall plots for two randomly selected HR+ BC patients are presented in Figure 6 (with ALNM) and Figure 7 (without ALNM). Starting from the baseline predicted value (E[f(x)]), the plots sequentially illustrate the contribution of each feature to the final predicted value (f(x)). The bar corresponding to each feature quantifies its specific contribution to the prediction, where color indicates the direction of contribution (positive or negative) and length denotes the magnitude. In Figure 6, the predicted value increased from the baseline of 1.32 to 2 following the cumulative contributions of individual features. Specifically, reproductive history (P4) and menopausal status (postmenopausal) were the key positive contributors elevating the predicted value, significantly increasing the ALNM risk assessment. In contrast, the predicted value in Figure 7 decreased from the baseline of 1.32 to 1, primarily driven by negative contributions from reproductive history (P1) and AFP levels (1.95 ng/mL), which reduced the ALNM risk assessment.
Figure 6. SHAP waterfall diagram for the occurrence of ALNM. Parity: none = 0, 1 = 1, 2 = 2, 3 = 3, >3 = 4; Menopause: not menopausal = 0, menopausal = 1; Tumor_Location: Upper outer quadrant = 1, Upper inner quadrant = 2, Lower inner quadrant = 3, Lower outer quadrant = 4, central = 0; T Stage: T1 = 1, T2 =2, T3 =3.
Figure 7. SHAP waterfall diagram without ALNM. Parity: none = 0, 1 = 1, 2 = 2, 3 = 3, >3 = 4; Menopause: not menopausal = 0, menopausal = 1; Tumor_Location: Upper outer quadrant = 1, Upper inner quadrant = 2, Lower inner quadrant = 3, Lower outer quadrant = 4, central = 0; T Stage: T1 = 1, T2 =2, T3 =3.
4 Discussion
For patients with cN0 BC, SLNB serves as the standard of care for evaluating axillary lymph node status. However, studies have reported that approximately 30% of cN0 patients harbor sentinel lymph node (SLN) metastases on SLNB (7). Notably, 20–60% of these patients with SLN metastases do not progress to further axillary lymph node involvement. For this subset, the clinical benefit of ALND remains uncertain, potentially constituting overtreatment (6). Thus, there is an unmet clinical need for reliable predictive tools to mitigate unnecessary axillary surgical interventions in cN0 patients—especially within the large HR+ subtype population.
In this study, the KNN model exhibited superior performance in predicting ALNM risk among cN0-stage HR+ BC patients compared to other machine learning algorithms. Leveraging the SHAP method, we quantified the relative importance of predictive features and elucidated the specific contribution of key variables to individual patient predictions via SHAP waterfall plots. By integrating the ranked feature importance with clinical expertise, we provide a data-driven foundation for individualized risk stratification and clinical decision-making in this patient population.
This research model confirmed reproductive history as the primary predictor of axillary lymph node metastasis. Multiple studies have pointed out that, after controlling for other factors, certain aspects of reproductive history remain independent predictors of lymph node metastasis. A long-term follow-up study based on the Swedish national population explicitly identified high parity (≥4 births) as an independent risk factor for axillary lymph node metastasis (OR = 1.56) (18). This confirms the universality of the association between reproductive history and lymph node metastasis from a large-scale population perspective. Multiple earlier studies provided corroborating evidence from different angles: Japanese research showed higher ALNM rates in BC patients with recent childbirth, particularly pronounced in HR+ patients (5); multivariate analysis of ALNM in invasive BC identified prior pregnancy history as a significant predictor of axillary metastasis (17); Another comparative study of Uyghur and Han Chinese women similarly confirmed multiple pregnancies as a shared risk factor (7). Zhang’s team’s large-scale Chinese cohort study confirmed a significant association between high parity and luminal B BC risk (19). The luminal B subtype itself is clinically recognized as closely associated with higher lymph node metastasis rates. A case-control study in Northeast China reached the same conclusion: multiple pregnancies increase BC risk (20).
We hypothesize that, within the cN0 HR+ population of this study, a history of high parity likely identifies a tumor subgroup with a biological predisposition toward luminal B characteristics and thus greater invasive potential. Recent cutting-edge research offers deeper mechanistic insights: a complete pregnancy-lactation cycle induces specific CD8+ T cells to persistently reside in breast tissue, thereby enduringly reshaping the immune microenvironment (21). Interestingly, this appears to contradict the epidemiological finding that “childbirth reduces BC incidence.” We propose a dual-role theory to reconcile this contradiction: at the population level, childbirth may reduce the risk of healthy cells becoming cancerous through tissue remodeling (preventive effect). However, once a tumor develops in breast tissue undergoing such remodeling, it may exhibit heightened invasiveness (promoting progression). This study focuses on the latter mechanism, revealing the significant predictive value of childbirth history during disease progression.
The current study identifies age, primary tumor location, and menopausal status as key predictors of ALNM in cN0 HR+ BC patients—findings that align with and extend prior clinical evidence. A comparative analysis of younger (<40 years) and older BC patients demonstrated that younger individuals had a higher rate of ALNM in non-metastatic disease (73.2% vs 55.6%, P < 0.001) and were more frequently diagnosed with T3/4 tumors (28.2% vs 13.8%, P < 0.001) (22). Additionally, multivariate analysis in a training cohort confirmed a significant association between younger age and increased ALNM risk (P = 0.006) (23), while Hu et al. further identified young age, together with tumor size, as independent predictors of ALNM (24). Regarding tumor location, Xiong et al. similarly reported age and upper outer quadrant tumor location as independent predictors of ALNM (11). In a study of T1-T2 N0 BC, the upper outer quadrant was specifically identified as a high-risk factor for ALNM (odds ratio [OR]=4.49, 95% CI: 1.63–12.37, P = 0.004) (25), and Xue et al. further confirmed tumor location (OR = 4.019, 95% CI: 1.304–12.383, P = 0.015) and tumor size (OR = 3.702, 95% CI: 1.517–9.034, P = 0.004) as independent risk factors (12). Furthermore, nomogram analyses consistently validated tumor location as an independent predictor of ALNM (P = 0.010) (26), with another nomogram-based study additionally recognizing preoperative tumor size (P = 0.030) and menopausal status (P = 0.017) as key prognostic factors (27). Consistent with our findings, prior studies have reported that postmenopausal women are at increased risk of axillary metastasis (28). We hypothesize that this may be attributed to the fact that breast masses in older women are often overlooked, leading to a longer disease course prior to diagnosis and thereby increasing the window of opportunity for ALNM development. Collectively, these data reinforce the clinical relevance of age, tumor location, and menopausal status in ALNM risk stratification, particularly in the cN0 HR+ population.
Tumor diameter is a core component of the TNM staging system, with tumors >5 cm in diameter typically associated with higher lymph node metastasis rates and increased distant metastasis risk (29). A study predicting ALNM using ultrasound features identified tumor diameter as a significant correlate (OR = 4.312, 95% CI: 2.933–7.364) (9). Gao et al. further demonstrated that tumor size (20–50 mm: OR = 3.682, 95% CI: 3.181–4.267, p < 0.001; >50 mm: OR = 9.725, 95% CI: 6.240–15.827, p < 0.001) and central tumor location (p < 0.001) were independent predictors of ALNM (8). Multiple other studies have consistently validated tumor size as a robust predictor of axillary lymph node status (30, 31). Consistent with these prior findings, our model confirms that tumor diameter, a key determinant of T stage, is a critical predictor of ALNM development in cN0 HR+ BC patients. This alignment underscores the biological rationale linking larger tumor burden to increased likelihood of lymphatic spread, reinforcing the clinical utility of tumor diameter in ALNM risk stratification.
Beyond clinicopathological characteristics, systemic inflammation, reflected by the preoperative neutrophil count, lymphocyte count, and platelet count, may offer specific prognostic insights in our cN0 HR+ cohort. Evidence suggests that neutrophil counts may serve as a predictor of ALNM in BC (32). This association may be underpinned by tumor-induced systemic reprogramming, where BC cells remotely alter bone marrow function via cytokines (e.g., IL-1β/G-CSF), promoting a pro-metastatic neutrophil phenotype (33). In HR+ patients, circulating neutrophils have been demonstrated to exhibit distinct molecular characteristics compared to other subtypes (34). Concurrently, a lower lymphocyte count may reflect weakened anti-tumor immune surveillance, further tipping the balance toward a pro-metastatic state. This is supported by a study in young women with HR+/HER2- breast cancer, which showed that higher stromal abundance of specific T−helper lymphocyte subsets is associated with improved survival outcomes (35). Similarly, platelets are activated by tumor-derived factors and can facilitate early metastasis by cloaking circulating tumor cells and secreting pro-angiogenic factors (36).
In summary, within the specific biological context of HR+ breast cancer, preoperative alterations in these three blood counts are not isolated phenomena. Instead, they may be interrelated components of a tumor-elicited systemic response. This accessible composite picture of host inflammation and immune status could provide a rationale for their collective predictive value in stratifying ALNM risk in cN0 patients.
Geng et al. identified the serum tumor marker CEA as an independent risk factor for ALNM (37), with patients with elevated CEA levels having a significantly higher risk of ALNM compared to those with normal levels (OR = 2.139, 95% CI: 1.261–3.630, p = 0.005) (38). It is important to note, however, that the predictive value of CEA for ALNM remains a subject of ongoing investigation with somewhat inconsistent conclusions across studies. While several studies, including those by Wu and Zhao, have supported its role as an independent predictor for nodal metastasis (38, 39), other analyses have found that CEA failed to retain independent significance in multivariate models (40). This discrepancy may stem from variations in study populations, sample sizes, or cutoff values. In the context of our cN0 HR+ cohort, the inclusion of CEA in the final model suggests a potential association, but its clinical application warrants careful interpretation and external validation.
Notably, the feature selection process in this study identified AFP as one of the predictors for ALNM. The significance of AFP may not stem from its role as a conventional diagnostic biomarker for BC, but rather from its potential as a surrogate signal indicating underlying aggressive biological behavior. This is supported by emerging research: an immunomics analysis specifically designed for BC observed differential expression of AFP, suggesting its non-incidental relevance in this context (41). Biologically, AFP is a cancer-embryonic protein with recognized immunomodulatory functions, such as suppressing NK cell and dendritic cell activity, thereby potentially fostering an immunosuppressive tumor microenvironment conducive to metastasis (42). Its re-expression in tumors may also reflect cellular dedifferentiation or stem cell-like properties, correlated with enhanced metastatic potential. Although not a standard marker, elevated serum AFP levels have been detected in a subset of BC patients and may correlate with postoperative recurrence risk (43, 44). Therefore, within our specific cohort of cN0 HR+ BC, preoperative AFP levels might provide unique supplementary information for risk stratification by hinting at both the immunosuppressive landscape and the inherent aggressiveness of tumor cells. Naturally, this association requires further validation in prospective, multicenter cohorts and basic experiments to clarify its causal mechanisms and feasibility for clinical application.
This study has several limitations. First, as a retrospective analysis from a single institution, the findings may be influenced by local clinical practices and patient demographics, affecting generalizability. Second, and most critically, our model was validated using a temporal validation cohort from the same hospital rather than a geographically independent external cohort. While this approach tests model stability over time and is stronger than a random split, it cannot assess geographic or demographic generalizability. Third, the sample size of the temporal validation cohort was relatively small, which may limit the precision of the performance estimates. Fourth, despite using multiple imputation, missing data in some variables could introduce bias. Finally, the clinical and biological implications of some predictor associations identified by the model, such as AFP, require further validation in independent cohorts and mechanistic investigation to fully establish their role in clinical decision-making.
5 Conclusion
In this study, we developed and validated a machine learning-based prediction model for ALNM in patients with cN0, HR+ BC. The KNN algorithm demonstrated the most robust performance among the models compared, with an AUC of 0.898 in the test set and 0.774 in the temporal validation cohort. Using the SHAP interpretability framework, parity was identified as the most influential preoperative predictor, followed by age, tumor location, and menopausal status. This interpretable model provides a data-driven approach for preoperative ALNM risk assessment in this specific patient population, offering insights that could inform discussions on axillary surgical management.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by The Ethics Committee of Qinghai University Affiliated Hospital (Approval No. P-SL-2024-406). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.
Author contributions
WL: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Validation, Writing – original draft, Writing – review & editing. ZM: Funding acquisition, Writing – review & editing, Supervision. YW: Supervision, Visualization, Writing – review & editing. QC: Formal analysis, Visualization, Writing – original draft. LW: Formal analysis, Software, Visualization, Writing – original draft. JC: Formal analysis, Software, Visualization, Writing – original draft.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by the National Natural Science Foundation of China (No. 82060485).
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660
2. Huang H, Wei T, Zhang A, Zhang H, Kong L, Li Y, et al. Trends in the incidence and survival of women with hormone receptor-positive breast cancer from 1990 to 2019: a large population-based analysis. Sci Rep. (2024) 14:23690. doi: 10.1038/s41598-024-74746-1
3. Goldhirsch A, Winer EP, Coates AS, Gelber RD, Piccart-Gebhart M, Thürlimann B, et al. Personalizing the treatment of women with early breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer 2013. Ann Oncol. (2013) 24:2206–23. doi: 10.1093/annonc/mdt303
4. Loibl S, Poortmans P, Morrow M, Denkert C, and Curigliano G. Breast cancer. Lancet (London England). (2021) 397:1750–69. doi: 10.1016/S0140-6736(20)32381-3
5. Giuliano AE, Connolly JL, Edge SB, Mittendorf EA, Rugo HS, Solin LJ, et al. Breast Cancer-Major changes in the American Joint Committee on Cancer eighth edition cancer staging manual. CA: Cancer J Clin. (2017) 67:290–303. doi: 10.3322/caac.21393
6. Maimaitiaili A, Wu D, Liu Z, Liu H, Muyiduli X, and Fan Z. Erratum to Analysis of factors related to non-sentinel lymph node metastasis in 296 sentinel lymph node-positive Chinese breast cancer patients. Cancer Biol Med. (2018) 15:478. doi: 10.20892/j.issn.2095-3941.2018.0438
7. Chen W, Wang C, Fu F, Yang B, Chen C, and Sun Y. A model to predict the risk of lymph node metastasis in breast cancer based on clinicopathological characteristics. Cancer Manage Res. (2020) 12:10439–47. doi: 10.2147/CMAR.S272420
8. Gao X, Luo W, He L, and Yang L. Nomogram models for stratified prediction of axillary lymph node metastasis in breast cancer patients (cN0). Front endocrinol. (2022) 13:967062. doi: 10.3389/fendo.2022.967062
9. Yao J, Zhou W, Zhu Y, Zhou J, Chen X, and Zhan W. Predictive nomogram using multimodal ultrasonographic features for axillary lymph node metastasis in early−stage invasive breast cancer. Oncol letters. (2024) 27:95. doi: 10.3892/ol.2024.14228
10. Esteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K, et al. A guide to deep learning in healthcare. Nat Med. (2019) 25:24–9. doi: 10.1038/s41591-018-0316-z
11. Xiong J, Zuo W, Wu Y, Wang X, Li W, Wang Q, et al. Ultrasonography and clinicopathological features of breast cancer in predicting axillary lymph node metastases. BMC cancer. (2022) 22:1155. doi: 10.1186/s12885-022-10240-z
12. Xue M, Che S, Tian Y, Xie L, Huang L, Zhao L, et al. Nomogram based on breast MRI and clinicopathologic features for predicting axillary lymph node metastasis in patients with early-stage invasive breast cancer: A retrospective study. Clin Breast cancer. (2022) 22:e428–e37. doi: 10.1016/j.clbc.2021.10.014
13. Lundberg SM and Lee S-I. (2017). A unified approach to interpreting model predictions, in: Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, California, USA. Red Hook, NY, USA: Curran Associates Inc. pp. 4768–77.
14. Mei F, Liu JY, and Xue WC. Histological grading of invasive breast carcinoma: Nottingham histological grading system. Zhonghua bing li xue za zhi = Chin J pathol. (2019) 48:659–64. doi: 10.3760/cma.j.issn.0529-5807.2019.08.019
15. Allison KH, Hammond MEH, Dowsett M, McKernin SE, Carey LA, Fitzgibbons PL, et al. Estrogen and progesterone receptor testing in breast cancer: ASCO/CAP guideline update. J Clin Oncol. (2020) 38:1346–66. doi: 10.1200/JCO.19.02309
16. Nielsen TO, Leung SCY, Rimm DL, Dodson A, Acs B, Badve S, et al. Assessment of Ki67 in breast cancer: updated recommendations from the international Ki67 in breast cancer working group. J Natl Cancer Institute. (2021) 113:808–19. doi: 10.1093/jnci/djaa201
17. Budhathoki N, Bhandari R, Bashyal S, and Lee C. Predicting asthma using imbalanced data modeling techniques: Evidence from 2019 Michigan BRFSS data. PLoS One. (2023) 18:e0295427. doi: 10.1371/journal.pone.0295427
18. Manjer J, Balldin G, Zackrisson S, and Garne JP. Parity in relation to risk of axillary lymph node involvement in women with breast cancer. Results from Swedish population-based series of 3,472 consecutive cases. Eur Surg Res Europaische chirurgische Forschung Recherches chirurgicales europeennes. (2005) 37:179–84. doi: 10.1159/000085966
19. Zhang L, Huang Y, Feng Z, Wang X, Li H, Song F, et al. Comparison of breast cancer risk factors among molecular subtypes: A case-only study. Cancer Med. (2019) 8:1882–92. doi: 10.1002/cam4.2012
20. Xing P, Li J, and Jin F. A case-control study of reproductive factors associated with subtypes of breast cancer in Northeast China. Med Oncol (Northwood London England). (2010) 27:926–31. doi: 10.1007/s12032-009-9308-7
21. Virassamy B, Caramia F, Savas P, Harris MA, Pan JW, Wang J, et al. Parity and lactation induce T-cell-mediated breast cancer protection. Nature. (2025) 649:449–59. doi: 10.1038/s41586-025-09713-5
22. Abdel-Razeq H, Iweir S, Abdel-Razeq R, Rahman FA, Almasri H, Bater R, et al. Differences in clinicopathological characteristics, treatment, and survival outcomes between older and younger breast cancer patients. Sci Rep. (2021) 11:14340. doi: 10.1038/s41598-021-93676-w
23. Cheng MY, Wu CG, Lin YY, Zou JC, Wang DQ, Haffty BG, et al. Development and validation of a multivariable risk model based on clinicopathological characteristics, mammography, and MRI imaging features for predicting axillary lymph node metastasis in patients with upgraded ductal carcinoma in situ. Gland Surg. (2025) 14:738–53. doi: 10.21037/gs-2025-89
24. Hu X, Xue J, Peng S, Yang P, Yang Z, Yang L, et al. Preoperative nomogram for predicting sentinel lymph node metastasis risk in breast cancer: A potential application on omitting sentinel lymph node biopsy. Front Oncol. (2021) 11:665240. doi: 10.3389/fonc.2021.665240
25. Fu F, Zhang Y, Sun J, Zhang C, Zhang D, Xie L, et al. Predictors of sentinel lymph node metastasis in Chinese women with clinical T1-T2 N0 breast cancer and a normal axillary ultrasound. Acta radiologica (Stockholm Sweden: 1987). (2022) 63:1463–8. doi: 10.1177/02841851211054191
26. Wang S, Wang D, Wen X, Xu X, Liu D, and Tian J. Construction and validation of a nomogram prediction model for axillary lymph node metastasis of cT1 invasive breast cancer. Eur J Cancer Prev. (2024) 33:309–20. doi: 10.1097/CEJ.0000000000000860
27. Akissue de Camargo Teixeira P, Chala LF, Shimizu C, Filassi JR, Maesaka JY, and de Barros N. Axillary lymph node sonographic features and breast tumor characteristics as predictors of Malignancy: A nomogram to predict risk. Ultrasound Med Biol. (2017) 43:1837–45. doi: 10.1016/j.ultrasmedbio.2017.05.003
28. Xie F, Yang H, Wang S, Zhou B, Tong F, Yang D, et al. A logistic regression model for predicting axillary lymph node metastases in early breast carcinoma patients. Sensors (Basel Switzerland). (2012) 12:9936–50. doi: 10.3390/s120709936
29. Liu Y, He M, Zuo WJ, Hao S, Wang ZH, and Shao ZM. Tumor size still impacts prognosis in breast cancer with extensive nodal involvement. Front Oncol. (2021) 11:585613. doi: 10.3389/fonc.2021.585613
30. Ye X, Zhang X, Lin Z, Liang T, Liu G, and Zhao P. Ultrasound-based radiomics nomogram for predicting axillary lymph node metastasis in invasive breast cancer. Am J Trans Res. (2024) 16:2398–410. doi: 10.62347/KEPZ9726
31. Dong X, Meng J, Xing J, Jia S, Li X, and Wu S. Predicting axillary lymph node metastasis in young onset breast cancer: A clinical-radiomics nomogram based on DCE-MRI. Breast Cancer (Dove Med Press). (2025) 17:103–13. doi: 10.2147/BCTT.S495246
32. Bozan MB, Yazar FM, Kale IT, Topuz S, Bozan AA, and Boran OF. Immature granulocyte count and delta neutrophil index as new predictive factors for axillary metastasis of breast cancer. J Coll Physicians Surgeons--Pakistan: JCPSP. (2022) 32:220–5. doi: 10.29271/jcpsp.2022.02.220
33. Garner H, Martinovic M, Liu NQ, Bakker NAM, Velilla IQ, Hau CS, et al. Understanding and reversing mammary tumor-driven reprogramming of myelopoiesis to reduce metastatic spread. Cancer Cell. (2025) 43:1279–95.e9. doi: 10.1016/j.ccell.2025.04.007
34. Ramessur A, Ambasager B, Valle Aramburu I, Peakman F, Gleason K, Lehmann C, et al. Circulating neutrophils from patients with early breast cancer have distinct subtype-dependent phenotypes. Breast Cancer research: BCR. (2023) 25:125. doi: 10.1186/s13058-023-01707-3
35. Tesch ME, Arocho YDG, Collins LC, Heng J, Zheng Y, Tayob N, et al. Association of tumor-infiltrating lymphocyte subtypes with clinical characteristics and prognosis in young women with hormone receptor-positive breast cancer. Clinical cancer research : an official journal of the American Association for Cancer Research. (2023) 41:505–. doi: 10.1200/JCO.2023.41.16_suppl.505
36. Corbeau I, Thezenas S, Maran-Gonzalez A, Colombo PE, Jacot W, and Guiu S. Inflammatory blood markers as prognostic and predictive factors in early breast cancer patients receiving neoadjuvant chemotherapy. Cancers. (2020) 12:2666. doi: 10.3390/cancers12092666
37. Geng SK, Fu SM, Zhang HW, and Fu YP. Predictive nomogram based on serum tumor markers and clinicopathological features for stratifying lymph node metastasis in breast cancer. BMC cancer. (2022) 22:1328. doi: 10.1186/s12885-022-10436-3
38. Wu SG, He ZY, Ren HY, Yang LC, Sun JY, Li FY, et al. Use of CEA and CA15–3 to predict axillary lymph node metastasis in patients with breast cancer. J Cancer. (2016) 7:37–41. doi: 10.7150/jca.13090
39. Zhao W, Li X, Wang W, Chen B, Wang L, Zhang N, et al. Association of preoperative serum levels of CEA and CA15–3 with molecular subtypes of breast cancer. Dis markers. (2021) 2021:5529106. doi: 10.1155/2021/5529106
40. Zhang J, Ling Y, Wang T, Yan C, Huang M, Fan Z, et al. Analysis of sentinel lymph node biopsy and non-sentinel lymph node metastasis in invasive ductal and invasive lobular breast cancer: a nationwide cross-sectional study (CSBrS-001). Ann Trans Med. (2021) 9:1588. doi: 10.21037/atm-21-5169
41. Hermann N, Dressen K, Schroeder L, Debald M, Schildberg FA, Walgenbach-Bruenagel G, et al. Diagnostic relevance of a novel multiplex immunoassay panel in breast cancer. Tumour Biol. (2017) 39:1010428317711381. doi: 10.1177/1010428317711381
42. Munson PV, Adamik J, and Butterfield LH. Immunomodulatory impact of α-fetoprotein. Trends Immunol. (2022) 43:438–48. doi: 10.1016/j.it.2022.04.001
43. Sarui H, Nakayama T, Takeda N, Ishizuka T, and Yasuda K. Alpha-fetoprotein-producing male breast cancer accompanied with hepatocellular carcinoma: assessment by lectin-affinity profile. Am J Med Sci. (2001) 322:369–72. doi: 10.1097/00000441-200112000-00011
Keywords: axillary lymph node metastasis, breast cancer, hormone receptor positivity, machine learning, predictive modeling
Citation: Liu W, Ma Z, Wang Y, Chen Q, Wang L and Chi J (2026) Construction of a prediction model for axillary lymph node metastasis in stage cN0 hormone receptor-positive breast cancer: based on interpretable machine learning methods. Front. Oncol. 16:1763228. doi: 10.3389/fonc.2026.1763228
Received: 08 December 2025; Accepted: 19 January 2026; Revised: 07 January 2026;
Published: 03 February 2026.
Edited by:
Robert Wieder, Rutgers University, United StatesReviewed by:
Haowei Huang, Guangzhou Red Cross Hospital, ChinaKui Jia, The First Affiliated Hospital of Guangxi Medical University, China
Copyright © 2026 Liu, Ma, Wang, Chen, Wang and Chi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhijun Ma, bXpqZmFtYWlAMTYzLmNvbQ==
Zhijun Ma2*