Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Surg., 20 January 2026

Sec. Surgical Oncology

Volume 12 - 2025 | https://doi.org/10.3389/fsurg.2025.1722983

This article is part of the Research TopicImaging, Surgical and Treatment Strategies Among Young Patients Affected by Breast CancerView all 3 articles

Development and validation of a nomogram for predicting bone metastasis in breast cancer: a retrospective study


Yingnan LiYingnan LiTeng MaTeng MaXinyi SunXinyi SunChanggen LiuChanggen LiuHaibo Wang

Haibo Wang*
  • Breast Disease Diagnosis and Treatment Center, The Affiliated Hospital of Qingdao University, Qingdao, China

Background: Bone metastasis is the most common site of distant metastasis in breast cancer. Patients with bone metastasis have their quality of life and survival rate threatened. This study aims to develop a practical nomogram for predicting the risk of bone metastasis in breast cancer by integrating clinical data, assisting doctors in making more scientific clinical decisions.

Methods: We conducted a retrospective analysis of the data of newly diagnosed breast cancer patients from the database of the Affiliated Hospital of Qingdao University from January 2015 to December 2017. The cohort is divided into training set and validation set in a ratio of 7.5:2.5. Determine independent risk factors through Least Absolute Shrinkage and Selection Operator (LASSO) regression analysis and logistic regression, and develop a nomogram prediction model. The model's performance and clinical utility were evaluated by Receiver Operating Characteristic (ROC) curve analysis, Area Under the Curve (AUC), calibration curves, and Decision Curve Analysis (DCA).

Results: During the 5-year follow-up period, bone metastases developed in 48 of 421 patients (11.40%). Ultimately, six independent risk factors were identified: neoadjuvant chemotherapy, family history of cancer, distant metastasis in other locations, axillary lymph node metastasis, marital status, and primary tumor site. The nomogram demonstrated excellent predictive performance, with AUC values of 0.89 and 0.86 in the training and validation cohorts, respectively.

Conclusions: This pioneering nomogram, incorporating baseline, tumor characteristics, and therapeutic parameters, provides visual guidance for breast surgeons to assess bone metastasis risk in breast cancer patients. It enables clinicians to prioritize high-risk patients through early identification, thereby optimizing surveillance protocols and therapeutic strategies to safeguard patients' quality of life.

1 Introduction

Breast cancer, characterized by its distinctive epidemiological patterns and marked heterogeneity, remains one of the leading causes of cancer-related mortality among women worldwide. According to global statistics, over 2 million new cases are diagnosed annually, substantially compromising both the quality of life and survival outcomes for affected individuals (1). Reportedly, the 5-year overall survival rate exceeds 80% in patients with non-metastatic breast cancer, whereas the development of distant metastasis drastically reduces the 5-year survival rate to approximately 25% (2). bone metastasis accounts for over 75% of distant metastases in breast cancer, constituting the most frequent site of distant spread (3). Bone metastasis in breast cancer can induce skeletal-related events (SREs), including pain, functional impairment, hypercalcemia, pathological fractures, spinal cord and nerve root compression, and bone marrow infiltration, contributing to diminished quality of life in affected patients (4, 5). Patients suffering from bone metastases tend to demonstrate an extended overall survival (OS) in comparison to individuals with metastases affecting visceral organs. However, the presence of intense bone pain can severely affect their functional abilities. As a result, this leads to a considerable decline in their quality of life (6).

Current research indicates that the risk factors for bone metastasis in breast cancer remain controversial. This topic has been extensively investigated in both domestic and international studies. Numerous clinical investigations have established that factors such as the patient's age, tumor grade, molecular subtype, histological classification, along with the existence of concurrent metastases in the lung, liver, or brain, serve as independent risk determinants for bone metastasis in breast cancer (710). Furthermore, human epidermal growth factor receptor 2 (HER2) positivity has been identified as a risk factor for both primary and metastatic breast cancer (11). However, emerging research paradoxically suggests that human epidermal growth factor receptor 2 (HER2) positivity may act as a protective factor in metastatic breast cancer (12). The Tumor-Node-Metastasis (TNM) staging system is widely employed in clinical decision-making and prognosis prediction. However, it falls short in adequately quantifying and predicting tumor biology and the risk of distant metastasis (13, 14). Recent studies have constructed various predictive tools using machine learning, such as risk scores derived from the Surveillance, Epidemiology, and End Results (SEER) database and predictive models incorporating hematological parameters, facilitating preliminary risk predictions (7, 9). However, current models may exhibit considerable heterogeneity in the weighting of risk factors. For instance, the impact of hormone receptor status on risk shows inconsistent directions across different studies (10, 15). Current research may still fail to achieve accurate prediction of breast cancer bone metastasis. Moreover, individualized prediction is susceptible to misclassification. Therefore, we aim to construct a novel nomogram model. This model seeks to address these critical constraints.

Additionally, radiation exposure from diagnostic imaging is a frequent concern in clinical practice (16). Although nuclear medicine examinations, such as bone scans, effectively detect bone metastasis, their utility is often limited. This limitation stems from public apprehension and financial constraints (17). Therefore, the development of a straightforward nomogram to effectively predict malignant bone metastasis risk carries significant clinical implications.

Compared to traditional scoring systems, nomograms offer visual and personalized advantages. A nomogram graphically represents regression equations. It simplifies outcome prediction by integrating continuous and discrete variables through interactive axes (18). Unlike single-factor-based risk stratification, nomograms employ multivariate analysis. They integrate weighted factors for comprehensive assessment (19). This provides a visual approach to accurately predict individual bone metastasis risk in breast cancer patients. Consequently, it facilitates optimization of personalized treatment plans. Most current breast cancer bone metastasis risk prediction models are derived from public databases. Their applicability to Asian populations remains unclear (7, 20). Note that studies on breast cancer bone metastasis prediction models based on region-specific data remain scarce.

Consequently, the objective of this research is to create and validate a nomogram designed for the personalized assessment of the risk of bone metastasis in patients diagnosed with breast cancer. From comprehensively collected demographic and clinical data, we built a nomogram integrating these multifaceted features. The nomogram estimates the likelihood of bone metastasis in breast cancer patients, aiding clinicians in evidence-based therapeutic decision-making.

2 Patients and methods

2.1 Research design

This single-center retrospective study utilized anonymized clinical data from this center. A flow diagram presents the overall study design, including patient screening and variable collection processes (Figure 1).

Figure 1
Flowchart illustrating the process of data collection, variable selection, and nomogram development for a study on bone metastasis in breast cancer patients. The study period is from January 2015 to December 2017, involving 698 cases with 40 variables initially. Exclusion criteria are listed, narrowing down to 421 cases and 37 variables. The process splits into a training cohort of 317 cases and a validation cohort of 104 cases. LASSO selection reduces to 10 variables, with logistic regression further narrowing to 6 variables. The nomogram is established and validated using ROC, calibration, DCA curves, and HL test.

Figure 1. Study flowchart. ROC, receiver operating characteristic; DCA, decision curve analysis; HL, Hosmer-Lemeshow.

2.2 Patient information

This study was approved by the Ethics Committee of The Affiliated Hospital of Qingdao University. We included patients newly diagnosed with breast cancer between January 2015 and December 2017; these patients were subsequently followed for 60 months. All baseline and clinical data were retrieved from the MediCloud Database at The Affiliated Hospital of Qingdao University (ethics approval number: QYFY WZLL 29986, retrospectively registered). Informed consent, which was documented in writing, was secured from all patients participating in the study at the time of their admission. All methodologies in this study strictly adhered to the principles of the Declaration of Helsinki.

2.3 Inclusion and exclusion criteria

Patients meeting the following criteria were enrolled: (1) Histopathologically confirmed primary breast cancer; (2) Primary tumor staged as T1 (maximum tumor diameter ≤2 cm), T2 (maximum tumor diameter >2 cm and ≤5 cm), or T3 (maximum tumor diameter >5 cm), with complete TNM stage documentation (21); (3) Clearly documented interval from initial breast cancer diagnosis to bone metastasis development; (4) Complete baseline clinicopathological data, including histological grade, hormone receptor (ER/PR) status, HER2 status, Ki-67 index, and molecular subtype; (5) Traceable comprehensive treatment records, including surgical approach, neoadjuvant/adjuvant chemotherapy, radiotherapy, and endocrine therapy regimens; (6) Age ≥18 years with regular follow-up data.

Exclusion criteria were: (1) Male breast cancer patients; (2) Patients with ≥2 primary malignancies; (3) Patients not undergoing surgical treatment; (4) Patients with primary occult breast carcinoma (T0); (5) Patients lacking essential clinicopathological information, including primary tumor location, TNM stage (22), estrogen receptor (ER) and progesterone receptor (PR) status, HER2 status, histological type, and axillary lymph node metastasis (ALNM) status; (6) Patients presenting with distant metastasis at initial diagnosis; (7) Patients with concurrent bone metabolic diseases; (8) Patients lost to follow-up or whose death was unrelated to breast cancer during follow-up (e.g., accidents or other severe illnesses).

2.4 Data collection and determination of main results

All variables included in this study were categorized into baseline characteristics, tumor stage, pathological information, diagnostic testing data, and clinical treatment information (Table 1). Laboratory parameters reflect the first hematological tests obtained upon admission.

Table 1
www.frontiersin.org

Table 1. Baseline data for training and validation sets.

Diagnosis of bone metastasis required fulfillment of at least one criterion: (1) Detection of metastatic breast cancer cells in bone biopsy; (2) Positive findings on bone scintigraphy and/or positron emission tomography (PET)/PET-CT scan, with confirmation by CT, MRI, or radiography when necessary, demonstrating imaging characteristics consistent with typical bone metastasis (e.g., osteolytic/blastic destruction on CT/MRI) (23). All diagnostic procedures were performed within affiliated departments of our institution.

2.5 Sample size calculation

The sample size for this study was estimated a priori based on the Events Per Variable (EPV) criterion, a well-established method for developing multivariable prediction models to ensure reliable parameter estimates and minimize overfitting.

The expected event rate (p) for the primary outcome bone metastasis, was conservatively set at 15%. This estimate was derived from preliminary analysis of our institutional data.

We anticipated that the final logistic regression model would retain between 4 and 6 key predictor variables following clinical and statistical selection. To ensure a robust sample size, we used the upper limit of this range (6 variables) and the standard minimum EPV value of 10 for calculation.

The minimum sample size was calculated in two steps:

• The required number of events (E) was determined as: E = (Number of predictor variables) × (EPV) = 6 × 10 = 60 events.

• The total minimum sample size (N) was then derived using the formula: N = E/p = 60/0.15 = 400 participants.

This calculation indicated that a minimum of 400 patients was required to develop a stable model. Our final analyzed cohort comprised 421 patients, which exceeds this requirement and provides adequate statistical power for the objectives of this study.

2.6 Establishing and evaluating a nomogram prediction model

This study utilized R software (version 4.4.2) to develop the nomogram prediction model and generate assessment plots. To select the most relevant predictors from a high-dimensional dataset and prevent overfitting, we performed LASSO (Least Absolute Shrinkage and Selection Operator) regression using the “glmnet” package. The variables selected by LASSO were then entered into a multivariable logistic regression model (fitted with the “glm” function) to establish the final prediction model and estimate the regression coefficients (24).

The “rms” package was employed to generate the nomogram (a graphical tool for calculating individual risk) and calibration curve (which plots predicted probabilities against observed frequencies to assess accuracy). We utilized the “rmda” package to conduct decision curve analysis (Decision Curve Analysis, DCA) for assessing the net benefit of our prediction model. The model's discriminative ability was assessed by constructing a receiver operating characteristic (ROC) curve and calculating the area under the curve (AUC) using the “pROC” package.

Ultimately, model performance was comprehensively evaluated from three perspectives: discrimination (AUC), calibration (calibration curve), and clinical utility (DCA).

2.7 Statistical analysis

Continuous variables are expressed as mean ± SD; categorical variables as counts (n). Continuous variables were compared with the Student's t-test; categorical variables with the χ² test or Fisher's exact test. Statistical significance was defined as p < 0.05.

3 Results

3.1 Patient demographics

This study enrolled 421 breast cancer patients with a mean age of 49.98 ± 10.54 years. In the 5 years of follow-up, bone metastasis developed in 48 patients (11.40%); while 373 patients (88.60%) remained metastasis-free. All patients were randomly allocated to a training cohort (n = 317) and a validation cohort (n = 104) in a 75%:25% ratio using complete randomization implemented in R software. Baseline data comparison is presented in Table 1.

As presented in Table 1, the primary tumor location was left-sided in 53.31% of cases and right-sided in 46.69%. The most frequent site was the upper outer quadrant of the breast (52.68%). Luminal A subtype (HR+/HER2−) was the predominant molecular subtype, accounting for 62.46% of cases. The majority of patients exhibited positive estrogen receptor (ER; 75.39%) and progesterone receptor (PR; 67.82%) status. The most common T stage and N stage were T1 (58.04%) and N0 (58.05%), respectively. Given that prior literature reports and our preliminary analysis suggested a weak correlation between the composite AJCC TNM stage and bone metastasis risk, T stage and N stage were included as separate variables in this analysis to allow a deeper investigation into the influence of different aspects of tumor burden. Invasive ductal carcinoma represented 91.17% of breast cancer pathological types.

3.2 LASSO regression analysis summary

We performed LASSO regression on the training cohort data (Figures 2A,B). Figure 2A demonstrates that as the penalty coefficient increased, coefficients of all independent variables shrank towards zero, ultimately being reduced to zero. Figure 2B indicates that the binomial deviance reached its minimum when the predictive model incorporated 10 variables. Features with non-zero coefficients selected through this process included: age at menopause, CA125 level, neoadjuvant chemotherapy, family history of cancer, non-osseous distant metastasis, axillary lymph node metastasis (ALNM), T stage, marital status, primary tumor location, and blood type.

Figure 2
Graph A shows a plot of coefficients versus log lambda, displaying multiple coefficient paths as lambda increases. Graph B illustrates a plot of binomial deviance against log lambda, featuring a red dotted line with error bars indicating variability.

Figure 2. Characteristics of coefficient variation in LASSO regression and analysis of binomial deviance plots. (A) In LASSO regression analysis, as the penalty coefficient increases, ultimately resulting in all coefficients being reduced to zero, with the aim of eliminating irrelevant variables. (B) The position at the left dashed line indicates the point of minimal error, where the model exhibits optimal performance.

3.3 Construction of the nomogram model

Variables exhibiting non-zero coefficients in the least absolute shrinkage and selection operator (LASSO) regression analysis were selected as candidates. This process refined the initial predictor set to 10 features by applying a penalty (lambda) to shrink coefficients and prevent overfitting. These 10 variables were subsequently entered into a multivariable logistic regression model to ascertain independent predictors of breast cancer bone metastasis. The results of the multivariable logistic regression are presented in Table 2. This analysis revealed that six variables retained statistical significance and emerged as independent predictors: neoadjuvant chemotherapy, family history of cancer, non-osseous distant metastasis, axillary lymph node metastasis (ALNM), marital status, and primary tumor location. The regression coefficients (β), odds ratios (OR), and corresponding confidence intervals for each of these predictors are detailed in Table 2. Based on these results, a predictive nomogram was constructed to provide a visual and quantitative tool for individualized risk assessment (Figure 3).

Table 2
www.frontiersin.org

Table 2. Non-zero coefficient variables in multifactorial logistic regression analysis results.

Figure 3
Nomogram visualizing factors influencing bone metastasis risk. Criteria include neoadjuvant chemotherapy, family history, distant metastasis locations, axillary lymph node involvement, marital status, and primary tumor site. Total points correlate with bone metastasis risk probability.

Figure 3. The nomogram predictive model for BMBC. For instance, a patient who failed to receive the recommended neoadjuvant chemotherapy (NAC) (score: 47.5 points), presented with a family history of cancer (score: 42.5 points), exhibited no distant metastasis (DM) (score: 0 points), showed absence of axillary lymph node metastasis (ALNM) (score: 0 points), maintained married status (score: 0 points), and had a primary lesion located in the upper-inner quadrant of breast (score: 30 points), the cumulative risk score totals approximately 120 points. According to the nomogram analysis, this score corresponds to a 40% probability of developing bone metastasis from breast cancer within 5 years, as calibrated by the probability axis on the lower horizontal scale.

The nomogram graphically represents the multivariable logistic regression model. Each of the six independent predictors is assigned a separate axis (“Points” scale). The length of each axis is proportional to the contribution (regression coefficient β) of that variable to the outcome. To use the nomogram, the value for a given patient for each predictor is located on its respective axis, and a vertical line is drawn upward to the ‘Points' line to determine the corresponding score.

For example, a patient with a family history of cancer would receive a higher point contribution than one without, reflecting its positive association with bone metastasis risk. Conversely, married marital status contributes fewer points, aligning with its role as a protective factor. The points for all six variables are then summed to obtain a total points score. This total is subsequently located on the “Total Points” axis. Finally, by drawing a line straight down from the total points to the “Risk of Bone Metastasis” axis, the clinician can read the patient's estimated probability of developing bone metastasis, ranging from 0 to 1.

This nomogram effectively translates the complex statistical model into an intuitive, user-friendly scoring system. It allows for the integration of continuous and categorical variables to generate an individualized risk prediction, facilitating immediate clinical interpretation and supporting shared decision-making at the point of care, without the need for complex calculations.

3.4 Validation of the nomogram

Both the training and validation cohorts achieved high AUC values (0.89 and 0.86, respectively), demonstrating excellent model discriminative performance (Figure 4). As shown in Figure 5A, the x-axis represents the predicted probability of breast cancer bone metastasis, while the y-axis denotes its actual occurrence. Predictive accuracy increased with closer alignment of the calibration curve to the diagonal reference line. Hosmer- Lemeshow tests indicated good calibration in both cohorts (training cohort p = 0.7436; validation cohort p = 0.1142). Both p-values exceeded 0.05, indicating no significant deviation between predicted and observed outcomes and supporting good model fit. Comprehensive evaluation incorporating multiple assessment metrics demonstrated consistently high performance of the developed predictive model in the training and validation cohorts.

Figure 4
ROC curve comparing training and validation sets. The x-axis shows specificity, and the y-axis shows sensitivity. The black line represents the training set with an AUC of 0.888, and the red line represents the validation set with an AUC of 0.856.

Figure 4. Receiver operating characteristic curve of the nomogram prediction model.

Figure 5
Two graphs labeled A and B. Graph A is a Training Set Calibration Curve plotting actual vs. predicted probability, showing a line of logistic calibration with metrics like Dxy and ROC. Graph B is a Decision Curve Analysis, showing standardized net benefit versus threshold probability for the model compared to \

Figure 5. Calibration curve and Decision curve analysis for the nomogram prediction model. (A) Calibration curve for the training set data. The closer the black line is to the diagonal gray line, the higher the accuracy of the predictive model. (B) Decision curve analysis for the nomogram prediction model. The red line represents the nomogram model constructed in this study. The higher the position of this curve, the greater the net benefit of the model in predicting bone metastasis. The gray line represents the strategy of intervening in all patients (assuming all patients will develop bone metastasis), while the black line represents the strategy of not intervening in any patients (assuming no patients will develop bone metastasis). The decision curve analysis of this predictive model demonstrates that across the bone metastasis risk threshold range of 0 to 1, using this model

Decision curve analysis (DCA) of the prediction model (Figure 5B) demonstrated added net benefit across the entire risk threshold spectrum (0–1) for breast cancer bone metastasis, outperforming both “treat-all” and “treat-none” strategies. This indicates broad clinical applicability of the nomogram.

4 Discussion

Metastasis represents the primary cause of mortality in the majority of breast cancer patients (25). Bone metastasis imposes a dual burden characterized by its high incidence rate and propensity for severe complications (26). Critically, upon diagnosis of bone metastasis, patients experience significantly compromised survival outcomes, with median survival further reduced to 2.1–4 years (5). Although current imaging modalities (e.g., bone scintigraphy, PET/CT) can detect osseous metastases, their application faces significant limitations. Traditional imaging screening demonstrates low detection probability and high false-positive rates for most patients with low metastatic risk—particularly stages I-II breast cancer. Moreover, these approaches impose substantial financial burdens and heighten patient anxiety regarding ionizing radiation exposure (27, 28). However, no readily applicable prediction model currently exists that simultaneously graphically represents both risk and protective factors for delayed bone metastasis in newly diagnosed breast cancer patients.

Therefore, the development of a practical risk prediction model to identify breast cancer patients at high risk of bone metastasis and facilitate personalized interventions holds significant clinical value. The nomogram constructed in this study integrates six independent predictors—neoadjuvant chemotherapy, family history of cancer, non-osseous distant metastasis, axillary lymph node metastasis, marital status, and primary tumor location—to translate a multivariate logistic regression model into an intuitive graphical scoring system. Unlike conventional risk stratification methods, the nomogram enables clinicians to rapidly and quantitatively assess individualized risk for each patient, thereby avoiding a one-size-fits-all approach. For instance, for a married patient with no family history of cancer but with ≥4 metastatic axillary lymph nodes, the nomogram provides a precise probability of risk through cumulative points, rather than simplistically classifying the case as “high-risk” or “low-risk.” Such granular risk quantification supports the shift from population-based medicine to personalized care, aligning with the principles of precision medicine (18, 29).

Although some popular machine learning models are capable of performing such predictive tasks, their clinical utility is often limited by the “black-box” nature of complex algorithms, which compromises interpretability (17). In contrast, the decision-making logic of the nomogram developed in this study is fully transparent: the contribution of each variable is clearly reflected through axis length and point assignment, making it easy for both physicians and patients to understand the sources of risk. This transparency greatly facilitates shared decision-making between clinicians and patients, enabling more informed participation in treatment choices and improving adherence. In clinical practice, this nomogram offers a practical tool for long-term management of breast cancer patients. On one hand, for patients predicted to be at high risk (e.g., those with a bone metastasis probability >20%), clinicians may recommend intensified follow-up strategies—such as shortening the interval for bone scintigraphy or PET-CT scans—and consider prophylactic use of bisphosphonates to reduce the risk of skeletal-related events. On the other hand, for low-risk patients (e.g., those with a probability <5%), unnecessary routine bone scans can be avoided, thereby reducing medical costs, radiation exposure, and patient anxiety.

In this study, we analyzed 421 patients with 5-year follow-up data and identified six independent predictors of bone metastasis in breast cancer. Consistent with prior reports, ≥4 metastatic axillary lymph nodes constituted an independent risk factor for bone metastasis. Marco Colleoni et al. demonstrated that patients with ≥4 involved axillary lymph nodes at diagnosis exhibited the highest cumulative incidence of bone metastasis (30). Similarly, Chen Zhao et al. reported a significant correlation between higher numbers of metastatic axillary lymph nodes and breast cancer bone metastasis (16). Yann Delpech et al. further confirmed that nodal status was independently and significantly associated with subsequent bone metastasis in breast cancer. Our findings align with these studies, showing that patients with ≥4 metastatic axillary lymph nodes had significantly higher likelihood of developing bone metastasis compared to those without nodal involvement.

Notably, marital status was identified as an independent predictor of bone metastasis in breast cancer, with married status serving as a protective factor. This finding aligns with previous reports: Gao et al. identified unmarried status as a risk factor for bone metastasis in breast cancer patients (31), while Guo et al. demonstrated a positive correlation between unmarried status and both the incidence and poor prognosis of bone metastasis (32) Furthermore, in the risk prediction model developed by Li et al., non-married marital status was significantly associated with increased bone metastasis risk (33). Collectively, these studies suggest that unmarried women face higher risks of bone metastasis post-diagnosis and exhibit poorer prognosis compared to their married counterparts—a conclusion consistent with our findings.

The association between marital status and bone metastasis risk may be underpinned by several interlinked pathophysiological and psychosocial mechanisms. First, from a biological perspective, reproductive history and endogenous hormone exposure differ by marital status. Unmarried/nulliparous women may experience longer periods of uninterrupted ovulation and cumulative estrogen exposure, a known promoter of breast cancer progression. Evidence suggests that estrogen can upregulate factors like parathyroid hormone-related protein (PTHrP) and receptor activator of nuclear factor kappa-B ligand (RANKL) within the bone microenvironment, thereby facilitating the colonization and growth of metastatic cells (34, 35). Second, chronic stress associated with social isolation or lack of spousal support may dysregulate the hypothalamic-pituitary-adrenal (HPA) axis, leading to elevated cortisol levels and systemic inflammation. This pro-inflammatory state, characterized by increased levels of cytokines such as IL-6 and TNF-α, can promote tumor cell invasion, angiogenesis, and bone resorption, creating a favorable niche for metastasis (36). Third, differences in healthcare-seeking behaviors and treatment adherence are well-documented. Married patients often benefit from spousal encouragement and logistical support, leading to better compliance with adjuvant therapies, timely follow-up, and earlier management of complications—all critical for metastasis suppression (37, 38). Our results underscore the need for heightened clinical vigilance regarding treatment adherence and follow-up compliance in unmarried breast cancer patients. Enhanced awareness and tailored interventions for this demographic may improve outcomes.

Additionally, this study identified distant metastasis in other locations as an independent risk factor for bone metastasis. These include lung, brain, and liver metastases. Metastasis to one organ may accelerate cancer spread to others. Bone metastasis could substantially increase risks of additional metastases elsewhere, and vice versa (39). Prior studies confirm our findings: Brain, lung, and liver metastases significantly correlate with skeletal metastasis risk (8, 33). Thus, patients with existing non-bone metastases require enhanced surveillance for bone metastasis risk and timely implementation of targeted interventions to prevent further deterioration in quality of life.

Concurrently, our study identified completion of guideline-recommended neoadjuvant chemotherapy as an independent protective factor against bone metastasis in breast cancer. Current research primarily focuses on neoadjuvant chemotherapy's role in tumor shrinkage and surgical outcomes (40). Emerging evidence confirms its protective effect: Preoperative neoadjuvant chemotherapy improves surgical success and may reduce bone metastasis risk (4143). Mechanistically, neoadjuvant chemotherapy directly eliminates disseminated tumor cells in bone marrow niches. It particularly targets EpCAM-positive subpopulations, suppressing metastatic seeding (44). Through such actions—including micro-metastasis eradication—it confers significant protection against skeletal spread. This study pioneers the incorporation of this biological mechanism into bone metastasis prediction models. Our approach enhances clinical utility. For guideline-eligible patients, strict adherence to complete neoadjuvant chemotherapy regimens must be enforced to mitigate subsequent metastasis risk, with mandatory therapeutic response monitoring.

Studies indicate higher rates of sternal bone metastasis in patients with inner-quadrant primary tumors. This likely reflects medial breast tissue's lymphatic drainage toward parasternal nodes (45). Our study confirms Upper-inner quadrant tumor location is an independent risk factor for breast cancer bone metastasis. Its higher regression coefficient warrants enhanced vigilance: Prioritize rigorous imaging review for inner-quadrant primary tumors.

Risk factor analysis identified neoadjuvant chemotherapy, family history of cancer, distant metastasis to other sites, axillary lymph node metastasis, marital status, and primary tumor location as factors associated with bone metastasis in breast cancer patients. Notably, the baseline table comprehensively lists all parameters collected in this study, which were selected based on an extensive literature review of factors related to breast cancer bone metastasis. The systematic collection and presentation of these parameters provide a more complete informational background for clinical risk assessment. Although some parameters showed that these factors may have limited direct predictive value for bone metastasis in the specific population or context of this study, or their effects might be overshadowed by stronger predictors, they still hold significant clinical relevance. For instance, although breast cancer subtype was not included as a core predictive variable, it is important to note that breast cancer subtype is an extremely important biological parameter in assessing the risk of bone metastasis. Previous studies have confirmed that different subtypes exhibit significant heterogeneity in distant metastasis tendencies and sites due to their intrinsic molecular characteristics and tumor microenvironment differences (46). Therefore, we still recommend that clinicians consider the predictive model established in this study in conjunction with important biological characteristics such as breast cancer subtype when evaluating individual patient risk, to form a comprehensive and multidimensional judgment. Future studies could further explore the potential value or interactions of these factors in larger sample sizes.

This study offers several notable advantages: First, the model was constructed using clinical data from a Northern Chinese breast cancer cohort, providing a more accurate reflection of the biological characteristics and bone metastasis-related information specific to this regional population. Second, compared to prior bioinformatics studies reliant on public databases (e.g., SEER), our analysis incorporated more comprehensive variables, including detailed records of systemic therapies such as neoadjuvant chemotherapy regimens, endocrine therapy, and targeted agent administration. This granular classification of treatment information addresses a significant gap in previous public database studies, enabling the provision of precise guidance for individualized treatment strategies (8). Third, models developed for specific populations may exhibit limited generalizability to others. By utilizing local clinical data, the present model is likely to demonstrate greater compatibility with Chinese patients than prediction models derived from public databases primarily representing European and American populations.

However, this study has several limitations. Although the model demonstrated excellent predictive performance, as indicated by the ROC curves and calibration plots, its single-center, retrospective design inevitably led to the exclusion of some cases and may have introduced selection bias. Furthermore, the lack of external multi-center validation necessitates future multi-center and prospective studies to enhance the generalizability of our findings. We plan to expand data collection in subsequent research to further improve the model's reliability.

5 Conclusions

Our nomogram demonstrates that failure to receive appropriate neoadjuvant chemotherapy, a family history of cancer, the presence of distant metastases in other locations, involvement of four or more metastatic axillary lymph nodes, unmarried status, and primary tumor location are independent risk factors for bone metastasis in breast cancer patients. By constructing the nomogram, we aimed to identify high-risk patients with potential bone metastasis, thereby enhancing predictive accuracy for this outcome. The model exhibited robust predictive performance and reliability during both internal evaluation and external validation. Early risk stratification using this tool may optimize follow-up and therapeutic strategies, assisting clinicians in formulating personalized precision medicine approaches.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by the Ethics Committee of The Affiliated Hospital of Qingdao University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

YL: Formal analysis, Writing – original draft, Methodology, Conceptualization. TM: Validation, Supervision, Software, Writing – review & editing. XS: Writing – review & editing, Visualization, Project administration. CL: Project administration, Formal analysis, Writing – review & editing. HW: Writing – review & editing, Supervision.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Acknowledgments

The authors extend their deepest gratitude to all anonymous patients who participated in this study. We honor the trust placed in our investigative team.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Hipp LE, Hulswit BB, Milliron KJ. Clinical tools and counseling considerations for breast cancer risk assessment and evaluation for hereditary cancer risk. Best Pract Res Clin Obstet Gynaecol. (2022) 82:12–29. doi: 10.1016/j.bpobgyn.2022.02.001

PubMed Abstract | Crossref Full Text | Google Scholar

2. Liang Y, Zhang H, Song X, Yang Q. Metastatic heterogeneity of breast cancer: molecular mechanism and potential therapeutic targets. Semin Cancer Biol. (2020) 60:14–27. doi: 10.1016/j.semcancer.2019.08.012

PubMed Abstract | Crossref Full Text | Google Scholar

3. Tulotta C, Ottewell P. The role of IL-1B in breast cancer bone metastasis. Endocr Relat Cancer. (2018) 25:R421–34. doi: 10.1530/erc-17-0309

PubMed Abstract | Crossref Full Text | Google Scholar

4. Wang Y, Hu Y, Wang M, Wang M, Xu Y. The role of breast cancer cells in bone metastasis: suitable seeds for nourishing soil. Curr Osteoporos Rep. (2024) 22:28–43. doi: 10.1007/s11914-023-00849-9

PubMed Abstract | Crossref Full Text | Google Scholar

5. Zhang H, Zhu W, Biskup E, Yang W, Yang Z, Wang H, et al. Incidence, risk factors and prognostic characteristics of bone metastases and skeletal-related events (SREs) in breast cancer patients: a systematic review of the real world data. J Bone Oncol. (2018) 11:38–50. doi: 10.1016/j.jbo.2018.01.004

PubMed Abstract | Crossref Full Text | Google Scholar

6. Marie L, Braik D, Abdel-Razeq N, Abu-Fares H, Al-Thunaibat A, Abdel-Razeq H. Clinical characteristics, prognostic factors and treatment outcomes of patients with bone-only metastatic breast cancer. Cancer Manag Res. (2022) 14:2519–31. doi: 10.2147/cmar.S369910

PubMed Abstract | Crossref Full Text | Google Scholar

7. Ye L-J, Suo H-D, Liang C-Y, Zhang L, Jin Z-N, Yu C-Z, et al. Nomogram for predicting the risk of bone metastasis in breast cancer: a SEER population-based study. Transl Cancer Res. (2020) 9:6710–9. doi: 10.21037/tcr-20-2379

PubMed Abstract | Crossref Full Text | Google Scholar

8. Huang Z, Hu C, Liu K, Yuan L, Li Y, Zhao C, et al. Risk factors, prognostic factors, and nomograms for bone metastasis in patients with newly diagnosed infiltrating duct carcinoma of the breast: a population-based study. BMC Cancer. (2020) 20:1145. doi: 10.1186/s12885-020-07635-1

PubMed Abstract | Crossref Full Text | Google Scholar

9. Tian Z, Li C, Wang X, Sun H, Zhang P, Yu Z. Prediction of bone metastasis risk of early breast cancer based on nomogram of clinicopathological characteristics and hematological parameters. Front Oncol. (2023) 13:1136198. doi: 10.3389/fonc.2023.1136198

PubMed Abstract | Crossref Full Text | Google Scholar

10. Hou N, Yi J, Wang Z, Yang L, Wu Y, Huang M, et al. Development and validation of a risk stratification nomogram for predicting prognosis in bone metastatic breast cancer: a population-based study. Medicine (Baltimore). (2021) 100:e24751. doi: 10.1097/md.0000000000024751

PubMed Abstract | Crossref Full Text | Google Scholar

11. Harbeck N, Penault-Llorca F, Cortes J, Gnant M, Houssami N, Poortmans P, et al. Breast cancer. Nat Rev Dis Primers. (2019) 5:66. doi: 10.1038/s41572-019-0111-2

PubMed Abstract | Crossref Full Text | Google Scholar

12. Raghavendra AS, Ibrahim NK. Breast cancer brain metastasis: a comprehensive review. JCO Oncol Pract. (2024) 20:1348–59. doi: 10.1200/op.23.00794

PubMed Abstract | Crossref Full Text | Google Scholar

13. Zhang Z, Chen L, Chen Z, Guan S, He D, Song H, et al. Machine learning prediction of overall survival in patients with cT1b renal cell carcinoma after surgical resection using the SEER database. Sci Rep. (2025) 15:38366. doi: 10.1038/s41598-025-22342-2

PubMed Abstract | Crossref Full Text | Google Scholar

14. Zhao R-S, Liu Y-N, Dai W-G, Chen S-L, Ye J-N, Zhai E-T, et al. A substage increase in the AJCC classification system improves prognostic prediction in stage III gastric cancer with insufficient lymph nodes removed. Front Oncol. (2021) 11:624413. doi: 10.3389/fonc.2021.624413

PubMed Abstract | Crossref Full Text | Google Scholar

15. Yao YB, Zheng XE, Luo XB, Wu AM. Incidence, prognosis and nomograms of breast cancer with bone metastases at initial diagnosis: a large population-based study. Am J Transl Res. (2021) 13:10248–61. doi: 10.62347/UHTA8948

PubMed Abstract | Crossref Full Text | Google Scholar

16. Chen W-Z, Shen J-F, Zhou Y, Chen X-Y, Liu J-M, Liu Z-L. Clinical characteristics and risk factors for developing bone metastases in patients with breast cancer. Sci Rep. (2017) 7:11325. doi: 10.1038/s41598-017-11700-4

PubMed Abstract | Crossref Full Text | Google Scholar

17. Liu W-C, Li M-P, Hong W-Y, Zhong Y-X, Sun B-L, Huang S-H, et al. A practical dynamic nomogram model for predicting bone metastasis in patients with thyroid cancer. Front Endocrinol (Lausanne). (2023) 14:1142796. doi: 10.3389/fendo.2023.1142796

PubMed Abstract | Crossref Full Text | Google Scholar

18. Lu S, Xia X, Shi X, Qin X, Wang C, Wei W. A nomogram for postoperative pain relief in patients with osteoporotic vertebral compression fracture treated with polymethylmethacrylate bone cement. Sci Rep. (2025) 15:1780. doi: 10.1038/s41598-025-85820-7

PubMed Abstract | Crossref Full Text | Google Scholar

19. Zheng C, Gu X-t, Huang X-l, Wei Y-c, Chen L, Luo N-b, et al. Nomogram based on clinical and preoperative CT features for predicting the early recurrence of combined hepatocellular-cholangiocarcinoma: a multicenter study. Radiol Med. (2023) 128:1460–71. doi: 10.1007/s11547-023-01726-2

PubMed Abstract | Crossref Full Text | Google Scholar

20. Wang Z, Cheng Y, Chen S, Shao H, Chen X, Wang Z, et al. Novel prognostic nomograms for female patients with breast cancer and bone metastasis at presentation. Ann Transl Med. (2020) 8:197. doi: 10.21037/atm.2020.01.37

PubMed Abstract | Crossref Full Text | Google Scholar

21. Lee SC, Jain PA, Jethwa SC, Tripathy D, Yamashita MW. Radiologist's role in breast cancer staging: providing key information for clinicians. Radiographics. (2014) 34:330–42. doi: 10.1148/rg.342135071

PubMed Abstract | Crossref Full Text | Google Scholar

22. Giuliano AE, Connolly JL, Edge SB, Mittendorf EA, Rugo HS, Solin LJ, et al. Breast cancer-major changes in the American joint committee on cancer eighth edition cancer staging manual. CA Cancer J Clin. (2017) 67:290–303. doi: 10.3322/caac.21393

PubMed Abstract | Crossref Full Text | Google Scholar

23. Ellmann S, Seyler L, Evers J, Heinen H, Bozec A, Prante O, et al. Prediction of early metastatic disease in experimental breast cancer bone metastasis by combining PET/CT and MRI parameters to a model-averaged neural network. Bone. (2019) 120:254–61. doi: 10.1016/j.bone.2018.11.008

PubMed Abstract | Crossref Full Text | Google Scholar

24. Dutta A, Batabyal T, Basu M, Acton ST. An efficient convolutional neural network for coronary heart disease prediction. Expert Syst Appl. (2020) 159:113408. doi: 10.1016/j.eswa.2020.113408

Crossref Full Text | Google Scholar

25. Li Z, Kang Y. Emerging therapeutic targets in metastatic progression: a focus on breast cancer. Pharmacol Ther. (2016) 161:79–96. doi: 10.1016/j.pharmthera.2016.03.003

PubMed Abstract | Crossref Full Text | Google Scholar

26. Hao X, Jiang B, Wu J, Xiang D, Xiong Z, Li C, et al. Nanomaterials for bone metastasis. J Control Release. (2024) 373:640–51. doi: 10.1016/j.jconrel.2024.07.067

PubMed Abstract | Crossref Full Text | Google Scholar

27. Arnaout A, Varela NP, Allarakhia M, Grimard L, Hey A, Lau J, et al. Baseline staging imaging for distant metastasis in women with stages I, II, and III breast cancer. Curr Oncol. (2020) 27:e123–45. doi: 10.3747/co.27.6147

PubMed Abstract | Crossref Full Text | Google Scholar

28. Li J-w, Sheng D-l, Chen J-g, You C, Liu S, Xu H-x, et al. Artificial intelligence in breast imaging: potentials and challenges. Phys Med Biol. (2023) 68:23TR01. doi: 10.1088/1361-6560/acfade

Crossref Full Text | Google Scholar

29. Iasonos A, Schrag D, Raj GV, Panageas KS. How to build and interpret a nomogram for cancer prognosis. J Clin Oncol. (2008) 26:1364–70. doi: 10.1200/jco.2007.12.9791

PubMed Abstract | Crossref Full Text | Google Scholar

30. Colleoni M, O'Neill A, Goldhirsch A, Gelber RD, Bonetti M, Thürlimann B, et al. Identifying breast cancer patients at high risk for bone metastases. J Clin Oncol. (2000) 18:3925–35. doi: 10.1200/jco.2000.18.23.3925

PubMed Abstract | Crossref Full Text | Google Scholar

31. Gao T, Shao F. Risk factors and prognostic factors for inflammatory breast cancer with bone metastasis: a population-based study. J Orthop Surg. (2021) 29:23094990211000144. doi: 10.1177/23094990211000144

PubMed Abstract | Crossref Full Text | Google Scholar

32. Guo X, Zhang C, Guo Q, Xu Y, Feng G, Li L, et al. The homogeneous and heterogeneous risk factors for the morbidity and prognosis of bone metastasis in patients with prostate cancer. Cancer Manag Res. (2018) 10:1639–46. doi: 10.2147/cmar.S168579

PubMed Abstract | Crossref Full Text | Google Scholar

33. Li M, Yu W, Zhang C, Li H, Li X, Song F, et al. Reclassified the phenotypes of cancer types and construct a nomogram for predicting bone metastasis risk: a pan-cancer analysis. Cancer Med. (2024) 13:e7014. doi: 10.1002/cam4.7014

PubMed Abstract | Crossref Full Text | Google Scholar

34. Clines GA, Guise TA. Hypercalcaemia of malignancy and basic research on mechanisms responsible for osteolytic and osteoblastic metastasis to bone. Endocr Relat Cancer. (2005) 12:549–83. doi: 10.1677/erc.1.00543

PubMed Abstract | Crossref Full Text | Google Scholar

35. Infante M, Fabi A, Cognetti F, Gorini S, Caprio M, Fabbri A. RANKL/RANK/OPG system beyond bone remodeling: involvement in breast cancer and clinical perspectives. J Exp Clin Cancer Res. (2019) 38:12. doi: 10.1186/s13046-018-1001-2

PubMed Abstract | Crossref Full Text | Google Scholar

36. Cole SW, Nagaraja AS, Lutgendorf SK, Green PA, Sood AK. Sympathetic nervous system regulation of the tumour microenvironment. Nat Rev Cancer. (2015) 15:563–72. doi: 10.1038/nrc3978

PubMed Abstract | Crossref Full Text | Google Scholar

37. Wheldon CW, Boehmer U. Marital Status and quality of life in cancer survivors by sexual and gender identity. Psychooncology. (2025) 34:e70346. doi: 10.1002/pon.70346

PubMed Abstract | Crossref Full Text | Google Scholar

38. Wang L, Wilson SE, Stewart DB, Hollenbeak CS. Marital status and colon cancer outcomes in US surveillance, epidemiology and end results registries: does marriage affect cancer survival by gender and stage? Cancer Epidemiol. (2011) 35:417–22. doi: 10.1016/j.canep.2011.02.004

PubMed Abstract | Crossref Full Text | Google Scholar

39. Xie J, Ying Y-Y, Xu B, Li Y, Zhang X, Li C. Metastasis pattern and prognosis of male breast cancer patients in US: a population-based study from SEER database. Ther Adv Med Oncol. (2019) 11:1758835919889003. doi: 10.1177/1758835919889003

PubMed Abstract | Crossref Full Text | Google Scholar

40. Charfare H, Limongelli S, Purushotham AD. Neoadjuvant chemotherapy in breast cancer. Br J Surg. (2005) 92:14–23. doi: 10.1002/bjs.4840

PubMed Abstract | Crossref Full Text | Google Scholar

41. Lyu X, Luo B. Prognostic factors and survival prediction in HER2-positive breast cancer with bone metastases: a retrospective cohort study. Cancer Med. (2021) 10:8114–26. doi: 10.1002/cam4.4326

PubMed Abstract | Crossref Full Text | Google Scholar

42. Li C, Liu M, Li J, Wang W, Feng C, Cai Y, et al. Machine learning predicts the prognosis of breast cancer patients with initial bone metastases. Front Public Health. (2022) 10:1003976. doi: 10.3389/fpubh.2022.1003976

PubMed Abstract | Crossref Full Text | Google Scholar

43. Yang H, Ouyang Q-C, Yan M, Wang X-J, Hu X-C, Jiang Z-F, et al. Age at initial diagnosis and prognosis of breast cancer: a nationwide multicenter retrospective study in China. Ann Transl Med. (2022) 10:813. doi: 10.21037/atm-22-302

PubMed Abstract | Crossref Full Text | Google Scholar

44. Watson MA, Ylagan LR, Trinkaus KM, Gillanders WE, Naughton MJ, Weilbaecher KN, et al. Solation and molecular profiling of bone marrow micrometastases identifies TWIST1 as a marker of early tumor relapse in breast cancer patients. Clin Cancer Res. (2007) 13:5001–9. doi: 10.1158/1078-0432.Ccr-07-0024

PubMed Abstract | Crossref Full Text | Google Scholar

45. Xiang Y, Tie J, Zhang S, Shi C, Guo C, Peng Y, et al. Explainable machine learning model for predicting internal mammary node metastasis in breast cancer: multi-method development and cross-cohort validation. The Breast. (2025) 82:104517. doi: 10.1016/j.breast.2025.104517

PubMed Abstract | Crossref Full Text | Google Scholar

46. Yue G, Jing Z, Peng J, Hong L, Xin H, Zhi-Ming S. Incidence proportions and prognosis of breast cancer patients with bone metastases at initial diagnosis. Cancer Med. (2018) 7:4156–69. doi: 10.1002/cam4.1668

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: bone metastasis, breast cancer, cancer, nomogram, predictive model

Citation: Li Y, Ma T, Sun X, Liu C and Wang H (2026) Development and validation of a nomogram for predicting bone metastasis in breast cancer: a retrospective study. Front. Surg. 12:1722983. doi: 10.3389/fsurg.2025.1722983

Received: 11 October 2025; Revised: 19 December 2025;
Accepted: 29 December 2025;
Published: 20 January 2026.

Edited by:

Stefano Spinaci, Ospedale Villa Scassi, Italy

Reviewed by:

Zoaib Habib Tharwani, Dow University of Health Sciences, Pakistan
Cristian Iorga, Carol Davila University of Medicine and Pharmacy, Romania

Copyright: © 2026 Li, Ma, Sun, Liu and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Haibo Wang, SGFpYm9XYW5nUURAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.