Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 12 February 2024
Sec. Breast Cancer

Comparison of machine learning models to predict the risk of breast cancer-related lymphedema among breast cancer survivors: a cross-sectional study in China

Jiali DuJiali Du1Jing Yang*Jing Yang1*Qing YangQing Yang2Xin ZhangXin Zhang1Ling YuanLing Yuan1Bing FuBing Fu1
  • 1Department of Breast Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, Affiliated Cancer Hospital of University of Electronic Science and Technology of China, Chengdu, China
  • 2Department of Nursing, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, Affiliated Cancer Hospital of University of Electronic Science and Technology of China, Chengdu, China

Objective: The aim of this study was to develop and validate a series of breast cancer-related lymphoedema risk prediction models using machine learning algorithms for early identification of high-risk individuals to reduce the incidence of postoperative breast cancer lymphoedema.

Methods: This was a retrospective study conducted from January 2012 to July 2022 in a tertiary oncology hospital. Subsequent to the collection of clinical data, variables with predictive capacity for breast cancer-related lymphoedema (BCRL) were subjected to scrutiny utilizing the Least Absolute Shrinkage and Selection Operator (LASSO) technique. The entire dataset underwent a randomized partition into training and test subsets, adhering to a 7:3 distribution. Nine classification models were developed, and the model performance was evaluated based on accuracy, sensitivity, specificity, recall, precision, F-score, and area under curve (AUC) of the ROC curve. Ultimately, the selection of the optimal model hinged upon the AUC value. Grid search and 10-fold cross-validation was used to determine the best parameter setting for each algorithm.

Results: A total of 670 patients were investigated, of which 469 were in the modeling group and 201 in the validation group. A total of 174 had BCRL (25.97%). The LASSO regression model screened for the 13 features most valuable in predicting BCRL. The range of each metric in the test set for the nine models was, in order: accuracy (0.75–0.84), sensitivity (0.50–0.79), specificity (0.79–0.93), recall (0.50–0.79), precision (0.51–0.70), F score (0.56–0.69), and AUC value (0.71–0.87). Overall, LR achieved the best performance in terms of accuracy (0.81), precision (0.60), sensitivity (0.79), specificity (0.82), recall (0.79), F-score (0.68), and AUC value (0.87) for predicting BCRL.

Conclusion: The study established that the constructed logistic regression (LR) model exhibits a more favorable amalgamation of accuracy, sensitivity, specificity, recall, and AUC value. This configuration adeptly discerns patients who are at an elevated risk of BCRL. Consequently, this precise identification equips nurses with the means to undertake timely and tailored interventions, thus averting the onset of BCRL.

Introduction

The most recent data derived from GLOBOCAN 2020 (1)showed that breast cancer, for the first time, has surpassed lung cancer as the most prevalent malignancy among women, with an estimated 2.3 million new cases globally, accounting for 11.7% of overall cancer incidence. Notably, advancements in early detection methods and therapeutic interventions have contributed to a notable enhancement in the 5-year relative survival rate for breast cancer patients. Over the span of the last three decades, this rate has surged from 79% to 91% (2).

As the survival prospects for breast cancer patients continue to improve, there arises an imperative to elevate the quality of life for women grappling with the complications of this ailment. Breast cancer-related lymphoedema (BCRL) emerges as a frequent chronic complication after breast cancer surgery. This condition typically arises due to the accumulation of protein-rich fluids within tissue interstitial spaces, occasioned by deficient lymphatic drainage (3). Lymphedema engenders swelling, distortion, and impaired functionality within the affected limbs, profoundly influencing both physical and psychological well-being, and the overall quality of life (4). However, investigations into the authentic prevalence of BCRL exhibit disparities, with reported incidence rates spanning from 6% to 83% (5). Despite recent trends indicating a decline in the occurrence of BCRL, the challenge of its treatment remains formidable. Effective management strategies for lymphoedema are still lacking, thereby underscoring the importance of a preventative approach to mitigate its onset.

A disease risk prediction model constitutes a statistical methodology rooted in the evaluation of various disease-associated risk factors. These factors are assigned scores in correspondence with their impact magnitude, and subsequently, the likelihood of an impending event is computed through a mathematical formulation (6). This model yields a more refined assessment of the likelihood of a particular outcome transpiring. This, in turn, facilitates the implementation of precisely tailored interventions targeting diverse risk cohorts, thus yielding a pronounced impact on augmenting patient prognosis (7).

The term “Machine Learning” (ML) was introduced by Arthur Samuel (8) in 1959. It denotes an assemblage of algorithmic techniques designed for the purpose of data representation and analysis, a domain that has found widespread application in the realm of cancer care research (9). ML methodologies demonstrate enhanced efficacy in addressing problems characterized by a profusion of potential predictors, in contrast to conventional statistical paradigms (10). While many studies within the domain of China have sought to devise predictive models for BCRL, a considerable subset of these inquiries have predominantly relied upon classical approaches such as logistic regression. These methodologies may deviate from ML algorithms with respect to key metrics such as accuracy, sensitivity, and specificity. It is pertinent to underscore that the arena of machine learning encompasses diverse algorithmic classifications, encompassing entities like support vector machines (SVM), decision trees (DT), and random forests (RF). The efficacy of predictive models forged through disparate algorithmic classifications inherently exhibits discrepancy. It is imperative to undertake intermodel comparisons for the problem of BCRL.

The objective of this study was to formulate and subsequently validate an array of predictive models with a specific focus on predicting the risk of BCRL, and to validate the model using data from a large sample retrospective cohort study. These models possess the potential to aid clinical practitioners in the identification of individuals who are predisposed to a heightened risk profile, which enable the timely implementation of targeted interventions, thereby contributing to a reduction in the occurrence of BCRL.

Materials and methods

This was a cross-sectional study of consecutive cases diagnosed with breast cancer during the period 2021–2022. Ethical approval was obtained from Sichuan Cancer Hospital (Approval No. SCCHEC-02-2022-005).

Study design

We conducted a survey-based cross-sectional study in a tertiary oncology hospital (Sichuan Cancer Hospital & Research Institute), the largest oncology hospital in Southwestern China, which guaranteed a sufficient sample size. Participants were recruited using the following criteria: (1) ≥18 years of age at the time of surgery; (2) patient diagnosed with breast cancer by histopathology and received surgical treatment for breast cancer; (3) patients with normal reading, comprehension, and expression skills; and (5) agreed to participate in the study. Exclusion criteria were as follows: (1) patients with other serious diseases, such as severe heart, cardiogenic, and renal failure; (2) surgical history and injury history of the affected upper limb before breast cancer surgery; (3) bilateral breast cancer; (4) lymphedema caused by other reasons, such as nephrogenic, cardiogenic, hypoproteinemia, deep vein thrombosis, and so on.

Data collection procedures

The demographic, clinical, disease, treatment, and behavioral particulars of the patients were procured through the questionnaires and retrieval from the electronic medical record system (Table 1). The items of the lymphedema risk-reduction behavior checklist are shown in Table 2. Prior to the distribution of questionnaires, a comprehensive explanation regarding the study’s objectives and significance was conveyed to the patients and their families. Upon securing the patients’ consent, the individuals independently completed the demographic and behavioral data questionnaires. To ensure meticulous accuracy in information collection, a diverse array of methods was employed for data collection, including the retrieval of data from electronic medical records, utilization of the WeChat (a messaging application), and telephone interview.

Table 1
www.frontiersin.org

Table 1 The study’s variable and their corresponding categories utilized in predicting BCRL.

Table 2
www.frontiersin.org

Table 2 Lymphedema risk-reduction behavior checklist.

Evaluation of outcome

Diagnostic criteria for upper limb lymphedema: objective indexes were measured by circumferential measurement, and the circumferential diameters of the transverse carpal stripe, 10 cm above the transverse carpal stripe, elbow fossa, and 10 cm above the elbow fossa of the upper limbs of the bilateral upper limbs were measured with a non-elastic soft ruler. A difference of ≥2.0 cm between any point of the affected and healthy upper limbs was determined as breast cancer-related lymphedema. A difference of 6 cm was considered severe edema.

Statistical analysis

Statistical analysis was performed using python (version: 3.7.0) and tableone package (version: 0.7.10) software. Because the data were obtained by standardized Z-scoring, they conformed to a normal distribution, where measures were described using mean ± standard deviation and counts were described using frequencies and percentages. An independent-samples T test was used to conduct the between-group comparisons. LASSO analysis was used to select the predictive variables. Before constructing the ML model, the input data are randomly divided into training and test sets. About 70% of the data is used for the training of the predictive model, and nearly 30% of the data is used for the validation of the test dataset. Recall, precision, accuracy, F-score, area under curve (AUC), sensitivity, and specificity based on the confusion matrix were used as the indicators for evaluating the predictive performance of the model.

Results

Patient participation and characteristics

On the basis of the circumferential measurements, 670 postoperative breast cancer patients were categorized into non-lymphedema and lymphedema groups, of which 496 (74.03%) were non-lymphedema patients and 174 (25.97%) were lymphedema patients. Comparisons between groups of the two groups of patients in terms of demographic, disease, and treatment information are shown in Table 3. The analytical outcomes underscored the presence of statistical significance across different categories of postoperative breast cancer patients in relation to attributes including BMI, hypertension, clinical staging, T-stage, N-stage, pathological type, age, type of surgery, type of lymph node surgery, surgical side, level of lymph node dissection, number of removed lymph nodes, number of positive lymph nodes, neoadjuvant chemotherapy, adjuvant chemotherapy, postoperative radiotherapy, endocrine therapy (P < 0.05).

Table 3
www.frontiersin.org

Table 3 Comparison of demographic, disease, and treatment information of postoperative breast cancer patients between different groups.

Using LASSO regression for feature screening and cross-validation method to select the optimal model parameters, the 13 predictive values for BCRL were finally screened out. The 13 characteristics were as follows: BMI, number of positive lymph nodes, surgical side, N stage, neoadjuvant chemotherapy (NAC), type of lymph node surgery, postoperative radiotherapy, type of surgery, clinical staging, entry 1, entry 5, entry 6, and entry 12.The ranking of the importance of the 13 features screened for the results is shown in Figure 1.

Figure 1
www.frontiersin.org

Figure 1 Importance ranking of variables.

The ML models’ prediction performance

In this study, support vector machines (SVM), stochastic gradient descent (SGD), K-nearest neighbor (KNN), decision trees (DT), random forests (RF), extra trees (ET), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), and logistic regression (LR) were used for modeling in the training set and internally validated with the test set. This study used a method combining grid search and 10-fold cross-validation, which is realized by calling the GridSearchCV function in sklearn library. After finding the optimal parameter through GridSearchCV, it was directly brought into the model to observe the performance of the model.

After analysis, the ROC curves for the nine models is shown in Figure 2. Internal validation and external validation were performed for each model category, followed by a comparison of the performance indicators of the nine models combined, and finally LR was found to have the best predictive performance based on the AUC values (0.87), with 81% accuracy, 82% specificity, and 79% recall for BCRL. The performance characteristics of the nine ML models for predicting BCRL are summarized in Table 4.

Figure 2
www.frontiersin.org

Figure 2 Receiver operating characteristic curve of the nine models in the validation set.

Table 4
www.frontiersin.org

Table 4 Performance results for the machine learning models in the validation set.

Discussion

Our study found that lymphedema occurs in 25.97% of breast cancer patients, with consistent results in a review by Erickson et al. (11), which reported an overall incidence of upper extremity edema of 26%. However, the incidence of BCRL was 33.82%, 19.77%, and 6.8% in the studies by Xiao Xu (12), Xie Danping et al. (13), and Card et al. (14), respectively, which may be due to the variability of results due to the diversity of the study sites, follow-up times, diagnostic criteria, and measurement methods.

Since lymphedema is a chronic progressive disease, early detection or prevention is very important for both patients and healthcare professionals. By identifying risk factors for lymphedema, the likelihood of its occurrence can be predicted and appropriate strategies can be adopted to reduce these risk factors. In this study, 13 characteristics were screened by LASSO regression.

The most important risk factor was the patient’s BMI. In Ahmed and Dominick’s study of 1,287 and 2,431 breast cancer survivors (15, 16), BMI was also identified as a significant risk factor for lymphedema. In a study by Park et al. (17), BMI ≥25 kg/m2 was an independent risk factor for lymphedema in an analysis of 406 women who received preoperative neoadjuvant chemotherapy. Although considered one of the most important risk factors for BCRL, and despite the fact that accumulation of lymphocytes is known to increase adipocyte proliferation and differentiation (18), the mechanism by which elevated BMI increases the risk of secondary lymphedema is unclear. Green (19) suggested that a possible explanation for obesity-induced lymphedema is the inability of the lymphatic system of the affected limb to transport the amount of lymph produced, which can be demonstrated by abnormal lymphatic scintigraphy. Therefore, health education aimed at promoting dietary and lifestyle changes in breast cancer patients with a high BMI is important to help reduce the risk of postsurgical lymphoedema. This can be provided by specialized breast nurses as part of a holistic and individualized care plan. Although the primary impact of early dietary intervention on weight loss is beneficial in reducing the risk of lymphedema, potential secondary outcomes include a positive association with breast cancer recurrence, self-care, and overall health (20). Individuals with comorbidities such as diabetes, cardiovascular disease, or musculoskeletal disorders may benefit from lower BMI through dietary interventions or exercise, thereby improving quality of life.

The impact of lymph node status on the presence of LE is a topic of ongoing debate. In this study, the second most significant risk factor was the number of positive lymph nodes, aligning closely with the findings of Wu et al. (21). In the multivariate analysis conducted by Kwan et al. (22), the quantity of pathological lymph nodes demonstrated statistically significant prognostic value for the severity of lymphedema. A meta-analysis report suggests that an increased number of metastatic lymph nodes elevated the incidence of LE (23). A prospective survey on lymphedema involving 627 breast cancer patients who underwent mastectomy revealed that with each additional positive lymph node, the risk of developing BCRL increased by 1.091 times (24). Huang et al. (25) assert that the status of axillary lymph nodes significantly influenced the treatment and prognosis of breast cancer patients. This may be attributed to a higher number of positive lymph nodes leading to a broader scope of axillary lymph node dissection, resulting in prolonged radiation therapy and potential damage to axillary lymph nodes, upper limbs, and chest tissues. Conversely, a study comparing LE following SLNB and ALND in lymph node-negative and lymph node-positive breast cancer suggested that only ALND was associated with LE, regardless of lymph node status (26).

This study indicated that NAC was also a significant risk factor for the occurrence of BCRL. A prospective cohort study of 276 patients found that compared to preoperative surgery, patients undergoing ALND with NAC treatment had a twofold increased risk of BCRL development (27). Another study involving 409 ALND patients also discovered that NAC increased the risk of BCRL by 3.76 times and was independently associated with BCRL (28). A recent retrospective analysis of 596 breast cancer patients who underwent ALND and chemotherapy revealed that, compared to preoperative surgery, patients undergoing NAC with ALND had a 1.5-fold increased risk of BCRL (29). This finding was consistent with another retrospective study of 848 ALND patients, where NAC patients experienced more prolonged BCRL events compared to those receiving adjuvant chemotherapy (HR 1.39; 95% CI [1.05, 1.84]) (30). Similar associations were observed in populations other than those undergoing ALND. A retrospective analysis of 3,136 patients who underwent breast excision surgery reported a BCRL incidence of 10.4%, with NAC (HR 1.42; 95% CI [1.10, 1.84]) increasing the prevalence of BCRL (31).

The prevalence of most preventive behaviors is low due to postoperative health education and patient adherence to recommendations for preventive behaviors, such as sunburn, trauma, injections, and ipsilateral blood draws. Mosquito bites on infected limbs were the most common because they are not easily avoided; however, they usually have a minimal impact on the patient and few relevant studies have shown it to be a risk factor for BCRL. Air travel ranked second in prevalence. Air travel has received relatively little attention, hence its higher prevalence. Theoretically, air travel can have a deleterious effect on lymphedema. It is hypothesized that changes in cabin pressure during ascent and descent of the aircraft and the relatively low cabin pressure at high altitudes are responsible for this problem (32). However, there are conflicting indications in the published literature (33, 34). In the present study, among these behaviors, “do not ignore upper extremity edema,” “avoid strenuous exercise of the affected extremity,” “avoid heavy lifting of the affected extremity,” and “avoiding excessive fatigue of the affected limb” were associated with lymphedema. This may be due to overuse leading to excessive muscle tension in the affected limb, which disrupts the balance of lymphatic return and in turn induces lymphedema (35). Preventive behaviors associated with the risk of lymphedema remain controversial, and this uncertainty influences decision-making in very different ways for people at risk of lymphedema or for people with lymphedema. Further research is needed to determine whether they exacerbate lymphedema in postoperative breast cancer patients.

In addition, we performed a comprehensive comparison of the nine models. In terms of sensitivity, LR has the highest sensitivity, which means that the model has a relatively high identification rate for positive patients, and the specificity of LR is 0.82, which means that the identification of negative patients is generally correct, whereas SVM, SGD, KNN, DT, RF, ET, XGBoost, and LightGBM were all generally stable and had lower overall effectiveness scores than LR. In comparison with previous studies, Fu et al. (36) used five machine learning classification algorithms: decision tree with C4.5 and decision tree with C5.0, GBM, ANN, and SVM for predicting BCRL. Out of the five trained classifiers, the artificial neural network detected lymphedema with an accuracy of 93.75%, sensitivity of 95.65%, and specificity of 91.03%. The use of real-time symptom report that allows the use of web-and-mobile-based mHealth system in detecting lymphedema status is a strength of the study, which increases the predictive performance of the model compared with the LR model in this study. Notash et al. (37) used six classification algorithms including C5.0’s decision tree, KNN, SVM, LDA, Bayesian, and MLP to construct the BCRL prediction model, of which the SVM algorithm showed the highest sensitivity and was found to be the best model for predicting lymphedema, based on the accuracy obtained, the algorithm correctly detected the presence or absence of lymphedema in newly diagnosed patients in 88% of cases, which was slightly higher than the AUC of the LR in this study (0.87).Wei et al. (38) derived and evaluated six machine learning models, and the results showed that the LR model performed the best in the early detection of lymphedema with the best performance, AUC = 0.889 (0.840–0.938), sensitivity = 0.771, specificity = 0.883, accuracy = 0.825, and Brier score = 0.141, which is slightly lower than the sensitivity of LR (0.79) in this study. In summary, among the risk prediction models for upper limb lymphedema in postoperative breast cancer patients, the model constructed by the LR algorithm has a better predictive performance, which can guide clinical medical personnel to develop targeted BCRL prevention strategies.

Conclusion

In this study, using LASSO regression modeling, the 13 features that were most valuable in predicting the occurrence of BCRL were screened and ranked in order of importance, and the results showed that BMI was the most critical factor among them. Then, the dataset was divided into training and test sets in the ratio of 7:3, and after comparing the performance indicators of the nine models combined, the LR model achieved the accuracy (0.81), precision (0.60), sensitivity (0.79), specificity (0.82), recall (0.79), F-score (0.68), and AUC value (0.87) of predicting BCRL with the optimal performance and was able to identify patients at high risk of BCRL more accurately. Overall, the findings emphasize the importance of implementing holistic and comprehensive care for patients with cancer, and nurses with expertise in lymphedema can play a key role in this regard.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Sichuan Cancer Hospital Ethics Committee. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

JD: Conceptualization, Data curation, Investigation, Methodology, Writing – original draft. JY: Data curation, Supervision, Writing – original draft, Writing – review & editing. QY: Software, Supervision, Writing – review & editing. XZ: Investigation, Software, Writing – original draft. LY: Methodology, Resources, Writing – original draft. BF: Data curation, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin (2021) 71(3):209–49. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Miller KD, Nogueira L, Mariotto AB, Rowland JH, Yabroff KR, Alfano CM, et al. Cancer treatment and survivorship statistics, 2019. CA Cancer J Clin (2019) 69(5):363–85. doi: 10.3322/caac.21565

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Cheville AL, Mcgarvey CL, Petrek JA, Russo SA, Thiadens SR, Taylor ME. The grading of lymphedema in oncology clinical trials. Semin Radiat Oncol (2003) 13(3):214–25. doi: 10.1016/S1053-4296(03)00038-9

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Kim M, Kim SW, Lee SU, Lee NK, Jung SY, Kim TH, et al. A model to estimate the risk of breast cancer-related lymphedema: combinations of treatment-related factors of the number of dissected axillary nodes, adjuvant chemotherapy, and radiation therapy. Int J Radiat Oncol Biol Phys (2013) 86(3):498–503. doi: 10.1016/j.ijrobp.2013.02.018

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Clark B, Sitzia J, Harlow W. Incidence and risk of arm oedema following treatment for breast cancer: a three-year follow-up study. Qjm-an Int J Med (2005) 98(5):343–8. doi: 10.1093/qjmed/hci053

CrossRef Full Text | Google Scholar

6. Zhang W, Tang Y, Liu H, Yuan LP, Wang CC, Chen SF, et al. Risk prediction models for intensive care unit-acquired weakness in intensive care unit patients: a systematic review. PloS one (2021) 16(9):e0257768. doi: 10.1371/journal.pone.0257768

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Yu 'an L, Xiaowen Y, Lezhi LI. Application of machine learning in disease prediction. J Nurs (2019) 28(07):30–4. doi: 10.2196/38040

CrossRef Full Text | Google Scholar

8. Samuel AL. Some studies in machine learning using the game of checkers. Ibm J Res Dev (1959) 3(3):211–29. doi: 10.1147/rd.33.0210

CrossRef Full Text | Google Scholar

9. Cheng ASK, Guan Q, Su Y, Zhou P, Zeng Y. Integration of machine learning and blockchain technology in the healthcare field: A literature review and implications for cancer care. Asia-Pacific J Oncol Nurs (2021) 8(6):720–4. doi: 10.4103/apjon.apjon-2140

CrossRef Full Text | Google Scholar

10. Mainali S, Darsie ME, Smetana KS. Machine learning in action: stroke diagnosis and outcome prediction. Front Neurol (2021) 12:734345. doi: 10.3389/fneur.2021.734345

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Erickson VS, Pearson ML, Ganz PA, Adams J, Kahn KL. Arm edema in breast cancer patients. J Natl Cancer Institute (2001) 93(2):96–111. doi: 10.1093/jnci/93.2.96

CrossRef Full Text | Google Scholar

12. Xiao X. Lymphedema predictor factors after breast cancer surgery: a survival analysis. Lymphatic Research and Biology (2021) 13(4):268–74. doi: 10.1089/lrb.2013.0042

CrossRef Full Text | Google Scholar

13. Danping X, Hongchao F. Analysis of related influencing factors of postoperative upper limb lymphedema in patients with breast cancer. J Pract Cancer (2019) 38(01):66–9. doi: 10.1016/j.apjon.2022.100101

CrossRef Full Text | Google Scholar

14. Card A, Crosby MA, Liu J, Lindstrom WA, Lucci A, Chang DW. Reduced Incidence of Breast Cancer-Related Lymphedema following Mastectomy and Breast Reconstruction versus Mastectomy Alone. Plast Reconstructive Surg (2012) 130(6):1169–78. doi: 10.1097/PRS.0b013e31826d0faa

CrossRef Full Text | Google Scholar

15. Ahmed RL, Schmitz KH, Prizment AE, Folsom AR. Risk factors for lymphedema in breast cancer survivors, the Iowa Women's Health Study. Breast Cancer Res Treat (2011) 130(3):981–91. doi: 10.1007/s10549-011-1667-z

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Dominick SA, Madlensky L, Natarajan L. Risk factors associated with breast cancer-related lymphedema in the WHEL Study. J Cancer Survivorship (2013) 7(1):115–23. doi: 10.1007/s11764-012-0251-9

CrossRef Full Text | Google Scholar

17. Park S, Lee JE, Yu J, Paik HJ, Ryu JM, Kim I, et al. Risk factors affecting breast cancer-related lymphedema: serial body weight change during neoadjuvant anthracycline plus cyclophosphamide followed by taxane. Clin Breast Cancer (2018) 18(1):e49–e54. doi: 10.1016/j.clbc.2017.06.003

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Greene AK, Slavin SA, Brorson H. Lymphedema: Presentation, diagnosis, and treatment. (2015) 148(1):91–8. doi: 10.1007/978-3-319-14493-1

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Sudduth CL, Greene AK. Current overview of obesity-induced lymphedema. Adv Wound Care (2022) 11(7):392–8. doi: 10.1089/wound.2020.1337

CrossRef Full Text | Google Scholar

20. Mahamaneerat WK, Shyu C-R, Stewart BR, Armer JM. Breast cancer treatment, BMI, post-op swelling/lymphoedema. J lymphoedema (2008) 3(2):38–44.

PubMed Abstract | Google Scholar

21. Wu XM, Guan QY, Cheng AS, K , Guan C, Su Y, Jiang J, et al. Comparison of machine learning models for predicting the risk of breast cancer- related lymphedema in Chinese women. Asia-Pacific J Oncol Nurs (2022) 9(12):100–1. doi: 10.1016/j.apjon.2022.100101

CrossRef Full Text | Google Scholar

22. Kwan JYY, Famiyeh P, Su J, Su J, Xu W, Kwan BYM. Development and validation of a risk model for breast cancer-related lymphedema. JAMA Netw Open (2020) 3(11):121–5. doi: 10.1001/jamanetworkopen.2020.24373

CrossRef Full Text | Google Scholar

23. DiSipio T, Rye S, Newman B, Hayes S. Incidence of unilateral arm lmphoedema after breast cancer: a systematic review and meta-analysis. Lancet Oncol (2013) 14(6):500–15. doi: 10.1016/S1470-2045(13)70076-7

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Kwan JYY, Famiyeh P, Su J, Xu W, Kwan BYM, Jones JM, et al. Development and validation of a risk model for breast cancer-related lymphedema. JAMA (2020) 3(11):e2024373. doi: 10.1001/jamanetworkopen.2020.24373

CrossRef Full Text | Google Scholar

25. Huang HP, Zhou JR, Zeng Q. Risk Factors Associated with Lymphedema among Postmenopausal Breast Cancer Survivors after Radical Mastectomy and Axillary Dissection in China. Breast Care (2012) 7(6):461–4. doi: 10.1159/000345459

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Sackey H, Magnuson A, Sandelin K, Liljegren G, Bergkvist L, Fülep Z, et al. Arm lymphoedema after axillary surgery in women with invasive breast cancer. Br J Surg (2014) 101(4):390–7. doi: 10.1002/bjs.9401

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Montagna G, Zhang J, Sevilimedu V, Charyn J, Abbate K, Gomez EA, et al. Risk factors and racial and ethnic disparities in patients with breast cancer-related lymphedema. JAMA Oncol (2022) 8(8):1195–200. doi: 10.1001/jamaoncol.2022.1628

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Li X, Hui H, Lin Q, Yu Q, Zhou Y, Long W, et al. Validation of a breast cancer nomogram to predict lymphedema in a Chinese population. J Surg Res (2017) 210:132–8. doi: 10.1016/j.jss.2016.11.009

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Jia MM, Pan LH, Yang HB, Gao J, Guo F. Impact of neoadjuvant chemotherapy on breast cancer-related lymphedema after axillary lymph node dissection: a retrospective cohort study. Breast Cancer Res Treat. (2023). doi: 10.21203/rs.3.rs-3012109/v1

CrossRef Full Text | Google Scholar

30. Jung SY, Shin KH, Kim M, Chung SH, Lee S, Kang HS, et al. Treatment factors affecting breast cancer-related lymphedema after systemic chemotherapy and radiotherapy in stage II/III breast cancer patients. Breast Cancer Res Treat (2014) 148(1):91–8. doi: 10.1007/s10549-014-3137-x

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Basta MN, Wu LC, Kanchwala SK, Serletti JM, Tchou JC, Kovach SJ, et al. Reliable prediction of postmastectomy lymphedema: the risk assessment tool evaluating lymphedema. Am J Surg (2017) 213(6):1125–33. doi: 10.1016/j.amjsurg.2016.08.016

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Gavish I, Brenner B. Air travel and the risk of thromboembolism. Internal Emergency Med (2011) 6(2):113–6. doi: 10.1007/s11739-010-0474-6

CrossRef Full Text | Google Scholar

33. Kilbreath SL, Refshauge KM, Beith JM, Ward LC, Ung OA, Dylke ES, et al. Risk factors for lymphoedema in women with breast cancer: A large prospective cohort. Breast (2016) 28:29–36. doi: 10.1016/j.breast.2016.04.011

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Ferguson CM, Swaroop MN, Horick N, Skolny MN, Miller CL, Jammallo LS, et al. Impact of ipsilateral blood draws, injections, blood pressure measurements, and air travel on the risk of lymphedema for patients treated for breast cancer. J Clin Oncol (2016) 34(7):691–8. doi: 10.1200/JCO.2015.61.5948

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Fu MR, Ridner SH, Armer J. Post-breast cancer lymphedema: part 2. Am J Nursing: Off Magazine Am Nurses' Associstion (2009) 109(8):34–42. doi: 10.1097/01.NAJ.0000358492.91678.78

CrossRef Full Text | Google Scholar

36. Fu MR, Wang Y, Li C, Qiu Z, Axelrod D, Guth AA, et al. Machine learning for detection of lymphedema among breast cancer survivors. mHealth (2018) 4:17. doi: 10.21037/mhealth.2018.04.02

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Notash AY, Notash AY, Omidi Z, Haghighat S. Prediction of lymphedema occurrence in patients with breast cancer using the optimized combination of ensemble learning algorithm and feature selection. BMC Med Inf Decision Making (2022) 22(1):195–6. doi: 10.1186/s12911-022-01937-z

CrossRef Full Text | Google Scholar

38. Wei X, Lu Q, Jin S, Li F, Zhao Q, Cui Y, et al. Developing and validating a prediction model for lymphedema detection in breast cancer survivors. Eur J Oncol Nurs (2021) 54(4):102–3. doi: 10.1016/j.ejon.2021.102023

CrossRef Full Text | Google Scholar

Keywords: breast cancer, lymphoedema, machine learning, prediction model, cancer

Citation: Du J, Yang J, Yang Q, Zhang X, Yuan L and Fu B (2024) Comparison of machine learning models to predict the risk of breast cancer-related lymphedema among breast cancer survivors: a cross-sectional study in China. Front. Oncol. 14:1334082. doi: 10.3389/fonc.2024.1334082

Received: 06 November 2023; Accepted: 24 January 2024;
Published: 12 February 2024.

Edited by:

Chirag Patel, National Cancer Institute at Frederick (NIH), United States

Reviewed by:

Anurag Srivastava, National Institute of Cancer Prevention and Research (ICMR), India
Raghvendra Mall, Technology Innovation Institute (TII), United Arab Emirates

Copyright © 2024 Du, Yang, Yang, Zhang, Yuan and Fu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jing Yang, 815361661@qq.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.