Potential Determinants for Radiation-Induced Lymphopenia in Patients With Breast Cancer Using Interpretable Machine Learning Approach

Radiation-induced lymphopenia is known for its survival significance in patients with breast cancer treated with radiation therapy. This study aimed to evaluate the impact of radiotherapy on lymphocytes by applying machine learning strategies. We used Extreme Gradient Boosting (XGboost) to predict the event of lymphopenia (grade≥1) and conduced an independent validation. Then, we induced feature attribution analysis (Shapley additive explanation, SHAP) in explaining the XGboost models to explore the directional contribution of each feature to lymphopenia. Finally, we implemented the proof-of-concept clinical validation. The results showed that the XGboost models had rigorous generalization performances (accuracies 0.764 and ROC-AUC 0.841, respectively) in the independent cohort. The baseline lymphocyte counts are the most protective feature (SHAP = 5.226, direction of SHAP = -0.964). Baseline platelets and monocytes also played important protective roles. The usage of taxane only chemotherapy was less risk on lymphopenia than the combination of anthracycline and taxane. By the contribution analysis of dose, we identified that firstly lymphocytes were sensitive to a radiation dose less than 4Gy; secondly the irradiation volume was more important in promoting lymphopenia than the irradiation dose; thirdly the irradiation dose promoted the event of lymphopenia when the irradiation volume was fixed. Overall, our findings paved the way to clarifying the radiation dose volume effect. To avoid radiation-induced lymphopenia, irradiation volume should be kept to a minimum during the planning process, as long as the target coverage is not compromised.

Radiation-induced lymphopenia is known for its survival significance in patients with breast cancer treated with radiation therapy. This study aimed to evaluate the impact of radiotherapy on lymphocytes by applying machine learning strategies. We used Extreme Gradient Boosting (XGboost) to predict the event of lymphopenia (grade≥1) and conduced an independent validation. Then, we induced feature attribution analysis (Shapley additive explanation, SHAP) in explaining the XGboost models to explore the directional contribution of each feature to lymphopenia. Finally, we implemented the proofof-concept clinical validation. The results showed that the XGboost models had rigorous generalization performances (accuracies 0.764 and ROC-AUC 0.841, respectively) in the independent cohort. The baseline lymphocyte counts are the most protective feature (SHAP = 5.226, direction of SHAP = -0.964). Baseline platelets and monocytes also played important protective roles. The usage of taxane only chemotherapy was less risk on lymphopenia than the combination of anthracycline and taxane. By the contribution analysis of dose, we identified that firstly lymphocytes were sensitive to a radiation dose less than 4Gy; secondly the irradiation volume was more important in promoting lymphopenia than the irradiation dose; thirdly the irradiation dose promoted the event of lymphopenia when the irradiation volume was fixed. Overall, our findings paved the way to clarifying the radiation dose volume effect. To avoid radiation-induced lymphopenia, irradiation volume should be kept to a minimum during the planning process, as long as the target coverage is not compromised.

INTRODUCTION
The biological effects of radiation exposure on the immune system are double-edged. It has an immunostimulatory effect by promoting the release of tumor antigens (1), radiationinduced neoantigens (2) and chemokine that recruit effector cells into the tumor microenvironment (3). On the other side, radiation has the potential for direct cytotoxicity toward immune cells, especially lymphocytes, which are the most radiosensitive (4). Among human peripheral blood lymphocytes, T helper cells, cytotoxic T cells, and B cells display a radiosensitive phenotype (5). In the treatment of solid tumors, lymphopenia is a common side effect of radiotherapy known for decades (6).
Radiation-induced lymphopenia is associated with inferior clinical outcomes in a wide variety of solid malignancies (7,8), and more importantly, it is associated with inferior survival outcomes (9)(10)(11)(12). For example, total lymphocyte counts < 100 cells/mm 3 were associated with poor overall survival in patients with locally advanced cervical cancer (13); total lymphocyte count < 500 cells/mm 3 at 2 months was associated with short overall survival outcomes and was an independent predictor for survival in elderly patients with glioblastoma (10). Similarly, lymphopenia was an independent predictor of inferior survival in locally advanced pancreatic cancer (11). In patients with breast cancer, the five-year disease-free survival was significantly lower in patients with a ratio of lymphocyte nadir to pre-treatment lymphocyte less than 0.8 (14). Nevertheless, baseline characteristics associated with radiation-induced lymphopenia have not been thoroughly evaluated. Given its clinical implications, it is necessary to identify those baseline characteristics to predict radiation-induced lymphopenia.
Recently, some studies have investigated and modeled the significant effects of radiation dose on radiation-induced lymphopenia. In esophageal cancer patients who underwent radiotherapy, thoracic vertebral (TV) volume spared of 5-40Gy was significantly associated with higher lymphocyte nadirs (P<0.05) (15). Yovino et al. (16) demonstrated the lymphotoxic impact of conventionally fractionated brain radiotherapy for high-grade gliomas. They established that after 30 fractions of radiotherapy, 99% of the circulating lymphocyte received ≥0.5Gy. Furthermore, our group identified a model of the effective radiation dose to the circulating immune cells (EDIC) in patients with advanced esophageal squamous cell carcinoma treated with trimodality therapy (17,18). EDIC negatively correlated with lymphocyte nadir. Overall, radiation therapy was associated with lymphopenia in patients with different solid tumors. However, the association between breast cancer and radiation-induced lymphopenia is less well-studied.
In this study, we generated Extreme Gradient Boosting (XGboost) (19) models in a machine learning framework to identify the impact of radiation dose on circulating lymphocytes in patients with breast cancer. XGboost is the specific implementation of the gradient boosting algorithm. It employs more accurate approximations to find the best models and an advanced regularization technique, enhancing model training speed and generalization and reducing model complexity (20).
Next, we sought to understand the associations between lymphopenia and baseline features, including baseline blood counts, clinical and tumor characteristics, treatment regimens and especially radiation dose, in patients with breast cancer who underwent radiation therapy. Therefore, we assessed the relative importance and direction of feature contribution to lymphopenia in XGboost models via Shapley additive explanation (SHAP) approaches (21,22). Rigorous quality validations were implemented, including cross-validation, bootstrapping and Proof-of-concept clinical validation.

Description of Cohorts
Lymphopenia (grade ≥1) was defined from the Common Terminology Criteria for Adverse Events, version 4.0 (CTCAE v4.0). Patients with breast cancer who received adjuvant radiation therapy from March 2015 to October 2019 at the University of Hong Kong-Shenzhen Hospital formed the study population (Testing cohort). The eligibility criteria were: pathologically confirmed invasive breast cancer, received adjuvant radiation therapy, aged 18-years old or above, peripheral lymphocyte counts evaluated within 7 days after the end of radiation therapy in the same hospital. Exclusion criteria included patients with non-invasive breast cancer (stage 0), stage IV or recurrent breast cancer, breast lymphoma, and underlying autoimmune diseases. Patients who underwent surgery and chemotherapy in other hospitals were eligible if they received the whole course of radiotherapy at the University of Hong Kong-Shenzhen Hospital.
Another independent prospective validation cohort included patients between November 2019 and December 2020. Patients with breast cancer who underwent radiation therapy with the same criteria as the Testing cohort were selected from a prospective observational study of the Bio-Imaging Repository Databank (BIRD) project at the University of Hong Kong-Shenzhen Hospital.

Radiation Therapy Procedures
Radiotherapy techniques included 2-field tangential opposing technique (2D-fields), tangential opposing fields with an anterior SCF field (three-dimensional conformal technique, 3D-fields) and RapidArc (Varian Medical Systems, Palo Alto, CA, USA). CT scans from the skull base to the level of the first lumbar vertebra were obtained in this study. The 2D-fields technique was normally employed on patients who needed irradiation only of the breast. 3D-fields technique was usually used on patients who needed irradiation of the breast or chest wall and supraclavicular fossa or axillary fossa. RapidArc, a volume modulated arc therapy technique, was employed on patients with invasive breast cancer with N3 or N2 diseases with centrally or medially located primary tumors. with), and menopausal (pre, peri, post). All participants are females.

Predictive Models of the Lymphopenia Events
We established XGboost models by tenfold cross-validation (CV) framework via 100 bootstrapping iterations to predict radiation-induced lymphopenia. We used either all features or each feature group as input to build XGboost models and applied Lasso regressions for comparison. To estimate the explained prediction variances, we evaluated the predicting results using sensitivity, specificity, accuracy, f1-score, the area under the receiver operating characteristic curve (ROC-AUC) and the area under the precision-recall curve (PR-AUC). All samples were used in training and validation, considering XGboost's abilities to handle the missing data. In contrast, those samples without missing data were used in training and validating the Lasso regressions.
In each bootstrapping iteration, we randomly selected 80% of patients and grouped them into one patient set. The patient set was separated into a training set and a testing set (ratio 7: 3) and performed the tenfold CV to train models. Next, AUC and its minimum value were used as hyperparameters for Lasso regression in the training set. The grid-search sets were used as hyperparameters for XGboost, i.e. learning_rate from 0.01 to 0.1 step 0.01, gamma from 0 to 5 step 1, max_depth from 3 to 6 step 1, scale_pos_weight from 1 to 2 step 0.2, subsample from 0.7 to 1 step 0.1, colsample_bytree from 0.7 to 1 step 0.1, min_child_weigth from 3 to 6 step 1, max_delta_step from 0 to 5 step 1, the other hyperparameters were set as defaults. Finally, we computed the model's prediction on the remaining testing set.

Feature Attributions Analysis
To emphasize the predictive power of XGboost models, we used Shapley additive explanation (SHAP). SHAP values assign an importance value to each feature representing the effect on the model prediction. In brief, for a specific prediction, the SHAP value of a feature is defined as the change in the expected value of the model's output when this feature is observed versus when it is missing.
We computed individual SHAP values using the module TreeExplainer (Python v.0.37.0). We summarized the mean absolute SHAP value across all instances, reflecting the mean effect of each feature on predicting the lymphopenia outcome and serving as a feature's contribution measure. The bigger mean absolute SHAP value mentions the feature is more important in lymphopenia.
We further determined the directional mean absolute SHAP values by taking into account the mean value of Spearman correlations between individual SHAP values and corresponding outcomes via all iterations. The directional SHAP values more closing to 1 or -1 mentions that the feature promotes or protects the occurrence of lymphopenia more. The directional SHAP values has the same sign meaning as the odd ratio in the Lasso regression. Finally, we used a graphical layout (Cytoscape 3.7.2) in order to visualize the contributions of features (both mean absolute SHAP value and conditional SHAP values) in XGboost models for the event of lymphopenia.

Selection of Paired Patients
To study the contributions of the less important features that important features may overshadow, we selected the paired patients with controlled discrepancy of important features in the Testing cohort and Validation cohort separately. In this study, for every two patients a and b, the discrepancy was defined as the mean value of absolute relative differences in important number, n is the total important feature number, abs means absolution, f_a and f_b are the feature values in patients a and b, respectively, and f_max means the maximum of the feature. For each less important feature we need to study, the important features were defined as those with the SHAP values bigger than the less important features. Two patients with a discrepancy less than the threshold and different lymphopenia outcomes were considered one paired patient. In the selected paired patient cohort, we used paired t-test to assess the significance of the less important feature on the lymphopenia events.

Statistical Analysis
For all statistical analysis and prediction models we used R 3.6.1

Patients and Characteristics
The patient flowchart is shown in Figure 1A. In the Testing cohort, 589 patients with breast cancer who underwent radiation therapy were enrolled. We collected data about clinical characteristics, baseline and post-treatment blood test results, tumor characteristics, radiation dose and other therapy regimes (summarized in Table 1). All patients were females with a median age of 45 (1st to 3rd Qu: 39-51). The median lymphocyte count in patients before radiation therapy was 1530/µL (1st to 3rd Qu: 1200 to 1860), while it decreased to 950/µL (1st to 3rd Qu: 750 to 1170) after radiation therapy. A total of 340 (57.7%) patients had lymphopenia (grade≥1) after radiation therapy.
To validate the accuracy and robustness of our results, we adopted an independent prospective cohort (Validation cohort) enrolling 203 patients with breast cancer. The patients in the Validation cohort were also all females with a median age of 45 (1st to 3rd Qu: 39-51). The median lymphocyte count in patients before radiation therapy was 1500/µL (1st to 3rd Qu: 1230 to 1935), while it decreased to 1000/µL (1st to 3rd Qu: 700 to 1255) after radiation therapy. A total of 104 (51.2%) patients had lymphopenia (grade≥1) after radiation therapy. The characteristics of the Validation cohort were all summarized in Table S1.

Prediction of the Event of Lymphopenia
The flowchart for establishing the machine learning models is shown in Figure 1B. We trained XGboost and lasso regression in tenfold CV via 100 bootstrapping iterations in the Testing cohort to predict the binary classified lymphopenia. The models were validated in the Validation cohort across all iterations. The final XGboost models were development with the hyperparameters as follows: learning_rate=0.04, gamma=5, max_depth=3, scale_pos_weight=1.8, subsample=0.8, colsample_bytree=0.7, min_child_weigth =5, max_delta_step = 1.
In the full XGboost models, the main metrics to evaluate the classifying abilities are both accuracy (median: 0.781, 1st to 3rd Qu: 0.762 to 0.817) and ROC-AUC (median: 0.841, 1st to 3rd Qu: 0.822 to 0.856); the models were validated in Validation cohort for accuracy (median: 0.764, 1st to 3rd Qu: 0.753 to 0.774) and ROC-AUC (median: 0.841, 1st to 3rd Qu: 0.817 to 0.868), as shown in Figure 2. Other evaluation metrics in the Testing and Validation cohorts are shown in Figures S1-S4 and compared, sensitivity, specificity, F1-score, and PR-AUC. The XGboost models' evaluation metrics and prediction abilities in the Validation cohort keep up with the Lasso regression's metrics.
We also investigated the abilities of each feature group to predict lymphopenia. Radiation dose and baseline blood cells were the two important feature groups, as shown in Figure 2. In contrast, the other feature groups, including tumor characteristics, treatment regimens and clinical features, have fewer contributions to lymphopenia. These are similar to the results of lasso regressions, which are also shown in Figure 2 and Figures S1-S4.

Feature Attributions
The feature parameters, including both gain and frequency indexes, in the full XGboost models constructed via all iterations, are summarized in Table S2. In each XGboost model, the gain index represents the fractional contribution of each feature to the model based on the total gain of this feature's splits; and the frequency index represents the relative number of times a feature has been used in sub-trees in one XGboost model. The higher gain and frequency indexes mean the more important predictive feature for predicting outcome. We also checked the features' coefficients and P-values in Lasso regressions and their occurrence frequencies ( Table S3). The top 10 important    features for lymphopenia either in XGboost models or Lasso regressions are compared in Figure S5. However, both Gain and frequency indexes in XGboost cannot show the directional contributions of each feature on the occurrence of lymphopenia, which is not similar to the meaning of coefficient in Lasso regressions. We analyzed the directional SHAP values of each feature in the XGboost models, illustrated in the method section. The directional SHAP values of features in XGboost models are visualized in Figure 3. The comparison of feature contributions to the occurrence of lymphopenia between the directional SHAP values in XGboost models and the coefficients in Lasso regressions is listed in Table 2.

Baseline Blood Cells' Attributions
As shown in Figure 3A, we considered the baseline blood cell counts' contributions to the occurrence of lymphopenia in the XGboost and SHAP analysis. The baseline lymphocyte counts are negatively associated with lymphopenia events (SHAP = 5.226, direction of SHAP = -0.964). The higher the baseline lymphocyte level, the fewer lymphopenia events after treatment. Interestingly, baseline hemoglobin level promotes lymphopenia events (SHAP = 0.737, direction of SHAP = 0.378), while white blood cells, platelets and monocytes protected the patients from lymphopenia. We also found their directional contributions to the lymphopenia events were consistent with coefficients in Lasso regressions ( Table 2)

Dose Attributions
We considered the contribution of the radiation dose to the occurrence of lymphopenia in XGboost models and SHAP analysis, as shown in Figure 3B Figure S5 and Table S2).
Next, we extracted both irradiation doses and corresponding volumes of each radiation dose from the patient's DVH curves. As shown in Figures 4A-D, we sorted both the irradiation volume and dose by their contributions to lymphopenia (SHAP values). The irradiation volumes are ordered ascendingly by SHAP values (log10(volume)~1.325*SHAP value, P value<1e-8). In contrast, the corresponding irradiation dose is in the descending order with SHAP values (log2(dose)~0.603*SHAP value, P value<1e-8). The regression information is shown in Figure S6. It can be illustrated that (point 1) lymphocytes are sensitive to an irradiation dose lower than 4Gy, because the integral dose of body (median: 4Gy, 1st-3rd Qu: 3.3-5Gy) is found to be the most important dose in lymphopenia; (point 2) the irradiation volume plays a significantly greater role than the irradiation dose, in promoting lymphopenia.
We studied the relationship between irradiation dose and irradiation volume in both the Test and Validation cohort ( Figures 4E, F). Irradiation volume is commonly positively correlated with irradiation dose. Notably, the irradiation volume is low when the mean heart dose is higher both in the Testing cohort and the Validation cohort that means the higher mean heart dose is related to the smaller irradiation range. According to the points 1 and 2 we summarized above, the protective role of mean heart dose against a lymphopenia event (SHAP = 0.774,  direction of SHAP = -0.377) can be explained. That is, not the higher mean heart dose, but the smaller corresponding irradiation volume protects against a lymphopenia event.
In addition, the irradiation volume of the maximum heart dose is almost close to zero, and the positive SHAP value of the maximum heart dose (SHAP = 0.783, direction of SHAP = 0.187) mentions that the irradiation dose promotes the event of lymphopenia when the irradiation volume is fixed. Following the points above, it can be further illustrated that: (point 3) the irradiation dose promotes the occurrence of lymphopenia when  SHAP value is more than zero, the higher SHAP value the more contribution of feature to lymphopenia; direction of SHAP is range from -1 to 1, more promotive to lymphopenia when close to 1 while more protective when closet to -1. RT, radiation treatment; ER, estrogen receptors; PR, progesterone receptors; IHC, immunohistochemistry; HR, hormone receptor; HER2, human epidermal growth factor receptor 2; BCT, breast-conserving therapy; MRM, modified radical mastectomy; SLNB, Sentinel lymph node biopsy; ALND, axillary lymph node dissection. the irradiation volume is controlled. Because of the negative correlations between irradiation volume and irradiation doe of heart, the associations between the mean/maximum heart dose and SHAP value do not follow the regression relationships, it can be compared between Figures S6, S7.

Other Features' Attributions
Finally, we considered the contributions of other features to the occurrence of lymphopenia ( Figures 3C-E), especially those with relatively high SHAP value and direction of SHAP. Notably, the chemotherapy regimens (anthracycline/taxane) were found to promote the occurrence of lymphopenia (SHAP = 0.368, direction of SHAP = 0.714). Conversely, the Taxane monotherapy chemotherapy regimens were less of risk factor (SHAP = 0.420, direction of SHAP = -0.672). These findings were consistent with the results in Lasso regressions, as shown in Table 2.

Proof-of-Concept Clinical Validation
We found that maximum heart dose and V20 of ipsilateral lung promoted the lymphopenia events (SHAP = 0.783, direction of SHAP = 0.187; SHAP = 0.383, direction of SHAP = 0.308, respectively). The mean heart dose protected the patients from lymphopenia (SHAP = 0.774, direction of SHAP = -0.377) ( Table 2). However, these three radiation doses were never included in Lasso regressions. The mean heart dose promotes lymphopenia in the univariate logistic analysis in both the Testing cohort (Table 1) and the Validation cohort ( Table S1).
The contradiction between mean heart dose in SHAP analysis and univariate logistic analysis might be caused by the multicollinearity among radiation doses (Pearson's correlations > 0.181, as shown in Figure S8). Moreover, the contradiction between these three radiation doses in SHAP analysis and the lasso regressions may be because of the shrinkage function in lasso regression, which means the highly correlated but less important features may not be selected in the regression model. Therefore, we did a proof-of-concept analysis in paired patients for these three radiation doses, as illustrated in the method section and shown in Figure 1C.
As shown in Figure 5, after controlling for the main features, the mean heart dose is higher in the patients without lymphopenia events (both P values < 0.05). In comparison, the maximum heart dose and V20 of the ipsilateral lung are significantly higher in the patients with lymphopenia events (all P values < 0.05). These results are all consistent with the SHAP value analysis, indicating that our XGboost models and directional mean absolute SHAP values rigorously revealed the features' importance and their directional contribution to lymphopenia, especially those features were contradictory in univariate analysis or those features were never included in lasso regressions because of being masked by the highly correlated major features.

DISCUSSION
To best of our knowledge, our study is the first to describe the diverse potential determinants, especially including the complex radiation dose, of radiation-induced lymphopenia in the patients with breast cancer by the machine learning algorithm (XGboost). Furthermore, XGboost and SHAP interpretation approach were combined to determine the predictive performances and the feature contributions in radiation-induced lymphopenia. We found that baseline lymphocyte counts protect against while the baseline hemoglobin level impact the event of radiation-induced lymphopenia; more importantly, we summarize some regularities between radiation dose and occurrence of lymphopenia, i.e. (1) lymphocytes are sensitive to an irradiation dose lower than 4Gy; (2) the irradiation volume plays a significantly greater role than the irradiation dose, in promoting lymphopenia; (3) the irradiation dose promotes the occurrence of lymphopenia when the irradiation volume is controlled. The protective role of baseline lymphocyte counts in radiation-induce lymphopenia is consistent to the common acknowledgement. However, there were little previous works have correlated the hemoglobin and lymphopenia in radiation therapy. It is known for decades that cancer-related hypoxia and anemia are associated with decrease in radiosensitivity of tumor cells (25). In another words, it can be hypothesized that hypoxia and anemia might decrease the radiosensitivity of lymphocytes which deserves more studies in the future. For the protective effects of platelets and monocytes, frequent platelet donation is usually associated with T-cell lymphopenia in platelet donation studies (26,27), which mentioned that possible casual correlations between platelet reduction and lymphopenia but the mechanisms were still unknown. The correlations between monocytes and lymphopenia were seldom studied, but macrophage, which differentiated from monocyte, leads to either radiosensitization or radioresistance depending on different tumor types or different radiation regimen studied, and various molecular players as NF-kB, MAPKs, p53, reactive oxygen species, and inflammasomes that have been involved in these processes (28), which might mention the monocyte reduction correlated to lymphopenia at some extend.
Considering the impact of irradiation dose on lymphopenia, the contribution of integral dose of the total body also has been presented in our previous work of EDIC formula containing integral dose of the total body resulted lymphopenia in 488 esophageal cancer patients (29); V5 of ipsilateral lung/bilateral lungs have been previously reported to impact lymphopenia in non-small cell lung cancer patients (30). Similar findings have been reported in patients with early-stage lung cancer, V10 to V50 were significantly negatively correlated with the decrease of absolute lymphocyte counts following radiation (31). In patients with pancreatic cancer, V10, V15, and V20 were significantly higher in patients with severe lymphopenia (32), and the same findings in patients with esophageal squamous cell carcinoma (33). For the contribution of irradiation volume at heart, our findings are consistent with our previous study in lung cancer patients, i.e. higher heart V5 (the heart volume in 5Gy) was significant with decrease in post-radiation lymphocyte counts (31). The results of these studies were all consistent with our findings.
We illustrated that the irradiation volume had a more significant lymphotoxic impact than the irradiation dose. The lymphocytes behave as mobile organs and circulate through the blood at about 1 cycle/min (34). Therefore, this may explain our novel findings that the higher the volume of radiotherapy dose delivered may result in a higher administration of radiotherapy to the circulating lymphocytes, thereby increasing the risk of radiation-induced lymphopenia.
The impact of chemotherapy on lymphopenia is known. Our findings is consistent with findings from study of Tolaney SM et al. who demonstrated that lymphopenia (grade 3 or grade 4) was associated with the combined use of adjuvant anthracycline and taxane regimens in three breast cancer patient cohorts (35). This study revealed that the taxane only chemotherapy seemed to have less risk on lymphopenia (OR 1.2, 95%CI 0.67-2.14, P-value>0.5, as shown in Table 1) in univariate analysis which was consistent with our previous study (36), similarly in multivariable analysis. Less lymphopenia effect from taxane monotherapy may be simply explained by the less damage from the combination of anthracycline and taxane. Of note, we previously noted that single-agent taxane treatment increased serum IL-2 levels in patients with advanced breast cancer (37). IL-2 serves as a cellcycle progression signal for T lymphocytes, stimulating their proliferation and differentiation. The underpinning biology driving chemotherapy-induced lymphopenia is not fully understood, and this study further highlights the unmet need for future studies.
There are some limitations in this study. First, we used an independent prospective cohort with different population and study protocols but the same staffs who assembled these two cohorts. Our findings are needed to be verified with external cohorts in the future. Secondly, it is well known that inflammation indicators are also important in immune responses including lymphopenia, while the Testing cohort in this study were established without inflammation indictors. This limitation could be addressed in subsequent studies by measuring inflammation indicator levels in serum.
In conclusion, in patients with breast cancer who underwent radiotherapy, we found that the baseline lymphocyte, platelet and monocyte play protective roles in lymphopenia; the usage of taxane results in less impact on lymphopenia than the combination of an anthracycline with a taxane; all radiation doses promote the occurrence of lymphopenia except the mean heart dose. Especially for the contributions of complicated radiation dose on lymphopenia, we draw three conclusions: 1) lymphocytes are sensitive to an irradiation dose lower than 4Gy; 2) the irradiation volume plays a more important role in promoting the occurrence of lymphopenia than the irradiation dose; 3) the irradiation dose promotes the lymphopenia occurrence when the irradiation volume is controlled. Higher than the dose's priority, irradiation volume should be kept as small as possible during the planning process to avoid radiation-induced lymphopenia as long as the target coverage is not compromised.

DATA AVAILABILITY STATEMENT
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

ETHICS STATEMENT
The studies involving human participants were reviewed and approved by the ethics committee of the University of Hong Kong-Shenzhen Hospital. The patients/participants provided their written informed consent to participate in this study.