Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 07 March 2024
Sec. Hematologic Malignancies

Prediction of tumor lysis syndrome in childhood acute lymphoblastic leukemia based on machine learning models: a retrospective study

Yao XiaoYao Xiao1Li XiaoLi Xiao1Yang ZhangYang Zhang2Ximing XuXiming Xu3Xianmin GuanXianmin Guan1Yuxia GuoYuxia Guo1Yali ShenYali Shen1XiaoYing LeiXiaoYing Lei1Ying DouYing Dou1Jie Yu*Jie Yu1*
  • 1Department of Hematology and Oncology, Children’s Hospital of Chongqing Medical University, National Clinical Research Center for Child Health and Disorders, Chongqing Key Laboratory of Pediatrics, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing, China
  • 2College of Medical Informatics, Chongqing Medical University, Chongqing, China
  • 3Big Data Engineering Center for Children’s Medical Care, Children’s Hospital of Chongqing Medical University, Chongqing, China

Background: Tumor lysis syndrome (TLS) often occurs early after induction chemotherapy for acute lymphoblastic leukemia (ALL) and can rapidly progress. This study aimed to construct a machine learning model to predict the risk of TLS using clinical indicators at the time of ALL diagnosis.

Methods: This observational cohort study was conducted at the National Clinical Research Center for Child Health and Disease. Data were collected from pediatric ALL patients diagnosed between December 2008 and December 2021. Four machine learning models were constructed using the Least Absolute Shrinkage and Selection Operator (LASSO) to select key clinical indicators for model construction.

Results: The study included 2,243 pediatric ALL patients, and the occurrence of TLS was 8.87%. A total of 33 indicators with missing values ≤30% were collected, and 12 risk factors were selected through LASSO regression analysis. The CatBoost model with the best performance after feature screening was selected to predict the TLS of ALL patients. The CatBoost model had an AUC of 0.832 and an accuracy of 0.758. The risk factors most associated with TLS were the absence of potassium, phosphorus, aspartate transaminase (AST), white blood cell count (WBC), and urea levels.

Conclusion: We developed the first TLS prediction model for pediatric ALL to assist clinicians in risk stratification at diagnosis and in developing personalized treatment protocols. This study is registered on the China Clinical Trials Registry platform (ChiCTR2200060616).

Clinical trial registration: https://www.chictr.org.cn/, identifier ChiCTR2200060616.

1 Introduction

Acute lymphoblastic leukemia (ALL) is a malignant tumor of the hematopoietic system caused by the abnormal proliferation of bone marrow T- or B-lineage lymphocytes, and it is the most common malignant disease in children, accounting for about one-third of all childhood malignancies (1). The incidence rate of ALL is about (3~5)/100,000 (2), and the age of onset is mostly before 15 years old (3, 4). The male-to-female ratio is about 1.2:1 (5). In recent years, chemotherapy based on risk factor stratification has significantly improved the prognosis of children with ALL. In developed countries, the 5-year event-free survival (EFS) of pediatric ALL can reach more than 85%, and the overall survival (OS) can reach more than 90% (1, 6, 7). In China, the 5-year EFS of pediatric ALL reaches 80%, and the OS reaches more than 85% (8). However, about 20% of ALL patients face treatment failure, mainly due to recurrence, secondary tumors, chemotherapy toxicity, or severe complications (9). Several studies have reported that 30% to 47% of ALL patients die from treatment-related deaths (8, 10, 11), with infection and bleeding being the main causes. Among these serious complications, tumor lysis syndrome (TLS) often occurs early after induction chemotherapy in patients with ALL, and most of them progress rapidly. Acute kidney injury (AKI), cardiac arrhythmia, seizures, and even death may occur at the time of diagnosis. An in-depth understanding of TLS is beneficial to optimize the efficacy of ALL and provide a reference for TLS prevention and treatment.

TLS is an oncologic emergency caused by the massive lysis of tumor cells and the release of large amounts of potassium, phosphate, and nucleic acids into the systemic circulation. Clinical manifestations are characterized by hyperuricemia, hyperkalemia, hyperphosphatemia, and hypocalcemia (12, 13). Standard treatment for TLS includes massive hydration, allopurinol, and labyrinthine for hyperuricemia, and renal replacement therapy may be required in severe cases (14) Clinical observations suggest that TLS tends to occur in highly proliferative malignancies with heavy tumor loads or sensitive responses to initial therapy, such as Burkitt’s lymphoma and ALL (15). Thus, TLS is a great challenge in pediatric ALL. The analysis of independent risk factors for the development of TLS in children with ALL in the early stage of induction chemotherapy based on laboratory tests and clinical presentation at the time of initial diagnosis and the construction of a predictive model are crucial for the timely adoption of stronger interventions and close monitoring of high-risk patients to improve patient survival.

Currently, machine learning is widely used in the fields of disease diagnosis, prediction of prognosis, design of treatment plans, and individualized healthcare (16). Previous studies have reported the prediction models for the occurrence of TLS in AML in adults and ALL in children abroad (1719). However, machine learning-based TLS prediction models for childhood ALL have not been reported at home or abroad. Relying on the clinical research big data platform of the National Clinical Medical Research Center for Children’s Health and Diseases, this study intends to construct a machine learning model to predict the probability of TLS in ALL patients by analyzing the clinical characteristics of real-world patients with ALL to provide a basis for decision-making for the early identification and intervention of TLS.

2 Materials and methods

2.1 Study population and design

This observational retrospective cohort study was conducted at the Children’s Hospital of Chongqing Medical University, one of two national clinical research centers for child health and disease in China that completed the Clinical Science Research Big Data Platform (CSRBDP) in 2021, which is available to clinical researchers and includes more than 750,000 pediatric outpatients and inpatients as of January 2022. To establish and validate the prediction model, the subjects were randomly divided into a training set (n = 1,570) and a validation set (n = 673) in a 7:3 ratio. The inclusion criteria were as follows: (1) age between 0 and 18 years; (2) diagnosis made between December 2008 and December 2021; (3) newly diagnosed ALL patients or out-of-hospital-diagnosed patients who did not receive steroids and whose bone marrow smear showed a ratio of primitive/naïve lymphocytes ≥20%. The exclusion criteria were as follows: (1) patients with secondary tumors or immunodeficiency diseases and (2) patients who did not have uric acid (UA), potassium, phosphorus, and calcium indexes detected within the 3 days before and 7 days after baseline time, where the baseline time was defined as the time of the first steroid administration at the time of hospitalization.

The study followed the Transparent Reporting of Individual Prognostic or Diagnostic Multivariate Predictive Models (TRIPOD) specifications for model development and validation (20). Multidimensional clinical data were collected from the CSRBDP, and the study outcome was whether TLS occurred after chemotherapy. The study was registered on the Chinese clinical trial registration platform (clinicaltrials.gov, identifier: ChiCTR2200060616) and was approved by the Ethics Committee of the Children’s Hospital of Chongqing Medical University with a waiver of the informed consent requirement (File No. 2022,98). Figure 1 presents an overview of the study design.

Figure 1
www.frontiersin.org

Figure 1 Machine learning model construction process.

2.2 Data collection

A total of 33 clinically relevant indicators of ALL were collected in this study, including demographic information (e.g., age at diagnosis, gender), laboratory indicators (e.g., blood counts, liver and kidney function, electrolytes, and coagulation), and the type of steroids used at the time of the initial induction chemotherapy. Clinical and laboratory data were collected from all visits, starting with the patient’s initial diagnosis of ALL. In cases where this information was not available, data collected during the patient’s hospital stay were used instead.

2.3 Chemotherapy regimen

The chemotherapy regimen is based on international and national treatment protocols for childhood ALL (2123). It stratifies chemotherapy according to the risk level (low risk, intermediate risk, and high risk). The main chemotherapeutic regimens include window-phase treatment, VDLP(Pred+VCR+DNR+L-ASP/PEG-ASP) induction chemotherapy, CAM(CTX+Ara-C+6-MP) early intensive therapy, high-dose MTX (HDMTX) consolidation therapy, reinduction therapy, and maintenance chemotherapy. TLS mainly occurs during the window-phase treatment and induction of remission therapy phases of the initial treatment. The main drugs used in remission-induction therapy include steroids (Pred/Dex), VCR, DNR, L-ASP/PEG-ASP, and intrathecal injections (steroids, MTX, Ara-C). BCR-ABL1-positive children are given chemotherapy combined with tyrosine kinase inhibitors (TKI).

2.4 Prevention of TLS

Routine preventive measures for TLS include the following: (1) Hydration therapy: adequate hydration therapy and maintaining the balance of fluid and output is the most important measure in the prevention and treatment of TLS (24). The recommended dose of hydration solution for children is 2,000~3,000 mL/(m2/day), and for infants with a body mass less than 10 kg, it is 200 mL/(kg/day) (25). Infants should maintain a urine output of more than 4 mL/(kg/h), while children (excluding infants) should have a urine output of more than 100 mL/(m2/h) to ensure proper renal perfusion and excretion of related metabolites (19). Hourly monitoring of urine output is essential for hydrated children to detect and treat circulatory overload promptly to avoid circulatory failure. If urine output is insufficient, non-thiazide diuretics (such as furosemide 0.5–1.0 mg/kg) can be used for diuresis, although they may impair renal function and promote the deposition of uric acid in the kidneys (26). (2) Correction of hyperuricemia: treatment options for hyperuricemia include xanthine oxidase inhibitors (such as allopurinol) and uric acid oxidase (such as acrylate). Allopurinol may be the initial drug of choice for low-risk TLS and intermediate-risk patients with normal serum uric acid levels. In the absence of high serum uric acid levels, prophylactic treatment with allopurinol is recommended for patients at intermediate risk for TLS, starting 24–48 h before chemotherapy initiation with a routine oral dosage of 10 mg/(kg/day) in three divided doses (the maximum daily dose should not exceed 800 mg). Meanwhile, diuretics should be discontinued, and the dosage of drugs such as cyclosporine should be adjusted. Patients with G-6-PD negative can be given uric acid oxidase at a dosage of 1.5 mg every other day until WBC is less than 50 × 109/L, LDH is within two times the normal upper limit, and UA levels are normal. (3)Symptomatic support: correcting electrolyte disturbances and hemodialysis in emergencies can be used to treat oliguric renal failure or life-threatening metabolic disorders.

2.5 Definition of TLS

The recent international guidelines for the diagnosis of TLS include the ASCO guidelines of 2008 (27), the 2010 International Expert Consensus (25), and the 2015 BCSH guidelines (28). These guidelines primarily apply to adults. In 2021, the Pediatric Tumor Committee of the Chinese Anti-Cancer Association published guidelines for the diagnosis and treatment of TLS in Chinese children. The 2004 Cairo–Bishop definition has been widely adopted by international guidelines (29). According to the Cairo–Bishop criteria, laboratory TLS (LTLS) is diagnosed when there is a specified absolute or 25% increase/decrease from baseline in serum levels of UA, potassium, phosphorus, and/or calcium. The specified absolute serum levels are as follows: uric acid, ≥8 mg/dL; potassium, ≥6 mmol/L; phosphorus, ≥6.5 mg/dL in children and ≥4.5 mg/dL in adults; and calcium, ≤7 mg/dL. TLS is diagnosed when two or more of the proposed criteria are met, either 3 days before or 7 days after chemotherapy administration, regardless of hydration status and the use of hyperuricemic agents. Clinical TLS (CTLS) is defined as the presence of LTLS and at least one complication associated with TLS, without any other identifiable cause, such as an increase in serum creatinine (Crea) of ≥1.5 times the upper limit of normal (ULN), cardiac arrhythmia, sudden death, hand and foot cramps, or seizures.

2.6 Model development

In this study, indicators with missing rates >30% were excluded, and for indicators with missing rates ≤30%, the Random Forest-based iterative interpolation method (Miss Forest) (30) was used to fill in the missing values. The Least Absolute Shrinkage and Selection Operator (LASSO) was applied to identify influential clinical variables (predictor variables with p < 0.05) to remove irrelevant and redundant information, select the best predictor variables, and improve the predictive power of machine learning models.

Random stratified sampling divided the patients into training and validation sets in a ratio of 7:3, with the former being used to train the machine learning models and the latter being used to evaluate the model performance. Four machine learning models were constructed based on the training set, including Support Vector Machine (SVM), Random Forest (RF), CatBoost Classifier (CatBoost), and Logistic Regression (Logistic). The training set was preprocessed using random undersampling combined with the SMOTE oversampling technique to deal with sample imbalance. Hyperparameter tuning of machine learning models uses fivefold cross-validation grid search, and model construction in the training set uses fivefold cross-validation (31). The best-performing model was chosen for risk prediction based on its combined metrics of AUC, recall, specificity, and accuracy. The predictive power of the best machine learning models was explained using Shapley additive explanations (SHAP) (32).

2.7 Statistical methods

Statistical analyses were performed using R (version 4.3.0), while predictive model construction and evaluation were conducted using Python (version 3.7). Continuous variables were assessed for normality using the Shapiro–Wilk test, and nonnormal continuous variables were presented as median with interquartile range (IQR). Categorical variables were presented as frequencies and percentages (n, %). The Mann–Whitney U-test was used for continuous variables, and the Chi-square test was used for categorical variables to compare the differences in variable distribution between the training and validation cohorts. Statistical significance was defined as a two-sided p-value < 0.05.

3 Results

3.1 General patient characteristics

A total of 2,243 patients diagnosed with ALL were included in this study, including 1,332 male (59%) and 911 female (41%) patients, with an incidence of TLS of 8.87% (199/2243). A total of 33 metrics were analyzed after excluding metrics with missing values >30%. Table 1 presents a comparison of clinical and laboratory characteristics between patients in the training and validation cohorts at diagnosis. The results indicate that there were no significant differences between the two cohorts (p > 0.05). We conducted a comparison of the basic characteristics of patients with or without the TLS. The analysis revealed that gender, immunophenotypes, French–American–British classification systems (FAB), white blood cell count (WBC), creatinine (Crea), urea, phosphorus, potassium, uric acid (UA), lactate dehydrogenase (LDH), aspartate transaminase (AST), alanine transaminase (ALT), total bilirubin (TBil), fibrinogen (Fib), prothrombin time (PT), primitive immature cells, the occurrence of hemorrhage, cardiac arrhythmia, AKI, and splenomegaly had a statistically significant relationship with the occurrence of TLS (p < 0.05), as shown in Supplementary Table S1.

Table 1
www.frontiersin.org

Table 1 Comparison of demographic and clinical indicators in the training cohort and validation cohort of patients with ALL.

3.2 Feature screening

In the LASSO regression for feature screening, 33 candidate predictor variables were included. By adjusting λ, the number of variables was reduced, which resulted in a simplified model. The number of clinical characteristics of pediatric ALL patients decreased continuously with increasing log (λ) (Figure 2A). By performing 10-fold cross-validation to find the optimal λ value, LASSO selected 12 variables with nonzero regression coefficients at λ = 0.007 and log (λ) = −2.176 (Figure 2B). The 12 predictor variables that were ultimately included in the follow-up study were FAB type, WBC, phosphorus, calcium, potassium, UA, AST, blood glucose, occurrence of infection, AKI, cardiac arrhythmia, and the type of steroid used in initial induction chemotherapy.

Figure 2
www.frontiersin.org

Figure 2 Screening of variables based on Lasso regression. (A) The variation characteristics of the coefficient of variables; (B) the selection process of the optimum value of the parameter λ in the Lasso regression model by cross-validation method.

3.3 Model development and screening

The optimal parameters for developing the TLS prediction model for ALL in children through grid search and cross-validation are presented in Supplementary Table S2. Four postscreening prediction models that included 12 variables using LASSO regression feature screening were included. Among the four predictive models constructed after LASSO regression screening, the CatBoost model has the best prediction with AUC = 0.832 (95% CI: 0.810–0.854) (Figure 3A), while the Logistic model has AUC = 0.812 (95% CI: 0.777–0.847) (Figure 3B); the AUC = 0.829 (95% CI: 0.807–0.851) for the Random Forest model (Figure 3C); AUC = 0.815 (95% CI: 0.782–0.848) for the SVM model (Figure 3D). To comprehensively evaluate the prediction performance of each model, the accuracy, specificity, recall, and Brier scores of the four models were also calculated in this study. Table 2 shows the performance parameters of the four models constructed after LASSO feature screening. The CatBoost model has an accuracy of 0.758, a recall of 0.684, and a specificity of 0.825. After the screening of the LASSO regression features, we finally selected the CatBoost model as the optimal model, and the optimal model, CatBoost, was evaluated in the validation set, with an AUC of 0.803 (95% CI: 0.735–0.865, accuracy: 0.807, recall: 0.667, specificity: 0.821(Figure 4).

Figure 3
www.frontiersin.org

Figure 3 Four machine learning-based predictive model ROC curves after feature screening. (A) CatBoost; (B) Logistic; (C) Random Forest; (D) SVM.

Table 2
www.frontiersin.org

Table 2 Model performance (after feature screening).

Figure 4
www.frontiersin.org

Figure 4 The optimal model, CatBoost ROC curves, in the validation set.

3.4 Interpretation of personalized predictions

To visually explain the selected variables, this study used SHAP to illustrate how these variables affect TLS in the model. The feature importance ranking of the CatBoost is shown in Supplementary Figure S1. The factors most associated with TLS were potassium, phosphorus, AST, WBC, and UA.

In this study, an analysis was conducted to enhance the interpretability of the CatBoost model by importing the SHAP package into the Tree-Explainer class. Figure 5A displays the importance ranking of 12 risk factors evaluated by the average absolute SHAP value, while Figure 5B shows the distribution of the impact of each feature on the model output results. The feature ordering (y-axis) indicates the importance of the predictors, and the SHAP value (x-axis) is a unity index showing the impact of a particular variable on the model output results. Each dot in each row represents a patient, and the color of the dot represents the feature value: red for higher values and blue for lower values. Red and blue bars represent risk factors and protective factors, respectively; the longer bars indicate more significant feature importance. We observe that potassium has the most critical influence, with lower potassium exhibiting a positive SHAP value (dots extending to the right becoming increasingly blue) and higher blood potassium exhibiting a negative SHAP value (dots extending to the left becoming increasingly red). This suggests that when potassium concentrations are very low, it increases the risk of TLS in children. The opposite is true for WBC—the higher the WBC, the higher the predictive value for TLS. The distribution of dots can also provide valuable insights, such as in the case of UA, where we can see a dense cluster of low UA (blue dots) associated with a negative SHAP value. The instances of high uric acid (red dots) extend further to the right, indicating that high UA has a greater positive impact on the occurrence of TLS than low UA.

Figure 5
www.frontiersin.org

Figure 5 SHAP interpretation of the CatBoost model. (A) Importance ranking of features that have an impact on model prediction. (B) Impact of each feature on model prediction; in each row, each dot represents a patient, and the color of the dot represents the feature value: red represents a larger value, blue represents a lower value. (C, D) Individualized predictions of the model for two patients.

Supplementary Figure S1 displays the SHAP scatter dependency plots for the first five characteristics, revealing considerable variation in the relationship between the SHAP values and the values of the variables within each characteristic. Potassium and phosphorus show a negative, nearly linear trend across the range of values. The SHAP values for white blood cell count, uric acid, and glutamic transaminase are comparable over a range of values but then increase sharply. When potassium = 3.68 mmol/L, phosphorus = 1.49 mmol/L, AST = 33.6 U/L, WBC = 16.6*109, and UA = 512 µmol/L, the resulting SHAP value is 0. The greater the SHAP value, the higher the probability of TLS occurrence.

To illustrate the interpretability of the model, we present two typical examples: Figure 5C represents a pediatric ALL patient with phosphorus = 1.5 mmol/L, AST = 34.2 U/L, potassium = 4.52 mmol/L, WBC = 4.29*109, and UA = 153.1 mmol/L. The CatBoost model predicts a SHAP value of −1.05 for this patient, with a 25.92% probability of developing TLS. Figure 5D represents a pediatric ALL patient with AST = 249.0 U/L, WBC = 128.06*109, phosphorus = 0.36 mmol/L, and potassium = 4.38 mmol/L. Under the prediction of the CatBoost model, this patient has a SHAP value of 1.68 and an 84.29% probability of developing TLS.

4 Discussion

This study is the first to use machine learning algorithms to identify risk factors for the development of TLS in pediatric ALL patients and to construct machine learning models to assess the probability of TLS in these patients. Several studies have shown that models based on machine learning algorithms perform better than predictive models using traditional logistic regression. In 2021, Alabi et al. developed a machine-learning model to predict overall survival in tongue cancer, and when compared to traditional logistic regression using an externally validated dataset, the machine-learning model had an accuracy of 88.7%, compared to logistic regression’s accuracy of 60.4% (33). In 2022, Zhao et al. developed a machine learning-based RF model to predict the maximum creatinine value within 24 h after diagnosis of AKI patients as an independent predictor of renal function prognosis, and the RF model showed better ability in predicting the prognosis of patients with AKI compared with traditional regression models (34). Although several studies have examined the risk factors for TLS in children with ALL (35, 36), traditional logistic prediction models based on a single center for the occurrence of TLS in adults with AML and children with ALL have also been established in previous studies abroad, but machine learning-based risk models have not been reported.

TLS is the most common disease-related hematologic cancer-critical illness in children. TLS is most commonly seen in fast-growing and chemotherapy-sensitive malignancies, such as non-Hodgkin’s lymphoma, acute nonlymphoblastic leukemia, and acute lymphoblastic leukemia among hematologic neoplasms, as well as in certain solid tumors of epithelial origin, such as small-cell lung carcinomas, advanced breast carcinomas, and neural tube cell tumors (15). The incidence and prevalence of TLS are not well defined because they depend on tumor type, treatment regimen, patient-related risk factors, and preventive measures taken. Most epidemiologic data, mostly from the 1990s, show that up to 70% of children with acute leukemia present with LTLS (37) and <10% had clinical manifestations of LTLS (19, 38, 39). Early experience has led to rigorous prophylaxis and step-dosing treatment strategies that have significantly reduced the incidence of TLS (40). However, once TLS occurs, the consequences can be fatal. A retrospective study of patients with TLS from 2010 to 2013 found that the majority (58%) of patients developed acute renal failure, often with comorbid infections and sepsis, with a very low rate of seizures (1%), and an overall mortality rate of 21 (13). The most common causes of death in patients with clinical TLS are hemorrhage and renal failure (19, 27, 41). Although TLS poses a challenge to treatment, it is generally curable for most patients with TLS through early detection and the use of appropriately designed prevention programs. Regarding risk factors for developing TLS, known reports include disease type, tumor volume (assessed by solid tumor size, LDH, or white blood cell count), renal insufficiency, and dehydration status (42).

In this study, the AUC of the CatBoost model constructed with 12 predictor variables screened by logistic LASSO regression was 0.803 (95% CI: 0.735–0.865). The factors used to construct the model included FAB typing, white blood cell count at initial diagnosis, blood phosphorus, blood calcium, blood potassium, uric acid, glutamic acid aminotransferase, blood glucose, the occurrence of infections, acute kidney injury, cardiac arrhythmia, and the type of steroid used in the initial induction chemotherapy. Previous studies have shown that TLS occurs most frequently in patients with acute lymphoblastic leukemia type L3, high initial white blood cell counts, high uric acid levels, and renal insufficiency (43, 44), which is the same as the results of our analysis. A few patients with their own high baseline blood phosphorus, calcium, potassium, and uric acid levels were prone to spontaneous tumor lysis syndrome (45). In the present study, we found for the first time that the patients’ glutamine aminotransferase, blood glucose, the occurrence of infection, acute kidney injury, cardiac arrhythmia, and the type of steroid used at the time of the initial induction chemotherapy were associated with the development of TLS at the time of the initial diagnosis.

It is well known that the typical clinical manifestations of TLS are hyperuricemia, hyperkalemia, hyperphosphatemia, and hypocalcemia (25, 46, 47). However, when we used SHAP to interpret the optimal CatBoost model, the results suggested that lower potassium and lower phosphorous were more likely to occur in TLS. On the one hand, the data on potassium and phosphorus we collected were at the time of the diagnosis of ALL, when the patients had not yet started chemotherapy, and a large number of tumor cells sensitive to the response to initial chemotherapy may not have yet cracked in large numbers, and a very small number of patients with spontaneous TLS were outside the scope of our study. On the other hand, patients with initial ALL often present with fever, mouth ulcers, sore throat, chest pain, cough, abdominal pain, diarrhea, skin petechiae and ecchymosis, bleeding gums, nosebleeds, nausea, vomiting, pallor, dizziness, and shortness of breath. At this time, the patients often have poor appetite (48), and the body’s potassium and phosphorus are mainly obtained through plant and animal foods. The most frequent electrolyte abnormality with ALL patients was hypokalemia. Furthermore, hypokalemic patients more frequently experienced concurrent electrolyte disturbances (i.e., hyponatremia, hypocalcemia, hypophosphatemia, and hypomagnesemia) (49). Therefore, low potassium and low phosphorus are consistent with the state of the patients at the time of initial diagnosis.

Arrhythmia refers to an abnormality in the frequency, rhythm, site of origin, conduction velocity, or order of excitation of the heart’s impulses (50). We retrieved all ECG reports for all patients at the time of the initial ALL visit, and the results showed that the incidence of arrhythmia in 2,243 patients was 28.89% (648/2,243), of which the incidence of arrhythmia in patients with TLS was 37.19% (74/199), and the incidence of arrhythmia in patients without TLS was 28.08% (574/2,044). Interestingly, existing studies have found that patients with TLS always have electrolyte and metabolic disturbances, and it can progress to clinical toxic effects, including cardiac arrhythmias due to hyperkalemia and hypocalcemia (51). In our study, a significant proportion of patients with a diagnosis of ALL already had arrhythmias at the time of diagnosis, regardless of whether or not TLS occurred after starting chemotherapy.

However, this study has several limitations. Firstly, the predictive model was constructed based on a retrospective study conducted in a single center in China. Therefore, its predictive validity needs to be further verified using external data, particularly large-sample cohort studies involving multiple centers, different regions, and diverse ethnicities. Secondly, the MissForest algorithm used in this study has a missing value limit of <30% for interpolating mixed missing data into good data. As a result, variables with missing values exceeding 30% (such as BMI and calcitoninogen) were not included in the analysis. Thirdly, the analysis did not consider variables such as mutations in specific genes due to the high cost and low prevalence of genetic testing in the early stages. Despite these limitations, this study represents the first attempt to develop a machine-learning model for predicting the risk of TLS in children with ALL in China.

5 Conclusion

This study examined 12 risk factors associated with the development of TLS using data from ALL patients. A risk prediction model was established based on these factors. The study results demonstrated the good predictive ability of the CatBoost model. The CatBoost model can be incorporated into a clinical decision support system to assist clinicians in making effective diagnoses and treatment decisions. This can help prevent or delay the occurrence of TLS in ALL patients, particularly high-risk children, thereby reducing the economic burden on patients and society. Additionally, the use of TLS risk alerts during clinic visits for ALL patients can minimize missed diagnoses.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by The ethics committee of the Children’s Hospital of Chongqing Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because The project was a retrospective study utilizing medical records obtained from previous clinical consultations, without unnecessary risk to the subjects, and informed consent was agreed to be waived with a commitment not to disclose patient privacy.

Author contributions

YX: Writing – original draft. LX: Writing – original draft. YZ: Writing – original draft. XX: Writing – review & editing. XG: Writing – review & editing. YG: Writing – review & editing. YS: Writing – review & editing. XL: Writing – review & editing. YD: Writing – review & editing. JY: Writing – review & editing.

Funding

The author(s) declare financial support was received for theresearch, authorship, and/or publication of this article. This work was supported by the Intelligence Medicine Project of Chongqing Medical University (YJSZHYX202103), the personalized treatment strategy based on individualized therapy to guide the diagnosis and treatment of relapsed acute lymphoblastic leukemia in children of Children’s Hospital of Chongqing Medical University (LY02026), and Multicenter randomized controlled clinical trial comparing dexamethasone and prednisone in induced remission in children with intermediate- and high-risk acute lymphoblastic leukemia of Children’s Hospital of Chongqing Medical University (LY03006).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1337295/full#supplementary-material

References

1. Hunger SP, Mullighan CG. Acute lymphoblastic leukemia in children. N Engl J Med. (2015) 373:1541–52. doi: 10.1056/NEJMra1400972

CrossRef Full Text | Google Scholar

2. National Cancer Institute. SEER cancer statistics review 1975-2017. [EB/OL]. (2020). Available at: https://seer.cancer.gov/csr/1975_2017/browse_csr.php?sectionSEL=28&pageSEL=sect_28_table.08.

Google Scholar

3. Hunger SP, Lu X, Devidas M, Camitta BM, Gaynon PS, Winick NJ, et al. Improved survival for children and adolescents with acute lymphoblastic leukemia between 1990 and 2005: a report from the children's oncology group. J Clin Oncol. (2012) 30:1663–9. doi: 10.1200/jco.2011.37.8018

CrossRef Full Text | Google Scholar

4. Bhojwani D, Yang JJ, Pui CH. Biology of childhood acute lymphoblastic leukemia. Pediatr Clinics North America. (2015) 62:47–60. doi: 10.1016/j.pcl.2014.09.004

CrossRef Full Text | Google Scholar

5. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2021. CA Cancer J Clin. (2021) 71:7–33. doi: 10.3322/caac.21654

CrossRef Full Text | Google Scholar

6. Teachey DT, Pui CH. Comparative features and outcomes between paediatric T-cell and B-cell acute lymphoblastic leukaemia. Lancet Oncol. (2019) 20:e142–54. doi: 10.1016/s1470-2045(19)30031-2

CrossRef Full Text | Google Scholar

7. Pui CH, Nichols KE, Yang JJ. Somatic and germline genomics in paediatric acute lymphoblastic leukaemia. Nat Rev Clin Oncol. (2019) 16:227–40. doi: 10.1038/s41571-018-0136-6

CrossRef Full Text | Google Scholar

8. Cui L, Li ZG, Chai YH, Yu J, Gao J, Zhu XF, et al. Outcome of children with newly diagnosed acute lymphoblastic leukemia treated with CCLG-ALL 2008: The first nation-wide prospective multicenter study in China. Am J Hematol. (2018) 93:913–20. doi: 10.1002/ajh.25124

CrossRef Full Text | Google Scholar

9. Oskarsson T, Söderhäll S, Arvidson J, Forestier E, Montgomery S, Bottai M, et al. Relapsed childhood acute lymphoblastic leukemia in the Nordic countries: prognostic factors, treatment and outcome. Haematologica. (2016) 101:68–76. doi: 10.3324/haematol.2015.131680

CrossRef Full Text | Google Scholar

10. O'Connor D, Bate J, Wade R, Clack R, Dhir S, Hough R, et al. Infection-related mortality in children with acute lymphoblastic leukemia: an analysis of infectious deaths on UKALL2003. Blood. (2014) 124:1056–61. doi: 10.1182/blood-2014-03-560847

CrossRef Full Text | Google Scholar

11. Pui CH, Pei D, Campana D, Cheng C, Sandlund JT, Bowman WP, et al. A revised definition for cure of childhood acute lymphoblastic leukemia. Leukemia. (2014) 28:2336–43. doi: 10.1038/leu.2014.142

CrossRef Full Text | Google Scholar

12. Criscuolo M, Fianchi L, Dragonetti G, Pagano L. Tumor lysis syndrome: review of pathogenesis, risk factors and management of a medical emergency. Expert Rev Hematol. (2016) 9:197–208. doi: 10.1586/17474086.2016.1127156

CrossRef Full Text | Google Scholar

13. Durani U, Shah ND, Go RS. In-hospital outcomes of tumor lysis syndrome: A population-based study using the national inpatient sample. oncologist. (2017) 22:1506–9. doi: 10.1634/theoncologist.2017-0147

CrossRef Full Text | Google Scholar

14. Barbar T, Jaffer Sathick I. Tumor lysis syndrome. Adv chronic Kidney Dis. (2021) 28:438–446.e1. doi: 10.1053/j.ackd.2021.09.007

CrossRef Full Text | Google Scholar

15. Howard SC, Jones DP, Pui CH. The tumor lysis syndrome. N Engl J Med. (2011) 364:1844–54. doi: 10.1056/NEJMra0904569

CrossRef Full Text | Google Scholar

16. Beam AL, Kohane IS. Big data and machine learning in health care. Jama. (2018) 319:1317–8. doi: 10.1001/jama.2017.18391

CrossRef Full Text | Google Scholar

17. Mato AR, Riccio BE, Qin L, Heitjan DF, Carroll M, Loren A, et al. A predictive model for the detection of tumor lysis syndrome during AML induction therapy. Leuk Lymphoma. (2006) 47:877–83. doi: 10.1080/10428190500404662

CrossRef Full Text | Google Scholar

18. Truong TH, Beyene J, Hitzler J, Abla O, Maloney AM, Weitzman S, et al. Features at presentation predict children with acute lymphoblastic leukemia at low risk for tumor lysis syndrome. Cancer. (2007) 110:1832–9. doi: 10.1002/cncr.22990

CrossRef Full Text | Google Scholar

19. Montesinos P, Lorenzo I, Martín G, Sanz J, Pérez-Sirvent ML, Martínez D, et al. Tumor lysis syndrome in patients with acute myeloid leukemia: identification of risk factors and development of a predictive model. Haematologica. (2008) 93:67–74. doi: 10.3324/haematol.11575

CrossRef Full Text | Google Scholar

20. Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ (Clinical Res ed.). (2015) 350:g7594. doi: 10.1136/bmj.g7594

CrossRef Full Text | Google Scholar

21. Brown PA, Shah B, Advani A, Aoun P, Boyer MW, Burke PW, et al. Acute lymphoblastic leukemia, version 2.2021, NCCN clinical practice guidelines in oncology. J Natl Compr Cancer Network JNCCN. (2021) 19:1079–109. doi: 10.6004/jnccn.2021.0042

CrossRef Full Text | Google Scholar

22. Yeoh AE, Tan D, Li CK, Hori H, Tse E, Pui CH. Management of adult and paediatric acute lymphoblastic leukaemia in Asia: resource-stratified guidelines from the Asian Oncology Summit 2013. Lancet Oncol. (2013) 14:e508–23. doi: 10.1016/s1470-2045(13)70452-2

CrossRef Full Text | Google Scholar

23. Tang J, Yu J, Cai J, Zhang L, Hu S, Gao J, et al. Prognostic factors for CNS control in children with acute lymphoblastic leukemia treated without cranial irradiation. Blood. (2021) 138:331–43. doi: 10.1182/blood.2020010438

CrossRef Full Text | Google Scholar

24. Jones DP, Mahmoud H, Chesney RW. Tumor lysis syndrome: pathogenesis and management. Pediatr Nephrol (Berlin Germany). (1995) 9:206–12. doi: 10.1007/bf00860751

CrossRef Full Text | Google Scholar

25. Cairo MS, Coiffier B, Reiter A, Younes A. Recommendations for the evaluation of risk and prophylaxis of tumour lysis syndrome (TLS) in adults and children with Malignant diseases: an expert TLS panel consensus. Br J Haematol. (2010) 149:578–86. doi: 10.1111/j.1365-2141.2010.08143.x

CrossRef Full Text | Google Scholar

26. Worcester EM, Coe FL. Clinical practice. Calcium kidney stones. N Engl J Med. (2010) 363:954–63. doi: 10.1056/NEJMcp1001011

CrossRef Full Text | Google Scholar

27. Coiffier B, Altman A, Pui CH, Younes A, Cairo MS. Guidelines for the management of pediatric and adult tumor lysis syndrome: an evidence-based review. J Clin Oncol. (2008) 26:2767–78. doi: 10.1200/jco.2007.15.0177

CrossRef Full Text | Google Scholar

28. Jones GL, Will A, Jackson GH, Webb NJ, Rule S. Guidelines for the management of tumour lysis syndrome in adults and children with haematological Malignancies on behalf of the British Committee for Standards in Haematology. Br J Haematol. (2015) 169:661–71. doi: 10.1111/bjh.13403

CrossRef Full Text | Google Scholar

29. Cairo MS, Bishop M. Tumour lysis syndrome: new therapeutic strategies and classification. Br J Haematol. (2004) 127:3–11. doi: 10.1111/j.1365-2141.2004.05094.x

CrossRef Full Text | Google Scholar

30. Stekhoven DJ, Bühlmann P. MissForest–non-parametric missing value imputation for mixed-type data. Bioinf (Oxford England). (2012) 28:112–8. doi: 10.1093/bioinformatics/btr597

CrossRef Full Text | Google Scholar

31. Lorman V, Razzaghi H, Song X, Morse K, Utidjian L, Allen AJ, et al. A machine learning-based phenotype for long COVID in children: An EHR-based study from the RECOVER program. PloS One. (2023) 18:e0289774. doi: 10.1371/journal.pone.0289774

CrossRef Full Text | Google Scholar

32. Xue B, Li D, Lu C, King CR, Wildes T, Avidan MS, et al. Use of machine learning to develop and evaluate models using preoperative and intraoperative data to identify risks of postoperative complications. JAMA network Open. (2021) 4:e212240. doi: 10.1001/jamanetworkopen.2021.2240

CrossRef Full Text | Google Scholar

33. Alabi RO, Mäkitie AA, Pirinen M, Elmusrati M, Leivo I, Almangush A. Comparison of nomogram with machine learning techniques for prediction of overall survival in patients with tongue cancer. Int J Med Inf. (2021) 145:104313. doi: 10.1016/j.ijmedinf.2020.104313

CrossRef Full Text | Google Scholar

34. Zhao X, Lu Y, Li S, Guo F, Xue H, Jiang L, et al. Predicting renal function recovery and short-term reversibility among acute kidney injury patients in the ICU: comparison of machine learning methods and conventional regression. Renal failure. (2022) 44:1326–37. doi: 10.1080/0886022x.2022.2107542

CrossRef Full Text | Google Scholar

35. Xue Y, Chen J, Gao S, Zhai X, Wang N, Gao J, et al. Clinical characteristics of tumor lysis syndrome in childhood acute lymphoblastic leukemia. Sci Rep. (2021) 11:9656. doi: 10.1038/s41598-021-88912-2

CrossRef Full Text | Google Scholar

36. Kedar A, Grow W, Neiberger RE. Clinical versus laboratory tumor lysis syndrome in children with acute leukemia. Pediatr Hematol Oncol. (1995) 12:129–34. doi: 10.3109/08880019509029545

CrossRef Full Text | Google Scholar

37. Wolf G, Hegewisch-Becker S, Hossfeld DK, Stahl RA. Hyperuricemia and renal insufficiency associated with Malignant disease: urate oxidase as an efficient therapy? Am J Kidney Dis. (1999) 34:E20. doi: 10.1016/s0272-6386(99)70062-4

CrossRef Full Text | Google Scholar

38. Hande KR, Garrow GC. Acute tumor lysis syndrome in patients with high-grade non-Hodgkin's lymphoma. Am J Med. (1993) 94:133–9. doi: 10.1016/0002-9343(93)90174-N

CrossRef Full Text | Google Scholar

39. Annemans L, Moeremans K, Lamotte M, Garcia Conde J, van den Berg H, Myint H, et al. Incidence, medical resource utilisation and costs of hyperuricemia and tumour lysis syndrome in patients with acute leukaemia and non-Hodgkin's lymphoma in four European countries. Leuk Lymphoma. (2003) 44:77–83. doi: 10.1080/1042819021000054661

CrossRef Full Text | Google Scholar

40. Howard SC, Trifilio S, Gregory TK, Baxter N, McBride A. Tumor lysis syndrome in the era of novel and targeted agents in patients with hematologic Malignancies: a systematic review. Ann Hematol. (2016) 95:563–73. doi: 10.1007/s00277-015-2585-7

CrossRef Full Text | Google Scholar

41. Darmon M, Guichard I, Vincent F, Schlemmer B, Azoulay E. Prognostic significance of acute renal injury in acute tumor lysis syndrome. Leuk Lymphoma. (2010) 51:221–7. doi: 10.3109/10428190903456959

CrossRef Full Text | Google Scholar

42. Durani U, Hogan WJ. Emergencies in haematology: tumour lysis syndrome. Br J Haematol. (2020) 188:494–500. doi: 10.1111/bjh.16278

CrossRef Full Text | Google Scholar

43. Wössmann W, Schrappe M, Meyer U, Zimmermann M, Reiter A. Incidence of tumor lysis syndrome in children with advanced stage Burkitt's lymphoma/leukemia before and after introduction of prophylactic use of urate oxidase. Ann Hematol. (2003) 82:160–5. doi: 10.1007/s00277-003-0608-2

CrossRef Full Text | Google Scholar

44. Kishimoto K, Kobayashi R, Ichikawa M, Sano H, Suzuki D, Yasuda K, et al. Risk factors for tumor lysis syndrome in childhood acute myeloid leukemia treated with a uniform protocol without rasburicase prophylaxis. Leuk Lymphoma. (2015) 56:2193–5. doi: 10.3109/10428194.2014.991923

CrossRef Full Text | Google Scholar

45. Davidson MB, Thakkar S, Hix JK, Bhandarkar ND, Wong A, Schreiber MJ. Pathophysiology, clinical consequences, and treatment of tumor lysis syndrome. Am J Med. (2004) 116:546–54. doi: 10.1016/j.amjmed.2003.09.045

CrossRef Full Text | Google Scholar

46. Gertz MA. Managing tumor lysis syndrome in 2010. Leuk Lymphoma. (2010) 51:179–80. doi: 10.3109/10428190903488788

CrossRef Full Text | Google Scholar

47. Abu-Alfa AK, Younes A. Tumor lysis syndrome and acute kidney injury: evaluation, prevention, and management. Am J Kidney Dis. (2010) 55:S1–13; quiz S14-9. doi: 10.1053/j.ajkd.2009.10.056

CrossRef Full Text | Google Scholar

48. Clarke RT, Van den Bruel A, Bankhead C, Mitchell CD, Phillips B, Thompson MJ. Clinical presentation of childhood leukaemia: a systematic review and meta-analysis. Arch Dis Childhood. (2016) 101:894–901. doi: 10.1136/archdischild-2016-311251

CrossRef Full Text | Google Scholar

49. Milionis HJ, Bourantas CL, Siamopoulos KC, Elisaf MS. Acid-base and electrolyte abnormalities in patients with acute leukemia. Am J Hematol. (1999) 62:201–7. doi: 10.1002/(sici)1096-8652(199912)62:4<201::aid-ajh1>3.0.co;2-1

CrossRef Full Text | Google Scholar

50. Fu DG. Cardiac arrhythmias: diagnosis, symptoms, and treatments. Cell Biochem Biophysics. (2015) 73:291–6. doi: 10.1007/s12013-015-0626-4

CrossRef Full Text | Google Scholar

51. Bellinghieri G, Santoro D, Savica V. Emerging drugs for hyperphosphatemia. Expert Opin Emerging Drugs. (2007) 12:355–65. doi: 10.1517/14728214.12.3.355

CrossRef Full Text | Google Scholar

Keywords: machine learning, predictive modeling, acute lymphoblastic leukemia, tumor lysis syndrome, treatment toxicity

Citation: Xiao Y, Xiao L, Zhang Y, Xu X, Guan X, Guo Y, Shen Y, Lei X, Dou Y and Yu J (2024) Prediction of tumor lysis syndrome in childhood acute lymphoblastic leukemia based on machine learning models: a retrospective study. Front. Oncol. 14:1337295. doi: 10.3389/fonc.2024.1337295

Received: 12 November 2023; Accepted: 19 February 2024;
Published: 07 March 2024.

Edited by:

Massimiliano Bonifacio, University of Verona, Italy

Reviewed by:

Sezaneh Haghpanah, Shiraz University of Medical Sciences, Iran
Fatemeh Sadat Hosseini-Baharanchi, Iran University of Medical Sciences, Iran

Copyright © 2024 Xiao, Xiao, Zhang, Xu, Guan, Guo, Shen, Lei, Dou and Yu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jie Yu, eXVqaWVAaG9zcGl0YWwuY3FtdS5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.