- 1Pediatric Intensive Care Unit, Children’s Hospital of Nanjing Medical University, Nanjing, China
- 2Department of Pediatrics, The Second People’s Hospital of Changzhou Affiliated to Nanjing Medical University, Nanjing, China
- 3Department of Hematology, Children’s Hospital of Nanjing Medical University, Nanjing, China
- 4Department of Pharmacy, Children’s Hospital of Nanjing Medical University, Nanjing, China
- 5Jiangsu Key Laboratory of Children’s Major Disease Research, Children’s Hospital of Nanjing Medical University, Nanjing, China
Purpose: This study was to identify risk factors associated with delayed methotrexate (MTX) excretion in pediatric patients receiving high-dose MTX (HDMTX) therapy based on real-world data, and to develop and evaluate a predictive model.
Methods: Clinical data were retrospectively collected from 1,485 pediatric HDMTX chemotherapy cycles at the Children’s Hospital affiliated with Nanjing Medical University between 2021 and 2023. Key predictive variables were identified by Least Absolute Shrinkage and Selection Operator (LASSO) regression, Random Forest (RF), and Support Vector Machine Recursive Feature Elimination (SVM-RFE), and then incorporated into predictive models for MTX delayed excretion using Logistic Regression (LR), Naive Bayes (NB), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost). Bootstrap was employed to internally validate these models and identify the best-performing one, and then SHapley Additive exPlanations (SHAP) values were utilized to provide both global and local interpretations.
Results: Among the 1,485 pediatric HDMTX chemotherapy cycles, 26.1% were associated with delayed MTX excretion. Serum creatinine (Scr), total drug dose (Dose), alkaline phosphatase (ALP), creatine kinase (CK), blood urea nitrogen (Urea), gamma-glutamyl transferase (GGT), hemoglobin (HB), and height were identified as key predictors of delayed excretion. Internal validation showed that the XGBoost model performed best, with an accuracy of 0.780, an F1 score of 0.669, an area under the Receiver Operating Characteristic curve (AUROC) of 0.842, and a Brier score of 0.136. Decision Curve Analysis (DCA) also demonstrated favorable clinical utility. SHAP analysis revealed that Scr was the most important risk factor for delayed MTX excretion in the XGBoost model. This XGBoost model has been translated into a convenient tool to facilitate its utility in clinical settings.
Conclusion: The XGBoost model demonstrated good predictive performance and clinical utility for delayed MTX excretion in pediatric patients.
Introduction
Methotrexate (MTX) administered intravenously at doses of ≥500 mg/m2 or >20 mg/kg is defined as high-dose methotrexate (HDMTX) (2019). HDMTX significantly increases the serum concentration of MTX, allowing it to penetrate the blood-brain barrier and reach poorly vascularized solid tumors, which makes it widely used in first-line chemotherapy for diseases such as Acute Lymphocytic Leukemia (ALL) and Non-Hodgkin Lymphoma (NHL) (Prodduturi and Bierman, 2012). HDMTX therapy can improve the overall prognosis of patients with hematological malignancies; however, the MTX doses used are potentially lethal. While these doses inhibit tumor growth, they can also damage proliferating normal tissues, leading to toxic events such as acute liver and kidney injury, bone marrow suppression, mucositis, gastrointestinal issues, fever, and infections (2019). Studies have shown that delayed MTX excretion further increases the risk of these toxicity events (Yu et al., 2024).
Currently, most related studies suggest that age, body surface area, and drug dosage are the primary influencing factors for MTX excretion delay (Chen et al., 2020; Zhang et al., 2022; Yu et al., 2024). Additionally, creatinine clearance, drug interactions, and genetic polymorphisms also impact the in vivo clearance process of MTX (Faganel Kotnik et al., 2011; Kawase et al., 2016; Tran and Herrington, 2016; Umerez et al., 2017; Hamada et al., 2018; Hui et al., 2019; Ishizaki et al., 2020; Ueda et al., 2020; Yang et al., 2020; Ferracini et al., 2021; Ebid et al., 2022; Mao et al., 2022). However, there are several discrepancies among different studies, and many of these studies rely on traditional statistical methods to screen risk factors and build predictive models. These methods often suffer from poor sensitivity and specificity, resulting in fluctuating prediction outcomes and significant reliance on the researcher’s data analysis expertise.
This study was based on real-world data and employed machine learning methods to construct a more stable and predictive risk model. The primary aims were to explore the predictive factors for delayed MTX excretion in pediatric patients, identify children at risk of MTX excretion delay as early as possible, and to provide a basis for precision treatment, thereby reducing the incidence of toxicity events.
Materials and methods
Study subjects
This was a retrospective cohort study. The subjects were pediatric patients aged 0–16 years with hematological malignancies who received HDMTX treatment at the Children’s Hospital of Nanjing Medical University between 1 January 2021, and 31 December 2023. A total of 1,485 chemotherapy cycles were included in this study. The study was approved by the Ethics Committee of the Children’s Hospital of Nanjing Medical University (No. 202411021-1), with a waiver of informed consent.
Due to the dynamic nature of growth and development in pediatric patients, weight, height, and body surface area may vary considerably between chemotherapy cycles. In addition, laboratory assessments (e.g., complete blood count, liver and kidney function) are standardly repeated prior to each cycle to rule out contraindications. In practice, baseline values for both demographic and laboratory parameters are not static between chemotherapy cycles within the same individual. To reflect the clinical reality and capture cycle-specific risk, we modeled each treatment cycle as an independent unit, even for patients contributing multiple cycles.
The study subjects met the following inclusion criteria: ⅰ) All chemotherapy cycles were part of a sequential four-cycle HDMTX + 6-MP treatment regimen. Each cycle was spaced approximately 2 weeks apart, provided that the patient was in complete remission after prior chemotherapy, had completed baseline assessments, and had no contraindications to chemotherapy (otherwise, treatment was delayed). No other chemotherapy regimens were administered between consecutive HDMTX+6-MP cycles. ⅱ) Patients concurrently received intrathecal administration of a triple regimen consisting of MTX, dexamethasone (Dex), and cytarabine (Ara-C). ⅲ) Each chemotherapy cycle had available MTX plasma concentration data at 24 h, 48 h, and 72 h post-infusion. ⅳ) The underlying disease was either ALL or NHL, and the MTX dose administered was approximately 3–5 g/m2. Patients were excluded if they experienced severe drug hypersensitivity reactions, or were unable to complete the HDMTX chemotherapy cycle for any reason, or had incomplete data.
Delayed MTX excretion was defined as plasma MTX concentration >1 μmol/L at 48 h post-infusion (C48h), or ≥0.2 μmol/L at 72 h post-infusion (C72h) Based on this definition, patients were categorized into the delayed excretion group and the non-delayed excretion group. All patients were treated according to the standardized protocol outlined in the Chinese Society of Clinical Oncology (CSCO) (2019). At our institution, the discharge criterion was defined as an MTX plasma concentration <0.2 μmol/L.
Data collection
Clinical data were collected through the hospital’s electronic medical record system. The collected data included: a) Demographic information: age, height, weight, body surface area (BSA), and gender; b) Diagnosis and treatment information: chemotherapy cycle number, MTX dose per unit of body surface area (g/m2), total MTX dosage, and disease diagnosis; c) MTX plasma concentrations: measured at baseline (before MTX infusion), 24 h, 48 h, and 72 h post-infusion for each chemotherapy cycle, and additional measurements at extra time points performed on-demand; d) Laboratory tests: complete blood count, liver and kidney function, and biochemical indicators.
MTX plasma concentrations were determined using the Enzyme-Multiplied Immunoassay Technique (EMIT) assay kit, manufactured by Siemens Healthcare Diagnostics Inc., United States. The lower limit of detection for MTX was 0.1 μmol/L.
Research methodology
Feature selection
Three machine learning methods—Least Absolute Shrinkage and Selection Operator (LASSO) regression, Random Forest (RF), and Support Vector Machine - Recursive Feature Elimination (SVM-RFE) — were applied to the original dataset to perform feature selection. These methods are capable of identifying the most relevant variables while reducing model complexity and overfitting risk. The detailed descriptions and parameter settings for LASSO, RF, and SVM-RFE are provided in Supplementary Table S1. A consensus-based approach was used to determine the final set of key predictive features.
Model training
The key predictive features identified were used as input variables to develop machine learning models. Four different machine learning algorithms—Logistic Regression (LR), Naive Bayes (NB), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost) — were employed to construct predictive models for delayed excretion of HDMTX in pediatric patients (Descriptions and parameter settings for the LR, NB, SVM, and XGBoost algorithms are provided in Supplementary Table S2). To ensure comparability across models, all models were trained using the same set of input features. Subsequently, grid search and randomized search approaches were applied to identify the optimal hyperparameters for each model based on the training data. The area under the receiver operating characteristic curve (AUROC) was used as the primary evaluation metric during hyperparameter tuning.
Internal model validation
Bootstrap resampling with 1,000 iterations was used to perform internal validation of the developed models. The performance of the four machine learning models was evaluated in terms of discrimination, calibration, and clinical utility using a comprehensive set of metrics, including: accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), F1 Score, AUROC, Brier Score, and Decision Curve Analysis (DCA). These metrics facilitated the selection of the optimal predictive model.
Model interpretation
The best-performing model was interpreted using SHapley Additive exPlanations (SHAP) through the following steps: a) Feature Contribution Analysis: A SHAP value was assigned to each feature to reflect its contribution to the model’s prediction. b) SHAP Summary Plot: A summary plot was generated to visualize the relative importance of each feature and to show how the magnitude and direction of feature values influence the predicted risk of delayed MTX excretion. c) SHAP Dependency Plots: For each important feature, a dependency plot was created to explore the relationship between the feature value and the predicted outcome.
Web-based prediction tool
To facilitate the clinical application of this predictive model, we have developed a web-based application. When the corresponding feature values are provided, the application can return the probability of MTX excretion, with a waterfall plot and a force plot.
Statistical analysis
Data processing was performed using R software (version 4.4.2). Continuous variables that followed a normal distribution were expressed as mean ± standard deviation (x ± s), and intergroup comparisons were conducted using the independent samples t-test. For continuous variables with non-normal distribution, data were presented as median (first quartile, third quartile) [M (Q1, Q3)], and group comparisons were performed using the Mann-Whitney U test. Categorical variables were expressed as counts and percentages (n, %). Comparisons between groups were carried out using the chi-square (χ2) test or Fisher’s exact test. A two-sided P value <0.05 was considered statistically significant.
Results
Patient baseline characteristics
A total of 1,485 HDMTX chemotherapy cycles from 408 pediatric patients were included in this study. Among these, 387, 389, 358, and 351 cycles corresponded to the first, second, third, and fourth treatment cycles, respectively. 388 cycles were identified as delayed excretion cases, resulting in a delayed excretion rate of 26.1%. Descriptive statistics of the covariates between the two groups are presented in Table 1.

Table 1. The baseline characteristics of patients in the non-delayed excretion group and the delayed excretion group.
Feature selection using machine learning methods
Each algorithm identified a distinct set of variables, as shown in Figures 1A–C. This consensus-based feature selection approach (Figure 1D) yielded a final set of 8 key features: Serum Creatinine (Scr), Dose, Alkaline Phosphatase (ALP), Creatine Kinase (CK), Blood Urea Nitrogen (Urea), Gamma-glutamyl Transferase (GGT), Hemoglobin (HB), and Height. These eight core features were subsequently used to construct predictive models for delayed MTX excretion using four different machine learning algorithms.

Figure 1. Feature Selection Results Using LASSO, RF, and SVM-RFE. Note: (A) LASSO Regression Coefficient Path Plot and LASSO Cross-Validation Curve. (B) RF Number of Trees vs Error Plot and RF Feature Importance Ranking Plot. (C) SVM-RFE Accuracy vs Number of Features Plot and SVM-RFE Generalization Error vs Number of Features Plot. (D) Venn Diagram: The Venn diagram displays the intersection of feature sets selected by LASSO, RF, and SVM-RFE.
Prediction model construction and internal validation
Using the eight key features selected through feature selection, four machine learning models were developed based on LR, NB, SVM, and XGBoost. To assess model performance and ensure internal validity, bootstrap resampling with 1,000 iterations was performed.
Among the four models, the XGBoost model demonstrated the best overall performance across all validation metrics (Table 2; Figures 2A–D), with an AUC of 0.842 (95%CI: 0.815–0.873). The calibration curve demonstrated that the probability of MTX excretion delay predicted by the XGBoost model was relatively well matched with the actual measurement (Supplementary Figure S1).

Figure 2. ROC and DCA Curves. (A–D) ROC curves for LR, NB, SVM, and XGBoost Models. ROC curve was performed via 1,000 bootstrap iterations. The 45-degree diagonal line represents the baseline for random classification. The red line represents the mean ROC of 1,000 bootstrap iterations. The grey lines represents the actual ROC of 1,000 bootstrap iterations. (E) DCA Curves for LR, NB, SVM, and XGBoost Models.
Additionally, sensitivity analysis was performed using only the first chemotherapy cycle from each patient. A new predictive model was developed based on this subset. As shown in Supplementary Figure S2, XGBoost again emerged as the optimal model, and there was no significant difference in performance between this model and the one trained on all treatment cycles (DeLong test, P = 0.967).
In addition, DCA showed that the XGBoost model had the widest applicable threshold range (approximately 10%–80%). Within most of this range (approximately 10%–75%), the XGBoost model provided the highest benefit among all four models, reaching up to 0.2 (Figure 2E). This indicates that the XGBoost model offers the greatest clinical utility in terms of decision-making compared to the other models.
Model interpretation using SHAP
The contribution of the eight selected predictive features in the XGBoost model was interpreted using SHAP analysis, as illustrated in the Summary Plot (Figures 3A,B) and Dependence Plots (Figure 3C). Among the eight predictors, Scr was identified as the most influential feature in predicting delayed MTX excretion. Dose and ALP followed in terms of importance. Higher levels of Scr, higher MTX dose, and lower levels of ALP were associated with an increased risk of delayed excretion. In comparison, the other five variables—CK, Urea, GGT, HB, and height—showed relatively weaker influences on the model’s predictions.

Figure 3. SHAP analysis elucidates feature influence and direction in the XGBoost model. (A) Beeswarm plot: features are ranked by impact magnitude; dot color indicates feature value (yellow: high, purple: low). (B) Bar plot: features are ordered by mean |SHAP|, representing overall importance. (C) Dependence plots: illustrate how feature values affect SHAP values, indicating contribution direction. SHAP values >0 increase predicted risk.
Clinical application of the prediction model
To facilitate the practical use of the final XGBoost prediction model in clinical settings, we developed a web-based application (Figure 4). When the actual values of the eight predictive features are entered into the tool, the application automatically calculates and displays the individual risk of delayed MTX excretion for each pediatric patient. In addition to the predicted probability, the web tool generates two interpretable visualizations: Waterfall plot and Force plot. The web application is publicly accessible at https://delay-prediction-njch.ndemo.net.

Figure 4. Clinical Application of the XGBoost Prediction Model. Note: This XGBoost prediction model, built on eight clinically relevant features, offers a user-friendly interface for predicting the risk of delayed MTX excretion in pediatric patients. Upon entering the actual values of the eight features, the application automatically displays a predicted probability of 50.5% for MTX excretion delay. Furthermore, individual patient explanations are visualized through waterfall plots and force plots, which clearly illustrate the key features influencing the prediction: blue-colored features drive the prediction toward “no delay” while red-colored features push the prediction toward “delay”.
Discussion
HDMTX therapy is associated with significant antitumor efficacy; however, it also carries substantial toxicity risks (Mosleh et al., 2023). Given that children have less mature organ development and lower tolerance to drug toxicity compared to adults, they are at a higher risk of severe adverse events, including mortality. Delayed MTX excretion is typically defined based on whether plasma MTX concentrations exceed predefined threshold levels at specific time points after infusion. However, elevated MTX levels are usually detected only after delayed excretion has already occurred, limiting opportunities for early intervention. This study was a retrospective analysis of pediatric patients with ALL or NHL who received HDMTX therapy. Leveraging real-world data and machine learning techniques, we identified key predictors of delayed MTX excretion from demographic, dosing, treatment cycle, and laboratory variables. A risk prediction model was subsequently developed to identify high-risk patients before HDMTX administration. By enabling early risk stratification, this model supports the delivery of individualized, precision-based interventions to potentially reduce the incidence of delayed excretion and associated toxicities.
Most existing studies on delayed MTX excretion have traditionally relied on conventional statistical methods for feature selection and predictive model development. These traditional approaches, such as linear regression, require the fulfillment of numerous assumptions—linearity, independence, normality, homoscedasticity, among others—before they can be applied effectively. When dealing with smaller datasets, simple linear models are often the most reasonable choice due to their interpretability and straightforward implementation. However, these models’ expressiveness becomes notably limited as dataset sizes increase. In contrast, machine learning models can overcome many of the limitations imposed by traditional statistical models and incorporate techniques like regularization to mitigate overfitting (Christodoulou et al., 2019; Chen et al., 2023). Consequently, in this study, we opted to utilize machine learning methods for model construction. Machine learning models, especially more complex ones, typically exhibit superior predictive performance but are often criticized as “black boxes” due to their opaque decision-making processes. To address this interpretability issue, we employed SHAP, which leverages concepts from cooperative game theory to explain individual predictions made by machine learning models. By calculating each feature’s contribution to the prediction outcome, SHAP values provide a means to assign optimal credit allocation and local explanations. This approach allows us to better understand how the XGBoost model arrives at its decisions, thereby enhancing transparency and trustworthiness in clinical applications.
In this study, the feature selection analyses using LASSO, RF, and SVM-RFE consistently identified eight key predictors of delayed MTX excretion: Scr, total MTX dose, ALP, CK, Urea, GGT, HB, and height. These eight variables were then incorporated into four machine learning models—LR, NB, SVM, and XGBoost—for predictive modeling. Based on internal validation, the XGBoost model demonstrated superior performance compared to the other models. Therefore, we focused on interpreting the XGBoost model using SHAP to explore the contribution of each predictor in individual risk assessment.
The SHAP analysis conducted on the XGBoost model revealed that Scr had the highest contribution to the prediction of delayed MTX excretion. The dependence plot illustrated higher Scr values were consistently associated with increased model-predicted risk of delayed MTX excretion, which was consistent with the statistical associations observed in previous studies (Xu et al., 2014; Mei et al., 2018; Zhan et al., 2022). From a pathophysiological perspective, MTX and its metabolites are primarily eliminated through glomerular filtration and proximal tubular secretion into the primary urine, followed by excretion via urine (Mosleh et al., 2023). When renal function, particularly glomerular filtration rate (GFR), is compromised, the clearance of MTX decreases. Scr is a common indicator of glomerular filtration. Notably, SHAP reflects model-learned associations, not causation—while higher Scr correlates with higher predicted risk, this does not prove it directly causes delayed excretion.
The SHAP analysis revealed that total MTX dose had the second-highest contribution to predicting delayed MTX excretion, following Scr. Higher doses of MTX were associated with an increased model-predicted risk of delayed excretion. Additionally, ALP was identified as another significant predictor in the model. Lower ALP levels were associated with a model-predicted higher risk of delayed MTX excretion. This observation may reflect underlying anemia or malnutrition due to the primary disease. As malnutrition is known to affect the function of various tissues and organs, it could potentially influence drug metabolism efficiency. However, the model itself cannot establish a causal link.
CK, Urea, and GGT showed relatively moderate contributions in the SHAP feature importance ranking. The SHAP dependence plot for CK revealed a non-linear relationship between CK levels and the predicted risk of delayed MTX excretion: when CK < 50 U/L, lower values were associated with model-predicted risk scores of delayed excretion; in the intermediate range (50–100 U/L), changes in CK levels had minimal impact on the predicted risk; when CK > 100 U/L, higher values were linked to the predicted elevated risk of delayed MTX excretion. Low CK levels may be attributed to several factors, including the younger age of pediatric patients—whose baseline CK activity is naturally lower—or underlying disease-related malnutrition. However, this interpretation represents a clinical hypothesis independent of the model’s findings. The dependence plot for Urea demonstrated that: when Urea levels were below the normal reference range (1.79–6.43 mmol/L), lower values correlated with higher risk of delayed MTX excretion; within the normal range, variations in Urea had no significant effect on the model-predicted risk scores. The dependence plot of GGT indicated: within the normal reference range (8–61 U/L), changes in GGT levels did not significantly alter the predicted risk; when GGT exceeded the normal range, it may increase the risk of delayed MTX excretion. Elevated GGT levels are often indicative of hepatic or biliary dysfunction, such as impaired bile flow or liver damage (Lu and Association, 2022). Since MTX undergoes extensive hepatic metabolism, this may partially explain the association patterns identified by the model.
HB and height demonstrated relatively weaker contributions to the predictive performance of the XGBoost model. However, their SHAP dependence plots still revealed meaningful associations with the risk of delayed MTX excretion. The SHAP dependence plot for HB showed that lower HB levels were associated with a higher model-predicted risk of delayed MTX excretion. The dependence plot for height indicated a non-linear relationship: when height was below 150 cm, variations in height had minimal impact on the predicted risk; when height exceeded 150 cm, increased height was associated with higher predicted risk scores in the model. Generally, taller patients tend to have larger BSA and higher MTX clearance rates (Mei et al., 2018; Hui et al., 2019; Chen et al., 2020; Yang et al., 2020). However, they also receive higher total doses of MTX based on BSA-adjusted dosing protocols. The nonlinear patterns identified by the model may reflect a dynamic interplay between these two opposing effects: in children shorter than 150 cm, the increased clearance associated with greater height may offset the effects of higher drug doses, resulting in no significant net change in the risk of delayed excretion; in contrast, once height exceeds 150 cm, the increase in drug load may outpace the compensatory rise in clearance, leading to a greater likelihood of delayed excretion. This observation requires further investigation and validation in independent cohorts.
This was a large-sample, single-center retrospective cohort study to date investigating risk factors for delayed MTX excretion. A major strength of our work lies in the development of a web-based prediction tool based on the final XGBoost model, which allows clinicians to conveniently input patient-specific features and obtain real-time individualized risk predictions along with interpretable visualizations (e.g., waterfall and force plots). This user-friendly interface enhances the practical applicability of our findings in daily clinical practice. Despite these strengths, several limitations should be acknowledged. First, we did not include drug-drug interactions in our analysis, as the usage rates of potentially interacting medications were all below 10%. This low prevalence may reflect clinical efforts to actively avoid prescribing drugs that could potentially delay MTX excretion. Second, due to the retrospective nature of the study, genetic testing was not routinely performed at the time of treatment, and data on urine output and urine pH were frequently missing. As a result, polymorphisms in drug-metabolizing genes, urine volume, and urinary pH were not included in the analysis. Additionally, all patients in this cohort received standardized hydration, urinary alkalinization, and leucovorin rescue according to the Expert Consensus on High-Dose Methotrexate with Calcium Folinate Rescue for the Treatment of Malignant Tumors. As a result, variables such as hydration rate/volume, degree of alkalinization, and timing/dose of leucovorin were not included in the model. However, the omission of these factors may introduce unmeasured confounding bias. For instance, inadequate fluid intake and low urine output are well-established risk factors for MTX-induced nephrotoxicity, which directly delay drug excretion (Howard et al., 2016; Chen et al., 2020). If not accounted for, the model may underestimate risk in dehydrated patients while overestimating the importance of markers such as serum creatinine (Scr)—which itself can be influenced by volume status. Furthermore, urinary pH is critical for MTX solubility; when pH falls below 7.0, the risk of crystalline nephropathy increases substantially (Bezabeh et al., 2012; Kawaguchi et al., 2021). The current model did not capture this key physiological mechanism, potentially limiting its accuracy in predicting delays caused by intratubular precipitation. Concomitant use of interacting medications—such as proton pump inhibitors (PPIs), which inhibit renal tubular secretion of MTX, or other nephrotoxic agents—can significantly alter MTX pharmacokinetics (Howard et al., 2016). Quantifying the precise impact of these unmeasured confounders is challenging, but their absence may lead to an over-optimistic assessment of model performance. This could restrict the model’s generalizability across diverse clinical settings. The model may be most applicable to patients receiving optimal hydration, without significant drug interactions or additional renal insults. Future studies should aim to address these gaps by conducting multi-center prospective investigations, expanding the range of sample types, and incorporating a broader range of covariates to further refine and validate the predictive performance of the model and improve the model’s utility in supporting clinical decision-making.
In conclusion, machine learning models can serve as reliable tools for predicting the occurrence of delayed MTX excretion during HDMTX therapy in pediatric patients. Among the models evaluated, the XGBoost model, constructed using key predictors including Scr, total drug dose, ALP, CK, Urea, GGT, HB, and height, demonstrated the best predictive performance. This study provides a practical approach for early identification of high-risk patients for MTX excretion delay, thereby supporting personalized treatment decisions. By enabling timely interventions, the model has the potential to reduce the incidence of delayed excretion and associated toxicities, ultimately improving both the safety and adherence of chemotherapy in pediatric oncology.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The study was approved by the Ethics Committee of the Children’s Hospital of Nanjing Medical University (No. 202411021-1), with a waiver of informed consent. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’; legal guardians/next of kin because this was a retrospective cohort study.
Author contributions
CZ: Investigation, Writing – review and editing, Methodology, Software, Data curation, Writing – original draft, Formal Analysis. YQ: Writing – review and editing, Investigation, Formal Analysis, Data curation. YX: Writing – review and editing, Formal Analysis, Software, Methodology. LR: Writing – review and editing, Methodology, Formal Analysis, Software. YW: Methodology, Software, Writing – review and editing, Formal Analysis. KL: Methodology, Formal Analysis, Software, Writing – review and editing. HM: Software, Formal Analysis, Writing – review and editing, Methodology. FC: Project administration, Writing – review and editing, Supervision, Validation, Visualization. YF: Writing – review and editing, Supervision, Visualization, Project administration, Validation. XG: Project administration, Resources, Data curation, Supervision, Writing – review and editing, Conceptualization, Visualization, Validation.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by the National Natural Science Foundation of China (Grant No. 82170733).
Acknowledgments
We would like to thank all of the investigators for their involvement in this study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2025.1662718/full#supplementary-material
SUPPLEMENTARY FIGURE S1 | Calibration curve for the models. (A) Calibration curve of the LR model; (B) Calibration curve of the NB model; (C) Calibration curve of the SVM model; (D) Calibration curve of the XGBoost model. The slope of Ideal (diagonal) was 1, which represents the ideal curve; Apparent represents the uncalibrated prediction curve and Bias-corrected represents the calibration prediction curve.
SUPPLEMENTARY FIGURE S2 | ROC Curves of the machine learning models in the cycle 1. (A-D) ROC curves for LR, NB, SVM, and XGBoost Models. ROC curve was performed via 1,000 bootstrap iterations. The 45-degree diagonal line represents the baseline for random classification. The red line represents the mean ROC of 1,000 bootstrap iterations. The grey lines represents the actual ROC of 1,000 bootstrap iterations.
References
Chinese Society of Clinical Oncology (CSCO) (2019). Expert consensus on high-dose methotrexate with calcium folinate rescue for the treatment of malignant tumors. Chin. Clin. Oncol. 46 (15), 761–767. doi:10.3969/j.issn.1000-8179.2019.15.748
Bezabeh, S., Mackey, A. C., Kluetz, P., Jappar, D., and Korvick, J. (2012). Accumulating evidence for a drug–drug interaction between methotrexate and proton pump inhibitors. Oncol. 17 (4), 550–554. doi:10.1634/theoncologist.2011-0431
Chen, A. R., Wang, Y. M., Lin, M., and Kuo, D. J. (2020). High-dose methotrexate in pediatric acute lymphoblastic leukemia: predictors of delayed clearance and the effect of increased hydration rate on methotrexate clearance. Cureus 12 (6), e8674. doi:10.7759/cureus.8674
Chen, L., Shi, B., and Feng, S. (2023). Where medical statistics meets Artificial intelligence. Reply. N. Engl. J. Med. 389 (25), 2403–2404. doi:10.1056/NEJMc2312332
Christodoulou, E., Ma, J., Collins, G. S., Steyerberg, E. W., Verbakel, J. Y., and Van Calster, B. (2019). A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J. Clin. Epidemiol. 110, 12–22. doi:10.1016/j.jclinepi.2019.02.004
Ebid, A. H. I., Hossam, A., El Gammal, M. M., Soror, S., Mangoud, N. O., and Mahmoud, M. A. (2022). High dose methotrexate in adult Egyptian patients with hematological malignancies: impact of ABCB1 3435C> T rs1045642 and MTHFR 677C> T rs1801133 polymorphisms on toxicities and delayed elimination. J. Chemother. 34 (6), 381–390. doi:10.1080/1120009X.2021.2009723
Faganel Kotnik, B., Grabnar, I., Bohanec Grabar, P., Dolžan, V., and Jazbec, J. (2011). Association of genetic polymorphism in the folate metabolic pathway with methotrexate pharmacokinetics and toxicity in childhood acute lymphoblastic leukaemia and malignant lymphoma. Eur. J. Clin. Pharmacol. 67, 993–1006. doi:10.1007/s00228-011-1046-z
Ferracini, A. C., Lopes-Aguiar, L., Lourenço, G. J., Yoshida, A., Lima, C. S. P., Sarian, L. O., et al. (2021). GSTP1 and ABCB1 polymorphisms predicting toxicities and clinical management on carboplatin and paclitaxel-based chemotherapy in ovarian cancer. Clin. Transl. Sci. 14 (2), 720–728. doi:10.1111/cts.12937
Hamada, Y., Ikemura, K., Iwamoto, T., and Okuda, M. (2018). Stereoselective inhibition of renal basolateral human organic anion transporter 3 by lansoprazole enantiomers. Pharmacology 101 (3-4), 176–183. doi:10.1159/000485920
Howard, S. C., McCormick, J., Pui, C. H., Buddington, R. K., and Harvey, R. D. (2016). Preventing and managing toxicities of high-dose methotrexate. Oncol. 21 (12), 1471–1482. doi:10.1634/theoncologist.2015-0164
Hui, K. H., Chu, H. M., Fong, P. S., Cheng, W. T. F., and Lam, T. N. (2019). Population pharmacokinetic study and individual dose adjustments of high-dose methotrexate in Chinese pediatric patients with acute lymphoblastic leukemia or osteosarcoma. J. Clin. Pharmacol. 59 (4), 566–577. doi:10.1002/jcph.1349
Ishizaki, J., Nakano, C., Kitagawa, K., Suga, Y., and Sai, Y. (2020). A previously unknown drug-drug interaction is suspected in delayed elimination of plasma methotrexate in high-dose methotrexate therapy. Ann. Pharmacother. 54 (1), 29–35. doi:10.1177/1060028019870445
Kawaguchi, S., Fujiwara, S. I., Murahashi, R., Nakashima, H., Matsuoka, S., Ikeda, T., et al. (2021). Risk factors for high-dose methotrexate-induced nephrotoxicity. Int. J. Hematol. 114 (1), 79–84. doi:10.1007/s12185-021-03132-8
Kawase, A., Yamamoto, T., Egashira, S., and Iwaki, M. (2016). Stereoselective inhibition of methotrexate excretion by glucuronides of nonsteroidal anti-inflammatory drugs via multidrug resistance proteins 2 and 4. J. Pharmacol. Exp. Ther. 356 (2), 366–374. doi:10.1124/jpet.115.229104
Lu, L., and Association, C. M. (2022). Guidelines for the management of cholestatic liver diseases (2021). J. Clin. Transl. Hepatology 10 (4), 757–769. doi:10.14218/JCTH.2022.00147
Mao, J., Li, Q., Li, P., Qin, W., Chen, B., and Zhong, M. (2022). Evaluation and application of population pharmacokinetic models for identifying delayed methotrexate elimination in patients with primary central nervous system lymphoma. Front. Pharmacol. 13, 817673. doi:10.3389/fphar.2022.817673
Mei, S., Li, X., Jiang, X., Yu, K., Lin, S., and Zhao, Z. (2018). Population pharmacokinetics of high-dose methotrexate in patients with primary central nervous system lymphoma. J. Pharm. Sci. 107 (5), 1454–1460. doi:10.1016/j.xphs.2018.01.004
Mosleh, E., Snyder, S., Wu, N., Willis, D. N., Malone, R., and Hayashi, R. J. (2023). Factors influencing delayed clearance of high dose methotrexate (HDMTX) in pediatric, adolescent, and young adult oncology patients. Front. Oncol. 13, 1280587. doi:10.3389/fonc.2023.1280587
Prodduturi, P., and Bierman, P. J. (2012). Current and emerging pharmacotherapies for primary CNS lymphoma. Clin. Med. Insights Oncol. 6, 219–231. doi:10.4137/CMO.S7752
Tran, H. X., and Herrington, J. D. (2016). Effect of ceftriaxone and cefepime on high-dose methotrexate clearance. J. Oncol. Pharm. Pract. 22 (6), 801–805. doi:10.1177/1078155215608524
Ueda, H., Narumi, K., Sato, Y., Furugen, A., Kobayashi, M., and Iseki, K. (2020). Evaluation of possible pharmacokinetic interaction between methotrexate and proton pump inhibitors in rats. Pharmacol. Rep. 72, 1426–1432. doi:10.1007/s43440-020-00130-1
Umerez, M., Gutierrez-Camino, A., Munoz-Maldonado, C., Martin-Guerrero, I., and Garcia-Orad, A. (2017). MTHFR polymorphisms in childhood acute lymphoblastic leukemia: influence on methotrexate therapy. Pharmacogenomics personalized Med. 10, 69–78. doi:10.2147/PGPM.S107047
Xu, W. Q., Zhang, L. Y., Chen, X. Y., Pan, B. H., Mao, J. Q., Song, H., et al. (2014). Serum creatinine and creatinine clearance for predicting plasma methotrexate concentrations after high-dose methotrexate chemotherapy for the treatment for childhood lymphoblastic malignancies. Cancer Chemother. Pharmacol. 73, 79–86. doi:10.1007/s00280-013-2319-2
Yang, L., Wu, H., de Winter, B. C., Sheng, C.-c., Qiu, H.-q., Cheng, Y., et al. (2020). Pharmacokinetics and pharmacogenetics of high-dose methotrexate in Chinese adult patients with non-Hodgkin lymphoma: a population analysis. Cancer Chemother. Pharmacol. 85, 881–897. doi:10.1007/s00280-020-04058-4
Yu, L., Shen, J., Li, H., Zhang, M., Wang, Z., Gao, Y., et al. (2024). Factors influencing delayed high-dose methotrexate excretion and its correlation with adverse reactions after treatment in children with malignant hematological tumors. Transl. Pediatr. 13 (2), 300–309. doi:10.21037/tp-23-615
Zhan, M., Sun, Y., Zhou, F., Wang, H., Chen, Z., Yan, L., et al. (2022). Population pharmacokinetics of methotrexate in paediatric patients with acute lymphoblastic leukaemia and malignant lymphoma. Xenobiotica 52 (3), 265–273. doi:10.1080/00498254.2022.2069060
Keywords: methotrexate, delayed excretion, pediatric, machine learning, predictive model
Citation: Zhou C, Qian Y, Xue Y, Rong L, Wan Y, Leng K, Miao H, Chen F, Fang Y and Ge X (2025) Risk factor identification for delayed excretion in pediatric high-dose methotrexate therapy: a machine learning analysis of real-world data. Front. Pharmacol. 16:1662718. doi: 10.3389/fphar.2025.1662718
Received: 09 July 2025; Accepted: 27 August 2025;
Published: 17 September 2025.
Edited by:
Robert James Hayashi, Washington University in St. Louis, United StatesReviewed by:
Jingjing Xia, Xinjiang University, ChinaEma Mosleh, Eastern Maine Healthcare Systems, United States
Copyright © 2025 Zhou, Qian, Xue, Rong, Wan, Leng, Miao, Chen, Fang and Ge. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xuhua Ge, Z2V4dWh1YUBuam11LmVkdS5jbg==
†These authors have contributed equally to this work