Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Pharmacol.

Sec. Drug Metabolism and Transport

Volume 16 - 2025 | doi: 10.3389/fphar.2025.1662718

Risk Factor Identification for Delayed Excretion in Pediatric High-Dose Methotrexate Therapy: A Machine Learning Analysis of Real-World Data

Provisionally accepted
  • 1Children's Hospital of Nanjing Medical University, Nanjing, China
  • 2The Second People’s Hospital of Changzhou affiliated to Nanjing Medical University, Changzhou, China

The final, formatted version of the article will be published soon.

Purpose: This study was to identify risk factors associated with delayed methotrexate (MTX) excretion in pediatric patients receiving high-dose MTX (HDMTX) therapy based on real-world data, and to develop and evaluate a predictive model. Methods: Clinical data were retrospectively collected from 1,485 pediatric HDMTX chemotherapy cycles at the Children's Hospital affiliated with Nanjing Medical University between 2021and 2023. Key predictive variables were identified by Least Absolute Shrinkage and Selection Operator (LASSO) regression, Random Forest (RF), and Support Vector Machine Recursive Feature Elimination (SVM-RFE), and then incorporated into predictive models for MTX delayed excretion using Logistic Regression (LR), Naive Bayes (NB), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost). Bootstrap was employed to internally validate these models and identify the best-performing one, and then SHapley Additive exPlanations (SHAP) values were utilized to provide both global and local interpretations. Results: Among the 1,485 pediatric HDMTX chemotherapy cycles, 26.1% were associated with delayed MTX excretion. Serum creatinine (Scr), total drug dose (Dose), alkaline phosphatase (ALP), creatine kinase (CK), blood urea nitrogen (Urea), gamma-glutamyl transferase (GGT), hemoglobin (HB), and height were identified as key predictors of delayed excretion. Internal validation showed that the XGBoost model performed best, with an accuracy of 0.780, an F1 score of 0.669, an area under the Receiver Operating Characteristic curve (AUROC) of 0.842, and a Brier score of 0.136. Decision Curve Analysis (DCA) also demonstrated favorable clinical utility. SHAP analysis revealed that Scr was the most important risk factor for delayed MTX excretion in the XGBoost model. This XGBoost model has been translated into a convenient tool to facilitate its utility in clinical settings. Conclusion: The XGBoost model demonstrated good predictive performance and clinical utility for delayed MTX excretion in pediatric patients.

Keywords: Methotrexate, Delayed excretion, pediatric, machine learning, predictive model

Received: 09 Jul 2025; Accepted: 27 Aug 2025.

Copyright: © 2025 Zhou, Qian, Xue, Rong, Wan, Leng, Miao, Chen, Fang and Ge. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Xuhua Ge, Children's Hospital of Nanjing Medical University, Nanjing, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.