ORIGINAL RESEARCH article
Front. Pediatr.
Sec. Pediatric Infectious Diseases
Volume 13 - 2025 | doi: 10.3389/fped.2025.1610187
Accurate Prediction of Sepsis from Pediatric Emergency Department to PICU Using a Machine-Learning Model
Provisionally accepted- 1Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, China
- 2Ewell Technology Company, Hangzhou, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background: Timely identification of pediatric sepsis remains a critical challenge in emergency and intensive care settings due to the heterogeneous clinical presentations across age groups. Existing scoring systems often lack temporal resolution and interpretability. We aimed to develop a real-time, machine learning–based prediction framework integrating static and dynamic electronic health record (EHR) features to support early sepsis detection. Methods:This retrospective study included pediatric patients from Guangzhou Women and Children's Medical Center (GWCMC; n=1,697) and an external validation cohort from the MIMIC-III database (n=827). Irregular time-series data were imputed using a correlation-enhanced continuous time-window histogram with multivariate Gaussian processes (CTWH+MGP). We compared the predictive performance of XGBoost and gated recurrent unit (GRU)-based RNN models over a 12-hour window prior to clinical diagnosis. Model outputs were validated internally and externally using AUROC, AUPRC, and Youden index, with SHAP-based interpretability applied to identify key clinical features. Results: The CTWH+MGP-XGBoost model achieved the highest AUROC at diagnosis time (T=0 h; AUROC=0.915), while the GRU-based model demonstrated superior temporal stability across early windows. Top contributing features included lactate, white blood cell count, pH, and vasopressor use. External validation confirmed generalizability (MIMIC-III AUROC=0.905). Simulation of real-time alerts showed a median lead time of 6.2 hours before clinical diagnosis, with κ=0.82 agreement against physician-confirmed cases. Conclusions: Our results suggest that a dual-model ensemble combining interpolation-based preprocessing and interpretable machine learning enables robust early sepsis detection in pediatric populations. The system supports integration into EHR platforms for real-time clinical alerts and may inform prospective trials and quality improvement initiatives.
Keywords: Pediatric sepsis, early warning, machine learning, XGBoost, recurrent neural network, Shap, Electronic Health Records
Received: 11 Apr 2025; Accepted: 08 Sep 2025.
Copyright: © 2025 Shi, Wang, Yang, Fan, Liu, Song, Peng, Wang, Sun, Ma and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Wencheng Ma, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, China
Peiqing Li, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.