External Validation of Predictive Models for Diagnosis, Management and Severity of Pediatric Appendicitis

Marcinkevics, Ricards; Sokol, Kacper; Paulraj, Akhil; Hilbert, Melinda  A; Rimili, Vivien; Wellmann, Sven; Knorr, Christian; Reingruber, Bertram; Vogt, Julia  E; Reis Wolfertstetter, Patricia

doi:10.3389/fped.2025.1587488

ORIGINAL RESEARCH article

Front. Pediatr.

Sec. Pediatric Surgery

Volume 13 - 2025 | doi: 10.3389/fped.2025.1587488

External Validation of Predictive Models for Diagnosis, Management and Severity of Pediatric Appendicitis

Provisionally accepted

Ricards Marcinkevics¹

Kacper Sokol^1*

Akhil Paulraj²

Melinda A Hilbert³

Vivien Rimili⁴

Sven Wellmann⁵

Christian Knorr⁶

Bertram Reingruber⁷

Julia E Vogt^1*

Patricia Reis Wolfertstetter^5*

¹ETH Zürich, Zurich, Switzerland
²Department of Electrical and Computer Engineering, School of Engineering and Applied Science, Princeton University, Princeton, New Jersey, United States
³St. Vincentius Clinics Karlsruhe, Karlsruhe, Baden-Württemberg, Germany
⁴University Hospital of Cologne, Cologne, North Rhine-Westphalia, Germany
⁵University Medical Center Regensburg, Regensburg, Bavaria, Germany
⁶RoMed Klinikum, Rosenheim, Bavaria, Germany
⁷Florence Nightingale Hospital Kaiserswerther Diakonie, Düsseldorf, Germany

The final, formatted version of the article will be published soon.

Background Appendicitis is a common condition among children and adolescents. Machine learning models can offer much-needed tools for improved diagnosis, severity assessment and management guidance for pediatric appendicitis. However, to be adopted in practice, such systems must be reliable, safe and robust across various medical contexts, e.g., hospitals with distinct clinical practices and patient populations. Methods We performed external validation of models predicting the diagnosis, management and severity of pediatric appendicitis. Trained on a cohort of 430 patients admitted to the Children's Hospital St. Hedwig (Regensburg, Germany), the models were validated on an independent cohort of 301 patients from the Florence-Nightingale-Hospital (Düsseldorf, Germany). The data included demographic, clinical, scoring, laboratory and ultrasound parameters. In addition, we explored the benefits of model retraining and inspected variable importance. Results The distributions of most parameters differed between the datasets. Consequently, we saw a decrease in predictive performance for diagnosis, management and severity across most metrics. After retraining with a portion of external data, we observed gains in performance, which, nonetheless, remained lower than in the original study. Notably, the most important variables were consistent across the datasets. Conclusions While the performance of transferred models was satisfactory, it remained lower than on the original data. This study demonstrates challenges in transferring models between hospitals, especially when clinical practice and demographics differ or in the presence of externalities such as pandemics. We also highlight the limitations of retraining as a potential remedy since it could not restore predictive performance to the initial level.

Keywords: artificial intelligence, machine learning, Predictive Modeling, Medical Decision Support Systems, Evaluation

Received: 04 Mar 2025; Accepted: 28 Jul 2025.

Copyright: © 2025 Marcinkevics, Sokol, Paulraj, Hilbert, Rimili, Wellmann, Knorr, Reingruber, Vogt and Reis Wolfertstetter. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Kacper Sokol, ETH Zürich, Zurich, Switzerland
Julia E Vogt, ETH Zürich, Zurich, Switzerland
Patricia Reis Wolfertstetter, University Medical Center Regensburg, Regensburg, 93053, Bavaria, Germany

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.