ORIGINAL RESEARCH article
Front. Artif. Intell.
Sec. Medicine and Public Health
Volume 8 - 2025 | doi: 10.3389/frai.2025.1652397
Predicting Pediatric Diagnostic Imaging Patient No-show and Extended Wait-Times Using LLMs, Regression, and Tree Based Models
Provisionally accepted- 1Department of Mechanical and Industrial Engineering, University of Toronto, Toronto, Canada
- 2Research Institute, The Hospital for Sick Children, Toronto, Canada
- 3Department of Medical Imaging, University of Toronto, Toronto, Canada
- 4Department of Computer Science, University of Toronto, Toronto, Canada
- 5Department of Diagnostic & Interventional Radiology, Toronto, Canada
- 6Australian Institute for Machine Learning, University of Adelaide, Adelaide, Australia
- 7Women's and Children's Health Network, Adelaide, Australia
- 8Seattle Children's Heart Center, Seattle Washington, United States
- 9Department of Diagnostic & Interventional Radiology, The Hospital for Sick Children, Toronto, Canada
- 10Institute of Medical Science, University of Toronto, Toronto, Canada
- 11Vector Institute, Toronto, Canada
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Patients missing their appointments (no-shows) are a persistent issue that results in idle resources while delaying critical patient prognosis. Likewise, long waiting times increase frustration for patients, leaving a negative impression on the appointment. In this paper, we explore 3 modalities of diagnostic and interventional radiology appointments for pediatric patients at the Hospital for Sick Children (SickKids), Toronto, ON, Canada. Our goal was to survey machine learning methods that best predict the risk of patient no-shows and long wait-times exceeding 1 hour for scheduling teams to propose targeted downstream accommodations. We experimented with 6 predictive model types separately trained on both tasks which included extreme gradient boosting (XGBoost), Random Forest (RF), Support Vector Machine, Logistic Regression, Artificial Neural Network, and a pre-trained large language model (LLM). Utilizing 20 features containing a mixture of patient demographics and appointment related data, we experimented with different data balancing methods including instance hardness threshold (IHT) and class weighting to reduce bias in prediction. We then conducted a comparative study of the improvements made by utilizing continuous contextual data in our LLM which boasted a 51% improvement in F1 score for the wait-time model. Our XGBoost model had the best combination of AUC and F1 scores (0.96 and 0.62, respectively) for predicting no-show while RF had the best AUC and F1 scores (0.83 and 0.61, respectively) for wait-time prediction. The LLMs also performed well for 90% probability thresholds (high risk patients) while being robustly calibrated on unseen test data. Our results surveyed multiple algorithms and data balancing methods to propose the greatest performing models on our tasks, implemented a unique methodology to use LLMs on heterogeneous data within this domain, and demonstrated the greater importance of contextual appointment data over patient demographic features for a more equitable prediction algorithm. Going forward, the predictive output (calibrated probabilities of events) can be used as stochastic input for risk-based optimized scheduling to provide accommodation for patients less likely to receive quality access to healthcare.
Keywords: No-show, wait-times, Scheduling, prediction, Large Language Model, machine learning, Data balancing, Calibration
Received: 23 Jun 2025; Accepted: 31 Jul 2025.
Copyright: © 2025 Rafique, Liu, Gong, Belsito, Mccradden, Mazwi, Lee, Ohanlon, Tsang, Ertl-Wagner and Khalvati. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Farzad Khalvati, Department of Mechanical and Industrial Engineering, University of Toronto, Toronto, Canada
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.