Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Medicine and Public Health

Volume 8 - 2025 | doi: 10.3389/frai.2025.1652397

Predicting Pediatric Diagnostic Imaging Patient No-show and Extended Wait-Times Using LLMs, Regression, and Tree Based Models

Provisionally accepted
Daniel  RafiqueDaniel Rafique1,2Xuan  LiuXuan Liu2Bo  GongBo Gong3,4Laura  BelsitoLaura Belsito5Melissa  D MccraddenMelissa D Mccradden6,7Mjaye  L MazwiMjaye L Mazwi8Wayne  LeeWayne Lee2Graham  OhanlonGraham Ohanlon2Kyle  TsangKyle Tsang2Birgit  Ertl-WagnerBirgit Ertl-Wagner10,2,3,9Farzad  KhalvatiFarzad Khalvati1,10,11,2,3,4,9*
  • 1Department of Mechanical and Industrial Engineering, University of Toronto, Toronto, Canada
  • 2Research Institute, The Hospital for Sick Children, Toronto, Canada
  • 3Department of Medical Imaging, University of Toronto, Toronto, Canada
  • 4Department of Computer Science, University of Toronto, Toronto, Canada
  • 5Department of Diagnostic & Interventional Radiology, Toronto, Canada
  • 6Australian Institute for Machine Learning, University of Adelaide, Adelaide, Australia
  • 7Women's and Children's Health Network, Adelaide, Australia
  • 8Seattle Children's Heart Center, Seattle Washington, United States
  • 9Department of Diagnostic & Interventional Radiology, The Hospital for Sick Children, Toronto, Canada
  • 10Institute of Medical Science, University of Toronto, Toronto, Canada
  • 11Vector Institute, Toronto, Canada

The final, formatted version of the article will be published soon.

Patients missing their appointments (no-shows) are a persistent issue that results in idle resources while delaying critical patient prognosis. Likewise, long waiting times increase frustration for patients, leaving a negative impression on the appointment. In this paper, we explore 3 modalities of diagnostic and interventional radiology appointments for pediatric patients at the Hospital for Sick Children (SickKids), Toronto, ON, Canada. Our goal was to survey machine learning methods that best predict the risk of patient no-shows and long wait-times exceeding 1 hour for scheduling teams to propose targeted downstream accommodations. We experimented with 6 predictive model types separately trained on both tasks which included extreme gradient boosting (XGBoost), Random Forest (RF), Support Vector Machine, Logistic Regression, Artificial Neural Network, and a pre-trained large language model (LLM). Utilizing 20 features containing a mixture of patient demographics and appointment related data, we experimented with different data balancing methods including instance hardness threshold (IHT) and class weighting to reduce bias in prediction. We then conducted a comparative study of the improvements made by utilizing continuous contextual data in our LLM which boasted a 51% improvement in F1 score for the wait-time model. Our XGBoost model had the best combination of AUC and F1 scores (0.96 and 0.62, respectively) for predicting no-show while RF had the best AUC and F1 scores (0.83 and 0.61, respectively) for wait-time prediction. The LLMs also performed well for 90% probability thresholds (high risk patients) while being robustly calibrated on unseen test data. Our results surveyed multiple algorithms and data balancing methods to propose the greatest performing models on our tasks, implemented a unique methodology to use LLMs on heterogeneous data within this domain, and demonstrated the greater importance of contextual appointment data over patient demographic features for a more equitable prediction algorithm. Going forward, the predictive output (calibrated probabilities of events) can be used as stochastic input for risk-based optimized scheduling to provide accommodation for patients less likely to receive quality access to healthcare.

Keywords: No-show, wait-times, Scheduling, prediction, Large Language Model, machine learning, Data balancing, Calibration

Received: 23 Jun 2025; Accepted: 31 Jul 2025.

Copyright: © 2025 Rafique, Liu, Gong, Belsito, Mccradden, Mazwi, Lee, Ohanlon, Tsang, Ertl-Wagner and Khalvati. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Farzad Khalvati, Department of Mechanical and Industrial Engineering, University of Toronto, Toronto, Canada

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.