Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Medicine and Public Health

Volume 8 - 2025 | doi: 10.3389/frai.2025.1636410

This article is part of the Research TopicThe Applications of AI Techniques in Medical Data ProcessingView all 16 articles

Multi-center evaluation of machine and deep learning methods to predict glaucoma surgical outcomes

Provisionally accepted
Samuel  BarrySamuel BarrySophia  Y. WangSophia Y. Wang*
  • Byers Eye Institute, Department of Ophthalmology, Stanford University, Stanford, United States

The final, formatted version of the article will be published soon.

Purpose: To develop machine learning (ML) and neural network (NN) models to predict glaucoma surgical outcomes, including intraocular pressure (IOP), use of ocular antihypertensive medications, and need for additional glaucoma surgery, using preoperative electronic health records (EHR) from a large multicenter cohort. Methods: This cohort study included 9,386 patients who underwent glaucoma surgery across 10 institutions in the Sight Outcomes Research Collaborative (SOURCE). All patients had at least 1 year of follow-up and 2 postoperative visits with IOP measurements. Models were trained using preoperative EHR features to predict surgical failure, defined as any of the following: IOP remaining above 80% of preoperative value beyond the immediate postoperative period, increased postoperative glaucoma medications, or need for additional glaucoma surgery. Model performance was evaluated on two test sets: an internal holdout set from sites seen during training and an external holdout set. Results: Of 13,173 surgeries, 8,743 (66.4%) met failure criteria. The best-performing model for overall surgical failure prediction was a one-dimensional convolutional neural network (1D-CNN) with AUROC of 76.4% and accuracy of 71.6% on the internal test set. The top-performing classical ML model was random forest (AUROC 76.2%, accuracy 72.1%). Prediction performance was highest for IOP-related failure (AUROC 82%), followed by need for additional surgery (80%) and increased medication use (68%). AUROC declined slightly (2–4%) on the external test set. Conclusions: ML and DL models can predict glaucoma outcomes using preoperative EHR data. Translational Relevance: Prediction models may support clinical decision-making by identifying glaucoma patients at risk of poor postoperative outcomes.

Keywords: Glaucoma, Surgical outcome prediction, machine learning, deep learning, Electronic Health Records, Multicenter study, Clinical prediction models

Received: 27 May 2025; Accepted: 30 Sep 2025.

Copyright: © 2025 Barry and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Sophia Y. Wang, sywang@stanford.edu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.