Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Pediatr.

Sec. Pediatric Cardiology

Machine Learning-Based Prediction of Hemodynamically Significant Patent Ductus Arteriosus in Preterm Neonates: A Pioneering Insight

Provisionally accepted
  • 1Izmir Democracy University Buca Seyfi Demirsoy Training and Research Hospital, İzmir, Türkiye
  • 2TC Saglik Bakanligi Izmir Sehir Hastanesi, Izmir, Türkiye

The final, formatted version of the article will be published soon.

Background: Hemodynamically significant patent ductus arteriosus (hPDA) in premature infants is a common congenital cardiac anomaly associated with substantial morbidity and mortality. Traditional diagnostic methods like echocardiography face challenges such as expertise requirement and inconsistent accessibility. This study investigates the efficacy of the Random Forest machine learning model in predicting hPDA in premature infants, aiming to provide a non-invasive, objective, and reliable alternative. Methods: This retrospective study analyzed data from 657 premature infants hospitalized between 2014 and 2019. Patients were categorized into hPDA and asymptomatic PDA (aPDA) groups. The Random Forest classification model, implemented in JASP software, utilized prenatal, natal, and postnatal clinical data, including gestational week, birth weight, and the need for resuscitation at birth. Model performance was assessed using metrics such as accuracy, Area Under the Curve, F1 score, Matthews Correlation Coefficient, recall, precision, and feature importance. Results: The Random Forest model demonstrated strong predictive performance, achieving a test accuracy of 91.7%, an AUC of 0.950, an F1 score of 0.923, and an MCC of 0.775. Notably, the recall for the hPDA group was 100%. Gestational week, birth weight, and the need for resuscitation at birth were identified as the most significant predictors. The model also revealed complex relationships, showing variables deemed statistically insignificant by classical methods (e.g., gender, 5th-minute APGAR score, oligohydramnios) to be significant within the Random Forest framework. This is a provisional file, not the final typeset article Conclusions: The Random Forest model effectively predicts hPDA risk in premature infants, offering superior predictive power compared to classical statistical analyses. This approach has the potential to enhance early detection, facilitate timely interventions, and support personalized treatment strategies, thereby improving patient outcomes. Further validation through large-scale, multi-center prospective studies is essential for its integration into clinical practice.

Keywords: patent ductus arteriosus, premature infants, random forest model, machine learning, Gestational week, Birth Weight, F1 score

Received: 17 Oct 2025; Accepted: 13 Nov 2025.

Copyright: © 2025 Ay and Gunes. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Oğuzhan Ay, oguzhanay1@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.