Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Artif. Intell.

Sec. Medicine and Public Health

This article is part of the Research TopicUnlocking New Frontiers in Clinical Lab: AI driven Risk Stratification and Predictive AnalyticsView all articles

Comparative Diagnostic Accuracy of Artificial-intelligence derived risk stratification versus conventional risk stratification methods in Pulmonary Hypertension patients: A systematic review and meta-analysis

Provisionally accepted
Faizan  AhmedFaizan Ahmed1Faseeh  Haider, MDFaseeh Haider, MD2Muhammad  ArhamMuhammad Arham3Allah  DadAllah Dad3Kinza  BakhtKinza Bakht3Muhammad  Moseeb Ali HashimMuhammad Moseeb Ali Hashim4Paweł  ŁajczakPaweł Łajczak5Muhammad  HassanMuhammad Hassan2Fatima  Binte AtharFatima Binte Athar6Muhammad Adnan  HaiderMuhammad Adnan Haider7Muhammad  UsmanMuhammad Usman8Najam  GoharNajam Gohar8Tehmasp  MirzaTehmasp Mirza9Mushood  AhmedMushood Ahmed10Mark  MoshiyakhovMark Moshiyakhov1Brett  SealoveBrett Sealove1Swapnil  PatelSwapnil Patel1Jesus  AlmendralJesus Almendral1Mohamed  BakrMohamed Bakr1Yasar  SattarYasar Sattar11*Fawaz  AleneziFawaz Alenezi12
  • 1Hackensack Meridian Jersey Shore University Medical Center, Neptune City, United States
  • 2Allama Iqbal Medical College, Lahore, Pakistan
  • 3Sheikh Zayed Medical College, Rahim Yar Khan, Pakistan
  • 4University of Missouri, Columbia, United States
  • 5Slaski Uniwersytet Medyczny w Katowicach, Katowice, Poland
  • 6Karachi Medical and Dental College, Karachi, Pakistan
  • 7Hospitalist Physician, Mission Hospital, Asheville, North Carolina, Asheville, United States
  • 8Post Graduate Medical Institute Ameer-ud-Din Medical College, Lahore, Pakistan
  • 9Shalimar Medical and Dental College, Lahore, Pakistan
  • 10Rawalpindi Medical University, Rawalpindi, Pakistan
  • 11Department of Interventional Cardiology, Tidal Health, Salisbury, Maryland, Salsbury, United States
  • 12Duke University Division of Cardiology, Durham, United States

The final, formatted version of the article will be published soon.

Background: Accurate risk stratification in pulmonary hypertension (PH) is integral for optimizing therapeutic strategies and improving patient outcomes. Recent artificial intelligence (AI) models have demonstrated notable efficacy in risk stratification of PH, achieving Area Under Curve (AUC) values of 0.94 and 0.81 in internal and external validation cohorts, respectively. This meta-analysis aims to demonstrate the effectiveness of AI models in the risk stratification of PH by comparing their performance to conventional risk stratification methods. Methods: A systematic search of five databases (PubMed, Embase, ScienceDirect, Scopus, and the Cochrane Library) was conducted from inception to March 2025. Statistical analysis was performed in R (version 2024.12.1+563) using 2×2 contingency data. Sensitivity, specificity, and diagnostic odds ratio (DOR) were pooled using a bivariate random-effects model (reitsma from the mada package), while the AUC was meta-analyzed using logit-transformed values via the metagen() function from the meta package. Results: Six studies were included in the final synthesis, comprising 14,095 patients: 4,481 in internal test datasets and 4,948 in external datasets. AI risk stratification models showed significant performance with a logit mean difference of 0.26 (95% CI 0.09 - 0.43; p=0.31) having low heterogeneity (I2=14.3%) as compared to conventional methods. Furthermore, pooled sensitivity and specificity were 0.77 (95% CI 0.74 - 0.79) and 0.72 (95% CI 0.70 - 0.75) in favour of AI methods respectively. The heterogeneities for pooled sensitivity and specificity were 57.1% (p=0.04) and 91.8% (p<0.0001), underscoring high variability across all studies. Lastly, DOR was substantially high, 8.53 (6.59 - 11.04) in favour of AI models with a high heterogeneity of 73.6% (p=0.002). Heterogeneity (I2) for pooled sensitivity went to 25.9% after excluding major outlier but it remained high for pooled specificity and DOR upon leave-one-out sensitivity analysis. Conclusions: Artificial intelligence-based risk stratification demonstrates significantly higher diagnostic performance compared to conventional methods in pulmonary hypertension. The higher pooled AUC, sensitivity, specificity, and DOR highlight AI's potential to enhance predictive accuracy, guiding better treatment strategies. Nonetheless, more superior quality studies are needed to validate AI models for clinical integration.

Keywords: AI - artificial intelligence, deep learning, Diagnostic accuracy, AI prediction, Risk strategies, pulmonary hypertension

Received: 26 Aug 2025; Accepted: 31 Oct 2025.

Copyright: © 2025 Ahmed, Haider, MD, Arham, Dad, Bakht, Hashim, Łajczak, Hassan, Athar, Haider, Usman, Gohar, Mirza, Ahmed, Moshiyakhov, Sealove, Patel, Almendral, Bakr, Sattar and Alenezi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Yasar Sattar, mdyasarsattar@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.