Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Artif. Intell.

Sec. Medicine and Public Health

This article is part of the Research TopicArtificial Intelligence and Medical Image ProcessingView all 7 articles

Comparative Accuracy of Artificial Intelligence versus Manual Interpretation in Detecting Pulmonary Hypertension Across Chest Imaging Modalities: A Diagnostic Test Accuracy (DTA) Meta-Analysis

Provisionally accepted
Faizan  AhmedFaizan Ahmed1Faseeh  Haider, MDFaseeh Haider, MD2Ramsha  AliRamsha Ali3Muhammad  ArhamMuhammad Arham4Yusra  JunaidYusra Junaid5Allah  DadAllah Dad4Kinza  BakhtKinza Bakht4Maryam  AbbasiMaryam Abbasi6Bareera  Tanveer MalikBareera Tanveer Malik4Abdul  MateenAbdul Mateen7Najam  GoharNajam Gohar8Rubiya  AliRubiya Ali9Yasar  SattarYasar Sattar10*Mushood  AhmedMushood Ahmed11Mushood  AhmedMushood Ahmed1Swapnil  PatelSwapnil Patel1Jesus  AlmendralJesus Almendral1Jesus  AlmendralJesus Almendral12
  • 1Hackensack Meridian Jersey Shore University Medical Center, Neptune City, United States
  • 2Allama Iqbal Medical College, Lahore, Pakistan
  • 3Peoples University of Medical and Health Sciences, Shaheed Benazirabad, Sindh, Pakistan, Shaheed Benazairabad, Pakistan
  • 4Sheikh Zayed Medical College, Rahim Yar Khan, Pakistan
  • 5Dow University of Health Sciences Institute of Business and Health Management, Karachi, Pakistan
  • 6Jinnah Sindh Medical University, Karachi, Pakistan
  • 7Ameer ud Din Medical College, Lahore, Pakistan
  • 8Post Graduate Medical Institute Ameer-ud-Din Medical College, Lahore, Pakistan
  • 9Memorial Hermann Health System, Houston, United States
  • 10TidalHealth Nanticoke, Seaford, United States
  • 11Rawalpindi Medical University, Rawalpindi, Pakistan
  • 12Duke University Division of Cardiology, Durham, United States

The final, formatted version of the article will be published soon.

Introduction: Pulmonary hypertension (PH) has an incidence of approximately 6 cases per million adults, with a global prevalence ranging from 49 to 55 cases per million adults. Recent advancements in artificial intelligence (AI) have demonstrated promising improvements in the diagnostic accuracy of imaging for PH, achieving an area under the curve (AUC) of 0.94, compared to seasoned professionals. Research objective: To systematically synthesize available evidence on the comparative accuracy of AI versus manual interpretation in detecting PH across various chest imaging modalities i.e. chest X-ray, echocardiography, CT scan and cardiac MRI. Methods: Following PRISMA guidelines, a comprehensive search was conducted across five databases—PubMed, Embase, ScienceDirect, Scopus, and the Cochrane Library—from inception through March 2025. Statistical analysis was performed using R (version 2024.12.1+563) with 2×2 contingency data. Sensitivity, specificity, and diagnostic odds ratio (DOR) were pooled using a bivariate random-effects model (reitsma() from the mada package), while the AUC were meta-analyzed using logit-transformed values via the metagen() function from the meta package. Results: This meta-analysis of 12 studies, encompassing 7459 patients, demonstrated a statistically significant improvement in diagnostic accuracy of PH with AI integration, evidenced by a logit mean difference in AUC of 0.43 (95% CI: 0.23–0.64; p < 0.0001) and low heterogeneity (I² = 21.0%, τ² < 0.0001, p = 0.2090), which was consolidated by pooled AUC of 0.934 on bivariate model. Pooled sensitivity and specificity for AI models were 0.83 (95% CI: 0.73–0.90) and 0.91 (95% CI: 0.86–0.95), respectively, with substantial heterogeneity for sensitivity (I² = 83.8%, τ² = 0.4934, p < 0.0001) and moderate for specificity (I² = 41.5%, τ² = 0.1015, p = 0.1146); the diagnostic odds ratio was 54.26 (95% CI: 22.50–130.87) with substantial heterogeneity (I² = 70.7%, τ² = 0.8451, p = 0.0023). Sensitivity analysis showed stable estimates and did not reduce heterogeneity across outcomes. Conclusions: AI-integrated imaging significantly enhances diagnostic accuracy for pulmonary hypertension, with higher sensitivity (0.83) and specificity (0.91) compared to manual interpretation across chest imaging modalities. However, further high-quality trials with externally validated cohorts may be needed to confirm these findings and reduce variability among AI models across diverse clinical settings.

Keywords: artificial intelligence, Chest imaging, Diagnostic accuracy, Meta-analysis, pulmonary hypertension

Received: 20 Sep 2025; Accepted: 18 Dec 2025.

Copyright: © 2025 Ahmed, Haider, MD, Ali, Arham, Junaid, Dad, Bakht, Abbasi, Malik, Mateen, Gohar, Ali, Sattar, Ahmed, Ahmed, Patel, Almendral and Almendral. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Yasar Sattar

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.