Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Psychiatry

Sec. Digital Mental Health

This article is part of the Research TopicAdvances in Artificial Intelligence Applications that Support Psychosocial HealthView all 16 articles

Artificial Intelligence in Mental Health Care: A Scoping Review of Reviews

Provisionally accepted
Mohammad  S. Abu-MahfouzMohammad S. Abu-MahfouzSarah  AlFehaidSarah AlFehaidHala  M. BurqanHala M. BurqanRabie  Adel El ArabRabie Adel El Arab*
  • Almoosa College of Health Sciences, Al Ahsaa, Saudi Arabia

The final, formatted version of the article will be published soon.

Background Artificial intelligence (AI) is rapidly entering mental health care, but most models remain proof-of-concept, with limited external validation and substantial risk of overfitting. Methods This scoping review of reviews adhered to the PRISMA-ScR checklist and Joanna Briggs Institute guidance. We searched MEDLINE, Embase, PsycINFO, and IEEE Xplore. Eligible publications encompassed systematic, scoping, narrative, integrative, meta-analytic, and patent reviews. Findings were synthesised thematically. Results Thirty-one reviews were included. Evidence concentrated on depression and anxiety; schizophrenia, bipolar disorder, perinatal mental health, autism spectrum conditions, older adults, nurses, and allied professionals were under-represented. Across screening, diagnosis/classification, and risk prediction, high accuracy was frequently reported under internal validation; in prior syntheses, typical internal AUCs clustered around ≈0.80–0.88 whereas externally or prospectively validated performance was scarce and typically attenuated. Signals were strongest for narrow, feedback-rich tasks, with greater decay for general-purpose models and longer prediction horizons. Conversational agents produced small-to-moderate short-term improvements in depressive symptoms (SMD ≈0.2–0.6); effects for anxiety and stress were smaller or inconsistent and varied with comparator stringency, follow-up (≤8–12 weeks vs longer), and the degree of human guidance. Most chatbot evaluations were short and small-scale, with few randomized or pragmatic trials and limited data on durability beyond 12 weeks. Real-world implementation was limited; several reviews identified usability and electronic health-record integration as prerequisites for adoption, and explainability alone rarely conferred actionability without clinician training. Ethical readiness was incomplete: privacy and bias were commonly discussed, but accountability, post-deployment monitoring, and crisis-escalation protocols were inconsistently specified. Economic evaluations were uncommon and rarely accounted for integration, maintenance, or re-training costs. Workforce outcomes (literacy, confidence, readiness) were infrequently measured. Internal and external metrics were not pooled. Conclusions AI applications span the mental-health care continuum but remain early in translation. Performance that appears strong under internal validation often attenuates on external or prospective testing; symptomatic gains are concentrated in depression/anxiety and may diminish over longer follow-up; and adoption is constrained by usability, EHR integration, and incomplete governance. The cross-review signal highlights consistent gaps in accountability, post-deployment monitoring and crisis escalation, equity reporting, workforce readiness, and life-cycle economics (including integration, monitoring, and re-training).

Keywords: Anxiety Disorders, artificial intelligence, diagnosis, digital therapeutics, Mental Health, Mood Disorders, Predictive Modeling, screening

Received: 18 Aug 2025; Accepted: 19 Jan 2026.

Copyright: © 2026 Abu-Mahfouz, AlFehaid, Burqan and El Arab. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Rabie Adel El Arab

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.