- 1Department of Psychiatry, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
- 2Occupational Therapy Department, Kwai Chung Hospital, Hong Kong, Hong Kong SAR, China
Introduction: Artificial intelligence (AI) has emerged as a valuable tool in mental health care, with applications in the treatment of psychosis. However, its application to psychosocial functioning in psychosis remains underexplored, despite its critical role towards long-term therapeutic outcomes and recovery. The goal of this systematic review and meta-analysis is to identify, summarize, and evaluate the current evidence on AI applications in psychosocial functioning in psychosis.
Methods: A literature search was conducted using the PubMed, Scopus, and ACM Digital Library databases for articles published between January 2010 and March 2025, in accordance with the PRISMA guidelines. Quality of studies was assessed using the Prediction model Risk Of Bias Assessment Tool (PROBAST), Newcastle-Ottawa Scale (NOS), and the Cochrane Risk of Bias Tool (RoB2.0). Meta-analyses synthesized commonly used performance metrics using random-effects models, with subgroup, sensitivity and publication bias analyses.
Results: A total of 14 studies were included in this review. Various AI techniques were employed, with supervised machine learning being the most predominant. Psychosocial domains, including social function, occupational function, social cognition and quality of life, were targeted. Meta-analysis revealed moderate discriminative and predictive accuracies of AI models: pooled AUC of 0.70 (95% CI: 0.63–0.76) and RMSE of 8.15 (95% CI: 7.32–8.98). Subgroup analyses indicated higher predictive accuracy for social cognition (AUC=0.77) and clinical symptom-based predictors (RMSE=7.1), with substantial heterogeneity mainly attributed to methodological variability.
Conclusions: This review discovered the current application of AI in psychosocial functioning in psychosis, including the techniques usage, modeling approaches, targeted domains, and model performance. AI showed promise for early identification, continuous monitoring, and personalized interventions, driven by methodological advances such as ensemble learning with feature selection. Nevertheless, limitations in methodological consistency, data quality, model design, and ethical issues underscore that the field remains in its early stages. Overall, AI should complement clinical expertise, rather than replace it, in delivering psychosocial care in psychosis.
Systematic review registration: https://www.crd.york.ac.uk/prospero/, identifier CRD420251051952.
1 Introduction
Psychosis is a mental health condition characterized by a range of symptoms including hallucinations, delusions, and disorganized thinking (1). Apart from the core symptoms, a substantial portion of individuals with psychosis encounter considerable impairments in psychosocial functioning, defined as the capacity to carry out everyday activities and to engage with others and the community in a mutually satisfying way (2–6). The key components comprise social functioning, occupational functioning, quality of life (QoL), and role performance (2, 4). In particular, individuals with psychosis suffer from a decline in cognitive function, including social cognition (7, 8). Social cognition, which encompasses aspects of facial social perception, facial emotion recognition, and theory of mind (ToM), involves the cognitive processes of perceiving, interpreting, and reacting to social information (9, 10). Impaired social cognition can impact one’s performance in routine activities and employment, leading to lower life satisfaction (11–13). However, the subtlety of psychosocial deficits is less apparent than overt psychotic symptoms, complicating early detection and intervention (14). Also, the effectiveness of conventional approaches, such as psychotherapy, social cognition, and interpersonal skills training, was found to be limited by trajectories in psychosis and clinicians’ subjectivity (15). These highlight an urge for innovative, precise, and scalable approaches to manage psychosocial impairments in psychosis.
Over the past decade, Artificial Intelligence (AI) has emerged as a transformative tool in healthcare. Research & Markets (16) projects a remarkable compound annual growth rate (CAGR) of 49.1% for AI in the healthcare market over the next five years, signaling a paradigmatic shift in how medical care will be delivered and offered. AI is defined as a branch of computer science that centers around developing algorithms and systems to replicate human cognitive activities, such as problem-solving, reasoning and learning (17). Common subfields include machine learning (ML), deep learning (DL), natural language processing (NLP), and reinforcement learning (RL) (18). ML, DL, and NLP are capable of analyzing large datasets to determine patterns and characteristics, while RL excels at learning optimal behaviors based on trial and error (19–21).
In the realm of mental health, a growing number of research studies are highlighting the emerging role of AI. For instance, ML has shown potential in forecasting the risk of relapse and violent behavior by analyzing behavioral features (22, 23). NLP was used to evaluate the mental states and functioning of patients by analyzing their patterns in daily life and interpersonal interactions (24–26). Furthermore, AI has contributed to more tailored and interactive interventions, like psychological support and psychoeducation delivered through chatbots (27–29). Given these capabilities, there is a substantial potential for AI to facilitate the diagnosis, monitoring, and treatment of psychosocial functioning in psychosis, offering objective and efficient solutions that traditional methods fall short of.
Although AI has been widely utilized in psychosis research, its application to psychosocial functioning remains largely uncharted. Existing literature primarily focuses on the diagnosis and detection of psychotic disorders. For example, a systematic review examined the current evidence on applying AI techniques for detection and classification in schizophrenia (30). On the other hand, some researchers investigated the use of ML to analyze structured data for diagnosis (31, 32). Additionally, several studies have explored the use of computer vision models to detect biomarkers and forecast progression in brain imaging analysis for the psychosis population (33–35). Whilst some empirical evidence suggested the use of AI during the stage of rehabilitation in the schizophrenia population, most studies concentrated on the prediction of medication adherence and relapse (36). After gathering and reviewing the current literature, a knowledge gap was identified in the application of AI to psychosocial function in psychosis. Given that psychosocial functioning is a fundamental indicator of illness course, therapeutic outcomes and recovery in psychosis, deficits in this domain will not only hinder one’s prognosis, but also impose a substantial burden on healthcare systems (11, 37, 38). Despite the critical need for advancement in managing psychosocial deficits in psychosis, no systematic review has been done to synthesize and consolidate evidence on this aspect. This research gap accentuates a vital opportunity to explore how AI can contribute to patients’ psychosocial recovery.
2 Aims of the review
The aim of this systematic review and meta-analysis is to comprehensively identify, summarize and evaluate the current evidence regarding the application of AI to psychosocial functioning in psychosis. following research questions:
● How are AI techniques applied, and which psychosocial domains are targeted?
● What is the performance of these AI techniques?
● What are the potential benefits and limitations of clinical integration?
● What are the key areas for further research?
3 Methodology
3.1 Search strategy
This systematic review and meta-analysis is registered with PROSPERO (CRD420251051952). A literature search was conducted in three electronic databases: PubMed, Scopus, and ACM Digital Library. The systematic review was in accordance with the PRISMA guidelines. The timeframe of published articles was limited to the last 15 years, between January 2010 and March 2025. The search criteria included the following keywords: (Artificial intelligence OR AI OR Machine learning OR Natural language processing OR Deep Learning OR Neural network OR Chatbot OR Reinforcement Learning) AND (Psychosis OR Schizo*) AND (Psychosocial function OR Social Function OR Social cognition OR Cognitive remediation OR Cognitive function OR Functional outcome OR Quality of life OR Vocational skill OR Psychotherapy). The search strings used were detailed in Supplementary Data (Table 1). In the initial screening, duplicate studies were eliminated. The titles of remaining studies were then reviewed to identify relevant studies. Subsequently, the abstracts and full texts were assessed for the eligibility of the studies.
3.2 Inclusion and exclusion criteria
Studies were included if they applied AI techniques in psychosocial function; included participants with schizophrenia, schizoaffective disorder, or other psychotic disorders. Studies were excluded if they were not available in English, not apply AI technologies, not involve individuals with psychosis, not focus on psychosocial function, no full text available, or they were non-empirical studies such as review articles and commentaries.
3.3 Data extraction
The relevant information was extracted from each study and summarized into Table 1. The systematically extracted information included authors, country of origin, year of publication, sample size, outcome measures, AI techniques applied, and key findings. The process was carried out independently by two reviewers. To ensure consistency between reviewers, the review process was iteratively refined until a Cohen’s kappa coefficient (κ) of at least 0.85 was achieved.
3.4 Quality assessment
To ensure methodological rigor aligned with study design, a tailored quality assessment approach was adopted. The Newcastle-Ottawa Scale (NOS) and its adapted version were used to appraise observational cohort and cross-sectional studies respectively. The Cochrane Risk of Bias Tool (RoB 2.0) was applied to the randomized controlled trial (RCT). For non-observational predictive modeling studies, the PROBAST (Prediction Model Risk of Bias Assessment Tool) was employed. This strategy ensures that each study was evaluated using the most appropriate and context-relevant criteria. The Grading of Recommendations Assessment, Development, and Evaluation (GRADE) was used to evaluate the quality of the collected evidence. The quality ratings were conducted independently by two reviewers.
3.5 Statistical analysis
Performance metrics from the studies included in the analysis were extracted for quantitative synthesis. Only metrics that appeared in at least two independent studies were included in the meta-analysis. Metric values from external validation of the studies were prioritized. The evaluation of potential heterogeneity among studies was conducted using Cochran’s Q test, with results expressed as the I² statistic. A random-effects model was used to compute pooled estimates and their 95% confidence intervals. To investigate the sources of heterogeneity, subgroup analyses were performed. The robustness of the pooled estimates was assessed through sensitivity analyses by adopting the leave-one-out (LOO) approach. Publication bias was assessed using funnel plots and Egger’s regression tests, which was only applied when there were at least ten independent estimates to maintain statistical validity. Funnel plots were examined descriptively to identify potential asymmetry. All statistical analyses were carried out using R software (Version 4.5.0).
4 Result
4.1 Study selection & characteristics
From the initial literature search, studies were identified. Once duplicates were removed, 711 studies remained that were deemed potentially relevant. After screening titles and abstracts, 690 studies were excluded because they did not meet the inclusion criteria. Subsequently, 37 studies were included after full-text analysis. A total of 14 studies that fully met the inclusion criteria were selected in this review. Of these, 9 studies were conducted in Western countries and 5 studies were conducted in Asian countries. Figure 1 shows the PRISMA flow diagram of the search. It was noted that 3 papers were authored by conducted by a research team from Taiwan. All of which utilized the same group of subjects but different topic focuses and data types.
4.2 Characteristics of AI methodology
The included studies applied a diverse range of AI techniques, namely supervised ML, unsupervised ML, causal ML, RL, DL, AutoML, and NLP (Figure 2). While one study used both unsupervised and supervised ML, another one combined RL, DL, and NLP in its AI system.
Supervised ML was predominantly utilized within the psychosis and schizophrenia population (39–49). Random forest (RF) was the most frequently utilized model (42, 46, 48, 49). The following widely used model was support vector machines (SVM), which were employed in three studies (41, 47, 48). Other commonly adopted supervised models included linear regression (44, 48), bagging ensemble (43, 44) and boosting techniques – AdaBoost (39) and XGBoost (48). Regularized regression models, including elastic net (45), Least Absolute Shrinkage and Selection Operator (LASSO) (42), neural networks (39), k-nearest neighbors (k-NN) (46), and Bayesian networks (40) were applied in individual studies as alternative supervised models. Feature selection techniques were incorporated in three studies to enhance model performance by selecting the most predictive variables, including M5 Prime (43, 44), LASSO (42), and Recursive Feature Elimination (RFE) (41).
Unsupervised ML with the use of latent class growth analysis (49), AutoML with Tree-based Pipeline Optimization Tool (TPOT) (50), and causal ML with Greedy Fast Causal Inference (GFCI) algorithm (51) were employed in single studies. One study combined RL, DL, and NLP, utilizing algorithms such as Deep Deterministic Policy Gradients (DDPG), Twin Delayed DDPG (TD3), and Batch Constrained Q-Learning (BCQ), along with Doc2Vec, to develop its AI system (52).
4.3 Application of AI on psychosocial domains
While 9 studies focused on applying AI techniques to specific psychosocial aspects, 5 studies covered more than one domain in their application. Three main psychosocial domains were identified: social and occupational functioning, social cognition, and QoL.
4.3.1 Social and occupational functioning
Social and occupational functioning was the most researched domain (n=10), in which most studies considered functional levels and employment status as indicators of overall social recovery. Among the 10 studies, 4 studies used the Global Assessment of Functioning (GAF) as the outcome measurement tool. Ensemble learning methods, specifically RF with feature selection and bagging, were found to be the most outstanding supervised ML models in predicting outcomes, highlighting their high predictive ability with low error rates (42, 43).
4.3.2 Social cognition
4 studies focused on social cognition (39, 40, 49, 50), with one specifically investigating the ToM within the schizophrenia population (40). By embedding Bayesian networks in supervised ML, it demonstrated an outstanding performance with high accuracy in differentiating individuals with schizophrenia from healthy participants purely based on their ToM abilities on T.h.o.m.a.s, exploring hierarchical relationships among ToM facets, and identifying the most pronounced ToM deficits within schizophrenia (40). To serve a similar purpose, Badal et al. (39) achieved good to excellent ability in differentiation based on two emotion recognition tasks (BLERT & ER40), by using Gini Importance – a feature selection method in supervised ML. The use of AI techniques in classification and understanding of the complexity of social cognition is therefore emphasized.
4.3.3 Quality of life
All 4 studies used supervised ML for the prediction of QoL (43–46). Bagging ensemble with feature selection again was the highest performing AI method, with high accuracy in forecasting QoL in individual with schizophrenia based on cognitive abilities and clinical symptoms (43). All 4 studies employed dedicated QoL measures, in which QLS was frequently used (43, 44, 46).
4.4 Performance of AI applications
Across the studies, the application of AI can be categorized into three main areas: predicting psychosocial outcomes, mechanistic exploration of psychosocial function in psychosis, and providing therapeutic support in psychosocial interventions. Various metrics were utilized to assess the model’s performance. Table 2 describes the metrics adopted and the performance range among included studies.
4.4.1 Prediction of psychosocial outcomes
11 out of 14 studies focused on predicting psychosocial outcomes, including social functioning, QoL, social cognition, and global functioning (39, 41–50). Various data types, including clinical, genetic, cognitive, and behavioral data, were utilized to develop AI models with varying predictive success. Some studies demonstrated robust performance; for example, Badal et al. (39) achieved F1 scores of 0.74–0.81 in predicting social and social cognitive functioning. Similarly, Li et al. (42) reported an AUC of 0.867 and an accuracy of 79.5% in forecasting social functional improvement. Furthermore, Lin et al. (43) showcased the lowest RMSE of 6.43 when using RF to predict QoL.
However, the performance was modest in other studies. Leighton et al. (45) reported AUCs ranging from 0.703 to 0.736 for forecasting outcomes like symptom remission and vocational recovery. Walter et al. (47) reported a BAC of 59.3% for social functioning, with a 19.6% decline from the training to the testing phase. Likewise, Wang et al. (48) reported MAE values of 2.17-2.79, indicating approximately 10% of prediction error in predicting the score of the Social Functioning Scale. Moreover, low explanatory power in the variance of social cognition was reflected with a PVE of 0.2 (50).
4.4.2 Mechanistic exploration of psychosocial function
Two studies employed AI to explore the underlying mechanisms of psychosocial functioning. Miley et al. (51) yielded strong effect sizes (0.77–1.5) in examining the interplay between motivation, social affective capacity, and functional outcomes in early schizophrenia; however, they reported suboptimal model fit (CFI, 0.884, RMSEA, 0.066). While Bosco et al. (40) achieved an accuracy of 86.4% in differentiating patients from controls, they also provided insights into the hierarchical structure of ToM facets. Similarly, Badal et al. (39) achieved AUCs of 0.83–0.86 and F1 scores of 0.81–0.85 in differentiating between individuals with schizophrenia and healthy controls using facial affect recognition data, demonstrating good to excellent performance.
4.4.3 Therapeutic support in psychosocial interventions
Lin et al. (52) examined the role of AI in supporting psychosocial interventions. It employed real-time recommendation systems during psychotherapy sessions. This model attained a moderate accuracy of 64% relative to human therapist decisions, indicating a promising yet underdeveloped application.
4.4.4 External validation
Of the four studies that conducted external validation to assess model generalizability beyond their training datasets, a decline in performance was consistently observed (41, 45, 47, 51). Potential overfitting to training data was found to be a common challenge. The magnitude of the performance drop varied: de Nijs et al. (41) showed a minimal accuracy decrease of 2.3%, indicating satisfactory generalizability in the prediction model for three- and six-year psychosocial outcomes, while Walter et al. (47) reported a substantial reduction in balanced accuracy from 78.9% to 59.3%, suggesting limited applicability to new populations. Outcome-specific trends were observed, with vocational recovery maintaining strong predictive power across datasets, whereas quality of life exhibited weaker external performance (45).
4.5 Meta-analysis of performance metrics – AUC
4.5.1 Pooled AUC
12 AUC estimates from 4 independent studies were subjected to a random-effects meta-analysis (Figure 3). Each AUC estimate reported the predictive performance of AI models for psychosocial functioning in psychosis. The pooled AUC was 0.70, with the 95% Confidence Interval (CI) at 0.63-0.76, indicating moderate discriminative performance. Between-study heterogeneity was extremely high (I², 92.2%), reflecting substantial variability across outcome domains, sample sizes, and modelling approaches.
4.5.2 Subgroup analysis of AUC
Subgroup analyses revealed systematic differences across outcome types (Figure 4). Models predicting functional outcomes (k=10) yielded a pooled AUC of 0.68 (95% CI: 0.60–0.77; I², 93%), while those applied to social cognition (k=2) achieved higher performance (0.77, 95% CI: 0.69–0.85; I², 83%). A test of subgroup differences using Cochran’s Q statistic indicated a statistically significant difference between functional outcome models and social cognitive outcome models (Q(between), 9.17, degrees of freedom, 1, p, 0.003). It suggested that model performance depended strongly on the outcome domain, with higher discriminative accuracy for facial emotion recognition and social cognition than for functional outcomes such as vocational and social skills.
4.5.3 Sensitivity analysis of AUC
LOO sensitivity analysis demonstrated that sequentially excluding each study produced stable pooled AUC estimates ranging from 0.68 to 0.71, and all Cis overlapped with the overall pooled estimate (AUC, 0.70, 95% CI: 0.63–0.76). Although slight changes in heterogeneity were observed (I² reduced from 92% to approximately 90%), the variation remained statistically significant (p < 0.001). By specifically excluding the only study with a high risk of bias, the pooled AUC increased slightly to 0.71 (95% CI: 0.64–0.78), and heterogeneity decreased marginally (I², 91%, τ², 0.010, p < 0.001). These results showed that the overall findings were robust and not driven by the high-risk study, with the observed variability likely reflecting methodological and outcome-related differences across studies rather than bias from any individual source.
4.6 Meta-analysis of performance metrics - RMSE
4.6.1 Pooled RMSE
A random-effects meta-analysis was performed by aggregating six RMSE estimates from two studies that focused on the same cohort in Taiwan (n=302) (Figure 5). Because these estimates were derived from overlapping samples, the results were interpreted cautiously, acknowledging the potential for double-counting. The observed variability in RMSE values was therefore more plausibly attributed to differences in modeling strategies, predictor sets, and analytical approaches rather than to actual differences within the study population. The pooled RMSE was 8.15 (95% CI: 7.32–8.98), suggesting moderate predictive performance. Substantial heterogeneity was observed (I², 92.2%, τ², 0.012, p < 0.001).
4.6.2 Subgroup analysis of RMSE
Subgroup analyses were conducted based on predictor domains (Figure 6). When stratified by predictor domain, clinical symptom-based models produced the lowest pooled RMSE of 7.10 (95% CI: 5.78–8.43), followed by cognitive models at 8.17 (95% CI: 7.36–8.99). Conversely, genetic predictor models yielded the highest RMSE of 9.19 (95% CI: 7.96–10.41), indicating the highest error.
Figure 6. Subgroup meta-analysis of pooled RMSE by predictor domains (Genetic, Clinical, Cognitive).
Heterogeneity within these subgroups was lower than that observed in the overall pooled analysis (I², 67.7–86% vs. 92.2%), suggesting that stratifying by predictor domain partially accounted for between-model variability. The remaining heterogeneity likely reflected differences in modeling approaches, predictor characteristics, and methods of feature selection rather than random errors. Since all estimates were derived from the same cohort, the residual variability represented within-sample differences in model performance rather than actual inconsistencies between studies. Overall, these results indicated that models based on clinical symptom features exhibited superior predictive accuracy, whereas those relying solely on genetic data performed less effectively for modeling psychosocial functioning in this cohort.
4.6.3 Sensitivity analysis of RMSE
LOO sensitivity analysis showed that the pooled RMSE remained stable after sequentially excluding each estimate. The pooled values ranged narrowly from 7.86 to 8.48, and all CIs overlapped with the overall estimate, indicating that no single model substantially influenced the pooled estimate.
4.7 Publication bias
To evaluate potential publication bias, funnel plots for both AUC and RMSE were analyzed. The funnel plot for AUC (Figure 7) displayed slight asymmetry; however, Egger’s test results (p, 0.08) did not demonstrate significant small-study effects, indicating a low likelihood of publication bias. Given the restricted number of independent studies and the high heterogeneity (I² > 90%), the observed asymmetry was more likely due to methodological and clinical diversity rather than selective reporting. Consequently, the funnel plot should be interpreted descriptively rather than as definitive evidence of bias.
For RMSE, Egger’s regression test was not conducted because only six estimates were available, all sourced from the same Taiwanese cohort, which violated the assumption of statistical independence. A descriptive funnel plot (Figure 8) was created to illustrate the distribution of RMSE values in relation to their standard errors. This plot exhibited a generally symmetrical pattern around the pooled estimate, with no clear signs of bias. However, since all estimates were derived from the same dataset, this apparent symmetry should not be considered conclusive evidence against publication bias. Instead, the funnel plot served as a visual summary of estimate precision and variability, indicating that the observed differences are likely due to methodological variations across models rather than selective reporting.
4.8 Quality of evidence
4.8.1 Quality assessment
The majority of included studies demonstrated low risk of bias, suggesting good methodological quality and overall robustness of the synthesized findings. Two studies (45, 48) were rated as fair quality, reflecting some concerns, primarily regarding representativeness and sample size. The single RCT (47) showed a high overall risk of bias due to missing outcome data, whereas the prediction model study (52) was rated as low risk across all PROBAST domains. Figures 9, 10, Supplementary Data (Table 2) illustrated the results of the quality assessment.
4.8.2 GRADE
For meta-analyzed performance metrics (AUC and RMSE), the overall evidence was evaluated using the GRADE approach. Certainty of evidence was assessed using GRADE across five domains: risk of bias, inconsistency, indirectness, imprecision, and publication bias. Because the included studies were predominantly observational, the starting level was rated as ‘low’. The certainty for AUC was downgraded for inconsistency (I², 92%) and indirectness (heterogeneous outcomes and methods), while RMSE was further downgraded for imprecision due to reliance on a single cohort, resulting in overall low and very-low certainty ratings, respectively. Supplementary Data (Table 3) provided the corresponding GRADE summary of findings table.
5 Discussion
5.1 Principal findings
To our knowledge, this is the first systematic review and meta-analysis focusing on the application of AI in psychosocial function in psychosis. The 14 eligible studies highlighted the diverse use of AI techniques, predominantly using supervised ML, alongside unsupervised ML, causal ML, DRL, and AutoML. Our findings from this review aligned with another systematic review on AI targeting the schizophrenia population by Yang et al. (36) which also identified supervised ML as the most prevalent approach. Our findings were also coherent with the projection, which anticipated that ML will become the major share of the AI market in healthcare (16). However, compared with the broader application of AI in mental health, a more balanced spread across diverse technologies was evident, such as conversational AI for therapeutic support, computer vision for emotion recognition, and AI-enhanced virtual reality for immersive therapy in psychiatric disorders (53). This contrast implied that the utilization of AI in psychosis remains in its early stages, with a limited scope that primarily emphasizes ML over the broader range of AI innovations seen in other mental health domains. Three key areas of application in psychosocial function were identified, including outcome prediction, mechanistic exploration, and therapeutic support, with prediction being the most common primary task undertaken. Psychosocial domains, including social functioning, occupational functioning, social cognition, and QoL were covered.
The findings from our meta-analyses, which were based on the most commonly adopted performance metrics, indicated moderate discriminative and predictive accuracies of AI models in addressing psychosocial functioning in psychosis. Notably, subgroup analyses indicated superior performance in predicting social cognition than broader functional abilities. Also, models using psychiatric symptoms as the predictor was found to have highest predictive accuracy when compared with models with using cognitive function and genetic factors as predictors. The heterogeneity (I², 67.7% to 93%) across meta-analyses underscored multifaceted factors including varying sample sizes, divergent outcome domains, disparate modelling approach. Rather than reflecting random error or systematic bias from individual studies, this variability likely stemmed from methodological diversity and non-standardized analytic practices across models. These findings emphasized the significance of adopting standardized modeling protocols, integrating multimodal data, and including diverse cohorts to enhance the robustness and generalizability of AI-based psychosocial care in psychosis.
5.2 Integration of AI into clinical practice
5.2.1 Early identification of at-risk populations
Predictive models, particularly ensemble learning methods, demonstrated the ability to predict social and occupational function in psychosis accurately (42, 44). Instead of relying on a single model, ensemble methods train several models whose predictions were combined to improve overall accuracy and robustness. In this review, the RF and bagging ensemble specifically demonstrated outstanding predictive ability in handling complex datasets with non-linear relationships, such as those related to social functioning. Moreover, it was found that both models utilized feature selection methods, such as LASSO and M5 Prime, which excelled at selecting impactful features, thereby further enhancing the accuracy and reliability of the results. The predictive modeling showed promise in identifying individuals at risk of poor social functioning and social cognition, enabling early interventions such as social cognition and interaction training (39, 42). The finding was parallel to that of Rotenberg et al. (54), which also found that the RF model outperformed other AI approaches in clinical prediction. Nonetheless, two reviews found no superiority of AI approaches in predicting clinical outcomes (55, 56). The inconsistencies in the findings regarding the superiority of AI approaches highlighted the necessity for further research, particularly in evaluating and contrasting various models to optimize model selection in clinical applications.
Apart from facilitating the early identification of psychosocial outcomes, the excellent discriminatory ability between schizophrenia and healthy controls, as indicated by evaluation of accuracy, confidence ratings, and reaction times on facial emotion recognition tasks, suggested potential in detecting schizophrenia (51). According to Lai et al. (30), the majority of papers focused on using brain imaging data from MRI scans, PET scans, and EEG for AI-augmented detection and classification. The higher accessibility and cost-effectiveness of obtaining behavioral data compared to brain imaging techniques may broaden their applicability for the widespread detection of at-risk populations. However, given the consistent findings of high predictive ability with brain imaging data (42, 57–61), it may serve as a complementary role with neuroimaging, offering a more holistic understanding of schizophrenia.
5.2.2 Data-driven assessment & continuous monitoring
Moreover, AI holds significant promise for enabling objective, data-driven, and continuous assessment monitoring, which is particularly valuable given the fluctuating nature of psychotic symptoms and the critical need for timely interventions. Traditional clinical assessments, such as face-to-face interviews or self-reported questionnaires, which are often conducted periodically and rely on subjective reporting, may be prone to bias and fail to capture subtle or rapid changes in a patient’s psychosocial functioning (62–64). With emerging AI-driven approaches, it leveraged technologies such as mobile sensing and speech feature analysis to provide real-time, quantifiable insights (46, 48). In our review, it was found that these innovative methods offer a transformative shift from conventional practices by harnessing multi-modal data to track behavioral patterns and vocal characteristics in schizophrenia. This aligned with the growing field of digital phenotyping, where passive data from smartphones and wearables is increasingly utilized to infer mental health states, as seen in studies exploring depression and anxiety (65). However, both methods revealed limitations in generalizability—mobile sensing struggled to predict outcomes for new patients without historical data (48). At the same time, speech analysis encountered difficulties with patients whose QoL scores deviated significantly from the norm, particularly those with severe symptoms (46). This suggested that while AI can enhance monitoring, it may require personalization or larger, more diverse datasets to achieve robust performance across varied populations.
5.2.3 Personalized intervention
Apart from the wide range of AI techniques and models, this review highlighted the potential of partnerships between AI methods and traditional statistical techniques. Greedy Fast Causal Inference (GFCI) and Structural Equation Model (SEM) were combined to reveal hierarchical and causal insights, identifying social affective capacity and motivation as high-impact treatment targets (51). AI models excel at processing large, high-dimensional datasets and identifying patterns, but they often lack interpretability, earning them the “black box” label. Statistical models, designed for causal inference, complement AI by providing rigorous, interpretable results that can validate cause-and-effect relationships. This combination was supported by (66), who found that integrating the two approaches yields complementary biologically meaningful conclusions.
Similarly, techniques like the Bayesian network identified a hierarchical causal chain in ToM ability, where first-order first-person ToM is the foundational ToM skill that influences performance on more complex ToM tasks (40). In the schizophrenia population group, second-order ToM and Allocentric third-person ToM are more pronounced impairments. Clinically, this implied that interventions should prioritize strengthening foundational ToM skills, such as self-reflection and awareness of one’s mental states, before progressing to more complex social cognitive tasks. Overall, the mechanistic analyzes of socio-affective capacity and motivation’s influence on functional outcomes and hierarchical ToM structures may provide clinicians with new direction for personalized treatment approaches (40, 51).
5.2.4 Real-time recommendation systems
Deep reinforcement learning was employed to suggest conversation topics and therapeutic techniques by analyzing patient data during live sessions and providing therapists with tailored suggestions (52). This is especially beneficial for complex mental health conditions like psychosis, where individualized, context-sensitive care is significant (67). Surprisingly, none of the studies in this review utilized generative AI, commonly known as chatbots in the psychosocial treatment of psychosis, as Olawade et al. (68) emphasized the emerging trend of AI therapists in chatbots, for example, Woebot, Wysa, and BetterHelp in delivering psychotherapy like cognitive behavioral therapy without human involvement for anxiety and depression. However, Alliende et al. (69) raised concerns about the application of chatbots to psychosis due to inherent risks and limitations. It was found that the current common generative AI models, including chatbots like ChatGPT and Gemini, were trained on datasets that often reflect societal biases and stigmatizing views towards individuals with schizophrenia, potentially reinforcing misconceptions about violence or social distance.
Additionally, Arbanas et al. (70) found that patients with mental disorders, including psychosis, reported significantly lower satisfaction with chatbot responses compared to human therapists. Moreover, it revealed that about 50% of the participants found it more comfortable to communicate with humans than chatbots, with none perceiving chatbots as more professional or knowledgeable. This highlighted the irreplaceable role of human empathy and trust, particularly in psychosis, where therapeutic rapport is foundational. In contrast, real-time recommendation systems offered a promising alternative by equipping human therapists with AI-driven insights, preserving the human-led therapeutic process while enhancing treatment delivery. These systems could provide data-driven suggestions tailored to the patient’s needs, potentially improving outcomes without losing the human connection. However, their clinical applicability remained limited, as Lin et al. (52) reported only moderate accuracy in current systems, underscoring the need for further refinement of the AI system.
5.3 Implications for future AI applications & research directions
While the application of AI demonstrated potential in psychosocial functioning for psychosis, several methodological limitations have to be acknowledged to inform future research and clinical practice.
First, the generalizability of findings may be limited by specific datasets, small sample size, and consent biases. For example, five studies’ samples were particular to the Asian population, which may not represent the diversity of the global psychosis population, and vice versa. Small and varied sample sizes, ranging from 18 to 1027, affected the generalizability of findings. Additionally, consent biases may arise due to the 49% participation rate in the dataset of Leighton et al. (45), which may differ systematically from that of non-participants. Future recommendations on replicating studies in varied ethnic groups and larger, diverse cohorts may improve generalizability and reduce biased results.
Second, the quality and availability of data may affect the accuracy and reliability of AI models, particularly in terms of psychosocial function. Although the vast majority of studies utilized prospective datasets, four relied on retrospective data, which may pose a higher risk of recall and selection biases. Moreover, the only RCT included in the studies — Wang et al. (48) — was assessed as having a high risk of bias, primarily due to missing outcome data, which raises concerns about the completeness and reliability of the findings. Additionally, the predominance of cross-sectional data limited the ability to evaluate longitudinal changes, which is crucial for understanding the evolving nature of psychosocial function. To overcome these challenges, future research should prioritize prospective data collection and longitudinal analysis, which can facilitate the capture of the dynamic nature of psychosocial function in psychosis, ultimately enhancing their clinical utility and broader applicability.
Third, the design of models may impose additional constraints. Most of the models designed were static, lacking post-baseline updates or biomarkers, which limited their adaptability to individual patient trajectories. Models among studies often required inputs of specific assessment baseline scores, which may not be universally available across clinical settings. The reliance on proxy measures, such as socio-affective capacity derived from QLS items, and the untested construct validity may compromise predictive accuracy. The development of dynamic models that incorporate a broader range of variables, including additional biomarkers and standardized outcome measures, may facilitate future clinical integration.
Lastly, the risk of overfitting in ML is of concern. Given that studies predominantly employed ML as the primary approach, less than one-third externally validated their findings. The low certainty of the evidence identified using GRADE and the high risk of bias in the included RCT were primarily due to limited external validation. Also, none of the studies compared AI predictive modeling with existing clinical assessments; thus, the clinical utility of AI models remains uncertain. Future research should prioritize external validation to confirm the predictive power of these models. For future clinical integration, a comparison with traditional clinical tools is necessary to assess feasibility and effectiveness.
5.4 Ethical considerations
Considerations on ethics and privacy in applying AI techniques in the mental health clinical practice should be proactively managed. The included AI-augmented studies, especially the approaches of continuous sensing and real-time data processing, pose a high risk to data security and patient autonomy. However, there was a lack of studies detailing the strategies for managing patient consent or mitigating clinical errors. While anonymization was noted, the absence of specifics on data collection processes or representativeness across cultural, linguistic, or socioeconomic contexts raises ethical concerns about fairness and bias. Existing literature was consistent with these concerns: Fisher (71) discussed data security and bias in AI for psychiatry, Saeidnia et al. (72) highlighted privacy, consent, and fairness in mental health AI, and Gooding et al. (73) noted that only 15.2% of studies address ethical implications, with minimal focus on consent. Future research should address the above concerns to ensure equitable benefits without compromising patient welfare.
5.5 Limitations
This systematic review and meta-analysis presented certain limitations. Firstly, the relatively small number of eligible studies restricted the breadth of insights into AI applications for psychosocial functioning in psychosis, particularly given the field’s early developmental stage. This scarcity was compounded in the meta-analysis, where only partial independent studies contributed to the pooled performance metrics, thereby limiting generalizability and preventing reliable evaluation of publication bias. Secondly, substantial heterogeneity across the studies reflected variability in outcome domains, sample sizes, modeling strategies, and predictor sets such as genetic versus clinical features, limiting a comprehensive comparability of results. The overlapping cohorts involved in the meta-analyses also introduced risks of inflated precision and reduced applicability. Lastly, although the majority of included studies were considered to have good methodological rigor, two observational studies and one RCT were evaluated as of moderate quality and a high risk of bias, which may compromise the validity of the overall findings; however, sensitivity analyses confirmed that these biases did not unduly influence the results.
6 Conclusion
This systematic review and meta-analysis demonstrated the growing application of AI, particularly supervised ML, in addressing psychosocial functioning in psychosis. The included studies, largely of good methodological quality, demonstrated moderate overall performance in predicting outcomes, exploring underlying mechanisms, and providing therapeutic support across various domains, including social functioning, occupational functioning, social cognition, and QoL. Our findings suggested that clinical applications of AI could prioritize social cognition domains and leverage clinical symptoms for enhanced precision in early identification, personalized interventions and data-driven monitoring. Methodological advances, including the ensemble learning with feature selection, tailored models incorporating patient-specific data rather than one-size-fit-all models, and probabilistic models for handling heterogeneous data were also discovered. Overall, the remarkable potential of AI to enhance psychosocial care in psychosis was underscored. Nevertheless, the field remains in its early stages, with critical limitations on methodological consistency, data quality, model design, and ethical issues being highlighted in this review. These challenges necessitate careful consideration when applying AI in clinical practice. Given the dynamic and complex nature of psychosocial functioning in psychosis, AI should not be considered as replacing clinical expertise but rather complementing it. Human elements, including clinicians’ judgment and empathy, remain essential for delivering comprehensive psychosocial care, thereby fostering a collaborative relationship between AI technology and human insight. Future research is recommended to focus on bridging existing gaps and realizing AI’s full potential in promoting effective and ethical psychosocial care for individuals with psychosis.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Author contributions
CM: Writing – review & editing, Writing – original draft, Conceptualization, Methodology, Investigation. CC: Supervision, Methodology, Writing – review & editing. MC: Writing – review & editing, Supervision.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2025.1692177/full#supplementary-material
References
1. American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 5th ed. Arlington: American Psychiatric Publishing (2013).
2. Dahuja S, Singh U, and Beniwal RP. Assessment of social functioning, quality of life, and social support in persons with schizophrenia: A gender difference. Arch Ment Health. (2022) 23:23–9. doi: 10.4103/amh.amh_81_21
3. Green MF. What are the functional consequences of neurocognitive deficits in schizophrenia? Am J Psychiatry. (1996) 153:321–30. doi: 10.1176/ajp.153.3.321
4. Lam RW, Filteau MJ, and Milev R. Clinical effectiveness: the importance of psychosocial functioning outcomes. J Affect Disord. (2011) 132:S9–13. doi: 10.1016/j.jad.2011.03.046
5. Suman A, Nehra R, Sahoo S, and Grover S. Prevalence of loneliness and its correlates among patients with schizophrenia. Int J Soc Psychiatry. (2023) 69:906–15. doi: 10.1177/00207640221141646
6. Velthorst E, Fett AKJ, Reichenberg A, Perlman G, van Os J, Bromet EJ, et al. The 20-year longitudinal trajectories of social functioning in individuals with psychotic disorders. Am J Psychiatry. (2017) 174:1075–85. doi: 10.1176/appi.ajp.2016.15111419
7. Corbera S, Kurtz MM, Achim AM, Agostoni G, Amado I, Assaf M, et al. International perspective on social cognition in schizophrenia: current stage and the next steps. Eur Psychiatry. (2025) 68:e9. doi: 10.1192/j.eurpsy.2024.1776
8. Mancuso F, Horan WP, Kern RS, and Green MF. Social cognition in psychosis: multidimensional structure, clinical correlates, and relationship with functional outcome. Schizophr Res. (2011) 125:143–51. doi: 10.1016/j.schres.2010.11.007
9. Green MF, Horan WP, and Lee J. Social cognition in schizophrenia. Nat Rev Neurosci. (2015) 16:620–31. doi: 10.1038/nrn4005
10. Uchino T, Okubo R, Takubo Y, Aoki A, Wada I, Hashimoto N, et al. Perceptions of and subjective difficulties with social cognition in schizophrenia from an internet survey: Knowledge, clinical experiences, and awareness of association with social functioning. Psychiatry Clin Neurosci. (2022) 76:429–36. doi: 10.1111/pcn.13435
11. Fett AKJ, Viechtbauer W, Dominguez MG, Penn DL, van Os J, and Krabbendam L. The relationship between neurocognition and social cognition with functional outcomes in schizophrenia: a meta-analysis. Neurosci Biobehav Rev. (2011) 35:573–88. doi: 10.1016/j.neubiorev.2010.07.001
12. Giordano GM, Pezzella P, Mucci A, Austin SF, Erfurth A, Glenthøj B, et al. Negative symptoms and social cognition as mediators of the relationship between neurocognition and functional outcome in schizophrenia. Front Psychiatry. (2024) 15:1333711. doi: 10.3389/fpsyt.2024.1333711
13. Maat A, Fett AK, and Derks E. GROUP investigators. Soc Cogn Qual Life schizophrenia. Schizophr Res. (2012) 137:212–8. doi: 10.1016/j.schres.2012.02.017
14. Griffiths SL, Wood SJ, and Birchwood M. Vulnerability to psychosocial disability in psychosis. Epidemiol Psychiatr Sci. (2019) 28:140–5. doi: 10.1017/S2045796018000495
15. Minichino A, Davies C, Karpenko O, Christodoulou N, Ramalho R, Nandha S, et al. Preventing psychosis in people at clinical high risk: an updated meta-analysis by the World Psychiatric Association Preventive Psychiatry section. Mol Psychiatry. (2025) 30:2773–82. doi: 10.1038/s41380-025-02902-8
16. Research and Markets. AI in healthcare market size, share, and trends analysis report by component (hardware, services), application, end-use, technology, region (North America, Europe, APAC, Latin America, MEA) with growth forecasts, 2025-2030 (2025). Dublin: Research and Markets. Available online at: https://www.researchandmarkets.com/reports/5351653/ai-in-healthcare-market-size-share-and-trends (Accessed August 23, 2025).
17. McCarthy J. From here to human-level AI. Artif Intell. (2007) 171:1174–82. doi: 10.1016/j.artint.2007.10.009
18. Samoili S, Lopez Cobo M, Delipetrev B, Martinez-Plumed F, Gomez Gutierrez E, and De Prato G. AI Watch: Defining artificial intelligence 2.0. EUR 30873 EN. Luxembourg: Publications Office of the European Union (2021). doi: 10.2760/019901
19. Daniel R, Radulescu A, and Niv Y. Intact reinforcement learning but impaired attentional control during multidimensional probabilistic learning in older adults. J Neurosci. (2020) 40:1084–96. doi: 10.1523/JNEUROSCI.0254-19.2019
20. Deneault A, Dumais A, Désilets M, and Hudon A. Natural language processing and schizophrenia: A scoping review of uses and challenges. J Pers Med. (2024) 14:744. doi: 10.3390/jpm14070744
21. Razzaq K and Shah M. Machine learning and deep learning paradigms: From techniques to practical applications and research frontiers. Computers. (2025) 14:93. doi: 10.3390/computers14030093
22. Solanes A, Mezquida G, Janssen J, Amoretti S, Lobo A, González-Pinto A, et al. Combining MRI and clinical data to detect high relapse risk after the first episode of psychosis. Schizophr. (2022) 8:100. doi: 10.1038/s41537-022-00309-w
23. Wang KZ, Bani-Fatemi A, Adanty C, Harripaul R, Griffiths J, Kolla N, et al. Prediction of physical violence in schizophrenia with machine learning algorithms. Psychiatry Res. (2020) 289:112960. doi: 10.1016/j.psychres.2020.112960
24. Figueroa-Barra A, Del Aguila D, Cerda M, Gaspar PA, Terissi LD, Durán M, et al. Automatic language analysis identifies and predicts schizophrenia in first episode of psychosis. Schizophr. (2022) 8:53. doi: 10.1038/s41537-022-00259-3
25. Tang SX, Kriz R, Cho S, Park SJ, Harowitz J, Gur RE, et al. Natural language processing methods are sensitive to sub-clinical linguistic differences in schizophrenia spectrum disorders. NPJ Schizophr. (2021) 7:25. doi: 10.1038/s41537-021-00154-3
26. Zhang T, Schoene AM, Ji S, and Ananiadou S. Natural language processing applied to mental illness detection: a narrative review. NPJ Digit Med. (2022) 5:46. doi: 10.1038/s41746-022-00589-7
27. Balan R, Dobrean A, and Poetar CR. Use of automated conversational agents in improving young population mental health: a scoping review. NPJ Digit Med. (2024) 7:75. doi: 10.1038/s41746-024-01072-1
28. Ho A, Hancock J, and Miner AS. Psychological, relational, and emotional effects of self-disclosure after conversations with a chatbot. J Commun. (2018) 68:712–33. doi: 10.1093/joc/jqy026
29. Miner AS, Milstein A, Schueller S, Hegde R, Mangurian C, and Linos E. Smartphone-based conversational agents and responses to questions about mental health, interpersonal violence, and physical health. JAMA Intern Med. (2016) 176:619–25. doi: 10.1001/jamainternmed.2016.0400
30. Lai JW, Ang CKE, Acharya UR, and Cheong KH. Schizophrenia: A survey of artificial intelligence techniques applied to detection and classification. Int J Environ Res Public Health. (2021) 18:6099. doi: 10.3390/ijerph18116099
31. Bracher-Smith M, Rees E, Menzies G, Walters JTR, O’Donovan MC, Owen MJ, et al. Machine learning for prediction of schizophrenia using genetic and demographic factors in the UK biobank. Schizophr Res. (2022) 246:156–64. doi: 10.1016/j.schres.2022.06.006
32. Wu CS, Luedtke AR, Sadikova E, Tsai HJ, Liao SC, Liu CC, et al. Development and validation of a machine learning individualized treatment rule in first-episode schizophrenia. JAMA Netw Open. (2020) 3:e1921660. doi: 10.1001/jamanetworkopen.2019.21660
33. Anderson KM, Collins MA, Chin R, Ge T, Rosenberg MD, and Holmes AJ. Transcriptional and imaging-genetic association of cortical interneurons, brain function, and schizophrenia risk. Nat Commun. (2020) 11:2889. doi: 10.1038/s41467-020-16710-x
34. Kong L, Zhang Y, Wu XM, Wang XX, Wu HS, Li SB, et al. The network characteristics in schizophrenia with prominent negative symptoms: a multimodal fusion study. Schizophr. (2024) 10:10. doi: 10.1038/s41537-023-00408-2
35. Stauffer EM, Bethlehem RAI, Dorfschmidt L, Won H, Warrier V, and Bullmore ET. The genetic relationships between brain structure and schizophrenia. Nat Commun. (2023) 14:7820. doi: 10.1038/s41467-023-43567-7
36. Yang H, Chang F, Zhu D, Fumie M, and Liu Z. Application of artificial intelligence in schizophrenia rehabilitation management: a systematic scoping review. (2024). doi: 10.48550/arXiv.2405.10883
37. Allott K, Liu P, Proffitt TM, and Killackey E. Cognition at illness onset as a predictor of later functional outcome in early psychosis: systematic review and methodological critique. Schizophr Res. (2011) 125:221–35. doi: 10.1016/j.schres.2010.11.001
38. Stouten LH, Veling W, Laan W, van der Helm M, and van der Gaag M. Psychosocial functioning in first-episode psychosis and associations with neurocognition, social cognition, psychotic and affective symptoms. Early Interv Psychiatry. (2017) 11:23–36. doi: 10.1111/eip.12210
39. Badal VD, Depp CA, Hitchcock PF, Penn DL, Harvey PD, and Pinkham AE. Computational methods for integrative evaluation of confidence, accuracy, and reaction time in facial affect recognition in schizophrenia. Schizophr Res Cognit. (2021) 25:100196. doi: 10.1016/j.scog.2021.100196
40. Bosco FM, Colle L, Salvini R, and Gabbatore I. A machine-learning approach to investigating the complexity of theory of mind in individuals with schizophrenia. Heliyon. (2024) 10:e30693. doi: 10.1016/j.heliyon.2024.e30693
41. de Nijs J, Burger TJ, Janssen RJ, Kia SM, van Opstal DPJ, de Koning MB, et al. Individualized prediction of three- and six-year outcomes of psychosis in a longitudinal multicenter study: a machine learning approach. NPJ Schizophr. (2021) 7:34. doi: 10.1038/s41537-021-00162-3
42. Li C, Wang W, Guo Q, Jiang L, Qiao K, Hu Y, et al. Deep learning system for brain image-aided diagnosis of multiple major mental disorders. bioRxiv. (2022). doi: 10.1101/2022.06.01.22275855
43. Lin E, Lin CH, and Lane HY. Applying a bagging ensemble machine learning approach to predict the functional outcome of schizophrenia with clinical symptoms and cognitive functions. Sci Rep. (2021) 11:6922. doi: 10.1038/s41598-021-86382-0
44. Lin E, Lin CH, and Lane HY. Prediction of functional outcomes of schizophrenia with genetic biomarkers using a bagging ensemble machine learning method. Sci Rep. (2021) 11:10179. doi: 10.1038/s41598-021-89540-6
45. Leighton SP, Upthegrove R, Krishnadas R, Benros ME, Broome MR, Gkoutos GV, et al. Development and validation of multivariable prediction models of remission, recovery, and quality of life outcomes in people with first episode psychosis: a machine learning approach. Lancet Digit Health. (2019) 1:e261–70. doi: 10.1016/S2589-7500(19)30121-9
46. Shibata Y, Victorino JF, Natsuyama T, Okamoto N, Yoshimura R, and Shibata T. Estimation of subjective quality of life in schizophrenic patients using speech features. Front Rehabil Sci. (2023) 4:1121034. doi: 10.3389/fresc.2023.1121034
47. Walter N, Wenzel J, Haas SS, Squarcina L, Bonivento C, Ruef A, et al. A multivariate cognitive approach to predict social functioning in recent onset psychosis in response to computerized cognitive training. Prog Neuropsychopharmacol Biol Psychiatry. (2024) 128:110864. doi: 10.1016/j.pnpbp.2023.110864
48. Wang W, Mirjafari S, Harari G, Ben-Zeev D, Brian R, Choudhury T, et al. Social sensing: Assessing social functioning of patients living with schizophrenia using mobile phone sensing. In: Proceedings of the 2020 CHI conference on human factors in computing systems; 2020. ACM, New York, NY, USA (2020). p. 1–15. doi: 10.1145/3313831.3376855
49. Miley K, Bronstein MV, Ma S, Lee H, Green MF, Ventura J, et al. Trajectories and predictors of response to social cognition training in people with schizophrenia: A proof-of-concept machine learning study. Schizophr Res. (2024) 266:92–9. doi: 10.1016/j.schres.2024.02.027
50. Lin E, Lin CH, and Lane HY. Inference of social cognition in schizophrenia patients with neurocognitive domains and neurocognitive tests using automated machine learning. Asian J Psychiatry. (2024) 91:103866. doi: 10.1016/j.ajp.2023.103866
51. Miley K, Meyer-Kalos P, Ma S, Bond DJ, Kummerfeld E, and Vinogradov S. Causal pathways to social and occupational functioning in the first episode of schizophrenia: uncovering unmet treatment needs. Psychol Med. (2023) 53:2041–9. doi: 10.1017/S0033291721003780
52. Lin B, Cecchi G, and Bouneffouf D. Psychotherapy AI companion with reinforcement learning recommendations and interpreta ble policy dynamics. In: Companion proceedings of the ACM web conference 2023 (WWW ‘23 companion); 2023. New York, NY, USA: ACM (2023). p. 932–9. doi: 10.1145/3543873.3587623
53. Dehbozorgi R, Zangeneh S, Khooshab E, Nia DH, Hanif HR, Samian P, et al. The application of artificial intelligence in the field of mental health: a systematic review. BMC Psychiatry. (2025) 25:132. doi: 10.1186/s12888-025-06483-2
54. Rotenberg LS, Borges-Júnior RG, Lafer B, Salvini R, and Dias RS. Exploring machine learning to predict depressive relapses of bipolar disorder patients. J Affect Disord. (2021) 295:681–7. doi: 10.1016/j.jad.2021.08.127
55. Sajjadian M, Lam RW, Milev R, Rotzinger S, Frey BN, Soares CN, et al. Machine learning in the prediction of depression treatment outcomes: a systematic review and meta-analysis. Psychol Med. (2021) 51:2742–51. doi: 10.1017/S0033291721003871
56. Tay JL, Htun KK, and Sim K. Prediction of clinical outcomes in psychotic disorders using artificial intelligence methods: A scoping review. Brain Sci. (2024) 14:878. doi: 10.3390/brainsci14090878
57. Kalmady SV, Greiner R, Agrawal R, Shivakumar V, Narayanaswamy JC, Brown MRG, et al. Towards artificial intelligence in mental health by improving schizophrenia prediction with multiple brain parcellation ensemble learning. NPJ Schizophr. (2019) 5:2. doi: 10.1038/s41537-018-0070-8
58. Shi D, Li Y, Zhang H, Yao X, Wang S, Wang G, et al. Machine learning of schizophrenia detection with structural and functional neuroimaging. Dis Markers. (2021) 2021:9963824. doi: 10.1155/2021/9963824
59. Yassin W, Nakatani H, Zhu Y, Kojima M, Owada K, Kuwabara H, et al. Machine-learning classification using neuroimaging data in schizophrenia, autism, ultra-high risk and first-episode psychosis. Transl Psychiatry. (2020) 10:278. doi: 10.1038/s41398-020-00965-5
60. Zhang J, Rao VM, Tian Y, Yang Y, Acosta N, Wan Z, et al. Detecting schizophrenia with 3D structural brain MRI using deep learning. Sci Rep. (2023) 13:14433. doi: 10.1038/s41598-023-41359-z
61. de Filippis R, Carbone EA, Gaetano R, Bruni A, Pugliese V, Segura-Garcia C, et al. Machine learning techniques in a structural and functional MRI diagnostic approach in schizophrenia: a systematic review. Neuropsychiatr Dis Treat. (2019) 15:1605–27. doi: 10.2147/NDT.S202418
62. López-Díaz Á, Palermo-Zeballos FJ, Gutierrez-Rojas L, Alameda L, Gotor-Sánchez-Luengo F, Garrido-Torres N, et al. Proxy measures for the assessment of psychotic and affective symptoms in studies using electronic health records. BJPsych Open. (2024) 10:e22. doi: 10.1192/bjo.2023.623
63. Ben-Zeev D, McHugo GJ, Xie H, Dobbins K, and Young MA. Comparing retrospective reports to real-time/real-place mobile assessments in individuals with schizophrenia and a nonclinical comparison group. Schizophr Bull. (2012) 38:396–404. doi: 10.1093/schbul/sbr171
64. Woolway GE, Legge SE, Lynham AJ, Smart SE, Hubbard L, Daniel ER, et al. Assessing the validity of a self-reported clinical diagnosis of schizophrenia. Schizophr (Heidelb). (2024) 10:99. doi: 10.1038/s41537-024-00526-5
65. Choi A, Ooi A, and Lottridge D. Digital phenotyping for stress, anxiety, and mild depression: Systematic literature review. JMIR Mhealth Uhealth. (2024) 12:e40689. doi: 10.2196/40689
66. Rajula HSR, Verlato G, Manchia M, Antonucci N, and Fanos V. Comparison of conventional statistical methods with machine learning in medicine: diagnosis, drug development, and treatment. Med (Kaunas). (2020) 56:455. doi: 10.3390/medicina56090455
67. Fernández-Miranda JJ, Díaz-Fernández S, and López-Muñoz F. Effectiveness of more personalized, case-managed, and multicomponent treatment for patients with severe schizophrenia compared to the standard treatment: A ten-year follow-up. J Pers Med. (2022) 12:1101. doi: 10.3390/jpm12071101
68. Olawade DB, Wada OZ, Odetayo A, David-Olawade AC, Asaolu F, and Eberhardt J. Enhancing mental health with Artificial Intelligence: Current trends and future prospects. J Med Surg Public Health. (2024) 3:100099. doi: 10.1016/j.glmedi.2024.100099
69. Alliende LM, Sands BR, and Mittal VA. Chatbots and stigma in schizophrenia: the need for transparency. Schizophr Bull. (2024) 50:957–60. doi: 10.1093/schbul/sbae105
70. Arbanas G, Periša A, Biliškov I, Sušac J, Badurina M, and Arbanas D. Patients prefer human psychiatrists over chatbots: a cross-sectional study. Croat Med J. (2025) 66:13–9. doi: 10.3325/cmj.2025.66.13
71. Fisher CE. The real ethical issues with AI for clinical psychiatry. Int Rev Psychiatry. (2025) 37:14–20. doi: 10.1080/09540261.2024.2376575
72. Saeidnia HR, Hashemi Fotami SG, Lund B, and Ghiasi N. Ethical considerations in artificial intelligence interventions for mental health and well-being: Ensuring responsible implementation and impact. Soc Sci. (2024) 13:381. doi: 10.3390/socsci13070381
Keywords: AI, artificial intelligence, machine learning, psychosis, psychosocial functioning
Citation: Mok CHY, Cheng CPW and Chu MHW (2025) Application of artificial intelligence and psychosocial functioning in psychosis: a systematic review and meta-analysis. Front. Psychiatry 16:1692177. doi: 10.3389/fpsyt.2025.1692177
Received: 25 August 2025; Accepted: 20 October 2025;
Published: 05 November 2025.
Edited by:
Sandra Carvalho, University of Minho, PortugalReviewed by:
Valeria Cioffi, Phenomena Research Group, ItalyMohammad Hossein Salemi, University of Tehran, Iran
Copyright © 2025 Mok, Cheng and Chu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Calvin Pak Wing Cheng, Y2hlbmdwc3lAaGt1Lmhr