EDITORIAL article
Front. Psychiatry
Sec. Schizophrenia
Volume 16 - 2025 | doi: 10.3389/fpsyt.2025.1666275
This article is part of the Research TopicNatural Language Processing and Artificial Intelligence tools to explore the relationship between language and schizophrenia from diagnosis to careView all 5 articles
"Editorial: Natural Language Processing and Artificial Intelligence tools to explore the relationship between language and schizophrenia from diagnosis to care"
Provisionally accepted- 1L@bISEN, Yncrea Ouest, 20 Rue Cuirasse Bretagne, 29228, Brest, France, France
- 2Institute of Behavioral Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, New-York, United States
- 3Zucker Hillside Hospital, Glen Oaks, United States
- 4Institute of Psychiatry and Neuroscience of Paris (IPNP), INSERM U1266, Team: Pathophysiology of Psychiatric Disorders: Development and Vulnerability, Université Paris Cité, Paris, France
- 5GHU Paris Psychiatrie et Neurosciences, CJAAD, Evaluation, Prevention and Therapeutic Innovation Department, Hôpital Sainte Anne, Paris, France
- 6CNRS GDR 3557-Institut de Psychiatrie, Paris, France
- 7URCI University Hospital Department of Adult Psychiatry, Brest, France
- 8Sorbonne University, Inserm, Pierre-Louis Institute of Epidemiology and Public Health, Paris, France
- 9IMT Atlantique, Lab-STICC, Campus de Brest, Technopôle Brest-Iroise, Brest, France
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Schizophrenia is a heterogeneous disorder classically defined by three symptom clusters: positive symptoms (such as hallucinations and delusions), negative symptoms (including affective flattening, avolition, and social withdrawal), and disorganization symptoms (notably thought disorder and incoherent speech). These manifest notably in language disturbances characterized by fragmented, disorganized, or impoverished speech, alongside motor and praxis difficulties as well as impaired social interactions, reflecting underlying neural circuit disruptions (1). Several (4,5).In this context, the evaluation of spoken language of patients with SSD or at risk of developing schizophrenia has repeatedly demonstrated its prognostic and diagnostic value (6,7).As a proxy for mental activity, language disorders, which represent most of the expression of "formal thought disorders", manifest themselves through disorganization of speech, loss of coherence, and alteration of emotional expression (8). With natural language processing techniques (NLP), a whole collection of new features appears to contribute to the clinical picture of SSD and clinical high risk for psychosis (CHR-P). While semantic coherence emerge as the principal marker in many studies (9), others emphasized on emotional prosody (10). Nevertheless, most linguistic markers, when considered independently, lack of specificity for sSchizophrenia and symptomatic alterations go beyond the simple diagnostic framework of schizophrenia or psychosis (11). Then, a multimodal approach, integrating linguistic data with other clinical and biological markers, holds promises to enhance the accuracy and richness of detection and assessments of at risk patients(12) (13).The objective of this topic was to bring together the most advanced studies in the field of diagnosing and predicting the future of patients with SSD and/or CHR-P from linguistic markers and to think about how they can be mixed in different pathological contexts and trajectories.Among the articles that make up this topic, that of Just et al., (14) highlights semantic incoherence, meaningly the inability to maintain a logical thread in the discourse. The authors demonstrate that semantic incoherence is one of the key elements in non-affective psychosis that could help in diagnosis. On top of negative symptoms scores which correlate with coherence independently of the embedding model, inpatient care, disorganized score and excitement score add up to the board when Word2Vec method is used.In addition to the semantic inconsistency, whose predictive value will be discussed later, the article by Olson et al., (15) demonstrates that the tone of the discourse can also be important.By using Linguistic Inquiry and Word Count, they determine that despite any significant differences in the count of emotionally charged terms, the tone of speech becomes more "negative" in CHR-P patients, particularly if their positive symptoms are high. This result perfectly represents the subtlety of language and the complex interactions between its various levels. Both in the clinical interview and classification criteria, the assessment of emotional states is an integral part of the diagnostic process. In clinical practice, these are spontaneously perceived and often identified through vocal expression, particularly in prosody, intonation, rhythm, or even intensity. To build on these results, it is interesting to remember that paralinguistic abnormalities linked to emotion (e.g., monotone voice, prosody flattening)(16)(17) (18) probably reflect affects in a complementary way to verbal content when detection of psychiatric disorders is at stake.According predictive value, the study by Kim-Dufor et al., (19) uses transcripts of free speech interviews as input to a machine learning model (XGBoost( 20)) to automatically classify with 82% accuracy success patients into three categories: not at risk, at risk, and first psychotic episode. The authors also examine the respective contribution to the classification of linguistic markers in the transcribed speech and conclude that semantic coherence, frequency of pronoun "I," and filled pauses help in predicting patient's outcome. This approach reconciles algorithmic performance and clinical intelligibility, providing a more transparent "black box," which constitutes an essential condition for the acceptability of AI tools in everyday psychiatric practice.Finally, the relationships between the aforementioned linguistic anomalies and their neural correlates have also been studied in this topic. Applying the PRISMA method on 37 imaging studies, Alonso-Sánchez et al., (21) explored the link between linguistic disturbances such as semantic coherence, maximal semantic coherence or disorganization of thoughts and brain alterations. Thus, whether patients are at ultra-high risk (UHR), have already had a first episode of psychosis (FEP) or present SSD, structural and functional modifications appear. These are mainly driven by differences in processing both in production or comprehension of speech when semantics is involved whether analyzed via NLP models or introduced employing specific experimental paradigms. Functional changes are also found related to disorders of encoding and/or word selection; two functions closely intertwined in the construction of a semantically coherent discourse. The pattern of visible changes also seems to extend from patients with FEP to those with schizophrenia, through UHR patients. The linguistic and cognitive disruptions highlighted throughout this special issue not only deepen our understanding of schizophrenia spectrum disorders but also pave the way toward the development of more faithful cognitive models of language processing, potentially surpassing existing connectionist frameworks (22).Beyond the diagnostic domain, these AI-driven approaches hold great promise by automating language analysis to provide more objective, faster, and cost-effective evaluations than traditional clinical methods. Such tools could contribute to a psychiatry that is more precise, personalized, and predictive, capable of detecting subtle changes well before the onset of severe symptoms. This progress could also, one day, enable innovative applications like the emergence of a digital twin of the brain's language functions, offering unprecedented insights into psychosis.It is important to emphasize that artificial intelligence will never replace the clinician but rather become his most valuable ally, especially in the complex and inherently subjective field of mental health. Once known as a "language disease," schizophrenia today finds in digital language processing an innovative tool for understanding and care, opening new horizons for both research and clinical practice.
Keywords: Schizophrenia, Natural Language Processing, linguistic markers, psychosis, Early detection
Received: 15 Jul 2025; Accepted: 28 Jul 2025.
Copyright: © 2025 Dufor, Nikzad, Lucarini and Lemey. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Olivier Dufor, L@bISEN, Yncrea Ouest, 20 Rue Cuirasse Bretagne, 29228, Brest, France, France
Christophe Lemey, URCI University Hospital Department of Adult Psychiatry, Brest, France
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.