ORIGINAL RESEARCH article
Front. Psychiatry
Sec. Digital Mental Health
Volume 16 - 2025 | doi: 10.3389/fpsyt.2025.1608163
This article is part of the Research TopicApplication of chatbot Natural Language Processing models to psychotherapy and behavioral mood healthView all 12 articles
Psychotherapist Remarks' ML classifier: Insights from LLM and Topic Modeling Application
Provisionally accepted- Institute of Psychology (RAS), Moscow, Russia
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
This paper presents the development of a machine learning (ML) classification model for analyzing topics in therapist remarks within a psychotherapeutic context. The study applies BERTopic, a MLbased topic modeling technique, to unstructured dialogues from two distinct groups of therapists: classical (founders of therapeutic schools, such as Carl Rogers, Fritz Perls, and Albert Ellis) and modern (representing various psychotherapeutic approaches). This methodology enables the identification and characterization of recurring topics across these groups. We detail the implementation of the BERTopic algorithm, including the construction of a vector space from therapist remarks, dimensionality reduction, clustering, and optimization of topic representations. To enhance interpretability, the study integrates expert assessment and manual refinement of the topic structure alongside automated modeling. The results reveal the most common and stable topics in therapists' discourse, providing insights into how language patterns in therapy emerge and persist across different therapeutic styles. The extracted topics are utilized as features for training a ML classifier, which is then applied to a case study examining thematic structures in therapy sessions conducted by Carl Rogers and modern Cognitive Behavioral Therapy (CBT) practitioners. The analysis uncovered distinct thematic compositions, identifying key topics that characterize each group's therapeutic practice. The resulting model is publicly available, offering broad applications in psychotherapy research and training. This study contributes to the growing intersection of ML and psychotherapy by demonstrating how automated methods can enhance both therapeutic practice, clinical supervision, and education. It underscores the potential of topic modeling as a powerful tool for deepening our understanding of therapist communication.
Keywords: Psychotherapy, therapist, Language, Speech, Topic Modeling, machine learning, BERTopic, ML classifier
Received: 08 Apr 2025; Accepted: 30 Jun 2025.
Copyright: © 2025 Vanin, Bolshev and Panfilova. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Vadim Bolshev, Institute of Psychology (RAS), Moscow, Russia
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.