METHODS article
Front. Artif. Intell.
Sec. AI for Human Learning and Behavior Change
Predicting Adolescent Depressive Symptoms Using Teacher-Reported Textual Descriptions of Abnormal Behaviors: a study based on machine learning
Provisionally accepted- 1Tianjin University, Tianjin, China
- 2Tianjin Anding Hospital, Tianjin, China
- 3The Second Xiangya Hospital of Central South University, Changsha, China
- 4The Second Xiangya Hospital of Central South University National Clinical Research Center for Mental and Mental Disorders, Changsha, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Objective: This study aimed to develop and compare machine learning (ML) models for predicting depressive symptoms in adolescents, based on teacher-reported textual descriptions of student behaviors. Methods: Participants were 441 adolescents from Tianjin, China. Their teachers provided written reports on behavioral or emotional concerns, while the students completed the Patient Health Questionnaire-9 (PHQ-9). Text data from reports were processed using Term Frequency-Inverse Document Frequency (TF-IDF). Four ML models—Random Forest (RF), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator (LASSO)—were trained and evaluated using a 80/20 data split and 5-fold cross-validation. Results: PHQ-9 screening identified 71.7% (n=316) of adolescents with clinically significant depressive symptoms (score ≥10). The Random Forest (RF) model demonstrated superior performance, achieving a recall of 0.97, accuracy of 0.91, precision of 0.92, and F1-score of 0.92. SVM and XGBoost also showed good performance, while LASSO was the weakest. The analysis demonstrated that teacher reports could identify depressive symptoms with up to 97% recall. Conclusion: Machine learning, particularly Random Forest, can effectively predict adolescent depressive symptoms from teacher-reported text. This approach offers a practical and efficient tool for early identification in school settings, facilitating timely intervention.
Keywords: adolescent depression, machine learning, Prediction model, randomForest, Teacher report
Received: 29 Oct 2025; Accepted: 15 Dec 2025.
Copyright: © 2025 Wumaierjiang, Yan, Yuan, Song, Hou, Xu, Sun, Zhou, Yin and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Jiansong Zhou
Huifang Yin
Guangming Xu
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
