ORIGINAL RESEARCH article
Front. Psychiatry
Sec. Psychopathology
Volume 16 - 2025 | doi: 10.3389/fpsyt.2025.1648585
This article is part of the Research TopicGlobal Youth Mental Health Crisis: Understanding Challenges and Advancing Solutions in PsychopathologyView all articles
Machine Learning Models for Predicting the Risk of Depressive Symptoms in Chinese College Students
Provisionally accepted- 1Guangzhou University, Guangzhou, China
- 2South China Normal University, Guangzhou, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Depression is highly prevalent among college students, and accurately identifying risk factors is essential for timely intervention. Given the limitations of traditional linear models in managing high-dimensional data, this study employed machine learning techniques to predict depressive symptoms. Data were collected from 1,635 Chinese college students and included 38 sociodemographic, psychological, and social variables. Four machine learning algorithms, Random Forest, XGBoost, LightGBM, and Support Vector Machine, were evaluated. Results showed that the RF model achieved the highest discriminant performance with an AUC of 0.87 and an accuracy of 0.79, and identified key predictors such as sleep disturbance, perceived stress, experiential avoidance, and self-criticism. SHAP (SHapley Additive exPlanations) analysis further revealed that deteriorating sleep quality and heightened stress levels significantly increased the risk of depressive symptoms. These findings validate the effectiveness of Random Forest in capturing complex data interactions and offer actionable insights for targeted mental health interventions. Future studies should improve generalizability by incorporating more diverse samples and physiological biomarkers.
Keywords: machine learning, depressive symptoms, Risk factors, college students, Random forest (bagging) and machine learning
Received: 17 Jun 2025; Accepted: 07 Jul 2025.
Copyright: © 2025 Yu, Kong, Yu, Ni, Chen and Liao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Chengfu Yu, Guangzhou University, Guangzhou, China
Xiangxuan Kong, Guangzhou University, Guangzhou, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.