ORIGINAL RESEARCH article

Front. Psychiatry

Sec. Psychopathology

Volume 16 - 2025 | doi: 10.3389/fpsyt.2025.1648585

This article is part of the Research TopicGlobal Youth Mental Health Crisis: Understanding Challenges and Advancing Solutions in PsychopathologyView all articles

Machine Learning Models for Predicting the Risk of Depressive Symptoms in Chinese College Students

Provisionally accepted
Chengfu  YuChengfu Yu1*Xiangxuan  KongXiangxuan Kong1*Weijie  YuWeijie Yu1Xingcan  NiXingcan Ni1Jing  ChenJing Chen1Xiaoyan  LiaoXiaoyan Liao2
  • 1Guangzhou University, Guangzhou, China
  • 2South China Normal University, Guangzhou, China

The final, formatted version of the article will be published soon.

Depression is highly prevalent among college students, and accurately identifying risk factors is essential for timely intervention. Given the limitations of traditional linear models in managing high-dimensional data, this study employed machine learning techniques to predict depressive symptoms. Data were collected from 1,635 Chinese college students and included 38 sociodemographic, psychological, and social variables. Four machine learning algorithms, Random Forest, XGBoost, LightGBM, and Support Vector Machine, were evaluated. Results showed that the RF model achieved the highest discriminant performance with an AUC of 0.87 and an accuracy of 0.79, and identified key predictors such as sleep disturbance, perceived stress, experiential avoidance, and self-criticism. SHAP (SHapley Additive exPlanations) analysis further revealed that deteriorating sleep quality and heightened stress levels significantly increased the risk of depressive symptoms. These findings validate the effectiveness of Random Forest in capturing complex data interactions and offer actionable insights for targeted mental health interventions. Future studies should improve generalizability by incorporating more diverse samples and physiological biomarkers.

Keywords: machine learning, depressive symptoms, Risk factors, college students, Random forest (bagging) and machine learning

Received: 17 Jun 2025; Accepted: 07 Jul 2025.

Copyright: © 2025 Yu, Kong, Yu, Ni, Chen and Liao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Chengfu Yu, Guangzhou University, Guangzhou, China
Xiangxuan Kong, Guangzhou University, Guangzhou, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.