Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Psychol., 29 January 2026

Sec. Educational Psychology

Volume 16 - 2025 | https://doi.org/10.3389/fpsyg.2025.1684274

Explaining factors influencing students' depression with a deep learning approach


Xinyu Li&#x;Xinyu Li1Yunyi Hu
&#x;Yunyi Hu2*Huohong Chen&#x;Huohong Chen3Xingxing Wang
&#x;Xingxing Wang4*Feng GongFeng Gong4
  • 1School of Computer Science and Engineering, Central South University, Changsha, China
  • 2School of Information Resource Management, Renmin University of China, Beijing, China
  • 3School of Educational Science, Hunan Normal University, Changsha, China
  • 4School of Marxism, Central South University, Changsha, China

Objective: Student mental health has emerged as an increasingly prominent issue in sustainable educational healthcare systems. Accurately and promptly identifying students' depression and analyzing the key factors associated with it are crucial for improving student mental health.

Method: We propose an artificial intelligence algorithm, GLNet, that integrates Mamba and convolutional layers to extract features from students' demographic, academic, and lifestyle information for depression analysis. The performance of GLNet is validated on the publicly available Student Depression Dataset.

Results: GLNet achieves an accuracy of 88.84% on the Student Depression Dataset, outperforming other methods and verifying its effectiveness in student depression analysis. Factor contribution analysis indicates that academic pressure and financial stress may be associated with student depression, while healthy dietary habits and academic satisfaction may alleviate depression. Subgroup analysis further reveals that a higher Cumulative Grade Point Average may be positively correlated with depression in female students, and unhealthy dietary habits may be linked to depression among doctoral students.

Conclusion: GLNet can serve as a reliable tool for enhancing student mental health. It also provides valuable insights for educators to identify students at risk of depression, contributing to the optimization of student mental health intervention strategies.

1 Introduction

Depression is a common mental illness characterized by persistent low mood, loss of interest or pleasure in activities, impaired memory, and feelings of fatigue or lack of energy (Arnone et al., 2024; Li et al., 2025; Maes et al., 2024; Haghish and Czajkowski, 2024). With the rapid development of society and the increasing pace of life, the incidence of depression among student populations has shown an upward trend (Liu et al., 2022). According to the World Health Organization (WHO), there are over 400 million people suffering from depression worldwide (Liu et al., 2024), posing a huge challenge to public health systems (Yılmaz et al., 2025). Depression can lead to a decline in students' academic performance and social abilities, and trigger a chain reaction, such as communication barriers between students and teachers, which severely hinders students' overall development (Verboom et al., 2014; Sörberg Wallin et al., 2019). Therefore, designing an accurate depression diagnosis mechanism for students and quantifying the risk factors is a necessary measure to improve students' mental health.

The etiology of depression can be understood through a biopsychosocial model, which posits that depression arises from a complex interplay of biological vulnerabilities, psychological processes, and social-contextual factors (Borrell-Carrió et al., 2004; Schotte et al., 2006; Maxwell and Cole, 2009). Several theoretical models elaborate on the multifaceted mechanisms underlying student depression. Specifically, research related to the interaction mechanism of social structure, stress, and mental health (Aneshensel et al., 1991; Morris et al., 2008), including the Diathesis-Stress Model (Colodro-Conde et al., 2018; Arnau-Soler et al., 2019; Harkness and Stewart, 2026), posits that external environmental stressors, such as academic pressure, financial difficulties, and interpersonal conflicts, interact with individuals' inherent vulnerabilities, thereby increasing the risk of depression. Beck's Cognitive Theory of Depression (Haaga et al., 1991; Haaga and Beck, 1995; Zaiden and Mahfar, 2024) emphasizes that hopelessness about the future may be associated with negative cognitive biases, including selective attention to adverse information, overgeneralization, and catastrophizing thinking, which in turn predispose individuals to depression. Social Role Theory (Kim and Kwon, 2025; Hsu et al., 2021) explains gender differences in depression risk, attributing these disparities to distinct social roles and expectations assigned to men and women. Specifically, women often face dual role conflict, as they are pressured to achieve academic excellence while conforming to traditional feminine role norms, inducing psychological stress. Therefore, these models reveal how external stressors, cognitive processing biases, and social structural factors interact to shape students' depression risk, highlighting the “external-internal interaction” nature of depressive pathogenesis.

Research on the mechanisms underlying the interactions among psychological needs, development, and mental health (Tang et al., 2020), such as Self-Determination Theory (Moore et al., 2021; Leow et al., 2016), identifies three basic psychological needs, including autonomy, competence, and relatedness. Satisfaction of these needs enhances intrinsic motivation and promotes mental health, whereas unmet needs may induce depression. For example, students' satisfaction with their learning experience may serve as a key factor influencing their risk of depression. Borbély's Two-Process Model (Borbély, 1987; Abhilash and Shafer, 2024) explains how sleep demand accumulates with wakefulness duration, as insufficient sleep impairs individuals' emotional regulation, thereby increasing the risk of depression. Additionally, Erikson's Theory of Psychosocial Development (Knight, 2017; Degges-White, 2017; Sekowski et al., 2024) reveals that individuals face unique developmental tasks and psychological challenges at different stages of life. As individuals age, their cognitive abilities, such as rational analysis and emotional regulation, gradually improve, and psychological resilience is enhanced. However, the dominant risk factors for depression vary across developmental stages, implying potential differences in the manifestations of depression among students of different educational levels and age groups. These models supplement understanding of depression mechanisms by focusing on individuals' internal psychological needs, developmental laws, and basic physiological regulation, emphasizing that the fulfillment of intrinsic needs and the maintenance of developmental balance are crucial for analyzing student depression.

Traditional statistical methods and many early machine learning models often assess them in isolation. These approaches fail to capture the complex, non-linear interplay between factors, which is central to the psychopathology of depression. Artificial intelligence algorithms have been proven capable of modeling the relationship between factors and outcomes, and have shown effectiveness in the diagnosis of individuals' mental health (Mimikou et al., 2025; Zhang X. et al., 2023; Algarni and Thayananthan, 2025). Specifically, Hong et al. (2025) used logistic regression to model factors in disabled elderly individuals to predict their levels of depression. Vu et al. (2025) employed XGBoost to model individual information such as academic, and financial status to predict depression. Risqi et al. (2025) used SVM to model behavioral data of college students to predict whether they have persistent depression. Gohari et al. (2024) used random forests to model factors such as family life and school belongingness in Canadian adolescents to predict depression. Song et al. (2025) used LightGBM to model factors in elderly individuals to predict their depression. However, these methods struggle to explore the deeper information in the data and are highly dependent on manually selected high-quality features.

Deep learning can extract deep information from the data through multiple hidden layers, enabling the exploration of complex patterns in factors for depression (Zamani et al., 2025). Specifically, Baek and Chung (2020) used multiple separate dense layers to model the lifestyle and health status of individuals from the Korean National Health and Nutrition Examination Survey to predict their depression levels. Saha et al. (2024) proposed DeprMVM, which uses a multilayer perceptron (MLP) to extract deep features from individuals' psychological and sociodemographic information, followed by using SVM for depression prediction. However, these methods struggle to model the global contextual relationships between factors, limiting the performance of the model. Additionally, Mamba (Dao and Gu, 2024) has shown potential in computer vision and text processing by using structured spatial models (SSM) to efficiently extract global information from input data and improve the linear scalability of data processing. However, Mamba's potential in enhancing depression detection in students has not been fully explored. Global information can reveal trends in the data and the relationships between distant factors (Tang et al., 2024; Khan and Ghosh, 2021). Local information can capture the details within the data (El-Rashidy et al., 2024; Kostopoulos et al., 2019). Exploring and combining both global and local information in the data could provide a more comprehensive perspective, helping to further improve the performance of student depression prediction.

Explaining the decision-making process of deep learning methods may help educators better identify the triggers of depression. However, deep learning methods are often regarded as a black-box approach, making their interpretability analysis a challenge (Xu and Yang, 2025). The Shapley Additive Explanations (SHAP) method (Lundberg and Lee, 2017), as a game-theory-based interpretability technique, provides a powerful tool for the interpretability of deep learning models. SHAP quantifies the importance of features by calculating their marginal contributions to the model's output and reveals the decision-making rationale of the model in a consistent and interpretable way. Therefore, using the SHAP method to explain the impact of different factors on student depression may be effective for educators in enhancing students' mental health.

To address this gap, we propose GLNet, a deep learning model designed to concurrently analyze both local and global features within a student's profile. Crucially, by integrating GLNet with SHAP, an explainability method, we aim not only to predict but to explain the drivers of depression, turning a “black box” into a transparent tool for psychological insight. Specifically, this study aims to: (1) Validate the performance of GLNet in accurately identifying student depression. (2) Identify and quantify the key risk and protective factors and their complex contributions. (3) Investigate the heterogeneity of these factors across different student subgroups, providing tailored insights for targeted interventions.

2 Materials and methods

2.1 Dataset description

We use the publicly available Student Depression Dataset (SDD; Jurado, 2025; Shamim, 2025). SDD was originally gathered through anonymized, self-reported surveys distributed across various educational institutions, and sensitive information was removed or encoded to protect individual privacy, ensuring compliance with ethical research standards. This dataset contains 27,901 student samples. After removing samples with missing information, a total of 27,898 samples are used in this study. Each student sample includes their demographic information, academic indicators, lifestyle, wellbeing, other additional factors, and whether they are experiencing depression using standardized depression scales. We follow data processing approaches (Zeng et al., 2025; Chowdhury and Das, 2025; Raza et al., 2025; Risqi et al., 2025) for SDD, ensuring that all student information is fully anonymized without any disclosure of personal privacy information. Moreover, demographic factors include the age of the student in years, the gender of the student, and the city or region where the student resides. Academic indicators include the cumulative grade point average of the student, how satisfied the student is with their studies, and the level of pressure the student faces in academic settings, such as stress from exams, assignments, and overall academic expectations. Lifestyle and Wellbeing factors include the average number of hours the student sleeps per day, the student's eating patterns and nutritional habits, the average number of hours per day the student dedicates to study, the student's satisfaction with their work environment, and the pressure related to work responsibilities. Additional factors include the field of study of the student, the academic degree or program that the student is pursuing, whether the student has ever experienced suicidal ideation, whether the student has a family history of mental illness, and the degree of stress experienced due to financial concerns.

We split the dataset into training and testing sets with an 80:20 ratio, with 15% of the training set used as a validation set for hyperparameter tuning. Table 1 shows the demographic characteristics of the data. Notably, the depression status, gender, and age of the students in the training and testing dataset remain consistent.

Table 1
www.frontiersin.org

Table 1. Summary of the datasets used in this study.

2.2 Data encoding

To analyze students' depression using the artificial intelligence algorithm GLNet, we have performed one-hot encoding on students' potential depression risk factors and their depression status. Specifically, students with a depressed status are encoded as 1, and those with a non-depressed status are encoded as 0. To facilitate the factor contribution analysis and quantify the impact of different factors on student depression, we have re-encoded the factors “Profession,” “Degree,” “City,” “Sleep Duration” and “Dietary Habits.” Specifically, we have split these factors into multiple binary factors. Finally, we obtain 114 encoded factors as inputs for GLNet.

2.3 Model development

The overall pipeline of our model can be divided into three steps: (1) We use global and local blocks (GLBs) to extract global and local features to predict students' depression. (2) We input the prediction results of GLNet into a weighted cross-entropy loss function to optimize the trainable parameters within GLNet. (3) We use SHAP to quantify the contribution of factors in the trained GLNet model's decision-making process, thereby explaining the impact of different factors on students' depression.

2.3.1 Structure of GLNet

To facilitate the subsequent extraction of global and local features, we first use a fully connected layer to linearly map the encoded 114 student features to 128 deep features, which can be expressed as:

xfc=ffc(x),    (1)

where ffc is the fully connected layer. We then input the xfc into GLBs to explore the global and local features in the data, and employ residual connections to aggregate the shallow and deep features of the network for student depression prediction.

Global and local block for feature extraction: as shown in Figures 1B, C, GLNet is primarily composed of GLBs. The GLBs use Mamba and convolutional layers, respectively, to explore the global and local features in students' academic and lifestyle information. Specifically, the Mamba layer (Dao and Gu, 2024), based on State-Space Models (SSM), introduces dynamic systems that can adaptively update based on the current state of the sequence, which allows for more efficient modeling of long-range dependencies in the input data. The Mamba layer can be represented as:

fmamba(ht)=F(ht-1,xt,θ),    (2)
Figure 1
Flowchart illustrating a machine learning process for predicting depression. Section A details data preparation: student information is encoded and normalized. Section B describes the Global and Local Block (GLB), incorporating steps like Mamba, convolution, Leaky ReLU, expansion/compression, and batch normalization. Section C outlines the GLNet, showing a sequence of fully connected layers and GLBs with residual connections leading to prediction of depression or non-depression.

Figure 1. Overall pipeline of the proposed GLNet. (A) All data are encoded and normalized to be input into the network. (B) The Global and Local Block is employed for deep feature extraction. (C) GLNet performs student depression analysis.

where ht is the state at time step t. xt is the input at time step t. θ represents the parameters of the model, and F represents the function that updates the state at each time step based on the previous state and the current input. Additionally, Convolution is regarded as an effective method for extracting local information from data due to its localized receptive field (Sadasivan et al., 2025; Kim et al., 2025). Therefore, the process of GLBs can be represented as:

fglb(xfc)=BN(ffc(β(fconv(fmamba(xfc))))),    (3)

where fconv is the 1D convolutions with kernel size of 3. β is the Leaky ReLU activation function for regulization. BN is the Batch Normalization. ffc in GLBs are used to either expand the features to twice their original size or compress them to half.

Shallow and deep feature aggregation: we use four GLBs to extract features from student information. Specifically, the first two GLBs utilize fully connected layers to expand the features to twice their original size, while the last two compress them to half. We use the residual connection to link shallow and deep features with the same dimensionality, forming a U-shaped structure, which can mitigate the vanishing gradient problem, aiding in the robust training of the network and enabling the extraction of deeper features. Finally, the resulting features are used for students' depression prediction, which can be expressed as:

fxglb1=fglbup(xfc),    (4)
y=fglbdowm(fglbdowm(fglbup(xglb1))+xglb1),    (5)

where fglbup represents the fully connected layer used to expand the features to twice their size, while fglbdown represents the fully connected layer used to compress the features to half of their original size.

2.3.2 Factor contribution analysis in GLNet with SHAP

The interpretability of Deep Neural Networks (DNN) has been a challenge for decades (Xu and Yang, 2025; Shah and Sureja, 2025). By explaining the impact of different factors on DNN's decision-making, we can better understand the factors contributing to the formation of depression, helping educators provide timely support for students. Shapley Additive Explanations (SHAP) is a unified method for interpreting deep learning models, which explains the model's output by calculating the SHAP values for each feature. The SHAP value represents the contribution of that feature to the model's output (Lundberg and Lee, 2017). Specifically, the SHAP method calculates the impact of each feature on the final prediction by comparing the model's prediction when the feature is included and excluded, and sums the contributions of all features, which can be expressed as:

ϕi(f)=SN\{i}|S|!(|N|-|S|-1)!|N|![f(S{i})-f(S)]    (6)

where S is a subset chosen from the feature set N, and does not contain the feature i. N is the full set of features. fS represents the prediction made by the model trained on the feature set S, while f(S∪{i}) is the prediction made by the model trained on the feature set S augmented with feature i. |S| is the size of the subset S. |N| is the size of the full feature set N. The term |S|!(|N|-|S|-1)!|N|! is the weight coefficient, which represents the probability of the feature order, reflecting the impact of the position of feature i in all possible permutations on the contribution value. We visualize the factor contribution using SHAP values to clearly show the positive and negative contribution of the factors on depression prediction.

2.4 Compared methods and evaluation metrics

To validate the effectiveness of GLNet, we compare it with five machine learning methods, including Logistic Regression (LR; Hong et al., 2025), Random Forest (RF; Gohari et al., 2024), XGBoost (Vu et al., 2025), Support Vector Machine (SVM; Risqi et al., 2025), and LightGBM (Song et al., 2025), as well as three deep learning methods, (Baek and Chung, 2020), (Zhou et al., 2024), and DeprMVM (Saha et al., 2024). To ensure a fair comparison, we follow the configurations described in their papers and the public code. Specifically, we set the maximum number of iterations for LR to 100. For SVM, we set the degree of the polynomial kernel to 3. For RF and XGBoost, we use 100 trees for modeling the factors. LightGBM uses Gradient Boosting Decision Trees to model factors for predicting depression. Baek and Chung use dense layers to extract features for predicting depression. Zhou et al. integrated random forest and fully connected layers for depression prediction. DeprMVM combined the MLP and SVM for depression prediction (Baek and Chung, 2020), Zhou et al., 2024).

To evaluate the performance of the models in predicting student depression, we use metrics such as Area Under the Curve (AUC), Accuracy (ACC), Sensitivity (SEN), Specificity (SPE), and F1_Score to quantitatively assess the model's prediction of whether students are in depression.

ACC=TP+TNTP+TN+FP+FN,    (7)
SEN=TPTP+FN,    (8)
SPE=TNTN+FP,    (9)
F1_Score=2×TP2TP+FP+FN,    (10)

where True Positives (TP) are the number of students correctly predicted as depressed by the model. True Negatives (TN) are the number of students correctly predicted as non-depressed by the model. False Positives (FP) are the number of students incorrectly predicted as depressed by the model, and False Negatives (FN) are the number of students incorrectly predicted as non-depressed by the model. Additionally, we also use a 95% confidence interval to assess the robustness of the models.

2.5 Implementation details

We train all models using NVIDIA 2080 Ti GPUs with PyTorch. We standardize all factors to have a mean of 0 and a standard deviation of 1. During training, we use dropout to randomly drop 10% of the factors and apply a 10% data shift for data augmentation to enhance the robustness of the model training. We use the Ranger optimizer (Wright and Demeure, 2021) with an initial learning rate of 1e-4, which is gradually reduced during the training process. We use the weighted cross-entropy loss function to optimize this depression prediction task. The batch size for training is set to 256.

3 Results

3.1 Comparison with other methods

To ensure the feasibility of contribution analysis using GLNet and its effectiveness in predicting student depression, we compare it with other methods. As shown in Table 2, GLNet achieves the best performance. Specifically, compared to the machine learning method SVM, GLNet improves AUC and ACC by 4.28 and 6.73%, respectively, which may be because GLNet is able to explore deeper information within the data. Compared to the deep learning method DeprMVM, GLNet improves AUC and ACC by 2.89 and 4.04%, respectively, which may be because GLNet effectively explore both global and local information in the data through the Mamba and convolution, demonstrating its effectiveness in predicting student depression. Therefore, GLNet can be used to accurately analyze the impact of different factors on students' depression.

Table 2
www.frontiersin.org

Table 2. Comparison of different methods.

3.2 Effectiveness of weighted cross-entropy loss

To verify the effectiveness of the weighted cross-entropy loss, we compare GLNet with the model without using the weighted cross-entropy loss. As shown in Table 3, the AUC and ACC of the model without using the weighted cross-entropy loss decrease by 2.71 and 2.41%, respectively, demonstrating the effectiveness of the weighted cross-entropy loss.

Table 3
www.frontiersin.org

Table 3. Effectiveness of weighted cross-entropy loss.

3.3 Analysis of factors influencing students' depression

We use the SHAP method (Lundberg and Lee, 2017) to quantify the impact of students' academic and lifestyle habits on their depression. As shown in Figure 2, the x-axis represents SHAP values, and the y-axis arranges factors based on their impact. Each point represents a sample. We find that having suicidal thoughts, academic pressure, financial stress, work/study hours, family history of mental illness, and sleep duration of less than 5 h are the most significant factors contributing to student depression. In contrast, older age, healthy eating habits, and satisfaction with one's academic performance are typically associated with a lower likelihood of depression. These findings may be helpful for enhancing the optimization of educational healthcare systems.

Figure 2
A SHAP summary plot analyzing factors impacting suicidal thoughts. The horizontal axis shows SHAP values, indicating the impact on the model output. Factors like academic pressure and financial stress have the highest positive impact, while sleep duration has a strong negative influence. Color gradients represent feature values, from low (blue) to high (red).

Figure 2. Summary of contributions from different factors on the model's decision. The vertical axis represents the importance ranking of features across different factors, with higher positions indicating greater contribution to the model's decision-making. The horizontal axis denotes SHAP values: the higher the positive value, the greater the contribution to the model's determination of students' depression; conversely, the larger the absolute value of the negative value, the stronger the inhibitory effect on this determination. Each point in the figure corresponds to a student, with the colors reflecting the values of the factors from low (blue) to high (red).

3.4 Analysis of GLNet on data subgrouped by gender

To explore the impact of students' academic and life factors on depression across different genders, we conduct a contribution analysis on students of different genders. As shown in Figure 3, we find that for male students, those with an educational level at Grade 12 (senior year of high school) have a lower likelihood of depression. For female students, the higher their Cumulative Grade Point Average (CGPA), the more likely they are to experience depression. These findings may be helpful for enhancing the optimization of educational healthcare systems.

Figure 3
Two side-by-side SHAP value plots showing feature impact on model output. The left plot includes features such as suicidal thoughts, academic pressure, financial stress, and more, with red indicating high impact and blue indicating low. The right plot displays a similar setup with slight changes in the feature list, like study satisfaction and CGPA. Both plots have SHAP values ranging from -0.25 to 0.25.

Figure 3. Subgroup analysis of GLNet on students subgrouped by the genders.

3.5 Analysis of GLNet on data subgrouped by educational levels

To explore the impact of various factors on depression among students at different educational levels, we conduct a subgroup analysis on high school students, undergraduates, master's students, and doctoral students. As shown in Figure 4, we find that the effects of different factors on depression vary significantly across students at different educational levels. Specifically, longer working and study hours significantly increase the risk of depression among high school students, undergraduates, and master's students, but have no significant impact on doctoral students. In addition, unhealthy dietary habits significantly increase the risk of depression among doctoral students. These findings may be valuable for enhancing the optimization of educational healthcare systems.

Figure 4
SHAP value charts illustrate the impact of various features on the model output for suicidal thoughts prediction. Features include academic pressure, financial stress, dietary habits, age, and sleep duration. Each chart shows feature value distributions, with higher SHAP values indicating greater impact. The charts compare different demographics, including city, degree, and dietary habits, showing variations in influence across these categories.

Figure 4. Subgroup analysis of GLNet on data subgrouped by the educational levels.

4 Discussion

We propose GLNet to explain the impact of various factors of students, such as their demographic, academic, lifestyle, wellbeing, and additional factors, on their depression. GLNet achieves an AUC of 93.36% and an ACC of 88.84% on the publicly available SDD dataset, outperforming other machine learning and deep learning methods, which may be because GLNet effectively explores the global and local information in the factors. In addition, GLNet only takes 0.0006 s to diagnose the depression of a single student, demonstrating its potential value in the application of educational healthcare systems. To our knowledge, we are the first to explore the potential of the latest artificial intelligence framework, Mamba, in predicting student depression and to accurately investigate the predictive factors of student depression. This study quantifies the decision-making process of GLNet using the SHAP method to visualize the potential impact of various factors on student depression.

We find that academic pressure may be associated with student depression. Students often equate academic performance with their own (Park et al., 2007; Fahd et al., 2021; Fairlamb, 2022). If excessive pressure leads to failure to meet expectations, it may trigger low self-esteem, making them feel useless (Nguyen et al., 2019), thereby inducing depression. We also find that students with financial stress may be associated with depression, because students facing a shortage of tuition fees and living expenses constantly worry about whether they can complete their studies and maintain a basic standard of living, and this uncertainty consumes their psychological energy, and gradually turns into depression (Zhang C. et al., 2023). Moreover, comparing themselves to classmates in better financial circumstances may be associated with feelings of inferiority, reducing students' sense of self-worth (Amani et al., 2024). Students who engage in prolonged overstudy may also be associated with depression, as prolonged overstudy leads to neurotransmitter imbalance and induces chronic fatigue (Vieriu and Hainagiu, 2025), weakening the body's tolerance to stress. Moreover, numerous studies confirm that depression has a genetic predisposition (Dionisio-García et al., 2023). Individuals with a family history of mental illness may carry susceptibility genes related to emotional regulation (Liu et al., 2021), making it difficult to regulate negative emotions. In addition, families with a history of mental illness may have unhealthy interaction patterns, such as negative stress-coping styles like excessive criticism and emotional neglect (Reupert and Maybery, 2016; Mitchell and Abraham, 2018; Zhou et al., 2023), and students may acquire these patterns during their growth, increasing their risk of depression. Insufficient sleep also easily contributes to student depression, because insufficient sleep overactivates the emotional centers in students' brains, such as the amygdala, while weakening the rational regulatory centers (Eum and Si, 2025), making students more reactive to negative emotions. Furthermore, we find that suicidal ideation may be associated with depression among students. Although suicidal ideation is one of the core symptoms of depression, it can serve as a strong predictive signal for the occurrence or exacerbation of depression (Tamsir et al., 2025; Wani et al., 2022; Amanat et al., 2022), enhancing the predictive performance of the machine learning model.

Through subgroup analysis, we identify differences in the potential impact of various factors on depression among students of different genders. For male students, those with an educational level at Grade 12 may have a lower likelihood of depression. This may be because Grade 12 is a critical transitional period for further education, with heavy academic tasks but clear goals at this stage (Strom et al., 2022; Fernandes et al., 2023), such as preparing for entrance exams, which prevents them from becoming confused about the future and thus may reduce the likelihood of depressive emotions. For female students, a high CGPA may be associated with their depression. This paradoxical finding offers a compelling real-world illustration of how social psychological theories, such as Social Role Theory and the concept of maladaptive perfectionism, operate in an educational context. This may be due to the dual social and cultural expectations placed on women, as they are required to excel academically while implicitly expected to conform to traditional roles, such as being gentle, family-oriented, and socially adept (Wong, 2007; Horta and Tang, 2023). Female students with high CGPAs may fall into a perfectionist trap due to their overpursuit of “all-round excellence.” As they not only need to maintain outstanding academic performance but may also be expected to behave appropriately in interpersonal relationships, these dual pressures drive them to set excessively high standards for themselves, and once they fail to meet expectations in a given area, they are prone to intense self-criticism, which may be associated with depression.

Through subgroup analysis, we find that different factors may have potential effects on students with different educational levels. Longer working and study hours may be associated with depression among high school students, undergraduates, and master's students, but may have no significant impact on doctoral students. This may be because doctoral students' research is mostly based on independent topic selection, and their long working hours are often associated with intrinsic motivations such as exploring areas of interest and pursuing academic breakthroughs, rather than passively completing standardized tasks (Satinsky et al., 2021). This state of investing in one's own goals can reduce the sense of oppression (Reese et al., 2015; Chazan et al., 2022; Guan et al., 2023). Furthermore, after undergoing academic training at the undergraduate and master's levels, doctoral students have developed more mature stress-coping patterns, and they may be more receptive to the academic norm of “long-term investment and delayed rewards.” In addition, unhealthy dietary habits may be associated with depression among doctoral students. This may be because, compared with students at other academic stages, doctoral students' research work requires more intensive mental labor. Meanwhile, if doctoral students maintain unhealthy diets over the long term, such as insufficient protein intake leading to amino acid deficiency, impairing serotonin synthesis, it can weaken their brain's emotional regulation and fatigue recovery capacities. The combination of this impairment of physiological foundations and academic pressure may make them more prone to developing depression. Furthermore, doctoral students have greater dietary autonomy than students at other academic levels, and long-term unhealthy dietary habits often reflect a reduced priority on self-care (Bayes et al., 2022).

Regarding the potential association between high CGPA and depression among female students, we recommend that educators optimize the existing academic evaluation system to reduce reliance on a single GPA-oriented model and incorporate process-oriented assessments, such as research practices and competency demonstrations. Concurrently, educators should offer counseling programs on perfectionism and psychological adjustment to help female students establish reasonable self-expectations and reduce excessive self-criticism. For doctoral students specifically, we suggest that educators organize thematic lectures on the adaptation of a scientific diet to mental work, provide nutritional guidelines in campus canteens, such as recommendations for high-protein diets, and strengthen doctoral students' self-care awareness to mitigate the potential for depression.

5 Limitation and future work

Although this study has made certain progress, there are still some limitations. Firstly, this study mainly explores the impact of students' demographic, academic, and life information on their depression, while other information, such as their interactions with supervisors or classmates, has not been taken into account. In addition, GLNet has only been validated on the SDD dataset collected from cities in India, and its generalizability to other datasets has not yet been verified. In the future, we plan to collect more data to validate the robustness of the model, promoting the potential development of healthcare systems.

6 Conclusion

This study proposes a deep learning network called GLNet, which combines global and local information from students' academic and life factors to accurately diagnose student depression. The study finds that high academic and financial pressure, long study and work hours, and insufficient sleep may be associated with student depression. Furthermore, subgroup analyses by gender and education level reveal that female students with high CGPA may be associated with depression, and that unhealthy dietary habits may be associated with depression among doctoral students. These findings provide important empirical support for the prevention and intervention of student depression.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding authors.

Author contributions

XL: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing. YH: Conceptualization, Formal analysis, Investigation, Methodology, Resources, Writing – original draft, Writing – review & editing. HC: Conceptualization, Formal analysis, Investigation, Writing – original draft, Writing – review & editing. XW: Conceptualization, Data curation, Funding acquisition, Investigation, Resources, Supervision, Writing – original draft, Writing – review & editing. FG: Conceptualization, Data curation, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This research is supported by the Key Project of the Propaganda Department of Hunan Province at the Academy of Marxism (24YBM27), the Education and Science Planning Project of Hunan Province (ND248077) and the Social Science Review Committee Project of Hunan Province (XSP24YBZ011).

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abhilash, L., and Shafer, O. T. (2024). A two-process model of drosophila sleep reveals an inter-dependence between circadian clock speed and the rate of sleep pressure decay. Sleep 47:zsad277. doi: 10.1093/sleep/zsad277

PubMed Abstract | Crossref Full Text | Google Scholar

Algarni, A. M., and Thayananthan, V. (2025). Cybersecurity for analyzing artificial intelligence (AI)-based assistive technology and systems in digital health. Systems 13:439. doi: 10.3390/systems13060439

Crossref Full Text | Google Scholar

Amanat, A., Rizwan, M., Javed, A. R., Abdelhaq, M., Alsaqour, R., Pandya, S., et al. (2022). Deep learning for depression detection from textual data. Electronics 11:676. doi: 10.3390/electronics11050676

Crossref Full Text | Google Scholar

Amani, D. Z., Taqiyah, I., and Iswinarti. (2024). Factors influencing inferiority feelings: a systematic review. J. Sci. Res. Educ. Technol. 3, 1756–1766. doi: 10.58526/jsret.v3i4.591

Crossref Full Text | Google Scholar

Aneshensel, C. S., Rutter, C. M., and Lachenbruch, P. A. (1991). Social structure, stress, and mental health: competing conceptual and analytic models. Am. Sociol. Rev. 56, 166–178. doi: 10.2307/2095777

Crossref Full Text | Google Scholar

Arnau-Soler, A., Adams, M. J., Clarke, T.-K., MacIntyre, D. J., Milburn, K., Navrady, L., et al. (2019). A validation of the diathesis-stress model for depression in generation scotland. Transl. Psychiatry 9:25. doi: 10.1038/s41398-018-0356-7

PubMed Abstract | Crossref Full Text | Google Scholar

Arnone, D., Karmegam, S. R., Östlundh, L., Alkhyeli, F., Alhammadi, L., Alhammadi, S., et al. (2024). Risk of suicidal behavior in patients with major depression and bipolar disorder-a systematic review and meta-analysis of registry-based studies. Neurosci. Biobehav. Rev. 159:105594. doi: 10.1016/j.neubiorev.2024.105594

Crossref Full Text | Google Scholar

Baek, J.-W., and Chung, K. (2020). Context deep neural network model for predicting depression risk using multiple regression. IEEE Access 8, 18171–18181. doi: 10.1109/ACCESS.2020.2968393

Crossref Full Text | Google Scholar

Bayes, J., Schloss, J., and Sibbritt, D. (2022). The effect of a mediterranean diet on the symptoms of depression in young males (the “AMMEND: a mediterranean diet in men with depression” study): a randomized controlled trial. Am. J. Clin. Nutr. 116, 572–580. doi: 10.1093/ajcn/nqac106

PubMed Abstract | Crossref Full Text | Google Scholar

Borbély, A. (1987). The S-deficiency hypothesis of depression and the two-process model of sleep regulation. Pharmacopsychiatry 20, 23–29. doi: 10.1055/s-2007-1017069

PubMed Abstract | Crossref Full Text | Google Scholar

Borrell-Carrió, F., Suchman, A. L., and Epstein, R. M. (2004). The biopsychosocial model 25 years later: principles, practice, and scientific inquiry. Ann. Fam. Med. 2, 576–582. doi: 10.1370/afm.245

PubMed Abstract | Crossref Full Text | Google Scholar

Chazan, D. J., Pelletier, G. N., and Daniels, L. M. (2022). Achievement goal theory review: an application to school psychology. Can. Journal of Sch. Psychol. 37, 40–56. doi: 10.1177/08295735211058319

Crossref Full Text | Google Scholar

Chowdhury, S., and Das, D. K. (2025). “Navigation of a research study on student depression incorporating various machine learning approaches,” in 2025 4th International Conference on Innovative Mechanisms for Industry Applications (ICIMIA) (Piscataway, NJ: IEEE), 1854–1859.

Google Scholar

Colodro-Conde, L., Couvy-Duchesne, B., Zhu, G., Coventry, W. L., Byrne, E. M., Gordon, S., et al. (2018). A direct test of the diathesis-stress model for depression. Mol. Psychiatry 23, 1590–1596. doi: 10.1038/mp.2017.130

PubMed Abstract | Crossref Full Text | Google Scholar

Dao, T., and Gu, A. (2024). “Transformers are SSMs: Generalized models and efficient algorithms through structured state space duality,” in International Conference on Machine Learning (ICML) [Proceedings of Machine Learning Research (PMLR)].

Google Scholar

Degges-White, S. (2017). “Erikson's theory of psychosocial development,” in College Student Development: Applying Theory to Practice on the Diverse Campus, ed K. K. Wendy (New York, NY: Springer Publishing Company), 35. doi: 10.1891/9780826118165.0004

Crossref Full Text | Google Scholar

Dionisio-García, D. M., Genis-Mendoza, A. D., González-Castro, T. B., Tovilla-Zárate, C. A., Juarez-Rojop, I. E., López-Narváez, M. L., et al. (2023). DNA methylation of genes involved in the HPA axis in presence of suicide behavior: a systematic review. Brain Sci. 13:584. doi: 10.3390/brainsci13040584

PubMed Abstract | Crossref Full Text | Google Scholar

El-Rashidy, M. A., Khodeir, N. A., Farouk, A., Aslan, H. K., and El-Fishawy, N. A. (2024). Attention-based contextual local and global features for urgent posts classification in moocs discussion forums. AIN Shams Eng. J. 15:102605. doi: 10.1016/j.asej.2023.102605

Crossref Full Text | Google Scholar

Eum, M.-J., and Si, E. (2025). Association between depression and the short sleep duration on weekdays and weekends in south Korean adults: a cross-sectional study using the 2016 and 2018 Korea National Health and Mutrition examination survey. Healthcare 13:1043. doi: 10.3390/healthcare13091043

Crossref Full Text | Google Scholar

Fahd, S., Bukhari, F. K., Akhtar, K., Siddique, U., and Javed, F. (2021). Impact of self-worth perception on achievement motivation among university students. J. Manag.Pract. Human. Soc. Sci. 5, 21–27. doi: 10.33152/jmphss-5.2.3

Crossref Full Text | Google Scholar

Fairlamb, S. (2022). We need to talk about self-esteem: the effect of contingent self-worth on student achievement and well-being. Scholarsh. Teach. Learn. Psychol. 8:45. doi: 10.1037/stl0000205

Crossref Full Text | Google Scholar

Fernandes, M. S. V., Mendonça, C. R., da Silva, T. M. V., Noll, P. R. S., de Abreu, L. C., and Noll, M. (2023). Relationship between depression and quality of life among students: a systematic review and meta-analysis. Sci. Rep. 13:6715. doi: 10.1038/s41598-023-33584-3

PubMed Abstract | Crossref Full Text | Google Scholar

Gohari, M. R., Doggett, A., Patte, K. A., Ferro, M. A., Dubin, J. A., Hilario, C., et al. (2024). Using random forest to identify correlates of depression symptoms among adolescents. Soc. Psychiatry Psychiatr. Epidemiol. 59, 2063–2071. doi: 10.1007/s00127-024-02695-1

PubMed Abstract | Crossref Full Text | Google Scholar

Guan, X., Zhang, Y., Li, Y., Zhang, Y., Wang, J., Li, X., et al. (2023). The effect of self-investment on hoarding tendency of Chinese college students: role of psychological connections. Int. J. Mental Health Promotion 25, 755–766. doi: 10.32604/ijmhp.2023.027086

Crossref Full Text | Google Scholar

Haaga, D. A., and Beck, A. T. (1995). Perspectives on depressive realism: implications for cognitive theory of depression. Behaviour research and therapyBehav. Res. Ther. 33, 41–48. doi: 10.1016/0005-7967(94)E0016-C

Crossref Full Text | Google Scholar

Haaga, D. A., Dyck, M. J., and Ernst, D. (1991). Empirical status of cognitive theory of depression. Psychol. Bull. 110:215. doi: 10.1037//0033-2909.110.2.215

PubMed Abstract | Crossref Full Text | Google Scholar

Haghish, E., and Czajkowski, N. (2024). Reconsidering false positives in machine learning binary classification models of suicidal behavior. Curr. Psychol. 43, 10117–10121. doi: 10.1007/s12144-023-05174-z

Crossref Full Text | Google Scholar

Harkness, K. L., and Stewart, J. G. (2026). “Stress and diathesis-stress models in depression,” in APA Handbook of Depression: Classification, Co-occurring Conditions, and Etiological Processes, eds J. W. Pettit, T. M. Olino, R. C. Boyd, B. C. Chu, E. P. Hayden, and D. A. Pizzagalli (Washington, DC: American Psychological Association), 441–458. doi: 10.1037/0000437-024

Crossref Full Text | Google Scholar

Hong, S., Lu, B., Wang, S., and Jiang, Y. (2025). Comparison of logistic regression and machine learning methods for predicting depression risks among disabled elderly individuals: results from the china health and retirement longitudinal study. BMC Psychiatry 25:128. doi: 10.1186/s12888-025-06577-x

PubMed Abstract | Crossref Full Text | Google Scholar

Horta, H., and Tang, L. (2023). Male and female academics' gendered perceptions of academic work and career progression in China. High. Educ. Q. 77, 515–536. doi: 10.1111/hequ.12419

Crossref Full Text | Google Scholar

Hsu, N., Badura, K. L., Newman, D. A., and Speach, M. E. P. (2021). Gender, “masculinity,” and “femininity”: a meta-analytic review of gender differences in agency and communion. Psychol. Bull. 147:987. doi: 10.1037/bul0000343

Crossref Full Text | Google Scholar

Jurado, I. C. (2025). Student Depression Dataset. OpenML. Available online at: https://www.openml.org/search?type=data&status=active&id=46753&sort=runs (Accessed March 12, 2025).

Google Scholar

Khan, A., and Ghosh, S. K. (2021). Student performance analysis and prediction in classroom learning: a review of educational data mining studies. Educ. Inform. Technol. 26, 205–240. doi: 10.1007/s10639-020-10230-3

Crossref Full Text | Google Scholar

Kim, J., and Kwon, K. Y. (2025). How social engagement shapes depressive symptoms in later life: uncovering gendered and asymmetric effects. Innov. Aging. 9:igaf096. doi: 10.1093/geroni/igaf096

PubMed Abstract | Crossref Full Text | Google Scholar

Kim, N., Lim, H., Li, Q., Li, X., Kim, S., and Kim, J. (2025). Enhancing review-based recommendations through local and global feature fusion. Electronics 14:2540. doi: 10.3390/electronics14132540

Crossref Full Text | Google Scholar

Knight, Z. G. (2017). A proposed model of psychodynamic psychotherapy linked to erik erikson's eight stages of psychosocial development. Clin. Psychol. Psychother. 24, 1047–1058. doi: 10.1002/cpp.2066

PubMed Abstract | Crossref Full Text | Google Scholar

Kostopoulos, G., Karlos, S., and Kotsiantis, S. (2019). Multiview learning for early prognosis of academic performance: a case study. IEEE Trans. Learn. Technol. 12, 212–224. doi: 10.1109/TLT.2019.2911581

Crossref Full Text | Google Scholar

Leow, K., Lee, M., and Lynch, M. F. (2016). “Big five personality and depressive symptoms: a self-determination theory perspective on students' positive relationships with others,” in Ideas Research You Can Use: VISTAS 2016 (Alexandria, VA: American Counseling Association), 2–11.

Google Scholar

Li, Y., Shen, X., and Wang, J.-L. (2025). Association between negative life events and suicidal behavior in adolescents: roles of core self-evaluation, depression and gender. Soc. Psychiatry Psychiatr. Epidemiol. 60, 673–684. doi: 10.1007/s00127-024-02725-y

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, J., Ning, W., Zhang, N., Zhu, B., and Mao, Y. (2024). Estimation of the global disease burden of depression and anxiety between 1990 and 2044: an analysis of the global burden of disease study 2019. Healthcare 12:1721. doi: 10.3390/healthcare12171721

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, N., Sun, S., Wang, P., Sun, Y., Hu, Q., and Wang, X. (2021). The mechanism of secretion and metabolism of gut-derived 5-hydroxytryptamine. Int. J. Mol. Sci. 22:7931. doi: 10.3390/ijms22157931

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, X.-Q., Guo, Y.-X., Zhang, W.-J., and Gao, W.-J. (2022). Influencing factors, prediction and prevention of depression in college students: a literature review. World J. Psychiatry 12:860. doi: 10.5498/wjp.v12.i7.860

PubMed Abstract | Crossref Full Text | Google Scholar

Lundberg, S., and Lee, S. I. (2017). A unified approach to interpreting model predictions. arXiv [preprint]. arXiv:1705.07874. doi: 10.48550/arXiv.1705.07874

Crossref Full Text | Google Scholar

Maes, M., Zhou, B., Jirakran, K., Vasupanrajit, A., Boonchaya-Anant, P., Tunvirachaisakul, C., et al. (2024). Towards a major methodological shift in depression research by assessing continuous scores of recurrence of illness, lifetime and current suicidal behaviors and phenome features. J. Affect. Disord. 350, 728–740. doi: 10.1016/j.jad.2024.01.150

PubMed Abstract | Crossref Full Text | Google Scholar

Maxwell, M. A., and Cole, D. A. (2009). Weight change and appetite disturbance as symptoms of adolescent depression: toward an integrative biopsychosocial model. Clin. Psychol. Rev. 29, 260–273. doi: 10.1016/j.cpr.2009.01.007

PubMed Abstract | Crossref Full Text | Google Scholar

Mimikou, C., Kokkotis, C., Tsiptsios, D., Tsamakis, K., Savvidou, S., Modig, L., et al. (2025). Explainable machine learning in the prediction of depression. Diagnostics 15:1412. doi: 10.3390/diagnostics15111412

PubMed Abstract | Crossref Full Text | Google Scholar

Mitchell, J. M., and Abraham, K. M. (2018). Parental mental illness and the transition to college: coping, psychological adjustment, and parent-child relationships. J. Child Fam. Stud. 27, 2966–2977. doi: 10.1007/s10826-018-1133-1

Crossref Full Text | Google Scholar

Moore, E., Holding, A. C., Moore, A., Levine, S. L., Powers, T. A., Zuroff, D. C., et al. (2021). The role of goal-related autonomy: a self-determination theory analysis of perfectionism, poor goal progress, and depressive symptoms. Journal of Counseling PsychologyJ. Couns. Psychol. 68:88. doi: 10.1037/cou0000438

Crossref Full Text | Google Scholar

Morris, M. C., Ciesla, J. A., and Garber, J. (2008). A prospective study of the cognitive-stress model of depressive symptoms in adolescents. J. Abnorm. Psychol. 117:719. doi: 10.1037/a0013741

PubMed Abstract | Crossref Full Text | Google Scholar

Nguyen, D. T., Wright, E. P., Dedding, C., Pham, T. T., and Bunders, J. (2019). Low self-esteem and its association with anxiety, depression, and suicidal ideation in Vietnamese secondary school students: a cross-sectional study. Front. Psychiatry 10:438641. doi: 10.3389/fpsyt.2019.00698

PubMed Abstract | Crossref Full Text | Google Scholar

Park, L. E., Crocker, J., and Kiefer, A. K. (2007). Contingencies of self-worth, academic failure, and goal pursuit. Pers. Soc. Psychol. Bull. 33, 1503–1517. doi: 10.1177/0146167207305538

PubMed Abstract | Crossref Full Text | Google Scholar

Raza, R., Wang, G., Wong, K. W., Laga, H., and Fisichella, M. (2025). “ITL-LIME: instance-based transfer learning for enhancing local explanations in low-resource data settings,” in Proceedings of the 34th ACM International Conference on Information and Knowledge Management [Association for Computing Machinery (ACM)], 2482–2492. doi: 10.1145/3746252.3761183

Crossref Full Text | Google Scholar

Reese, G., Proch, J., and Finn, C. (2015). Identification with all humanity: the role of self-definition and self-investment. Eur. J. Soc. Psychol. 45, 426–440. doi: 10.1002/ejsp.2102

Crossref Full Text | Google Scholar

Reupert, A., and Maybery, D. (2016). What do we know about families where parents have a mental illness? A systematic review. Child Youth Serv. 37, 98–111. doi: 10.1080/0145935X.2016.1104037

Crossref Full Text | Google Scholar

Risqi, M. K., Prastya, I. W. D., and Vikri, M. J. (2025). Comparison of decision tree algorithms and support vector machine (SVM) in depression classification in students. Eduvest J. Univ. Stud. 5, 4557–4567. doi: 10.59188/eduvest.v5i4.51108

Crossref Full Text | Google Scholar

Sadasivan, A., Gananathan, K., Pal Nesamony Rose Mary, J. D., and Balasubramanian, S. (2025). A systematic survey of graph convolutional networks for artificial intelligence applications. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 15:e70012. doi: 10.1002/widm.70012

Crossref Full Text | Google Scholar

Saha, D. K., Hossain, T., Safran, M., Alfarhood, S., Mridha, M., and Che, D. (2024). Ensemble of hybrid model based technique for early detecting of depression based on SVM and neural networks. Sci. Rep. 14:25470. doi: 10.1038/s41598-024-77193-0

PubMed Abstract | Crossref Full Text | Google Scholar

Satinsky, E. N., Kimura, T., Kiang, M. V., Abebe, R., Cunningham, S., Lee, H., et al. (2021). Systematic review and meta-analysis of depression, anxiety, and suicidal ideation among ph. d. students. Sci. Rep. 11:14370. doi: 10.1038/s41598-021-93687-7

Crossref Full Text | Google Scholar

Schotte, C. K., Van Den Bossche, B., De Doncker, D., Claes, S., and Cosyns, P. (2006). A biopsychosocial model as a guide for psychoeducation and treatment of depression. Depress. Anxiety 23, 312–324. doi: 10.1002/da.20177

PubMed Abstract | Crossref Full Text | Google Scholar

Sekowski, M., Wielogórska, M., and Lester, D. (2024). Psychosocial identity, intimacy and suicidality in young adults. Archiv. Suicide Res. 28, 1279–1293. doi: 10.1080/13811118.2023.2300320

PubMed Abstract | Crossref Full Text | Google Scholar

Shah, M., and Sureja, N. (2025). A comprehensive review of bias in deep learning models: methods, impacts, and future directions. Archiv. Comput. Methods Eng. 32, 255–267. doi: 10.1007/s11831-024-10134-2

Crossref Full Text | Google Scholar

Shamim, A. (2025). Student Depression Dataset. Kaggle. Available online at: https://www.kaggle.com/datasets/adilshamim8/student-depression-dataset (Accessed March 12, 2025).

Google Scholar

Song, Y. L. Q., Chen, L., Liu, H., and Liu, Y. (2025). Machine learning algorithms to predict depression in older adults in China: a cross-sectional study. Front. Public Health 12:1462387. doi: 10.3389/fpubh.2024.1462387

PubMed Abstract | Crossref Full Text | Google Scholar

Sörberg Wallin, A., Koupil, I., Gustafsson, J.-E., Zammit, S., Allebeck, P., and Falkstedt, D. (2019). Academic performance, externalizing disorders and depression: 26,000 adolescents followed into adulthood. Soc. Psychiatry Psychiatr. Epidemiol. 54, 977–986. doi: 10.1007/s00127-019-01668-z

PubMed Abstract | Crossref Full Text | Google Scholar

Strom, P. S., Hendon, K. L., Strom, R. D., and Wang, C.-h. (2022). High school student stress and school improvement. Sch. Commun. J., 32:205-228.

Google Scholar

Tamsir, I. S., Putrada, A. G., and Wicaksono, R. L. (2025). “SHAP for explainable AI in using AdaBoost-based depression detection on academic students,” in 2025 International Conference on Data Science and Its Applications (ICoDSA) (Piscataway, NJ: IEEE), 795–800.

Google Scholar

Tang, J., Li, J., Zou, J., Huang, Y., Ding, S., and Zhao, L. (2024). “MGS-Net: fusing global and local feature enhancements for healthcare education management of myasthenia gravis using speech data,” in 2024 IEEE International Conference on Systems, Man, and Cybernetics (SMC) (Piscataway, NJ: IEEE), 3912–3917.

Google Scholar

Tang, M., Wang, D., and Guerrien, A. (2020). A systematic review and meta-analysis on basic psychological need satisfaction, motivation, and well-being in later life: contributions of self-determination theory. Psych. J. 9, 5–33. doi: 10.1002/pchj.293

PubMed Abstract | Crossref Full Text | Google Scholar

Verboom, C. E., Sijtsema, J. J., Verhulst, F. C., Penninx, B. W., and Ormel, J. (2014). Longitudinal associations between depressive problems, academic performance, and social functioning in adolescent boys and girls. Dev. Psychol. 50:247. doi: 10.1037/a0032547

PubMed Abstract | Crossref Full Text | Google Scholar

Vieriu, A. M., and Hainagiu, S. M. (2025). Perfectionism and workaholism as barriers to lifelong learning and occupational sustainability: a cross-professional analysis. Sustainability 17:6512. doi: 10.3390/su17146512

Crossref Full Text | Google Scholar

Vu, T., Yamamoto, M., Tay, J. T., Watanabe, N., Kuriya, Y., Oya, A., et al. (2025). Prediction of depressive disorder using machine learning approaches: findings from the nhanes. BMC Med. Inform. Decis. Mak. 25, 1–12. doi: 10.1186/s12911-025-02903-1

PubMed Abstract | Crossref Full Text | Google Scholar

Wani, M. A., ELAffendi, M. A., Shakil, K. A., Imran, A. S., and Abd El-Latif, A. A. (2022). Depression screening in humans with AI and deep learning techniques. IEEE Trans. Comput. Soc. Syst. 10, 2074–2089. doi: 10.1109/TCSS.2022.3200213

Crossref Full Text | Google Scholar

Wong, W. C. (2007). Double standards in sex: differences between social expectation and practice among early school leavers. J. Epidemiol. Commun. Health 61, 763–763. doi: 10.1136/jech.2006.056390

Crossref Full Text | Google Scholar

Wright, L., and Demeure, N. (2021). Ranger21: a synergistic deep learning optimizer. arXiv [preprint]. arXiv:2106.13731. doi: 10.48550/arXiv.2106.13731

Crossref Full Text | Google Scholar

Xu, B., and Yang, G. (2025). Interpretability research of deep learning: a literature survey. Inform. Fusion 115:102721. doi: 10.1016/j.inffus.2024.102721

Crossref Full Text | Google Scholar

Yılmaz, S., Boz, C., Özsarı, S., Yılmaz, F., Fidan Türkön, B., and Mete, A. (2025). Effects of neurological disorders on health expenditure and economic output: dynamic panel analysis for oecd countries. Systems 15:521. doi: 10.3390/systems13070521

Crossref Full Text | Google Scholar

Zaiden, F. B., and Mahfar, M. B. (2024). Depression among adolescents: a review of beck's cognitive behaviour theory. J. Sains Kesihatan Malays., 22, 1–11. doi: 10.17576/JSKM-2024-2201-01

Crossref Full Text | Google Scholar

Zamani, S., Rostami, A., Nguyen, M., Sinha, R., and Madanian, S. (2025). A gamified ai-driven system for depression monitoring and management. Appl. Sci. 15:7088. doi: 10.3390/app15137088

Crossref Full Text | Google Scholar

Zeng, J., Li, X., Zheng, S., and Yang, L. (2025). “A study on classification of students depression based on gradient boosting algorithm,” in 2025 IEEE 34th Wireless and Optical Communications Conference (WOCC) (Piscataway, NJ: IEEE), 170–173.

Google Scholar

Zhang, C., Zhang, Q., Zhuang, H., and Xu, W. (2023). The reciprocal relationship between depression, social anxiety and aggression in chinese adolescents: the moderating effects of family functioning. J. Affect. Disord. 329, 379–384. doi: 10.1016/j.jad.2023.02.134

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, X., Li, A., Cui, N., Guo, B., Khan, H. T., and Zhang, L. (2023). The association between internet use and depression risk among Chinese adults, middle-aged and older, with disabilities. Systems 11:264. doi: 10.3390/systems11050264

Crossref Full Text | Google Scholar

Zhou, H.-y., Zhu, W.-q., Xiao, W.-y., Huang, Y.-t., Ju, K., Zheng, H., et al. (2023). Feeling unloved is the most robust sign of adolescent depression linking to family communication patterns. J. Res. Adolesc. 33, 418–430. doi: 10.1111/jora.12813

PubMed Abstract | Crossref Full Text | Google Scholar

Zhou, Y., Zhang, X., Gong, J., Wang, T., Gong, L., Li, K., et al. (2024). Identifying the risk of depression in a large sample of adolescents: an artificial neural network based on random forest. J. Adolesc. 96, 1485–1497. doi: 10.1002/jad.12357

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: contribution analysis, deep learning, depression, high education, Mamba, mental health

Citation: Li X, Hu Y, Chen H, Wang X and Gong F (2026) Explaining factors influencing students' depression with a deep learning approach. Front. Psychol. 16:1684274. doi: 10.3389/fpsyg.2025.1684274

Received: 12 August 2025; Revised: 31 December 2025;
Accepted: 30 December 2025; Published: 29 January 2026.

Edited by:

Daniel H. Robinson, The University of Texas at Arlington, United States

Reviewed by:

Sandra Carvalho, University of Minho, Portugal
S. Kuzhalvaimozhi Sathyan, The National Institute of Engineering, India

Copyright © 2026 Li, Hu, Chen, Wang and Gong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yunyi Hu, aHV5dW55aUBydWMuZWR1LmNu; Xingxing Wang, eHh3YW5nQGNzdS5lZHUuY24=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.