- Department of Clinical Laboratory, The Third Hospital Of LongYan, LongYan, Fujian, China
Objective: To understand the global trends in depression and identify potential early risk factors for its detection.
Methods: This study is the first to integrate the 2021 Global Burden of Disease (GBD) data with machine learning techniques to explore the risk factors of adolescent depression. A machine learning-based model was constructed, and SHAP (SHapley Additive exPlanations) plots were utilized for interpretive analysis.
Results: From 1990 to 2021, the incidence and disability-adjusted life years (DALYs) of depression continued to rise globally among the 10–24 age group, particularly in high socio-demographic index(SDI) regions. Greenland, the United States of America, and Palestine had the highest rates of depression globally. Among the eight machine learning models evaluated, random forest (RF) proved to be the most reliable. SHAP analysis revealed that elevated levels of S100β (0.330), NSE (0.060), and PLT (0.031) significantly increased the risk of depression.
Conclusion: Our study shows an increasing trend of depression in the global 10–24 age group. Additionally, elevated S100β, NSE, and PLT are identified as key risk factors for depression.
1 Introduction
Depression disorder is not merely a short-term mood disturbance; it is a chronic and potentially recurrent mood disorder. Common symptoms include persistent low mood, loss of interest, difficulty concentrating, changes in appetite, fatigue, sleep disturbances, and self-criticism. With the rapid global economic development, depression among adolescents has become an increasingly serious issue, drawing public attention. According to data from the World Health Organization (WHO), suicide is the fourth leading cause of death worldwide, with depression being the primary contributor (1). Some studies predict that by 2030, depression may become the leading cause of disease burden among adolescents (2). Therefore, there is an urgent need to adopt effective strategies to address the growing burden of depression, particularly through early detection and intervention to reduce its long-term impact on the adolescent population.
Currently, traditional diagnostic methods for depression rely on the patient’s clinical symptoms, symptom history, and diagnostic criteria. The most commonly used diagnostic tools include the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) (3) and the International Classification of Diseases, 11th Edition (ICD-11) by the World Health Organization. However, the accuracy of these assessment tools is influenced by participants’ compliance and the subjective interpretation of healthcare professionals. Additionally, the consistency of diagnosis is challenging to guarantee due to differences in economic, cultural, and healthcare levels. Furthermore, these diagnostic methods exhibit several limitations in the context of adolescent mental health diagnosis: 1. The behavioral patterns and severity of impairment in adolescents often differ from those in adults. 2. The social and environmental context of adolescent development is not adequately considered. 3. The diagnostic process places excessive emphasis on internal factors, neglecting the unique characteristics of the adolescent population. 4. The distinction between normal and abnormal behavior is sometimes subjective and arbitrary. Therefore, this study aims to address these gaps and propose a new diagnostic framework or approach, with the goal of improving diagnostic accuracy for the adolescent population.
In recent years, machine learning (ML) technology has been widely applied in the detection of depression (4, 5). Compared to traditional methods, machine learning enables more precise analysis of complex relationships between variables. Through big data analysis, it effectively predicts the risk of depression (6). When handling complex data, machine learning can filter out more valuable information and provide more reliable judgments.
This study aims to explore the burden of depression in the 10–24 age group and, through objective blood biomarkers, exclude some subjective factors. By combining machine learning analysis, we seek to identify potential risk factors for depression. Through this approach, we hope to provide more effective support for the early detection and intervention of depression.
2 Methods
2.1 Data source
The data for this study were obtained from the Global Burden of Disease (GBD) 2021 study, coordinated by the Institute for Health Metrics and Evaluation (IHME). The GBD study collects data from a wide range of sources, including censuses, vital registration systems, household surveys, health service use statistics, disease registries, and published scientific literature. All data inputs were standardized and adjusted to improve comparability across time and locations. Data were processed using the GBD’s standardized analytical tools, including cause of death ensemble modeling (CODEm), DisMod-MR 2.1 for non-fatal estimation, and spatiotemporal Gaussian process regression (ST-GPR). Detailed data processing protocols are publicly available at the IHME GBD methodology website (http://www.healthdata.org/gbd/). IHME follows the University of Washington’s website terms and conditions: http://www.washington.edu/online/terms/.
In addition, for this study, the diagnostic criteria for 250 adolescent depression patients and 200 healthy controls from Longyan Third Hospital in China followed the depression diagnostic standards from DSM-5, consistent with those in the GBD database. All diagnoses of depression were confirmed by experienced clinical psychologists through interviews and clinical assessments, meeting the following criteria:
Unipolar Depression: Patients must have experienced significant symptoms of mood depression or loss of interest in the past two weeks, with other common symptoms including: 1. Reduced concentration and attention; 2. Lowered self-esteem; 3. Guilt feelings and sense of worthlessness (even in mild episodes); 4. Pessimism about the future; 5. Thoughts or behaviors of self-harm or suicide; 6. Sleep disturbances; 7. Reduced appetite.
Bipolar Disorder: Patients in the current depressive episode must also meet the diagnostic criteria for bipolar disorder, including at least one history of manic or hypomanic episodes. Bipolar I Disorder: At least one manic episode (may be accompanied by a depressive episode), with no threshold for hypomanic episodes (mania alone is sufficient for diagnosis). Bipolar II Disorder: At least one hypomanic episode + at least one depressive episode, with no history of manic episodes.
Healthy Control Group: No symptoms or history of depression, excluded based on the Hamilton Depression Scale (HAMD) and Hamilton Anxiety Scale (HAMA).
In addition, this study did not compare the small sample data with the GBD dataset. We used the GBD dataset to study the epidemiological trends of depression in the adolescent population and understand the increasing burden of depression in this group. Therefore, the development of machine learning methods provides more effective support for the early detection and intervention of depression in adolescents.
According to the 10 EPV principle (which requires at least 10 samples per predictor variable), with 450 samples, 250 events, and 15 predictor variables, the EPV = 16.67, which is well above the commonly recommended 10 EPV standard (7).
2.2 Data collection
Depression disorder is not just a mental disorder; its occurrence is closely related to various physiological and biochemical factors. It may be associated with neural damage and neuroinflammation (8), thyroid function (9), the body’s metabolism (10), inflammatory responses (11), and cardiovascular factors (12). Therefore, data were collected from laboratory test results, including age, gender, NSE, S100β, FT3, FT4, TSH, BUN, CRE, LDH, CK, CK-Mb, WBC, RBC, HGB, PLT, among other parameters. The detection of NSE, S100β, FT3, FT4, and TSH is performed using chemiluminescence immunoassay (CLIA), utilizing the Snibe MAGLUMI X8 analyzer (https://www.snibe.com/en/product/CLIA_analyzer/40.html) and its associated reagents. The measurements of BUN, CRE, LDH, CK, and CK-Mb are conducted using colorimetric methods, with the Mindray BS-2000M analyzer(https://www.mindray.com/en/products/laboratory-diagnostics/chemistry/large-test-volume/bs-2000m).The analysis of WBC, RBC, HGB, and PLT is carried out using impedance and flow cytometry methods, employing the Sysmex XS-1000i analyzer (https://www.sysmex-wca.com/products/products-detail/xs-1000i/) and its corresponding consumables. All participants had no organic damage or underlying diseases. The diagnosis of depression was based on DSM-5, and the severity of depressive symptoms was assessed using the 17-item Hamilton Depression Rating Scale (HAMD-17).Level 1: Mild depressive episode; Level 2: Moderate depressive episode; Level 3: Severe depressive episode without psychotic features; Level 4: Severe depressive episode with psychotic features (13). And it also applies to this region (14, 15). This study has been approved by the Ethics Committee of The Third Hospital Of LongYan, and informed consent has been obtained from the patients or their guardians.
2.3 Testing methods
CLIA:CLIA is an immunoassay technique based on the generation of light signals through chemical reactions. In CLIA, the binding of antibodies to antigens is marked with chemiluminescent substances (such as enzyme labels or other chemiluminescent probes). Under certain reaction conditions, the label emits detectable light. During the reaction, when the antigen (target substance) binds to the antibody, light signals are generated through reactions with enzymes or other chemical reagents. These light signals are quantified using a photometer or chemiluminescence analyzer.
Colorimetric methods:Colorimetric methods are a common analytical technique based on the relationship between the color change in a solution and the concentration of the sample. In colorimetric analysis, a reagent is typically used that reacts with the target molecule to produce a color change. When the target molecule (such as proteins, enzymes, or other chemicals) reacts with the reagent, the color of the solution changes. This change can be measured using instruments such as a spectrophotometer to obtain absorbance values. By using a standard curve relating absorbance to known concentrations, the target substance in the sample can be quantified.
Impedance method:The impedance method analyzes changes in impedance when cells or particles are exposed to an electric field. The movement and morphological changes of cells in the electric field affect the conductivity, and changes in impedance can provide information about the size, shape, number, and biological activity of the cells. The impedance method is suitable for cell counting, cell biology research, and cell viability testing. Typically, electrodes or sensors make contact with the cells in the sample to measure changes in resistance or capacitance.
Flow cytometry:Flow cytometry is an efficient single-cell analysis technique that analyzes scattered light and fluorescence signals produced when cells pass through a laser beam. When cells are suspended in liquid and pass through a light source, they interact with the light; the scattered light provides information about the cell’s size and granularity, while fluorescence signals can be used to detect specific molecules or antigens labeled within the cells. Flow cytometry allows high-speed, multi-parameter analysis of cells and can be used for cell counting, cell cycle analysis, immunophenotyping, and cell sorting.
2.4 Data processing
In the data analysis, the Kolmogorov-Smirnov test was used to assess the normality of the data. If the p-value is less than 0.05, the data are considered not to follow a normal distribution. normally distributed variables were described using absolute numbers and percentages, means, and standard deviations, while non-normally distributed variables were represented by medians and interquartile ranges. The analysis of categorical variables was performed using the chi-square test or Fisher’s exact test, and the comparison of continuous variables was done using the two-sample t-test or Mann-Whitney U test. All hypothesis tests were two-sided, and the significance level was set at α = 0.05. If the p-value is less than 0.05, the null hypothesis is rejected, and the result is considered statistically significant. A linear regression model was used to calculate the annual average percentage change (EAPC) value and its 95% confidence interval (CI). The criteria for determining trends were as follows: if EAPC>0, it indicated an increase in the variable over time; if EAPC<0, it indicated a decrease over time (16).
2.5 Model construction and evaluation
The 450 patients were randomly divided into a training set and a validation set in a 7:3 ratio. The ratio of 7:3 is commonly used as a rule of thumb when splitting a dataset into training and test sets (17). A univariate logistic regression was used, and the forest plot was constructed based on the training set. Eight machine learning algorithms were used to construct predictive models, including Decision Tree (DT), K-Nearest Neighbors (K-NN), Light Gradient Boosting Machine (LightGBM), Logistic Regression (LR), Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost). The performance of each model was evaluated by comparing metrics such as AUC, F1 score, sensitivity, specificity, recall, and accuracy.Finally, this study used Random Forest for model construction. The parameters for the random forest are as follows: nestimators=500, nodesize=1, maxfeatures=3. For the development of the machine learning model, 5-fold cross-validation was used to train the model, which helped identify the optimal hyperparameters.
To further understand the prediction principles of the models, Shapley Additive Explanations (SHAP) plots were used. SHAP values provide a universal metric for explaining each feature’s contribution to the model’s predictions. By analyzing the SHAP value density scatter plots for each feature, the influence of each variable on the model output from the validation set was determined. The ranking of variables was based on the total SHAP values across all samples. All data analyses were performed using the statistical software R (version 4.4.2).
3 Results
3.1 Overall trend
From 1990 to 2021, the incidence of depression among the global population aged 10–24 years increased from 3085.52 (2307.81–4105.56) to 3909.88 (2841.30–5171.10), while the DALYs increased from 454.05 (291.66–657.43) to 567.77 (359.80–828.13). The Annual Average Percentage Change (EAPC) for incidence and DALYs were 0.16 (-0.06–0.39) and 0.21 (0.02–0.41), respectively.
In terms of gender distribution, the number of female patients was higher than that of males, but the increase in male incidence was more pronounced. The EAPC for males was 0.27 (0.06–0.49) for incidence and 0.31 (0.12–0.50) for DALYs, which was significantly higher than the corresponding values for females, which were 0.11 (-0.12–0.33) and 0.16 (-0.03–0.36), respectively. Regarding the Socio-Demographic Index (SDI), the incidence and DALYs in high-SDI regions were significantly higher than in other regions, with the EAPC growth rate far exceeding that of low and middle-SDI regions, which were 0.99 (0.74–1.24) and 0.93 (0.71–1.15), respectively (Table 1). Specifically, high-SDI regions have shown consistent growth since 1990, with an increase far exceeding other SDI-level regions, which is in line with the EAPC trend. From a time trend perspective, the global incidence and DALYs of depression remained stable and fluctuating until 2019. However, since the outbreak of the COVID-19 pandemic in 2019, this trend has significantly increased (Table 1, Figure 1).

Table 1. Incidence Rates, DALYs Rates of depressive disorders(10–24 year) by sex, SDI in 1990 -2021 (Based on the GBD dataset).

Figure 1. Temporal trend of incidence,DALYs rates for the burden of depressive disorders(10–24 year) by sex, SDI from 1990 to 2021. (Based on the GBD dataset).
At the national level, the top three countries with the highest incidence and DALYs were: Greenland, with an incidence of 15273.51 (10282.59–21274.47) and DALYs of 2114.78 (1289.81–3263.91); the United States of America, with an incidence of 10214.88 (8022.64–12897.19) and DALYs of 1431.52 (969.19–2070.3); and Palestine, with an incidence of 9287.97 (6090.49–13597.09) and DALYs of 1311.01 (765.05–2081.03) (Figure 2; Supplementary Table S1).

Figure 2. The incidence and disability-adjusted life years (DALYs) rates for depressive disorders(10-24 year) in 204 countries from 1990-2021. (Based on the GBD dataset) (A) Incidence; (B) DALYs.
3.2 General information of patients
A total of 450 patient data were collected in this study, including 200 non-depressed individuals (44.44%) and 250 depressed patients (55.56%). The gender distribution of patients was 206 males (45.78%) and 244 females (54.22%). The results indicated that the levels of NSE, S100β, WBC, and PLT in depressed patients were significantly higher than those in the non-depressed group, while FT3, BUN, and HGB levels were significantly lower in the depressed group (Table 2). Additionally, the levels of NSE and S100β increased progressively with the severity of depressive symptoms (Supplementary Table S2).
Univariate logistic regression analysis showed significant statistical differences between the two groups for PLT (1.011, 1.007–1.016), HGB (0.976, 0.961–0.991), WBC (1.323, 1.151–1.535), CRE (0.983, 0.967–0.999), BUN (0.625, 0.488–0.791), FT3 (1.136, 1.022–1.266), FT4 (0.598, 0.37–0.95), S100β (4.791, 3.41–7.294), and NSE (1.404, 1.293–1.543) (Figure 3).

Figure 3. The forest plot of the OR of the selected variables. Forest plot for outcome in Univariate Logistic Regression Analysis. (Based on 450 patients).
3.3 Machine learning model construction and evaluation
We applied the nine aforementioned variables to construct eight machine learning models, including DT, K-NN, LightGBM, LR, NB, RF, SVM, and XGBoost. The performance metrics of each model are shown as follows (Figure 4). Among all machine learning methods, the Random Forest (RF) model generally performed the best, demonstrating superior performance. Specifically, the RF model achieved the highest levels in F1 score (0.94), recall (0.95), negative predictive value (0.92), and sensitivity (0.95). Other metrics, such as accuracy (0.93), positive predictive value (0.93), and specificity (0.89), also remained at a high level. Ultimately, the RF model’s ROC curve area under the curve (AUC) was the highest, reaching 0.975 (0.955-0.996), indicating that RF might be the most reliable model among the eight machine learning methods, offering a good balance across all evaluation metrics. Additionally, the XGBoost model also exhibited strong performance, with results comparable to RF.

Figure 4. Performance metrics of the predictive model in the training and validation sets (based on 450 patients). (A) Training metrics; (B) Testing metrics; (C) ROC curve for training set; (D) ROC curve for testing set.
3.4 Variable importance and SHAP plot
We ranked the importance of each variable using the RF model and provided a visual interpretation through SHAP plots (Figure 5). The results indicated that S100β, NSE, and PLT were the top three most important variables. Specifically, S100β was identified as the most important variable, having the greatest impact on the predictive model, followed by NSE, with both variables showing similar levels of influence.

Figure 5. (A) Feature Importance of Random Forest (RF). (B) A visual interpretation through SHAP plots. In the plot, yellow represents higher feature values, while red indicates lower feature values. (Based on 450 patients).
The SHAP plot provides an in-depth analysis of model predictions. Through the SHAP value density scatter plot for each feature, we can assess the extent of each variable’s influence on the model output for individual samples in the validation set. The variables are ranked according to the total SHAP value sum across all samples. In the plot, yellow represents higher feature values, while red indicates lower feature values. The SHAP values on the x-axis represent the contribution of each variable to the model’s output. Negative SHAP values suggest that the variable has a protective effect against depression, whereas positive SHAP values indicate an increased likelihood of depression. The results showed that the SHAP value for S100β was 0.330, indicating that an increase in S100β significantly raised the risk of depression. The next most influential variables were NSE (0.060) and PLT (0.031).
4 Discussion
In this study, we utilized the GBD database to investigate the trends and burden of depression among individuals aged 10–24 from 1990 to 2021. The results indicate a significant upward trend in both the incidence of depression and DALYs in this age group, particularly during the COVID-19 pandemic, when this trend became more pronounced (18, 19). These data reflect the increasing burden of depression among young people aged 10-24. Due to global differences in economy, culture, and education levels, there may be biases in the depression measurement tools, leading to inaccurate assessments (20, 21). Therefore, to detect early signs of depression in the youth population, we adopted blood biomarkers and developed a simple predictive model using eight machine learning methods. The model effectively explained the relevant indicators for early depression, with the Random Forest (RF) model demonstrating the best performance, achieving the highest levels in F1 score (0.94), recall (0.95), negative predictive value (0.92), sensitivity (0.95), and AUC (0.975), indicating its reliability for this classification task. To further interpret the model, we used SHAP plots to identify the key variables influencing depression risk. The results showed that S100β and NSE were the most important variables, followed by PLT.
This study also found significant differences in depression incidence across different countries and regions with varying SDI, particularly in high-SDI areas. In economically developed regions, factors such as intense competition, high levels of academic and work-related stress, high living costs, and fast-paced lifestyles may contribute to the onset of depression, especially in the United States, where these issues have caused a substantial societal burden (22).The possible contributing factors to depression are as follows:1. Academic and Employment Pressure: In high-SDI regions, especially in cities with intense educational competition, students often face high-intensity study tasks and anxiety about future employment. This pressure leads to a high incidence of depression among adolescents.2. Impact of Social Media: The widespread use of smartphones, especially among adolescents, and the prolonged immersion in the virtual world result in a lack of real-life social interactions, which leads to feelings of loneliness, anxiety, and emotional distress, potentially triggering depression.3. Work pressure and career competition: In high-SDI regions, the pressure of career advancement, long working hours, and the fear of unemployment make work stress a significant factor in the development of depression (23). 4.Excessive information consumption: In the era of short videos, adolescents often lack the ability to distinguish false information, which causes a significant psychological gap and leads to feelings of inferiority and a self-destructive mentality.
Meanwhile, low-SDI regions face a lack of medical resources and insufficient awareness and education about depression, leading to underreporting and data imbalances. Some studies suggest that individuals with lower educational attainment are more prone to depression (24, 25), while higher education levels are associated with improved socioeconomic status, better employment and income opportunities, and greater attention to physical health, all of which contribute to a reduced incidence of depression (25, 26). However, these studies often overlook the educational stages, particularly during high school and university, when adolescents are at a heightened risk for depression (27, 28).
With the continued increase in the burden of depression among the youth population and the influence of cultural and economic differences across countries, a single measurement tool may not accurately assess depressive symptoms. In this study, we employed machine learning methods to identify the most important risk factors for depression, with S100β and NSE emerging as key indicators. Several studies have shown that S100β and NSE are biomarkers of neuronal damage (29, 30). Additionally, S100β is associated with brain neuroinflammation, such as in conditions like epilepsy (31, 32). A substantial body of evidence suggests that depression is not only a mental health disorder but also involves neuroinflammation, as well as damage to neurons and glial cell functions (33). Both genetic and environmental factors are important contributors to depression. The destruction and activation of microglia trigger neuroinflammation, which may be a crucial pathological mechanism of depression (34, 35).Microglial cells regulate neuronal function and survival through direct interactions with neurons (36) and the release of cytokines such as IL-1β and IL-6 (37). Additionally, activated microglia alter the brain’s microenvironment by releasing molecules that influence neuronal activity (8). These changes ultimately result in neuronal damage. Upon cellular injury, S100β protein is released (38). S100β interacts with pattern recognition receptors (PRRs) on immune cells, activating the nuclear factor kappa-light-chain-enhancer of activated B cells (NF-κB) signaling pathway (39), which further promotes microglial activation, driving them into a pro-inflammatory state. This leads to the release of additional inflammatory cytokines, exacerbating the immune response and intensifying neuronal damage.
We also identified the importance of PLT in depression and found that PLT levels were significantly higher in patients with depression compared to those without. Some studies suggest that patients with depression have lower levels of Omega-3 polyunsaturated fatty acids, and Omega-3 fatty acids may aid in the treatment of depression (40, 41). Omega-3 polyunsaturated fatty acids reduce and improve neuroinflammation by modulating arachidonic acid (AA) and cytokines. The main metabolite of arachidonic acid is thromboxane A2, which promotes platelet aggregation and vasoconstriction.
Overall, this study reveals the increasing trend in the incidence and DALYs of depression among individuals aged 10–24 globally, and through machine learning methods, we identified key risk factors for depression, such as S100β, NSE, and PLT, providing simple and effective auxiliary tools for the early detection of depression. Therefore, we recommend that biomarkers such as S100β and NSE be included in routine screening for adolescent depression, alongside traditional psychological screening tools. This approach could enhance diagnostic accuracy and facilitate earlier interventions, ultimately improving the mental health of the adolescent population.
However, this study has several limitations. First, global data were sourced from the GBD database, but differences in geography, culture, and economic levels across regions may lead to imbalances in the attention given to and reporting of depression, potentially affecting the representativeness of the data. Second, the clinical data were obtained from a single hospital, which raises concerns regarding the generalizability of the findings; therefore, further multi-center studies are needed to validate the results. Additionally, the limited sample size may increase the risk of overfitting in the machine learning model, which could affect its generalization ability. Future research should address potential overfitting issues and conduct external validation to enhance the robustness and applicability of the model.
In conclusion, our study indicates that depression rates among the global youth population aged 10–24 are on the rise. Additionally, elevated levels of S100β, NSE, and PLT are identified as risk factors for the development of depression.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Ethics Committee of The Third Hospital Of LongYan. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.
Author contributions
YG: Writing – original draft. HL: Writing – original draft. AC: Writing – original draft, Data curation. JG: Investigation, Writing – original draft. YL: Investigation, Writing – original draft. ZL: Writing – review & editing, Funding acquisition, Writing – original draft, Supervision.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This project has been supported by the Longyan Science and Technology Bureau(2023LYF17101).
Acknowledgments
We thank Tongdeng you(Fuzhou Changle District People’s Hospital) for their assistance.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2025.1594074/full#supplementary-material
References
1. Organization WH. Mental health of adolescents (2024). Available online at: https://wwwwhoint/zh/news-room/fact-sheets/detail/adolescent-mental-health (Accessed October 10, 2024).
2. Patel V, Saxena S, Lund C, Thornicroft G, Baingana F, Bolton P, et al. The Lancet Commission on global mental health and sustainable development. Lancet (London England). (2018) 392:1553–98. doi: 10.1016/s0140-6736(18)31612-x
3. American Psychiatric Association, DSM-5 Task Force. Diagnostic and statistical manual of mental disorders: DSM-5™ (5th ed.). American Psychiatric Publishing, Inc. (2013). doi: 10.1176/appi.books.9780890425596
4. Ji J, Dong W, Li J, Peng J, Feng C, Liu R, et al. Depressive and mania mood state detection through voice as a biomarker using machine learning. Front neurology. (2024) 15:1394210. doi: 10.3389/fneur.2024.1394210
5. Liang L, Wang Y, Ma H, Zhang R, Liu R, Zhu R, et al. Enhanced classification and severity prediction of major depressive disorder using acoustic features and machine learning. Front Psychiatry. (2024) 15:1422020. doi: 10.3389/fpsyt.2024.1422020
6. Nemesure MD, Heinz MV, Huang R, and Jacobson NC. Predictive modeling of depression and anxiety using electronic health records and a novel machine learning approach with artificial intelligence. Sci Rep. (2021) 11:1980. doi: 10.1038/s41598-021-81368-4
7. Peduzzi P, Concato J, Kemper E, Holford TR, and Feinstein AR. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. (1996) 49:1373–9. doi: 10.1016/s0895-4356(96)00236-3
8. Malau IA, Chang JP, Lin YW, Chang CC, Chiu WC, and Su KP. Omega-3 fatty acids and neuroinflammation in depression: targeting damage-associated molecular patterns and neural biomarkers. Cells. (2024) 13(21):1791. doi: 10.3390/cells13211791
9. Chen S, Wang J, Lang X, and Zhang XY. Clinical correlates, lipid metabolic parameters and thyroid hormones are associated with abnormal glucose metabolism in first-episode and drug-naïve major depressive disorder patients with suicide attempts: A large cross-sectional study. J Affect Disord. (2025) 380:10–6. doi: 10.1016/j.jad.2025.03.102
10. Zhu N, Faucon AL, Kuja-Halkola R, Landén M, Xu H, Carrero JJ, et al. Prevalence of severe mental illness and its associations with health outcomes in patients with CKD: A swedish nationwide study. Am J Kidney diseases: Off J Natl Kidney Foundation. (2025) 85:577–88.e1. doi: 10.1053/j.ajkd.2024.12.004
11. Pethő B, Kovács M, Simon D, Tóth T, Hajnal AS, Csulak T, et al. Investigation of peripheral inflammatory biomarkers in association with suicide risk in major depressive disorder. Front Psychiatry. (2024) 15:1321354. doi: 10.3389/fpsyt.2024.1321354
12. Zeng J, Qiu Y, Yang C, Fan X, Zhou X, Zhang C, et al. Cardiovascular diseases and depression: A meta-analysis and Mendelian randomization analysis. Mol Psychiatry. (2025). doi: 10.1038/s41380-025-03003-2
13. Zimmerman M, Martinez JH, Young D, Chelminski I, and Dalrymple K. Severity classification on the hamilton depression rating scale. J Affect Disord. (2013) 150:384–8. doi: 10.1016/j.jad.2013.04.028
14. Sun XY, Li YX, Yu CQ, and Li LM. Reliability and validity of depression scales of Chinese version: a systematic review. Zhonghua liu xing bing xue za zhi = Zhonghua liuxingbingxue zazhi. (2017) 38:110–6. doi: 10.3760/cma.j.issn.0254-6450.2017.01.021
15. Zheng YP, Zhao JP, Phillips M, Liu JB, Cai MF, Sun SQ, et al. Validity and reliability of the chinese hamilton depression rating scale. Br J psychiatry: J Ment science. (1988) 152:660–4. doi: 10.1192/bjp.152.5.660
16. Graham SM, Walsh AL, Molyneux EM, Phiri AJ, and Molyneux ME. Clinical presentation of non-typhoidal Salmonella bacteraemia in Malawian children. Trans R Soc Trop Med Hygiene. (2000) 94:310–4. doi: 10.1016/s0035-9203(00)90337-7
17. Lee SC, An C, Yoo J, Park S, Shin D, and Han CH. Development and validation of a nomogram to predict pulmonary function and the presence of chronic obstructive pulmonary disease in a Korean population. BMC pulmonary Med. (2021) 21:32. doi: 10.1186/s12890-021-01391-z
18. Quesada-Puga C, Cañadas GR, Gómez-Urquiza JL, Aguayo-Estremera R, Ortega-Campos E, Romero-Béjar JL, et al. Depression in nursing students during the COVID-19 pandemic: Systematic review and meta-analysis. PloS One. (2024) 19:e0304900. doi: 10.1371/journal.pone.0304900
19. Al Maqbali M, Alsayed A, Hughes C, Hacker E, and Dickens GL. Stress, anxiety, depression and sleep disturbance among healthcare professional during the COVID-19 pandemic: An umbrella review of 72 meta-analyses. PloS One. (2024) 19:e0302597. doi: 10.1371/journal.pone.0302597
20. Stevanovic D, Jafari P, Knez R, Franic T, Atilola O, Davidovic N, et al. Can we really use available scales for child and adolescent psychopathology across cultures? A systematic review of cross-cultural measurement invariance data. Transcultural Psychiatry. (2017) 54:125–52. doi: 10.1177/1363461516689215
21. Green HE, Gabel LN, Stewart EK, Kotelnikova Y, and Hayden EP. Latent structure and measurement invariance of the Depression Self-Rating Scale for Children across sex and age. psychol assessment. (2024) 36:552–61. doi: 10.1037/pas0001327
22. Greenberg PE, Fournier AA, Sisitsky T, Pike CT, and Kessler RC. The economic burden of adults with major depressive disorder in the United States (2005 and 2010). J Clin Psychiatry. (2015) 76:155–62. doi: 10.4088/JCP.14m09298
23. Kachi Y, Inoue A, Eguchi H, Kawakami N, Shimazu A, and Tsutsumi A. Occupational stress and the risk of turnover: a large prospective cohort study of employees in Japan. BMC Public Health. (2020) 20:174. doi: 10.1186/s12889-020-8289-5
24. Vu T, Dawadi R, Yamamoto M, Tay JT, Watanabe N, Kuriya Y, et al. Prediction of depressive disorder using machine learning approaches: findings from the NHANES. BMC Med Inf decision making. (2025) 25:83. doi: 10.1186/s12911-025-02903-1
25. Bauldry S. Variation in the protective effect of higher education against depression. Soc Ment Health. (2015) 5:145–61. doi: 10.1177/2156869314564399
26. Patria B. The longitudinal effects of education on depression: Finding from the Indonesian national survey. Front Public Health. (2022) 10:1017995. doi: 10.3389/fpubh.2022.1017995
27. Blossom JB, Adrian MC, Stoep AV, and McCauley E. Mechanisms of change in the prevention of depression: an indicated school-based prevention trial at the transition to high school. J Am Acad Child Adolesc Psychiatry. (2020) 59:541–51. doi: 10.1016/j.jaac.2019.05.031
28. Chowdhury AI, Ghosh S, Hasan MF, Khandakar KAS, and Azad F. Prevalence of insomnia among university students in South Asian Region: a systematic review of studies. J Prev Med hygiene. (2020) 61:E525–e9. doi: 10.15167/2421-4248/jpmh2020.61.4.1634
29. Qu Y, Jin H, Abuduxukuer R, Qi S, Si XK, Zhang P, et al. The association between serum S100β levels and prognosis in acute stroke patients after intravenous thrombolysis: a multicenter prospective cohort study. BMC Med. (2024) 22:304. doi: 10.1186/s12916-024-03517-6
30. Langeh U and Singh S. Targeting S100B protein as a surrogate biomarker and its role in various neurological disorders. Curr neuropharmacology. (2021) 19:265–77. doi: 10.2174/1570159x18666200729100427
31. Kalsariya RA, Kavila D, Shorter S, Negi D, Goodall ICA, Boussios S, et al. Molecular biomarkers of glial activation and injury in epilepsy. Drug Discov Today. (2025) 30:104289. doi: 10.1016/j.drudis.2025.104289
32. Pleines UE, Morganti-Kossmann MC, Rancan M, Joller H, Trentz O, and Kossmann T. S-100 beta reflects the extent of injury and outcome, whereas neuronal specific enolase is a better indicator of neuroinflammation in patients with severe traumatic brain injury. J neurotrauma. (2001) 18:491–8. doi: 10.1089/089771501300227297
33. Gao C, Jiang J, Tan Y, and Chen S. Microglia in neurodegenerative diseases: mechanism and potential therapeutic targets. Signal transduction targeted Ther. (2023) 8:359. doi: 10.1038/s41392-023-01588-0
34. Wang H, He Y, Sun Z, Ren S, Liu M, Wang G, et al. Microglia in depression: an overview of microglia in the pathogenesis and treatment of depression. J neuroinflammation. (2022) 19:132. doi: 10.1186/s12974-022-02492-0
35. Li B, Yang W, Ge T, Wang Y, and Cui R. Stress induced microglial activation contributes to depression. Pharmacol Res. (2022) 179:106145. doi: 10.1016/j.phrs.2022.106145
36. Cserép C, Pósfai B, and Dénes Á. Shaping neuronal fate: functional heterogeneity of direct microglia-neuron interactions. Neuron. (2021) 109:222–40. doi: 10.1016/j.neuron.2020.11.007
37. Yoshimura R, Hori H, Ikenouchi-Sugita A, Umene-Nakano W, Ueda N, and Nakamura J. Higher plasma interleukin-6 (IL-6) level is associated with SSRI- or SNRI-refractory depression. Prog Neuropsychopharmacol Biol Psychiatry. (2009) 33:722–6. doi: 10.1016/j.pnpbp.2009.03.020
38. Gonzalez LL, Garrie K, and Turner MD. Role of S100 proteins in health and disease. Biochim Biophys Acta Mol Cell Res. (2020) 1867:118677. doi: 10.1016/j.bbamcr.2020.118677
39. Liu T, Zhang L, Joo D, and Sun SC. NF-κB signaling in inflammation. Signal transduction targeted Ther. (2017) 2:17023–. doi: 10.1038/sigtrans.2017.23
40. Parletta N, Zarnowiecki D, Cho J, Wilson A, Procter N, Gordon A, et al. People with schizophrenia and depression have a low omega-3 index. Prostaglandins leukotrienes essential Fatty Acids. (2016) 110:42–7. doi: 10.1016/j.plefa.2016.05.007
Keywords: depressive disorder, machine learning, S100β, NSE, GBD
Citation: Guo Y, Lu H, Chen A, Guo J, Lai Y and Lu Z (2025) The burden of depressive disorder among the global 10–24 age group and the construction of an early risk factors model. Front. Psychiatry 16:1594074. doi: 10.3389/fpsyt.2025.1594074
Received: 15 March 2025; Accepted: 27 May 2025;
Published: 16 June 2025.
Edited by:
Thien Vu, National Institutes of Biomedical Innovation, Health and Nutrition, JapanReviewed by:
Sri Idaiani, National Research and Innovation Agency (BRIN), IndonesiaYangguang Lu, Wenzhou Medical University, China
Copyright © 2025 Guo, Lu, Chen, Guo, Lai and Lu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhengyou Lu, MTMzOTU5NzUzOTZAMTYzLmNvbQ==