ORIGINAL RESEARCH article

Front. Psychiatry, 21 May 2019

Sec. Mood Disorders

Volume 10 - 2019 | https://doi.org/10.3389/fpsyt.2019.00315

The Reliability and Validity of the Center for Epidemiologic Studies Depression Scale (CES-D) for Chinese University Students

  • 1. Department of Public Health and Preventive Medicine, School of Basic Medicine, Jinan University, Guangzhou, China

  • 2. Division of Medical Psychology and Behavior Science, School of Basic Medicine, Jinan University, Guangzhou, China

  • 3. Medical Imaging Center, First Affiliated Hospital of Jinan University, Guangzhou, China

  • 4. International School, Jinan University, Guangzhou, China

  • 5. Center for Brain Science and Brain-Inspired Intelligence, Guangdong–Hong Kong–Macao Greater Bay Area, Guangzhou, China

Article metrics

View details

283

Citations

43,4k

Views

9,9k

Downloads

Abstract

Aims: Depression is prevalent among university students worldwide, and the prevalence appears to be increasing. As an intermediate stage between being healthy and having depression, students with subthreshold depression could develop worsening depression or recover with intervention to prevent depression. The Center for Epidemiologic Studies Depression Scale (CES-D) is a useful tool to assess subthreshold depression. The primary purpose of the current study was to evaluate the psychometric characteristics of CES-D in Chinese university students. Secondly, we aimed to describe the prevalence of subthreshold depression among the student sample and examine its demographic correlates.

Methods: A total of 2,068 university students participated in the study, and they were asked to respond to the Chinese CES-D, Beck Depression Inventory-II (BDI-II), and Positive and Negative Affect Schedule (PANAS). The factor structure was evaluated by conducting exploratory (EFA) and confirmatory factor analysis (CFA) using a structural equation modeling approach. The reliability was assessed by calculating Cronbach’s alpha, inter-item correlation, and item-total correlation coefficients. The prevalence of subthreshold depression was calculated and demographic correlates of gender, grade, and major were examined by multiple regression.

Results: The final sample included 1,920 participants. The EFA results suggested extraction of three factors (somatic symptoms, negative affect, and anhedonia) that account for 52.68% of total variance. The CFA results suggested that the newly derived model with 14 items was the best fit for our data. Six items were removed from the original scale (item 9, 10, 13, 15, 17, and 19). The Cronbach’s alpha of the 14-item CES-D was 0.87. The prevalence of subthreshold depression among university students reached 32.7% for the 20-item CES-D and 31% for the 14-item CES-D, although there was no significant difference of prevalence in gender, grade, and major.

Conclusions: The CES-D has good reliability and validity for assessing subthreshold depression in Chinese university students.

Introduction

Depression is a common but serious mental illness typically characterized by sad, hopeless, or anxious feelings. The age of onset for depression has been falling, making university students particularly vulnerable to developing depression (1). In addition, university can be a challenging time for students, as students struggle with leaving home for the first time, living independently, increasing academic pressures, forming new relationships, and making important decisions. A growing body of evidence suggests that depression is prevalent among university students worldwide (2, 3), and this prevalence appears to be increasing. Untreated depression can persist for a long period, which may interfere with students’ daily lives, including academic performance and social functioning (4). In severe cases, depression may induce substance abuse (5, 6) and suicide (7). Recognition of the warning signs and early diagnosis of depression are, therefore, crucial for treating depressive symptoms and preventing depression from returning.

A prodromal phase of depression is regarded as subthreshold depression, in which depressive symptoms do not meet the criteria for a major depressive disorder (8). As an intermediate stage between being healthy and having depression, individuals with subthreshold depression could be worsened and develop depression (9) or recover with intervention and finally depression could be prevented (10). A useful tool to assess subthreshold depression is the Center for Epidemiologic Studies Depression Scale (CES-D), which was designed for use in epidemiology studies to assess degrees of depressive symptoms and detect at-risk individuals for depression in the general population (11). The CES-D is a self-rating 20-item scale with a recommended threshold score of 16 for indicating the presence of subthreshold depression. The current literature on CES-D has reported at least 20 factor solutions in different populations and subpopulations. Several items were questioned on their validity and psychometric properties. For instance, item 17 (“I had crying spells”) is biased by gender, as suggested by the differential item functioning analyses (1214). The two items (item 15 “People were unfriendly” and item 19 “I felt that people dislike me”) measure interpersonal problems, which are not consistent with theories of depression and widely used diagnosis criteria for depression (15, 16). The CES-D seems to be the only test that includes an interpersonal factor, but the other widely used instruments, such as the Beck Depression Inventory (BDI), the Hamilton Rating Scale for Depression (HRSD), and the Zung Self-rating Depression Scale (SDS) (16) do not have such a factor. The CES-D has demonstrated good reliability and validity across various Chinese populations, such as those who attempt suicide (17), patients with type 2 diabetes (18), primary care patients (19), and the elderly community (20). It is important to examine the reliability and validity of CES-D Chinese version in university students in order to advance detection and intervention of subthreshold depression.

The primary purpose of the current study was to evaluate the psychometric characteristics of CES-D in Chinese university students. Secondly, we aimed to describe the prevalence of subthreshold depression among the student sample and examine its demographic correlates.

Methods

Participants and Procedure

The study was conducted in Guangzhou in Southeastern China. There are more than ten universities in Guangzhou, but only five of them are comprehensive universities that include a variety of majors, such as literature and management, science and engineering, and medicine. Among the five comprehensive universities, two are national key universities and the other three are ordinary universities. In order to obtain a representative sample, we randomly selected one from two national key universities and one from three ordinary universities. We recruited students from one national key university (Jinan University) and one ordinary university (Guangzhou University) as the study participants. A stratified cluster selection strategy was used to recruit the participants. We stratified the sample into three majors: literature and management, science and engineering, and medicine. Five classes of each major were randomly selected during the 2016/2017 academic year, and all students from the selected classes were invited to participate in the study. We personally contacted the students and invited them to participate in their respective classrooms after the end of a class. Several studies raised problems associated with small samples in factor analysis and suggested large sample size. For instance, the effect of sample size on the results of factor analysis was empirically tested and the authors reported that larger samples tend to produce more accurate solutions (21). As the sample size increases, sampling error is reduced, factor analysis solutions become more stable and more reliably produce the factorial structure of the population (22). Given that the recommendations regarding sample size for factor analysis, a large sample size is expected (23). In total, 2,068 students were recruited in the study as participants.

Instruments

The Center for Epidemiologic Studies Depression Scale

The CES-D scale was developed to screen for depression by measuring the frequency of events and ideas over the past week (11). The CES-D scale is a 20-item instrument with each item rated on a four-point scale ranging from 0 (“rarely or none of the time”) to 3 (“most or all of the time”). Four of the items are positive statements which are inversely scored for calculating the total score. The total score ranges from 0 to 60 and a higher score indicates a greater risk of depression. For the original CES-D scale, a total score of 16 or greater is considered as indicative of subthreshold depression (11). However, a number of studies have evaluated the diagnostic accuracy of the CES-D to detect depression at the general population and proposed a variety of cut-off scores, such as a cut-off score of 18 among a very old population living in residential homes (24), a cut-off score of 21 among type 2 diabetes and primary care patients (18), and a cut-off score of 22 among elders (25). Using a meta-analytic approach, a previous study systematically reviewed 28 CES-D studies and proposed an optimal cut-off score of 20 with sensitivity of 0.83, specificity of 0.78, and diagnostic odds ratio of 16.64 (26). The present study adopted a cut-off score of 20 for detecting subthreshold depression. The CES-D had been validated in a variety of Chinese samples. For instance, good reliability was demonstrated in suicide attempters and residents with Cronbach’s alpha values of 0.940 and 0.895, and a three-factor with 14 items was the best fit (17). Similarly, a sample of 3,686 primary care patients demonstrated good internal consistency (ခωH = 0.855) and good test–retest reliability (ICC = 0.91), and a bi-factor structure with 20 items was the best fit (19). The previous Chinese version (19, 27) was used and it was verified by back-translation (Supplementary Material).

The Beck Depression Inventory-II

The BDI-II was developed to screen for depression and it has been widely used to measure the severity of depression (28). The scale consists of 21 items, and each item is rated on a four-point scale ranging from 0 (“I do not feel sad”) to 3 (“I am so sad or unhappy that I can’t stand it”). The subscales of BDI-II consisted of somatic-affective (item 15, 16, 18–20) and cognitive factor (item 1–14, 17, 21) for undergraduate students (29). The total score ranges from 0 to 63, and a higher score suggests more severe depression. The severity of depression can be categorized into minimal depression (score 0 to 13), mild depression (score 14 to 19), moderate depression (score 20 to 28), and severe depression (score 29 to 63) (28). The Chinese version of BDI-II (30) had been validated in university students in mainland China (31) and Taiwan (32) with Cronbach’s alphas of 0.85 and 0.88, respectively.

The Positive and Negative Affect Schedule

The Positive and Negative Affect Schedule (PANAS) has been widely used to measure both positive and negative affect (33). The questionnaire contains two 10-item scales, and each item is rated on a five-point scale ranging from 1 (“very slightly or not at all”) to 5 (“very much”). The total score ranges from 10 to 50 for both positive and negative affect, and a higher score indicates a higher positive emotion and a higher negative emotion, respectively. The Chinese version of PANAS (34) had been validated in residents from community, with Cronbach’s alpha for positive and negative affect of 0.85 and 0.83, respectively.

Data Analyses

The potential gender bias of item 17 was evaluated by estimating the differential item functioning using an item response theory approach (12). We produced the non-parametric item characteristic curves that were smoothed with a Gaussian kernel using jMetrik 4.1.1. An exploratory factor analysis (EFA) was followed by a confirmatory factor analysis (CFA) by splitting the data set into halves. We performed an EFA with the first half (Sample One) and then used the results to fit a CFA model to the second half of the data (Sample Two). EFA was performed using principal component analysis and oblique promax rotation. According to the statistics literature, a factor loading of 0.5 is used as the cut-off score for the most accepted norm of EFA (23, 35). CFA was performed using the weighted least squares with mean and variance adjustment (WLSMV) estimator (36, 37). All CFA models were estimated using Mplus 7.4 software, and the loading of the first indicator in each factor is automatically fixed to be 1.0. The model derived by EFA and five recommended models (11, 12, 18, 38, 39) were evaluated by CFA with and without the three items (item 15, 17, and 19). Multiple indices for fitness were used: root mean squared error of approximation (RMSEA) must be less than 0.08, with 90% confidence interval values below 0.10, Tucker–Lewis index (TLI) and comparative fit index (CFI) must be greater than 0.90 (40), and change in chi-square given the change in degrees of freedom should be less than 5.0. The Akaike information criterion (AIC) and Bayesian information criterion (BIC) were used to compare the non-nested competing models. Based on a structural equation modeling (SEM) approach, we calculated average variance extracted (AVE) and composite reliability (CR) (41, 42). Correlation coefficients between subscale scores of CES-D, BDI-II, and PANAS were calculated. To investigate the relationships between the underlying constructs of the CES-D and BDI-II, we built a two-factor measurement model to explore the latent structure of the CES-D and BDI-II in Sample Two (43), and this was checked by CFA. The reliability was evaluated by calculating Cronbach’s α, inter-item correlation, and corrected item-total correlation coefficients. The prevalence of subthreshold depression in the university student sample was calculated using a cut-off score of 20 for the 20-item CES-D (26). In addition, we performed receiver operator characteristic (ROC) analysis to determine the optimal cut-off score for the revised CES-D and calculated the prevalence. The relationship between the CES-D and demographic correlates was investigated by multiple regression.

Ethics

This study was carried out in accordance with the recommendations of the Declaration of Helsinki with written informed consent from all subjects. The protocol was approved by the Ethics Committee of the School of Medical Science at Jinan University, China. For students with positive results, a notice indicating that they are at risk of subthreshold depression and developing depression was given. We further provided some guidance and suggestions, such as gaining access to university counseling service and talking to a friend or a family member.

Results

Demographic Characteristics

A total of 2,200 questionnaires were distributed, and 2,068 questionnaires were returned (94.00%). Invalid questionnaires with any questions unanswered were excluded and the final sample included 1,920 questionnaires (92.84%). Of these, 710 (36.98%) were male and 1,210 (63.02%) were female. There were 1,315 (68.49%) junior grade students (freshman and sophomore) and 605 (31.51%) senior grade students (junior, senior and fifth grade). Regarding the major, 592 (30.83%) literature and management, 619 (32.24%) science and engineering, and 709 (36.93%) medicine majors were included in the study. The average age of the sample in years was 20 (SD = 1.68).

Differential Item Functioning Analyses

To verify that item 17 (“I had crying spells”) produced gender bias, differential item functioning analysis was conducted. As shown in Figure 1, the item characteristic curve of men differed markedly from the curve of women, suggesting that women were more likely to choose a higher response option than men. For the remaining items, their item characteristic curves demonstrated negligible difference between the male and female group. An example of item 7 was also presented in Figure 1 for illustrative purposes.

Figure 1

Figure 1

Item characteristic curves for items 17 and 7.

Psychometric Properties of the Chinese CES-D

Each of the 1,920 participants was randomly assigned to Sample One or Sample Two. As a result, 963 and 957 participants were randomly assigned to Sample One and Sample Two, respectively. Results of independent t tests and chi-square tests indicated that the students in the two samples were not different with regard to CES-D total score (t = 1.068, p = 0.286), gender (χ2 = 1.536, p = 0.215), grade (χ2 = 0.002, p = 0.965), and major (χ2 = 2.524, p = 0.283). The results suggested that the random split assignment is appropriate. An initial EFA, including all 20 items, was conducted. A series of statistics indicated that EFA is appropriate for the current dataset, including Kaiser–Meyer–Olkin (KMO) = 0.939 and Bartlett’s p < 0.001. The EFA results on Sample One suggested extraction of three factors, which accounts for 52.68% of total variance. The three factors were somatic symptoms (item 1∼3, 5∼7, 11), negative affect (item 14, 15, 17∼20), and anhedonia (item 4, 8, 12, 16). Three of the 20 items, including items 9, 10, and 13, had weak factor loadings (<0.5), suggesting removal of the three items (23, 35). Pattern and structure coefficients are presented in Table 1. The model derived by EFA and five previously reported models were evaluated by CFA with and without the three items (item 15, 17, and 19). The fit indices are summarized in Table 2. Two of the seven models fit the data well, that is, the Carleton model and model derived by EFA but without the three items. Considering the AIC and BIC values as well as the difference in the AIC and BIC values, this suggests that the model derived by EFA but without the three items fit the data best. The CR values for the three factors were 0.855 (somatic symptoms), 0.794 (depressed affect), and 0.804 (anhedonia), suggesting satisfactory construct reliability. The AVE values were 0.465 (somatic symptoms), 0.562 (depressed affect), and 0.510 (anhedonia), suggesting acceptable convergent validity. The graphical expression of the path diagram of the revised EFA model was presented in Figure 2. The factor loadings for each item ranged from 0.499 to 0.860.

Table 1

CES-D ItemPattern coefficientsStructure coefficients
Factor 1Factor 2Factor 3Factor 1Factor 2Factor 3
180.770.800.51
190.730.790.51
170.730.66
150.670.69
140.610.68
200.600.67
100.470.660.59
90.380.640.540.61
160.800.81
80.790.79
120.730.76
40.720.71
50.840.74
70.740.80
60.600.580.76
110.580.58
10.570.510.67
20.530.54
30.500.570.68
130.360.530.55

Pattern and Structure Matrices.

Factor 1: Negative affect; Factor 2: Anhedonia; Factor 3: Somatic symptoms. CES-D, Center for Epidemiologic Studies Depression Scale.

Table 2

ModelFactors (items)WLSMV χ2/dfTLICFIRMSEA (90% CI)AICBIC
Radloff et al. (11)4 (20)6.130.9390.9480.073 (0.069, 0.078)38,926.00639,247.017
Yen et al. (38)3 (17)5.840.9480.9550.071 (0.066, 0.076)33,887.85334,150.499
Lee et al. (39)2 (19)10.040.8990.9110.097 (0.093, 0.102)37,373.73837,655.839
Carleton et al. (12)3 (14)4.020.9700.9750.056 (0.050, 0.063)28,365.53628,584.407
Zhang et al. (18)4 (20)5.980.9410.9490.072 (0.068, 0.076)38,905.70839,226.719
Newly derived model3 (17)5.280.9540.9600.067 (0.062, 0.072)33,534.18533,796.830
The revised EFA model3 (14)3.740.9720.9780.053 (0.047, 0.060)28,342.43528,561.306

CFA fit indices and model comparisons for CES-D.

CFA, confirmatory factor analysis.

Figure 2

Figure 2

Path diagram of the revised exploratory factor analysis (EFA) model.

The mean scores of the 14-item CES-D, BDI-II, positive affect, and negative affect were 12.38 (SD = 6.89), 9.80 (SD = 8.90), 27.39 (SD = 7.08), and 17.75 (SD = 6.18), respectively. Correlations between the 14-item CES-D, BDI-II, and PANAS are presented in Table 3. The Chinese CES-D scores were significantly correlated with the BDI-II (r = 0.74, P < 0.01), positive affect (r = −0.58, P < 0.01), and negative affect (r = 0.63, P < 0.01). We built a measurement model for the 14-item CES-D (somatic symptoms, negative affect and anhedonia) and BDI-II (somatic-affective and cognitive factor). The path diagram of this measurement model is presented in Figure 3. The factor loadings for CES-D items range from 0.531 to 0.846 and for CES-D based first-order factors range from 0.723 to 0.924. The factor loadings for BDI-II items range from 0.524 to 0.823 and for BDI-II based first-order factors range from 0.902 to 0.973. The measurement model produces a correlation of 0.889 between the CES-D and BDI-II based on second-order factor analysis. The two-factor model with correlated factors fits the data well (χ2 = 1,922.809, df = 554, RMSEA = 0.051, CFI = 0.937, TLI = 0.932).

Table 3

CES-D-14 itemCES-D-20 itemBDI-IICES-D1CES-D2CES-D3BDI-II1BDI-II2PANA
CES-D-200.975**
BDI-II0.735**0.744**
CES-D10.909**0.895**0.679**
CES-D20.788**0.824**0.613**0.666**
CES-D30.743**0.674**0.508**0.456**0.400**
BDI-II10.632**0.618**0.845**0.632**0.487**0.392**
BDI-II20.719**0.735**0.975**0.645**0.615**0.513**0.720**
PA−0.578**−0.537**−0.479**−0.452**−0.368**−0.596**−0.422**−0.464**
NA0.631**0.667**0.563**0.586**0.589**0.388**0.415**0.578**−0.236**

Correlations between the subscales of CES-D, BDI-II, and PANAS.

CES-D-14 item, 14-item CES-D total score; CES-D-20 item, 20-item CES-D total score; BDI-II, BDI-II total score; CES-D1, Somatic symptoms; CES-D2, Depressed symptoms; CES-D3, Anhedonia; BDI-II1, Somatic-affective; BDI-II2, Cognitive factor; PA, Positive affect schedule total score; NA, Negative affect schedule total score. BDI-II, Beck Depression Inventory-II; PANAS, Positive and Negative Affect Schedule.

**P < 0.01.

Figure 3

Figure 3

Measurement model for the subscales of CES-D and BDI-II.

The 14-item CES-D scale had satisfactory internal consistency with a Cronbach’s coefficient alpha of 0.87. The average inter-item correlation for all items was 0.32, which was acceptable and suggested that the items measure the same construct well. The average inter-item correlation with the reverse-scored items (item 4, 8, 12, and 16) and without the reverse-scored items was 0.45 and 0.37, respectively. The corrected item-total correlation coefficients for all items ranged from 0.400 (item 11) to 0.697 (item 6) (Table 4).

Table 4

Scale mean if item deletedCorrected item-total correlationCronbach’s alpha if item deleted
1. I was bothered by things that usually don’t bother me.11.560.5330.857
2. I did not feel like eating; my appetite was poor.11.770.4020.863
3. I felt that I could not shake off the blues even with help from my family or friends.11.700.6080.852
4. I felt I was just as good as other people.10.980.4120.864
5. I had trouble keeping my mind on what I was doing.11.230.5020.858
6. I felt depressed.11.430.6970.847
7. I felt that everything I did was an effort.11.550.6480.850
8. I felt hopeful about the future.11.250.4650.860
11. My sleep was restless.11.680.4000.864
12. I was happy.11.180.5360.856
14. felt lonely.11.480.5560.855
16. I enjoyed life.11.420.5010.858
18. I felt sad.11.650.5780.854
20. I could not get “going.”12.050.5040.859

Internal consistency of CES-D.

Prevalence of Subthreshold Depression

For the 20-item CES-D, the scores ranged from 0 to 57, and the average score was 16.03 (SD = 9.62). The prevalence of subthreshold depression was 32.7% considering the cut-off score of 20. For the 14-item CES-D, the scores ranged from 0 to 42 and the average score was 12.38 (SD = 6.89). The ROC curve of the 14-item CES-D is drawn in Figure 4, with an area under the curve (AUC) of 0.903 (95%CI: 0.887 to 0.919). The optimal cut-off point of 15.5 was determined by maximizing both the sensitivity (0.840) and specificity (0.824), with positive predictive values of 0.547 and negative predictive values of 0.953. The prevalence of subthreshold depression was 31% considering a cut-off value of 16. A multiple regression was calculated to predict the CES-D score based on demographic variables of gender, grade, and major. The results show that the model is not significant, F(4, 1,915) = 2.128, P = 0.075. This suggests that the model cannot significantly predict the CES-D score.

Figure 4

Figure 4

Receiver operator characteristic (ROC) curve analysis of the 14-item CES-D.

Discussion

To our best knowledge, this is the first study to evaluate the psychometric properties of the Chinese version of the CES-D for assessing subthreshold depression in Chinese university students. The results indicate that the CES-D is a reliable and valid instrument for assessing subthreshold depression in Chinese university students. The fit statistics suggested that the newly derived model with 14 items provides the best fit for the current data. The prevalence of subthreshold depression is 32.7% for the 20-item CES-D and 31% for the 14-item CES-D, and there is no significant difference in the demographic correlates of gender, grade, and major.

Psychometric Properties of the Chinese CES-D

Previous studies have highlighted that item 17 (“I had crying spells”) may have gender bias (1213, 14, 44). As suggested by differential functioning analysis, the female group tended to respond higher compared with the male group. This finding is consistent with previous studies that reported that item 17 has gender bias (12, 13, 45). It is important to remove item 17 from the CES-D to achieve validation of a summary score to indicate the level of depression. In addition to item 17, the two interpersonal items (item 15, 19) may also be problematic. First, there is no theoretical support for including social items in an assessment of depression. The current DSM-V diagnosis manual (Supplementary Material) does not consider interpersonal problems a criteria to depression (15). In contrast, the two interpersonal items may assess symptoms of other disorders, such as social anxiety disorder (16, 46). Secondly, there are only two items for the interpersonal factor, which would result in psychometric difficulties (44, 47). Finally, a number of studies removed the two interpersonal items and produced a more validated measure of depression (12, 13). One study suggested that the two interpersonal items were unable to distinguish non-depressed and depressed patients with HIV/AIDS (48). Finally, in the original CES-D model developed by Radloff (1977) and other CES-D models (47, 49, 50), it is noted that the correlation between the interpersonal factor and other factors is very low. As a result, we evaluated the EFA models with and without the three items (item 15, 17, and 19).

The present study used confirmatory analysis to investigate factor structures of the Chinese CES-D, and the results suggested that the newly derived model with 14 items produced the best fit indices. The current results replicate the increasingly demonstrably robust results from Carleton et al. (12) with a Chinese sample and provide support for the three-factor structure of CES-D suggested by Carleton et al. (12). There is a small difference between the current model and the Carleton model. It was noted that items 3 (“blues”) and 6 (“felt depressed”) moved from the depressed affect to somatic symptoms, whereas item 20 (“could not get going”) moved from somatic symptoms to the depressed affect. It seems that the Chinese university students were confused about the difference between depressed affect and somatic symptoms. Numerous studies have shown that the Chinese tend to express somatic symptoms of depression, whereas people in Western countries tend to emphasize psychological symptoms of depression. For example, a study found that Chinese had higher endorsement rates for somatic symptoms compared with Euro-Australians, who in turn had higher endorsement rates for psychological symptoms (51). These differences would be lessened as Chinese-Australians adapt to mainstream Australian society (52). Similarly, another study also found that the Chinese reported more somatic symptoms than the Euro-Canadians (53). However, several studies have failed to find support for the relationship between culture and symptom expression. For instance, using the CES-D, Yen et al. (38) found that a Chinese student sample reported a significantly lower level of somatic depressive symptom endorsement compared with an American student sample (38). Importantly, several studies highlighted the role that cultural norms play in symptom expression. Drawing from a social identity perspective, a study found that increased somatic symptom expression occurred only when Asian participants were willing to endorse collectivism norms and identified strongly with Asian culture (54).

In the general populations, the CES-D has exhibited a good internal consistency with Cronbach’s alpha coefficients ranging from 0.83 to 0.95 (5557). The CES-D also showed good reliability, with a Cronbach’s alpha from 0.89 to 0.92 in university students from Japan, US, and Taiwan (5860). Furthermore, the Chinese version of CES-D has shown satisfactory reliability in children, American Chinese women, community residents, and elderly in Hong Kong, with Cronbach’s alpha values of 0.82, 0.86, 0.86, and 0.90 (17, 6163). In the present study, the Cronbach’s alpha reached 0.87, indicating a good reliability when used in university students. The BDI-II and PANAS were used to evaluate the criterion validity of the CES-D. The results showed that the CES-D scores were positively correlated with BDI-II and negative affect scores and negatively correlated with positive affect scores, demonstrating a good criterion validity of the Chinese CES-D in university students. Furthermore, the two-factor measurement model fits well with the data, suggesting that there is an overlap in the constructs underlying the subscales of the CES-D and BDI-II. This is consistent with the correlation between total scores of the two scales and the theory on depression.

Prevalence of subthreshold depression among the Chinese university students.

The prevalence of subthreshold depression, as assessed by the 20-item CES-D and 14-item CES-D, in university students was 32.7% and 31%, respectively. This is slightly higher than the 23.8% and 30.39% incidence reported in two meta-analysis studies on Chinese university students (64, 65). This is reasonable since the majority of studies used diverse measures, such as the Self-rating Depression Scale (66), BDI (67), and Hamilton Depression Scale (68), which evaluated depression rather than subthreshold depression. In addition, we found no significant differences in subthreshold depression with regard to gender, grade and major. This is consistent with previous studies suggesting no significant differences in depressive symptoms between male and female students (6971). This may be because Chinese female university students are equal as their male peers in many ways, such as political rights, job opportunities, and pressure from academics and life. Regarding grade, previous studies have shown inconsistent findings (71, 72) that may be explained by different measurement tools and sample errors.

Limitations

First, we did not have a diagnostic instrument for depression diagnosis, although a number of excellent diagnostic instruments exist for depression diagnosis, such as Diagnostic Interview Schedule (DIS) and Structured Clinical Interview for DSM Disorder (SCID). However, using such instruments is time consuming and unfeasible in population-based surveys. As a result, we lack a gold standard for depression diagnosis to investigate sensitivity, specificity, positive and negative predictive values of the Chinese CES-D to predict depression or subthreshold depression. Second, the sample was recruited from only two universities in Guangzhou, which limited the generalization of the results of subthreshold depression prevalence to a larger university population in China. Third, the sample was not followed up and test-retest reliability could not be examined. Finally, we only investigated the university student sample. Construct and external validity should be investigated in clinical samples.

Conclusions

The present findings indicate that the three-factor structure with 14 items of CES-D has satisfactory psychometric properties as an instrument for assessing subthreshold depressive symptoms in Chinese university students. The prevalence of subthreshold depression reaches 32.7% for the 20-item CES-D and 31% for the 14-item CES-D, and there is no significant difference in the variables of gender, grade, and major.

Funding

This work was supported by research grants from the National Natural Science Foundation of China (81601969, 81671670, 81501456, and 81471650); Science and Technology Program of Guangdong (2018B030334001); Fundamental Research Funds for the Central Universities; Planned Science and Technology Project of Guangdong Province, China (2014B020212022); and Planned Science and Technology Project of Guangzhou, China (201508020004, 20160402007, and 201604020184).

Statements

Data availability statement

All datasets generated for this study are included in the manuscript and the supplementary files.

Ethics statement

This study was carried out in accordance with the recommendations of Declaration of Helsinki with written informed consent from all subjects. All subjects gave written informed consent in accordance with the Declaration of Helsinki. The protocol was approved by the Ethics Committee of School of Medical Science at Jinan University, China.

Author contributions

LJ contributed to data collection, data analysis, discussion on results, writing, and preparation of the manuscript. YW contributed to study design, data analyses, writing and preparation of the manuscript. YZ, RL, HW, CL, and YLW contributed to data collection, data analyses, and discussion on results. QT contributed to study design, data collection, results interpretation, discussion of results, writing, and preparation of the manuscript. All authors read and approved the final manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2019.00315/full#supplementary-material

References

  • 1

    KesslerRCBerglundPDemlerOJinRMerikangasKRWaltersEE. Lifetime prevalence and age-of-onset distributions of DSM-IV disorders in the National Comorbidity Survey Replication. Arch Gen Psychiatry (2005) 62(6):593602. doi: 10.1001/archpsyc.62.6.593

  • 2

    IbrahimAKKellySJAdamsCEGlazebrookC. A systematic review of studies of depression prevalence in university students. J Psychiatr Res (2013) 47(3):391400. doi: 10.1016/j.jpsychires.2012.11.015

  • 3

    VoelkerR. Mounting student depression taxing campus mental health services. Jama (2003) 289(16):2055–6. doi: 10.1001/jama.289.16.2055

  • 4

    MihăilescuAIDiaconescuLVCiobanuAMDonisanTMihailescuC. The impact of anxiety and depression on academic performance in undergraduate medical students. Eur Psychiatry (2016) 33:S341S342. doi: 10.1016/j.eurpsy.2016.01.761

  • 5

    BucknerJDKeoughMESchmidtNB. Problematic alcohol and cannabis use among young adults: the roles of depression and discomfort and distress tolerance. Addict Behav (2007) 32(9):1957–63. doi: 10.1016/j.addbeh.2006.12.019

  • 6

    MushquashARStewartSHSherrySBSherryDLMushquashCJMacKinnonAL. Depressive symptoms are a vulnerability factor for heavy episodic drinking: a short-term, four-wave longitudinal study of undergraduate women. Addict Behav (2013) 38(5):2180–6. doi: 10.1016/j.addbeh.2012.11.008

  • 7

    RubinR. Recent suicides highlight need to address depression in medical students and residents. Jama (2014) 312(17):1725–7. doi: 10.1001/jama.2014.13505

  • 8

    CuijpersPSmitF. Subthreshold depression as a risk indicator for major depressive disorder: a systematic review of prospective studies. Acta Psychiatr Scand (2004) 109(5):325–31. doi: 10.1111/j.1600-0447.2004.00301.x

  • 9

    FogelJEatonWWFordDE. Minor depression as a predictor of the first onset of major depressive disorder over a 15-year follow-up. Acta Psychiatr Scand (2006) 113(1):3643. doi: 10.1111/j.1600-0447.2005.00654.x

  • 10

    ClarkeGNHornbrookMLynchFPolenMGaleJBeardsleeWet al. a randomized trial of a group cognitive intervention for preventing depression in adolescent offspring of depressed parents. Arch Gen Psychiatry (2001) 58(12):1127–34. doi: 10.1001/archpsyc.58.12.1127

  • 11

    RadloffLS. The CES-D Scale: a Self-Report Depression Scale for Research in the general population. Appl Psychol Meas (1977) 1(3):385401. doi: 10.1177/014662167700100306

  • 12

    CarletonRNThibodeauMATealeMJNWelchPGAbramsMPRobinsonTet al. The center for epidemiologic studies depression scale: a review with a theoretical and empirical examination of item content and factor structure. PLoS One (2013) 8(3):e58067. doi: 10.1371/journal.pone.0058067

  • 13

    ColeSRKawachiIMallerSJBerkmanLF. Test of item-response bias in the CES-D scale: experience from the New Haven EPESE study. J Clin Epidemiol (2000) 53(3):285–9. doi: 10.1016/S0895-4356(99)00151-1

  • 14

    YangFMJonesRN. Center for Epidemiologic Studies—Depression scale (CES-D) item response bias found with Mantel-Haenszel method was successfully replicated using latent variable modeling. J Clin Epidemiol (2007) 60(11):1195–200. doi: 10.1016/j.jclinepi.2007.02.008

  • 15

    A. American Psychiatric: diagnostic and statistical manual of mental disorders (DSM-5®). Am Psychiatr Pub (2013) 155–88. doi: 10.1176/appi.books.9780890425596

  • 16

    ShaferAB. Meta-analysis of the factor structures of four depression questionnaires: Beck, CES-D, Hamilton, and Zung. J Clin Psychol (2006) 62(1):123–46. doi: 10.1002/jclp.20213

  • 17

    YangLJiaC-XQinP. Reliability and validity of the Center for Epidemiologic Studies Depression Scale (CES-D) among suicide attempters and comparison residents in rural China. BMC Psychiatry (2015) 15:76. doi: 10.1186/s12888-015-0458-1

  • 18

    ZhangYTingRZWLamMHBLamS-PYeungRONanHet al. Measuring depression with CES-D in Chinese patients with type 2 diabetes: the validity and its comparison to PHQ-9. BMC Psychiatry (2015) 15:198. doi: 10.1186/s12888-015-0580-0

  • 19

    ChinWYChoiEPHChanKTYWongCKH. The Psychometric Properties of the Center for Epidemiologic Studies Depression Scale in Chinese Primary Care Patients: Factor Structure, Construct Validity, Reliability, Sensitivity and Responsiveness. PLoS One (2015) 10(8):e0135131. doi: 10.1371/journal.pone.0135131

  • 20

    O’HalloranAMKennyRAKing-KallimanisBL. The latent factors of depression from the short forms of the CES-D are consistent, reliable and valid in community-living older adults. Eur Geriatr Med (2014) 5(2):97102. doi: 10.1016/j.eurger.2013.12.004

  • 21

    CostelloABOsborneJ. Best Practices in Exploratory Factor Analysis: Four Recommendations for Getting the Most From Your Analysis. Pract Assess Res Eval (2005) 10:19.

  • 22

    MacCallumRCWidamanKZhangSHongS. Sample Size in Factor Analysis. Psychol Methods (1999) 4:8499. doi: 10.1037/1082-989X.4.1.84

  • 23

    ComreyALLeeHB. A first course in factor analysis. 2nd ed. New Jersey, United States: Lawrence Erlbaum Associates, Inc, Hillsdale (1992).

  • 24

    DozemanEvan SchaikDjFau - van MarwijkHWJvan MarwijkHwFau -StekMLStekMlFau - van der HorstHEvan der HorstHeFau - BeekmanATFBeekmanAT. The center for epidemiological studies depression scale (CES-D) is an adequate screening instrument for depressive and anxiety disorders in a very old population living in residential homes. Int J Geriatr Psych (2011) 26(3):239–46. doi: 10.1002/gps.2519

  • 25

    ChengS-TChanACM. The Center for Epidemiologic Studies Depression Scale in older Chinese: thresholds for long and short forms. Int J Geriatric Psychiatry (2005) 20(5):465–70. doi: 10.1002/gps.1314

  • 26

    VilagutGForeroCGBarbagliaGAlonsoJ. Screening for Depression in the General Population with the Center for Epidemiologic Studies Depression (CES-D): a Systematic Review with Meta-Analysis. PLoS One (2016) 11(5):e0155431–e0155431. doi: 10.1371/journal.pone.0155431

  • 27

    ZhangBFokkemaMCuijpersPLiJSmitsNBeekmanA. Measurement invariance of the center for epidemiological studies depression scale (CES-D) among chinese and dutch elderly. BMC Med Res Methodol (2011) 11(1):74. doi: 10.1186/1471-2288-11-74

  • 28

    BeckATSteerRABrownGK. Manual for Beck Depression Inventory-II. San Antonio: TX, Psychology Corporation (1996). doi: 10.1037/t00742-000

  • 29

    StorchEARobertiJWRothDA. Factor structure, concurrent validity, and internal consistency of the beck depression inventory—second edition in a sample of college students. Depression Anxiety (2004) 19(3):187–9. doi: 10.1002/da.20002

  • 30

    ZhengYWeiLLianggueGGuochenZChenggueW. Applicability of the Chinese beck depression inventory. Compr Psychiatry (1988) 29(5):484–9. doi: 10.1016/0010-440X(88)90063-6

  • 31

    YangWWuDPengF. Application of Chinese Version of Beck Depression Inventory-II to Chinese first-year college students. Chin J Clin Psychol (2012) 20(6):762–4.

  • 32

    WuP-C. Measurement invariance and latent mean differences of the Beck Depression Inventory II across gender groups. J Psychoeduc Assess (2010) 28(6):551–63. doi: 10.1177/0734282909360772

  • 33

    WatsonDClarkLATellegenA. Development and validation of brief measures of positive and negative affect: the PANAS scales. J Pers Soc Psychol (1988) 54(6):1063. doi: 10.1037/0022-3514.54.6.1063

  • 34

    HuangLYangTLiZ. Applicability of the Positive and Negative Affect Scale in Chinese. Chin Ment Health J (2003) 17(1):54–6.

  • 35

    HairJBlackWBabinBAndersonR. Multivariate Data Analysis. 7th Edition, Essex, England: Prentice Hall (2009).

  • 36

    BeauducelAHerzbergPY. On the performance of maximum likelihood versus means and variance adjusted weighted least squares estimation in CFA. Struct Equation Model (2006) 13(2):186203. doi: 10.1207/s15328007sem1302_2

  • 37

    FloraDBCurranPJ. An Empirical Evaluation of Alternative Methods of Estimation for Confirmatory Factor Analysis With Ordinal Data. Psychol Methods (2004) 9(4):466–91. doi: 10.1037/1082-989X.9.4.466

  • 38

    YenSRobinsCJLinN. A cross-cultural comparison of depressive symptom manifestation: China and the United States. J Consult Clin Psychol (2000) 68(6):993–9. doi: 10.1037/0022-006X.68.6.993

  • 39

    LeeSWStewartSMByrneBMWongJPSHoSYLeePWHet al. Factor structure of the Center for Epidemiological Studies Depression scale in Hong Kong adolescents. J Pers Assess (2008) 90(2):175–84. doi: 10.1080/00223890701845385

  • 40

    HooperDCoughlanJMullenM. Structural equation modelling: guidelines for determining model fit. Electron J Bus Res Methods (2008) 6(1):5360. doi: 10.21427/D7CF7R

  • 41

    LuomaJBO’HairAKKohlenbergBSHayesSCFletcherL. The development and psychometric properties of a new measure of perceived stigma toward substance users. Subst Use Misuse (2010) 45(1–2):4757. doi: 10.3109/10826080902864712

  • 42

    RaykovTShroutPE. Reliability of scales with general structure: point and interval estimation using a structural equation modeling approach. Struct Equation Model (2002) 9(2):195212. doi: 10.1207/S15328007SEM0902_3

  • 43

    SkorikovVBVanderVoortDJ. Relationships between the Underlying Constructs of the Beck Depression Inventory and the Center for Epidemiological Studies Depression Scale. Educ Psychol Meas (2003) 63(2):319–35. doi: 10.1177/0013164402251035

  • 44

    MillerTQMarkidesKSBlackSA. The factor structure of the CES-D in two surveys of elderly Mexican Americans. J Gerontol B Psychol Sci Soc Sci (1997) 52(5):S259S269. doi: 10.1093/geronb/52B.5.S259

  • 45

    StommelMGivenBAGivenCWKalaianHASchulzRMcCorkleR. Gender bias in the measurement properties of the Center for Epidemiologic Studies Depression Scale (CES-D). Psychiatry Res (1993) 49(3):239–50. doi: 10.1016/0165-1781(93)90064-N

  • 46

    StansburyJPRiedLDVelozoCA. Unidimensionality and bandwidth in the Center for Epidemiologic Studies Depression (CES–D) scale. J Pers Assess (2006) 86(1):1022. doi: 10.1207/s15327752jpa8601_03

  • 47

    WilliamsCDTaylorTRMakambiKHarrellJPalmerJRRosenbergLet al. CES-D four-factor structure is confirmed, but not invariant, in a large cohort of African American women. Psychiatry Res (2007) 150(2):173–80. doi: 10.1016/j.psychres.2006.02.007

  • 48

    JuddFKMijchANormanT. The evaluation of depression in inpatients with HIV disease. Aust N Z J Psychiatry (1999) 33(3):344–52. doi: 10.1046/j.1440-1614.1999.00579.x

  • 49

    MackinnonAMcCallumJAndrewsGAndersonI. The center for epidemiological studies depression scale in older community samples in Indonesia, North Korea, Myanmar, Sri Lanka, and Thailand. J Gerontol B Psychol Sci Soc Sci (1998) 53(6):P343–P352. doi: 10.1093/geronb/53B.6.P343

  • 50

    CheungC-KBagleyC. Validating an American scale in Hong Kong: the center for epidemiological studies depression scale (CES-D). J Psychol (1998) 132(2):169–86. doi: 10.1080/00223989809599157

  • 51

    ParkerGCheahYCRoyK. Do the Chinese somatize depression? A cross-cultural study. Soc Psychiatry Psychiatr Epidemiol (2001) 36(6):287–93. doi: 10.1007/s001270170046

  • 52

    ParkerGChanBTullyLEisenbruchM. Depression in the Chinese: the impact of acculturation. Psychol Med (2005) 35(10):1475–83. doi: 10.1017/S0033291705005623

  • 53

    RyderAGYangJZhuXYaoSYiJHeineSJet al. The cultural shaping of depression: somatic symptoms in China, psychological symptoms in North America? J Abnormal Psychol (2008) 117(2):300–13. doi: 10.1037/0021-843X.117.2.300

  • 54

    ChangMXLJettenJCruwysTHaslamC. Cultural identity and the expression of depression: a social identity perspective. J Commun Appl Soc Psychol (2017) 27(1):1634. doi: 10.1002/casp.2291

  • 55

    DemirchyanAPetrosyanVThompsonME. Psychometric value of the Center for Epidemiologic Studies Depression (CES-D) scale for screening of depressive symptoms in Armenian population. J Affective Disord (2011) 133(3):489–98. doi: 10.1016/j.jad.2011.04.042

  • 56

    MalakoutiSKPachanaNANajiBKahaniSSaeedkhaniM. Reliability, validity and factor structure of the CES-D in Iranian elderly. Asian J Psychiatry (2015) 18:8690. doi: 10.1016/j.ajp.2015.08.007

  • 57

    FountoulakisKIacovidesAKleanthousSSamolisSKaprinisSGSitzoglouKet al. Reliability, Validity and Psychometric Properties of the Greek Translation of the Center for Epidemiological Studies-Depression (CES-D) Scale. BMC Psychiatry (2001) 1(1):3. doi: 10.1186/1471-244X-1-3

  • 58

    SheanGBaldwinG. Sensitivity and specificity of depression questionnaires in a college-age sample. J Genet Psychol (2008) 169(3):281–92. doi: 10.3200/GNTP.169.3.281-292

  • 59

    UmegakiYTodoN. Psychometric properties of the Japanese CES-D, SDS, and PHQ-9 depression scales in university students. Psychol Assess (2017) 29(3):354–9. doi: 10.1037/pas0000351

  • 60

    ChangEChenR. A Study of Depression Factors in Taiwanese Students of Department of Design. Eurasia J Math Sci Technol Educ (2018) 14(1):197204. doi: 10.12973/ejmste/79632

  • 61

    William LiHCChungOKJHoKY. Center for Epidemiologic Studies Depression Scale for Children: psychometric testing of the Chinese version. J Adv Nurs (2010) 66(11):2582–91. doi: 10.1111/j.1365-2648.2010.05440.x

  • 62

    LiZHicksMH-R. The CES-D in Chinese American women: construct validity, diagnostic validity for major depression, and cultural response bias. Psychiatry Res (2010) 175(3):227–32. doi: 10.1016/j.psychres.2009.03.007

  • 63

    BoeyKW. Cross-validation of a short form of the CES-D in Chinese elderly. Int J Geriatric Psychiatry (1999) 14(8):608–17. doi: 10.1002/(SICI)1099-1166(199908)14:8<608::AID-GPS991>3.0.CO;2-Z

  • 64

    LeiX-YXiaoL-MLiuY-NLiY-M. Prevalence of Depression among Chinese University Students: a Meta-Analysis. PLoS One (2016) 11(4):e0153454. doi: 10.1371/journal.pone.0153454

  • 65

    JiangCXLiZZChenPChenLZ. Prevalence of Depression Among College-Goers in Mainland China: a methodical evaluation and meta-analysis. Medicine (Baltimore) (2015) 94(50):e2071. doi: 10.1097/MD.0000000000002071

  • 66

    ZungWWK. A self-rating depression scale. Arch Gen Psychiatry (1965) 12(1):6370. doi: 10.1001/archpsyc.1965.01720310065008

  • 67

    BeckATWardCHMendelsonMMockJErbaughJ. An Inventory for Measuring Depression. Arch Gen Psychiatry (1961) 4(6):561–71. doi: 10.1001/archpsyc.1961.01710120031004

  • 68

    HamiltonM. A rating scale for depression. J Neurol Neurosurg Psychiatry (1960) 23:5661. doi: 10.1136/jnnp.23.1.56

  • 69

    TakagakiKOkamotoYJinninRMoriANishiyamaYYamamuraTet al. Behavioral characteristics of subthreshold depression. J Affective Disord (2014) 168:472–5. doi: 10.1016/j.jad.2014.07.018

  • 70

    ChenLWangLQiuXHYangXXQiaoZXYangYJet al. Depression among Chinese University Students: prevalence and Socio-Demographic Correlates. PLoS One (2013) 8(3):e58379. doi: 10.1371/journal.pone.0058379

  • 71

    BayramNBilgelN. The prevalence and socio-demographic correlations of depression, anxiety and stress among a group of university students. Soc Psychiatry Psychiatr Epidemiol (2008) 43(8):667–72. doi: 10.1007/s00127-008-0345-x

  • 72

    BostanciMOzdelOOguzhanogluNOzdelLErginAErginNet al. Depressive symptomatology among university students in Denizli, Turkey: prevalence and sociodemographic correlates. Croatian Med J (2005) 46(1):96100.

Summary

Keywords

Center for Epidemiologic Studies Depression Scale, reliability, validity, students, depression

Citation

Jiang L, Wang Y, Zhang Y, Li R, Wu H, Li C, Wu Y and Tao Q (2019) The Reliability and Validity of the Center for Epidemiologic Studies Depression Scale (CES-D) for Chinese University Students. Front. Psychiatry 10:315. doi: 10.3389/fpsyt.2019.00315

Received

27 February 2019

Accepted

24 April 2019

Published

21 May 2019

Volume

10 - 2019

Edited by

Menachem Ben-Ezra, Ariel University, Israel

Reviewed by

Nuno Madeira, University of Coimbra, Portugal Forough Mortazavi, Sabzevar University of Medical Sciences, Iran

Updates

Copyright

*Correspondence: Qian Tao,

†These authors have contributed equally to this work.

This article was submitted to Mood and Anxiety Disorders, a section of the journal Frontiers in Psychiatry

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics