Stability of the Distribution of Patient Health Questionnaire-9 Scores Against Age in the General Population: Data From the National Health and Nutrition Examination Survey

Tomitaka, Shinichiro; Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Ono, Yutaka; Furukawa, Toshiaki A.

doi:10.3389/fpsyt.2018.00390

ORIGINAL RESEARCH article

Front. Psychiatry, 23 August 2018

Sec. Public Mental Health

Volume 9 - 2018 | https://doi.org/10.3389/fpsyt.2018.00390

Stability of the Distribution of Patient Health Questionnaire-9 Scores Against Age in the General Population: Data From the National Health and Nutrition Examination Survey

Shinichiro Tomitaka^1,2^*

Yohei Kawasaki³

Kazuki Ide^4,5,6

Maiko Akutagawa⁶

Yutaka Ono⁷

Toshiaki A. Furukawa²

¹Department of Mental Health, Panasonic Health Center, Tokyo, Japan
²Department of Health Promotion and Human Behavior, Kyoto University Graduate School of Medicine, School of Public Health, Kyoto, Japan
³Clinical Research Center, Chiba University Hospital, Chiba, Japan
⁴Department of Pharmacoepidemiology, Graduate School of Medicine and Public Health, Kyoto University, Kyoto, Japan
⁵Center for the Promotion of Interdisciplinary Education and Research, Kyoto University, Kyoto, Japan
⁶Department of Drug Evaluation and Informatics, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
⁷Center for the Development of Cognitive Behavior Therapy Training, Tokyo, Japan

Background: Epidemiological studies using the nine-item Patient Health Questionnaire (PHQ-9) have reported inconsistencies regarding the relationship between age and total scores. To determine whether this discrepancy is due to the stability of the distribution of PHQ-9 total scores against age, we investigated whether the total score distribution remains stable during adulthood, and also investigated the mathematical patterns of the total score distribution.

Methods: The present study utilized data from 15,847 participants of the 2009–2014 United States National Health and Nutrition Examination Survey, all of whom responded to all PHQ-9 items. The stability of the total score distribution among different age groups was examined using overlap coefficients and graphical analysis.

Results: High overlap coefficients were observed between all age groups for the distributions of PHQ-9 total scores, suggesting that the distribution of PHQ-9 total scores remains stable against age. Graphical analysis demonstrated that distributions of PHQ-9 total scores were similar across age groups. In addition, distributions of PHQ-9 total scores exhibited an exponential pattern, except at the lower end of the distribution.

Conclusions: Our findings indicate that the stability of the distribution of PHQ-9 total scores throughout adulthood may underlie inconsistencies in the evidence regarding age-related changes in total depression scores.

Introduction

Clinical and scientific research studies have aimed to identify the age at which people become more susceptible to depression (1). Because the diagnosis of depression is based on the degree of depressive symptoms, there has been much interest in understanding the relationship between age and depressive symptoms in the general population (2–4). Numerous epidemiological studies have attempted to clarify age-related changes in depressive symptoms (1, 3, 5–7). However, epidemiological studies have reported inconsistent evidence regarding age-related changes in total scores on depression screening scales during adulthood, suggesting a difficulty in replicating the relationship between age and depressive symptoms using traditional screening instruments.

The Center for Epidemiologic Studies Depression Scale (CES-D), developed by Radloff in 1977, is a self-reported depressive symptoms questionnaire that now serves as a screening tool for depression in primary care and research settings (2). Although some studies have reported inconsistent results (1), several cross-sectional surveys and longitudinal studies—the majority of which utilized the CES-D—have demonstrated that the trajectory of depressive symptom scores follows a U-shaped pattern, with total scores being high during young adulthood, decreasing during middle adulthood, and then increasing again after the age of 70 (5, 6, 8, 9).

The nine-item Patient Health Questionnaire (PHQ-9), developed in the 1990s (10), consists of questions associated with the nine criteria for depression outlined in the Diagnostic and Statistical Manual of Mental Disorders (11), and is widely used for the self-rating of depression worldwide (12). Kroenke et al. reported that a PHQ-9 score ≥10 had 88% sensitivity and 88% specificity for major depression in an assessment of criterion validity (13). Several lines of evidence suggest that the PHQ-9 total score distributions are right-skewed in the general population and a PHQ-9 score of 5 approximately corresponds to a percentile rank of 80% and, respectively, of 10 to a rank of 95%, and of 15 to a rank of 99% (14, 15).

Of note, epidemiological studies using the PHQ-9 and its related versions have reported greater inconsistency in age-related changes in depressive symptoms than those using the CES-D (5, 6, 14–18). Contrary to the results obtained using the CES-D, the National Health and Nutrition Examination Survey (NHANES) and the Behavioral Risk Factor Surveillance Survey (BRFSS) in the United States reported that the trajectory of symptom changes followed a reverse U-shaped pattern, with total scores being low during young adulthood, increasing during middle adulthood, and then decreasing again after the age of 60 (16, 19). In contrast, three German surveys and an Indian survey reported that total PHQ-9 scores increased with age (14, 15, 17, 20), while a population-based study in China reported that total scores decreased with age (18). Furthermore, epidemiological studies using other depression screening scales, such as the Hospital Anxiety and Depression Scale (HADS), have reported trajectories that differ from those obtained using the CES-D (21, 22).

Although the reason for the difficulty in replicating the relationship between age and total depression scores remains unclear, one possibility is that the effect of age on total scores using such a scale is so small that it is difficult to reproduce the association between age and total scores. Generally, it is easy to demonstrate large effects. Conversely, it is difficult to replicate small effects because they are easily neutralized by other factors (23). Epidemiological studies have reported that sex, ethnicity, education level, employment status, marital status, and household income significantly influence depressive symptoms (17, 19, 24). In addition, most of these factors differ according to age group in population-based studies. For example, there is an increasing amount of racial and ethnic diversity among young age groups in industrialized nations. Thus, if the effect of age on the PHQ-9 total scores is small, and the other factors differ according to age group, it will be difficult to reproduce the relationship between age and PHQ-9 total scores.

Consistent with this hypothesis, although cross-sectional surveys using other depression screening scales have reported inconsistent trajectories of depressive symptom scores, the reported age-related changes in depressive symptoms have been relatively mild (5, 9, 21, 22). Furthermore, in one previous study, the distribution of total scores on the CES-D was stable during middle adulthood, during which time no significant differences in mean total scores were observed (Figure 1) (9). These results suggest that the stability of the total score distribution for depression screening scales against age underlies inconsistencies in the evidence regarding the relationship between age and total depression scores. To test the hypothesis that inconsistencies in the evidence regarding the relationship between age and total depression scores is due to the stability of the distribution of PHQ-9 total scores against age, it is first necessary to ascertain the similarities in the distribution of PHQ-9 total scores throughout adulthood. Of note, our hypothesis is based on the degree of similarity rather than the equality of the distributions.

FIGURE 1

Figure 1. CES-D distributions for the middle adulthood group. The distributions of CES-D scores were similar among the middle adulthood group. (https://doi.org/10.1371/journal.pone.0114624). CES-D, Center for Epidemiologic Studies Depression Scale.

Previous studies have reported that total PHQ-9 scores in the general population follow an exponential distribution, except at the lower end of the distribution (25). This exponential pattern has been observed in previous studies that have utilized other scales, such as the CES-D, Kessler Psychological Distress Scale (K6), and Clinical Interview Schedule-Revised (CIS-R) (9, 26, 27). In general, an exponential distribution emerges when the total stability of the distribution is observed in conjunction with individual variability in the indicated variables (28, 29), suggesting that the exponential pattern of the distribution is linked to the stability of the total score distribution.

The present study investigated whether the distribution of total scores on the PHQ-9 remains stable across all age groups in the general population. The degree of similarity for total score distributions among age groups was quantified using overlap coefficients. Furthermore, we used graphical analysis to evaluate the similarity of the distributions among age groups (30) and examined whether the distribution of total PHQ-9 scores followed the same exponential pattern.

We used data from the NHANES—a national survey designed to assess the health and nutritional status of adults and children in the United States (31) that includes the PHQ-9 (12). The PHQ-9 data from the NHANES are suitable for verifying the aforementioned assumption due to the large sample sizes and limited selection bias.

Methods

Dataset

We analyzed data from the 2009–2014 NHANES. The NHANES includes a nationally representative sample of non-institutionalized civilian US citizens selected using a multiple-stage design. The survey consisted of a household interview and an examination conducted in a mobile examination center. All survey participants provided written informed consent. De-identified data from the NHANES are available for researchers worldwide (32). For the 2009–2014 NHANES, 41,035 participants were selected. The sociodemographic characteristics of the 2009–2014 NHANES samples are reported in detail elsewhere (32). Among all participants selected, approximately half were selected to respond to the PHQ-9. We used data from 17,899 participants aged 18 and older. Since our local institutional review board does not regard de-identified public data analysis as human subjects research, our research did not require ethical approval from the board.

Measures

In the 2009–2014 NHANES, depressive symptoms were assessed using the PHQ-9. Respondents self-rated the frequency of a variety of depressive symptoms within the past 2 weeks along a 4-point scale, as follows: 0 = “not at all,” 1 = “several days,” 2 = “more than half the days,” and 3 = “nearly every day.” Total scores ranged from 0 to 27.

Analysis

Participants were categorized into the following age groups: 18–19, 20–29, 30–39, 40–49, 50–59, 60–69, 70–79, and 80 years and older. Descriptive statistics (e.g., mean, standard deviation, skewness, kurtosis, and frequency curve) were calculated for each age group. To estimate the proportion of high PHQ-9 total scores, we calculated the 90th percentile of the PHQ-9 total score for each age group. The Kruskal–Wallis test was used to analyze differences among the age groups.

To evaluate the similarity of the distributions between different age groups, we calculated the overlap coefficient of the distributions (33). Overlap coefficients represent the proportion of overlap between two probability distributions (Figure 2) and serve as a measure of the similarity between observed distributions (33). Recently, researchers have used effect sizes such as Cohen's d to quantify the degree of difference between two groups (34). Estimating the degree of difference based on effect sizes may serve as an alternative to similarity indices. However, effect sizes focus on the difference between a representative value (mean or median), rather than the pattern or the distribution itself. Thus, we used overlap coefficients to assess the similarities among distributions. Overlap coefficients range between zero and one.

FIGURE 2

Figure 2. Overlap coefficient from two normal distributions. The overlap coefficient (shaded area) from two normal distributions with unequal average and variance.

Furthermore, we used graphical analysis to confirm the similarity of the distribution patterns between different age groups. Graphical analysis is advantageous in that it enables one to observe patterns in the data. Indeed, such analyses allowed us to detect common distribution patterns among different age groups. We used both normal and log-normal scales, the latter of which enables one to identify the range of an exponential pattern. The regression curve for an exponential model was estimated for each age group. All analyses were conducted using JMP Version 11 for Windows (SAS Institute, Inc., Cary, NC, USA).

Results

Analysis of PHQ-9 Total Scores

As we analyzed the distribution of total scores, participants who did not respond to all PHQ-9 items (2,052 individuals) were excluded from the analysis. The final sample for the analysis consisted of 15,842 individuals. The characteristics of the final sample and descriptive statistics for the distributions of PHQ-9 total scores according to age are shown in Table 1. Mean and median values were highly similar among age groups. Of note, the skewness and kurtosis values for all age groups were close to 2 and 5, respectively. It should be noted that the skewness and kurtosis of any exponential distributions are 2 and 6, and that those of any normal distributions are 0 and 0, respectively.

TABLE 1

Table 1. Participant characteristics.

As depicted in Figure 2, the mean values of the PHQ-9 total scores exhibited a reverse U-shaped pattern, being low during young adulthood, increasing during middle adulthood, and then decreasing during older adulthood, consistent with the findings of a previous study using NHANES data (16). Kruskal–Wallis analysis revealed that PHQ-9 total scores differed significantly among the age groups (χ² = 61.8425, P < 0.0001).

Overlap Coefficients Among Different Age Groups

The overlap coefficient between different age groups ranged from 0.84 (between 18–19 and 50s) to 0.96 (between 50 and 60s), demonstrating the general similarity of the distribution patterns among different age groups. The overlap coefficient between adjacent age groups ranged from 0.93 to 0.96, indicating that distributions for adjacent age groups were highly similar. Of note, the overlap coefficient among middle and older adult groups (30, 40, 50, 60, 70s, and 80+) was over 0.9 for any combination, whereas that between young adult groups (18–19 and 20s) and middle/older adult groups (30, 40, 50, 60, 70s, and 80+) was approximately less than 0.9, indicating that the distribution of PHQ-9 total scores changed more between young and middle adulthood than between middle adulthood and old age. This seemingly contradicts the finding that mean PHQ-9 total scores changed more between the 50s and 80+ groups than between the 18–19 and 50s groups (Figure 3).

FIGURE 3

Figure 3. Relationship between age and mean PHQ-9 total score. Mean PHQ-9 total scores exhibited a reverse U-shaped pattern: Mean scores were low during young adulthood, increased during middle adulthood, and then decreased again during old age. PHQ-9, Patient Health Questionnaire-9.

Graphical Analysis of the Distributions

The distributions of the PHQ-9 total scores were compared among the young age group (18–19, 20, and 30s), the middle age group (30, 40, 50, and 60s), and the older age group (60, 70s, and 80+) (Figure 4). The distributions of the PHQ-9 total scores were commonly right-skewed and similar among all age groups. Although the mean of the PHQ-9 total scores appeared to change throughout adulthood (Figure 3), it was difficult to distinguish each age group graphically (Figure 4). These findings were in accordance with the fact that the overlap coefficient between adjacent age groups remained high throughout adulthood (Table 2). The frequencies of the zero score were 25.3, 27.9, 32.9, 33.3, 34.6, 36.0, 38.6, and 36.6% for the 18–19, 20–29, 30–39, 40–49, 50–59, 60–69, 70–79, and 80+ years groups, respectively. These results indicate that the frequencies of the zero score increased with age.

FIGURE 4

Figure 4. Distributions of PHQ-9 total scores according to age group. Distributions of PHQ-9 total scores for the young adulthood group (A), middle adulthood group (B), and old age group (C). The distributions of PHQ-9 total scores were similar among all age groups. PHQ-9, Patient Health Questionnaire-9.

TABLE 2

Table 2. Overlap coefficients among different age groups.

Although mean PHQ-9 total scores changed more between the 50s and 80+ groups than between the 18–19 and 50s groups (Figure 3), the overlap coefficient between the 50s and 80+ groups (0.91) was higher than that between the 18–19 and 50s groups (0.84). To clarify this discrepancy, the distributions of the PHQ-9 total scores were compared between 18–19 and 50s groups, and between the 50s and 80+ groups (Figure 5). As indicated by the black arrows, from 0 points to 6 points, the distributions of the total PHQ-9 scores for the 18–19 and 50s groups (Figure 5A) were less similar to one another than those of the 50s and 80+ groups (Figure 5B), supporting the finding that the overlap coefficient between the 18–19 and 50s groups was lower than that between the 50s and 80+ groups. As shown in Figure 5A, the relative frequencies of ages 18–19 were lower at 0 points, higher from 2 points to 6 points, and lower again from 9 points to 16 points than those of ages 50–59. These findings indicated that the change in the distribution occurring between the 18–19 and 50s age groups was balanced to the right and left, with the mean value (3.2) at the center, resulting in a relatively small difference in mean total PHQ-9 scores between the two groups.

FIGURE 5

Figure 5. Distributions of the PHQ-9 total scores between 18–19 and 50s, and between 50s and 80+. The distributions of the PHQ-9 total scores were compared between the 18–19 and 50s groups (A), and between the 50s and 80+ groups (B). As indicated by the black arrows, from 0 points to 6 points, the distributions of the PHQ-9 total scores for the 18–19 and 50s groups were less similar to one another than those of the 50s and 80+ groups.

To demonstrate the pattern of the PHQ-9 total score distribution, we evaluated the distributions using a log-normal scale (Figure 6). The distributions for all age groups exhibited a linear pattern on a log-normal scale. However, for PHQ-9 scores over 10 points, the curves for each age group fluctuated randomly, reflecting the small sample sizes in each age group. In fact, the percentage of PHQ-9 total scores over 10 points was less than 1%. As indicated by the arrows in Figure 6, all groups exhibited higher frequencies at the zero score than those predicted based on the linear log-normal pattern. The divergence of the actual data from the predicted linear pattern at the zero score increased with increasing age.

FIGURE 6

Figure 6. Distributions of PHQ-9 total scores according to age group using a log-normal scale. Distributions of PHQ-9 total scores for the young adulthood group (A), middle adulthood group (B), and old age group (C) on a log-normal scale. The distributions for all age groups exhibited a linear pattern on a log-normal scale. However, for PHQ-9 scores over 10 points, the curves for each age group fluctuated randomly. As indicated by the arrows, all groups exhibited higher frequencies at the zero score than those predicted based on the linear log-normal pattern. PHQ-9, Patient Health Questionnaire-9.

Regression Curves for the Exponential Model

The fitting curves for the exponential model were calculated for the 18–19 years (y = 0.2438e^−0.25x, R² = 0.93), 20–29 years (y = 0.2391e^−0.24x, R² = 0.98), 30–39 years (y = 0.1979e^−0.22x, R² = 0.95), 40–49 years (y = 0.174e^−0.19x, R² = 0.93), 50–59 years (y = 0.16e^−0.18x, R² = 0.95), 60–69 years (y = 0.1974e^−0.21x, R² = 0.96), 70–79 (y = 0.2141e^−0.24x, R² = 0.96), and 80+ years groups (y = 0.2233e^−0.25x, R² = 0.94). In accordance with the findings obtained using the log-normal scale, higher coefficients of determination for the exponential model were observed among all age groups.

Discussion

The present study aimed to evaluate the hypothesis that evidence regarding age-related changes in total depression scores on screening scales is inconsistent due to the stability of the total score distribution against age. Our findings indicated that the distribution of PHQ-9 total scores was generally stable throughout all periods of adulthood. These findings demonstrate the stability of the PHQ-9 total score distribution against age, suggesting a small effect of age on the distribution and supporting our hypothesis.

In the present study, the mean values of the PHQ-9 total scores exhibited a reverse U-shaped pattern during adulthood, consistent with the findings of a previous study using NHANES data (16). Although the PHQ-9 total score was significantly different among the age groups in the present study, this may be attributable to the large sample size (15,842 individuals). If a sample size is over 10,000, a significant difference is likely to be found even when the difference between groups is negligible (34). In addition, as our hypothesis was based on the similarity rather than the equality of the distributions, the identification of significant differences with large sample sizes does not contradict the hypothesis.

It remains unclear why the evidence regarding age-related changes in total scores appears to be more inconsistent for the PHQ-9 than the CES-D. However, our findings indicated that the distribution of PHQ-9 total scores was generally stable throughout adulthood. These results suggest that the stability of the distribution of the PHQ-9 total scores throughout adulthood is related to inconsistencies in evidence regarding age-related changes in total scores. Further research is required to clarify why the distribution of total scores obtained using the PHQ-9 is stable throughout a wide age range.

Overlap coefficient and graphical analysis revealed that the distribution of PHQ-9 total scores was more stable during middle adulthood and old age than during young adulthood. This result is consistent with the previous finding that the distribution of CES-D total scores was more stable during middle adulthood than during young adulthood (9). These findings suggest that, although the total score distribution on depression screening scales is generally stable throughout out all periods of adulthood, the stability of the distribution increases during middle adulthood. Of note, our results demonstrate that differences in the mean values do not necessarily correspond to differences among the distributions themselves. In fact, although mean PHQ-9 total scores differed more significantly between the 50s and 80+ groups than between the 18–19 and 50s groups, the distributions of PHQ-9 total scores were more similar to each other for the 50s and 80+ groups than for the 18–19 and 50s groups. These findings suggest that overlap coefficients and graphical analysis are adequate for evaluating the magnitude of the similarity of the distributions between groups.

In general, age has a strong impact on biological indices, such as cardiopulmonary function, exercise capacity, and brain function (35). While these indices remain stable against age in some individuals, they fluctuate in others, altering the distribution of the variables with age. To the best of our knowledge, there are few physiological variables for which it is difficult to distinguish between the distributions of individuals in their 30 vs. 60s. Conversely, although each individual's depressive symptoms often change throughout life, the distribution of PHQ-9 total scores remains stable against age. Moreover, in the previous study, the distribution of CES-D total scores was stable against age between the age of 30 and 70 years old (9). These findings could be regarded as a unique feature of depressive symptom scores. The degree to which these findings can be generalized to other depression screening scales is unknown but warrants further investigation.

In the present study, the distribution of PHQ-9 total scores followed an exponential distribution, except at the lower end of the distribution, consistent with the findings of previous studies using the CES-D, K6, and CIS-R (9, 26, 27). The exponential pattern appears to coincide with the stability of the distribution. In general, an exponential distribution arises when total stability and individual variability are observed together (i.e., maximum entropy) (28, 29). Although the mechanism underlying this exponential pattern remains unclear, our recent simulation study has demonstrated that, if the latent trait of depressive symptoms follows an exponential distribution, the total scores of a depression rating scale will follow an exponential pattern, except at the lower end of the distribution (36).

The present study possesses some limitations of note. In this study, we evaluated the similarities among distributions using overlap coefficients and graphical analysis. However, one major limitation of these methods is the lack of a unified approach to the interpretation of results. Thus, even after obtaining the results of these analyses, we were unable to describe the degree of similarity using unified descriptors, such as “small,” “medium,” “large,” etc. (37). Further research is required to develop unified descriptors for the interpretation of these results.

Furthermore, although the age at which individuals become more susceptible to depression remains an important clinical question, evidence regarding age-related changes in total depression scores during adulthood remains inconsistent due to the stability of the total score distribution against age. Thus, it may be more appropriate to focus on differences in socioeconomic status in each age group. While our results suggest that the stability of the total score distribution against age during adulthood can be generalized to other depression screening scales, further study is required to verify this hypothesis.

Availability of Data and Materials

The data that support the findings of this study are available in the NHANES repository, https://www.cdc.gov/nchs/nhanes/index.htm.

Author Contributions

ST made substantial contributions to conception and design, acquisition of data, and analysis and interpretation of data and was a major contributor in writing the manuscript. YK, KI, and MA were involved in the interpretation of the data and in revising the manuscript. YO and TF were involved in the study design and revising the manuscript critically. All authors read and approved the final manuscript.

Funding

This work was supported by a research grant from the JSPS KAKENHI (grant number 18K03145). The funding source had no role in designing the study, in the collection, analysis and interpretation of data, in the writing of the report, or in the decision to submit the article for publication.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We would like to thank the Centers for Disease Control and Prevention for providing the data for this study.

References

1. Jorm AF. Does old age reduce the risk of anxiety and depression? A review of epidemiological studies across the adult life span. Psychol Med. (2000) 30:11–22. doi: 10.1017/S0033291799001452

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Radloff LS. The CES-D scale. Appl Psychol Meas. (1977) 1:385–401. doi: 10.1177/014662167700100306

CrossRef Full Text | Google Scholar

3. Weissman MM, Sholomskas D, Pottenger M, Prusoff BA, Locke BZ. Assessing depressive symptoms in five psychiatric populations: a validation study. Am J Epidemiol. (1977) 106:203–14. doi: 10.1093/oxfordjournals.aje.a112455

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Kessler RC, Barker PR, Colpe LJ, Epstein JF, Gfroerer JC, Hiripi E, et al. Screening for serious mental illness in the general population. Arch Gen Psychiatry (2003) 60:184. doi: 10.1001/archpsyc.60.2.184

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Kessler RC, Foster C, Webster PS, House JS. The relationship between age and depressive symptoms in two national surveys. Psychol Aging (1992) 7:119–26. doi: 10.1037/0882-7974.7.1.119

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Sutin AR, Terracciano A, Milaneschi Y, An Y, Ferrucci L, Zonderman AB. The trajectory of depressive symptoms across the adult life span. JAMA Psychiatry (2013) 70:803–11. doi: 10.1001/jamapsychiatry.2013.193

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Walsemann KM, Gee GC, Geronimus AT. Ethnic differences in trajectories of depressive symptoms: disadvantage in family background, high school experiences, and adult characteristics. J Health Soc Behav. (2009) 50:82–98. doi: 10.1177/002214650905000106

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Oh DH, Kim SA, Lee HY, Seo JY, Choi B-Y, Nam JH. Prevalence and correlates of depressive symptoms in korean adults: results of a 2009 korean community health survey. J Korean Med Sci. (2013) 28:128. doi: 10.3346/jkms.2013.28.1.128

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Tomitaka S, Kawasaki Y, Furukawa T. Right tail of the distribution of depressive symptoms is stable and follows an exponential curve during middle adulthood. PLoS ONE (2015) 10:e0114624. doi: 10.1371/journal.pone.0114624

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Kroenke K, Spitzer RL, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. (2001) 16:606–13. doi: 10.1046/j.1525-1497.2001.016009606.x

PubMed Abstract | CrossRef Full Text | Google Scholar

11. American Psychiatric Association. Task Force on Nomenclature and Statistics. Diagnostic and Statistical Manual of Mental Disorders. Washington, DC: American Psychiatric Association (1986).

12. Kroenke K, Spitzer RL, Williams JBW, Löwe B. The patient health questionnaire somatic, anxiety, and depressive symptom scales: a systematic review. Gen Hosp Psychiatry (2010) 32:345–59. doi: 10.1016/j.genhosppsych.2010.03.006

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Kroenke K, Spitzer RL. The PHQ-9: a new depression diagnostic and severity measure. Psychiatr Ann. (2002) 32:509–15. doi: 10.3928/0048-5713-20020901-06

CrossRef Full Text | Google Scholar

14. Kocalevent R-D, Hinz A, Brähler E. Standardization of the depression screener Patient Health Questionnaire (PHQ-9) in the general population. Gen Hosp Psychiatry (2013) 35:551–5. doi: 10.1016/j.genhosppsych.2013.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Rief W, Nanke A, Klaiberg A, Braehler E. Base rates for panic and depression according to the Brief Patient Health Questionnaire: a population-based study. J Affect Disord. (2004) 82:271–6. doi: 10.1016/j.jad.2003.11.006

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Pratt LA, Brody DJ. Depression in the U.S. household population, 2009-2012. NCHS Data Brief (2014) 172:1–8.

17. Poongothai S, Pradeepa R, Ganesan A, Mohan V. Reliability and validity of a modified PHQ-9 item inventory (PHQ-12) as a screening instrument for assessing depression in Asian Indians (CURES-65). J Assoc Physicians India (2009) 57:147–52.

PubMed Abstract | Google Scholar

18. Wang W, Bian Q, Zhao Y, Li X, Wang W, Du J, et al. Reliability and validity of the Chinese version of the Patient Health Questionnaire (PHQ-9) in the general population. Gen Hosp Psychiatry (2014) 36:539–44. doi: 10.1016/j.genhosppsych.2014.05.021

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Dhingra SS, Kroenke K, Zack MM, Strine TW, Balluz LS. PHQ-8 days: a measurement option for DSM-5 Major Depressive Disorder (MDD) severity. Popul Health Metr. (2011) 9:11. doi: 10.1186/1478-7954-9-11

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Löwe B, Wahl I, Rose M, Spitzer C, Glaesmer H, Wingenfeld K, et al. A 4-item measure of depression and anxiety: validation and standardization of the Patient Health Questionnaire-4 (PHQ-4) in the general population. J Affect Disord. (2010) 122:86–95. doi: 10.1016/j.jad.2009.06.019

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Stordal E, Bjartveit Kruger M, Dahl NH, Kruger O, Mykletun A, Dahl AA. Depression in relation to age and gender in the general population: the Nord-Trondelag Health Study (HUNT). Acta Psychiatr Scand. (2001) 104:210–6. doi: 10.1034/j.1600-0447.2001.00130.x

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Hinz A, Brähler E. Normative values for the hospital anxiety and depression scale (HADS) in the general german population. J Psychosom Res. (2011) 71:74–8. doi: 10.1016/j.jpsychores.2011.01.005

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Tal J. Strategy and Statistics in Clinical Trials : A Non-Statisticians Guide to Thinking, Designing, and Executing. Waltham, MA: Academic Press (2011).

Google Scholar

24. Galenkamp H, Stronks K, Snijder MB, Derks EM. Measurement invariance testing of the PHQ-9 in a multi-ethnic population in Europe: the HELIUS study. BMC Psychiatry (2017) 17:349. doi: 10.1186/s12888-017-1506-9

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Tomitaka S, Kawasaki Y, Ide K, Akutagawa M, Yamada H, Ono Y, et al. Distributional patterns of item responses and total scores on the PHQ-9 in the general population: data from the National Health and Nutrition Examination Survey. BMC Psychiatry (2018) 18:108. doi: 10.1186/s12888-018-1696-9

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Tomitaka S, Kawasaki Y, Ide K, Akutagawa M, Yamada H, Yutaka O, et al. Pattern analysis of total item score and item response of the Kessler Screening Scale for Psychological Distress (K6) in a nationally representative sample of US adults. PeerJ (2017) 5:e2987. doi: 10.7717/peerj.2987

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Melzer D, Tom BDM, Brugha TS, Fryers T, Meltzer H. Common mental disorder symptom counts in populations: are there distinct case groups above epidemiological cut-offs? Psychol Med. (2002) 32:1195–201. doi: 10.1017/S0033291702006049

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Dragulescu A, Yakovenko VM. Statistical mechanics of money. Eur Phys J B (2000) 17:723–9. doi: 10.1007/s100510070114

CrossRef Full Text | Google Scholar

29. Irwin RJ, Irwin TC. Stability of an exponential distribution for New Zealand taxable personal income. New Zeal Econ Pap. (2017) 51:49–59. doi: 10.1080/00779954.2015.1130739

CrossRef Full Text | Google Scholar

30. Jebb AT, Parrigon S, Woo SE. Exploratory data analysis as a foundation of inductive research. Hum Resour Manag Rev. (2017) 27:265–76. doi: 10.1016/j.hrmr.2016.08.003

CrossRef Full Text | Google Scholar

31. CDC. About the National health and nutrition examination survey. CDC/National Cent Heal Stat. (2013) Available online at: https://www.cdc.gov/nchs/nhanes/about_nhanes.htm (Accessed April 20, 2017).

32. National Center for Health Statistics. About the National Health and Nutrition Examination Survey. NHANES (2014).

33. Inman HF, Bradley EL. The overlapping coefficient as a measure of agreement between probability distributions and point estimation of the overlap of two normal densities. Commun Stat Theor Methods (1989) 18:3851–74. doi: 10.1080/03610928908830127

CrossRef Full Text | Google Scholar

34. Sullivan GM, Feinn R. Using effect size-or why the P value is not enough. J Grad Med Educ. (2012) 4:279–82. doi: 10.4300/JGME-D-12-00156.1

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Masoro EJ, Austad SN. Handbook of the Biology of Aging. San Diego, CA: Elsevier; Academic Press (2006).

Google Scholar

36. Tomitaka S, Kawasaki Y, Ide K, Akutagawa M, Yamada H, Furukawa TA. Exponential distribution of total depressive symptom scores in relation to exponential latent trait and item threshold distributions: a simulation study. BMC Res Notes (2017) 10:614. doi: 10.1186/s13104-017-2937-6

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Zinchuk V, Wu Y, Grossenbacher-Zinchuk O. Bridging the gap between qualitative and quantitative colocalization results in fluorescence microscopy studies. Sci Rep. (2013) 3:1365. doi: 10.1038/srep01365

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: depressive symptoms, Patient Health Questionnaire-9, age, National Health and Nutrition Examination Survey, exponential distribution, overlap coefficients

Citation: Tomitaka S, Kawasaki Y, Ide K, Akutagawa M, Ono Y and Furukawa TA (2018) Stability of the Distribution of Patient Health Questionnaire-9 Scores Against Age in the General Population: Data From the National Health and Nutrition Examination Survey. Front. Psychiatry 9:390. doi: 10.3389/fpsyt.2018.00390

Received: 09 May 2018; Accepted: 02 August 2018;
Published: 23 August 2018.

Edited by:

Meichun Mohler-Kuo, School of Nursing Sciences, University of Applied Sciences and Arts of Western Switzerland, Switzerland

Reviewed by:

Chantal Ski, Queen's University Belfast, United Kingdom
Jude Uzoma Ohaeri, University of Nigeria, Nsukka, Nigeria

Copyright © 2018 Tomitaka, Kawasaki, Ide, Akutagawa, Ono and Furukawa. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shinichiro Tomitaka, dG9taXRha2Euc2hpbmljaGlyb0BqcC5wYW5hc29uaWMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.