- 1Departamento de Ciencias Sociales, Universidad de Los Lagos, Osorno, Chile
- 2Consorcio Latinoamericano de Investigación (CLATI), Temuco, Chile
- 3Departamento de Procesos Terapéuticos, Facultad de Ciencias de la Salud, Universidad Católica de Temuco, Temuco, Chile
- 4Escuela de Psicología, Facultad de Ciencias Sociales y Comunicaciones, Universidad Santo Tomás, Temuco, Chile
- 5Universidad Nacional de Huancavelica, Huancavelica, Peru
- 6Escuela de Kinesiología, Facultad de Salud, Universidad Santo Tomas, Temuco, Chile
- 7Universidad Nacional Daniel Alcides Carrión, Cerro de Pasco, Peru
- 8Universidad Nacional Intercultural de la Amazonia, Pucallpa, Peru
- 9Huertas College, Caguas, Puerto Rico
- 10Coordinador de campos Clínicos, Universidad Santo Tomás, Temuco, Chile
- 11Instituto Dominicano para el Estudio de la Salud Integral y la Psicología Aplicada (IDESIP), Santo Domingo, Dominican Republic
- 12Departamento de Ciencias Sociales, Universidad de La Costa, Barranquilla, Colombia
- 13Departamento de Ciencias Básicas, Universidad Metropolitana, Barranquilla, Colombia
- 14Facultad de Educación, Universidad Nacional Intercultural de la Selva Central Juan Santos Atahualpa, La Merced, Chanchamayo, Peru
- 15Laboratorio de Neurociencias y Psicología Experimental, Área de Farmacología, Facultad de Ciencias Médicas, Universidad Nacional de Cuyo, Mendoza, Argentina
- 16Universidad Nacional Autónoma Altoandina de Tarma,Tarma, Peru
Introduction: The prevalence of depressive symptoms among healthcare professionals has increased significantly, highlighting the need for valid and reliable ultra-rapid screening tools in high-demand clinical settings.
Objective: To analyze the reliability and convergent validity of the Patient Health Questionnaire-2 (PHQ-2) among healthcare professionals in Argentina.
Method: A cross-sectional eHealth study was conducted with 2,835 healthcare professionals (47% men, 53% women) working in public (57.2%) and private (42.8%) institutions. Participants completed the PHQ-2, PHQ-9, GAD-7, and Mini-Z scales in two phases, with a three-month interval. The convergent validity of the test was determined by comparing it with the original version of the PHQ, an anxiety test (GAD-7), and a burnout test (Mini-Z). In addition, internal consistency was calculated using Cronbach's alpha coefficient and McDonald's omega coefficient.
Results: The PHQ-2 showed a unidimensional structure, explaining 73% of the variance, and demonstrated strong convergent validity, with high correlations with the PHQ-9 (r = 0.836; p < 0.001) and the GAD-7 (r = 0.724; p < 0.001), and a moderate correlation with the Mini-Z (r = 0.568; p < 0.001). Internal consistency was satisfactory (α = 0.75; ω = 0.85), confirming its reliability as a screening tool.
Conclusions: The PHQ-2 is a valid and reliable instrument for the ultra-rapid detection of depressive symptoms in healthcare professionals in Argentina. Its strong psychometric properties and brief format make it a useful tool for mental health screening in high-demand healthcare settings.
1 Introduction
The COVID-19 pandemic caused a widespread deterioration of mental health worldwide; anxiety and depression became one of the most recurrent emotional disorders (1–3). In particular, healthcare professionals faced exceptional psychological demands throughout the pandemic, resulting from continuous clinical duties, increased workloads, and the absence of clear expectations about how the crisis would evolve (4–6). In this context, emotional disorders such as depression became some of the most disabling mental conditions, affecting both individual well-being and professional performance (7–9).
In the general population, the prevalence of depressive symptoms reached 28.18% during the pandemic and decreased to 20.0% in the post-pandemic period (10, 11). These rates were consistently higher among healthcare professionals, who exhibited greater levels of depressive symptoms in both phases.
Recent studies have reported that between 22.8% and 37.1% of healthcare professionals experienced depressive symptoms during the pandemic (12, 13). In the post-pandemic period, the figures ranged from 20.5% to 50.7%, consistently exceeding the rates observed in the general population (14–16). However, in Latin America the situation was even more critical, due to the structural fragility of healthcare systems, the scarcity of resources, and political instability; factors that exacerbated the psychological burden on healthcare workers (17, 18). In countries such as Argentina, Colombia, and Peru, the prevalence of depressive symptoms among healthcare personnel ranged between 59.18% and 91.7% (19–21); with an increase in the incidence of occupational burnout (59.8%) and cognitive dysfunctions (69.2%), associated with the care of patients infected with coronavirus (22, 23).
The repercussions of depression in this group go beyond the individual level. Several studies have shown that depressive symptoms are linked to concentration difficulties, which increase the risk of medical errors and reduce the quality of care (23, 24). These results emphasize the need for rapid and effective tools to detect depressive symptoms among healthcare professionals. This is particularly relevant in hospital environments, where high clinical workloads and limited resources often hinder the implementation of systematic assessments (25–27).
The PHQ-2 is the abbreviated version of the Patient Health Questionnaire-9 (PHQ-9). It consists of two items that evaluate the core symptoms of depression, depressed mood and anhedonia, as recognized by the DSM (28, 29). This brief version demonstrated adequate reliability and convergent validity across different clinical contexts. In the original validation study, conducted in eight primary care clinics and seven obstetrics and gynecology centers, it achieved a sensitivity of 83% and a specificity of 90% for major depression (28). Subsequently, similar diagnostic accuracy was confirmed, with a sensitivity of 86% and a specificity of 78% in comparable clinical samples (30). Likewise, a study reported an area under the curve (AUC) of 0.84, and a meta-analysis including more than 40,000 participants estimated an overall AUC of 0.89 (31, 32). In Latin America, studies in rural Mexico and Brazil also confirmed its validity. The Mexican study reported a sensitivity of 80% and a specificity of 87%, whereas the Brazilian sample of women in primary care demonstrated adequate discriminant validity (33, 34).
Moreover, recent research in hospital and community settings has confirmed the usefulness of the PHQ-2 as a brief screening tool, emphasizing its value in contexts with limited time and resources (35–37). However, in Argentina, there are still no reports describing its psychometric properties. Across Latin America, the only formal validation published to date refers to healthcare professionals in Colombia (38). That investigation, conducted with 725 physicians and nurses during the COVID-19 pandemic, demonstrated high convergent validity between the PHQ-2 and PHQ-9 (r = 0.860; p < 0.001), adequate internal consistency (α = 0.80; ω = 0.76), and strong item–total correlations (r = 0.910 and r = 0.924). These results confirm that the PHQ-2 is a reliable and valid instrument for the rapid detection of depressive symptoms in high-demand healthcare settings. Therefore, this research aimed to analyze its reliability and convergent validity among Argentine healthcare professionals.
2 Materials and methods
This study constitutes a secondary analysis of original data obtained from an international multicenter project that evaluated mental health indicators in Latin America and the Caribbean following the COVID-19 pandemic (INV.140-03-001-18). Between May 12, 2022, and November 27, 2023, data were collected through an electronic questionnaire administered in two stages. The instrument was distributed using a snowball sampling strategy. The overall sample included more than 23,000 participants from 15 Latin American countries, encompassing healthcare professionals, outpatients, university students, and community members from diverse occupational backgrounds.
This analysis examined the internal reliability of the PHQ-2 and its construct validity through factor analysis. Convergent validity was also tested using correlations with the PHQ-9, GAD-7, and Mini-Z scales, allowing for the assessment of its internal structure and its relationship with depression, anxiety, and burnout.
2.1 Participants
From the total database, 3,931 records corresponded to physicians and nurses from Argentina. Among them, 2,948 participants (75%) were selected because they had completed the initial form, and their identity and professional role were verified through cross-validation of personal and institutional email addresses. However, only 2,835 subjects completed the second digital form, which therefore constituted the final analyzed sample. The sample size included all valid available cases, with no a priori estimation.
2.2 Procedure
In the original study, an automated Google form was used that included a unique identifier for each subject, demographic data (age, sex, type of profession, employment status, and institutional email). Additionally, questions on lifestyle habits and health conditions were included, along with the administration of two psychometric instruments: the PHQ-9 and the GAD-7. To date, two publications have documented the psychometric properties of one of the instruments used in this research (2, 39).
From the 3,931 initial records of Argentine healthcare personnel, 75% (n = 2,948) were randomly selected through an automated algorithm that prioritized verifiable personal and institutional email addresses. This subsample was used to cross-validate identity and data consistency according to methodological criteria, which optimized internal validity and reduced potential bias. Operational criteria were also applied to ensure a manageable response rate and an appropriate logistical workload. Finally, 2,835 subjects completed the second digital form, constituting the final analyzed sample.
The selected participants received a second questionnaire that included new questions on lifestyle and health conditions, along with the Mini-Z and PHQ-2 instruments. The estimated duration of the first assessment was approximately eight minutes, while the second required about six minutes. A minimum interval of three months was established between both phases based on two main considerations. First, to reduce the possibility of learning or recall bias resulting from the repetition of similar items between the PHQ-9 and PHQ-2. Second, to accommodate the logistical conditions of the multicenter project (INV.140-03-001-18), which was implemented in successive stages during the pandemic and, in several cases, made the simultaneous administration of both instruments unfeasible.
To minimize selection bias, follow-up communications were sent only to participants who had completed the first phase and provided valid email addresses, allowing personal and institutional cross-validation. This measure reduced the risk of identification errors and contributed to maintaining the integrity of the longitudinal data. Additionally, to prevent multiple submissions, the Google Form was configured to accept only one response per institutional email account. This restriction was complemented by a manual review of duplicates through the comparison of unique identifiers and key demographic variables, further reinforcing the reliability and uniqueness of the data collection process.
In the first assessment phase, 983 records were excluded (3,931 → 2,948), and in the second phase, an additional 113 participants (2,948 → 2,835) were removed. These exclusions were due to unverified professional identity, incomplete forms, duplicate records, or multiple entries with inconsistent identifiers.
2.3 Instrument
The PHQ-2 is a brief tool derived from the PHQ-9 (28). It consists of two items that evaluate depressed mood and anhedonia over the previous two weeks. Each item is rated on a scale from 0 ("not at all") to 3 ("nearly every day"), with a total score ranging from 0 to 6. These two items represent the core symptoms of depression defined by the DSM diagnostic criteria and form the diagnostic basis of a major depressive episode. The original proposal has been replicated in multiple settings, demonstrating that this ultra-brief version maintains adequate validity and clinical utility (28). The validation of the Spanish version in a Colombian population, specifically among healthcare professionals, reported high internal consistency (Cronbach’s α = 0.80) (38). A cutoff score of ≥ 3 has been recommended as an indicator of major depression, supported by robust meta-analytic evidence (40).
To assess the convergent validity of PHQ-2, three measures associated with depressive symptoms were used. First, the original nine-item version of PHQ was administered (41). This scale measures the frequency of depressive symptoms over the previous two weeks. Each item is scored from 0 (“not at all”) to 3 (“nearly every day”), yielding a total score from 0 to 27. Scores of 10 or higher indicate moderate to severe depression. The PHQ-9 has demonstrated excellent psychometric properties and has been validated in a wide range of clinical and population-based studies worldwide (42–44).
Likewise, the Generalized Anxiety Disorder scale (GAD-7) was used (45). It is a brief seven-item instrument that assesses symptoms of generalized anxiety over the previous two weeks. Each item is rated on a Likert scale from 0 (“not at all”) to 3 (“nearly every day”), yielding a total score ranging from 0 to 21. Scores of 10 or higher indicate moderate to severe anxiety. The Spanish version has shown adequate validity among healthcare professionals in Colombia and Bolivia (46). It is a widely used measure in Latin America and the Caribbean, with excellent psychometric properties (2, 39).
Finally, the Mini-Z scale was administered (47, 48). This brief 10-item instrument assesses burnout and psychosocial factors in the workplace, including job satisfaction, workload, perceived control, stress, and alignment with institutional values. Item 1 evaluates emotional exhaustion and is considered the main indicator of burnout, with scores ≥ 3 indicating clinical risk. Responses are recorded using Likert or dichotomous formats, as appropriate. The Spanish version for Spanish-speaking populations, developed by our working research team, was used (49).
The inclusion of the GAD-7 and the Mini-Z aimed to examine the convergent validity of PHQ-2 with emotional constructs closely related to depression, such as anxiety and burnout. Several studies have demonstrated that these conditions often coexist among healthcare professionals, although they represent distinct constructs (50–52). This approach allowed us to analyze whether PHQ-2 was significantly associated with related emotional states, reinforcing its convergent validity without compromising its specificity as a depression screening tool.
2.4 Data analysis
A multi-stage statistical analysis was conducted to assess the psychometric properties of the PHQ-2. First, descriptive statistics were calculated for each item, including mean (M), standard deviation (SD), kurtosis, skewness, and 95% confidence intervals. Subsequently, convergent validity was examined using Pearson’s correlation coefficients between the total PHQ-2 score and the PHQ-9, GAD-7, and Mini-Z scores. This procedure aimed to verify whether the PHQ-2 measured the same theoretical construct related to depressive symptomatology. Although the items followed a Likert-type format, the total score was treated as a continuous variable, in accordance with widely accepted psychometric standards (53).
The normality of score distributions was assessed using the Kolmogorov–Smirnov test, which was nonsignificant and therefore supported the use of parametric statistics. Consequently, Pearson’s correlations were performed, and the use of Spearman’s coefficients was ruled out since no assumption violations were detected. Internal consistency was then estimated through Cronbach’s alpha and McDonald’s omega coefficients, as well as corrected item–total correlations, to determine item homogeneity.
Additionally, the unidimensionality of the PHQ-2 was explored through a factor analysis, acknowledging that with two items this procedure yields a just-identified model (with no degrees of freedom). Therefore, the analysis was interpreted solely as complementary evidence (focused on factor loadings and inter-item correlation), and internal validity was primarily supported by reliability and convergent validity, as recommended by scale development guidelines (54, 55). According to structural modeling guidelines, two-indicator factors may be used for confirmatory purposes under these precautions and with limited interpretation (56, 57). All analyses were conducted using R software, version 1.3.1056.
2.5 Ethical considerations
Participants were previously informed about the objectives of the study and provided their informed consent through an online form. To ensure the accuracy of the information, each individual’s participation and professional role were verified via telephone and email contact. Additionally, participants were provided with a document containing psychological recommendations for managing stress and depressive symptoms. This consent also included the possibility that their data could be used in subsequent analyses for research purposes, including the validation of psychometric instruments. The study was conducted in accordance with national and international ethical guidelines for research involving human beings, in line with the Declaration of Helsinki of 1975, revised in 2008. The project was approved by the ethics committee of Universidad de La Costa (Code 086/2020 and 173/2022) which led the study.
3 Results
Participants ranged in age from 29.18 to 47.04 years (M = 39.22, SD = 13.61); 47% were men (n = 1,332) and 53% were women (n = 1,503). Regarding professional category, 28.36% were general practitioners, 37.24% specialists, and 34.4% nursing staff. Among the total participants, 42.8% worked in private institutions and 57.2% in the public healthcare system. Table 1 shows that the two PHQ-2 items presented similar means, indicating consistency in responses. Likewise, the distributions were approximately symmetric, without pronounced tails, and the narrow confidence intervals demonstrate the stability of the estimates.
The confirmatory factor analysis (CFA) results (Table 2) supported the unidimensional structure of the PHQ-2. Because this model is just-identified (with no degrees of freedom), global fit indices (χ², RMSEA, CFI, SRMR) were not reported, as they are uninformative for this type of structure. Instead, emphasis was placed on the factor loadings and inter-item correlation as evidence of unidimensionality.
The Confirmatory Factor Analysis (CFA) model of the PHQ-2 showed high standardized loadings for both items (λ = 0.82 and λ = 0.84), indicating strong saturation on the latent depression factor. The associated errors (ϵ = 0.29 and 0.28) were low, suggesting little unexplained residual variance. Similarly, the variance explained by each item was high (λ² = 0.71 and 0.73, respectively), supporting the stability of the proposed model. These results were interpreted as complementary evidence of unidimensionality, while construct validity was primarily supported by internal consistency and convergent validity.
3.1 Convergent validity
Then, correlations were conducted between the PHQ-2 and three relevant measures: the original version of the PHQ-9, the Generalized Anxiety Disorder test (GAD-7), and a Burnout Test (Mini-Z) (see Table 3). A very strong correlation was observed with the PHQ-9 (r = 0.836; p < 0.001), confirming that both instruments assess the same theoretical construct. Likewise, a high correlation was identified with the GAD-7 (r = 0.724; p < 0.001) and a moderate but significant correlation with the Mini-Z (r = 0.568; p < 0.001). This differential pattern of magnitudes is consistent with the convergent validity of the PHQ-2 and indicates a stronger alignment with depressive symptomatology than with anxiety or burnout; however, given shared internalizing variance and the two-item format, these findings should be interpreted as convergence predominance rather than strict discriminant validity or diagnostic specificity.
3.2 Reliability analysis
Subsequently, internal consistency analyses of the PHQ-2 were performed. As shown in Table 4, the instrument demonstrated acceptable reliability, with a Cronbach’s alpha coefficient of 0.75 (95% CI: 0.70–0.79), indicating adequate response consistency. The item-total correlations were high, with values of 0.889 for Item 1 and 0.898 for Item 2, both significantly above the recommended minimum threshold of 0.30. These results reflect a homogeneous internal structure, supporting the stability and reliability of the PHQ-2 for use in this population.
4 Discussion
We evaluated the reliability and convergent validity of the PHQ-2 for the rapid detection of depressive symptoms among Argentine healthcare professionals. The results showed a unidimensional structure, suggesting that the two test items assess a single construct. The corrected item–total correlations (0.889 and 0.898) substantially exceeded the recommended threshold, showing that both items contribute similarly and robustly to the measurement of depression. Additionally, the descriptive indicators showed approximate symmetry (skewness near zero) and low kurtosis, suggesting the absence of bias in item interpretation and providing additional evidence of the instrument’s stability in application. Nevertheless, because the PHQ-2 is a two-item scale, its CFA results should be interpreted with caution; models of such low complexity provide limited information, and beyond the inter-item correlation they do not allow a meaningful assessment of global fit (58). Therefore, in our study the CFA was used as complementary evidence of unidimensionality, whereas construct validity was primarily supported by reliability and convergent validity analyses (54, 55). A previous study warned that the interpretation of fit indices in simple models depends on sample size, number of items, and structural complexity (59). Even so, a recent validation of the PHQ-4 in a Chilean population has shown that, under conditions of high inter-item correlation and large sample sizes, two-indicator factors can be psychometrically acceptable (60).
Beyond methodological limitations, the psychometric robustness of the PHQ-2 is supported both theoretically and clinically. Its two items, focused on the core symptoms of depression (depressed mood and anhedonia), provide an efficient and accurate assessment of depressive symptomatology, to the extent that it is considered a reliable and valid ultra-brief instrument for detecting major depression (31, 61, 62). Despite its brevity, it achieves high levels of sensitivity (S) and specificity (E), and its clinical utility (AUC) has been widely demonstrated in both clinical and community settings (63–67). In this regard, an individual participant data meta-analysis with over 40,000 participants estimated an overall AUC close to 0.90, with high levels of S and E compared to diagnostic interviews (31). Complementarily, a recent meta-analysis in Spanish-speaking populations confirmed equivalent performance, with pooled S and E indicators of 0.89 and an AUC of 0.87, reinforcing its clinical validity in Latin America and Spanish-speaking communities (68).
Regional studies have shown that the PHQ-2 is a valid and useful instrument for the rapid detection of depressive symptoms. In Chile, found that a cut-off score of ≥3 yielded a sensitivity of 74.6%, a specificity of 93.9%, and an AUC of 0.92 in a rural population (69). Among patients with chronic illnesses, another study reported a diagnostic accuracy of 0.92 for detecting depression cases (67). In Mexico, a community-based study observed comparable performance, with a sensitivity of 80.0%, specificity of 86.9%, and an AUC of 0.89 using a cut-off score of 3 (33). Finally, in Spain, confirmed its validity in primary care, establishing an optimal cut-off score of ≥2 and corroborating its unidimensional structure in the general population (70). This evidence reinforces the clinical utility of the PHQ-2 as a brief, reliable, and valid instrument for the rapid detection of depressive symptoms across various Spanish-speaking contexts, both within and beyond hospital settings.
On the other hand, the convergence of the PHQ-2 with other instruments capturing emotional dimensions related to depression in hospital settings was examined-specifically the PHQ-9, GAD-7, and Mini-Z. First, a high correlation was observed with the PHQ-9 (r = 0.836; p < 0.001), confirming that the two core items of the PHQ-2 accurately reflect the depressive construct assessed by the full scale. This result reinforces its convergent validity and aligns with international evidence, which has documented consistent associations across various populations and clinical settings.
In Chinese populations, very high correlations have been reported between the PHQ-2 and the PHQ-9, both among adolescents (r = 0.86–0.89) and among older adults living in rural areas (71). In primary care settings, a study conducted in Ethiopia reported a strong correlation (r = 0.83), while research from Saudi Arabia also found a high correlation (r = 0.79; R² = 0.62) (72). These findings highlight the ability of the PHQ-2 to function as a brief and effective screening tool in everyday clinical practice settings.
Additionally, adequate levels of convergence between the PHQ-2 and the PHQ-9 have been documented in specific clinical populations, such as patients with heart failure (73), migraine (74), adults with chronic obstructive pulmonary disease (75), and women with infertility (64).
In the Latin American context, a study conducted in Costa Rica reported a very high correlation (r = 0.843; p < 0.001) between the PHQ-9 and GAD-7 in the general population (76); and another conducted in Colombia identified a significant association (ρ = 0.70; p < 0.01) between PHQ-9 and PHQ-2 among primary care users (77). Finally, the only research to date in a population of healthcare professionals in Latin America that has analyzed the correlation between the PHQ-2 and the PHQ-9 was conducted in Colombia (38). This research reported a high correlation (r = 0.860; p < 0.001), reinforcing the robustness of the PHQ-2 as a brief and valid instrument for screening depressive symptoms in high-demand clinical environments such as hospitals and workplaces.
Secondly, the PHQ-2 demonstrated adequate convergence with the GAD-7 (r = 0.724; p < 0.001), indicating that it also captures emotional distress dimensions related to anxiety. This finding is consistent with evidence reported across different international contexts. In Chile, a similar association was observed among migrants (65). In Germany, a moderate correlation (r = 0.64) was reported in both the general population and primary care, confirming the frequent comorbidity between brief depression and generalized anxiety (78). In Iran, significant correlations were identified in patients with infertility, supporting the usefulness of the PHQ-2 in clinical contexts linked to anxiety (79). Consistent correlations between the PHQ-2 and GAD-7 have also been reported among U.S. college students, ranging from ρ = 0.45 to 0.65, supporting the stability of this relationship in younger populations (80).
Thirdly, the PHQ-2 showed a moderate correlation with the Mini-Z (r = 0.568; p < 0.001), supporting its applicability in identifying emotional distress associated with work overload and occupational stress. In Puerto Rico, a study of employees found a positive association between depression and the emotional exhaustion and cynicism dimensions of the Maslach Burnout Inventory (81). Furthermore, moderate correlations among burnout, depression, and anxiety have been reported, confirming that these are related yet conceptually distinct constructs (50).
In hospital settings, this link is reinforced by the role of chronic stress due to work overload in the development of emotional problems such as anxiety and, ultimately, depression. Evidence indicates that prolonged exposure to stress activates the hypothalamic–pituitary–adrenal (HPA) axis, increasing cortisol release and altering neuronal plasticity in the hippocampus and prefrontal cortex (82). These alterations impair emotional regulation and executive functioning, leading to poorer decision-making and the emergence of depressive symptoms (83). Moreover, burnout has been associated with reduced levels of Brain-Derived Neurotrophic Factor (BDNF), a neurotrophin essential for resilience, whose decrease exacerbates cognitive and emotional dysfunction (84). The convergence of these mechanisms reinforces the relationship between chronic occupational stress and vulnerability to depression (85).
Taken together, the available evidence and our findings confirm that the PHQ-2 captures a depressive core closely linked to burnout and other mechanisms related to occupational stress, reinforcing its psychometric robustness and clinical applicability in high-demand healthcare settings.
Finally, the PHQ-2 demonstrated adequate reliability, with a Cronbach’s alpha coefficient of 0.75 and a McDonald’s omega coefficient of 0.85, indicating satisfactory internal consistency. These results are consistent with those obtained in previous validations conducted among healthcare professionals. In Colombia, a study reported an alpha of 0.80 and an omega of 0.76 among medical and nursing staff (38). In Chile, similar coefficients of 0.78 and 0.79 were observed in a rural community sample (69). The similarity of these findings reinforces the psychometric stability of the PHQ-2 as a reliable tool.
Undoubtedly, the PHQ-2 could be successfully integrated among ultra-rapid depression screening tests, as it maintains its diagnostic validity even in its abbreviated two-item version (36, 86). Indeed, the simplification of assessment scales is a key strategy for optimizing the detection of depressive symptoms without compromising validity (87). Its use in time-constrained settings, common in some hospital services, reduces the burden on both professionals and patients, ensuring accurate detection. In this context, the PHQ-2 facilitates the early identification of at-risk cases and supports associated interventions.
This trend toward brief assessment tools is also evident in instruments such as the Beck Depression Inventory–FastScreen (BDI-FS) and the 10-item Center for Epidemiologic Studies Depression Scale (CES-D-10), abbreviated versions of the BDI-II and CES-D. Both have shown strong psychometric performance, maintaining sensitivity and specificity levels comparable to their original versions (88–90). Their usefulness has been demonstrated in occupational environments characterized by high exposure to stressors, which underscores the relevance of our findings. In this regard, the PHQ-2 can be included among ultra-brief tools with clinical validity for detecting depressive symptoms across diverse contexts, distinguished by its simplicity and ease of use.
However, our study presents some limitations that should be noted. First, the cross-sectional design prevents the establishment of causal relationships between the variables analyzed; moreover, the premorbid status of the participants was not known. The sampling, although appropriate for this type of population, may limit the generalizability of the findings to all healthcare professionals in Argentina. Additionally, factorial invariance by sex was not assessed, which could be explored in future research to strengthen the structural validity of the instrument.
It should also be acknowledged that the factor analysis applied to the PHQ-2 has a limited interpretative scope, since two-item models are just-identified and lack degrees of freedom. Therefore, this procedure was used solely as complementary evidence of unidimensionality; whereas internal validity relied mainly on internal consistency and convergent validity indices.
Another limitation concerns the three-month interval between the PHQ-9 and PHQ-2 administrations. Although this separation helped minimize potential memory effects from repeated items and was consistent with the multicenter study’s data collection schedule, it may also have led to variations in participants’ depressive symptoms. Therefore, convergent validity results should be interpreted with caution, and future studies should apply both versions concurrently to obtain more accurate estimates.
Similarly, a formal analysis of divergent validity was not conducted, which limits the ability to confirm that the PHQ-2 specifically measures depressive symptoms and not overlapping constructs such as anxiety or burnout. Future research should incorporate measures of divergent validity to strengthen the psychometric evidence of the instrument.
Finally, online administration may have limited the participation of professionals with restricted access to digital resources, introducing a potential selection bias. Future studies could adopt mixed methodologies, combining online surveys with in-person data collection to improve sample representativeness. Likewise, it would be relevant to compare the PHQ-2 with other ultra-rapid scales to assess its diagnostic accuracy across different clinical contexts.
Despite these limitations, the present study provides consistent psychometric evidence supporting the reliability and validity of the PHQ-2 among healthcare professionals in Argentina. Its strong internal consistency and high correlation with the original nine-item version (PHQ-9) support its convergent validity, confirming that both instruments assess the same depressive construct. In addition, the moderate associations observed with anxiety and burnout indicators reinforce its concurrent validity, demonstrating that the PHQ-2 can also capture emotional dimensions frequently intertwined with depressive symptomatology. Together, these findings confirm the theoretical coherence and measurement stability of the instrument. Although future studies should continue to examine its factorial invariance and temporal stability, the PHQ-2 stands out as a brief and reliable measure that could be useful for the rapid detection of depressive symptoms in healthcare populations.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Ethics Committee of the Universidad de La Costa, Barranquilla, Colombia (Approval Ref: ACT No. 089 of 2021). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
NC: Conceptualization, Methodology, Project administration, Resources, Writing – original draft, Writing – review & editing, Data curation, Funding acquisition, Investigation, Software, Supervision. BM-A: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing, Software. J-CC: Data curation, Investigation, Project administration, Supervision, Writing – original draft, Writing – review & editing, Conceptualization, Funding acquisition, Methodology, Resources, Visualization. LC-A: Data curation, Formal analysis, Validation, Visualization, Writing – original draft, Writing – review & editing, Conceptualization, Investigation, Methodology, Project administration. CQ-A: Formal analysis, Methodology, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing, Data curation. CS: Data curation, Formal analysis, Investigation, Validation, Writing – original draft, Writing – review & editing, Methodology, Project administration, Software, Visualization. DS: Writing – original draft, Writing – review & editing, Data curation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization. DR-C: Writing – original draft, Writing – review & editing, Data curation, Formal analysis, Investigation, Methodology, Software, Supervision, Validation, Visualization. WB: Data curation, Formal analysis, Methodology, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. JM: Data curation, Investigation, Methodology, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. RD-S: Data curation, Formal analysis, Investigation, Methodology, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. CC: Data curation, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. YF: Writing – original draft, Writing – review & editing, Conceptualization, Data curation, Formal analysis, Methodology, Software, Supervision, Validation, Visualization. RQ-A: Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing. CA-D: Data curation, Formal analysis, Investigation, Methodology, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. AP-R: Conceptualization, Data curation, Funding acquisition, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing. PG: Conceptualization, Formal analysis, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. JC: Conceptualization, Data curation, Investigation, Methodology, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. NL: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Resources, Software, Supervision, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research and/or publication of this article. This study was funded by la Universidad de la Costa under the research project code INV.140-03-001-18.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Byrne A, Barber R, and Lim CH. Impact of the COVID-19 pandemic – a mental health service perspective. Prog Neurol Psychiatry. (2021) 25:27. doi: 10.1002/pnp.708
2. López N, Coronado JC, Ripoll-Córdoba D, Caldichoury N, Quispe-Ayala C, Morales-Asencio B, et al. Factorial invariance of the generalized anxiety disorder scale (GAD-7) in Latin America and the Caribbean. Front Psychiatry. (2025) 16. doi: 10.3389/fpsyt.2025.1529424
3. Santomauro DF, Mantilla Herrera AM, Shadid J, Zheng P, Ashbaugh C, Pigott DM, et al. Global prevalence and burden of depressive and anxiety disorders in 204 countries and territories in 2020 due to the COVID-19 pandemic. Lancet. (2021) 398:1700–12. doi: 10.1016/S0140-6736(21)02143-7
4. Chutiyami M, Cheong AMY, Salihu D, Bello UM, Ndwiga D, Maharaj R, et al. COVID-19 pandemic and overall mental health of healthcare professionals globally: A meta-review of systematic reviews. Front Psychiatry. (2022) 12. doi: 10.3389/fpsyt.2021.804525
5. Kelly D, Schroeder S, and Leighton K. Anxiety, depression, stress, burnout, and professional quality of life among the hospital workforce during a global health pandemic. J Rural Health. (2022) 38:795–804. doi: 10.1111/jrh.12659
6. Majid U, Hussain SAS, Zahid A, Haider MH, and Arora R. Mental health outcomes in health care providers during the COVID-19 pandemic: an umbrella review. Health Promot Int. (2023) 38:1–11. doi: 10.1093/heapro/daad025
7. Garcia GPA, Fracarolli IFL, dos Santos HEC, de Oliveira SA, Martins BG, Santin Junior LJ, et al. Depression, anxiety and stress in health professionals in the COVID-19 context. Int J Environ Res Public Health. (2022) 19:4402. doi: 10.3390/ijerph19074402
8. Al Maqbali M, Alsayed A, Hughes C, Hacker E, and Dickens GL. Stress, anxiety, depression and sleep disturbance among healthcare professional during the COVID-19 pandemic: An umbrella review of 72 meta-analyses. PLoS One. (2024) 19:e0302597. doi: 10.1371/journal.pone.0302597
9. Bada W, Villacorta YR, Novoa-Pallares O, Torres A, Gonzalez C, Wisum U, et al. Effectiveness of the PHQ-9 for detecting depressive symptoms in Peruvian Amazonian indigenous people. Gen Hosp Psychiatry. (2025) 97:1–2. doi: 10.1016/j.genhosppsych.2025.08.014
10. Mahmud S, Mohsin M, Dewan M, and Muyeed A. The global prevalence of depression, anxiety, stress, and insomnia among general population during COVID-19 pandemic: A systematic review and meta-analysis. Trends Psychol. (2022) 31:143–70. doi: 10.1007/s43076-021-00116-9
11. Min S, Jeong YH, Kim J, Koo JW, and Ahn YM. The aftermath: post-pandemic psychiatric implications of the COVID-19 pandemic, a South Korean perspective. Front Psychiatry. (2021) 12. doi: 10.3389/fpsyt.2021.671722
12. Pappa S, Ntella V, Giannakas T, Giannakoulis VG, Papoutsi E, and Katsaounou P. Prevalence of depression, anxiety, and insomnia among healthcare workers during the COVID-19 pandemic: A systematic review and meta-analysis. Brain Behav Immun. (2020) 88:901–7. doi: 10.1016/j.bbi.2020.05.026
13. Marvaldi M, Mallet J, Dubertret C, Moro MR, and Guessoum SB. Anxiety, depression, trauma-related, and sleep disorders among healthcare workers during the COVID-19 pandemic: A systematic review and meta-analysis. Neurosci Biobehav Rev. (2021) 126:252–64. doi: 10.1016/j.neubiorev.2021.03.024
14. Liu D, Zhou Y, Tao X, Cheng Y, and Tao R. Mental health symptoms and associated factors among primary healthcare workers in China during the post-pandemic era. Front Public Health. (2024) 12. doi: 10.3389/fpubh.2024.1374667
15. Iraís PFL, Edén PTM, and Erika RT. Post-pandemic depression, anxiety, and stress: A look at the mental health of medical and administrative staff. Acta Psychol (Amst). (2025) 253:104706. doi: 10.1016/j.actpsy.2025.104706
16. Grünheid T and Hazem A. Mental wellbeing of frontline health workers post-pandemic: lessons learned and a way forward. Front Public Health. (2023) 11. doi: 10.3389/fpubh.2023.1204662
17. Escobar-Viera CG, Cernuzzi LC, Miller RS, Rodríguez-Marín HJ, Vieta E, González Toñánez M, et al. Feasibility of mHealth interventions for depressive symptoms in Latin America: a systematic review. Int Rev Psychiatry. (2021) 33:300–11. doi: 10.1080/09540261.2021.1887822
18. Limenih G, MacDougall A, Wedlake M, and Nouvet E. Depression and global mental health in the global south: A critical analysis of policy and discourse. Int J Soc Determinants Health Health Services. (2024) 54:95–107. doi: 10.1177/27551938231220230
19. Giardino DL, Huck-Iriart C, Riddick M, and Garay A. The endless quarantine: the impact of the COVID-19 outbreak on healthcare workers after three months of mandatory social isolation in Argentina. Sleep Med. (2020) 76:16–25. doi: 10.1016/j.sleep.2020.09.022
20. Moya-Salazar J, Saciga-Saavedra W, Cañari B, and Contreras-Pulache H. Depression in healthcare workers from the COVID-19 Care and Isolation Center - Villa Panamericana: a single-center prospective study in Peru. Einstein (São Paulo). (2022) 20:1–6. doi: 10.31744/einstein_journal/2022AO6707
21. Peñaranda A, García E, Pérez-Herrera LC, Trojan A, Peñaranda D, Molina J, et al. Effect of the COVID-19 pandemic on the mental health, daily and occupational activities among health professionals in Colombia: a national study. BMC Psychiatry. (2022) 22:682. doi: 10.1186/s12888-022-04337-9
22. Caldichoury N, García-Roncallo P, Saldías C, Zurita B, Castellanos C, Herrera-Pino J, et al. Impacto psicológico del COVID-19 en los trabajadores sanitarios durante el segundo año de pandemia en Latinoamérica: estudio de encuesta transversal. Rev Colomb Psiquiatr. (2023) 54:250–259. doi: 10.1016/j.rcp.2023.04.010
23. Soto-Añari M, Rivera-Fernández C, Ramos-Vargas L, Denegri-Solis L, Herrera-Pino J, Camargo L, et al. Prevalencia y Factores Asociados con las Quejas Cognitivas Subjetivas del Personal Sanitario Latinoamericano Durante la Pandemia de COVID-19. Rev Colomb Psiquiatr. (2024) 53:340–6. doi: 10.1016/j.rcp.2022.07.008
24. Ekpenyong E and Edako C. Prevalence of iatrogenesis in the Nigerian healthcare delivery system: A conceptual analysis. SSRN Electronic J. (2024). doi: 10.2139/ssrn.5040697
25. Brower KJ. Professional stigma of mental health issues: physicians are both the cause and solution. Acad Med. (2021) 96:635–40. doi: 10.1097/ACM.0000000000003998
26. Fekadu A, Demissie M, Birhane R, Medhin G, Bitew T, Hailemariam M, et al. Under detection of depression in primary care settings in low and middle-income countries: a systematic review and meta-analysis. Syst Rev. (2022) 11:21. doi: 10.1186/s13643-022-01893-9
27. Søvold LE, Naslund JA, Kousoulis AA, Saxena S, Qoronfleh MW, Grobler C, et al. Prioritizing the mental health and well-being of healthcare workers: an urgent global public health priority. Front Public Health. (2021) 9. doi: 10.3389/fpubh.2021.679397
28. Kroenke K, Spitzer RL, and Williams JBW. The patient health questionnaire-2. Med Care. (2003) 41:1284–92. doi: 10.1097/01.MLR.0000093487.78664.3C
29. Spitzer RL. Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. JAMA. (1999) 282:1737. doi: 10.1001/jama.282.18.1737
30. Arroll B, Goodyear-Smith F, Crengle S, Gunn J, Kerse N, Fishman T, et al. Validation of PHQ-2 and PHQ-9 to screen for major depression in the primary care population. Ann Family Med. (2010) 8:348–53. doi: 10.1370/afm.1139
31. Levis B, Sun Y, He C, Wu Y, Krishnan A, Bhandari PM, et al. Accuracy of the PHQ-2 alone and in combination with the PHQ-9 for screening to detect major depression. JAMA. (2020) 323:2290. doi: 10.1001/jama.2020.6504
32. Löwe B, Kroenke K, and Gräfe K. Detecting and monitoring depression with a two-item questionnaire (PHQ-2). J Psychosom Res. (2005) 58:163–71. doi: 10.1016/j.jpsychores.2004.09.006
33. Arrieta J, Aguerrebere M, Raviola G, Flores H, Elliott P, Espinosa A, et al. Validity and utility of the patient health questionnaire (PHQ)-2 and PHQ-9 for screening and diagnosis of depression in rural chiapas, Mexico: A cross-sectional study. J Clin Psychol. (2017) 73:1076–90. doi: 10.1002/jclp.22390
34. De Lima Osório F, Vilela Mendes A, Crippa JA, and Loureiro SR. Study of the discriminative validity of the PHQ-9 and PHQ-2 in a sample of Brazilian women in the context of primary health care. Perspect Psychiatr Care. (2009) 45:216–27. doi: 10.1111/j.1744-6163.2009.00224.x
35. Larsen A, Pintye J, Odhiambo B, Mwongeli N, Marwa MM, Watoyi S, et al. Comparing depression screening tools (CESD-10, EPDS, PHQ-9, and PHQ-2) for diagnostic performance and epidemiologic associations among postpartum Kenyan women: Implications for research and practice. J Affect Disord. (2023) 324:637–44. doi: 10.1016/j.jad.2022.12.101
36. Giuliani M, Gorini A, Barbieri S, Veglia F, and Tremoli E. Examination of the best cut-off points of PHQ-2 and GAD-2 for detecting depression and anxiety in Italian cardiovascular inpatients. Psychol Health. (2021) 36:1088–101. doi: 10.1080/08870446.2020.1830093
37. Lino VTS, Portela MC, Camacho LAB, Atie S, Lima MJB, Rodrigues NCP, et al. Screening for depression in low-income elderly patients at the primary care level: use of the patient health questionnaire-2. PLoS One. (2014) 9:e113778. doi: 10.1371/journal.pone.0113778
38. Camargo L, Soto-Añari M, Caldichoury-Obando N, Ramos-Henderson M, Porto MF, Salomón S, et al. Validez del cuestionario PHQ-2 para la detección de depresión en personal sanitario Colombiano durante Covid-19. Rev Chil Neuropsiquiatr. (2022) 60:281–8. doi: 10.4067/s0717-92272022000300281
39. López N, Morales-Asencio B, Ripoll-Córdoba D, Coronado-López J, Caldichoury N, Quispe-Ayala C, et al. Internal validity and reliability of the GAD-7 test in Latin America. Chronic Stress. (2025) 9:1–9. doi: 10.1177/24705470251315260
40. Manea L, Gilbody S, Hewitt C, North A, Plummer F, Richardson R, et al. Identifying depression with the PHQ-2: A diagnostic meta-analysis. J Affect Disord. (2016) 203:382–95. doi: 10.1016/j.jad.2016.06.003
41. Kroenke K, Spitzer RL, and Williams JBW. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. (2001) 16:606–13. doi: 10.1046/j.1525-1497.2001.016009606.x
42. Faro A, Nunes D, and Falk D. Depressive symptomatology in Brazil: perspectives of statistical and psychometrics analyses of the PHQ-9 at four time-points (2020–2023) in the COVID-19 pandemic. Front Psychol. (2025) 16. doi: 10.3389/fpsyg.2025.1440054
43. Di Matteo R, Bolgeo T, Simonelli N, Dal Molin A, Bassola B, Lusignani M, et al. Psychometric properties and measurement invariance of the patient health questionnaire 9 in an italian coronary heart disease population. J Cardiovasc Nurs. (2025) 40:452–461. doi: 10.1097/JCN.0000000000001178
44. Bianchi R, Verkuilen J, Toker S, Schonfeld IS, Gerber M, Brähler E, et al. Is the PHQ-9 a unidimensional measure of depression? A 58,272-participant study. Psychol Assess. (2022) 34:595–603. doi: 10.1037/pas0001124
45. Spitzer RL, Kroenke K, Williams JBW, and Löwe B. A brief measure for assessing generalized anxiety disorder. Arch Intern Med. (2006) 166:1092. doi: 10.1001/archinte.166.10.1092
46. Porto MF, Ocampo-Barba N, Flores-Valdivia G, Caldichoury N, and López N. Propiedades psicométricas del GAD-7 para detectar ansiedad generaliza en profesionales sanitarios de Bolivia. Rev Peru Med Exp Salud Publica. (2022) 39:115–6. doi: 10.17843/rpmesp.2022.391.8620
47. Shanafelt TD, West CP, Sloan JA, Novotny PJ, Poland GA, Menaker R, et al. Career fit and burnout among academic faculty. Arch Intern Med. (2009) 169:990. doi: 10.1001/archinternmed.2009.70
48. Shanafelt TD, Gorringe G, Menaker R, Storz KA, Reeves D, Buskirk SJ, et al. Impact of organizational leadership on physician burnout and satisfaction. Mayo Clin Proc. (2015) 90:432–40. doi: 10.1016/j.mayocp.2015.01.012
49. Rivera-Fernández C, Soto-Añari M, Camargo L, Caldichoury N, Ramos L, Porto MF, et al. Validación del Mini-Z para la detección de burnout en personal sanitario Colombiano. Rev Colomb Psiquiatr. (2023) 52:S132–3. doi: 10.1016/j.rcp.2021.04.002
50. Koutsimani P, Montgomery A, and Georganta K. The relationship between burnout, depression, and anxiety: A systematic review and meta-analysis. Front Psychol. (2019) 10. doi: 10.3389/fpsyg.2019.00284
51. Shanafelt TD, Hasan O, Dyrbye LN, Sinsky C, Satele D, Sloan J, et al. Changes in burnout and satisfaction with work-life balance in physicians and the general US working population between 2011 and 2014. Mayo Clin Proc. (2015) 90:1600–13. doi: 10.1016/j.mayocp.2015.08.023
52. Pokhrel NB, Khadayat R, and Tulachan P. Depression, anxiety, and burnout among medical students and residents of a medical school in Nepal: a cross-sectional study. BMC Psychiatry. (2020) 20:298. doi: 10.1186/s12888-020-02645-6
53. Carifio J and Perla R. Resolving the 50-year debate around using and misusing Likert scales. Med Educ. (2008) 42:1150–2. doi: 10.1111/j.1365-2923.2008.03172.x
54. Worthington RL and Whittaker TA. Scale development research. Couns Psychol. (2006) 34:806–38. doi: 10.1177/0011000006288127
55. Carpenter S. Ten steps in scale development and reporting: A guide for researchers. Commun Methods Meas. (2018) 12:25–44. doi: 10.1080/19312458.2017.1396583
56. Kline RB. Principles and practice of structural equation modeling. 4.a ed. New York, NY, USA: The Guilford Press (2016).
58. Eisinga R, Grotenhuis Mt, and Pelzer B. The reliability of a two-item scale: Pearson, Cronbach, or Spearman-Brown? Int J Public Health. (2013) 58:637–42. doi: 10.1007/s00038-012-0416-3
59. Goretzko D, Siemund K, and Sterner P. Evaluating model fit of measurement models in confirmatory factor analysis. Educ Psychol Meas. (2024) 84:123–44. doi: 10.1177/00131644231163813
60. Moreno X and Moreno F. Factor structure and measurement invariance of the Patient Health Questionnaire-4 among the Chilean population. PLoS One. (2024) 19:e0304623. doi: 10.1371/journal.pone.0304623
61. Maurer DM, Raymond TJ, and Davis BN. Depression: screening and diagnosis. Am Fam Physician. (2018) 98:508–15.
62. Gelaye B, Wilson I, Berhane HY, Deyessa N, Bahretibeb Y, Wondimagegn D, et al. Diagnostic validity of the Patient Health Questionnaire-2 (PHQ-2) among Ethiopian adults. Compr Psychiatry. (2016) 70:216–21. doi: 10.1016/j.comppsych.2016.07.011
63. Junaid K, Akram I, Daood M, and Khan A. Validity of the patient health questionnaires (phq-2 and phq-9) for screening depression among human immunodeficiency virus patients in Lahore, Pakistan. World J Virol. (2022) 11:341–51. doi: 10.5501/wjv.v11.i5.341
64. Maroufizadeh S, Omani-Samani R, Almasi-Hashiani A, Amini P, and Sepidarkish M. The reliability and validity of the Patient Health Questionnaire-9 (PHQ-9) and PHQ-2 in patients with infertility. Reprod Health. (2019) 16:137. doi: 10.1186/s12978-019-0802-x
65. Errazuriz A, Beltrán R, Torres R, and Passi-Solar A. The validity and reliability of the PHQ-9 and PHQ-2 on screening for major depression in spanish speaking immigrants in Chile: A cross-sectional study. Int J Environ Res Public Health. (2022) 19:13975. doi: 10.3390/ijerph192113975
66. Ghazisaeedi M, Mahmoodi H, Arpaci I, Mehrdar S, and Barzegari S. Validity, reliability, and optimal cut-off scores of the WHO-5, PHQ-9, and PHQ-2 to screen depression among university students in Iran. Int J Ment Health Addict. (2022) 20:1824–33. doi: 10.1007/s11469-021-00483-5
67. Martínez P, Soto-Brandt G, Brandt S, Guajardo V, and Rojas G. Validación inicial de Patient Health Questionnaire 2-Item para detectar síntomas depresivos en diabéticos y/o hipertensos en la atención primaria de salud. Rev Med Chil. (2020) 148:1614–8. doi: 10.4067/S0034-98872020001101614
68. Martinez A, Teklu SM, Tahir P, and Garcia ME. Validity of the spanish-language patient health questionnaires 2 and 9. JAMA Netw Open. (2023) 6:e2336529. doi: 10.1001/jamanetworkopen.2023.36529
69. Caneo C, Toro P, and Ferreccio C. Validity and performance of the patient health questionnaire (PHQ-2) for screening of depression in a rural Chilean cohort. Community Ment Health J. (2020) 56:1284–91. doi: 10.1007/s10597-020-00605-8
70. Gómez-Gómez I, Benítez I, Bellón J, Moreno-Peral P, Oliván-Blázquez B, Clavería A, et al. Utility of PHQ-2, PHQ-8 and PHQ-9 for detecting major depression in primary health care: a validation study in Spain. Psychol Med. (2023) 53:5625–35. doi: 10.1017/S0033291722002835
71. Liu ZW, Yu Y, Hu M, Liu HM, Zhou L, and Xiao SY. PHQ-9 and PHQ-2 for screening depression in chinese rural elderly. PLoS One. (2016) 11:e0151042. doi: 10.1371/journal.pone.0151042
72. Habtamu K, Birhane R, Medhin G, Hanlon C, and Fekadu A. Psychometric properties of screening questionnaires to detect depression in primary healthcare setting in rural Ethiopia. BMC Primary Care. (2022) 23:138. doi: 10.1186/s12875-022-01755-2
73. Piepenburg SM, Faller H, Gelbrich G, Störk S, Warrings B, Ertl G, et al. Comparative potential of the 2-item versus the 9-item patient health questionnaire to predict death or rehospitalization in heart failure. Circ Heart Fail. (2015) 8:464–72. doi: 10.1161/CIRCHEARTFAILURE.114.001488
74. Seo JG and Park SP. Validation of the Patient Health Questionnaire-9 (PHQ-9) and PHQ-2 in patients with migraine. J Headache Pain. (2015) 16:65. doi: 10.1186/s10194-015-0552-2
75. Liu M, Chang Y, Wang D, Fang J, Carson S, Li B, et al. Validation of the patient health questionnaire (PHQ)-9 and PHQ-2 in patients with COPD in China. BMC Psychiatry. (2025) 25:637. doi: 10.1186/s12888-025-07052-3
76. González-Sánchez A, Ortega-Moreno R, Villegas-Barahona G, Carazo-Vargas E, Arias-LeClaire H, and Vicente-Galindo P. New cut-off points of PHQ-9 and its variants, in Costa Rica: a nationwide observational study. Sci Rep. (2023) 13:14295. doi: 10.1038/s41598-023-41560-0
77. Cassiani-Miranda CA, Cuadros-Cruz AK, Torres-Pinzón H, Scoppetta O, Pinzón-Tarrazona JH, López-Fuentes WY, et al. Validez del Cuestionario de salud del paciente-9 (PHQ-9) para cribado de depresión en adultos usuarios de Atención Primaria en Bucaramanga, Colombia. Rev Colomb Psiquiatr. (2021) 50:11–21. doi: 10.1016/j.rcp.2019.09.001
78. Löwe B, Decker O, Müller S, Brähler E, Schellberg D, Herzog W, et al. Validation and standardization of the generalized anxiety disorder screener (GAD-7) in the general population. Med Care. (2008) 46:266–74. doi: 10.1097/MLR.0b013e318160d093
79. Ghaheri A, Omani-Samani R, Sepidarkish M, Hosseini M, and Maroufizadeh S. The four-item patient health questionnaire for anxiety and depression: A validation study in infertile patients. Int J Fertil Steril. (2020) 4:234–9. doi: 10.22074/ijfs.2020.44412
80. Byrd-Bredbenner C, Eck K, and Quick V. Psychometric properties of the generalized anxiety disorder-7 and generalized anxiety disorder-mini in United States university students. Front Psychol. (2020) 11. doi: 10.3389/fpsyg.2020.550533
81. Rosario-Hernández E, Rovira-Millán LV, Merino-Soto C, and Angulo-Ramos M. Review of the psychometric properties of the Patient Health Questionnaire-9 (PHQ-9) Spanish version in a sample of Puerto Rican workers. Front Psychiatry. (2023) 14. doi: 10.3389/fpsyt.2023.1024676
82. McEwen BS. Neurobiological and systemic effects of chronic stress. Chronic Stress. (2017) 1:1–11. doi: 10.1177/2470547017692328
83. Orosz A, Federspiel A, Haisch S, Seeher C, Dierks T, and Cattapan K. A biological perspective on differences and similarities between burnout and depression. Neurosci Biobehav Rev. (2017) 73:112–22. doi: 10.1016/j.neubiorev.2016.12.005
84. Cortés-Álvarez NY, Lara-Morales A, Bautista-Rodríguez E, Marmolejo-Murillo LG, Díaz Jiménez A, Vergara Hernández LA, et al. Job burnout, cognitive functioning, and Brain-derived neurotrophic factor expression among hospital Mexican nurses. PLoS One. (2024) 19:e0304092. doi: 10.1371/journal.pone.0304092
85. Pihlaja M, Peräkylä J, Erkkilä EH, Tapio E, Vertanen M, and Hartikainen KM. Altered neural processes underlying executive function in occupational burnout—Basis for a novel EEG biomarker. Front Hum Neurosci. (2023) 17. doi: 10.3389/fnhum.2023.1194714
86. Garber BB, Chen J, Beliveau A, Farwell DG, Bewley AF, Birkeland AC, et al. Using the Patient Health Questionnaire-2 to improve depression screening in head and neck cancer patients. Am J Otolaryngol. (2023) 44:103724. doi: 10.1016/j.amjoto.2022.103724
87. Hlynsson JI and Carlbring P. Diagnostic accuracy and clinical utility of the PHQ-2 and GAD-2: a comparison with long-format measures for depression and anxiety. Front Psychol. (2024) 15. doi: 10.3389/fpsyg.2024.1259997
88. Zhou J, Radojčić MR, Ashton-James CE, Yang H, Chen Z, Wang R, et al. Optimal cut-offs of depression screening tools during the COVID-19 pandemic: a systematic review. BMC Psychiatry. (2023) 23:953. doi: 10.1186/s12888-023-05455-8
89. Kondo K, Antick JR, Ayers CK, Kansagara D, and Chopra P. Depression screening tools for patients with kidney failure. Clin J Am Soc Nephrology. (2020) 15:1785–95. doi: 10.2215/CJN.05540420
Keywords: depression, healthcare professionals, validation, test, PHQ-2 screening
Citation: Caldichoury N, Morales-Asencio B, Coronado J-C, Castellanos-Alvarenga LM, Quispe-Ayala C, Saldías C, Salazar D, Ripoll-Córdoba D, Bada W, Martínez J, Duhalde-Sanhueza R, Castellanos C, Flórez Y, Quincho-Apumayta R, Ardila-Duarte C, Patiño-Rivera AR, Gargiulo PA, Cárdenas J and López N (2025) Reliability and convergent validity of the PHQ-2 for the rapid detection of depressive symptoms in healthcare professionals in Argentina. Front. Psychiatry 16:1652072. doi: 10.3389/fpsyt.2025.1652072
Received: 23 June 2025; Accepted: 12 November 2025; Revised: 05 November 2025;
Published: 28 November 2025.
Edited by:
Marcin Siwek, Jagiellonian University, PolandReviewed by:
Lorenz B. Dehn, University Hospital OWL of Bielefeld University, GermanySri Idaiani, National Research and Innovation Agency (BRIN), Indonesia
Asmare Belete, Wollo University, Ethiopia
Copyright © 2025 Caldichoury, Morales-Asencio, Coronado, Castellanos-Alvarenga, Quispe-Ayala, Saldías, Salazar, Ripoll-Córdoba, Bada, Martínez, Duhalde-Sanhueza, Castellanos, Flórez, Quincho-Apumayta, Ardila-Duarte, Patiño-Rivera, Gargiulo, Cárdenas and López. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Norman López, bm9ybWFubG9wZXp2ZUBzYW50b3RvbWFzLmNs