Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Psychol., 29 July 2025

Sec. Quantitative Psychology and Measurement

Volume 16 - 2025 | https://doi.org/10.3389/fpsyg.2025.1592614

Psychometric evaluation and updated community norms of the WHO-5 well-being index, based on a representative German sample

  • 1Department of Social Welfare, Ernst-Abbe University of Applied Sciences Jena, Jena, Germany
  • 2Institute of Delinquency and Crime Prevention, Zurich University of Applied Sciences, Zurich, Switzerland
  • 3University of Zurich, Zürich, Switzerland
  • 4Clinic for Child and Adolescent Psychiatry/Psychotherapy, Ulm University, Ulm, Germany
  • 5Department of Psychosomatic Medicine and Psychotherapy, Justus Liebig University Giessen, Giessen, Germany
  • 6Department of Psychosomatic Medicine and Psychotherapy, University Medical Center of the Johannes Gutenberg University Mainz, Mainz, Germany
  • 7Department of Medical Psychology and Sociology, Leipzig University, Leipzig, Germany

Introduction: The World Health Organization-Five Well-Being Index (WHO-5) is a widely used self-report measure for evaluating mental well-being in both general and clinical populations. This study examines the psychometric properties of the WHO-5 using a large, representative sample of the adult population in Germany (N = 2,515) and presents updated population norms.

Methods: Analyses included item-level statistics such as means, standard deviations, and inter-item correlations. Construct validity was evaluated through correlations with measures of depression (PHQ-2), anxiety (GAD-2), somatic symptoms (SSS-8), and loneliness (UCLA Loneliness Scale). Internal consistency was measured using coefficient omega, while factorial validity was tested through confirmatory factor analysis based on a one-factor model. Measurement invariance was assessed across gender and age groups using multi-group confirmatory factor analyses. Population norms are reported for the total sample and various age groups.

Results: The findings confirm the strong psychometric properties of the WHO-5, including its internal consistency and construct validity. Measurement invariance results support comparability of scores across gender and age. The updated norms offer.

Discussion: These updated norms support the continued implementation of the WHO-5 as a practical tool for population-based prevention and mental health care planning.

1 Introduction

The World Health Organization-Five Well-Being Index (WHO-5) is an internationally recognized instrument for assessing subjective psychological well-being. Initially introduced in the late 1990s, the WHO-5 has gained widespread use in both research and clinical practice due to its brevity and generic nature, which facilitate its application across diverse populations (see Topp et al., 2015). The WHO-5 was derived from the longer WHO-10, itself an abridged version of a 28-item instrument developed during a multicenter European study (Topp et al., 2015). Due to its generic structure, the WHO-5 is suitable for comparisons between general population norms and clinical populations, irrespective of specific diagnoses. It is thus widely used as a benchmark for monitoring remission or assessing well-being in various contexts [e.g., older adults (Allgaier et al., 2013), individuals with diabetes (George and Randhawa, 2024; Sommer et al., 2024), stroke survivors (Damsbo et al., 2020), cancer patients (Aerts et al., 2014), individuals with alcohol use disorders (Elholm et al., 2011), individuals with schizophrenia (Fekih-Romdhane et al., 2024), individuals with sleep disturbances (Hartwig et al., 2019), individuals with multiple sclerosis (Andreasen et al., 2010), individuals with personality disorders (Lara-Cabrera et al., 2020), and individuals with bereavement (Reitsma et al., 2024); see Domenech et al., 2025 for an overview]. During the COVID-19 pandemic, the WHO-5 was widely used in cross-national studies (e.g., Gallemit et al., 2024; Lara-Cabrera et al., 2022), and even in settings affected by geopolitical crises (e.g., Alnaser et al., 2025). Validation studies in various languages, including Chinese, German, Malay, Swedish, Turkish, Azerbaijani, and Arabic, have consistently shown high internal consistencies (e.g., Aliyev et al., 2024; Brähler et al., 2007; Kassab Alshayea, 2023; Khosravi et al., 2015; Eser et al., 2019; Faruk et al., 2021; Fung et al., 2022; Löve et al., 2014; Suhaimi et al., 2022) ranging from α = 0.75 (Bangla version; Faruk et al., 2021) to α = 0.94 (Persian sample; Khosravi et al., 2015). Accordingly, confirmatory factor analyses (CFA) conducted across diverse populations consistently support the WHO-5’s unidimensional structure (e.g., Guðmundsdóttir et al., 2014; Perera et al., 2020; Cosma et al., 2022; Sischka et al., 2025). Measurement invariance across cultures has been extensively tested, albeit with mixed findings (e.g., Sischka et al., 2020; Caycho-Rodríguez et al., 2023; Jami and Kemmelmeier, 2020). Some studies demonstrated invariance across gender (e.g., Fekih-Romdhane et al., 2023; Perera et al., 2020; Yang et al., 2023) and age groups (e.g., Yang et al., 2023; Cosma et al., 2022). However, systematic investigations of measurement invariance across clinical and non-clinical samples remain scarce (e.g., Cosma et al., 2022; Eser et al., 2019), highlighting the need for further research in this area. The construct validity of the WHO-5 has been robustly supported through substantial correlations with a wide range of conceptually related constructs. Strong negative associations have been documented with depressive symptoms (e.g., de Wit et al., 2007; Gallemit et al., 2024; Möller-Leimkühler et al., 2007), generalized anxiety (e.g., Gallemit et al., 2024; Pheko et al., 2023; Fekih-Romdhane et al., 2024), somatic complaints (e.g., Brähler et al., 2007; Cosma et al., 2022; Sischka et al., 2025), and loneliness (e.g., Sischka et al., 2025; Pheko et al., 2023). Positive correlations with quality of life, overall life satisfaction, and other measures of well-being (e.g., Clarke et al., 2011; Lambert et al., 2014) further support the construct validity of the WHO-5. Although the WHO-5 has been extensively validated, studies involving representative samples of the general population remain limited. Existing German norms, based on data collected in 2004, are now outdated (Brähler et al., 2007). Profound societal and health-related changes have created a pressing need for updated benchmarks for the general population. Recent normative data from countries like Denmark (Moeller et al., 2023) highlight the importance of deriving updated reference values from large, representative community samples. Accordignly, this study aims to address this gap by evaluating the psychometric properties of the WHO-5 and providing updated norms based on a representative sample of the German population.

2 Methods and materials

2.1 Procedure

The WHO-5 was administered as part of a comprehensive survey conducted by Leipzig University between June and October 2021. Data collection was managed by USUMA Markt- und Sozialforschung, an independent research institute specializing in social and market research (e.g., Kliem et al., 2015; Kliem et al., 2016). The survey had three main objectives: (1) to estimate the prevalence of physical and mental health conditions and associated risk behaviors, (2) to explore contributing factors to these conditions, and (3) to validate psychological instruments and update German population norms.

The survey included two components. First, interviewers gathered demographic and household information through structured interviews aligned with the standards of the German Federal Statistical Office (Statistisches Bundesamt). Second, participants independently completed paper-based questionnaires in the interviewer’s presence but without their direct involvement. Interviewers remained available to address any questions. All participants provided informed consent, and those under 18 years required parental consent. Confidentiality and data protection measures were communicated in detail to all participants. The study adhered to the Declaration of Helsinki (World Medical Association, 2024) and was approved by the Ethics Committee of the Medical Faculty at Leipzig University (AZ: 298/21-ek).

2.2 Sample description

The survey utilized the ADM (Arbeitskreis Deutscher Markt- und Sozialforschungsinstitute) sampling system to generate a representative sample of the German general population. Sampling involved three stages: (1) regional stratification to identify 258 sampling points across Germany, (2) random selection of 5,676 households using a random-route procedure, and (3) selection of target individuals within households using a Kish grid (Kish, 1949). Of the K = 5,908 households contacted, N = 2,515 completed the survey, resulting in a response rate of 42.6%. Data from N = 2,515 participants (51.6% female) were analyzed. Detailed sample characteristics are provided in Table 1, and an overview of the sampling procedure is shown in Figure 1. Additional descriptive statistics for the sample, stratified by the same groups used in the measurement invariance analyses and by normative comparison groups (see below), are available in the Supplemental material (see Supplementary Tables S1–S3).

Table 1
www.frontiersin.org

Table 1. Demographic characteristics of the study sample.

Figure 1
The survey began with 5676 target households. After excluding vacant dwellings and ineligible populations, the initial sample consisted of 5934 individuals. Losses occurred due to non-contact, refusals, travel, illness and interview denials. A total of 2526 interviews were conducted. The final analytical sample includes 2515 valid cases.

Figure 1. Flowchart of sampling procedure and reasons for nonparticipation.

2.3 Measures

This survey was designed to serve various epidemiological research purposes; however, only the measures relevant to the validation process are discussed in this paper. Alongside comprehensive demographic data (see Table 1), information on health-related behaviors, including the number of sick days, doctor visits, and hospital admissions, was collected. The following instruments were utilized to validate the scales examined in this study.

2.3.1 The World Health Organization-five well-being index

The World Health Organization-Five Well-Being Index (WHO-5) (Bech et al., 2003; German version: Brähler et al., 2007) consists of five positively phrased items assessing well-being over the past 14 days. Participants rate each statement on a scale from 0 (at no time) to 5 (all of the time), yielding a total score range of 0–25. Scores are conventionally converted to a percentage scale ranging from 0 (lowest well-being) to 100 (highest well-being) by multiplying the raw score by four (Bech et al., 2003).

2.3.2 The patient health questionnaire-4

The patient health questionnaire-4 (PHQ-4) (Löwe et al., 2010; Wicke et al., 2022) assess symptoms of depression (PHQ-2) and generalized anxiety disorder (GAD-2) on subscales of two items each. Each item is rated on a four-point scale (0 = not at all to 3 = nearly every day), resulting in subscale scores ranging from 0 to 6. Higher scores indicate greater symptom severity. Internal consistency for the PHQ-2, and GAD-2 in this study was acceptable to good (PHQ-2: α = 0.82, ω = 0.82; GAD-2: α = 0.76, ω = 0.77, respectively).

2.3.3 UCLA loneliness scale (short form)

Loneliness was measured using a three-item version of the UCLA Loneliness Scale (Hughes et al., 2004; German version: Klein et al., 2021). Participants rated items on a five-point Likert scale (0 = never to 4 = very often), with higher scores indicating greater loneliness. The total score ranges from 0 to 12. The German version demonstrated very good reliability in this study (α = 0.90, ω = 0.91).

2.3.4 Somatic symptom scale

The Somatic symptom scale (SSS-8) (Gierk et al., 2014) measures somatic symptom burden based on eight items, each rated on a five-point scale (0 = not at all to 4 = very strongly). Participants reported how much they had been affected by specific complaints over the past 7 days, with higher scores indicating higher burden. Internal consistency in the present sample was α = 0.86, ω = 0.90.

2.3.5 European health interview survey quality of life 8-item index

The European health interview survey quality of life 8-item index (EUROHIS-QOL) (Schmidt et al., 2006; German version: Brähler et al., 2007; Hettich et al., 2022) is a quality of life measure consisting of eight items (overall QoL, general health, energy, daily living activity, self-esteem, social relationships, finances, and home). However, conceptually the psychological, physical, social and environmental domains are each represented by two items. All answer scales have a 5-point response format on a Likert scale, ranging from “not at all” to “completely.” The overall QOL score is formed by a simple summation of scores on the eight items, with higher scores indicating better QOL. Each item is answered individually on a five-point scale. Internal consistency in the sample at hand was α = 0.90, ω = 0.96.

2.4 Statistical analysis

2.4.1 Missing data

The proportion of missing responses for the WHO-5 items ranged from 0.08 to 0.20%. To address missing data, multiple imputation via chained equations was applied, following the procedure described in van Buuren and Groothuis-Oudshoorn (2011). Sociodemographic variables and all scales relevant for assessing construct validity were included in the imputation model to predict missing values. Predictive mean matching was utilized to ensure plausible item values by selecting observed values closest to the predicted values (ŷ). The imputations were carried out using the “mice” package in R (van Buuren and Groothuis-Oudshoorn, 2011). All subsequent analyses were conducted on one imputed dataset.

2.4.2 Item characteristics

Means (M), standard deviations (SD), kurtosis (Kurt), and skewness (Skew) for all WHO-5 items were calculated for the entire sample and for subgroups based on gender. Additionally, inter-item correlations (r) were calculated providing insight into the degree of association between items, indicating whether items measure similar aspects of the underlying construct. Furthermore, item-test correlations (rit) were calculated to assess the extent to which each item correlates with the sum of the remaining items. For this purpose, the item under consideration was excluded from the total score to avoid artificial inflation of the correlation coefficient and to obtain a corrected item-total correlation. Cronbach’s alpha if one item deleted (α−1) was calculated to examine the potential impact of each item on overall scale reliability; decreases in alpha following item deletion indicate that the item contributes positively to internal consistency, whereas increases suggest that the item may not align well with the scale’s overall structure. Lastly, we calculated item (Pi) and total score difficulty (Pt) by transforming raw scores to a standardized scale ranging from 0 to 100. This linear transformation was based on the minimum and maximum possible scores per item, allowing for comparability across items. Group differences in item means were assessed using Cohen’s d to quantify effect sizes.

2.4.3 Construct validity

Construct validity was assessed by examining correlations between the WHO-5 and scales measuring depression (PHQ-2), anxiety (GAD-2), loneliness (UCLA Loneliness Scale-3), somatization (SSS-8), and quality of life (EROHIS-QOL-8). The following hypotheses guided the analysis: higher well-being would be associated with (a) lower scores on depression, (b) lower scores on anxiety, and (c) lower scores on somatization. Additionally, it was hypothesized that well-being would positively correlate with quality of life. In addition to the correlation analyses, we estimated a structural equation model (SEM) to examine the construct validity of the WHO-5 using latent variables. The model included the WHO-5 as a latent factor and its associations with conceptually related constructs.

2.4.4 Internal consistency

To provide a robust measure of internal consistency, McDonald’s ω was calculated using the “semTools” package in R (Jorgensen et al., 2021). This approach complements the use of Cronbach’s α, which can be sensitive to certain assumptions (McNeish, 2018).

2.4.5 Factorial validity and measurement invariance

To confirm the unidimensional structure of the WHO-5, confirmatory factor analyses (CFA) were conducted using the “lavaan” package in R (Rosseel, 2019). The analyses employed weighted least square means and variance adjusted (WLSMV) estimation, which is suitable for categorical data. To evaluate goodness-of-fit of the relevant model, the following four criteria were considered. While the RMSEA and the 90% confidence interval both assess absolute model fit, the two additionally calculated criteria (Comparative Fit Index [CFI] and Tucker Lewis Index [TLI]) measure relative model fit compared to the “null” model. RMSEA values < 0.050 represent a close fit, values between 0.050 and 0.080 represent a reasonably close fit, and values > 0.080 represent an unacceptable model (Hu and Bentler, 1999). Regarding CFI and TLI, Hu and Bentler (1999) suggested a CFI and TLI > 0.900 for an adequate fit and a CFI and TLI > 0.950 for a good model fit. In addition, the Standardized Root Mean Square Residual (SRMR) was considered as an indicator of the average standardized residuals between observed and predicted covariances; SRMR values < 0.080 are generally interpreted as indicative of good fit (Hu and Bentler, 1999). Measurement invariance (MI) was assessed using multiple group confirmatory factor analysis (MGCFA), adhering to the procedures recommended by Wu and Estabrook (2016). The theta parameterization was applied, with models identified by fixing latent factor means and variances to 0 and 1, setting item intercepts to 0, and constraining residual variances to 1.

Five nested models were tested: (i) configural invariance (unconstrained except for identification constraints), (ii) threshold invariance (equal thresholds across groups), (iii) weak invariance (equal factor loadings), (iv) strong invariance (equal intercepts), and (v) full invariance (equal residual variances). The parameter constraints for each model are visualized in Supplementary Figure S1 and described in detail in Supplementary Table S4. Cut-off criteria by Chen (2007) were applied, with CFI changes of < −0.01 and RMSEA changes of ≥0.015 signaling lack of invariance. MGCFA analyses were conducted across gender, age (below and above median age), and combined age-gender groups. Non-binary cases were excluded from the gender-based analyses due to their small numbers. In addition to age and sex, we also examined measurement invariance across subgroups defined by anxiety symptom severity (GAD-2), depression symptom severity (Löwe et al., 2004) and somatic symptom burden (SSS-8), as differential response behavior in these groups may affect comparability of scores. For subgroup classification, we applied established cut-off scores [GAD-2: ≥3.0 (Löwe et al., 2010); PHQ-2 (Löwe et al., 2010); ≥3.0, and SSS-8 (Gierk et al., 2014): ≥8.0] for each scale to distinguish between individuals with clinically relevant versus non-clinically relevant symptom levels.

3 Results

3.1 Item characteristics

Supplementary Table S5 provides a detailed overview of totel score and item-level statistics. The WHO-5 yielded a mean score of M = 16.89 (SD = 5.74), with a negatively skewed distribution (Skew = −0.90) and near-normal kurtosis (Kurt = 0.08). Difficulty for the total score was Pt = 67.56, indicating a generally positive self-reported well-being in the sample. On item level mean scores ranged from M = 3.36 (#4) to 3.55 (#2), with moderate variability (SD = 1.14 to 1.32), indicating generally high levels of well-being within the sample. All items showed negative skewness (Skew = −1.06 to −0.75), suggesting a response pattern skewed toward higher endorsements. Item difficulty ranged between Pi = 67.25 (#4) and p = 71.08 (#2), reflecting moderate item difficulty. Corrected item-total correlations (rit) were consistently high (rit = 0.83 to 0.87), indicating strong discrimination and conceptual alignment of the items with the overall construct. Cronbach’s alpha remained stable across all deletion scenarios (α−1 = 0.93 to 0.94), demonstrating that each item contributes positively to the internal consistency of the scale. Supplementary Table S6 provides a detailed overview of the inter-item correlations for the WHO-5 ranging from r = 0.84 (95% CI [0.83, 0.85]) to r = 0.76 (95% CI [0.74, 0.77]). Furthermore, Supplementary Table S7 reports M and SD for gender subgroups. Male participants generally reported higher mean well-being scores and exhibited lower variability on most items. Effect sizes for gender differences were small, with Cohen’s d ranging from ES = 0.05 (#1; 95% CI [0.03, 0.13]) to 0.13 (#4; 95% CI [−0.05, 0.20]).

3.2 Construct validity

Correlations between the WHO-5 and related scales supported its construct validity. As hypothesized, the WHO-5 showed strong negative correlations with PHQ-2 [r(2,413) = −0.64, p < 0.001], GAD-2 [r(2,413) = −0.53, p < 0.001], UCLA Loneliness Scale-3 [r(2,413) = −0.54, p < 0.001], and SSS-8 [r(2,413) = −0.59, p < 0.001]. A strong positive correlation was observed with EROHIS-QOL-8 [r(2,413) = 0.68, p < 0.001]. A comprehensive correlation table is provided in Supplementary Table S8. In Supplementary Table S9, the correlations between individual WHO-5 items and theoretically relevant constructs are reported, providing evidence for construct-related validity on item level. The SEM results showed the expected pattern of correlations, with strong negative associations with depression and anxiety, and moderate associations with the other health indicators. Detailed parameter estimates are reported in Supplementary Table S10.

3.3 Population norms

Percentile ranks of WHO-5 scores for the overall sample are presented in Table 2. Additional norms stratified by gender and age are available in Supplementary Tables S11, S12.

Table 2
www.frontiersin.org

Table 2. Population based norms (cumulative percentiles) of the WHO-5 scores (total sample).

3.4 Internal consistency

The internal consistency of the WHO-5 was excellent, with both Cronbach’s α and McDonald’s ω yielding values of 0.95 for the full sample.

3.5 Factorial validity

CFA results provided evidence of unidimensionality for the WHO-5. Fit indices indicated good model fit with robust CFI = 0.977, robust TLI = 0.954, and SRMR = 0.016. Despite the elevated RMSEA value, the overall fit indices, including the robust CFI, TLI, and SRMR, provide compelling evidence supporting a well-fitting unidimensional model of the WHO-5. The robust RMSEA was calculated at 0.17 (90% CI [0.153, 0.186]), which may be attributed to its sensitivity to model simplicity (see discussion). Considering the strong fit indicated by the other indices, this discrepancy is unlikely to compromise the validity of the model. Standardized factor loadings ranged from 0.89 to 0.94, reinforcing the scale’s unidimensional structure (see Supplementary Table S5). Path diagrams are included in Supplementary Figures S2, S3.

3.6 Measurement invariance

Measurement invariance analyses yielded satisfactory fit indices across all steps and groups. The fit statistics are presented in Supplementary Table S13, supporting the comparability of WHO-5 scores across gender and age groups. With regard to anxiety, depression, and somatic symptom severity, the analyses indicated largely consistent model fit across configural, threshold, metric, and scalar levels. These findings support the robustness of the WHO-5 across varying levels of psychological and somatic symptomatology and are presented in Supplementary Table S14.

4 Discussion

This study evaluated the psychometric properties of the WHO-5 in a large, representative sample of the German general population. The results demonstrated high internal consistency, as evidenced by McDonald’s ω. Despite the high RMSEA, the overall fit indices (robust CFI, TLI, SRMR) provide strong evidence for a well-fitting unidimensional model of the WHO-5. It has been shown that RMSEA has significant issues with simpler models that have few degrees of freedom. This is particularly relevant for simple path models and CFAs, which often have relatively few df. In such cases, RMSEA can incorrectly indicate poor model fit, even when the model fits the data well (Kenny et al., 2015). The problem arises from the construct of RMSEA as an absolute fit index that incorporates model complexity (Hu and Bentler, 1999). To account for simplicity, RMSEA applies a penalty for fewer df. This penalty can cause models with few df to exhibit poor RMSEA values, even when they fit the data well. Kenny et al. (2015) demonstrated through simulations that models with few df might even show poor RMSEA values despite a non-significant chi-square test (indicating no significant discrepancy between the model and the data). This conclusion is further supported by the finding that in the more constrained models of the MGCFA, which are characterized by a greater number of df, the RMSEA values consistently indicate an acceptable model fit. This suggests that the inclusion of additional constraints and the resulting increase in degrees of freedom may mitigate the sensitivity of RMSEA, allowing it to reflect a more accurate assessment of model fit under these conditions. Correspondingly, the measurement invariance analyses confirmed comparable factor structures across gender and age subgroups, supporting the use of WHO-5 for comparisons across these demographics. Furthermore, measurement invariance analyses demonstrated that the WHO-5 functioned equivalently across individuals with and without elevated anxiety, depression, and somatic symptoms. Full scalar and strict invariance were established for all three constructs (GAD-2, PHQ-2, and SSS-8), indicating that comparisons of latent well-being scores across symptom groups are psychometrically valid. Additionally, correlations with related constructs, including depression, anxiety, loneliness, somatization, and quality of life, provided strong evidence for construct validity. These findings align with prior normative studies—such as Moeller et al. (2023) and affirm the WHO-5’s utility as a reliable and valid measure of well-being in general population settings. Moeller et al. (2023) found a slightly lower mean well-being in a comparable sample of the Danish general population (Mpercentage-scale = 63.9, SDpercentage-scale = 22.0 compared to Mpercentage-scale = 67.56, SDpercentage-scale = 22.96 in the study at hand). Differences may seem unexpected, as Denmark is consistently ranked among the world’s “happiest” countries. A possible explanation for the higher well-being scores in the German sample compared to the Danish one lies in methodological and contextual differences between the studies. A key distinction is the timing of data collection. The Danish data were gathered at the onset of the COVID-19 pandemic, a period marked by high uncertainty, social isolation, and economic concerns, which likely had a negative impact on subjective well-being. In contrast, the German data were collected in June 2021, when the pandemic situation had significantly improved. By that time, infection rates had declined following a strict winter lockdown, widespread restrictions were lifted, and the vaccination campaign was well advanced, contributing to a greater sense of security in the population. Compared to norm data from the German general population collected in 2004 (Brähler et al., 2007), the present study found significantly higher levels of well-being across all subsamples. This increase in well-being among the German population is likely attributable to a combination of long-term societal improvements and greater awareness of mental health. Between 2004 and 2021, Germany has experienced economic stability and growth, characterized by low unemployment rates (decline in unemployment from 10.5% in 2004 to 5.7% in 2021), improved healthcare access (e.g., increased coverage for psychotherapy, and broader access to preventive healthcare services), and an expansion of social security systems (e.g., introduction of the statutory minimum wage in 2015, strengthening of parental benefits through “Elterngeld” and “ElterngeldPlus”), all of which are key determinants of subjective well-being. Additionally, mental health awareness has increased, fostering greater acceptance of psychological support, improved access to therapy, and a broader adoption of self-care practices. These developments may have contributed to more effective coping mechanisms, enabling individuals to better manage stress and maintain higher well-being levels. Given these shifts, the findings underscore the importance of regularly updating well-being norms to reflect societal changes.

4.1 Limitations

While this study utilized data from a large representative sample, certain limitations should be acknowledged. The response rate, at 42.6%, is consistent with other general population studies (e.g., Kliem et al., 2024; Kliem et al., 2018) but raises concerns about potential non-response bias. Efforts were made to ensure representativeness. However, the lack of demographic information for non-responders limits the ability to assess the extent of such bias. Access to registry-based data, which requires governmental authorization in Germany, could mitigate this issue in future research. While the presented norms and psychometric properties can serve as valuable reference data for epidemiological and clinical research, the generalizability to clinical samples is limited. Nevertheless, population-based norms are essential for calculating metrics such as the Reliable Change Index (RCI; Jacobson and Truax, 1991) in clinical samples. Thus, despite being derived from a non-clinical population, these norms have direct clinical relevance by enabling the evaluation of meaningful individual-level changes in treatment contexts. Second, the study relies entirely on self-report data. Without external validation or objective behavioral or clinical measures, it cannot be ruled out that generalized negative self-evaluation might drive the observed association patterns (e.g., between well-being and depressive symptoms). Third, the data were collected exclusively within the German general population. While this ensures national representativeness, it may limit the generalizability of the findings to countries with different or more heterogeneous cultural norms and attitudes. However, given the extensive body of validation studies across diverse cultural contexts, it is reasonable to assume that the WHO-5 also functions reliably in other populations. Nevertheless, country-specific normative data are still needed, as cut-off values and score distributions may vary substantially. Moreover, it remains unclear how the COVID-19 pandemic has affected normative values in other countries, particularly in light of varying public health responses and demographic differences. Some countries implemented less restrictive infection control measures or were less severely impacted due to younger average population age structures - whereas Germany, as an aging society, may have experienced more pronounced effects on mental well-being. This underscores the importance of regularly updating national norms to account for such contextual and demographic variability. Fourth, the cross-sectional design of the study restricts conclusions regarding the predictive validity, test–retest reliability, and longitudinal measurement invariance of the German version of the WHO-5. To establish the temporal stability and predictive utility of the instrument, future research should adopt longitudinal approaches. Despite these limitations, the updated German WHO-5 norms provide valuable reference values for public mental health monitoring and screening purposes. Given its brevity, ease of administration, and strong psychometric properties, the WHO-5 is particularly well-suited for routine use in large-scale surveys, digital applications, and preventive healthcare. These updated norms therefore support the continued implementation of the WHO-5 as a practical tool for population-based prevention and mental health care planning.

Data availability statement

The datasets presented in this article are not readily available because the datasets generated and/or analyzed during the current study are not publicly available as ethics board approval did not include open data sharing. Questions concerning the data should be addressed to E. Brähler. Requests to access the datasets should be directed to RWxtYXIuQnJhZWhsZXJAbWVkaXppbi51bmktbGVpcHppZy5kZQ==.

Ethics statement

The studies involving humans were approved by Ethics Committee of the Medical Faculty at Leipzig University (AZ: 298/21-ek). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

SK: Conceptualization, Formal analysis, Writing – original draft, Writing – review & editing. AL: Formal analysis, Writing – original draft, Writing – review & editing. SF: Writing – review & editing. DB: Writing – review & editing. VC: Writing – review & editing. CS: Writing – review & editing. HK: Writing – review & editing. EB: Writing – review & editing, Data curation, Project administration. JF: Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. The research did not receive specific funding, but was performed as part of the employment of the authors. SK, AL, and SF Ernst-Abbe Hochschule - University of Applied Sciences Jena, CS and JF University Clinic Ulm, DB Zurich University of Applied Sciences, HK and EB University Mainz. The funders were neither involved in manuscript writing, editing, approval, or decision to publish. Open access funding by Zurich University of Applied Sciences (ZHAW).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The authors declare that Gen AI was used in the creation of this manuscript. The authors utilized ChatGPT to enhance the linguistic quality of the manuscript, as they are not native English speakers. Following this process, the authors carefully reviewed and revised the text to ensure accuracy and clarity, taking full responsibility for the final content of the publication.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1592614/full#supplementary-material

References

Aerts, L., Christiaens, M. R., Enzlin, P., Neven, P., and Amant, F. (2014). Sexual functioning in women after mastectomy versus breast conserving therapy for early-stage breast cancer: a prospective controlled study. Breast 23, 629–636. doi: 10.1016/j.breast.2014.06.012

PubMed Abstract | Crossref Full Text | Google Scholar

Aliyev, B., Rustamov, E., Satici, S. A., and Zalova Nuriyeva, U. (2024). Azerbaijani adaptation of the WHO-5 wellbeing index: investigating its relationship with psychological distress, resilience, and life satisfaction. BMC Psychol. 12:100. doi: 10.1186/s40359-024-01593-0

PubMed Abstract | Crossref Full Text | Google Scholar

Allgaier, A. K., Kramer, D., Saravo, B., Mergl, R., Fejtkova, S., and Hegerl, U. (2013). Beside the geriatric depression scale: the WHO-five well-being index as a valid screening tool for depression in nursing homes. Int. J. Geriatr. Psychiatry 28, 1197–1204. doi: 10.1002/gps.3944

PubMed Abstract | Crossref Full Text | Google Scholar

Alnaser, M. Z., Alhamad, H., Alotaibi, N., Alhamdan, N., and Manee, F. (2025). Beyond borders: investigating the impact of the 2023 conflict in Gaza on generalized anxiety disorders and associated somatic symptoms and wellbeing among the Kuwait population: a global call for public intervention programs. Front. Public Health 13:1407906. doi: 10.3389/fpubh.2025.1407906

PubMed Abstract | Crossref Full Text | Google Scholar

Andreasen, A. K., Spliid, P. E., Andersen, H., and Jakobsen, J. (2010). Fatigue and processing speed are related in multiple sclerosis. Eur. J. Neurol. 17, 212–218. doi: 10.1111/j.1468-1331.2009.02776.x

PubMed Abstract | Crossref Full Text | Google Scholar

Bech, P., Olsen, L. R., Kjoller, M., and Rasmussen, N. K. (2003). Measuring well-being rather than the absence of distress symptoms: a comparison of the SF-36 mental health subscale and the WHO-five well-being scale. Int. J. Methods Psychiatr. Res. 12, 85–91. doi: 10.1002/mpr.145

PubMed Abstract | Crossref Full Text | Google Scholar

Brähler, E., Mühlan, H., Albani, C., and Schmidt, S. (2007). Teststatistische Prüfung und Normierung der deutschen Versionen des EUROHIS-QOL Lebensqualität-Index und des WHO-5 Wohlbefindens-Index. Diagnostica 53, 83–96. doi: 10.1026/0012-1924.53.2.83

Crossref Full Text | Google Scholar

Caycho-Rodríguez, T., Vilca, L. W., Valencia, P. D., Carbajal-León, C., Reyes-Bossio, M., White, M., et al. (2023). Is the meaning of subjective well-being similar in Latin American countries? A cross-cultural measurement invariance study of the WHO-5 well-being index during the COVID-19 pandemic. BMC Psychol. 11:102. doi: 10.1186/s40359-023-01149-8

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, F. F. (2007). Sensitivity of goodness of fit indexes to lack of measurement invariance. Struct. Equ. Model. 14, 464–504. doi: 10.1080/10705510701301834

Crossref Full Text | Google Scholar

Clarke, A., Friede, T., Putz, R., Ashdown, J., Martin, S., Blake, A., et al. (2011). Warwick–Edinburgh mental well-being scale (WEMWBS): validated for teenage school students in England and Scotland. A mixed methods assessment. BMC Public Health 11:487. doi: 10.1186/1471-2458-11-487

PubMed Abstract | Crossref Full Text | Google Scholar

Cosma, A., Költő, A., Chzhen, Y., Kleszczewska, D., Kalman, M., and Martin, G. (2022). Measurement invariance of the WHO-5 well-being index: evidence from 15 European countries. Int. J. Environ. Res. Public Health 19:9798. doi: 10.3390/ijerph19169798

PubMed Abstract | Crossref Full Text | Google Scholar

Damsbo, A. G., Kraglund, K. L., Buttenschøn, H. N., Johnsen, S. P., Andersen, G., and Mortensen, J. K. (2020). Predictors for wellbeing and characteristics of mental health after stroke. J. Affect. Disord. 264, 358–364. doi: 10.1016/j.jad.2019.12.032

PubMed Abstract | Crossref Full Text | Google Scholar

de Wit, M., Pouwer, F., Gemke, R. J. B. J., Delemarre-van de Waal, H. A., and Snoek, F. J. (2007). Validation of the WHO-5 well-being index in adolescents with type 1 diabetes. Diabetes Care 30, 2003–2006. doi: 10.2337/dc07-0447

PubMed Abstract | Crossref Full Text | Google Scholar

Domenech, A., Kasujee, I., Koscielny, V., and Griffiths, C. E. M. (2025). Systematic review of the use of the WHO-5 well-being index across different disease areas. Adv. Ther. 42, 1–15. doi: 10.1007/s12325-025-03266-9

PubMed Abstract | Crossref Full Text | Google Scholar

Elholm, B., Larsen, K., Hornnes, N., Zierau, F., and Becker, U. (2011). Alcohol withdrawal syndrome: symptom-triggered versus fixed-schedule treatment in an outpatient setting. Alcohol Alcohol. 46, 318–323. doi: 10.1093/alcalc/agr020

PubMed Abstract | Crossref Full Text | Google Scholar

Eser, E., Çevik, C., Baydur, H., Güneş, S., Esgin, T. A., Öztekin, Ç. S., et al. (2019). Reliability and validity of the Turkish version of the WHO-5, in adults and older adults for its use in primary care settings. Prim. Health Care Res. Dev. 20:e100. doi: 10.1017/S1463423619000343

PubMed Abstract | Crossref Full Text | Google Scholar

Faruk, M. O., Alam, F., Chowdhury, K. U. A., and Soron, T. R. (2021). Validation of the Bangla WHO-5 well-being index. Glob. Ment. Health 8:e26. doi: 10.1017/gmh.2021.26

PubMed Abstract | Crossref Full Text | Google Scholar

Fekih-Romdhane, F., Al Mouzakzak, F., Abilmona, G., Dahdouh, O., and Hallit, S. (2024). Validation and optimal cut-off score of the World Health Organization well-being index (WHO-5) as a screening tool for depression among patients with schizophrenia. BMC Psychiatry 24:391. doi: 10.1186/s12888-024-05814-z

PubMed Abstract | Crossref Full Text | Google Scholar

Fekih-Romdhane, F., Cherif, W., Alhuwailah, A., Fawaz, M., Shuwiekh, H., Helmy, M., et al. (2023). Cross-country validation of the Arabic version of the WHO-5 well-being index in non-clinical young adults from six Arab countries. Res. Square. doi: 10.21203/rs.3.rs-2988215/v1

Crossref Full Text | Google Scholar

Fung, S.-F., Kong, C. Y. W., Liu, Y.-M., Huang, Q., Xiong, Z., Jiang, Z., et al. (2022). Validity and psychometric evaluation of the Chinese version of the 5-item WHO well-being index. Front. Public Health 10:872436. doi: 10.3389/fpubh.2022.872436

PubMed Abstract | Crossref Full Text | Google Scholar

Gallemit, I. M. J. S., Mordeno, I. G., Simon, P. D., and Ferolino, M. A. L. (2024). Assessing the psychometric properties of the World Health Organization -five well-being index (WHO-5) in Filipino samples amid the COVID-19 pandemic. BMC Psychol. 12:580. doi: 10.1186/s40359-024-01941-0

PubMed Abstract | Crossref Full Text | Google Scholar

George, J., and Randhawa, K. (2024). World Health Organization 5 (WHO-5) well-being index and problem areas in diabetes 5 scale (PAID-5): a cross-sectional study in screening tools for depression and anxiety patients with type 1 and type 2 diabetes in Kerala, India. Clin. Diabetol. 13, 164–169. doi: 10.5603/cd.99696

Crossref Full Text | Google Scholar

Gierk, B., Kohlmann, S., Toussaint, A., Wahl, I., Brünahl, C. A., Murray, A. M., et al. (2014). Assessing somatic symptom burden: a psychometric comparison of the patient health questionnaire-15 (PHQ-15) and the somatic symptom scale-8 (SSS-8). J. Psychosom. Res. 78, 352–355. doi: 10.1016/j.jpsychores.2014.11.006

Crossref Full Text | Google Scholar

Guðmundsdóttir, H. B., Olason, D. P., Guðmundsdóttir, D. G., and Sigurðsson, J. F. (2014). A psychometric evaluation of the Icelandic version of the WHO-5. Scand. J. Psychol. 55, 567–572. doi: 10.1111/sjop.12156

PubMed Abstract | Crossref Full Text | Google Scholar

Hartwig, E. M., Rufino, K. A., Palmer, C. A., Shepard, C., Alfano, C. A., Schanzer, B., et al. (2019). Trajectories of self-reported sleep disturbance across inpatient psychiatric treatment predict clinical outcome in comorbid major depressive disorder and generalized anxiety disorder. J. Affect. Disord. 251, 248–255. doi: 10.1016/j.jad.2019.03.069

PubMed Abstract | Crossref Full Text | Google Scholar

Hettich, N., Beutel, M. E., Krakau, L., and Brähler, E. (2022). Quality of life: updated psychometric properties and new norm values in a representative German sample focusing socioeconomics and mental health. Int. J. Public Health 67:1605188. doi: 10.3389/ijph.2022.1605188

Crossref Full Text | Google Scholar

Hughes, M. E., Waite, L. J., Hawkley, L. C., and Cacioppo, J. T. (2004). A short scale for measuring loneliness in large surveys: results from two population-based studies. Res. Aging 26, 655–672. doi: 10.1177/0164027504268574

PubMed Abstract | Crossref Full Text | Google Scholar

Hu, L.-t., and Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct. Equ. Model. 6, 1–55. doi: 10.1080/10705519909540118

Crossref Full Text | Google Scholar

Jacobson, N. S., and Truax, P. (1991). Clinical significance: a statistical approach to defining meaningful change in psychotherapy research. J. Consult. Clin. Psychol. 59, 12–19. doi: 10.1037//0022-006x.59.1.12

PubMed Abstract | Crossref Full Text | Google Scholar

Jami, W. A., and Kemmelmeier, M. (2020). Assessing well-being across space and time: measurement equivalence of the WHO-5 in 36 European countries and over 8 years. J. Well-Being Assess. 4, 419–445. doi: 10.1007/s41543-021-00042-8

Crossref Full Text | Google Scholar

Jorgensen, T. D., Pornprasertmanit, S., Schoemann, A. M., and Rosseel, Y. (2021). SemTools: Useful tools for structural equation modeling. Available online at: https://CRAN.R-project.org/package=semTools.

Google Scholar

Kassab Alshayea, A. (2023). Development and validation of an Arabic version of the World Health Organization well-being index (WHO-5). J. Psychopathol. Behav. Assess. 45, 247–255. doi: 10.1007/s10862-023-10027-x

Crossref Full Text | Google Scholar

Kenny, D. A., Kaniskan, B., and McCoach, D. B. (2015). The performance of RMSEA in models with small degrees of freedom. Sociol. Methods Res., 44, 486–507. doi: 10.1177/0049124114543236

Crossref Full Text | Google Scholar

Khosravi, A., Mousavi, S. A., Chaman, R., Kish, M. S., Ashrafi, E., Khalili, M., et al. (2015). Reliability and validity of the Persian version of the World Health Organization-five well-being index. Shahroud J. Med. Sci. 1, 17–19. doi: 10.22100/ijhs.v1i1.24

Crossref Full Text | Google Scholar

Kish, L. (1949). A procedure for objective respondent selection within the household. J. Am. Stat. Assoc, 44, 380–387. doi: 10.1080/01621459.1949.10483314

Crossref Full Text | Google Scholar

Klein, E. M., Zenger, M., Tibubos, A. N., Ernst, M., Reiner, I., Schmalbach, B., et al. (2021). Loneliness and its relation to mental health in the general population: validation and norm values of a brief measure. J. Affect. Disord. Rep. 4:100120. doi: 10.1016/j.jadr.2021.100120

PubMed Abstract | Crossref Full Text | Google Scholar

Kliem, S., Lohmann, A., Mößle, T., and Brähler, E. (2018). Psychometric properties and measurement invariance of the Beck hopelessness scale (BHS): results from a German representative population sample. BMC Psychiatry 18:110. doi: 10.1186/s12888-018-1646-6

PubMed Abstract | Crossref Full Text | Google Scholar

Kliem, S., Mößle, T., Rehbein, F., Hellmann, D. F., Zenger, M., and Brähler, E. (2015). A brief form of the perceived social support questionnaire (F-SozU) was developed, validated, and standardized. J. Clin. Epidemiol. 68, 551–562. doi: 10.1016/j.jclinepi.2014.11.003

PubMed Abstract | Crossref Full Text | Google Scholar

Kliem, S., Mößle, T., Zenger, M., Strauß, B., Brähler, E., and Hilbert, A. (2016). The eating disorder examination-questionnaire 8: a brief measure of eating disorder psychopathology (EDE-Q8). Int. J. Eat. Disord. 49, 613–616. doi: 10.1002/eat.22487

PubMed Abstract | Crossref Full Text | Google Scholar

Kliem, S., Sachser, C., Lohmann, A., Baier, D., Brähler, E., Gündel, H., et al. (2024). Psychometric evaluation and community norms of the PHQ-9, based on a representative German sample. Front. Psych. 15:1483782. doi: 10.3389/fpsyt.2024.1483782

PubMed Abstract | Crossref Full Text | Google Scholar

Lambert, M., Fleming, T., Ameratunga, S., Robinson, E., Crengle, S., Sheridan, J., et al. (2014). Looking on the bright side: an assessment of factors associated with adolescents’ happiness. Adv. Ment. Health 12, 101–109. doi: 10.1080/18374905.2014.11081888

Crossref Full Text | Google Scholar

Lara-Cabrera, M. L., Betancort, M., Muñoz-Rubilar, A., Rodríguez-Novo, N., Bjerkeset, O., and De las Cuevas, C. (2022). Psychometric properties of the WHO-5 Well-Being Index among nurses during the COVID-19 pandemic: A cross-sectional study in three countries. Int. J. Environ. Res. Public Health, 19, 10106. doi: 10.3390/ijerph191610106

Crossref Full Text | Google Scholar

Lara-Cabrera, M. L., Mundal, I. P., and De Las Cuevas, C. (2020). Patient-reported well-being: psychometric properties of the world health organization well-being index in specialised community mental health settings. Psychiatry Res. 291:113268. doi: 10.1016/j.psychres.2020.113268

PubMed Abstract | Crossref Full Text | Google Scholar

Löve, J., Andersson, L., Moore, C. D., and Hensing, G. (2014). Psychometric analysis of the Swedish translation of the WHO well-being index. Qual. Life Res. 23, 293–297. doi: 10.1007/s11136-013-0447-0

PubMed Abstract | Crossref Full Text | Google Scholar

Löwe, B., Spitzer, R. L., Gräfe, K., Kroenke, K., Quenter, A., Zipfel, S., et al. (2004). Comparative validity of three screening questionnaires for DSM-IV depressive disorders and physicians’ diagnoses. J. Affect. Disord. 78, 131–140. doi: 10.1016/s0165-0327(02)00237-9

PubMed Abstract | Crossref Full Text | Google Scholar

Löwe, B., Wahl, I., Rose, M., Spitzer, C., Glaesmer, H., Wingenfeld, K., et al. (2010). A 4-item measure of depression and anxiety: validation and standardization of the patient health Questionnaire-4 (PHQ-4) in the general population. J. Affect. Disord. 122, 86–95. doi: 10.1016/j.jad.2009.06.019

PubMed Abstract | Crossref Full Text | Google Scholar

McNeish, D. (2018). Thanks coefficient alpha, we’ll take it from here. Psychol. Methods 23, 412–433. doi: 10.1037/met0000144

PubMed Abstract | Crossref Full Text | Google Scholar

Moeller, S. B., Gondan, M., Austin, S. F., Slade, M., and Simonsen, S. (2023). National norms of mental health for Denmark. Nord. J. Psychiatry 77, 617–623. doi: 10.1080/08039488.2023.2202637

PubMed Abstract | Crossref Full Text | Google Scholar

Möller-Leimkühler, A. M., Paulus, N. C., and Heller, J. (2007). Male depression in a population sample of young males. Risk and symptom profiles. Der Nervenarzt, 78, 641–650. doi: 10.1007/s00115-006-2173-0

Crossref Full Text | Google Scholar

Perera, B. P. R., Jayasuriya, R., Caldera, A., and Wickremasinghe, A. R. (2020). Assessing mental well-being in a Sinhala speaking Sri Lankan population: validation of the WHO-5 well-being index. Health Qual. Life Outcomes 18:305. doi: 10.1186/s12955-020-01532-8

PubMed Abstract | Crossref Full Text | Google Scholar

Pheko, M. M., Mhaka-Mutepfa, M., Mfolwe, L. M., Mathoothe, K. D., Chua, B. S., and Low, W. Y. (2023). Psychometrics validation of the WHO-5 wellbeing index in Botswana, Zimbabwe, and Malaysia: coping with COVID-19 pandemic in the global south study. SAGE Open 13, 1–11. doi: 10.1177/21582440231198387

Crossref Full Text | Google Scholar

Reitsma, L., Mooren, T. M., Mouthaan, J., van Hoof, M., Groen, S. P. N., van Dijk, I., et al. (2024). A latent class analysis on indicators of early prolonged grief disorder and well-being among Dutch adults bereaved during the first year of the COVID-19 pandemic. Clin. Psychol. Psychother. 31:e3054. doi: 10.1002/cpp.3054

PubMed Abstract | Crossref Full Text | Google Scholar

Rosseel, Y. (2019). Lavaan: Latent variable analysis. Available online at: https://cran.r-project.org/web/packages/lavaan/index.html.

Google Scholar

Schmidt, S., Mühlan, H., and Power, M. (2006). The EUROHIS-QOL 8-item index: psychometric results of a cross-cultural field study. Eur. J. Pub. Health 16, 420–428. doi: 10.1093/eurpub/cki155

PubMed Abstract | Crossref Full Text | Google Scholar

Sischka, P. E., Costa, A. P., Steffgen, G., and Schmidt, A. F. (2020). The WHO-5 well-being index – validation based on item response theory and the analysis of measurement invariance across 35 countries. J. Affect. Disord. Rep. 1:100020. doi: 10.1016/j.jadr.2020.100020

PubMed Abstract | Crossref Full Text | Google Scholar

Sischka, P. E., Martin, G., Residori, C., Hammami, N., Page, N., Schnohr, C., et al. (2025). Cross-national validation of the WHO-5 well-being index within adolescent populations: findings from 43 countries. Assessment. doi: 10.1177/10731911241309452

PubMed Abstract | Crossref Full Text | Google Scholar

Sommer, J., Borgmann, S. O., Gontscharuk, V., Zaharia, O. P., Maalmi, H., Herder, C., et al. (2024). Depression symptoms, wellbeing, health-related quality of life, and diabetes-related distress in novel subtypes of recent-onset diabetes in Germany: a 5-year observational follow-up study. Lancet Diabetes Endocrinol. 12, 890–903. doi: 10.1016/S2213-8587(24)00234-1

PubMed Abstract | Crossref Full Text | Google Scholar

Suhaimi, A. F., Makki, S. M., Tan, K.-A., Silim, U. A., and Ibrahim, N. (2022). Translation and validation of the Malay version of the WHO-5 well-being index: reliability and validity evidence from a sample of type 2 diabetes mellitus patients. Int. J. Environ. Res. Public Health 19:4415. doi: 10.3390/ijerph19074415

PubMed Abstract | Crossref Full Text | Google Scholar

Topp, C. W., Østergaard, S. D., Søndergaard, S., and Bech, P. (2015). The WHO-5 well-being index: a systematic review of the literature. Psychother. Psychosom. 84, 167–176. doi: 10.1159/000376585

PubMed Abstract | Crossref Full Text | Google Scholar

van Buuren, S., and Groothuis-Oudshoorn, K. (2011). Mice: multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–67. doi: 10.18637/jss.v045.i03

Crossref Full Text | Google Scholar

Wicke, T., Schäfer, M., Müller, K., and Löwe, B. (2022). Update of the standardization of the Patient Health Questionnaire-4 (PHQ-4) in the general population. J. Affect. Disord., 303, 1–7. doi: 10.1016/j.jad.2022.06.054

Crossref Full Text | Google Scholar

World Medical Association (2024). World medical association declaration of Helsinki: ethical principles for medical research involving human participants. JAMA 310, 2191–2194. doi: 10.1001/jama.2024.21972

Crossref Full Text | Google Scholar

Wu, H., and Estabrook, R. (2016). Identification of confirmatory factor analysis models of different levels of invariance for ordered categorical outcomes. Psychometrika, 81, 1014–1045. doi: 10.1007/s11336-016-9506-0

Crossref Full Text | Google Scholar

Yang, N., Ma, H., Huang, M., Lau, E. Y. Y., Fong, D., Wang, M., et al. (2023). Measurement properties and optimal cutoff point of the WHO-5 among Chinese healthcare students. Psychol. Res. Behav. Manag. 16, 5141–5158. doi: 10.2147/PRBM.S437219

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: well-being, self-report questionnaire, population norms, psychometrics, measurement invariance

Citation: Kliem S, Lohmann A, Fischer S, Baier D, Clemens V, Sachser C, Kampling H, Brähler E and Fegert JM (2025) Psychometric evaluation and updated community norms of the WHO-5 well-being index, based on a representative German sample. Front. Psychol. 16:1592614. doi: 10.3389/fpsyg.2025.1592614

Received: 12 March 2025; Accepted: 26 June 2025;
Published: 29 July 2025.

Edited by:

Begoña Espejo, University of Valencia, Spain

Reviewed by:

Irene Checa, University of Valencia, Spain
Cesar Merino-Soto, Monterrey Institute of Technology and Higher Education (ITESM), Mexico

Copyright © 2025 Kliem, Lohmann, Fischer, Baier, Clemens, Sachser, Kampling, Brähler and Fegert. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Sören Kliem, c29lcmVuLmtsaWVtQGVhaC1qZW5hLmRl

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.