Psychometric inspection of an internalized homonegativity measure

Kerry, Matthew James; Pfister, Andreas; Krüger, Paula

doi:10.3389/fpsyg.2025.1569382

ORIGINAL RESEARCH article

Front. Psychol., 09 September 2025

Sec. Quantitative Psychology and Measurement

Volume 16 - 2025 | https://doi.org/10.3389/fpsyg.2025.1569382

This article is part of the Research TopicBreaking Barriers in LGBT+ Health: Innovations and InsightsView all 12 articles

Psychometric inspection of an internalized homonegativity measure

Matthew James Kerry¹^*

Andreas Pfister¹^†

Paula Krüger²^†

¹Department of Health, Zurich University of Applied Sciences, Winterthur, Switzerland
²Department of Social Work, Lucerne University of Applied Sciences and Arts, Lucerne, Switzerland

Internalized homonegativity (IH) is a substantively important construct linked to various health-related quality-of-life indicators. Despite IH’s prominence in the homosexuality literature, however, several measurement challenges are posed to advancing its empirical evidence base. This study aimed to bring modern psychometric methods to bear on a measure of IH in a general population sample of self-reported gay, lesbian, and bisexual respondents in Switzerland (n = 988). Specifically, we used a prospective observational cross-sectional design with questionnaire methodology to examine three aspects of the validity of a 7-item IH instrument: (1) structure validity, (2) cross-linguistic validity, and (3) “known-groups” (discriminant) validity. Our findings indicated support for the 7-item IH measure’s essential unidimensionality. Furthermore, we found support for IH’s measurement equivalence across German- and French-speaking regions, whereas mixed support was found for its extension to bisexual respondents. Finally, the IH measure exhibited discriminant validity, such that depression and poor self-reported health status were associated with higher IH scores. In conclusion, the IH instrument may be used as a unidimensional measure across German- and French-speaking general populations; however, further research should focus on extending its linguistic validity and measurement equivalence to bisexual and transgender populations.

1 Introduction

Meyer and Dean (1998) defined internalized homophobia as “the gay person’s direction of negative social attitudes toward the self, leading to a devaluation of the self and resultant internal conflicts and poor self-regard” (p. 161). However, the term “internalized homophobia” has been critiqued and further differentiated. Wider societal factors shape the devaluation of the self. Internalized homophobia “…is not simply a product of personal, subjective, and ‘irrational’ fears” (Berg et al., 2016, p. 542). Thus, the term “internalized homonegativity” (IH) may be more appropriate and is increasingly used in contemporary scientific literature (see, e.g., Badenes-Ribera et al., 2018; Lefevor et al., 2023).

In Europe, the situation for LGBT+ persons varies greatly across sovereign states. The importance of IH for oppression, stigmatization, and minority stress is relevant to Switzerland as a whole (Krüger et al., 2023). For example, minority stress theory has postulated that marginalized intersections (racial gay minorities, or immigrant gay minorities) may be particularly at-risk of elevated IH (Frost and Meyer, 2023). Pertinent to the current study, minority stress theory posits that stigmatized individuals suffer negative health outcomes as a result of enduring societal discrimination-borne chronic stress. Health disparities between marginalized group members include the domains of physical health, substance abuse, and mental health.

Empirical research has shown IH to be associated with depression (Davidson et al., 2017), anxiety (DiPlacido, 1998), and substance use disorders (Moody et al., 2018). IH has also exhibited meta-analytic evidence for its association with bodily preconceptions (Badenes-Ribera et al., 2018) and suicidal ideation (Williams et al., 2023). IH has also been found to be related to aggression (Berg et al., 2016), partner violence (Badenes-Ribera et al., 2019), sexually transmitted infections (Berg et al., 2015), and intergroup relations (van Leeuwen et al., 2016). The negative consequences of IH are postulated to result from a psychological dilemma, specifically, a self-image disjuncture between romantic desires and public shame-borne feelings of belittlement and low self-esteem (Berg et al., 2016).

Despite IH’s substantive correlations with various health-related quality-of-life indicators, challenges remain regarding measurement consistency. For example, a systematic mapping review of empirical research identified nine distinct instruments (Berg et al., 2016). Although past studies have purported to measure IH’s measurement using 1-factor, 2-factor, or 3-factor structures, recent modern psychometric evidence has provided evidence for a 1-factor (unidimensional) solution (Wickham et al., 2021). It should also be noted that the differing number of factor solutions may simply be owed to sample variation rather than instrument instability. Indeed, given our diverse target sample of Switzerland—with several official national languages and cultures—we aimed to validate a simpler instrument that may be more robust to sampling variation, rather than an overly specific instrument that may be less generalizable. A unidimensional solution may be advantageous for both administrative and analytical ease, as well as for minimizing response burden via shorter instruments and fewer items (Reeve, 2003).

Given previous inconsistency regarding IH’s latent-factor structure, the current study aims to bring modern psychometric evidence to bear on a previously validated, adapted 7-item IH measure for use in Switzerland (Smolenski et al., 2010). To balance construct coverage (breadth) and brevity, the first two items from each of Smolenski’s three factors of IH were adopted. A seventh item from the original moral subfactor was added based on the author’s subject matter expert knowledge of the potential applicability within Switzerland. Specifically, the current study examines the following four hypotheses pertaining to IH’s internal structure validity, cross-linguistic validity, and “known-groups” (sensitivity) validity.

H1: We hypothesize that the IH measure will exhibit essential unidimensionality.

H2: We hypothesize that the IH measure will exhibit sufficient cross-linguistic validity evidence, defined as the overall measure indicating small practical effect sizes (Cohen’s d < 0.20) across German and French languages in Switzerland.

Empirical mapping of IH has exhibited positive associations with depression (Berg et al., 2016). Additionally, a recent meta-analysis of k = 68 studies, yielding n = 151 effect sizes, found that IH is negatively associated (r = −0.28) with general health status (Lefevor et al., 2023). Based on these findings, we hypothesized the following “known-groups” validity test for our locally developed IH instrument:

H3: We hypothesize that the IH measure would be positively associated with depression scores.

H4: We hypothesize that the IH measure would be negatively associated with general health status.

We examined H3 and H4 using both median-split t-tests to compare high- and low-IH score groups (i.e., “known groups” discriminant validity), as well as correlational analysis.

2 Methods

2.1 Sample

In total, 2,064 lesbian, gay, bisexual, and transgender (including non-binary) individuals participated in the study. The survey was available in German, French, Italian, or English. Although “Romansch” is the national language of Switzerland, it is spoken by only 0.4% of the population, and most Romansch speakers are also fluent in German. Due to minimal model-identification requirements (n > 100), only the German and French language versions of the IH measure were included in the current psychometric examination. In total, n = 1,174 eligible respondents self-identified as German- or French-speaking and as gay, lesbian, or bisexual. Twenty-three respondents (n = 23) were missing data on the focal IH measure in the system and were listwise deleted from the dataset. Due to space limitations, the full missing data analysis and data cleaning technical report is listed in Supplementary material. An additional 163 respondents were identified as careless based on extreme responding across normal- and reverse-scored items and were removed, thereby reducing the effective sample to n = 988 for analyses.

2.2 Design

Responses were collected via purposive sampling in a Swiss-national cross-sectional survey. Specifically, potential participants were recruited via advertisements in traditional and social media outlets by the universities, the Federal Office of Public Health, and LGBT organizations (e.g., TGNS, LOS, and Les Klamydia’s, Pink Cross). In addition, sampling frames included selected institutions from the healthcare sector, which were asked whether they could draw attention to the study on their homepage and/or by flyer display. A cross-sectional design with survey methodology was conducted (see Krüger et al., 2023, for details). On behalf of the Federal Office of Public Health, data were collected from mid-May to mid-July 2021 in a national survey aimed at describing the health status and healthcare access of LGBT people living in Switzerland.

2.3 Measures

Several socio-demographic data were collected for the present study, including sexual identity, defined as the person’s self-identification (e.g., gay, lesbian, and bisexual) (Patterson et al., 2017). The focal measure of the current short report is a 7-item Internalized Homonegativity Scale. It is adapted from Smolenski et al.’s (2010) 7-item revised Reactions to Homosexuality scale. Specifically, 6 of the 7 items were adopted and underwent standard forward-back translation protocols with native speakers of German and French. The final item from the original first subscale was replaced with one of Smolenski’s items from his fourth factor to increase construct breadth and content-relevance, based on the authors’ knowledge of the Swiss population, “Homosexuality is morally acceptable to me.” Respondents indicate their agreement on a 6-point Likert-type scale ranging from “strongly disagree” (1) to “strongly agree” (6). An example item is “I feel comfortable in gay/lesbian bars.” Internal consistency, indexed by McDonald’s omega (ω) reliability coefficient, was adequate at ω = 0.85. In addition, we administered a single-item self-reported health status measure on a 6-point Likert-type scale (very poor – very good). We also administered the 9-item Patient Health Questionnaire for assessing depression, which also exhibited strong internal consistency reliability in the current study, ω = 0.88 (Kroenke et al., 2001). Responses to the Patient Health Questionnaire are recorded on a 4-point Likert-type scale ranging from “not at all” (1) to “nearly every day” (4).

2.4 Analyses

Data cleaning and classical analyses were conducted in software IBM SPSS v29 (IBM Corp., 2023). Specifically, missing data, parallel analyses, and maximum likelihood estimation were conducted (O’Connor, 2000). Unidimensional indices were computed using Dueber’s (2017) bifactor indices calculator. Cross-linguistic validity was assessed using modern psychometric analyses conducted in software IRTPRO v5.1 with graded response model specifications (Cai et al., 2020). Specifically, differential item functioning (DIF) or, item bias, testing proceeded with magnitude assessment, comprising conventional effect size (ES) estimation according to Cohen’s (1988) rubric of small (d ≥ 0.20), medium (d ≥ 0.50), and large (d ≥ 0.80) using VisualDF (Meade, 2010). “Known-groups” validity was tested using an independent t-test on median splits of the grouping variables self-rated health and depression.

3 Results

Summary sample descriptive characteristics are presented in Table 1. Univariate item-level descriptive statistics, frequency response patterns, and graphical inspection of scale-level normal Q–Q plots provided tentative evidence for inferring univariate-normal distributional assumptions.

Table 1

Table 1. Summary of descriptive characteristics.

Specifically, nearly all items’ skewness (<3) and kurtosis (<7) values were within normality-range recommendations for large sample sizes (n > 300) (see Table 2). Item 6, however, exhibited non-normal descriptive and may be a candidate for exclusion in future studies. This item was retained in the current study. Therefore, analyses proceeded with parametric tests.

Table 2

Table 2. Descriptive statistics and factor loading of the IH items.

3.1 Structural validity

Parallel analysis of raw data suggested a potential 2-factor solution for extraction (Figure 1). This was, however, further scrutinized with exploratory factor analysis using maximum likelihood estimation, which extracted only a single factor. Specifically, the first/s eigenvalue ratio was computed as 2.67/0.97 = 2.75, suggesting negligible multidimensionality in the total sample. Finally, a bifactor model was estimated in order to compute hierarchical-omega (ωH) values for the general factor, which indicates the percentage (%) of reliable variance that can be contributed to the general factor. The current IH scale exhibited ωH = 0.90, supporting the 7-item scale’s interpretation as “essentially unidimensional” (ωH > 0.80; Reise et al., 2013, p. 224). Given past-reported factor solutions, however, we also used a confirmatory factor analysis to compare our 1-factor to 2- and 3-factor solutions as well. Items loaded onto previously reported “source” factor-loading solutions (Smolenski et al., 2010). A 1-factor model exhibited good fit to the data, χ²(488) = 1,603.53, p < 0.001, RMSEA = 0.05, BIC = 16,344.61. A 2-factor model exhibited an adequate, but comparatively worse fit to the data, χ²(487) = 1,602.71, p = 0.371, RMSEA = 0.06, BIC = 16,480.12. Finally, a 3-factor model also exhibited an adequate, but comparatively worse fit to the data, χ²(486) = 1601.19, p = 0.224, RMSEA = 0.06, BIC = 16,494.29. These findings support the retention of our 1-factor measures of IH.

Figure 1

Line graph showing three lines labeled rawdata, means, and percntyl. The x-axis is labeled root and ranges from one to seven. The y-axis ranges from negative one to three. Rawdata starts high at two, then sharply declines, while means and percntyl remain relatively flat near zero.

Figure 1. Parallel analysis for latent-factor structure. Raw data, raw-eigenvalue estimates; means, mean-eigenvalue estimates; percentile, 95% confidence interval eigenvalue estimates.

3.2 Cross-linguistic validity

The two-step procedure for item bias (differential item functioning, DIF) proceeded with statistical detection. Unsurprisingly, given the relatively large sample size (n = 988), traditional statistical indices detected nominally significant DIF across five out of seven items, although only two items indicated DIF on the slope parameter (see Table 3). Step 2’s magnitude assessment proceeded with computation of ES standardized differences (ESSDs). As exhibited in Table 3, large ESs were observed for reversed-coded item 2 and item 3. All other ESs were small according to Cohen’s (1988) rubric. Indeed, the IH-overall score indicated an ESSD of only 0.11. These findings support IH cross-linguistic validity across Swiss-German-speaking and Swiss-French-speaking samples.

Table 3

Table 3. Summary DIF statistics by total and slope parameter estimates for language.

Excluding the likely reverse-scored artifact item, the highest practical DIF and overall DIF of the IH measure are illustrated in Figure 2. As shown, on average, French respondents with equal estimated standing as German respondents on latent-IH scored approximately 0.40 points lower in agreement with the item, “I don’t mind being seen in public with an obviously gay/lesbian or bisexual person.” The overall measure, however, indicated very little difference between German- and French-language versions. Taken together, little evidence for meaningful item bias was detected in the current sample, although future research should continue to examine extensions into additional languages.

Figure 2

Two line graphs compare survey responses between French and German participants. The first graph, titled

Figure 2. Item and instrument response curves.

3.3 Known-groups validity

Independent-sample t-tests were conducted based on median splits of focal variables, namely PHQ-Depression and self-reported health status. The t-test for depression was significant in the hypothesized direction, t(986) = −4.56, p = 0.008, with an additional effect size calculated without group classification, r = 0.14, p < 0.001. The t-test for health status was also significant in the hypothesized direction, t(986) = 1.71, p = 0.043, with an additional effect size calculated without group classification, r = −0.10, p = 0.002. These findings support hypotheses 3 and 4 for the IH measure’s sensitivity across known groups. The T-test results are illustrated in Figure 3.

Figure 3

Two bar graphs showing internalized homonegativity. Left graph: comparison by PHQ-Depression, with higher values for high depression (orange) versus low depression (blue), t(986) = -4.56, p = 0.008. Right graph: comparison by health status, with higher values for low health status (blue) versus high health status (orange), t(986) = 1.71, p = 0.043. Error bars are present.

Figure 3. Independent t-tests of internalized homonegativity by (A) depression and (B) health status. “Low” and “High” groups correspond to median splits on the focal variables. N = 988.

3.4 Exploratory analyses

Because we extended the current IH measure to bisexual populations by adapting item wording, we explored whether measurement equivalence holds in this new group. Although this was not the focus of the current paper, we repeated the two-step DIF assessment above to detect potential item bias resulting from adapting the wording for the IH measurement among bisexual respondents. The summarized results, reported in Table 4, indicated that more items were detected for potential DIF (4 of 7). Step 2’s magnitude assessment indicated that three items exhibited medium to large DIF, whereas item 5 exhibited small DIF.

Table 4

Table 4. Summary DIF statistics by total and slope parameter estimates for homosexual vs. bisexual.

These findings need further examination in a larger sample.

Finally, an independent t-test indicated no significant difference between sexual identity, t(986) = −0.34, p = 0.741 (Figure 4).

Figure 4

Figure 4. Independent t-test of internalized homonegativity by sexual orientation. N = 988.

4 Discussion

This study aimed to bring psychometric evidence to bear on a measure of internal homonegativity. Specifically, four hypotheses were tested as follows: (1) IH would exhibit essential unidimensionality, (2) IH would exhibit cross-linguistic validity, (3) IH would positively associate with depression, and (4) IH would negatively associate with health status. Test results of the hypotheses are summarized below.

First, regarding dimensionality, our parallel analysis results indicated that the IH exhibited a strong general factor, which was confirmed by follow-up exploratory factor analysis. Furthermore, bifactor indices were calculated to compute a hierarchical-omega (ωH) coefficient for the general factor, which satisfied the condition for “essential unidimensionality.” Therefore, IH scores are interpretable as unidimensional construct indicators in future research. This finding is consistent with recent modern psychometric results (Wickham et al., 2021).

Second, regarding cross-linguistic validity, we conducted a statistical and practical item bias (DIF) assessment. Statistical findings indicated two potentially DIF items of the IH. Follow-up practical assessment revealed only one of these items to be potentially problematic, with an effect size d > 0.50. The follow-up assessment further revealed, however, that the reverse-scored item exhibited a large DIF, although we recommend retention of this item for potential “careless responding” detection in future research (Ward and Meade, 2023). The overall IH exhibited minimal practical DIF, suggesting sufficient cross-linguistic validity across Swiss-German and Swiss-French administrations.

Third, independent t-tests were conducted to test “known-groups” differences on the IH measure. Consistent with the literature linking IH to depression, IH scores were significantly higher for high-depression groups compared to low-depression groups (Newcomb and Mustanski, 2010). Fourth, and complementary to this, in accordance with the literature supporting the health benefits of low IH, IH scores were significantly lower among individuals reporting better health status (Lefevor et al., 2023).

Practically, the IH measure may be administered in both German- and French-speaking parts of Switzerland, whether in clinical or general population settings. Its brevity (7-item) offers benefits to aggregate public health monitoring and research endeavors. Further validation in clinical settings should be conducted and is elaborated in detail below.

5 Limitations and future research

Cross-linguistic validity can and should be extended to additional language regions (Italian and English). Although our sample sizes were low for Swiss-Italian (n = 65) and English-speaking (n = 57) respondents in Switzerland, the preliminary findings reported in Table 3 provide tentative evidence for the appropriateness of administering the IH measure in more language regions. These tentative findings should be further examined using larger and more representative samples. Furthermore, potential “self-presentation” of the homonegativity self-report may be examined for convergent validity with validated implicit measures, such as conditional reasoning tests (James and LeBreton, 2010) or through other-reported measures for corroboration. For example, respondents may not acknowledge experiencing IH (Lord, 1980), yet they still exhibit negative consequences associated with latent IH. The self-report may be triangulated to understand the construct representation in IH questionnaires. Furthermore, future research should focus on extending the validity evidence of the IH measure to additional domains, such as responsiveness over time and criterion validity. The reverse-scored item may be a crude way of identifying careless responders, and this should be more deeply examined by response pattern indices. Relatedly, item 6 “Homosexuality is morally acceptable to me” exhibited negative skew and high leptokurtosis. Given that 97% of respondents replied in the highest two response categories, this item may reflect outdated perceptions and may be a candidate for deletion in future studies. The shortening of the instrument in the current sample indicated minimal compromise to internal consistency reliability with a 6-item estimate of ω = 0.78. Finally, it should be noted that linguistic equivalence does not necessarily equate to cultural equivalence. Although our DIF analyses indicated linguistic equivalence, further content validation studies (cognitive interviews, pilot tests) should be conducted in future studies.

6 Conclusion

The IH measure demonstrated essential unidimensionality, sufficient cross-linguistic validity between German and French language versions, and “known groups” validity. Further linguistic validations may be prioritized, but preliminary evidence supports the IH measure as a short, reliable, and tentatively valid instrument for future public health research.

Data availability statement

The data was provided by the Federal Office of Public Health and is available to researchers upon written request.

Ethics statement

The requirement of ethical approval was waived by the Federal Office of Public Health for the studies involving humans because the data satisfied Switzerland’s Human Research Act regarding noninterventional de-identified data (HRA, RS 810.30). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

MK: Methodology, Writing – original draft. AP: Conceptualization, Data curation, Investigation, Resources, Writing – original draft. PK: Conceptualization, Data curation, Investigation, Project administration, Resources, Supervision, Writing – original draft.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. Open access funding by Zurich University of Applied Sciences (ZHAW).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1569382/full#supplementary-material

References

Badenes-Ribera, L., Fabris, M. A., and Longobardi, C. (2018). The relationship between internalized homonegativity and body image concerns in sexual minority men: a meta-analysis. Psychol. Sex. 9, 251–268. doi: 10.1080/19419899.2018.1476905

Crossref Full Text | Google Scholar

Badenes-Ribera, L., Sánchez-Meca, J., and Longobardi, C. (2019). The relationship between internalized homophobia and intimate partner violence in same-sex relationships: a meta-analysis. Trauma Violence Abuse 20, 331–343. doi: 10.1177/1524838017708781

PubMed Abstract | Crossref Full Text | Google Scholar

Berg, R. C., Munthe-Kaas, H. M., and Ross, M. W. (2016). Internalized Homonegativity: a systematic mapping review of empirical research. J. Homosex. 63, 541–558. doi: 10.1080/00918369.2015.1083788

PubMed Abstract | Crossref Full Text | Google Scholar

Berg, R. C., Weatherburn, P., Ross, M. W., and Schmidt, A. J. (2015). The relationship of internalized homonegativity to sexual health and well-being among men in 38 European countries who have sex with men. J. Gay Lesbian Ment. Health 19, 285–302. doi: 10.1080/19359705.2015.1024375

PubMed Abstract | Crossref Full Text | Google Scholar

Cai, L., Thissen, D., and Du Toit, S. H. C. (2020). IRTPRO for windows. Version 5. Lincolnwood: Scientific Software International.

Google Scholar

Cohen, J. (1988). Statistical power analysis for the behavioral sciences. 2nd Edn. Hillsdale, NJ: Erlbaum.

Google Scholar

Davidson, K., McLaren, S., Jenkins, M., Corboy, D., Gibbs, P. M., and Molloy, M. (2017). Internalized homonegativity, sense of belonging, and depressive symptoms among Australian gay men. J. Homosex. 64, 450–465. doi: 10.1080/00918369.2016.1190215

PubMed Abstract | Crossref Full Text | Google Scholar

DiPlacido, J. (1998). Minority stress among lesbians, gay men, and bisexuals: a consequence of heterosexism, homophobia, and stigmatization. Thousand Oaks, CA: Sage Publications.

Google Scholar

Dueber, D. M. (2017). Bifactor indices calculator: a Microsoft excel-based tool to calculate various indices relevant to bifactor CFA models. doi: 10.13023/edp.tool.01

Crossref Full Text | Google Scholar

Frost, D. M., and Meyer, I. H. (2023). Minority stress theory: application, critique, and continued relevance. Curr. Opin. Psychol. 51:101579. doi: 10.1016/j.copsyc.2023.101579

PubMed Abstract | Crossref Full Text | Google Scholar

IBM Corp. (2023). IBM SPSS statistics for Windows. Version 29.0.20.0. Armonk, NY: IBM Corp.

Google Scholar

James, L. R., and LeBreton, J. M. (2010). Assessing aggression using conditional reasoning. Curr. Dir. Psychol. Sci. 19, 30–35. doi: 10.1177/0963721409359279

Crossref Full Text | Google Scholar

Kroenke, K., Spitzer, R. L., and Williams, J. B. (2001). The PHQ-9: validity of a brief depression severity measure. J. Gen. Intern. Med. 16, 606–613. doi: 10.1046/j.1525-1497.2001.016009606.x

PubMed Abstract | Crossref Full Text | Google Scholar

Krüger, P., Pfister, A., Eder, M., and Mikolasek, M. (2023). Gesundheit von LGBT-Personen in der Schweiz. Lucerne, Switzerland: Nomos Verlagsgesellschaft.

Google Scholar

Lefevor, G. T., Larsen, E. R., Golightly, R. M., and Landrum, M. (2023). Unpacking the internalized Homonegativity-health relationship: how the measurement of internalized Homonegativity and health matter and the contribution of religiousness. Arch. Sex. Behav. 52, 921–944. doi: 10.1007/s10508-022-02436-y

PubMed Abstract | Crossref Full Text | Google Scholar

Lord, F. M. (1980). Applications of item response theory to practical testing problems. New York, NY: Routledge.

Google Scholar

Meade, A. W. (2010). A taxonomy of effect size measures for the differential functioning of items and scales. J. Appl. Psychol. 95, 728–743. doi: 10.1037/a0018966

PubMed Abstract | Crossref Full Text | Google Scholar

Meyer, I. H., and Dean, L. (1998). “Internalized homophobia, intimacy, and sexual behavior among gay and bisexual men” in Stigma and sexual orientation: Understanding prejudice against lesbians, gay men, and bisexuals. eds. G. M. Herek and G. Herek (Thousand Oaks, CA: Sage Publications), 160–186.

Google Scholar

Moody, R. L., Starks, T. J., Grov, C., and Parsons, J. T. (2018). Internalized homophobia and drug use in a national cohort of gay and bisexual men: examining depression, sexual anxiety, and gay community attachment as mediating factors. Arch. Sex. Behav. 47, 1133–1144. doi: 10.1007/s10508-017-1009-2

PubMed Abstract | Crossref Full Text | Google Scholar

Newcomb, M. E., and Mustanski, B. (2010). Internalized homophobia and internalizing mental health problems: a meta-analytic review. Clin. Psychol. Rev. 30, 1019–1029. doi: 10.1016/j.cpr.2010.07.003

PubMed Abstract | Crossref Full Text | Google Scholar

O’Connor, B. P. (2000). SPSS and SAS programs for determining the number of components using parallel analysis and velicer’s MAP test. Behav. Res. Methods Instrum. Comput. 32, 396–402. doi: 10.3758/bf03200807

PubMed Abstract | Crossref Full Text | Google Scholar

Patterson, J. G., Jabson, J. M., and Bowen, D. J. (2017). Measuring sexual and gender minority populations in health surveillance. LGBT Health 4, 82–105. doi: 10.1089/lgbt.2016.0026

PubMed Abstract | Crossref Full Text | Google Scholar

Reeve, B. B. (2003). Item response theory modeling in health outcomes measurement. Expert Rev. Pharmacoecon. Outcomes Res. 3, 131–145. doi: 10.1586/14737167.3.2.131

PubMed Abstract | Crossref Full Text | Google Scholar

Reise, S. P., Bonifay, W. E., and Haviland, M. G. (2013). Scoring and modeling psychological measures in the presence of multidimensionality. J. Pers. Assess. 95, 129–140. doi: 10.1080/00223891.2012.725437

PubMed Abstract | Crossref Full Text | Google Scholar

Smolenski, D. J., Diamond, P. M., Ross, M. W., and Rosser, B. R. S. (2010). Revision, criterion validity, and multigroup assessment of the reactions to homosexuality scale. J. Pers. Assess. 92, 568–576. doi: 10.1080/00223891.2010.513300

PubMed Abstract | Crossref Full Text | Google Scholar

van Leeuwen, F., Miton, H., Firat, R. B., and Boyer, P. (2016). Perception of gay men as defectors and commitment to group defense predict aggressive homophobia. Evol. Psychol. 14, 1–8. doi: 10.1177/1474704916657833

Crossref Full Text | Google Scholar

Ward, M. K., and Meade, A. W. (2023). Dealing with careless responding in survey data: prevention, identification, and recommended best practices. Annu. Rev. Psychol. 74, 577–596. doi: 10.1146/annurev-psych-040422-045007

PubMed Abstract | Crossref Full Text | Google Scholar

Wickham, R. E., Gutierrez, R., Giordano, B. L., Rostosky, S. S., and Riggle, E. D. B. (2021). Gender and generational differences in the internalized homophobia questionnaire: an alignment IRT analysis. Assessment 28, 1159–1172. doi: 10.1177/1073191119893010

PubMed Abstract | Crossref Full Text | Google Scholar

Williams, D. Y., Hall, W. J., Dawes, H. C., Srivastava, A., Radtke, S. R., Ramon, M., et al. (2023). Relationships between internalized stigma and depression and suicide risk among queer youth in the United States: a systematic review and meta-analysis. Front. Psych. 14:1205581. doi: 10.3389/fpsyt.2023.1205581

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: internalized homonegativity, sexual identity, LGBTQ persons, structural validity, cross-linguistic validity, discriminant validity

Citation: Kerry MJ, Pfister A and Krüger P (2025) Psychometric inspection of an internalized homonegativity measure. Front. Psychol. 16:1569382. doi: 10.3389/fpsyg.2025.1569382

Received: 31 January 2025; Accepted: 20 June 2025;
Published: 09 September 2025.

Edited by:

Piotr Karniej, WSB MERITO University in Wroclaw, Poland

Reviewed by:

Banu Cingöz-Ulu, Middle East Technical University, Türkiye
Marcel Hackbart, Fachhochschule des Mittelstands, Germany

Copyright © 2025 Kerry, Pfister and Krüger. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Matthew James Kerry, a2VyckB6aGF3LmNo

^†ORCID: Andreas Pfister, https://orcid.org/0000-0002-9242-8260
Paula Krüger, https://orcid.org/0000-0001-8573-2793

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.