Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Psychol., 09 January 2026

Sec. Quantitative Psychology and Measurement

Volume 16 - 2025 | https://doi.org/10.3389/fpsyg.2025.1591800

Evaluating alcohol consumption in adolescents and young adults: a meta-analysis of the psychometric properties of measurement instruments

  • 1Carrera de Psicología, Grupo de Investigación en Psicología Comunitaria y Salud (GIPCS), Pontificia Universidad Católica del Ecuador, Esmeraldas, Ecuador
  • 2Carrera de Psicología Clínica, Grupo de Investigación en Neurociencia Clínica Aplicada (GINCA), Universidad Politécnica Salesiana, Cuenca, Ecuador
  • 3Departamento de Psicología, Universidad de Oviedo, Oviedo, España

Introduction: Early alcohol consumption is a public health concern among young people in most Western countries. However, psychometric evidence on the tools used to assess alcohol consumption and its associated factors is limited. This meta-analysis aimed to evaluate the overall reliability and measurement quality of instruments assessing alcohol consumption, following standardized methodological frameworks.

Method: The protocol was registered in PROSPERO (ID: CRD4202424533078) and followed PRISMA guidelines for systematic review and meta-analysis. The search was conducted in PubMed, Scopus, PsycINFO, and Web of Science using the PECOS strategy. Methodological characteristics of the studies and the reported psychometric properties of the instruments were documented. Reliability estimates and confidence intervals were analyzed using a restricted maximum likelihood random-effects model. Methodological quality and risk of bias were evaluated based on the COSMIN checklist for systematic reviews of outcome measurement instruments and the Terwee quality criteria, ensuring the inclusion of studies with acceptable internal consistency (α ≥ 0.70) and appropriate model fit (CFI ≥ 0.90; RMSEA ≤ 0.08).

Results: Twenty-seven studies met the inclusion criteria, reporting Cronbach’s alpha coefficients between 0.71 and 0.98. The pooled reliability coefficient was α = 0.88 (SE = 0.0088; 95% CI = 0.86–0.89), indicating high internal consistency. Significant heterogeneity (I2 = 99.6%) was observed, primarily related to differences in sample size and item number. In addition, according to COSMIN and Terwee assessments the Alcohol Use Disorders Identification Test (AUDIT) and the Brief Young Adult Alcohol Consequences Questionnaire (BYAACQ) demonstrated the strongest psychometric robustness, supported by consistent reliability and validity across cultural contexts.

Conclusion: This meta-analysis provides strong evidence of reliability and adequate structural validity for instruments measuring alcohol consumption in adolescents and young adults. The AUDIT and BYAACQ emerge as the most reliable and valid options, while the CLASS, PRQ, and ACQ-SF-R also represent promising, psychometrically viable alternatives for assessing beliefs, parental norms, and craving associated with alcohol use. These instruments can guide early detection and preventive interventions, supporting more consistent measurement standards in public health and behavioral research.

1 Introduction

Alcohol is the most widely consumed psychoactive substance in the world (Gomà-i-Freixanet et al., 2023). In most Western countries, alcohol consumption is a significant public health concern among adolescents and young adults (Smit et al., 2020; Campbell et al., 2022; Stamates et al., 2023). In 2021, approximately 2.3 billion people consumed alcoholic beverages worldwide, and about 283 million people over the age of 15 were diagnosed with alcohol use disorder (World Health Organization, 2021).

In recent decades, reports from Anglo-Saxon countries indicate that the prevalence of alcohol consumption among adolescents and young adults ranges from 20 to 70% (Morales Quintero et al., 2019; Villarosa-Hurlocker et al., 2020; Hummer et al., 2022). The highest prevalence is observed at age 15 (Toner et al., 2019). In 2019, the global per capita alcohol consumption among individuals aged 15 years and older was 5.5 liters of pure alcohol (World Health Organization, 2023). In North America, 78.2% of individuals aged 15 or older reported drinking alcohol in the past year (Dermody et al., 2023).

Early alcohol consumption, widespread normalization, and low risk perception (Sánchez-García et al., 2020) result from culturally ingrained norms in society. The pervasive nature of alcohol consumption underscores the need for early detection (Hadland et al., 2019; Duffy et al., 2023). Alcohol consumption has a profound impact on adolescent and young adult development, education, and social and family relationships. It is also significantly associated with unemployment, low socioeconomic status, traffic accidents, and violence (Klimkiewicz et al., 2021; Voss et al., 2021).

Risk behaviors associated with alcohol consumption among adolescents and young adults generally follow two patterns. The first is continuous and moderate consumption throughout the week without intoxication, commonly referred to as the Mediterranean drinking style (Callinan et al., 2022). The second is characterized by heavy episodic drinking, also known as binge drinking, commonly referred to as the Anglo-Saxon drinking style (Cox et al., 2022).

Several validated and reliable psychometric instruments are available for assessing alcohol consumption (Rodrigues et al., 2021; Delawalla et al., 2023). The Alcohol Use Disorders Identification Test (AUDIT) and its shorter version, the AUDIT-C, are the most widely used psychometric tools for diagnosing alcohol consumption in the general population (Toner et al., 2019). The AUDIT is the most commonly used international instrument (Ohtani et al., 2023) due to its affordability, ease of administration, and rapid application. It has demonstrated strong psychometric reliability for detecting hazardous drinking and dependence in various countries (O'Brien et al., 2020; Klimkiewicz et al., 2021; Duffy et al., 2023).

Unlike Toner et al. (2019), the present meta-analysis broadens the scope by including studies published between 2019 and 2024 and integrates both reliability estimates and model-fit indices (CFI, TLI, RMSEA, SRMR) of psychometric instruments used to assess alcohol consumption and associated factors (Ramírez et al., 2025). Furthermore, it compares instruments that evaluate consumption levels, motives, consequences, and protective factors, thus providing an updated and comprehensive synthesis of psychometric evidence supporting research and clinical applications in adolescent and young adult populations.

The objective of this systematic review and meta-analysis was to evaluate the overall reliability and model fit of psychometric instruments reporting Cronbach’s alpha as a measure of internal consistency, and to identify the most frequently used instruments along with their key psychometric characteristics.

Specific objectives: (1) To describe the methodological and psychometric characteristics of the instruments identified in the systematic review; (2) To analyze the overall and instrument-specific reliability based on Cronbach’s alpha coefficients; (3) To evaluate the overall and instrument-specific model fit using confirmatory indices such as the Comparative Fit Index (CFI), Tucker–Lewis Index (TLI), Root Mean Square Error of Approximation (RMSEA), and Standardized Root Mean Square Residual (SRMR).

1.1 Materials and methods

A systematic review and meta-analysis were conducted following the PRISMA Declaration guidelines (Page et al., 2021). The meta-analysis protocol was registered in PROSPERO 2024 (ID: CRD42024533078).

The search terms were defined using the PECOS strategy. Population: Children, adolescents, and young adults up to 25 years old. Exposure: Assessment, diagnosis, or measurement of alcohol consumption. Comparison: Psychometric instruments, questionnaires, scales, and tests used. Outcome: Generalized reliability of psychometric instruments. Study design: Quantitative studies reporting psychometric data of interest.

1.2 Inclusion criteria

Studies were included if they met the following criteria: (1) Original quantitative studies assessing alcohol consumption, (2) Studies conducted with children, adolescents, and young adults (up to 25 years old), (3) Studies that used psychometric instruments to assess alcohol consumption, (4) Studies that reported psychometric data and overall reliability of the instrument, (5) Studies with a sample size of at least 10 participants per item of the instrument, (6) Studies published in the last 5 years (2019–2024).

1.3 Exclusion criteria

Studies were excluded if they met any of the following criteria: (1) Systematic reviews, meta-analyses, book chapters, theses, conference proceedings, or abstracts, (2) Studies on adult populations over 25 years old, (3) Studies reporting participants with polysubstance use or other drug consumption, (4) Qualitative studies using interviews or surveys, (5) Studies with participants diagnosed with dual pathology or psychiatric disorders, (6) Studies that conducted evaluations exclusively in an online format.

1.4 Sources of information

The search was conducted in the following databases: PubMed, Scopus, Web of Science, and PsycINFO. These databases were selected based on their scientific quality and international relevance in health and social sciences. Additionally, the reference lists of included studies were examined to minimize exclusion bias due to database coverage.

1.5 Search strategy

The search phrase was developed using a combination of terms frequently used in original research and review studies on alcohol consumption: (“Alcoholism” OR “Alcohol Drinking” OR “Binge Drinking” OR “Underage Drinking”) AND (“Child” OR “Adolescent” OR “Young Adult”) AND (“Assessment” OR “Measure” OR “Measurement” OR “Diagnosis”) AND (“Test” OR “Scale” OR “Questionnaire”). The search terms were adapted to each database (see Supplementary S1).

The decision to limit the search to studies published in the last 5 years was based on a bibliometric analysis of academic production trends in alcohol consumption research. Publication metrics from the four databases indicated a significant decline in studies on this topic since 2019: 779 studies (2019), 679 (2020), 617 (2021), 421 (2022), 392 (2023), and 58 in the first quarter of 2024 (see Supplementary S2).

This decline may be due to shifts in research priorities, particularly the focus on COVID-19 during this period. Therefore, including studies up to 2024 ensures that the findings incorporate reliable and relevant methodological approaches, population trends, and recent developments in the field.

1.6 Review strategy

The systematic search was conducted on March 18, 2024. Search records were verified by the principal investigator and a collaborator. After ensuring consistency across the four databases, the records were downloaded in RIS format.

The Rayyan web application (Ouzzani et al., 2016) was used to code the records. Functions such as duplicate detection, inclusion, exclusion, and potential inclusion were applied. Tags were used to document the reasons for selection decisions. All data were stored in the Rayyan system.

A total of 4,412 records were identified. Duplicates were removed using both algorithmic and manual detection, eliminating 1,462 articles due to over 90% content overlap. Based on a rigorous title and keyword screening, 2,760 articles were excluded according to the predetermined inclusion/exclusion criteria. After screening abstracts, 112 additional studies were excluded.

Among the 78 full-text studies assessed, 51 were excluded for the following reasons: Cronbach’s alpha was not reported as a global measure (n = 11). Studies did not report psychometric data (n = 15). The mean participant age exceeded 25 years (n = 10). Full-text access was unavailable (n = 4). The study was outside the publication range (n = 3). Online-only assessment was performed (n = 4). The study was a preliminary investigation (n = 1). The instrument used was not a psychometric measurement tool (n = 1). The study was based on surveys rather than psychometric instruments (n = 1). The study included participants who used non-alcoholic drugs (n = 1).

The final selection was determined through full-text peer review by the reviewer and a collaborator. Disagreements were resolved through deliberation and consensus. Ultimately, 27 articles were included in the systematic review and meta-analysis (Figure 1).

Figure 1
Flowchart depicting the identification and screening process of studies. Initially, 4,412 records are identified from databases: Web of Science (1,019), PsycINFO (812), PubMed (601), and Scopus (1,980). After removing 1,462 duplicates, 2,950 titles are inspected. From these, 2,760 records are excluded for not meeting inclusion criteria. Abstract inspection reduces the count to 190, and further full-text inspection narrows it to 78. Ultimately, 27 studies are included. Exclusions during abstract and full-text inspections are listed with specific reasons.

Figure 1. PRISMA flowchart.

1.7 Data extraction process

Two primary data categories were defined: Methodological characteristics of the included studies and Psychometric properties of the instruments.

The data were extracted according to predefined variables and cross-checked by the reviewer and collaborator. Discrepancies were resolved through comparison and verification.

The following variables were collected: (1) Study characteristics: Author, year of publication, title, journal, quartile ranking, and sample size. (2) Instrument characteristics: Psychometric test, number of items, country, participant age, gender distribution, and prevalence of risky or harmful alcohol consumption.

Psychometric properties: (1), Reliability (Cronbach’s alpha), (2) Goodness-of-fit indices for the structural model, including: Comparative Fit Index (CFI), Root Mean Square Error of Approximation (RMSEA), Tucker-Lewis Index (TLI), Standardized Root Mean Square Residual (SRMR).

1.8 Risk of bias and quality assessment

The methodological quality of the included studies and the measurement properties of the instruments were evaluated according to the quality criteria proposed by Terwee et al. (2007), the guidelines of the COSMIN Manual for Systematic Reviews of Outcome Measurement Instruments (Mokkink et al., 2024), and the methodological recommendations of the JBI Manual (Stephenson et al., 2020), specific to psychometric reviews.

Each study was analyzed across the following psychometric quality domains: reliability (internal consistency and temporal stability), structural validity (model fit indices: CFI, TLI, RMSEA, and SRMR), content validity (item representativeness and conceptual clarity), and cross-cultural validity (linguistic adaptation and population relevance). Internal consistency was considered acceptable when Cronbach’s alpha was ≥ 0.70, following Terwee et al. (2007).

Model fit indices were interpreted according to COSMIN guidelines: CFI ≥ 0.90, TLI ≥ 0.90, RMSEA ≤ 0.08, and SRMR ≤ 0.05, indicating good structural model fit. The risk of bias was rated across four levels (very good, adequate, doubtful, or inadequate), considering study design, sample size adequacy (minimum of 10 participants per instrument item), and completeness of reported psychometric information.

The evaluation was independently conducted by two reviewers, with an inter-rater agreement of 96.4%, and discrepancies were resolved by consensus. Detailed results of this assessment, including the scoring of measurement properties and methodological quality following the Terwee criteria and COSMIN risk-of-bias rubric, are provided in Supplementary S3.

Finally, potential sources of heterogeneity were explored through subgroup and sensitivity analyses to distinguish statistical variability from methodological heterogeneity (Harrer et al., 2021).

1.9 Statistical analysis

This study employed reliability generalization and meta-analytic approaches to evaluate internal consistency and model fit metrics of alcohol assessment instruments. Cumulative reliability was estimated using Cronbach’s alpha, transformed through the Hakstian–Whalen method to facilitate random-effects modeling (Hakstian and Whalen, 1976).

The studies also reported model fit indices, including the Comparative Fit Index (CFI), the Tucker–Lewis Index (TLI), the Root Mean Square Error of Approximation (RMSEA), and the Standardized Root Mean Square Residual (SRMR). Heterogeneity was assessed using Cochran’s Q, I2, H2, and τ2 statistics (Higgins and Thompson, 2002). Studies deviating from alpha > 0.70 or a 10:1 participant-to-item ratio were excluded using trimming techniques.

Potential publication bias was evaluated through funnel plot asymmetry and Egger’s regression test. The meta-analysis provided a comprehensive synthesis of reliability and validity indicators across diverse applications, allowing for the generalization of psychometric properties in alcohol assessment.

All statistical analyses were conducted using The Jamovi Project (2023) version 2.4.8 and R (Metafor package), version 2023.09.1 + 494.

2 Results

A total of 27 scientific articles were included in this systematic review and meta-analysis. These studies report the use of psychometric instruments to assess alcohol consumption in adolescents and young adults. The characteristics of the studies and instruments are described below (Table 1).

Table 1
www.frontiersin.org

Table 1. Articles selected for meta-analysis.

2.1 Characteristics of the included studies

The highest frequency of publication occurred in 2020, with 10 studies representing 37.0% of the total. The journal with the highest number of publications was Psychological Assessment (n = 5; 18.5%). Most studies were published in Q1-indexed journals (n = 23; 85.2%).

In terms of geographical distribution, most studies were conducted in North America (n = 20; 54.1%), followed by Europe (n = 12; 32.4%). The United States was the country with the highest number of publications (n = 19; 51.4%).

The total sample across all studies comprised 78,972 participants, with a minimum sample size of 124 and a maximum of 10,954. The mean age of participants was 19.1 years (SD = 2.90), with an age range between 12.5 and 23.6 years.

The mean number of items in the psychometric instruments documented was 12.7, with a minimum of 2 and a maximum of 24 items per instrument.

2.2 Documented psychometric instruments

Fourteen psychometric instruments were identified (see Supplementary S4).

The most frequently used was the Alcohol Use Disorders Identification Test (AUDIT; Saunders et al., 1993), reported in 25.9% of the studies. The Brief Young Adult Alcohol Consequences Questionnaire (BYAACQ; Kahler et al., 2005) was used in 14.8%, while the College Life Alcohol Salience Scale (CLASS; Osberg et al., 2010; Prince et al., 2018) was documented in 11.3% of the studies. The Protective Drinking Practices Scale (PDPS; Martin et al., 2020) and the Parental Rules Toward Alcohol Use (PRQ; Van der Vorst et al., 2006) were each used in 7.4% of the studies.

Additional instruments identified included the AR2i (Cortés et al., 2017), Brief Situational Confidence Questionnaire–Alcohol (BSCQ; Delaney et al., 2020), Alcohol Sensitivity Questionnaire (ASQ; O’Neill et al., 2002), Alcohol-Induced Blackout Measure (ABOM; Miller et al., 2019), Perceived Access to Alcohol and Other Drug Scale (Kuntsche et al., 2008), Protective Behavioral Strategies Scale-20 (S-PBSS-20; Sánchez-García et al., 2020), Trauma-Related Drinking to Cope (TRD; Hawn et al., 2020), Alcohol Craving Questionnaire–Short Form–Revised (ACQ-SF-R; Singleton et al., 1994), and Responsible Drinking Identity (PARDI; Leary et al., 2023).

The instruments were classified based on their primary function: Assessment of risk and severity of alcohol consumption: Used in eight studies (AUDIT: Saunders et al., 1993; AR2i: Cortés et al., 2017). Evaluation of alcohol-related consequences: Used in six studies (BYAACQ: Kahler et al., 2005; ABOM: Miller et al., 2019; S-PBSS-20: Sánchez-García et al., 2020). Examination of motives and reasons for alcohol consumption: Reported in six studies (ACQ-SF-R: Singleton et al., 1994; ASQ: O’Neill et al., 2002; CLASS: Osberg et al., 2010; Prince et al., 2018; TRD: Hawn et al., 2020). Assessment of risk and protective factors: Used in seven studies (PRQ: Van der Vorst et al., 2006; Perceived Access to Alcohol and Other Drug Scale: Kuntsche et al., 2008; PDPS: Martin et al., 2020; BSCQ: Delaney et al., 2020; PARDI: Leary et al., 2023).

In the analysis of characteristics related to language, population, and application context, four instruments stand out as the most representative: The Alcohol Use Disorders Identification Test (AUDIT; Saunders et al., 1993) is the instrument with the highest number of cultural and linguistic adaptations (English, Spanish, Portuguese, Chinese, Arabic, among others). It is commonly used as a screening tool in both general and clinical populations, particularly among adults and young adults.

The Brief Young Adult Alcohol Consequences Questionnaire (BYAACQ; Kahler et al., 2005) is available in English, Spanish, and Chinese. It is frequently applied in young adult populations to identify the negative consequences associated with alcohol consumption.

The College Life Alcohol Salience Scale (CLASS; Osberg et al., 2010) was originally developed in English and later adapted into Spanish by Bravo et al. (2019). Its primary use is within university settings to assess beliefs and attitudes toward alcohol use among young adults.

The Parental Rules toward Alcohol Use Questionnaire (PRQ; van der Vorst et al., 2005; Van der Vorst et al., 2006) is available in Dutch and English. It has been employed with adolescents aged 12–18 years, particularly within parent–child dyads in the Netherlands and the United Kingdom, to examine family norms and parental rule-setting regarding alcohol use.

2.3 Descriptive analysis of Cronbach’s alpha and fit indices

The overall reliability (Cronbach’s alpha) across studies indicated good internal consistency among the measurement instrument items (M = 0.876; SD = 0.0560).

Additionally, fit indices were reported as follows: Comparative Fit Index (CFI): M = 0.941; SD = 0.0271 (23 studies), Tucker-Lewis Index (TLI): M = 0.918; SD = 0.0537 (14 studies), Root Mean Square Error of Approximation (RMSEA): M = 0.0655; SD = 0.0414 (23 studies), Standardized Root Mean Square Residual (SRMR): M = 0.0414; SD = 0.0225 (13 studies). These values suggest a good fit of the theoretical model to the observed data (Table 2).

Table 2
www.frontiersin.org

Table 2. Descriptive analysis of Cronbach’s alpha and fit indices.

2.4 Instrument-specific reliability and model-fit analysis

The most frequently used psychometric instruments- those included in two or more studies- were the Alcohol Use Disorders Identification Test (AUDIT), the Parental Rules toward Alcohol Use Questionnaire (PRQ), the College Life Alcohol Salience Scale (CLASS), and the Brief Young Adult Alcohol Consequences Questionnaire (BYAACQ) (see Supplementary S5).

Seven studies employing the AUDIT reported a mean Cronbach’s alpha of 0.83 (SD = 0.068). Model-fit indices were satisfactory: CFI averaged 0.96 (SD = 0.040; 3 studies), TLI averaged 0.97 (SD = 0.0064; 2 studies), RMSEA averaged 0.032 (SD = 0.018; 3 studies), and SRMR was 0.024 (1 study). These findings confirm strong internal consistency and adequate model fit across different cultural and population contexts.

Seven studies (from two original articles) using the PRQ showed a mean Cronbach’s alpha of 0.89 (SD = 0.020). The mean CFI was 0.94 (SD = 0.026; 7 studies), TLI averaged 0.89 (SD = 0.055; 7 studies), RMSEA averaged 0.087 (SD = 0.026; 7 studies), and SRMR averaged 0.040 (SD = 0.029; 7 studies). Although reliability values were high, some RMSEA values indicated moderate model misfit, possibly reflecting differences in sample age and item structure.

Six studies (from two original articles) applying the CLASS reported a mean Cronbach’s alpha of 0.89 (SD = 0.034). The mean CFI was 0.94 (SD = 0.014; 6 studies), TLI averaged 0.94 (SD = 0.024; 3 studies), RMSEA averaged 0.051 (SD = 0.023; 6 studies), and SRMR averaged 0.042 (SD = 0.0087; 3 studies). These results indicate good internal consistency and adequate model fit.

Four studies using the BYAACQ presented a mean Cronbach’s alpha of 0.90 (SD = 0.026). Available model-fit indices were: CFI = 0.97 (1 study), TLI = 0.91 (1 study), and RMSEA = 0.055 (1 study), while SRMR was not reported. Overall, the BYAACQ demonstrated strong reliability and satisfactory structural validity among adolescent and young adult populations.

2.5 Random effects model and heterogeneity statistics

The 27 studies included in this meta-analysis reported a total of 37 reliability measures. These measures were pooled, and Cronbach’s alpha reliability estimates, along with their confidence intervals, were calculated using a restricted maximum likelihood random-effects model (Table 3).

Table 3
www.frontiersin.org

Table 3. Random-effects model and heterogeneity statistics.

The reliability estimation criterion was Cronbach’s alpha > 0.70, while the fit index criteria were: Comparative Fit Index (CFI) > 0.90; Tucker-Lewis Index (TLI) > 0.90; Standardized Root Mean Square Residual (SRMR) < 0.04; Root Mean Square Error of Approximation (RMSEA) < 0.08. Heterogeneity was assessed using Cochran’s Q and the I2 statistic, where an I2 value of 50% indicates low heterogeneity. Possible publication bias was evaluated using the Funnel Plot and Egger’s test.

The pooled Cronbach’s alpha reliability coefficient was α = 0.877 (SE = 0.00883; 95% CI = 0.859–0.894). However, significant statistical heterogeneity was detected across the included studies (I2 = 99.65%; Q = 9615.553; p < 0.001), likely due to differences in sample size and instrument length.

A global sensitivity analysis of Cronbach’s alpha was conducted using four models (see Supplementary S6).

The sensitivity analysis is summarized as follows: in Model 1, Egger’s coefficient was −5.301, and I2 was 99.65%. In the second model, studies with an alpha lower than 0.80 were removed. Minor differences were observed in the coefficients, with Egger = −3.317 and I2 = 95.5%. In the third model, studies with an alpha greater than 0.90 were excluded, resulting in coefficients of Egger = −3.109 and I2 = 98.85%. In Model 4, studies with alpha values lower than 0.83 and higher than 0.87 were removed, yielding an Egger coefficient of −3.537 and I2 = 61.73% (Figure 2).

Figure 2
Forest plot and funnel plot for a meta-analysis, labeled Model 4 with seven studies. The forest plot shows effect sizes ranging from 0.83 to 0.87 with confidence intervals. The random-effects model summary is 0.86. The funnel plot displays standard errors against effect sizes, indicating possible publication bias.

Figure 2. Forest and funnel plot of Cronbach’s alpha global sensitivity analysis.

Thus, outliers appear to influence the initial model. However, these results should be interpreted with caution. Addressing this issue will be a priority objective for future studies in the following year.

2.5.1 Comparative fit index

The random-effects meta-analysis of CFI values resulted in a pooled estimate of 0.941 (SE = 0.00564; 95% CI = 0.930–0.952), indicating a good model fit. However, significant heterogeneity was observed (I2 = 99.89%; Q = 13383.862; p < 0.001), attributed to differences in sample size and instrument structure (Figure 3).

Figure 3
Forest plot and funnel plot for CFI with 23 studies. The forest plot shows point estimates and confidence intervals from individual studies, ranging from 0.88 to 0.98, with an overall random effects model estimate of 0.94 (95% CI: 0.93, 0.95). The funnel plot displays the standard error against CFI values to assess publication bias, forming a symmetric funnel.

Figure 3. Forest and funnel plot for CFI meta-analysis.

2.5.2 Tucker-Lewis index

The pooled estimate for TLI was 0.918 (SE = 0.0143; 95% CI = 0.890–0.946), suggesting a good model fit. However, high heterogeneity was detected (I2 = 99.94%; Q = 9922.357; p < 0.001), likely due to variations in sample size and number of instrument items (Figure 4).

Figure 4
Forest plot and funnel plot related to TLI values from multiple studies. The forest plot shows individual study estimates with confidence intervals, and a pooled overall effect. The funnel plot displays standard error against effect size to assess publication bias, with most points symmetrically distributed around the pooled effect line.

Figure 4. Forest and funnel plot for TLI meta-analysis.

2.5.3 Root mean square error of approximation

The pooled estimate for RMSEA was 0.0672 (SE = 0.0107; 95% CI = 0.046–0.088), suggesting a moderately adequate fit, considering the recommended threshold of < 0.08. However, moderate heterogeneity was detected (I2 = 63.06%; Q = 54.870; p < 0.001) (Figure 5).

Figure 5
Forest plot and funnel plot showing RMSEA values for 23 studies. The forest plot includes effect sizes with confidence intervals, and a summary diamond for the random effects model. The funnel plot displays standard error versus effect size, with data points distributed symmetrically, suggesting no publication bias.

Figure 5. Forest and funnel plot for RMSEA meta-analysis.

2.5.4 Standardized root mean square residual

The pooled estimate for SRMR was 0.0375 (SE = 0.00860; 95% CI = 0.021–0.054), indicating a good model fit, given the recommended cut-off value of 0.05. The heterogeneity analysis showed moderate heterogeneity (I2 = 35.09%; Q = 14.573; p = 0.0266) (Figure 6).

Figure 6
A forest plot and funnel plot displaying a meta-analysis of thirteen studies on standardised root mean square residual (SRMR). The forest plot shows each study's effect size with confidence intervals, and the overall effect size from a random effects model is 0.04 [0.02, 0.05]. The funnel plot displays effect sizes against standard errors, with most points clustered towards the top.

Figure 6. Forest and funnel plot for SRMR meta-analysis.

3 Discussion

The objective of this study was to analyze the reliability of psychometric instruments used to assess alcohol consumption among adolescents and young adults. In health and social sciences, internal consistency reliability is commonly assessed using Cronbach’s alpha (Doval et al., 2023). The 27 studies included in this meta-analysis allowed us to document 27 Cronbach’s alpha measures, considering studies with multiple samples.

The Alcohol Use Disorders Identification Test (AUDIT) with a two-factor structure, was the most widely used instrument for diagnosing alcohol consumption (Morales Quintero et al., 2019; López et al., 2019; Watterson et al., 2019; Hallit et al., 2020; Villarosa-Hurlocker et al., 2020; Delawalla et al., 2023; Gomà-i-Freixanet et al., 2023). This structure includes a consumption factor with three items and a consequences factor with six items (Seay and Feely, 2020). In this meta-analysis, Cronbach’s alpha for the AUDIT ranged from 0.79 to 0.98, consistent with previous studies reporting values between 0.81 and 0.87 (Dermody et al., 2023).

The Brief Young Adult Alcohol Consequences Questionnaire (BYAACQ) was the second most frequently used instrument. This tool is particularly sensitive to detecting the negative consequences of alcohol consumption among students and demonstrates high internal consistency (Zamboanga et al., 2019; Zhang et al., 2019; Tucker et al., 2020; Stamates et al., 2023). In this meta-analysis, the BYAACQ’s Cronbach’s alpha ranged from 0.88 to 0.94.

In the comparison of the most frequently used instruments, the AUDIT shows the best psychometric balance, combining acceptable internal consistency (α = 0.83, SD = 0.067), which confirms its reliability across diverse cultural and population contexts, consistent with the findings of Toner et al. (2019) and Dermody et al. (2023). However, the present meta-analysis provides additional evidence supporting an excellent model fit (CFI = 0.96, TLI = 0.97, RMSEA = 0.032, SRMR = 0.024).

The BYAACQ exhibits the highest reliability (α = 0.90) and satisfactory model fit (CFI = 0.97, RMSEA = 0.055), although these results are based on fewer studies and with some fit indices unreported. The CLASS demonstrates good reliability (α = 0.89) and acceptable fit (CFI = 0.94, RMSEA = 0.051), particularly in university samples; however, not all fit indices were reported. In contrast, the PRQ shows strong internal consistency (α = 0.89) but a higher RMSEA value (0.087) and incomplete reporting of fit indices, suggesting moderate structural misfit.

3.1 Reliability

The 37 reliability measures analyzed in this meta-analysis indicate good internal consistency across the psychometric instruments. The overall Cronbach’s alpha estimate was 0.88 (95% CI: 0.86–0.89), aligning with the recommended range of 0.80 to 0.90 (Streiner, 2003; Oviedo and Campo-Arias, 2005). All instruments exceeded the 0.70 threshold despite being applied in diverse cultural contexts, including Mexico, Ecuador, Argentina, Spain, Portugal, the United States, the United Kingdom, the Netherlands, Australia, China, and Lebanon.

Based on the interpretation guidelines by George and Mallery (2010), no instrument in this study reported a questionable Cronbach’s alpha (below 0.70). Instead: Three instruments had acceptable Cronbach’s alpha values above 0.70 (Sánchez-García et al., 2020; Delawalla et al., 2023; Gomà-i-Freixanet et al., 2023). Eighteen instruments reported Cronbach’s alpha above 0.80 (Morales Quintero et al., 2019; López et al., 2019; Watterson et al., 2019; Zamboanga et al., 2019; Hawn et al., 2020; Villarosa-Hurlocker et al., 2020; Bravo et al., 2019; Delaney et al., 2020; Rodrigues et al., 2021; Trager et al., 2021; McKay et al., 2021; Stamates et al., 2023; Leary et al., 2023). Sixteen instruments showed excellent Cronbach’s alpha values exceeding 0.90 (Zhang et al., 2019; Miller et al., 2019; Lui, 2019; Tucker et al., 2020; Martin et al., 2020; Lewis et al., 2020; Hallit et al., 2020; Lui et al., 2020; Trager et al., 2021; Motos Sellés et al., 2021; Davis et al., 2021; McKay et al., 2021; Jordan et al., 2021; Leary et al., 2023).

However, very high Cronbach’s alpha values may indicate item redundancy or overfitting, leading to overestimated internal consistency (Streiner, 2003). For example, an alpha of 0.98 (Hallit et al., 2020) suggests potential item duplication. The optimal threshold for Cronbach’s alpha is recommended at 0.90 ± 0.02, with values above this suggesting redundancy (Streiner, 2003).

In the context of this meta-analysis, high alpha values may inflate pooled reliability estimates and increase heterogeneity across studies. This effect may result from item redundancy rather than genuine internal consistency, which helps to interpret the aggregated reliability more accurately. This finding also underscores the need to complement alpha with additional reliability indicators, such as McDonald’s omega (ω), which is based on factor analysis and provides a more comprehensive evaluation of the internal structure (Hayes and Coutts, 2020).

3.2 Psychometric fit indices

The Comparative Fit Index (CFI) = 0.94 and Tucker-Lewis Index (TLI) = 0.92 indicate good model fit. However, a high degree of heterogeneity among studies was observed, likely due to differences in sample size, number of items, and sociocultural characteristics, rather than random variation. The Root Mean Square Error of Approximation (RMSEA) = 0.067 suggests a moderate fit, aligning with Hoogland and Boomsma’s (1998) recommendation that values below 0.05 are considered optimal. Similarly, the Standardized Root Mean Square Residual (SRMR) = 0.040 indicates an acceptable fit for the meta-analysis model. Heterogeneity was moderate for RMSEA and low for SRMR.

Variability in the models reflects real differences between studies rather than random error. This underscores the importance of considering heterogeneity when interpreting meta-analysis results, particularly for clinical applications or future research. Fit indices should always be evaluated collectively and within the study’s specific context.

3.3 Limitations

One limitation of this meta-analysis is its reliance on Cronbach’s alpha as the primary reliability metric. While widely used, Cronbach’s alpha assumes unidimensionality, tau-equivalence (equal factor loadings), and independent errors, which may not always hold true (Doval et al., 2023; Kalkbrenner, 2024). The indiscriminate use of Cronbach’s alpha has led to its misconception as the “gold standard” of reliability, despite its limitations (Elosua Oliden and Zumbo, 2008). Cronbach’s alpha depends on several statistical assumptions, including normality of test scores, unidimensionality, unit-weighted scores, essential tau-equivalence, and independent errors (Edwards et al., 2021). It is more informative when complemented with McDonald’s Omega (ω), which provides a more robust measure of reliability (Cho and Kim, 2015; Trizano-Hermosilla and Alvarado, 2016).

To strengthen future meta-analyses, studies should report additional psychometric fit indices Comparative Fit Index (CFI), Tucker-Lewis Index (TLI), Root Mean Square Error of Approximation (RMSEA), Standardized Root Mean Square Residual (SRMR) and Average Variance Extracted (AVE), which helps evaluate construct validity. Not all studies report fit indices, limiting meta-analytic insights into the reliability of psychometric tools.

Second, heterogeneity analysis is essential in meta-analysis (Higgins and Thompson, 2002). Bias in reliability estimation may stem from differences in sample characteristics, psychometric tools, or sociodemographic factors. To enhance accuracy, sensitivity analyses should sequentially exclude studies to determine their impact on heterogeneity (Patsopoulos et al., 2008).

Finally, cultural and demographic variations—across North America, South America, Europe, Asia, and Oceania—must be considered when interpreting results. Differences in sample size, instrument structure, and sociocultural contexts may introduce variability in psychometric estimates. As Cortés et al. (2021) emphasizes, these contextual aspects can influence the validity and reliability of measurement instruments and should therefore be carefully examined in future cross-cultural research.

3.4 Strengths

First, it integrates two of the most widely recognized methodological standards for psychometric evaluation: the Quality Criteria for Measurement Properties proposed by Terwee et al. (2007) and the COSMIN Guidelines for systematic reviews of outcome measurement instruments (Mokkink et al., 2024). The joint application of these frameworks ensures a transparent and reproducible process for assessing measurement quality and minimizes risk of bias in the synthesis of psychometric evidence.

Second, the meta-analysis applies a restricted maximum likelihood random-effects model, which strengthens the precision and generalizability of pooled reliability estimates. Third, the inclusion and evaluation of multiple model fit indices (CFI, TLI, RMSEA, SRMR) expand the evidence base beyond internal consistency, offering a comprehensive perspective on structural and construct validity across diverse samples.

Finally, although cultural variability was identified as a potential source of heterogeneity, the broad international coverage of the included studies -spanning North America, South America, Europe, Asia, and Oceania- also represents a major strength. This diversity enhances the external validity of the findings and underscores the cross-cultural relevance of psychometric research on alcohol-related instruments.

3.5 Conclusion

This systematic review and meta-analysis demonstrate that all included studies reported Cronbach’s alpha values above 0.70, with an overall mean Cronbach’s alpha of 0.88. The psychometric properties of the instruments used to assess alcohol consumption among adolescents and young adults are adequate and reliable.

The Alcohol Use Disorders Identification Test (AUDIT) was the most frequently used and demonstrated strong reliability (α > 0.70 in all cases). The Brief Young Adult Alcohol Consequences Questionnaire (BYAACQ) also exhibited high internal consistency (α > 0.80 in all studies). Together, these two instruments emerge as the most psychometrically robust options for assessing alcohol-related behaviors in this population.

According to the criteria (Terwee and COSMIN), most instruments achieved a “positive” or “adequate” rating across key domains of internal consistency, structural validity, and content validity. The AUDIT and BYAACQ demonstrated the highest overall methodological quality and lowest risk of bias. Instruments such as CLASS, PRQ, and ACQ-SF-R were rated as “intermediate” mainly due to incomplete reporting of model fit indices or limited cross-cultural validation evidence.

From a practical and clinical perspective, the AUDIT and BYAACQ stand out as reliable, cost-effective, and easy-to-administer tools for detecting alcohol risk patterns and consequences in adolescents and young adults. Their psychometric robustness supports their application not only in research but also in school-based screenings, primary care settings, and community prevention programs, where early detection of risky consumption can improve timely intervention and reduce long-term harm.

The incorporation of standardized methodological frameworks, such as the Terwee criteria and COSMIN checklist, strengthens the interpretive validity of these conclusions and minimizes potential bias in synthesizing psychometric evidence. This approach establishes a benchmark for future studies aiming to evaluate and compare alcohol-related measurement tools using reproducible and transparent quality standards.

Finally, integrating these tools into clinical practice and public health initiatives can enhance the identification of hazardous drinking behaviors, promote individualized interventions, and improve the monitoring of preventive and therapeutic outcomes. Future research should prioritize cross-cultural adaptation and longitudinal validation of these instruments, in alignment with COSMIN guidelines, to ensure their reliability, validity, and responsiveness across diverse sociocultural contexts.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

HS-S: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Software, Writing – original draft, Writing – review & editing. AR: Data curation, Formal analysis, Software, Writing – review & editing. LB-B: Formal analysis, Methodology, Visualization, Writing – review & editing. FR-D: Conceptualization, Methodology, Supervision, Visualization, Writing – review & editing. FH: Software, Supervision, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. Acknowledgment to the Pontifical Catholic University of Ecuador, Esmeraldas, for hosting the research. The authors would like to express their gratitude to the Universidad Politécnica Salesiana, Cuenca, Ecuador, and especially to Dr. Juan Cárdenas Tapia, sdb, PhD, for funding the publication of this research.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1591800/full#supplementary-material

References

Bravo, A. J., Pilatti, A., Pearson, M. R., Read, J. P., Mezquita, L., Ibáñez, M. I., et al. (2019). Cross-cultural examination of negative alcohol-related consequences: measurement invariance of the young adult alcohol consequences questionnaire in Spain, Argentina, and USA. Psychol. Assess. 31, 631–642. doi: 10.1037/pas0000689,

PubMed Abstract | Crossref Full Text | Google Scholar

Callinan, S., Livingston, M., Dietze, P., Gmel, G., and Room, R. (2022). Age-based differences in quantity and frequency of consumption when screening for harmful alcohol use. Addiction 117, 2431–2437. doi: 10.1111/add.15904,

PubMed Abstract | Crossref Full Text | Google Scholar

Campbell, K. W., Pebley, K., MacKillop, J., and Murphy, J. G. (2022). Measurement invariance of the young adult alcohol consequences questionnaire across college status, race, and childhood SES in a diverse community sample. Psychol. Addict. Behav. 36, 824–836. doi: 10.1037/adb0000789,

PubMed Abstract | Crossref Full Text | Google Scholar

Cho, E., and Kim, S. (2015). Cronbach’s coefficient alpha: well known but poorly understood. Organ. Res. Methods 18, 207–230. doi: 10.1177/1094428114555994

Crossref Full Text | Google Scholar

Cortés, M. L., Morales-Quintero, L. A., Rojas, J. L., Moral, M. V., Flores, M., and Rodríguez-Díaz, F. J. (2021). Alcohol consumption patterns and perception of risk in Mexican students. Rev. Iberoam. Psicol. Salud 12, 17–33. doi: 10.23923/j.rips.2021.01.042

Crossref Full Text | Google Scholar

Cortés, M. T., Giménez-Costa, J. A., Motos-Sellés, P., and Sancerni-Beitia, M. D. (2017). Revision of AUDIT consumption items to improve the screening of youth binge drinking. Front. Psychol. 8:910. doi: 10.3389/fpsyg.2017.00910,

PubMed Abstract | Crossref Full Text | Google Scholar

Cox, M., Chaney, B., McDonald, L., and Beth Miller, M. (2022). Assessing alcohol use in situ: correlates of self-report vs. objective alcohol consumption. Addict. Behav. 129:107278. doi: 10.1016/j.addbeh.2022.107278,

PubMed Abstract | Crossref Full Text | Google Scholar

Davis, C. N., Piasecki, T. M., Bartholow, B. D., and Slutske, W. S. (2021). Effects of alcohol sensitivity on alcohol-induced blackouts and passing out: an examination of the alcohol sensitivity questionnaire among underage drinkers. Alcohol. Clin. Exp. Res. 45, 1149–1160. doi: 10.1111/acer.14607,

PubMed Abstract | Crossref Full Text | Google Scholar

Delaney, D. J., Bernstein, M. H., Harlow, L. L., Farrow, M., Martin, R. A., and Stein, L. A. R. (2020). The brief situational confidence questionnaire for alcohol: a psychometric assessment with incarcerated youth. Psychol. Assess. 32, 254–264. doi: 10.1037/pas0000780,

PubMed Abstract | Crossref Full Text | Google Scholar

Delawalla, C. N., Lee, T. T. C., and Keen, M. A. (2023). The use of MMPI-3 scales to assess personality-based vulnerabilities for alcohol use and problems. Psychol. Assess. 35, 633–645. doi: 10.1037/pas0001245,

PubMed Abstract | Crossref Full Text | Google Scholar

Dermody, S. S., Uhrig, A., Moore, A., Raessi, T., and Abramovich, A. (2023). A narrative systematic review of the gender inclusivity of measures of harmful drinking and their psychometric properties among transgender adults. Addiction 118, 1649–1660. doi: 10.1111/add.16212,

PubMed Abstract | Crossref Full Text | Google Scholar

Doval, E., Viladrich, C., and Angulo-Brunet, A. (2023). Coefficient alpha: the resistance of a classic. Psicothema 1, 5–20. doi: 10.7334/psicothema2022.321,

PubMed Abstract | Crossref Full Text | Google Scholar

Duffy, F. F., Sudom, K., Jones, M., Fear, N. T., Greenberg, N., Adler, A. M., et al. (2023). Calibrating the alcohol use disorders identification test-consumption (AUDIT-C) for detecting alcohol-related problems among Canadian, UK and US soldiers: cross-sectional pre-deployment and post-deployment survey results. BMJ Open 13:e068619. doi: 10.1136/bmjopen-2022-068619,

PubMed Abstract | Crossref Full Text | Google Scholar

Edwards, A. A., Joyner, K. J., and Schatschneider, C. (2021). A simulation study on the performance of different reliability estimation methods. Educ. Psychol. Meas. 81, 1089–1117. doi: 10.1177/0013164421994184,

PubMed Abstract | Crossref Full Text | Google Scholar

Elosua Oliden, P., and Zumbo, B. D. (2008). Coeficientes de fiabilidad para escalas de respuesta categórica ordenada. Psicothema 20, 896–901.

Google Scholar

George, D., and Mallery, P. (2010). SPSS for windows step by step: a simple study guide and reference, 17.0 update, 10/e. Boston: Pearson Education India.

Google Scholar

Gomà-i-Freixanet, M., Ferrero-Rincón, G., and Granero, R. (2023). Assessing alcohol expectations in university students: the APNE scale. Int. J. Ment. Health Addict. 21, 4259–4274. doi: 10.1007/s11469-022-00854-6

Crossref Full Text | Google Scholar

Hadland, S. E., Knight, J. R., and Harris, S. K. (2019). Alcohol use disorder: a pediatric-onset condition needing early detection and intervention. Pediatrics 143:e20183654. doi: 10.1542/peds.2018-3654,

PubMed Abstract | Crossref Full Text | Google Scholar

Hakstian, A. R., and Whalen, T. E. (1976). A K-sample significance test for independent alpha coefficients. Psychometrika 41, 219–231. doi: 10.1007/BF02291840

Crossref Full Text | Google Scholar

Hallit, J., Salameh, P., Haddad, C., Sacre, H., Soufia, M., Akel, M., et al. (2020). Validation of the AUDIT scale and factors associated with alcohol use disorder in adolescents: results of a National Lebanese Study. BMC Pediatr. 20:205. doi: 10.1186/s12887-020-02116-7,

PubMed Abstract | Crossref Full Text | Google Scholar

Harrer, M., Cuijpers, P., Furukawa, T., and Ebert, D. (2021). Doing meta-analysis with R: A hands-on guide : Chapman and Hall/CRC. doi: 10.1201/9781003107347

Crossref Full Text | Google Scholar

Hawn, S. E., Aggen, S. H., Cusack, S. E., Spit for Science Working GroupDick, D., and Amstadter, A. B. (2020). Examination of a novel measure of trauma-related drinking to cope. J. Clin. Psychol. 76, 1938–1964. doi: 10.1002/jclp.22972,

PubMed Abstract | Crossref Full Text | Google Scholar

Hayes, A. F., and Coutts, J. J. (2020). Use omega rather than Cronbach’s alpha for estimating reliability. But…. Commun. Methods Meas. 14, 1–24. doi: 10.1080/19312458.2020.1718629

Crossref Full Text | Google Scholar

Higgins, J. P., and Thompson, S. G. (2002). Quantifying heterogeneity in a meta-analysis. Stat. Med. 21, 1539–1558. doi: 10.1002/sim.1186,

PubMed Abstract | Crossref Full Text | Google Scholar

Hoogland, J. J., and Boomsma, A. (1998). Robustness studies in covariance structure modeling: an overview and a meta-analysis. Sociol. Methods Res. 26, 329–367. doi: 10.1177/0049124198026003003

Crossref Full Text | Google Scholar

Hummer, J. F., Tucker, J. S., Rodriguez, A., Davis, J. P., and D'Amico, E. J. (2022). A longitudinal study of alcohol and Cannabis use in young adulthood: exploring racial and ethnic differences in the effects of peer and parental influences from middle adolescence. J. Stud. Alcohol Drugs 83, 684–694. doi: 10.15288/jsad.21-00050,

PubMed Abstract | Crossref Full Text | Google Scholar

Jordan, H. R., Colvin, K. F., Kim, K. Y. E., Martin, J. L., and Madson, M. B. (2021). Psychometric validation of the protective drinking practices scale in college students across the United States. Exp. Clin. Psychopharmacol. 29, 251–260. doi: 10.1037/pha0000471,

PubMed Abstract | Crossref Full Text | Google Scholar

Kahler, C. W., Strong, D. R., and Read, J. P. (2005). Toward efficient and comprehensive measurement of the alcohol problems continuum in college students: the brief young adult alcohol consequences questionnaire. Alcoholism 29, 1180–1189. doi: 10.1097/01.alc.0000171940.95813.a,

PubMed Abstract | Crossref Full Text | Google Scholar

Kalkbrenner, M. T. (2024). Choosing between Cronbach’s coefficient alpha, McDonald’s coefficient omega, and coefficient H: confidence intervals and the advantages and drawbacks of interpretive guidelines. Meas. Eval. Couns. Dev. 57, 93–105. doi: 10.1080/07481756.2023.2283637

Crossref Full Text | Google Scholar

Klimkiewicz, A., Jakubczyk, A., Mach, A., Abramowska, M., Szczypiński, J., Berent, D., et al. (2021). Psychometric properties of the polish version of the alcohol use disorders identification test (AUDIT). Drug Alcohol Depend. 218:108427. doi: 10.1016/j.drugalcdep.2020.108427,

PubMed Abstract | Crossref Full Text | Google Scholar

Kuntsche, E., Kuendig, H., and Gmel, G. (2008). Alcohol outlet density, perceived availability and adolescent alcohol use: a multilevel structural equation model. J. Epidemiol. Community Health 62, 811–816. doi: 10.1136/jech.2007.065367,

PubMed Abstract | Crossref Full Text | Google Scholar

Leary, A. V., Dvorak, R. D., Burr, E. K., Peterson, R., De Leon, A. N., Klaver, S. J., et al. (2023). Development and validation of the personal assessment of responsible drinking identity (PARDI) with a college student sample. Psychol. Assess. 35, 618–632. doi: 10.1037/pas0001236,

PubMed Abstract | Crossref Full Text | Google Scholar

Lewis, M. A., Litt, D. M., King, K. M., Fairlie, A. M., Waldron, K. A., Garcia, T. A., et al. (2020). Examining the ecological validity of the prototype willingness model for adolescent and young adult alcohol use. Psychol. Addict. Behav. 34, 293–302. doi: 10.1037/adb0000533,

PubMed Abstract | Crossref Full Text | Google Scholar

López, V., Paladines, B., Vaca, S., Cacho, R., Fernández-Montalvo, J., and Ruisoto, P. (2019). Psychometric properties and factor structure of an Ecuadorian version of the alcohol use disorders identification test (AUDIT) in college students. PLoS One 14:e0219618. doi: 10.1371/journal.pone.0219618,

PubMed Abstract | Crossref Full Text | Google Scholar

Lui, P. P. (2019). College alcohol beliefs: measurement invariance, mean differences, and correlations with alcohol use outcomes across sociodemographic groups. J. Couns. Psychol. 66, 487–495. doi: 10.1037/cou0000338,

PubMed Abstract | Crossref Full Text | Google Scholar

Lui, P. P., Berkley, S. R., and Zamboanga, B. L. (2020). College alcohol belief and alcohol use: testing moderations by cultural orientations and ethnicity. J. Couns. Psychol. 67, 184–194. doi: 10.1037/cou0000374,

PubMed Abstract | Crossref Full Text | Google Scholar

Martin, J. L., Colvin, K. F., Madson, M. B., Zamboanga, B. L., and Pazienza, R. (2020). Optimal assessment of protective behavioral strategies among college drinkers: an item response theory analysis. Psychol. Assess. 32, 394–406. doi: 10.1037/pas0000799,

PubMed Abstract | Crossref Full Text | Google Scholar

McKay, M. T., Perry, J. L., Cole, J. C., Percy, A., and Sumnall, H. R. (2021). One rule for one, and a different rule for another: the case of the parental rules about alcohol questionnaire. Drug Alcohol Depend. 225:108824. doi: 10.1016/j.drugalcdep.2021.108824,

PubMed Abstract | Crossref Full Text | Google Scholar

Miller, M. B., DiBello, A. M., Merrill, J. E., and Carey, K. B. (2019). Development and initial validation of the alcohol-induced blackout measure. Addict. Behav. 99:106079. doi: 10.1016/j.addbeh.2019.106079,

PubMed Abstract | Crossref Full Text | Google Scholar

Mokkink, L. B., Elsman, E. B. M., and Terwee, C. B. (2024). The COSMIN guideline for systematic reviews of patient-reported outcome measures (PROMs). Qual. Life Res. 33, 2929–2939. doi: 10.1007/s11136-024-03761-6,

PubMed Abstract | Crossref Full Text | Google Scholar

Morales Quintero, L. A., Moral Jiménez, M. V., Rojas Solís, J. L., Bringas Molleda, C., Soto Chilaca, A., and Rodríguez Díaz, F. J. (2019). Psychometric properties of the alcohol use disorders identification test (AUDIT) in adolescents and young adults from southern Mexico. Alcohol 81, 39–46. doi: 10.1016/j.alcohol.2019.05.002,

PubMed Abstract | Crossref Full Text | Google Scholar

Motos Sellés, P., Cortés Tomás, M. T., and Giménez Costa, J. A. (2021). Diagnostic utility of new short versions of AUDIT to detect binge drinking in undergraduate students. Clín. Salud 32, 49–54. doi: 10.5093/clysa2020a23

Crossref Full Text | Google Scholar

O’Neill, S. E., Sher, K. J., and Bartholow, B. D. (2002). Alcohol susceptibility and tolerance in young adults. Alcoholism 26:119A.

Google Scholar

O'Brien, H., Callinan, S., Livingston, M., Doyle, J. S., and Dietze, P. M. (2020). Population patterns in alcohol use disorders identification test (AUDIT) scores in the Australian population; 2007-2016. Aust. N. Z. J. Public Health 44, 462–467. doi: 10.1111/1753-6405.13043,

PubMed Abstract | Crossref Full Text | Google Scholar

Ohtani, Y., Ueno, F., Kimura, M., Matsushita, S., Mimura, M., and Uchida, H. (2023). Highly endorsed screening and assessment scales for alcohol problems: a systematic review. Neuropsychopharmacol. Rep. 43, 470–481. doi: 10.1002/npr2.12363,

PubMed Abstract | Crossref Full Text | Google Scholar

Osberg, T. M., Atkins, L., Buchholz, L., Shirshova, V., Swiantek, A., Whitley, J., et al. (2010). Development and validation of the college life alcohol salience scale: a measure of beliefs about the role of alcohol in college life. Psychol. Addict. Behav. 24, 1–12. doi: 10.1037/a0018197,

PubMed Abstract | Crossref Full Text | Google Scholar

Ouzzani, M., Hammady, H., Fedorowicz, Z., and Elmagarmid, A. (2016). Rayyan-a web and mobile app for systematic reviews. Syst. Rev. 5:210. doi: 10.1186/s13643-016-0384-4,

PubMed Abstract | Crossref Full Text | Google Scholar

Oviedo, H. C., and Campo-Arias, A. (2005). Aproximación al uso del coeficiente alfa de Cronbach. Rev. Colomb. Psiquiatr. 34, 572–580.

Google Scholar

Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., et al. (2021). The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 372:n71. doi: 10.1136/bmj.n71,

PubMed Abstract | Crossref Full Text | Google Scholar

Patsopoulos, N. A., Evangelou, E., and Ioannidis, J. P. (2008). Sensitivity of between-study heterogeneity in meta-analysis: proposed metrics and empirical evaluation. Int. J. Epidemiol. 37, 1148–1157. doi: 10.1093/ije/dyn065,

PubMed Abstract | Crossref Full Text | Google Scholar

Prince, M. A., Pearson, M. R., Bravo, A. J., and Montes, K. S. (2018). A quantification of the alcohol use-consequences association in college student and clinical populations: a large, multi-sample study. Am. J. Addict. 27, 116–123. doi: 10.1111/ajad.12686,

PubMed Abstract | Crossref Full Text | Google Scholar

Ramírez, A., Burgos-Benavides, L., Sinchi, H., Quito-Calle, J. V., Herrero Díez, F., and Rodríguez-Díaz, F. J. (2025). Adaptation and validation of psychological assessment questionnaires using confirmatory factor analysis: a tutorial for planning and reporting analysis. Preprints. doi: 10.20944/preprints202502.1192.v1

Crossref Full Text | Google Scholar

Rodrigues, R., López-Caneda, E., Almeida-Antunes, N., Sampaio, A., and Crego, A. (2021). Portuguese validation of the alcohol craving questionnaire-short form-revised. PLoS One 16:e0251733. doi: 10.1371/journal.pone.0251733,

PubMed Abstract | Crossref Full Text | Google Scholar

Sánchez-García, M., Lozano-Rojas, Ó. M., Díaz-Batanero, C., Carmona-Márquez, J., Rojas-Tejada, A. J., and Fernández-Calderón, F. (2020). Spanish adaptation of the protective behavioral strategies Scale-20 (S-PBSS-20) and evaluation of its psychometric properties in university students. Psicothema 32, 598–606. doi: 10.7334/psicothema2020.172,

PubMed Abstract | Crossref Full Text | Google Scholar

Saunders, J. B., Aasland, O. G., Babor, T. F., de la Fuente, J. R., and Grant, M. (1993). Development of the alcohol use disorders identification test (AUDIT): WHO collaborative project on early detection of persons with harmful alcohol consumption--II. Addiction 88, 791–804. doi: 10.1111/j.1360-0443.1993.tb02093.x,

PubMed Abstract | Crossref Full Text | Google Scholar

Seay, K. D., and Feely, M. (2020). Assessment of the validity of the AUDIT factor structure in parents involved with child protective services. Am. J. Drug Alcohol Abuse 46, 546–552. doi: 10.1080/00952990.2020.1722685,

PubMed Abstract | Crossref Full Text | Google Scholar

Singleton, E. G., Henningfield, J. E., and Tiffany, S. (1994). Alcohol craving questionnaire: ACQ-Now: background and administration manual. Baltimore: NIDA Addiction Research Centre.

Google Scholar

Smit, K., Voogt, C., Otten, R., Kleinjan, M., and Kuntsche, E. (2020). Alcohol expectancies change in early to middle adolescence as a function of the exposure to parental alcohol use. Drug Alcohol Depend. 211:107938. doi: 10.1016/j.drugalcdep.2020.107938,

PubMed Abstract | Crossref Full Text | Google Scholar

Stamates, A. L., Yang, M., and Lau-Barraco, C. (2023). Validation of the brief young adult alcohol consequences questionnaire among student and nonstudent young adults. Exp. Clin. Psychopharmacol. 31, 643–651. doi: 10.1037/pha0000615,

PubMed Abstract | Crossref Full Text | Google Scholar

Stephenson, M., Riitano, D., Wilson, S., Leonardi-Bee, J., Mabire, C., Cooper, K., et al. (2020). “Chapter 12: systematic reviews of measurement properties” in JBI manual for evidence synthesis. eds. E. Aromataris and Z. Munn (Adelaida, Australia: JBI).

Google Scholar

Streiner, D. L. (2003). Starting at the beginning: an introduction to coefficient alpha and internal consistency. J. Pers. Assess. 80, 99–103. doi: 10.1207/S15327752JPA8001_18,

PubMed Abstract | Crossref Full Text | Google Scholar

Terwee, C. B., Bot, S. D., de Boer, M. R., van der Windt, D. A., Knol, D. L., Dekker, J., et al. (2007). Quality criteria were proposed for measurement properties of health status questionnaires. J. Clin. Epidemiol. 60, 34–42. doi: 10.1016/j.jclinepi.2006.03.012,

PubMed Abstract | Crossref Full Text | Google Scholar

The Jamovi Project (2023). jamovi. (Version 2.4) [Computer Software]. Available online at: https://www.jamovi.org (Accessed April 07, 2024).

Google Scholar

Toner, P., Böhnke, J. R., Andersen, P., and McCambridge, J. (2019). Alcohol screening and assessment measures for young people: a systematic review and meta-analysis of validation studies. Drug Alcohol Depend. 202, 39–49. doi: 10.1016/j.drugalcdep.2019.01.030,

PubMed Abstract | Crossref Full Text | Google Scholar

Trager, B. M., Koning, I. M., and Turrisi, R. (2021). Exploratory and confirmatory factor analysis of the parental rules toward adolescent drinking questionnaire: two factors are better than the original one. Addict. Behav. 117:106855. doi: 10.1016/j.addbeh.2021.106855,

PubMed Abstract | Crossref Full Text | Google Scholar

Trizano-Hermosilla, I., and Alvarado, J. M. (2016). Best alternatives to Cronbach's alpha reliability in realistic conditions: congeneric and asymmetrical measurements. Front. Psychol. 7:769. doi: 10.3389/fpsyg.2016.00769,

PubMed Abstract | Crossref Full Text | Google Scholar

Tucker, J. A., Bacon, J. P., Chandler, S. D., Lindstrom, K., and Cheong, J. (2020). Utility of digital respondent driven sampling to recruit community-dwelling emerging adults for assessment of drinking and related risks. Addict. Behav. 110:106536. doi: 10.1016/j.addbeh.2020.106536,

PubMed Abstract | Crossref Full Text | Google Scholar

Van der Vorst, H., Engels, R. C., Meeus, W., and Deković, M. (2006). The impact of alcohol-specific rules, parental norms about early drinking and parental alcohol use on adolescents drinking behavior. J. Child Psychol. Psychiatry 47, 1299–1306. doi: 10.1111/j.1469-7610.2006.01680.x,

PubMed Abstract | Crossref Full Text | Google Scholar

Villarosa-Hurlocker, M. C., Schutts, J. W., Madson, M. B., Jordan, H. R., Whitley, R. B., and Mohn, R. C. (2020). Screening for alcohol use disorders in college student drinkers with the AUDIT and the USAUDIT: a receiver operating characteristic curve analysis. Am. J. Drug Alcohol Abuse 46, 531–545. doi: 10.1080/00952990.2020.1712410,

PubMed Abstract | Crossref Full Text | Google Scholar

Van der Vorst, H., Engels, R. C., Meeus, W., Deković, M., and Van Leeuwe, J. (2005). The role of alcohol-specific socialization in adolescents’ drinking behaviour. Addiction (Abingdon, England), 100, 1464–1476. doi: 10.1111/j.1360-0443.2005.01193.x

Crossref Full Text | Google Scholar

Voss, A. T., Floyd, R. G., Campbell, K. W., Dennhardt, A. A., MacKillop, J., and Murphy, J. G. (2021). Psychometric evaluation of the reward probability index in emerging adult drinkers. Psychol. Addict. Behav. 35, 432–443. doi: 10.1037/adb0000712,

PubMed Abstract | Crossref Full Text | Google Scholar

Watterson, J., Gabbe, B., Dietze, P., Bowring, A., and Rosenfeld, J. V. (2019). Comparing short versions of the alcohol use disorders identification test (AUDIT) in a military cohor. J. R. Army Med. Corps 165, 312–316. doi: 10.1136/jramc-2018-001024,

PubMed Abstract | Crossref Full Text | Google Scholar

World Health Organization 2021 Global alcohol action plan 2022–2030 to strengthen implementation of the Global Strategy to Reduce the Harmful Use of Alcohol. Available online at: https://www.who.int/publications/m/item/global-alcohol-action-plan-second-draft-unedited (Accessed June 10, 2024).

Google Scholar

World Health Organization 2023 World health statistics: monitoring health for the SDGs, Sustainable Development Goals. Available online at: https://www.who.int/publications/i/item/9789240074323 (Accessed June 10, 2024).

Google Scholar

Zamboanga, B. L., Audley, S., Olthuis, J. V., Blumenthal, H., Tomaso, C. C., Bui, N., et al. (2019). Validation of a seven-factor structure for the motives for playing drinking games measure. Assessment 26, 582–603. doi: 10.1177/1073191117701191,

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, M. X., Pesigan, I. J. A., Kahler, C. W., Yip, M. C. W., Yu, S., and Wu, A. M. S. (2019). Psychometric properties of a Chinese version of the brief young adult alcohol consequences questionnaire (B-YAACQ). Addict. Behav. 90, 389–394. doi: 10.1016/j.addbeh.2018.11.045,

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: alcohol, adolescents, psychometric, meta-analysis, young adult

Citation: Sinchi-Sinchi H, Ramírez A, Burgos-Benavides L, Rodríguez-Díaz FJ and Herrero Díez FJ (2026) Evaluating alcohol consumption in adolescents and young adults: a meta-analysis of the psychometric properties of measurement instruments. Front. Psychol. 16:1591800. doi: 10.3389/fpsyg.2025.1591800

Received: 21 March 2025; Revised: 06 November 2025; Accepted: 15 December 2025;
Published: 09 January 2026.

Edited by:

Gudberg K. Jonsson, University of Iceland, Iceland

Reviewed by:

Sonia Lorente, Autonomous University of Barcelona, Spain
María Luisa García-Gomar, Autonomous University of Baja California, Mexico

Copyright © 2026 Sinchi-Sinchi, Ramírez, Burgos-Benavides, Rodríguez-Díaz and Herrero Díez. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hugo Sinchi-Sinchi, aGZzaW5jaGlAcHVjZXNlLmVkdS5lYw==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.