Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Psychol., 23 January 2026

Sec. Educational Psychology

Volume 16 - 2025 | https://doi.org/10.3389/fpsyg.2025.1675957

A reliability generalization meta-analysis of self-report measures of statistics anxiety

  • 1Department of Educational Sciences, Gaziantep University, Gaziantep, Türkiye
  • 2Ministry of National Education, Gaziantep, Türkiye

Objective and Method: In this study, it was aimed to obtain a general reliability coefficient for each of the statistics anxiety scales. For this purpose, Web of Science, ERIC and Scopus databases and Google Scholar search engine were searched according to certain criteria and 84 Cronbach’s alpha coefficients reported for the whole scale were reached. Reliability generalization meta-analysis method was applied to the obtained alpha coefficients within the scope of meta-analysis, which is a quantitative method.

Results and Conclusion: The mean value of the alpha coefficients for which the transformation formula was applied was found to be.927 under the random effects model, and the findings were statistically significant. In addition, the mean alpha value of each statistics anxiety scale was.931 for STARS,.917 for SAS,.918 for SAS-10 and.951 for WAESTA. Analog to the ANOVA and meta-regression analyses were conducted to reveal the heterogeneity of alpha coefficients in the overall analysis. Analog to ANOVA was applied for five different categorical variables, and according to the findings, it was observed that the mean alpha value differed statistically significantly depending on the scale type variable. Moreover, it was found that the mean scale score and the standard deviation of the mean scale score were statistically significant predictors of the mean alpha value.

1 Introduction

Statistics is a branch of science that covers the processes of making inferences from numerical data related to the organization, analysis or interpretation of data collected in daily life or school life. Statistics, which is widely used, has become a branch of science utilized in many different fields such as engineering, health sciences, social sciences and natural sciences (Akdeniz, 2015). An individual who will conduct a scientific study is expected to have some competencies in order to collect data, to bring these data together in a certain order and to interpret them correctly (Erkuş, 2011). In order to have these competencies, statistics education has become a necessity for individuals receiving undergraduate and graduate education (Onwuegbuzie, 1993).

The fact that statistics course is taught as a compulsory course to undergraduate and graduate students has brought along the need to examine the factors of success or failure in this course. Success in statistics course is affected by affective factors as well as cognitive factors (Yaşar, 2014). Feedback about the course (Williams, 2012), difficulties encountered in the course (Feinberg and Halperin, 1978), and past experiences associated with the mathematics course (Maysick, 1985) are among the cognitive factors affecting success in statistics course. The factors affecting the success of this course, such as the past academic experiences of each student taking the course, negative attitudes towards the teacher teaching the course, and the perception of interpreting numerical data paved the way for the formation of concerns about the course (Baloğlu, 2017). As a result of the formation of such concerns about the statistics course in individuals, the concept of “statistics anxiety” was added to the literature. Onwuegbuzie (1993) defines statistics anxiety as the state of anxiety that occurs when an individual encounters statistical processes such as organizing, analyzing or interpreting data at any time.

Since the ability to interpret numerical data in statistics is similar to the content of the mathematics course, the first studies on statistics anxiety tried to explain it with mathematics anxiety (Schacht and Stewart, 1990). There are also studies in which the relationship between statistical course success and situational anxiety was tried to be explained (Bendig and Hughes, 1954; Fisch, 1971). In this case, in the most general sense, statistical anxiety is examined under three subcomponents: situational causes, environmental causes and characterological causes (Onwuegbuzie, 1993). Situational causes of statistics anxiety include the role of the instructor teaching the statistics course (O’Bryant et al., 2021; Tonsing, 2018), pedagogical behaviors (Asare, 2023), course feedback (Williams, 2012), course content and pace of the course (Pan and Tang, 2004). Environmental causes of statistics anxiety are classified as the age (Bui and Alfaro, 2011), gender (Edirisooriya and Lipscomb, 2021), and past experiences with mathematics (Maysick, 1985). Under the characterological causes of statistical anxiety, individual characteristics such as academic motivation, learning styles (Kesici et al., 2011), self-perception (Esnard et al., 2021), self-identity (Najmi et al., 2018), and academic resilience (Ali and Gaber, 2022) stand out.

1.1 Statistics anxiety scales

Statistics anxiety scales have been developed by scientists in order to determine the level of this anxiety and to take measures for the result. Among these scales, STARS developed by Cruise et al. (1985) and SAS developed by Vigil-Colet et al. (2008) are the most widely used statistics anxiety scales worldwide. In addition to these scales, the SAS-10 scale, which was revised by Pretorius and Norman (1992) from the Mathematics Anxiety Scale developed by Betz (1978) to a statistics anxiety scale, and the WAESTA scales developed by Faber et al. (2018) are also widely used statistics anxiety scales. These scales have been adapted to many cultures and are widely used. There are also the SAM scale developed by Earp (2007) and the SAI scale developed by Zeidner (1991), which are not widely used.

1.2 Statistics anxiety rating scale (STARS)

STARS, developed by Cruise et al. (1985) on a five-point Likert scale, is the most widely used statistics anxiety scale. As a result of the factor analysis conducted, the STARS scale consisting of 51 items and 6 subscales was obtained. Worth of Statistics subscale consists of 16 items, Interpretation Anxiety subscale consists of 11 items, and Test and Class Anxiety subscale consists of 8 items. In addition, Computation Self-Concept subscale consists of 7 items, Fear of Asking for Help subscale consists of 4 items, and finally Fear of Statistics Teacher subscale consists of 5 items. When examining the subscales of STARS, it is seen that the first three subscales (WS, IA, TCA) are related to statistical anxiety, while the other three subscales (CSC, FAH, FST) are related to statistical competence. The internal consistency coefficient for the reliability of STARS for the sample of 537 participants was reported as 0.96 and the test–retest reliability coefficient was reported as 0.76, and the internal consistency coefficients of the subscales were 0.94, 0.87, 0.68, 0.88, 0.89 and 0.80, respectively.

1.3 Statistical anxiety scale (SAS)

In the SAS developed by Vigil-Colet et al. (2008), it was thought that the subscales of STARS other than statistics anxiety (fear of statistics teacher, self-perception, value of statistics) were not suitable for the purpose of measuring statistics anxiety. It was thought to be more useful since it has fewer items than STARS. Exploratory factor analysis was applied to ensure the validity of the SAS, which is a five-point Likert-type scale consisting of 24 items, and as a result of the analysis, it was decided that the scale had three subscales. Each of these subscales consists of 8 items. The factor loadings of the items in the first subscale vary between 0.88 and 0.57, the factor loadings of the items in the second subscale vary between 0.92 and 0.50, and the factor loadings of the items in the third subscale vary between 0.89 and 0.34. These subscales were named as Examination Anxiety, Asking for Help Anxiety and Interpretation Anxiety. The alpha coefficient for the sample of 159 undergraduate students for whom the SAS was developed was reported as 0.911. In addition, the alpha coefficients of the subscales of the SAS were determined as 0.87, 0.92 and 0.82, respectively.

1.4 WAESTA scale

The WAESTA scale developed by Faber et al. (2018) consists of 17 items. The aim was to create a measurement tool that is simpler and free from conceptual limitations than the scales previously developed in this field. In order to gather validity evidence based on internal structure of the scale, the factor loadings of each item were calculated using principal component analysis and it was seen that the scale consisted of three subscales. These subscales consist of the Worry subscale, Avoidance subscale, and Emotional subscale. Among these subscales, the factor loadings of the items belonging to the Worry subscale (consisting of eight items) ranged between 0.49 and 0.74, and the factor loadings of the items belonging to the Avoidance subscale (consisting of four items) ranged between 0.49 and 0.76. Finally, the factor loadings of the items belonging to the subscale named Emotional cognition (consisting of five items) vary between 0.53 and 0.70. Of the three subscales, The internal consistency coefficient for the sample in which the measurement tool was developed was reported as 0.94 and the split-half reliability coefficient was reported as 0.91.

1.5 The statistics anxiety scale (SAS-10)

The SAS-10 was created by revising the 10-item Mathematics Anxiety Scale (MAS) developed by Betz (1978) by replacing mathematics terms with statistical terms (Pretorius and Norman, 1992). The internal consistency coefficient for the sample in which the unidimensional SAS-10 scale was developed was reported as 0.94 and the split-half reliability were reported as 0.91. The relationship between the SAS-10 and the STAI (State–Trait Anxiety Inventory; Spielberger et al., 1970), whose reliability and validity have been previously proven, was examined and a statistically significant relationship was found between them.

In addition to the developed statistics anxiety scales, there are many statistics anxiety scales adapted to different cultures in the literature. The number of items, the name of the scale and the number of subscales of the adapted scales vary according to the adapted culture. The widespread use of statistics anxiety scales around the world will bring about the differentiation of the conditions of application of the scale. The reliability of a scale is the degree to which it is free from random errors that may occur during its application (Büyüköztürk et al., 2012). The language in which the measurement tool is applied, the length of the test, the objectivity of scoring, the factors related to the instructions of the scale and the factors related to the application conditions will affect the reliability of the sample. In this case, fluctuations in the reliability values in the samples to which the scale is applied are expected. The situation of fluctuations in the reliability coefficient has brought about the need to investigate this variability in a systematic way. The most accurate way to estimate the reliability coefficient for a scale is to systematically combine the reliability coefficients of the studies in which the scale is used (Şen and Yıldırım, 2023). This method is called “reliability generalization” in the literature. The definition of reliability generalization was first used by Vacha-Haase (1998). According to Vacha-Haase (1998), reliability generalization meta-analysis provides important evidence about the amount and source of variation in reliability value. It provides guidance on whether the scale will be appropriate for the sample to which it will be applied (Taylor, 2012). It also provides important evidence for researchers in interpreting data and comparing results (Leech et al., 2011).

Looking at the literature, it was seen that in addition to the studies examining the level of statistical anxiety, the predictors of statistical anxiety were also addressed (Faber and Drexler, 2019; O'Bryant, 2017; Valle et al., 2021; Zhang et al., 2021). There are also studies investigating the relationship of statistics anxiety with affective factors such as perception, attitude, and self-identity (Altun et al., 2021; Kesici et al., 2011; Mji, 2009; Perepiczka et al., 2011; Sesé et al., 2015). Moreover, studies examining the effect of statistics anxiety on variables such as gender, age, and academic performance were also found (Eshet et al., 2022; Hsiao and Chiang, 2011; Sandoz et al., 2017; Valle et al., 2021; Wu et al., 2022). Apart from the studies investigating the variables associated with statistical anxiety, studies in which reliability generalization meta-analysis was conducted were also examined. It was examined whether the sample type (Özdemir et al., 2020; Shields and Caruso, 2003; Wallace and Wheeler, 2002), the type of publication of the study (Ock et al., 2021), the language in which the measurement tool was applied (Kıyıcı and Kahraman, 2022), and the length of the test (Fitzgerald, 1996), which are thought to affect the mean reliability value obtained, affect the mean reliability value. In addition, reliability generalization meta-analysis studies in which continuous variables such as the year the study was published, the mean score of the scale and the mean age of the sample were investigated as predictors of the mean reliability value are also available in the literature (Sen, 2022; Wallace and Wheeler, 2002). There is no study examining the reliability generalization meta-analysis method on statistics anxiety. Increasing the number of such studies is extremely important in terms of identifying previously examined studies in the literature on reliability generalization meta-analysis and reaching more comprehensive results. It is thought that finding a general reliability value of the statistics anxiety scales to be obtained from the current study will contribute to the relevant literature. Moreover, it is thought that it will guide scientists in future studies. In this direction, while analyzing the studies included in the research, answers to the following questions were sought:

1. What is the reliability induction rate of the studies reached during the research process?

2. What is the mean value of the internal consistency coefficients of each statistics anxiety scale and its sub-dimensions included in the study?

3. What is the mean value of the overall internal consistency coefficients of all statistics anxiety scales included in the study?

4. Do the variables such as the measurement tool, type of publication, continent and education level where the scale is applied, and the language used in the studies included in the research have an effect on the mean value of internal consistency coefficient?

5. Is the mean age of the individuals participating in the study, the ratio of the number of women to the number of men, the mean score of the scale, the standard deviation of the mean score, the year of the study, or the number of items predictive of the mean value of internal consistency coefficients?

2 Method

In the current study, reliability generalization meta-analysis method, which is one of the meta-analysis methods, was used by bringing together the reliability coefficients of the studies in which statistics anxiety scales were used. The way to estimate an accurate reliability coefficient for a specific measurement tool is to combine the reliability coefficients obtained from numerous studies (Şen and Yıldırım, 2023, p. 318). The method of combining the results of numerous studies in a meta-analysis to obtain a single result is also used in combining the reliability coefficients of measurement tools. Reliability generalization is a method of estimating a common reliability coefficient by combining the reliability coefficients obtained from a scale used in various studies (Vacha-Haase, 1998, p. 12). In this context, the reliability coefficients in the studies in which the STARS, SAS, WAESTA, SAS-10 scales were used among the statistics anxiety scales were brought together and the reliability generalization meta-analysis method was used for each scale. Moreover, reliability generalization meta-analysis was also performed for the subscales of multidimensional and widely used statistics anxiety scales (STARS, SAS). Furthermore, pooled reliability coefficient was obtained for the whole of the statistics anxiety scales (STARS, SAS, WAESTA, SAS-10, SAM, SAQ). When the reliability coefficient preferred by the studies included in the analysis was examined, it was seen that the most commonly used reliability coefficient was Cronbach’s alpha coefficient, one of the internal consistency coefficients. For this reason, the analyses were conducted with the alpha coefficient.

2.1 Data sources and search strategies

While reviewing the studies, it was aimed to reach all scales measuring statistics anxiety and all studies in which these scales were used without determining a specific year interval. At this context, Web of Science, Scopus and ERIC databases and Google Scholar search engine were searched, respectively. In the inclusion of the studies in the current study, the criteria of being published in English, using a measurement tool that measures statistics anxiety, and reporting the reliability coefficient of the whole scale or its subscales were taken into consideration. In line with these criteria, databases were searched with some keywords as well as the citations of the statistics anxiety scales identified between May 2023 and July 2023. While ERIC, Web of Science and Scopus were searched with the words “statistical anxiety” and “statistics anxiety,” Google Scholar was searched with the search model “scale OR measure OR questionnaire OR inventory” AND “intitle:statistical anxiety” AND “scale OR measure OR questionnaire OR inventory” AND “intitle:statistics anxiety.” As a result of these searches, a total of 1,598 articles, theses or published papers were reached. Meta-analysis was conducted with 84 alpha coefficients obtained from 81 studies as a result of eliminating 1,444 studies that did not meet the above criteria. Furthermore, 73 studies in which only the reliability coefficient of the subscales was reported were also recorded to be used in the reliability generalization meta-analysis of the subscales. In the selection of studies to be included in the meta-analysis, the process should be carried out in accordance with a checklist. In addition to the REGEMA (Reliability Generalization Meta-Analysis) checklist developed by Sanchez‐Meca et al. (2013) for reliability generalization meta-analysis, PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analysis) developed by Moher et al. (2009) is the most widely used checklist for meta-analysis. The studies included in the analysis are shown in the PRISMA flow diagram in Figure 1.

Figure 1
Flowchart depicting a meta-analysis process. Starting with an electronic database search yielding 1,598 results, duplicates reduced it to 1338. Screening criteria included English studies with specific scales, narrowing to 792 full-texts. Of these, 81 were included in the meta-analysis, and 73 reported only subscale reliability. Exclusions involved abstracts and studies with insufficient data, incorrect scale use, or non-English publications.

Figure 1. PRISMA flowchart: Studies included in the study.

The 157 reliability coefficients obtained from 154 studies recorded during the data collection phase were reported. Of the 115 studies in which STARS was used, 24 studies reported the alpha coefficient for the whole scale but not for the subscales. And, there are 27 studies in which the alpha coefficient was reported for the whole STARS and subscales. On the other hand, there are 64 studies in which the alpha coefficient is not reported for the whole STARS, but only for the subscales. A similar situation also applies to SAS. Alpha coefficient for the whole SAS was reported in 5 out of 27 studies in which SAS was used. There are 14 studies reporting alpha coefficient for the whole SAS and subscales. On the other hand, there were 8 studies in which the alpha coefficient was reported for subscales but not for the whole SAS, and 5 studies in which the WAESTA scale was used. Four studies using the SAS-10 scale were recorded. One study using SAM and one study using SAQ were also included in the analysis (Earp, 2007; Idika and Ojong, 2020). In addition, studies using two different statistics anxiety scales (Altun et al., 2021; Chew et al., 2018; Grajzel, 2019; Igbokwe et al., 2017) were recorded as a second study with the same name.

In a study using the SAI recorded during the data collection phase (Zeidner, 1991), only the alpha coefficient for the subscales was reported. Similarly, in a study using the SAM (Vahedi et al., 2011), only the alpha coefficient for the subscales was reported. On the other hand, in two studies (Asare, 2023; Jones et al., 2022) where the reliability coefficient for the subscales of STARS was reported, the omega coefficient was reported. Since only studies reporting alpha coefficients were included in the analysis, the two studies reporting omega coefficients (Asare, 2023; Jones et al., 2022) were excluded from the analysis. In this case, a total of four studies were excluded from the analysis.

2.2 Data coding and coder reliability

In addition to the alpha coefficients of the studies included in the analysis, the year of publication, number of items, the scale used, the language in which the scale was applied, the type of publication of the study, the type of sample, the continent in which the scale was applied, were also recorded. Sample size, the ratio of the number of women to the number of men in the sample, the mean scale score, the standard deviation of the mean scale score, and the mean age were also collected. The scales used in these data were coded as STARS, SAS, WAESTA, SAS-10, SAM and SAQ. Moreover, the language in which the scale was administered was coded as English and non-English. The continent variable was coded as Asia, Europe, America, Africa and Australia. But there are studies that collected data from more than one continent (Chew et al., 2018; Gibeau et al., 2023). The sample type variable was coded as undergraduate, graduate and mixed (undergraduate and graduate).

For coder reliability, 20% of the researches were randomly selected from the data. After checking the reliability coefficient and n values for the selected data, When the data obtained from the researchers was evaluated, it was found that the inter-researcher agreement coefficient was 100%.

2.3 Data analysis

The present meta-analysis study aims to apply a reliability generalization meta-analysis to the 84 reliability coefficients reported for the scales in 81 studies that used different statistics anxiety scales. Additionally, 73 studies reporting reliability coefficients for subscales were included in the study to calculate the mean alpha coefficient for subscales. Analyses were conducted with the CMA v3 program. Method of moments (MM, also known as the DerSimonian and Laird method) was used as estimator method during the analysis.

In the study, firstly, the reliability induction rate was determined. In this way, the case of reporting reliability coefficient in primary studies will be revealed. Since the alpha coefficients of the scales are mostly reported at values of 0.70 and above, the distribution of alpha coefficients will show negative skewness (Beretvas and Pastor, 2003). In this context, the researchers contend that it would be more appropriate to apply a transformation formula to the alpha coefficients (Bonett, 2002, 2010; Hakstian et al., 1976). It was deemed appropriate to apply Bonett (2002) transformation formula to the alpha coefficients in order to stabilize the variance as well as to eliminate the skewness in the alpha coefficients. The transformation formulas for variance variability and alpha coefficients are as follows (Equations 1,2):

L i = ln ( 1 α ̂ i )     (1)
V ( L i ) = 2 J i ( J i 1 ) ( n i 2 )     (2)

After applying the transformation formula to the alpha coefficients, the effect size to be used in the meta-analysis should be selected. Since the random effects model is mostly used in social sciences and it is thought that there is more than one source of variability in this model, the random effects model was chosen in this study. Following the selection of the effect size, in addition to the forest plot presented to test the heterogeneity of the data group, the Q-statistic and the I2 value to determine the amount of heterogeneity were calculated. The Q-statistic aims to test whether there is a statistically significant chi-square value (Sedgwick, 2015). On the other hand, 50% percentile value is a limit for the calculated I2 value (Cleophas and Zwinderman, 2007). After the heterogeneity of the data group was tested, analog to the ANOVA analysis was conducted under the mixed effects model in the analysis of categorical variables in order to determine the sources of heterogeneity (Hedges, 1982). In the analysis of continuous variables, meta-regression analysis with maximum likelihood estimation method was applied (Hedges and Olkin, 2014). Furthermore, the QB -statistic (Q-between) was used to determine statistical significance in moderator variables. R2 estimation was used to determine the proportion of variance explained by continuous variables in meta-regression analysis (Yıldırım, 2021).

In the study, six different methods are mentioned to determine whether there is publication bias (Card, 2015). These methods are the funnel plot (Light and Pillemer, 1984), the trim and fill method developed by Duval and Tweedie (2000), the fail-safe N method proposed by Rosenthal (1979) and Orwin (1983) fail-safe N method. Analyses were performed with these methods. In addition, the regression analysis developed by Egger et al. (1997) and the rank correlation test developed by Begg and Mazumdar (1994) were utilized. The data was analyzed using the CMA v3 program.

2.4 Demographic characteristics of the primary studies

The distribution of the studies in which statistics anxiety scales were used according to years is shown in Figure 2. When Figure 2 is examined, it is seen that the use of statistics anxiety scales has increased over the years and the most commonly used year is 2022.

Figure 2
Line graph showing frequency from 1985 to 2023. The values remain low and stable until around 2009, then increase with peaks around 2011, 2019, and a sharp peak in 2022 before dropping in 2023.

Figure 2. Number of studies using statistics anxiety scale according to years.

The demographic characteristics of the studies included in the analysis are shown in Table 1. When Table 1 is examined, the measurement tools consist of 6 different scales, namely STARS (61.90%), SAS (25.00%), WAESTA (5.95%), SAS-10 (4.76%), SAM (1.19%) and SAQ (1.19%). When the publication types of the studies were analyzed, it was seen that they consist of thesis (13.04%), article (76.19%) or paper (10.71%). The measurement tools were applied not only in the continent where they were developed but also in Asia (19.04%), Europe (29.76%), America (39.28%), Africa (11.90%) and Australia (1.19%). The educational level of the participants was grouped in three different levels: undergraduate students (68.24%), graduate students (23.52%), and mixed (8.24%). The studies using the instruments measuring statistics anxiety were grouped in two different ways: studies using the English version (56.52%) and studies using non-English versions (43.47%). The non-English versions were Arabic, Malay, Italian, German, Turkish, Chinese, Spanish, Hebrew, French and Dutch. Eighty-four reliability coefficient obtained from 81 studies for reliability generalization meta-analysis are presented in Supplementary Table 1. As stated in Supplementary Table 1, no year range was determined during the screening, and it was aimed to reach all published studies. In Supplementary Table 1, in addition to the reliability coefficients of the samples, the number of the sample, the ratio of the number of women to the number of men in the sample, the scale applied, the number of items in the scale, the continent where the study was conducted, the type of study published, the education level of the participants, and the language in which the scale was applied are given. There is also information on the number of items of the measurement tool, the year of publication of the study and the mean age of the sample.

Table 1
www.frontiersin.org

Table 1. Characteristics of the studies included in the study.

According to Supplementary Table 1, there were a total of 18,809 participants in the studies included in the meta-analysis. In addition to the number of participants, there are also studies reporting the mean age of participants. In 53 of the 81 studies, the mean age of the participants was reported. According to the reported mean ages, the mean age of the participants ranged between 18.23 and 40.73. The number of studies reporting the gender of the participants is also quite high. Only 19 studies did not report the number of men and women participating in the study, while the number of men and women was reported in the remaining 65 studies. Accordingly, in the current study, the ratio of the number of women participating in the research to the number of men was taken for each study. The female to male ratio of the participants is between 0.70 and 12.67.

Since the number of items of the measurement tools used in the studies is also an important variable for reliability generalization meta-analysis, the number of items for each study is reported. The number of items of the scales varied between 10 and 51. In addition, there are 21 studies reporting the mean scale score recorded during the data collection phase. The mean scale scores of these studies ranged from 43.46 to 198.84 for STARS and from 46.10 to 74.70 for SAS. For WAESTA, the mean scores ranged from 40.62 to 42.66, and for SAS-10, the mean scale scores ranged from 26.07 to 34.19. The standard deviation of the mean scale scores of these studies is between 1.1 and 59.06. The publication years of the studies included in the analysis ranged between 1985 and 2023 (Median = 2017).

Figure 3 shows the stem and leaf graphs of the 84 raw alpha values. When the alpha coefficients are analyzed, it can be inferred that the reliability of all of these values is at a sufficient level (O’Rourke et al., 2005). In Figure 3, the unweighted mean alpha coefficient was calculated as 0.913 (Median = 0.930, SD = 0.056). According to the graph, the lowest alpha coefficient was 0.74 and the highest alpha coefficient was 0.98. The values of the 84 reliability coefficients show an asymmetry of −1.514.

Figure 3
A text listing with a decimal point to the left, containing:

Figure 3. Stem and leaf plot of raw alpha values.

3 Results

3.1 Reliability induction

In some studies, instead of reporting the reliability coefficient of the sample, the reliability coefficient of the sample in which the scale was developed is reported. This situation is called reliability induction in the literature (Shields and Caruso, 2004; Şen and Yıldırım, 2023). In the current research, there are quite a number of studies in which the reliability coefficients of the sample of the studies in which statistics anxiety scales were used were not reported. In 204 studies in which statistics anxiety scales were used, the reliability coefficient of the applied sample was not reported and was not included in the analysis. In this case, 157 studies in which the reliability coefficient of the sample to which the scale was applied was reported were included in the analysis. Since 204 studies in which the statistics anxiety scale was used and the reliability coefficient was not reported were not included in the analysis, the reliability induction rate was determined as 57.2%. As can be seen from here, more than half of the studies that should have used the measurement tool and reported reliability did not take this requirement into account. This can be considered as an important methodological problem.

3.2 Publication bias

In order to determine whether there was any publication bias in the data group. According to the classical fail-safe N value, the value was found to be 10,026. This value is considerably higher than the value of 425 obtained with the formula NR (5 k + 10). In this case, it is seen that the combined alpha coefficients are not biased according to the classical perpetrator-safe N method. According to Orwin’s fail-safe N method, the number of missing data should be 2,120 for Fisher’s z to be 0.01. Since this number is almost impossible to reach, it can be said that the current study is not biased according to Orwin’s fail-safe N method. Moreover, according to the findings of Begg and Mazumdar’s rank correlation test, Kendall’s two-tailed p value is not statistically significant (p = 0.432) and the standard error value is negative (τ = −0.05). According to this test, the two-tailed p value should not be statistically significant to ensure the absence of publication bias. In addition, according to the Egger’s test analyzed, the t value is not statistically significant [t(83) = 0.183, p = 0.427]. Egger’s one-tailed p > 0.05 indicates that the current finding is not biased. Finally, the findings of Duval and Tweedie’s trim and fill method are shown in Figure 4.

Figure 4
Funnel plot showing the standard error by Fisher’s Z value, with data points scattered around a central vertical line. The plot displays a triangular pattern indicating whether there is publication bias.

Figure 4. Funnel plot obtained as a result of the trim and fill method.

When Figure 4 is examined, the data shows a symmetrical distribution. In addition, the fact that there are only two fictitious studies that need to be added to ensure that the publication is not biased shows that the data group is not biased. Moreover, the forest plots created to determine the heterogeneity of the data group is shown in Figure 5.

Figure 5
A forest plot displaying the results of multiple studies listed in a table. The table includes columns for study name, Fisher's Z, standard error, variance, lower and upper limits, Z-value, and p-value. The plot on the right visually represents each study's confidence intervals, with squares indicating Fisher's Z and horizontal lines showing the 95% confidence intervals. The x-axis spans from -4.00 to 4.00, with labels

Figure 5. Forest plot of the studies included in the meta-analysis.

3.3 Reliability generalization meta-analysis for each scale

The statistics anxiety scales included in the reliability generalization meta-analysis were analyzed with a total of six different measurement tools: STARS, SAS, SAS-10, WAESTA, SAQ and SAM. It was aimed to conduct reliability generalization meta-analysis for each of these scales, and since only one study using SAM and SAQ scales was included in the analysis, no reliability generalization meta-analysis was conducted for these studies. Accordingly, the reliability coefficients of the studies using the STARS, SAS, SAS-10 and WAESTA scales as well as the pooled reliability values of the subscales of STARS and SAS are shown in Table 2. Table 2 also includes the descriptive statistics of these measurement tools.

Table 2
www.frontiersin.org

Table 2. Mean reliability values of statistics anxiety scales and subscales.

The internal consistency coefficient of the sample in which the measurement tools were developed, the total number of studies in which the scale was used and the number of items in the scale are shown in Table 2. Moreover, for each of these scales, the 95% confidence interval of the alpha values for which Bonett (2002) transformation formula was applied, and the lowest and highest alpha coefficients in the included studies were also presented. In addition, the Q statistic with degrees of freedom for testing heterogeneity and I2 values for the amount of heterogeneity are shown. Finally, Orwin’s fail-safe N value was examined to determine whether the data formed by the alpha coefficients of the scales were subject to publication bias.

According to Table 2, the alpha coefficient of the sample in which STARS was developed was reported as 0.96, while the original alpha coefficients of the subscales were 0.94, 0.87, 0.68, 0.88, 0.89 and 0.80, respectively. In addition to the 52 studies in which STARS was used and included in the analysis, the number of studies in which subscales were used and included in the analysis were 85, 90, 91, 83, 88 and 82, respectively. As a result of the reliability generalization meta-analysis applied to the alpha coefficients of these studies with a range between 0.740 and 0.980, the mean alpha coefficient for the whole STARS was calculated as 0.931 (95% CI: 0.917–0.942), and the mean alpha coefficients for the subscales were calculated as 0.911, 0.864, 0.865, 0.847, 0.822 and 0.780, respectively. Table 2 shows that heterogeneity was achieved according to the Q values calculated for the whole STARS and each of the subscales (p < 0.01). Furthermore, the I2 values calculated for the heterogeneity amounts of these scales are quite high (>75%). Considering the Orwin fail-safe N value examined for publication bias, it can be concluded that the data groups formed for STARS and subscales of STARS are not biased.

Table 2 presents the findings of the reliability generalization meta-analysis for the whole 24-item SAS and its three subscales as well as demographic characteristics. While the alpha coefficient for the sample in which the SAS was developed was reported as 0.91, the original alpha coefficients for the subscales of the SAS were 0.87, 0.92 and 0.82, respectively. In addition to reporting the alpha coefficients of 21 studies in which the SAS was used, the number of studies in which the alpha coefficients of the subscales were also reported were 22, 22 and 21, respectively. As a result of the reliability generalization meta-analysis applied to alpha values ranging from 0.790 to 0.950, the mean alpha value for the whole SAS was determined as 0.917 (95% CI: 0.989–0.933). The mean alpha coefficients for the subscales of the SAS were 0.895, 0.927 and 0.876, respectively. According to the Q value calculated for the whole and subscales of the SAS, heterogeneity was achieved (p < 0.01) and the I2 value calculated for the amount of heterogeneity was quite high (>75%). Considering the Orwin fail-safe N value examined for publication bias, it can be concluded that the data groups created for the SAS and its subscales are not biased.

Although the 17-item WAESTA scale has three subscales, reliability generalization meta-analysis was conducted for this scale since there are few studies using the WAESTA scale included in the analysis in the current study. Reliability generalization meta-analysis was not applied to the subscales. Five studies using this scale were included in the analysis and the alpha coefficient for the sample in which WAESTA was developed was reported as 0.94. The mean alpha coefficient of the studies using the WAESTA scale with a range between 0.940 and 0.960 was analyzed as 0.951 (95% CI: 0.941–0.960). According to the Q-statistic calculated to test the heterogeneity of the studies using WAESTA, heterogeneity was achieved (p < 0.01). In addition, the I2 value calculated for the amount of heterogeneity exceeded the 50% limit. Considering the Orwin fail-safe N value examined for publication bias, it can be concluded that the data groups created for WAESTA are not biased.

Four studies using the 10-item, unidimensional SAS-10 scale were included in the analysis and the range of these studies varied between 0.750 and 0.980. The alpha coefficient of the sample in which SAS-10 was developed was reported as 0.90. The mean alpha coefficient of the studies using this scale was analyzed as 0.918 (95% CI: 0.836–0.959). The heterogeneity of the analyzed data group was ensured according to the calculated Q-statistic (p < 0.01) and the amount of heterogeneity was quite high (>75%) according to the calculated I2 value. Considering the Orwin fail-safe N value examined for publication bias, it can be concluded that the data groups created for SAS-10 are not biased.

3.4 Overall effect size

In addition to the reliability generalization meta-analysis conducted for each of the scales, all of the studies using these scales were combined and the overall alpha mean of all studies was created. In Table 3, the general reliability coefficient formed by combining the reliability coefficients of the studies included in the meta-analysis is reported under the random effects model. Moreover, the heterogeneity test for the model is also presented in Table 3. The Q value calculated to determine the heterogeneity of the data group was reported as 3596.36. This value is well above the 0.05 confidence interval limit (sd = 83, χ2 = 105.27) with 84 degrees of freedom in the chi-square table. In this case, the heterogeneity of the data is ensured. The I2 value calculated to determine the amount of heterogeneity was found to be 97.6%. This value shows that the amount of heterogeneity of the data group is quite high.

Table 3
www.frontiersin.org

Table 3. Effect sizes and heterogeneity test.

The mean of the alpha coefficients using Bonett (2002) transformation formula was 0.928 (95% CI: 0.917–0.937) under the random effects model and this value is statistically significant (p < 0.01). It is seen that the mean of the transformed alpha values is higher than the mean of the raw alpha values shown in Figure 3.

3.5 Sub-group analysis of categorical variables

In addition to the statistical significance of all studies included in the analysis, categorical and continuous variables were determined to investigate the source of heterogeneity of the studies. Analog to the ANOVA analysis was performed for categorical variables and meta-regression analysis for continuous variables. In Supplementary Table 1, five different categorical variables were identified: the type of scale used, the type of publication of the study, the language in which the scale was applied, the educational level of the sample, and the continent in which the study was applied. Moreover, the continuous variables in Supplementary Table 1 are the year of publication of the study, the ratio of the number of women to the number of men in the sample, the number of items in the scale, the sample size, the mean age of the sample, the mean score of the scale and the standard deviation of the mean score of the scale. Analog to the ANOVA findings for categorical variables are shown in Table 4.

Table 4
www.frontiersin.org

Table 4. Analog to the ANOVA results of categorical variables.

In the analog to the ANOVA analysis, the difference between the scales was first investigated. STARS (k = 52), SAS (k = 21), WAESTA (k = 5) and SAS-10 (k = 4) scales were analyzed with alpha coefficients. Since there was only one study using SAM and SAQ scales, they were not included in the analog to the ANOVA analysis. It was observed that the mean alpha coefficients of the 83 studies included in the analog to the ANOVA analysis differed according to the scale types (Q = 14.364, sd = 3, p < 0.05). In this case, it can be said that the heterogeneity of the mean alpha coefficient is due to the type of scale used. When the mean alpha coefficients of the scale types are compared, it is seen that the scale with the highest mean alpha value is WAESTA (0.952), followed by STARS (0.931), and finally SAS (0.918) and SAS-10 (0.918) scales together.

No statistically significant difference was found between the mean alpha coefficients of other categorical variables in the current study (p > 0.05). According to Table 4, no statistically significant difference was found according to the variables of education level (Q = 2.656, sd = 2, p = 0.265), continent where the scale was applied (Q = 2.780, sd = 3, p = 0.433), language where the scale was applied (Q = 0.619, sd = 1, p = 0.431) and type of publication of the study (Q = 1.142, sd = 2, p = 0.613). The non-significant p values indicate that comparable findings were not reached between subgroups.

3.6 Moderator analysis of continuous variables

In the present study, continuous variables were determined in addition to categorical variables. The year the study was published, the size of the sample, the number of items in the scale, the mean age of the sample, the ratio of the number of women to the number of men in the sample, the mean scale score and the standard deviation of the mean scale score were determined as continuous variables. The findings of the meta-regression analysis applied to investigate the effect of each continuous variable on the mean alpha coefficient are shown in Table 5. The reason for applying meta-regression analysis separately for each variable instead of multiple is missing data. Since the data of different studies could not be accessed under each heading, there are less than 15 studies when the general model is established.

Table 5
www.frontiersin.org

Table 5. Results of meta-regression analysis applied separately for each continuous variable.

According to the meta-regression findings, the variables of scale mean score and standard deviation of scale mean score are statistically significant predictors of the mean reliability value (p < 0.05). As predicted, there is a significant and positive relationship between these variables and mean alpha values. A one-unit increase in the mean scale score is expected to increase the mean reliability value by 0.0056. Furthermore, a one-unit increase in the standard deviation of the mean scale score is expected to increase the mean reliability value by 0.0198 points. Here, it is seen that the standard deviation of the mean scale score affects the mean alpha value more than the mean scale score. The R2 value, which is the indicator of the explained variance ratio, is 24% for the mean scale score and 24% for the standard deviation of the mean scale score. According to Table 5, other continuous variables were not statistically significant predictors of the mean alpha coefficient (p > 0.05). In the multiple regression model for the combined reliability value, the entire model was statistically significant (QM = 3593.76, p < 0.01) and explained 98% of the total variance. In addition, the model for residual values is also statistically significant (Q = 3424.61, p < 0.01).

4 Conclusion and discussion

The aim of the present study is to establish an overall reliability value for all of the statistics anxiety scales, as well as to apply reliability generalization meta-analysis for each statistics anxiety scale (STARS, SAS, WAESTA, SAS-10). Bonett (2010) conversion formula was applied to the alpha coefficients of the studies in which statistics anxiety scales were used. Since the mean alpha value obtained from the transformed alpha coefficients is above 0.90, the applications are reliable (Büyüköztürk, 2021). In the literature, there are studies in which the mean alpha coefficient of each scale was calculated in addition to reliability generalization meta-analysis for different scales measuring the same construct (Graham and Christiansen, 2009; Graham et al., 2011). Reliability generalization meta-analysis was also conducted for the subscales of multidimensional statistics anxiety scales. As a result of this analysis, it was concluded that the reliability of each subscale was above 0.70 (Clark and Watson, 2016). When the literature is examined, there are studies in which the mean alpha coefficients for the subscales were analyzed in addition to the mean alpha value for the entire measurement tool (Kıyıcı and Kahraman, 2022; Graham and Christiansen, 2009). This is similar to the method of the current study. In addition to the reliability generalization meta-analysis applied for each scale, reliability generalization meta-analysis was also applied for all of the statistics anxiety scales. Since the mean alpha coefficient obtained as a result of the analysis conducted under the random effects model was above 0.70, it was seen that the applications of the scale were reliable (DeVellis, 1991; Tavakol and Dennick, 2011). Moreover, the heterogeneity of the data group was ensured and it was determined that the amount of heterogeneity was quite high. The scope of the current study includes articles, proceedings, and theses that meet the specified criteria and use at least one of the scales mentioned above, accessible via the Web of Science, Scopus, ERIC databases, and Google Scholar search engine as of July 2023. Only studies published in Turkish or English were included in the research. In many studies, it was observed that the reliability coefficient of the sample in which the scale was used was not reported. Since the conditions of each application of the scale are different, the reliability value should be reported (Shields and Caruso, 2003).

In order to investigate the sources of heterogeneity in the heterogeneous data group, analog to the ANOVA was applied for categorical variables and meta-regression analysis was applied for continuous variables. The categorical variables for which analog to the ANOVA was applied were examined in five subgroups: the type of scale, the educational level of the sample, the continent in which the scale was applied, the language in which the scale was applied, and the type of publication of the study. In line with the findings obtained, it is seen that the mean alpha value differs statistically significantly according to the scale type variable. Grajzel (2019) applied STARS and SAS measurement tools to the same sample and reported the alpha coefficient of STARS as 0.890 and the alpha coefficient of SAS as 0.940. This does not coincide with the results obtained in the present study on the other hand, Igbokwe et al. (2017), similar to Grajzel (2019), reported the alpha coefficients of STARS and SAS measurement tools for the same sample as 0.960 for STARS and 0.910 for SAS. This supports the Analog to the ANOVA findings of the current study classified according to scale type.

In the present study, the mean alpha coefficient does not differ statistically significantly according to other categorical variables (the educational level of the sample, the continent where the scale was applied, the language in which the scale was applied, the type of publication of the study). In the literature, there are studies in which statistics anxiety differs according to the educational level of the sample (Fitzgerald, 1996), as well as studies in which there is no statistically significant difference between statistics anxiety and educational level (Benson, 1989). On the other hand, there are studies in the literature where there is a difference in the level of statistics anxiety according to continents (Kesici et al., 2011; Onwuegbuzie, 1999). Moreover, although there was no statistically significant difference in the current study in terms of the language in which the scale was applied, there is a study in the literature in which there was a difference in the level of statistics anxiety according to the language in which the scale was applied (Förster and Maur, 2015). In this study, it was revealed that the group whose mother tongue was different had more statistics anxiety than the group whose mother tongue was German. The reason for this situation can be explained by the inability to understand statistical concepts correctly, followed by a higher level of statistics anxiety. In the present study, no statistically significant difference was found between statistics anxiety and the type of publication of the study. In a different study using reliability generalization meta-analysis method (Sen, 2022), no difference was found between the mean alpha coefficient and the type of publication. On the other hand, there is a study showing that the type of extension is one of the variables affecting statistics anxiety (Fitzgerald, 1996). Since the aforementioned study tried to explain statistics anxiety together with attitude and achievement factors, different results may have been obtained from the present study.

In order to determine the continuous variables affecting the mean alpha coefficient, meta-regression analysis was performed for seven moderator variables (year of publication of the study, size of the sample, number of items, mean age of the sample, ratio of the number of women to the number of men, mean scale score and standard deviation of the mean scale score). As a result of the analysis, it was found that the mean scale score and the standard deviation of the mean scale score were statistically significant predictors of the mean alpha value. This may be due to the different number of items in the scales. Similarly, in another study in which reliability generalization meta-analysis was applied (López-Pina et al., 2015), the mean scale score and standard deviation of the mean scale score were found to be predictors of the mean reliability value. This supports the current study. On the other hand, unlike the conclusion of the current study that year of publication has no effect on statistics anxiety, a study concluding that year of publication has an effect on statistics anxiety was found (Fitzgerald, 1996). Since the year of publication was classified and analyzed as categorical variables in groups of ten instead of continuous variables in the aforementioned study, different results may have been reached. In addition, the result that the mean alpha coefficient was not affected by the sample size in the current study was also reached in a study (Sen, 2022) in which reliability generalization meta-analysis was applied. In the present study, the number of items was not one of the variables affecting the mean alpha coefficient. In the literature, it was found that test length affects statistics anxiety (Fitzgerald, 1996). Contrary, in a study investigating attitude towards mathematics (Bradford, 1990), it was revealed that the length of the test affects attitude. In the aforementioned studies, test length was classified and transformed into a categorical variable and analyzes were conducted in this way. In this case, it can be considered natural to reach different results with the present study. On the other hand, the study concluded that the mean age had no effect on the mean alpha coefficient. Another variable whose effect on the mean alpha coefficient was investigated is the mean age of the sample. In the present study, it was concluded that the mean age did not affect the mean alpha coefficient. Similarly, there are studies in the literature (Bui and Alfaro, 2011; Fitzgerald, 1996) that concluded that age has no effect on statistics anxiety. On the other hand, there are also studies showing that groups with a higher mean age exhibit higher statistics anxiety than younger groups (Bell, 2003; Edirisooriya and Lipscomb, 2021). Since younger groups are more familiar with computerized applications (Prensky, 2001), they may have lower statistics anxiety in statistical applications. Finally, in this study, it was determined that the ratio of the number of women to the number of men in the sample had no effect on the mean alpha coefficient. In the literature, there are studies that show that the level of statistics anxiety differs statistically significantly according to gender, rather than the ratio of the number of women to the number of men in samples that aim to determine statistics anxiety (Edirisooriya and Lipscomb, 2021; Gibeau et al., 2023; Hsiao and Chiang, 2011; MacArthur, 2020). In her study, Demaria-Mitton (1987) found that the level of statistics anxiety did not differ statistically significantly according to gender. Furthermore, Mandap (2016) investigated the difference of statistics anxiety of the group to which STARS was applied according to gender on the subscales of STARS. According to the results, a statistically significant difference was found only for one of the subscales according to gender. According to the findings, women’s level of fear of asking for help was found to be higher than that of men. When we look at this result, it is seen that there is a statistically significant difference only for one of the subscales rather than the whole scale. For this reason, it can be considered that the aforementioned study may have differed from the result of the current study, since analyzes were not conducted for the whole scale.

Bonett (2010) conversion formula was applied to the alpha coefficients in the study. In future studies, different conversion formulas (Fisher z, Hakstian et al., 1976) can be applied to different reliability values (composite reliability, test–retest). In addition, researchers who want to examine the variables affecting statistical anxiety, unlike the variables investigated in the current study, can examine the effect of variables such as the field of education of the scale and the country where the scale is applied.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

Author contributions

EÖ: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. IY: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1675957/full#supplementary-material

References

Akdeniz, F. (2015). İstatistikte yeni eğilimler ve gelişmeler. Sosyal Bilimler Araştırma Dergisi 4, 1–11.

Google Scholar

Asare, P. Y. (2023). Profiling teacher pedagogical behaviours in plummeting postgraduate students’ anxiety in statistics. Cogent Educ. 10:2222656. doi: 10.1080/2331186X.2023.2222656

Crossref Full Text | Google Scholar

Begg, C. B., and Mazumdar, M. (1994). Operating characteristics of a rank correlation test for publication bias. Biometrics 50, 1088–1101. doi: 10.2307/2533446,

PubMed Abstract | Crossref Full Text | Google Scholar

Bell, J. A. (2003). Statistics anxiety: the nontraditional student. Education 124, 157–163.

Google Scholar

Bendig, A. W., and Hughes, J. B. (1954). Student attitude and achievement in a course in introductory statistics. J. Educ. Psychol. 45, 268–276. doi: 10.1037/h0057391

Crossref Full Text | Google Scholar

Benson, J. (1989). Structural components of statistical test anxiety in adults: an exploratory model. J. Exp. Educ. 57, 247–261. doi: 10.1080/00220973.1989.10806509

Crossref Full Text | Google Scholar

Beretvas, S. N., and Pastor, D. A. (2003). Using mixed-effects models in reliability generalization studies. Educ. Psychol. Meas. 63, 75–95. doi: 10.1177/0013164402239318

Crossref Full Text | Google Scholar

Betz, N. E. (1978). Prevalence, distribution and correlates of math anxiety in college students. J. Couns. Psychol. 25, 441–448. doi: 10.1037/0022-0167.25.5.441

Crossref Full Text | Google Scholar

Bonett, D. G. (2002). Sample size requirements for testing and estimating coefficient alpha. J. Educ. Behav. Stat. 27, 335–340. doi: 10.3102/10769986027004335

Crossref Full Text | Google Scholar

Bonett, D. G. (2010). Varying coefficient meta-analytic methods for alpha reliability. Psychol. Methods 15, 368–385. doi: 10.1037/a0020142,

PubMed Abstract | Crossref Full Text | Google Scholar

Bradford, J. W. (1990). A meta-analysis of selected research on student attitudes towards mathematics : The University of Iowa.

Google Scholar

Büyüköztürk, Ş. (2021). Sosyal Bilimler İçin Veri Analizi El Kitabı (29. baskı). Pegem Akademi, Ankara.

Google Scholar

Büyüköztürk, Ş., Kılıç Çakmak, E., Akgün, Ö. E., Karadeniz, Ş., and Demirel, F. (2012). Örnekleme yöntemleri.

Google Scholar

Card, N. A. (2015). Applied meta-analysis for social science research. New York: Guilford Publications.

Google Scholar

Clark, L. A., and Watson, D. (2016). Constructing validity: basic issues in objective scale development. Psychological Assessment 7, 309–319. doi: 10.1037/14805-012

Crossref Full Text | Google Scholar

Cleophas, T. J., and Zwinderman, A. H. (2007). Meta-analysis. Circulation 115, 2870–2875. doi: 10.1161/CIRCULATIONAHA.105.594960,

PubMed Abstract | Crossref Full Text | Google Scholar

Cruise, R. J., Cash, R. W., and Bolton, D. L. (1985). “Development and validation of an instrument to measure statistical anxiety” in Paper presented at the annual meeting of the American Statistical Association statistics education section, Las Vegas, Nevada.

Google Scholar

Demaria-Mitton, P. A. 1987 Locus-of-control. Gender and type of major as correlates to statistics anxiety in college students. Unpublished doctoral dissertation, American U niversity

Google Scholar

DeVellis, R. F. (1991). Scale development: Theory and applications. California: Sage Publications, 1–113.

Google Scholar

Duval, S., and Tweedie, R. (2000). Trim and fll: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics 56, 455–463. doi: 10.1111/j.0006-341X.2000.00455.x

Crossref Full Text | Google Scholar

Egger, M., Smith, G. D., Schneider, M., and Minder, C. (1997). Bias in meta-analysis detected by a simple, graphical test. BMJ 315, 629–634. doi: 10.1136/bmj.315.7109.629,

PubMed Abstract | Crossref Full Text | Google Scholar

Erkuş, A. (2011). Davranış bilimleri için bilimsel araştırma süreci. Ankara: Seçkin.

Google Scholar

Faber, G., Drexler, H., Stappert, A., and Eichhorn, J. (2018). Education science students’ statistics anxiety: developing and analyzing a scale for measuring their worry, avoidance, and emotionality cognitions. Int. J. Educ. Psychol. 7, 248–285. doi: 10.17583/ijep.2018.2872

Crossref Full Text | Google Scholar

Feinberg, L. B., and Halperin, S. (1978). Affective and cognitive correlates of course performance in introductory statistics. J. Exp. Educ. 46, 11–18. doi: 10.1080/00220973.1978.11011637

Crossref Full Text | Google Scholar

Fisch, R. (1971). Course evaluation, test anxiety, and final test results in a statistics course. Z. Entwicklungspsychol. Paedagog. Psychol. 3, 212–228.

Google Scholar

Fitzgerald, S. M. (1996). The relationship between anxiety and statistics achievement: a meta-analysis : The University of Toledo.

Google Scholar

Graham, J. M., and Christiansen, K. (2009). The reliability of romantic love: a reliability generalization meta-analysis. Pers. Relat. 16, 49–66. doi: 10.1111/j.1475-6811.2009.01209.x

Crossref Full Text | Google Scholar

Graham, J. M., Diebels, K. J., and Barnow, Z. B. (2011). The reliability of relationship satisfaction: a reliability generalization meta-analysis. J. Fam. Psychol. 25, 39–48. doi: 10.1037/a0022441,

PubMed Abstract | Crossref Full Text | Google Scholar

Hakstian, A. R., Whalen, T. E., and Masson, M. E. (1976). AK-sample procedure for comparatively assessing multivariate association. Psychol. Bull. 83, 922–927. doi: 10.1037/0033-2909.83.5.922

Crossref Full Text | Google Scholar

Hedges, L. V. (1982). Fitting categorical models to effect sizes from a series of experiments. J. Educ. Stat. 7, 119–137. doi: 10.3102/10769986007002119

Crossref Full Text | Google Scholar

Hedges, L. V., and Olkin, I. (2014). Statistical methods for meta-analysis. Orlando: Academic press.

Google Scholar

Jones, A., Knibb, G., and Christiansen, P. (2022). Statistics anxiety, predictions of exam performance and actual exam performance in UK psychology students. PsyArXiv. doi: 10.31234/osf.io/edc2n

Crossref Full Text | Google Scholar

Kıyıcı, G., and Kahraman, N. (2022). A meta-analytic reliability generalization study of the computational thinking scale. Sci. Insights Educ. Front. 13, 1859–1874. doi: 10.15354/sief.22.ma011

Crossref Full Text | Google Scholar

Leech, N. L., Onwuegbuzie, A. J., and O'Conner, R. (2011). Assessing internal consistency in counseling research. Counsel. Outcome Res. Eval. 2, 115–125. doi: 10.1177/2150137811414873

Crossref Full Text | Google Scholar

Light, R. J., and Pillemer, D. B. (1984). Summing up: The science of reviewing research. Massachusetts: Harvard University Press.

Google Scholar

López-Pina, J. A., Sánchez-Meca, J., López-López, J. A., Marín-Martínez, F., Núñez-Núñez, R. M., Rosa-Alcázar, A. I., et al. (2015). The Yale–Brown obsessive compulsive scale: a reliability generalization metaanalysis. Assessment 22, 619–628. doi: 10.1177/1073191114551954,

PubMed Abstract | Crossref Full Text | Google Scholar

Moher, D., Liberati, A., Tetzlaff, J., and Altman, D. G.Prisma Group (2009). Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 151:e1000097. doi: 10.7326/0003-4819-151-4-200908180-00135,

PubMed Abstract | Crossref Full Text | Google Scholar

O’Rourke, N., Hatcher, L., and Stepanski, E. J. (2005). A step-by-step approach to using SAS for univariate & multivariate statistics. Cary, NC, United States: SAS Institute.

Google Scholar

Ock, J., McAbee, S. T., Ercan, S., Shaw, A., and Oswald, F. L. (2021). Reliability generalization analysis of the core self-evaluations scale. Pract. Assess. Res. Eval. 26:6. doi: 10.7275/zsc7-jw58

Crossref Full Text | Google Scholar

Orwin, R. G. (1983). A fail-safe N for effect size in meta-analysis. J. Educ. Stat. 8, 157–159. doi: 10.2307/1164923

Crossref Full Text | Google Scholar

Özdemir, V., Yıldırım, Y., and Tan, Ş. (2020). A meta-analytic reliability generalization study of the Oxford happiness scale in Turkish sample. J. Meas. Eval. Educ. Psychol. 11, 374–404. doi: 10.21031/epod.766266

Crossref Full Text | Google Scholar

Prensky, M. (2001). Digital natives, digital immigrants part II: do they really think differently? On Horizon 9, 1–6. doi: 10.1108/10748120110424843

Crossref Full Text | Google Scholar

Rosenthal, R. (1979). The file drawer problem and tolerance for null results. Psychol. Bull. 86, 638–641. doi: 10.1037/0033-2909.86.3.638

Crossref Full Text | Google Scholar

Sanchez‐Meca, J., López‐López, J. A., and López‐Pina, J. A. (2013). Some recommended statistical analytic practices when reliability generalization studies are conducted. British Journal of Mathematical and Statistical Psychology 66, 402–425. doi: 10.1111/j.2044-8317.2012.02057.x,

PubMed Abstract | Crossref Full Text | Google Scholar

Sánchez-Meca, J., Marín-Martínez, F., López-López, J. A., Núñez-Núñez, R. M., Rubio-Aparicio, M., López-García, J. J., et al. (2021). Improving the reporting quality of reliability generalization meta-analyses: the REGEMA checklist. Res. Synth. Methods 12, 516–536. doi: 10.1002/jrsm.1487,

PubMed Abstract | Crossref Full Text | Google Scholar

Schacht, S., and Stewart, B. J. (1990). What's funny about statistics? A technique for reducing student anxiety. Teach. Sociol. 18, 52–56. doi: 10.2307/1318231

Crossref Full Text | Google Scholar

Sedgwick, P. (2015). Meta-analyses: what is heterogeneity? BMJ 350, 14–35. doi: 10.1136/bmj.h1435,

PubMed Abstract | Crossref Full Text | Google Scholar

Sen, S. (2022). A reliability generalization meta-analysis of Runco ideational behavior scale. Creat. Res. J. 34, 178–194. doi: 10.1080/10400419.2021.1960719

Crossref Full Text | Google Scholar

Şen, S., and Yıldırım, İ. (2023). Meta-analysis applications with CMA. Ankara: Anı Publishing.

Google Scholar

Shields, A. L., and Caruso, J. C. (2003). Reliability generalization of the alcohol use disorders identification test. Educ. Psychol. Meas. 63, 404–413. doi: 10.1177/0013164403063003004

Crossref Full Text | Google Scholar

Shields, A. L., and Caruso, J. C. (2004). A reliability induction and reliability generalization study of the CAGE questionnaire. Educational and Psychological Measurement, 64, 254–270.

Google Scholar

Spielberger, C. D., Gonzalez-Reigosa, F., Martinez-Urrutia, A., Natalicio, L. F., and Natalicio, D. S. (1970). The state-trait anxiety inventory. Revista Interamericana de Psicologia/Interamerican journal of psychology, 5.

Google Scholar

Tavakol, M., and Dennick, R. (2011). Making sense of Cronbach’s alpha. International journal of medical education, 2:53–55. doi: 10.5116/ijme.4dfb.8dfd

Crossref Full Text | Google Scholar

Taylor, R. 2012 Review of the motivated strategies for learning questionnaire (MSLQ) using reliability generalization techniques to assess scale reliability. Doctoral dissertation

Google Scholar

Vacha-Haase, T. (1998). Reliability generalization: exploring variance in measurement error affecting score reliability across studies. Educ. Psychol. Meas. 58, 6–20. doi: 10.1177/0013164498058001002

Crossref Full Text | Google Scholar

Vahedi, S., Farrokhi, F., and Bevrani, H. (2011). A confirmatory factor analysis of the structure of statistics anxiety measure: an examination of four alternative models. Iran. J. Psychiatry 6:92,

PubMed Abstract | Google Scholar

Wallace, K. A., and Wheeler, A. J. (2002). Reliability generalization of the life satisfaction index. Educ. Psychol. Meas. 62, 674–684. doi: 10.1177/0013164402062004009

Crossref Full Text | Google Scholar

Yaşar, M. (2014). İstatistiğe yönelik tutum ölçeği: Geçerlilik ve güvenirlik çalışması. Pamukkale Üniversitesi Eğitim Fakültesi Dergisi 36, 59–75. doi: 10.9779/PUJE640

Crossref Full Text | Google Scholar

Yıldırım, A. B. (2021). The effect of exercise on the total number of BrdU+ cell counts in rats’ hippocampal dentate gyrus: A meta-analysis study. Brain Research, 1766, 147512.

Google Scholar

Zeidner, M. (1991). Statistics and mathematics anxiety in social science students: some interesting parallels. Br. J. Educ. Psychol. 61, 319–328. doi: 10.1111/j.2044-8279.1991.tb00989.x,

PubMed Abstract | Crossref Full Text | Google Scholar

Studies included in the meta-analysis

Abd Hamid, H. S., and Sulaiman, M. K. (2016). “Factor structure of statistics anxiety rating scale (STARs): EFA and CFA Using Malaysian Undergraduate Psychology Students” in 7th International Conference on University Learning and Teaching (InCULT 2014) Proceedings: Educate to Innovate (pp. 149–156) (Singapore: Springer).

Google Scholar

Abdalla, S. (2021). Prevalence and predictors of statistics anxiety among sociology and social work students. J. Arts Soc. Sci.

Google Scholar

Abdulghani, R. 2023 Predictors of Statistics Anxiety Among Graduate Students in Saudi Arabia. Doctoral dissertation, Andrews University

Google Scholar

Adegboye, O. A., and Jawid, A. (2016). Multivariate multilevel models for attitudes toward statistics: multi-disciplinary settings in Afghanistan. J. Appl. Stat. 43, 244–261. doi: 10.1080/02664763.2015.1091445

Crossref Full Text | Google Scholar

Afdal, A., Alizamar, A., Ilyas, A., Zikra, Z., Taufik, T., Erlamsyah, E., et al. (2019). Contribution of statistical anxiety to student learning outcomes: study in universitas negeri Padang. J. Phys. Conf. Ser. 1157:042126. doi: 10.1088/1742-6596/1157/4/042126

Crossref Full Text | Google Scholar

Ali, S. I., and Gaber, S. A. (2022). Modeling the causal relationships between statistical self-efficacy, academic hardiness, and statistics anxiety among postgraduate students at King Faisal University. J. Posit. Sch. Psychol., 1734–1749.

Google Scholar

Alizamar, A., Afdal, A., Ifdil, I., Ardi, Z., Ilyas, A., and Zikra, Z., & & Febriani, R. D. (2019). Are there statistical anxiety differences between male and female students?. In J. Phys. Conf. Ser. (Vol. 1157, No., p.:042127). IOP Publishing. doi:doi: 10.1088/1742-6596/1157/4/042127

Crossref Full Text | Google Scholar

Altun, M., Yılmaz, G. K., Demir, B., and Çelik, H. S. (2021). Statistical anxiety and metacognitive awareness levels of graduate students studying in mathematics education program. Eur. J. Educ. Stud. 9.

Google Scholar

Baloğlu, M., Abbassi, A., and Kesici, S. (2017). Multivariate relationships between statistics anxiety and motivational beliefs. Education 137, 430–444.

Google Scholar

Baloğlu, M. (2002). Psychometric properties of the statistics anxiety rating scale. Psychol. Rep. 90, 315–325. doi: 10.2466/pr0.2002.90.1.315,

PubMed Abstract | Crossref Full Text | Google Scholar

Baloğlu, M. (2003). Individual differences in statistics anxiety among college students. Pers. Individ. Differ. 34, 855–865. doi: 10.1016/S0191-8869(02)00076-4

Crossref Full Text | Google Scholar

Baloğlu, M., and Zelhart, P. (2004). Üniversite öğrencileri arasında yüksek ve düşük istatistik kaygısının ayrıştırıcıları. Eğitim ve Bilim 29.

Google Scholar

Baloğlu, M., Deniz, M. E., and Kesici, Ş. (2011). A descriptive study of individual and cross-cultural differences in statistics anxiety. Learn. Individ. Differ. 21, 387–391. doi: 10.1016/j.lindif.2011.03.003

Crossref Full Text | Google Scholar

Bathurst, N. G. 2015 Using a self-management project to improve student performance in an online introductory statistics course. San José State University

Google Scholar

Bektaş, H., Akman, S., and Yeşilaltay, E. (2021). İstatistik kaygı ölçeğinin (SAS) psikometrik özelliklerinin incelenmesi. İstanbul Gelişim Üniversitesi Sosyal Bilimler Dergisi 8, 1–14. doi: 10.17336/igusbd.627197

Crossref Full Text | Google Scholar

Bell, A. A. 2022 Social desirability and other predictors of statistics anxiety at the graduate level (doctoral dissertation, Capella University)

Google Scholar

Beurze, S. M., Donders, A. R. T., Zielhuis, G. A., de Vegt, F., and Verbeek, A. L. (2013). Statistics anxiety: a barrier for education in research methodology for medical students? Med. Sci. Educ. 23, 377–384.

Google Scholar

Bourne, V. J., and Nesbit, R. J. (2018). Do attitudes towards statistics influence the decision to study psychology at degree level? A pilot investigation. Psychol. Teach. Rev. 24, 55–63. doi: 10.53841/bpsptr.2018.24.2.55

Crossref Full Text | Google Scholar

Bui, N. H., and Alfaro, M. A. (2011). Statistics anxiety and science attitudes: age, gender, and ethnicity factors. Coll. Stud. J. 45, 573–585.

Google Scholar

Casinillo, E. L., Lina, E. R., Casinillo, L. F., Batidor, P. G., and Lebante, M. R. (2022). Econometric evidence on statistical anxiety of engineering students during the new normal setup. Philipp. Soc. Sci. J. 5, 9–17. doi: 10.52006/main.v5i4.564

Crossref Full Text | Google Scholar

Chau, Q. 2018 Exploration of statistics anxiety among doctoral students in health sciences related disciplines Seton Hall University

Google Scholar

Chew, K. H. P., and Dillon, D. B. (2014a). “Individual differences in statistics anxiety among students in Singapore” in Proceedings of the international conference on managing the Asian century: ICMAC 2013 (Singapore: Springer Singapore), 293–302.

Google Scholar

Chew, K. H. P., and Dillon, D. B. (2014b). Statistics anxiety and the big five personality factors. Procedia Soc. Behav. Sci. 112, 1177–1186. doi: 10.1016/j.sbspro.2014.01.1282

Crossref Full Text | Google Scholar

Chew, P. K., and Dillon, D. B. (2014c). Reliability and validity of the statistical anxiety scale among students in Singapore and Australia. J. Trop. Psychol. 4:e7. doi: 10.1017/jtp.2014.7

Crossref Full Text | Google Scholar

Chew, P. K., Dillon, D. B., and Swinbourne, A. L. (2018). An examination of the internal consistency and structure of the statistical anxiety rating scale (STARS). PLoS One 13:e0194195. doi: 10.1371/journal.pone.0194195,

PubMed Abstract | Crossref Full Text | Google Scholar

Chew, P. K., Swinbourne, A., and Dillon, D. B. (2014). No evidence of attentional bias in statistics anxiety. Eur. J. Soc. Behav. Sci. 10, 1451–1465. doi: 10.15405/ejsbs.131

Crossref Full Text | Google Scholar

Chiesi, F., Primi, C., and Carmona, J. (2011). Measuring statistics anxiety: cross-country validity of the statistical anxiety scale (SAS). J. Psychoeduc. Assess. 29, 559–569. doi: 10.1177/0734282911404985

Crossref Full Text | Google Scholar

Collins, K. M., and Onwuegbuzie, A. J. (2006). “Relatıonshıp between Readıng Abılıty and Statıstıcs Anxıety among Afrıcan-Amerıcan graduate students: Implıcatıons for the Teachıng and Learnıng of Statıstıcs” in 7th international conference on teaching statistics (Bahia, Brazil).

Google Scholar

Collins, K. M., and Onwuegbuzie, A. J. (2007). I cannot read my statistics textbook: the relationship between reading ability and statistics anxiety. J. Negro Educ. 7, 118–129.

Google Scholar

Cravalho, P. F. 2010 Learning statistics using concept maps: effects on anxiety and performance. San Jose state university

Google Scholar

de Vink, I. C. 2017 The relationship between statistics anxiety and statistical performance (bachelor's thesis)

Google Scholar

DeVaney, T. A. (2016). Confirmatory factor analysis of the statistical anxiety rating scale with online graduate students. Psychol. Rep. 118, 565–586. doi: 10.1177/0033294116644093,

PubMed Abstract | Crossref Full Text | Google Scholar

Dunn, K. (2014). Why wait? The influence of academic self-regulation, intrinsic motivation, and statistics anxiety on procrastination in online statistics. Innov. High. Educ. 39, 33–44. doi: 10.1007/s10755-013-9256-1

Crossref Full Text | Google Scholar

Durak, I., and Karagöz, Y. (2021). Adaptation of statistics anxiety scale to Turkish: validity and reliability study. Int. J. Assess. Tools Educ. 8, 667–683. doi: 10.21449/ijate.863225

Crossref Full Text | Google Scholar

Earp, M. S. 2007 Development and validation of the statistics anxiety measure (doctoral dissertation, University of Denver)

Google Scholar

Edirisooriya, M. L., and Lipscomb, T. J. (2021). Gender influence on statistics anxiety among graduate students. J. Res. Sci. Math. Technol. Educ. 4, 63–74. doi: 10.31756/jrsmte.421,

PubMed Abstract | Crossref Full Text | Google Scholar

Elmore, P. B. (1993). Statistics achievement: A function of attitudes and related experiences.

Google Scholar

Eshet, Y., Steinberger, P., and Grinautsky, K. (2021). Relationship between statistics anxiety and academic dishonesty: a comparison between learning environments in social sciences. Sustainability 13:1564. doi: 10.3390/su13031564

Crossref Full Text | Google Scholar

Eshet, Y., Steinberger, P., and Grinautsky, K. (2022). Does statistics anxiety impact academic dishonesty? Academic challenges in the age of distance learning. Int. J. Educ. Integr. 18, 1–19. doi: 10.1007/s40979-022-00117-w,

PubMed Abstract | Crossref Full Text | Google Scholar

Esnard, T. R., Alladin, F. M., and Samlal, K. C. (2021). Prior mathematics performance, statistics anxiety, self-efficacy and expectations for performance in statistics: a survey of social sciences students in a Caribbean institution of higher education. Stat. Educ. Res. J. 20, 4–4. doi: 10.52041/serj.v20i1.98

Crossref Full Text | Google Scholar

Fabbricatore, R., Parola, A., Pepicelli, G., and Palumbo, F. (2022). Successful factors in statistics learning for non-STEM courses students: a PLS-PM approach. Statistic. Applic. Ital. J. Appl. Statistic., (2). doi:doi: 10.26398/IJAS.0034-010

Crossref Full Text | Google Scholar

Faber, G., and Drexler, H. (2019). Predicting education science students' statistics anxiety: the role of prior experiences within a framework of domain-specific motivation constructs. High. Learn. Res. Commun. 9:n1. doi: 10.18870/hlrc.v9i1.435

Crossref Full Text | Google Scholar

Faber, G., Drexler, H., Stappert, A., and Eichhorn, J. (2018). Education science students’ statistics anxiety: developing and analyzing a scale for measuring their worry, avoidance, and emotionality cognitions. Int. J. Educ. Psychol. 7, 248–285. doi: 10.17583/ijep.2018.2872

Crossref Full Text | Google Scholar

Fairlamb, S., Papadopoulou, H., and Bourne, V. J. (2022). Reach for the STARS? The role of academic contingent self-worth in statistics anxiety and learning. Learn. Motiv. 78:101815. doi: 10.1016/j.lmot.2022.101815

Crossref Full Text | Google Scholar

Fernando, W. W. D. P., Wathsanie, M. V., Kavinda, R. P. M., and Lankeshwara, P. (2020). Identifying the causative factors for statistical anxiety among undergraduates (with special reference to general sir John Kotelawala defence university). J. Univers. Kelaniya 34:80. doi: 10.4038/kalyani.v34i1.49

Crossref Full Text | Google Scholar

Ferrando, P. J., and Navarro-González, D. (2018). Assessing the quality and usefulness of factor-analytic applications to personality measures: a study with the statistical anxiety scale. Pers. Individ. Differ. 123, 81–86. doi: 10.1016/j.paid.2017.11.014

Crossref Full Text | Google Scholar

Förster, M., and Maur, A. (2015). Statistics anxiety and selfconcept of beginning students in the social sciences–a matter of gender and socio-cultural background. Zeitschrift für Hochschulentwicklung 10, 67–90.

Google Scholar

Frey-Clark, M., Natesan, P., and O’Bryant, M. (2019). Assessing statistical anxiety among online and traditional students. Front. Psychol. 10:1440. doi: 10.3389/fpsyg.2019.01440

Crossref Full Text | Google Scholar

Gibeau, R. M., Maloney, E. A., Béland, S., Lalande, D., Cantinotti, M., Williot, A., et al. (2023). The correlates of statistics anxiety: relationships with spatial anxiety, mathematics anxiety and gender. J. Numer. Cogn. 9, 16–43. doi: 10.5964/jnc.8199

Crossref Full Text | Google Scholar

Grajzel, K. (2019). Validation of the statistical anxiety scale among college students in the United States : University of Colorado Colorado Springs.

Google Scholar

Güler, N., Taşdelen-Teker, G., and İlhan, M. (2019). Lisansüstü eğitim öğrencilerine yönelik istatistik kaygısı ölçeği’nin Türkçeye uyarlanması. J. Measure. Eval. Educ. Psychol. 10, 435–450. doi: 10.21031/epod.550765

Crossref Full Text | Google Scholar

Hanna, D., Shevlin, M., and Dempster, M. (2008). The structure of the statistics anxiety rating scale: a confirmatory factor analysis using UK psychology students. Pers. Individ. Differ. 45, 68–74. doi: 10.1016/j.paid.2008.02.021

Crossref Full Text | Google Scholar

Heretick, D. M., and Tanguma, J. (2021). Anxiety and attitudes toward statistics and research among younger and older nontraditional adult learners. J. Contin. High. Educ. 69, 87–99. doi: 10.1080/07377363.2020.1784690

Crossref Full Text | Google Scholar

Hoegler, S., and Nelson, M. (2018). The influence of anxiety and self-efficacy on statistics performance: a path analysis. Psi Chi J. Psychol. Res. 23, 364–375. doi: 10.24839/2325-7342.JN23.5.364

Crossref Full Text | Google Scholar

Hsiao, T. Y. (2010). The statistical anxiety rating scale: further evidence for multidimensionality. Psychol. Rep. 107, 977–982. doi: 10.2466/07.11.PR0.107.6.977-982,

PubMed Abstract | Crossref Full Text | Google Scholar

Hsiao, T. Y., and Chiang, S. (2011). Gender differences in statistics anxiety among graduate students learning English as a foreign language. Soc. Behav. Pers. 39, 41–42. doi: 10.2224/sbp.2011.39.1.41

Crossref Full Text | Google Scholar

Hubbard, A. 2022 Motivation and statistics anxiety among adult online learners: The mediating role of self-regulated learning (doctoral dissertation, Walden University)

Google Scholar

Idika, D. O., and Ojong, B. M. Demographıc Characterıstıcs and response effect of Statıstıcal Anxıety among Unıversıty students in cross Rıver state, Nıgerıa. International Journal of Scientific Research and Engineering Development, 3, 1024–1040.

Google Scholar

Igbokwe, D. O., Oyewole, F. O., Prekake, T. S., and Agoha, B. C. E. (2017). Factor structure, reliability and validity of two statistical anxiety measures in a sample of Nigerian psychology undergraduates. J. Nigerian Psychol. Res. Abuja 5.

Google Scholar

Jiao, Q. G., and Onwuegbuzie, A. J. (2012). The role of statıstıcs anxıety in cooperatıve learnıng groups in graduate-level research methodology courses. Int. J. Univ. Teach. Fac. Dev. 3:311.

Google Scholar

Kafadar, T. (2022). Sosyal Bilgiler Eğitimi Lisansüstü Öğrencilerinin İstatistik Kaygılarının İncelenmesi. Sakarya Üniversitesi Eğitim Fakültesi Dergisi 22, 226–236. doi: 10.53629/sakaefd.1195896

Crossref Full Text | Google Scholar

Kangni, A. C. 2021 The influence of the instructor’s visibility in an instructional statistics video on student’s perceived social presence, anxiety, and performance (master's thesis, University of Twente)

Google Scholar

Tonsing, K. N. (2018). Instructor immediacy and statistics anxiety in social work undergraduate students. Soc. Work. Educ. 37, 223–233. doi: 10.1080/02615479.2017.1395009

Crossref Full Text | Google Scholar

Keeley, J., Zayac, R., and Correia, C. (2008). Curvilinear relationships between statistics anxiety and performance among undergraduate students: evidence for optimal anxiety. Stat. Educ. Res. J. 7, 4–15. doi: 10.52041/serj.v7i1.477

Crossref Full Text | Google Scholar

Kesici, Ş., Baloğlu, M., and Deniz, M. E. (2011). Self-regulated learning strategies in relation with statistics anxiety. Learn. Individ. Differ. 21, 472–477. doi: 10.1016/j.lindif.2011.02.006

Crossref Full Text | Google Scholar

Lalande, D., Cantinotti, M., Williot, A., Gagnon, J., and Cousineau, D. (2019). Three pathways from achievement goals to academic performance in an undergraduate statistics course. Stat. Educ. Res. J. 18, 94–105. doi: 10.52041/serj.v18i1.152

Crossref Full Text | Google Scholar

Larwin, K., and Harvey, M. (2012). A demonstration of a systematic item-reduction approach using structural equation modeling. Pract. Assess. Res. Eval., 17, 8. doi:doi: 10.7275/0nem-w659

Crossref Full Text | Google Scholar

Lavasani, M. G., Weisani, M., and Shariati, F. (2014). The role of achievement goals, academic motivation in statistics anxiety: testing a causal model. Procedia Soc. Behav. Sci. 114, 933–938. doi: 10.1016/j.sbspro.2013.12.810

Crossref Full Text | Google Scholar

Lavidas, K., Manesis, D., and Gialamas, V. (2021). Investigation of the statistical anxiety rating scale psychometric properties with a sample of Greek students. Int. J. Educ. Psychol. 10, 116–142. doi: 10.17583/ijep.2021.6032

Crossref Full Text | Google Scholar

Levpušček, M. P., and Cukon, M. (2022). That old devil called ‘statistics’: statistics anxiety in university students and related factors. Cent. Educ. Policy Stud. J. 12, 147–168. doi: 10.26529/cepsj.826

Crossref Full Text | Google Scholar

Lin, Y. H. (2021). Clustering and profile analysis on statistics anxiety styles of undergraduate students. Int. J. Intel. Technol. Appl. Statistic. 14. doi: 10.6148/IJITAS.202112.14(4).0004

Crossref Full Text | Google Scholar

Lin, Y. J., and Tang, H. (2017). Exploring student perceptions of the use of open educational resources to reduce statistics anxiety. J. Form. Des. Learn. 1, 110–125. doi: 10.1007/s41686-017-0007-z

Crossref Full Text | Google Scholar

Liu, L. (2021). Assessing statistics anxiety in an online or hybrid setting: the adaption and development of a new instrument SASOH. Stat. Educ. Res. J. 20, 2–2. doi: 10.52041/serj.v20i1.96

Crossref Full Text | Google Scholar

Liu, S., Onwuegbuzie, A. J., and Meng, L. (2011). Examination of the score reliability and validity of the statistics anxiety rating scale. J. Educ. Enq. 11.

Google Scholar

Lorenzo Seva, U., Vigil Colet, A., and Ferrando Piera, P. J. (2022). Development of a revised version of the statistical anxiety scale. Psicothema. doi: 10.7334/psicothema2022.231

Crossref Full Text | Google Scholar

Maat, S. M., and Rosli, M. K. (2016). The Rasch model analysis for statistical anxiety rating scale (STARS). Creat. Educ. 07, 2820–2828. doi: 10.4236/ce.2016.718261,

PubMed Abstract | Crossref Full Text | Google Scholar

Maat, S. M., Mohd, N., Ahmad, C. N. C., and Puteh, M. (2022). Statistical anxiety and achievement among university students. TEM J., 11(1),:420. MacArthur, K. R. (2020). Avoiding over-diagnosis: Exploring the role of gender in changes over time in statistics anxiety and attitudes. Numeracy (1), 4. doi:doi: 10.18421/TEM111-53

Crossref Full Text | Google Scholar

Maat, S. M., Zakaria, E., and Rosli, M. K. (2017). The validation of statistical anxiety rating scale using confirmatory factor analysis among graduate students. Int. J. Novel Res. Educ. Learn. 4, 160–167.

Google Scholar

MacArthur, K. R. (2020). Avoiding over-diagnosis: exploring the role of gender in changes over time in statistics anxiety and attitudes. Numeracy 13:4. doi: 10.5038/1936-4660.13.1.4

Crossref Full Text | Google Scholar

MacArthur, K. R., and Santo, J. B. (2023). A multi-level analysis of the effects of statistics anxiety/attitudes on trajectories of exam scores. J. Stat. Data Sci. Educ. 31, 102–112. doi: 10.1080/26939169.2022.2093805

Crossref Full Text | Google Scholar

Macher, D., Paechter, M., Papousek, I., and Ruggeri, K. (2012). Statistics anxiety, trait anxiety, learning behavior, and academic performance. Eur. J. Psychol. Educ. 27, 483–498. doi: 10.1007/s10212-011-0090-5

Crossref Full Text | Google Scholar

Macher, D., Paechter, M., Papousek, I., Ruggeri, K., Freudenthaler, H. H., and Arendasy, M. (2013). Statistics anxiety, state anxiety during an examination, and academic achievement. Br. J. Educ. Psychol. 83, 535–549. doi: 10.1111/j.2044-8279.2012.02081.x,

PubMed Abstract | Crossref Full Text | Google Scholar

Mandap, C. M. (2016). Examining gender differences in statistics anxiety among college students. Int. J. Educ. Res. 4, 358–366.

Google Scholar

Maysick, D. J. 1985 The relationship of self-concept, sex, age, major, anxiety, mathematics background, and mathematics ability to the level of mathematics and statistical anxiety among college juniors. Andrews University

Google Scholar

McCaughey, N. J., Hill, T. G., and Mackinnon, S. P. (2022). The association of self-efficacy, anxiety sensitivity, and perfectionism with statistics and math anxiety. Pers. Sci. 3, 1–23. doi: 10.5964/ps.7091

Crossref Full Text | Google Scholar

McDonald, S., and Barnard, M. P. (2023). The influence of prior experience with mathematics and A-level science subjects on statistics anxiety in undergraduate psychology students. Psychol. Teach. Rev. 29, 37–50. doi: 10.53841/bpsptr.2023.29.1.37

Crossref Full Text | Google Scholar

McGrath, A. L., Ferns, A., Greiner, L., Wanamaker, K., and Brown, S. (2015). Reducing anxiety and increasing self-efficacy within an advanced graduate psychology statistics course. Can. J. Scholarsh. Teach. Learn. 6:5. doi: 10.5206/cjsotl-rcacea.2015.1.5

Crossref Full Text | Google Scholar

McIntee, S. E., Goulet-Pelletier, J. C., Williot, A., Deck-Léger, E., Lalande, D., Cantinotti, M., et al. (2022). (mal) adaptive cognitions as predictors of statistics anxiety. Stat. Educ. Res. J. 21:5. doi: 10.52041/serj.v21i1.364

Crossref Full Text | Google Scholar

Mji, A. (2009). Differences in university students' attitudes and anxiety about statistics. Psychol. Rep. 104, 737–744. doi: 10.2466/PR0.104.3.737-744,

PubMed Abstract | Crossref Full Text | Google Scholar

Mji, A., and Onwuegbuzie, A. J. (2004). Evidence of score reliability and validity of the statistical anxiety rating scale among technikon students in South Africa. Meas. Eval. Couns. Dev. 36, 238–251. doi: 10.1080/07481756.2004.11909745

Crossref Full Text | Google Scholar

Mokhele, M. 2018 The association of statistics anxiety, attitude toward statistics and mathematics self-concept with performance in a business statistics course (doctoral dissertation, University of the Free State)

Google Scholar

Najmi, A., Raza, S. A., and Qazi, W. (2018). Does statistics anxiety affect students' performance in higher education? The role of students' commitment, self-concept and adaptability. Int. J. Manag. Educ. 12, 95–113. doi: 10.1504/IJMIE.2018.090705

Crossref Full Text | Google Scholar

Nesbit, R. J., and Bourne, V. J. (2018). Statistics anxiety rating scale (STARS) use in psychology students: a review and analysis with an undergraduate sample. Psychol. Teach. Rev. 24, 101–110. doi: 10.53841/bpsptr.2018.24.2.101

Crossref Full Text | Google Scholar

Neumann, D. L., and Hood, M. (2009). The effects of using a wiki on student engagement and learning of report writing skills in a university statistics course. Australas. J. Educ. Technol. 25. doi: 10.14742/ajet.1141

Crossref Full Text | Google Scholar

Nielsen, T., and Kreiner, S. (2018). Measuring statistical anxiety and attitudes towards statistics: development of a comprehensive Danish instrument (HFS-R). Cogent Educ. 5:1521574. doi: 10.1080/2331186X.2018.1521574

Crossref Full Text | Google Scholar

O’Bryant, M., Natesan Batley, P., and Onwuegbuzie, A. J. (2021). Validation of an adapted version of the statistical anxiety scale in English and its relationship to attitudes toward statistics. SAGE Open 11:21582440211001378. doi: 10.1177/21582440211001378

Crossref Full Text | Google Scholar

O'Bryant, M. J. (2017). How attitudes towards statistics courses and the field of statistics predicts statistics anxiety among undergraduate social science majors: a validation of the statistical anxiety scale. Ann Arbor, MI: ProQuest LLC.

Google Scholar

Ogbogo, S., and Amadi, G. N. (2018). Statistics self-efficacy and learning styles as predictor of statistics anxiety. Int. J. Sci. Res. Manag. 6, 240–248. doi: 10.18535/ijsrm/v6i4.el06

Crossref Full Text | Google Scholar

Onwuegbuzie, A. J. 1993 The interaction of statistics test anxiety and examination condition in statistics achievement of post-baccalaureate non-statistics majors. University of South Carolina

Google Scholar

Onwuegbuzie, A. J. (1999). Statistics anxiety among African American graduate students: an affective filter? J. Black Psychol. 25, 189–209. doi: 10.1177/0095798499025002004

Crossref Full Text | Google Scholar

Onwuegbuzie, A. J. (2000a). I'll begin my statistics assignment tomorrow: the relationship between statistics anxiety and academic procrastination.

Google Scholar

Onwuegbuzie, A. J. (2000b). Statistics anxiety and the role of self-perceptions. J. Educ. Res. 93, 323–330. doi: 10.1080/00220670009598724

Crossref Full Text | Google Scholar

Onwuegbuzie, A. J. (2004). Academic procrastination and statistics anxiety. Assess. Eval. High. Educ. 29, 3–19. doi: 10.1080/0260293042000160384

Crossref Full Text | Google Scholar

Onwuegbuzie, A. J., and Daley, C. E. (1999). Perfectionism and statistics anxiety. Pers. Individ. Differ. 26, 1089–1102. doi: 10.1016/S0191-8869(98)00214-1

Crossref Full Text | Google Scholar

Onwuegbuzie, A. J., DaRos, D., and Ryan, J. M. (1997). The components of statistics anxiety: a phenomenological study. Focus. Learn. Probl. Math. 19, 11–35. doi: 10.46743/2160-3715/2015.2101

Crossref Full Text | Google Scholar

Pan, W., and Tang, M. (2004). Examining the effectiveness of innovative instructional methods on reducing statistics anxiety for graduate students in the social sciences. J. Instr. Psychol. 31.

Google Scholar

Papousek, I., Ruggeri, K., Macher, D., Paechter, M., Heene, M., Weiss, E. M., et al. (2012). Psychometric evaluation and experimental validation of the statistics anxiety rating scale. J. Pers. Assess. 94, 82–91. doi: 10.1080/00223891.2011.627959,

PubMed Abstract | Crossref Full Text | Google Scholar

Paul, L., Parveen, T., Ahmed, O., and Aktar, R. (2018). Adaptation study of the statistical anxiety scale on a Bangladeshi sample. Bulg. J. Sci. Educ. Policy 12, 380–401.

Google Scholar

Perepiczka, M., Chandler, N., and Becerra, M. (2011). Relationship between graduate students' statistics self-efficacy, statistics anxiety, attitude toward statistics, and social support. Prof. Counselor 1, 99–108. doi: 10.15241/mpa.1.2.99

Crossref Full Text | Google Scholar

Pretorius, T. B., and Norman, A. M. (1992). Psychometric data on the statistics anxiety scale for a sample of south African students. Educ. Psychol. Meas. 52, 933–937. doi: 10.1177/0013164492052004015

Crossref Full Text | Google Scholar

Primi, C., and Chiesi, F. (2016). “Statistics anxiety: a mediator in learning probability” in 13th international congress on mathematical education, 1–7.

Google Scholar

Primi, C., and Chiesi, F. (2018). “The role of mathematics anxiety and statistics anxiety in learning statistics” in 10th international conference on teaching statistics (Kyoto, Japan). Retrieved fromhttps://iase-web.org/icots/10/proceedings/pdfs/ICOTS10_5E2.pdf

Google Scholar

Ratanaolarn, T. (2016). The development of a structural equation model of graduate students’ statistics achievement. J. Behav. Sci. 11, 153–168. https://so06.tci-thaijo.org/index.php/IJBS/article/view/63288

Google Scholar

Rodarte-Luna, B., and Sherry, A. (2008). Sex differences in the relation between statistics anxiety and cognitive/learning strategies. Contemp. Educ. Psychol. 33, 327–344. doi: 10.1016/j.cedpsych.2007.03.002

Crossref Full Text | Google Scholar

Rosli, M. K., Mistima, S., and Rosli, N. (2017). Students’ attitude and anxiety towards statistics a descriptive analysis. Res. Educ. Psychol. 1, 47–56.

Google Scholar

Saidi, S. S., and Siew, N. M. (2022). Assessing secondary school students’ statistical reasoning, attitude towards statistics, and statistics anxiety. Stat. Educ. Res. J. 21, 6–6. doi: 10.52041/serj.v21i1.67

Crossref Full Text | Google Scholar

Sandoz, E. K., Butcher, G., and Protti, T. A. (2017). A preliminary examination of willingness and importance as moderators of the relationship between statistics anxiety and performance. J. Context. Behav. Sci. 6, 47–52. doi: 10.1016/j.jcbs.2017.02.002

Crossref Full Text | Google Scholar

Sesé, A. J., Jiménez López, R., Montaño Moreno, J. J., and Palmer Pol, A. L. (2015). Can attitudes toward statistics and statistics anxiety explain students' performance?. J. Psychodidactics, 2015, vol. 20, num., p. 285–304.

Google Scholar

Sockol, L. E., Ellison, W. D., Stutts, L. A., and Knouse, L. E. (2021). Improving quantitative abilities and attitudes in clinical psychology courses: longitudinal assessment of a blended learning intervention. Teach. Psychol. 48, 316–327. doi: 10.1177/0098628320979894

Crossref Full Text | Google Scholar

Starcher, K. (2016). Reducing statistics anxiety in undergraduate business students. Curr. ACADEMIC 19.

Google Scholar

Stefanovska–Petkovska, M., and Velikj, V. (2020). Statistics for all: analyses of attitudes and anxiety towards learning statistics in undergraduate and postgraduate students in medical and management sciences. For the publisher 114.

Google Scholar

Steinberger, P. (2020). Assessing the statistical anxiety rating scale as applied to prospective teachers in an Israeli teacher-training college. Stud. Educ. Eval. 64:100829. doi: 10.1016/j.stueduc.2019.100829

Crossref Full Text | Google Scholar

Steinberger, P., Eshet, Y., and Grinautsky, K. (2021). No anxious student is left behind: statistics anxiety, personality traits, and academic dishonesty—lessons from COVID-19. Sustainability 13:4762. doi: 10.3390/su13094762

Crossref Full Text | Google Scholar

Teman, E. D. (2013a). A Rasch analysis of the statistical anxiety rating scale. J. Appl. Meas. 14, 414–434,

PubMed Abstract | Google Scholar

Teman, E. D. (2013b). Factorial invariance of the statistical anxiety rating scale across sex and students' classification. Compr. Psychol. 2, 02–20. doi: 10.2466/02.20.CP.2.1

Crossref Full Text | Google Scholar

Tran, M. H. (2018). Examining the relationship between statistics anxiety and attentional bias.

Google Scholar

Valle, N., Antonenko, P., Valle, D., Dawson, K., Huggins-Manley, A. C., and Baiser, B. (2021). The influence of task-value scaffolding in a predictive learning analytics dashboard on learners' statistics anxiety, motivation, and performance. Comput. Educ. 173:104288. doi: 10.1016/j.compedu.2021.104288

Crossref Full Text | Google Scholar

van Dijck, J. P., Abrahamse, E., Kesteloot, S., Willems, R., and Fias, W. (2022). Statistics anxiety in Flanders: exploring its level, antecedents, and performance impact across professional and academic bachelor programs in psychology. Int. Electron. J. Elem. Educ. 14, 363–385. doi: 10.26822/iejee.2022.250

Crossref Full Text | Google Scholar

Vigil-Colet, A., Lorenzo-Seva, U., and Condon, L. (2008). Development and validation of the statistical anxiety scale. Psicothema 20, 174–180,

PubMed Abstract | Google Scholar

Watson, F. S., Hess, M. R., Onwuegbuzie, A. J., Ferron, J. M., Kromrey, J. D., Hogarty, K. Y., et al. (2004). “Anxiety and attitudes in graduate statistics education: measuring and monitoring change during the semester” in Annual meeting of the eastern educational research association (Clearwater, Florida).

Google Scholar

Watson, F. S., Lang, T. R., Kromrey, J. D., Ferron, J. M., Hess, M. R., and Hogarty, K. Y. 2003 An assessment blueprint for EncStat: a statistics anxiety intervention program

Google Scholar

Watson, F. S., Lang, Thomas R., and Kromrey, Jeffrey D. 2002 Breaking ground for EncStat: a statistics anxiety intervention program

Google Scholar

Welch, P. S., Jacks, M. E., Smiley, L. A., Walden, C. E., Clark, W. D., and Nguyen, C. A. (2015). A study of statistics anxiety levels of graduate dental hygiene students. Am. Dent. Hyg. Assoc. 89, 46–54.

Google Scholar

Williams, A. (2012). Online homework vs. traditional homework: statistics anxiety and self-efficacy in an educational statistics course. Technol. Innov. Stat. Educ. 6. doi: 10.5070/T561011683

Crossref Full Text | Google Scholar

Williams, A. (2014). An exploration of preference for numerical information in relation to math self-concept and statistics anxiety in a graduate statistics course. J. Stat. Educ. 22. doi: 10.1080/10691898.2014.11889693,

PubMed Abstract | Crossref Full Text | Google Scholar

Williams, A. S. (2010). Statistics anxiety and instructor immediacy. J. Stat. Educ. 18:187. doi: 10.1080/10691898.2010.11889495

Crossref Full Text | Google Scholar

Williams, A. S. (2013). Worry, intolerance of uncertainty, and statistics anxiety. Stat. Educ. Res. J. 12, 48–59. doi: 10.52041/serj.v12i1.321

Crossref Full Text | Google Scholar

Williams, A. S. (2015). Statistics anxiety and worry: the roles of worry beliefs, negative problem orientation, and cognitive avoidance. Stat. Educ. Res. J. 14, 53–75. doi: 10.52041/serj.v14i2.261

Crossref Full Text | Google Scholar

Wu, R., Chen, J., Li, Q., and Zhou, H. (2022). Reducing the influence of perfectionism and statistics anxiety on college student performance in statistics courses. Front. Psychol. 13:1011278. doi: 10.3389/fpsyg.2022.1011278,

PubMed Abstract | Crossref Full Text | Google Scholar

Yaffe, Y., Bouhnik, O., and Burg, D. (2020). Further examination of the structure of the statistical anxiety rating scale (STARS) in international use. J. Appl. Meas. 21, 235–242.

Google Scholar

Yeşildağ, A. Y. (2022). Sağlık Yönetimi Alanında Lisansüstü Eğitim Gören Öğretim Elemanlarının İstatistiksel Kaygı Düzeylerinin İncelenmesi. Avrupa Bilim Teknoloji Dergisi, (45), 88–92. doi:doi: 10.31590/ejosat.1222420

Crossref Full Text | Google Scholar

Younesi, S., Tavakoli, V., Hosseini, S., and Hashemizadeh, V. (2014). The relation between emotional self-efficacy and meta-cognitive believes with statistical anxiety. Indian J. Fundam. Appl. Life Sci. 4, 433–440.

Google Scholar

Yulianto, A. (2022). The effect of statistics anxiety on statistics courses achievement in psychology students. Cendikia: Media Jurnal Ilmiah Pendidikan 13. http://www.iocscience.org/ejournal/index.php/Cendikia

Google Scholar

Yunis, F. A. (2011). “Factors influencing the psychology student in dealing with statistics courses” in Avaliação Psicológica: Guia de consulta para estudantes e profissionais de psicologia. eds. R. Ambiel, et al. (Brasil–São Paulo: Casa da Psicologia).

Google Scholar

Zahan, F. N., Kawsar, L. A., Islam, M. A., and Bhuia, M. R. (2020). Internal consistency and structure of the statistics anxiety rating scale: a multivariate analysis among undergraduate students of SUST, Sylhet.

Google Scholar

Zhang, J. W., Kessler, E., and Braasch, J. L. (2021). Self-compassion mindsets can predict statistics course performance via intelligence mindsets and statistics anxiety. Learn. Individ. Differ. 90:102047. doi: 10.1016/j.lindif.2021.102047

Crossref Full Text | Google Scholar

Zimmerman, W. A., and Johnson, G. (2017). Exploring factors related to completion of an online undergraduate-level introductory statistics course. Online Learn. 21, 191–205. doi: 10.24059/olj.v21i3.1017

Crossref Full Text | Google Scholar

Keywords: reliability generalization, meta-analysis, reliability induction, statistics anxiety, reliability

Citation: Özdemir EÖ and Yildirim I (2026) A reliability generalization meta-analysis of self-report measures of statistics anxiety. Front. Psychol. 16:1675957. doi: 10.3389/fpsyg.2025.1675957

Received: 29 July 2025; Revised: 10 November 2025; Accepted: 24 November 2025;
Published: 23 January 2026.

Edited by:

Silvia Cristina da Costa Dutra, University of Zaragoza, Spain

Reviewed by:

Leonardo Carlucci, University of Foggia, Italy
Nail Ilhan, Inonu University, Türkiye

Copyright © 2026 Özdemir and Yildirim. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ibrahim Yildirim, aXlpbGRpcmltODRAZ21haWwuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.