- 1Department of Educational Sciences, Gaziantep University, Gaziantep, Türkiye
- 2Ministry of National Education, Gaziantep, Türkiye
Objective and Method: In this study, it was aimed to obtain a general reliability coefficient for each of the statistics anxiety scales. For this purpose, Web of Science, ERIC and Scopus databases and Google Scholar search engine were searched according to certain criteria and 84 Cronbach’s alpha coefficients reported for the whole scale were reached. Reliability generalization meta-analysis method was applied to the obtained alpha coefficients within the scope of meta-analysis, which is a quantitative method.
Results and Conclusion: The mean value of the alpha coefficients for which the transformation formula was applied was found to be.927 under the random effects model, and the findings were statistically significant. In addition, the mean alpha value of each statistics anxiety scale was.931 for STARS,.917 for SAS,.918 for SAS-10 and.951 for WAESTA. Analog to the ANOVA and meta-regression analyses were conducted to reveal the heterogeneity of alpha coefficients in the overall analysis. Analog to ANOVA was applied for five different categorical variables, and according to the findings, it was observed that the mean alpha value differed statistically significantly depending on the scale type variable. Moreover, it was found that the mean scale score and the standard deviation of the mean scale score were statistically significant predictors of the mean alpha value.
1 Introduction
Statistics is a branch of science that covers the processes of making inferences from numerical data related to the organization, analysis or interpretation of data collected in daily life or school life. Statistics, which is widely used, has become a branch of science utilized in many different fields such as engineering, health sciences, social sciences and natural sciences (Akdeniz, 2015). An individual who will conduct a scientific study is expected to have some competencies in order to collect data, to bring these data together in a certain order and to interpret them correctly (Erkuş, 2011). In order to have these competencies, statistics education has become a necessity for individuals receiving undergraduate and graduate education (Onwuegbuzie, 1993).
The fact that statistics course is taught as a compulsory course to undergraduate and graduate students has brought along the need to examine the factors of success or failure in this course. Success in statistics course is affected by affective factors as well as cognitive factors (Yaşar, 2014). Feedback about the course (Williams, 2012), difficulties encountered in the course (Feinberg and Halperin, 1978), and past experiences associated with the mathematics course (Maysick, 1985) are among the cognitive factors affecting success in statistics course. The factors affecting the success of this course, such as the past academic experiences of each student taking the course, negative attitudes towards the teacher teaching the course, and the perception of interpreting numerical data paved the way for the formation of concerns about the course (Baloğlu, 2017). As a result of the formation of such concerns about the statistics course in individuals, the concept of “statistics anxiety” was added to the literature. Onwuegbuzie (1993) defines statistics anxiety as the state of anxiety that occurs when an individual encounters statistical processes such as organizing, analyzing or interpreting data at any time.
Since the ability to interpret numerical data in statistics is similar to the content of the mathematics course, the first studies on statistics anxiety tried to explain it with mathematics anxiety (Schacht and Stewart, 1990). There are also studies in which the relationship between statistical course success and situational anxiety was tried to be explained (Bendig and Hughes, 1954; Fisch, 1971). In this case, in the most general sense, statistical anxiety is examined under three subcomponents: situational causes, environmental causes and characterological causes (Onwuegbuzie, 1993). Situational causes of statistics anxiety include the role of the instructor teaching the statistics course (O’Bryant et al., 2021; Tonsing, 2018), pedagogical behaviors (Asare, 2023), course feedback (Williams, 2012), course content and pace of the course (Pan and Tang, 2004). Environmental causes of statistics anxiety are classified as the age (Bui and Alfaro, 2011), gender (Edirisooriya and Lipscomb, 2021), and past experiences with mathematics (Maysick, 1985). Under the characterological causes of statistical anxiety, individual characteristics such as academic motivation, learning styles (Kesici et al., 2011), self-perception (Esnard et al., 2021), self-identity (Najmi et al., 2018), and academic resilience (Ali and Gaber, 2022) stand out.
1.1 Statistics anxiety scales
Statistics anxiety scales have been developed by scientists in order to determine the level of this anxiety and to take measures for the result. Among these scales, STARS developed by Cruise et al. (1985) and SAS developed by Vigil-Colet et al. (2008) are the most widely used statistics anxiety scales worldwide. In addition to these scales, the SAS-10 scale, which was revised by Pretorius and Norman (1992) from the Mathematics Anxiety Scale developed by Betz (1978) to a statistics anxiety scale, and the WAESTA scales developed by Faber et al. (2018) are also widely used statistics anxiety scales. These scales have been adapted to many cultures and are widely used. There are also the SAM scale developed by Earp (2007) and the SAI scale developed by Zeidner (1991), which are not widely used.
1.2 Statistics anxiety rating scale (STARS)
STARS, developed by Cruise et al. (1985) on a five-point Likert scale, is the most widely used statistics anxiety scale. As a result of the factor analysis conducted, the STARS scale consisting of 51 items and 6 subscales was obtained. Worth of Statistics subscale consists of 16 items, Interpretation Anxiety subscale consists of 11 items, and Test and Class Anxiety subscale consists of 8 items. In addition, Computation Self-Concept subscale consists of 7 items, Fear of Asking for Help subscale consists of 4 items, and finally Fear of Statistics Teacher subscale consists of 5 items. When examining the subscales of STARS, it is seen that the first three subscales (WS, IA, TCA) are related to statistical anxiety, while the other three subscales (CSC, FAH, FST) are related to statistical competence. The internal consistency coefficient for the reliability of STARS for the sample of 537 participants was reported as 0.96 and the test–retest reliability coefficient was reported as 0.76, and the internal consistency coefficients of the subscales were 0.94, 0.87, 0.68, 0.88, 0.89 and 0.80, respectively.
1.3 Statistical anxiety scale (SAS)
In the SAS developed by Vigil-Colet et al. (2008), it was thought that the subscales of STARS other than statistics anxiety (fear of statistics teacher, self-perception, value of statistics) were not suitable for the purpose of measuring statistics anxiety. It was thought to be more useful since it has fewer items than STARS. Exploratory factor analysis was applied to ensure the validity of the SAS, which is a five-point Likert-type scale consisting of 24 items, and as a result of the analysis, it was decided that the scale had three subscales. Each of these subscales consists of 8 items. The factor loadings of the items in the first subscale vary between 0.88 and 0.57, the factor loadings of the items in the second subscale vary between 0.92 and 0.50, and the factor loadings of the items in the third subscale vary between 0.89 and 0.34. These subscales were named as Examination Anxiety, Asking for Help Anxiety and Interpretation Anxiety. The alpha coefficient for the sample of 159 undergraduate students for whom the SAS was developed was reported as 0.911. In addition, the alpha coefficients of the subscales of the SAS were determined as 0.87, 0.92 and 0.82, respectively.
1.4 WAESTA scale
The WAESTA scale developed by Faber et al. (2018) consists of 17 items. The aim was to create a measurement tool that is simpler and free from conceptual limitations than the scales previously developed in this field. In order to gather validity evidence based on internal structure of the scale, the factor loadings of each item were calculated using principal component analysis and it was seen that the scale consisted of three subscales. These subscales consist of the Worry subscale, Avoidance subscale, and Emotional subscale. Among these subscales, the factor loadings of the items belonging to the Worry subscale (consisting of eight items) ranged between 0.49 and 0.74, and the factor loadings of the items belonging to the Avoidance subscale (consisting of four items) ranged between 0.49 and 0.76. Finally, the factor loadings of the items belonging to the subscale named Emotional cognition (consisting of five items) vary between 0.53 and 0.70. Of the three subscales, The internal consistency coefficient for the sample in which the measurement tool was developed was reported as 0.94 and the split-half reliability coefficient was reported as 0.91.
1.5 The statistics anxiety scale (SAS-10)
The SAS-10 was created by revising the 10-item Mathematics Anxiety Scale (MAS) developed by Betz (1978) by replacing mathematics terms with statistical terms (Pretorius and Norman, 1992). The internal consistency coefficient for the sample in which the unidimensional SAS-10 scale was developed was reported as 0.94 and the split-half reliability were reported as 0.91. The relationship between the SAS-10 and the STAI (State–Trait Anxiety Inventory; Spielberger et al., 1970), whose reliability and validity have been previously proven, was examined and a statistically significant relationship was found between them.
In addition to the developed statistics anxiety scales, there are many statistics anxiety scales adapted to different cultures in the literature. The number of items, the name of the scale and the number of subscales of the adapted scales vary according to the adapted culture. The widespread use of statistics anxiety scales around the world will bring about the differentiation of the conditions of application of the scale. The reliability of a scale is the degree to which it is free from random errors that may occur during its application (Büyüköztürk et al., 2012). The language in which the measurement tool is applied, the length of the test, the objectivity of scoring, the factors related to the instructions of the scale and the factors related to the application conditions will affect the reliability of the sample. In this case, fluctuations in the reliability values in the samples to which the scale is applied are expected. The situation of fluctuations in the reliability coefficient has brought about the need to investigate this variability in a systematic way. The most accurate way to estimate the reliability coefficient for a scale is to systematically combine the reliability coefficients of the studies in which the scale is used (Şen and Yıldırım, 2023). This method is called “reliability generalization” in the literature. The definition of reliability generalization was first used by Vacha-Haase (1998). According to Vacha-Haase (1998), reliability generalization meta-analysis provides important evidence about the amount and source of variation in reliability value. It provides guidance on whether the scale will be appropriate for the sample to which it will be applied (Taylor, 2012). It also provides important evidence for researchers in interpreting data and comparing results (Leech et al., 2011).
Looking at the literature, it was seen that in addition to the studies examining the level of statistical anxiety, the predictors of statistical anxiety were also addressed (Faber and Drexler, 2019; O'Bryant, 2017; Valle et al., 2021; Zhang et al., 2021). There are also studies investigating the relationship of statistics anxiety with affective factors such as perception, attitude, and self-identity (Altun et al., 2021; Kesici et al., 2011; Mji, 2009; Perepiczka et al., 2011; Sesé et al., 2015). Moreover, studies examining the effect of statistics anxiety on variables such as gender, age, and academic performance were also found (Eshet et al., 2022; Hsiao and Chiang, 2011; Sandoz et al., 2017; Valle et al., 2021; Wu et al., 2022). Apart from the studies investigating the variables associated with statistical anxiety, studies in which reliability generalization meta-analysis was conducted were also examined. It was examined whether the sample type (Özdemir et al., 2020; Shields and Caruso, 2003; Wallace and Wheeler, 2002), the type of publication of the study (Ock et al., 2021), the language in which the measurement tool was applied (Kıyıcı and Kahraman, 2022), and the length of the test (Fitzgerald, 1996), which are thought to affect the mean reliability value obtained, affect the mean reliability value. In addition, reliability generalization meta-analysis studies in which continuous variables such as the year the study was published, the mean score of the scale and the mean age of the sample were investigated as predictors of the mean reliability value are also available in the literature (Sen, 2022; Wallace and Wheeler, 2002). There is no study examining the reliability generalization meta-analysis method on statistics anxiety. Increasing the number of such studies is extremely important in terms of identifying previously examined studies in the literature on reliability generalization meta-analysis and reaching more comprehensive results. It is thought that finding a general reliability value of the statistics anxiety scales to be obtained from the current study will contribute to the relevant literature. Moreover, it is thought that it will guide scientists in future studies. In this direction, while analyzing the studies included in the research, answers to the following questions were sought:
1. What is the reliability induction rate of the studies reached during the research process?
2. What is the mean value of the internal consistency coefficients of each statistics anxiety scale and its sub-dimensions included in the study?
3. What is the mean value of the overall internal consistency coefficients of all statistics anxiety scales included in the study?
4. Do the variables such as the measurement tool, type of publication, continent and education level where the scale is applied, and the language used in the studies included in the research have an effect on the mean value of internal consistency coefficient?
5. Is the mean age of the individuals participating in the study, the ratio of the number of women to the number of men, the mean score of the scale, the standard deviation of the mean score, the year of the study, or the number of items predictive of the mean value of internal consistency coefficients?
2 Method
In the current study, reliability generalization meta-analysis method, which is one of the meta-analysis methods, was used by bringing together the reliability coefficients of the studies in which statistics anxiety scales were used. The way to estimate an accurate reliability coefficient for a specific measurement tool is to combine the reliability coefficients obtained from numerous studies (Şen and Yıldırım, 2023, p. 318). The method of combining the results of numerous studies in a meta-analysis to obtain a single result is also used in combining the reliability coefficients of measurement tools. Reliability generalization is a method of estimating a common reliability coefficient by combining the reliability coefficients obtained from a scale used in various studies (Vacha-Haase, 1998, p. 12). In this context, the reliability coefficients in the studies in which the STARS, SAS, WAESTA, SAS-10 scales were used among the statistics anxiety scales were brought together and the reliability generalization meta-analysis method was used for each scale. Moreover, reliability generalization meta-analysis was also performed for the subscales of multidimensional and widely used statistics anxiety scales (STARS, SAS). Furthermore, pooled reliability coefficient was obtained for the whole of the statistics anxiety scales (STARS, SAS, WAESTA, SAS-10, SAM, SAQ). When the reliability coefficient preferred by the studies included in the analysis was examined, it was seen that the most commonly used reliability coefficient was Cronbach’s alpha coefficient, one of the internal consistency coefficients. For this reason, the analyses were conducted with the alpha coefficient.
2.1 Data sources and search strategies
While reviewing the studies, it was aimed to reach all scales measuring statistics anxiety and all studies in which these scales were used without determining a specific year interval. At this context, Web of Science, Scopus and ERIC databases and Google Scholar search engine were searched, respectively. In the inclusion of the studies in the current study, the criteria of being published in English, using a measurement tool that measures statistics anxiety, and reporting the reliability coefficient of the whole scale or its subscales were taken into consideration. In line with these criteria, databases were searched with some keywords as well as the citations of the statistics anxiety scales identified between May 2023 and July 2023. While ERIC, Web of Science and Scopus were searched with the words “statistical anxiety” and “statistics anxiety,” Google Scholar was searched with the search model “scale OR measure OR questionnaire OR inventory” AND “intitle:statistical anxiety” AND “scale OR measure OR questionnaire OR inventory” AND “intitle:statistics anxiety.” As a result of these searches, a total of 1,598 articles, theses or published papers were reached. Meta-analysis was conducted with 84 alpha coefficients obtained from 81 studies as a result of eliminating 1,444 studies that did not meet the above criteria. Furthermore, 73 studies in which only the reliability coefficient of the subscales was reported were also recorded to be used in the reliability generalization meta-analysis of the subscales. In the selection of studies to be included in the meta-analysis, the process should be carried out in accordance with a checklist. In addition to the REGEMA (Reliability Generalization Meta-Analysis) checklist developed by Sanchez‐Meca et al. (2013) for reliability generalization meta-analysis, PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analysis) developed by Moher et al. (2009) is the most widely used checklist for meta-analysis. The studies included in the analysis are shown in the PRISMA flow diagram in Figure 1.
The 157 reliability coefficients obtained from 154 studies recorded during the data collection phase were reported. Of the 115 studies in which STARS was used, 24 studies reported the alpha coefficient for the whole scale but not for the subscales. And, there are 27 studies in which the alpha coefficient was reported for the whole STARS and subscales. On the other hand, there are 64 studies in which the alpha coefficient is not reported for the whole STARS, but only for the subscales. A similar situation also applies to SAS. Alpha coefficient for the whole SAS was reported in 5 out of 27 studies in which SAS was used. There are 14 studies reporting alpha coefficient for the whole SAS and subscales. On the other hand, there were 8 studies in which the alpha coefficient was reported for subscales but not for the whole SAS, and 5 studies in which the WAESTA scale was used. Four studies using the SAS-10 scale were recorded. One study using SAM and one study using SAQ were also included in the analysis (Earp, 2007; Idika and Ojong, 2020). In addition, studies using two different statistics anxiety scales (Altun et al., 2021; Chew et al., 2018; Grajzel, 2019; Igbokwe et al., 2017) were recorded as a second study with the same name.
In a study using the SAI recorded during the data collection phase (Zeidner, 1991), only the alpha coefficient for the subscales was reported. Similarly, in a study using the SAM (Vahedi et al., 2011), only the alpha coefficient for the subscales was reported. On the other hand, in two studies (Asare, 2023; Jones et al., 2022) where the reliability coefficient for the subscales of STARS was reported, the omega coefficient was reported. Since only studies reporting alpha coefficients were included in the analysis, the two studies reporting omega coefficients (Asare, 2023; Jones et al., 2022) were excluded from the analysis. In this case, a total of four studies were excluded from the analysis.
2.2 Data coding and coder reliability
In addition to the alpha coefficients of the studies included in the analysis, the year of publication, number of items, the scale used, the language in which the scale was applied, the type of publication of the study, the type of sample, the continent in which the scale was applied, were also recorded. Sample size, the ratio of the number of women to the number of men in the sample, the mean scale score, the standard deviation of the mean scale score, and the mean age were also collected. The scales used in these data were coded as STARS, SAS, WAESTA, SAS-10, SAM and SAQ. Moreover, the language in which the scale was administered was coded as English and non-English. The continent variable was coded as Asia, Europe, America, Africa and Australia. But there are studies that collected data from more than one continent (Chew et al., 2018; Gibeau et al., 2023). The sample type variable was coded as undergraduate, graduate and mixed (undergraduate and graduate).
For coder reliability, 20% of the researches were randomly selected from the data. After checking the reliability coefficient and n values for the selected data, When the data obtained from the researchers was evaluated, it was found that the inter-researcher agreement coefficient was 100%.
2.3 Data analysis
The present meta-analysis study aims to apply a reliability generalization meta-analysis to the 84 reliability coefficients reported for the scales in 81 studies that used different statistics anxiety scales. Additionally, 73 studies reporting reliability coefficients for subscales were included in the study to calculate the mean alpha coefficient for subscales. Analyses were conducted with the CMA v3 program. Method of moments (MM, also known as the DerSimonian and Laird method) was used as estimator method during the analysis.
In the study, firstly, the reliability induction rate was determined. In this way, the case of reporting reliability coefficient in primary studies will be revealed. Since the alpha coefficients of the scales are mostly reported at values of 0.70 and above, the distribution of alpha coefficients will show negative skewness (Beretvas and Pastor, 2003). In this context, the researchers contend that it would be more appropriate to apply a transformation formula to the alpha coefficients (Bonett, 2002, 2010; Hakstian et al., 1976). It was deemed appropriate to apply Bonett (2002) transformation formula to the alpha coefficients in order to stabilize the variance as well as to eliminate the skewness in the alpha coefficients. The transformation formulas for variance variability and alpha coefficients are as follows (Equations 1,2):
After applying the transformation formula to the alpha coefficients, the effect size to be used in the meta-analysis should be selected. Since the random effects model is mostly used in social sciences and it is thought that there is more than one source of variability in this model, the random effects model was chosen in this study. Following the selection of the effect size, in addition to the forest plot presented to test the heterogeneity of the data group, the Q-statistic and the I2 value to determine the amount of heterogeneity were calculated. The Q-statistic aims to test whether there is a statistically significant chi-square value (Sedgwick, 2015). On the other hand, 50% percentile value is a limit for the calculated I2 value (Cleophas and Zwinderman, 2007). After the heterogeneity of the data group was tested, analog to the ANOVA analysis was conducted under the mixed effects model in the analysis of categorical variables in order to determine the sources of heterogeneity (Hedges, 1982). In the analysis of continuous variables, meta-regression analysis with maximum likelihood estimation method was applied (Hedges and Olkin, 2014). Furthermore, the QB -statistic (Q-between) was used to determine statistical significance in moderator variables. R2 estimation was used to determine the proportion of variance explained by continuous variables in meta-regression analysis (Yıldırım, 2021).
In the study, six different methods are mentioned to determine whether there is publication bias (Card, 2015). These methods are the funnel plot (Light and Pillemer, 1984), the trim and fill method developed by Duval and Tweedie (2000), the fail-safe N method proposed by Rosenthal (1979) and Orwin (1983) fail-safe N method. Analyses were performed with these methods. In addition, the regression analysis developed by Egger et al. (1997) and the rank correlation test developed by Begg and Mazumdar (1994) were utilized. The data was analyzed using the CMA v3 program.
2.4 Demographic characteristics of the primary studies
The distribution of the studies in which statistics anxiety scales were used according to years is shown in Figure 2. When Figure 2 is examined, it is seen that the use of statistics anxiety scales has increased over the years and the most commonly used year is 2022.
The demographic characteristics of the studies included in the analysis are shown in Table 1. When Table 1 is examined, the measurement tools consist of 6 different scales, namely STARS (61.90%), SAS (25.00%), WAESTA (5.95%), SAS-10 (4.76%), SAM (1.19%) and SAQ (1.19%). When the publication types of the studies were analyzed, it was seen that they consist of thesis (13.04%), article (76.19%) or paper (10.71%). The measurement tools were applied not only in the continent where they were developed but also in Asia (19.04%), Europe (29.76%), America (39.28%), Africa (11.90%) and Australia (1.19%). The educational level of the participants was grouped in three different levels: undergraduate students (68.24%), graduate students (23.52%), and mixed (8.24%). The studies using the instruments measuring statistics anxiety were grouped in two different ways: studies using the English version (56.52%) and studies using non-English versions (43.47%). The non-English versions were Arabic, Malay, Italian, German, Turkish, Chinese, Spanish, Hebrew, French and Dutch. Eighty-four reliability coefficient obtained from 81 studies for reliability generalization meta-analysis are presented in Supplementary Table 1. As stated in Supplementary Table 1, no year range was determined during the screening, and it was aimed to reach all published studies. In Supplementary Table 1, in addition to the reliability coefficients of the samples, the number of the sample, the ratio of the number of women to the number of men in the sample, the scale applied, the number of items in the scale, the continent where the study was conducted, the type of study published, the education level of the participants, and the language in which the scale was applied are given. There is also information on the number of items of the measurement tool, the year of publication of the study and the mean age of the sample.
According to Supplementary Table 1, there were a total of 18,809 participants in the studies included in the meta-analysis. In addition to the number of participants, there are also studies reporting the mean age of participants. In 53 of the 81 studies, the mean age of the participants was reported. According to the reported mean ages, the mean age of the participants ranged between 18.23 and 40.73. The number of studies reporting the gender of the participants is also quite high. Only 19 studies did not report the number of men and women participating in the study, while the number of men and women was reported in the remaining 65 studies. Accordingly, in the current study, the ratio of the number of women participating in the research to the number of men was taken for each study. The female to male ratio of the participants is between 0.70 and 12.67.
Since the number of items of the measurement tools used in the studies is also an important variable for reliability generalization meta-analysis, the number of items for each study is reported. The number of items of the scales varied between 10 and 51. In addition, there are 21 studies reporting the mean scale score recorded during the data collection phase. The mean scale scores of these studies ranged from 43.46 to 198.84 for STARS and from 46.10 to 74.70 for SAS. For WAESTA, the mean scores ranged from 40.62 to 42.66, and for SAS-10, the mean scale scores ranged from 26.07 to 34.19. The standard deviation of the mean scale scores of these studies is between 1.1 and 59.06. The publication years of the studies included in the analysis ranged between 1985 and 2023 (Median = 2017).
Figure 3 shows the stem and leaf graphs of the 84 raw alpha values. When the alpha coefficients are analyzed, it can be inferred that the reliability of all of these values is at a sufficient level (O’Rourke et al., 2005). In Figure 3, the unweighted mean alpha coefficient was calculated as 0.913 (Median = 0.930, SD = 0.056). According to the graph, the lowest alpha coefficient was 0.74 and the highest alpha coefficient was 0.98. The values of the 84 reliability coefficients show an asymmetry of −1.514.
3 Results
3.1 Reliability induction
In some studies, instead of reporting the reliability coefficient of the sample, the reliability coefficient of the sample in which the scale was developed is reported. This situation is called reliability induction in the literature (Shields and Caruso, 2004; Şen and Yıldırım, 2023). In the current research, there are quite a number of studies in which the reliability coefficients of the sample of the studies in which statistics anxiety scales were used were not reported. In 204 studies in which statistics anxiety scales were used, the reliability coefficient of the applied sample was not reported and was not included in the analysis. In this case, 157 studies in which the reliability coefficient of the sample to which the scale was applied was reported were included in the analysis. Since 204 studies in which the statistics anxiety scale was used and the reliability coefficient was not reported were not included in the analysis, the reliability induction rate was determined as 57.2%. As can be seen from here, more than half of the studies that should have used the measurement tool and reported reliability did not take this requirement into account. This can be considered as an important methodological problem.
3.2 Publication bias
In order to determine whether there was any publication bias in the data group. According to the classical fail-safe N value, the value was found to be 10,026. This value is considerably higher than the value of 425 obtained with the formula NR (5 k + 10). In this case, it is seen that the combined alpha coefficients are not biased according to the classical perpetrator-safe N method. According to Orwin’s fail-safe N method, the number of missing data should be 2,120 for Fisher’s z to be 0.01. Since this number is almost impossible to reach, it can be said that the current study is not biased according to Orwin’s fail-safe N method. Moreover, according to the findings of Begg and Mazumdar’s rank correlation test, Kendall’s two-tailed p value is not statistically significant (p = 0.432) and the standard error value is negative (τ = −0.05). According to this test, the two-tailed p value should not be statistically significant to ensure the absence of publication bias. In addition, according to the Egger’s test analyzed, the t value is not statistically significant [t(83) = 0.183, p = 0.427]. Egger’s one-tailed p > 0.05 indicates that the current finding is not biased. Finally, the findings of Duval and Tweedie’s trim and fill method are shown in Figure 4.
When Figure 4 is examined, the data shows a symmetrical distribution. In addition, the fact that there are only two fictitious studies that need to be added to ensure that the publication is not biased shows that the data group is not biased. Moreover, the forest plots created to determine the heterogeneity of the data group is shown in Figure 5.
3.3 Reliability generalization meta-analysis for each scale
The statistics anxiety scales included in the reliability generalization meta-analysis were analyzed with a total of six different measurement tools: STARS, SAS, SAS-10, WAESTA, SAQ and SAM. It was aimed to conduct reliability generalization meta-analysis for each of these scales, and since only one study using SAM and SAQ scales was included in the analysis, no reliability generalization meta-analysis was conducted for these studies. Accordingly, the reliability coefficients of the studies using the STARS, SAS, SAS-10 and WAESTA scales as well as the pooled reliability values of the subscales of STARS and SAS are shown in Table 2. Table 2 also includes the descriptive statistics of these measurement tools.
The internal consistency coefficient of the sample in which the measurement tools were developed, the total number of studies in which the scale was used and the number of items in the scale are shown in Table 2. Moreover, for each of these scales, the 95% confidence interval of the alpha values for which Bonett (2002) transformation formula was applied, and the lowest and highest alpha coefficients in the included studies were also presented. In addition, the Q statistic with degrees of freedom for testing heterogeneity and I2 values for the amount of heterogeneity are shown. Finally, Orwin’s fail-safe N value was examined to determine whether the data formed by the alpha coefficients of the scales were subject to publication bias.
According to Table 2, the alpha coefficient of the sample in which STARS was developed was reported as 0.96, while the original alpha coefficients of the subscales were 0.94, 0.87, 0.68, 0.88, 0.89 and 0.80, respectively. In addition to the 52 studies in which STARS was used and included in the analysis, the number of studies in which subscales were used and included in the analysis were 85, 90, 91, 83, 88 and 82, respectively. As a result of the reliability generalization meta-analysis applied to the alpha coefficients of these studies with a range between 0.740 and 0.980, the mean alpha coefficient for the whole STARS was calculated as 0.931 (95% CI: 0.917–0.942), and the mean alpha coefficients for the subscales were calculated as 0.911, 0.864, 0.865, 0.847, 0.822 and 0.780, respectively. Table 2 shows that heterogeneity was achieved according to the Q values calculated for the whole STARS and each of the subscales (p < 0.01). Furthermore, the I2 values calculated for the heterogeneity amounts of these scales are quite high (>75%). Considering the Orwin fail-safe N value examined for publication bias, it can be concluded that the data groups formed for STARS and subscales of STARS are not biased.
Table 2 presents the findings of the reliability generalization meta-analysis for the whole 24-item SAS and its three subscales as well as demographic characteristics. While the alpha coefficient for the sample in which the SAS was developed was reported as 0.91, the original alpha coefficients for the subscales of the SAS were 0.87, 0.92 and 0.82, respectively. In addition to reporting the alpha coefficients of 21 studies in which the SAS was used, the number of studies in which the alpha coefficients of the subscales were also reported were 22, 22 and 21, respectively. As a result of the reliability generalization meta-analysis applied to alpha values ranging from 0.790 to 0.950, the mean alpha value for the whole SAS was determined as 0.917 (95% CI: 0.989–0.933). The mean alpha coefficients for the subscales of the SAS were 0.895, 0.927 and 0.876, respectively. According to the Q value calculated for the whole and subscales of the SAS, heterogeneity was achieved (p < 0.01) and the I2 value calculated for the amount of heterogeneity was quite high (>75%). Considering the Orwin fail-safe N value examined for publication bias, it can be concluded that the data groups created for the SAS and its subscales are not biased.
Although the 17-item WAESTA scale has three subscales, reliability generalization meta-analysis was conducted for this scale since there are few studies using the WAESTA scale included in the analysis in the current study. Reliability generalization meta-analysis was not applied to the subscales. Five studies using this scale were included in the analysis and the alpha coefficient for the sample in which WAESTA was developed was reported as 0.94. The mean alpha coefficient of the studies using the WAESTA scale with a range between 0.940 and 0.960 was analyzed as 0.951 (95% CI: 0.941–0.960). According to the Q-statistic calculated to test the heterogeneity of the studies using WAESTA, heterogeneity was achieved (p < 0.01). In addition, the I2 value calculated for the amount of heterogeneity exceeded the 50% limit. Considering the Orwin fail-safe N value examined for publication bias, it can be concluded that the data groups created for WAESTA are not biased.
Four studies using the 10-item, unidimensional SAS-10 scale were included in the analysis and the range of these studies varied between 0.750 and 0.980. The alpha coefficient of the sample in which SAS-10 was developed was reported as 0.90. The mean alpha coefficient of the studies using this scale was analyzed as 0.918 (95% CI: 0.836–0.959). The heterogeneity of the analyzed data group was ensured according to the calculated Q-statistic (p < 0.01) and the amount of heterogeneity was quite high (>75%) according to the calculated I2 value. Considering the Orwin fail-safe N value examined for publication bias, it can be concluded that the data groups created for SAS-10 are not biased.
3.4 Overall effect size
In addition to the reliability generalization meta-analysis conducted for each of the scales, all of the studies using these scales were combined and the overall alpha mean of all studies was created. In Table 3, the general reliability coefficient formed by combining the reliability coefficients of the studies included in the meta-analysis is reported under the random effects model. Moreover, the heterogeneity test for the model is also presented in Table 3. The Q value calculated to determine the heterogeneity of the data group was reported as 3596.36. This value is well above the 0.05 confidence interval limit (sd = 83, χ2 = 105.27) with 84 degrees of freedom in the chi-square table. In this case, the heterogeneity of the data is ensured. The I2 value calculated to determine the amount of heterogeneity was found to be 97.6%. This value shows that the amount of heterogeneity of the data group is quite high.
The mean of the alpha coefficients using Bonett (2002) transformation formula was 0.928 (95% CI: 0.917–0.937) under the random effects model and this value is statistically significant (p < 0.01). It is seen that the mean of the transformed alpha values is higher than the mean of the raw alpha values shown in Figure 3.
3.5 Sub-group analysis of categorical variables
In addition to the statistical significance of all studies included in the analysis, categorical and continuous variables were determined to investigate the source of heterogeneity of the studies. Analog to the ANOVA analysis was performed for categorical variables and meta-regression analysis for continuous variables. In Supplementary Table 1, five different categorical variables were identified: the type of scale used, the type of publication of the study, the language in which the scale was applied, the educational level of the sample, and the continent in which the study was applied. Moreover, the continuous variables in Supplementary Table 1 are the year of publication of the study, the ratio of the number of women to the number of men in the sample, the number of items in the scale, the sample size, the mean age of the sample, the mean score of the scale and the standard deviation of the mean score of the scale. Analog to the ANOVA findings for categorical variables are shown in Table 4.
In the analog to the ANOVA analysis, the difference between the scales was first investigated. STARS (k = 52), SAS (k = 21), WAESTA (k = 5) and SAS-10 (k = 4) scales were analyzed with alpha coefficients. Since there was only one study using SAM and SAQ scales, they were not included in the analog to the ANOVA analysis. It was observed that the mean alpha coefficients of the 83 studies included in the analog to the ANOVA analysis differed according to the scale types (Q = 14.364, sd = 3, p < 0.05). In this case, it can be said that the heterogeneity of the mean alpha coefficient is due to the type of scale used. When the mean alpha coefficients of the scale types are compared, it is seen that the scale with the highest mean alpha value is WAESTA (0.952), followed by STARS (0.931), and finally SAS (0.918) and SAS-10 (0.918) scales together.
No statistically significant difference was found between the mean alpha coefficients of other categorical variables in the current study (p > 0.05). According to Table 4, no statistically significant difference was found according to the variables of education level (Q = 2.656, sd = 2, p = 0.265), continent where the scale was applied (Q = 2.780, sd = 3, p = 0.433), language where the scale was applied (Q = 0.619, sd = 1, p = 0.431) and type of publication of the study (Q = 1.142, sd = 2, p = 0.613). The non-significant p values indicate that comparable findings were not reached between subgroups.
3.6 Moderator analysis of continuous variables
In the present study, continuous variables were determined in addition to categorical variables. The year the study was published, the size of the sample, the number of items in the scale, the mean age of the sample, the ratio of the number of women to the number of men in the sample, the mean scale score and the standard deviation of the mean scale score were determined as continuous variables. The findings of the meta-regression analysis applied to investigate the effect of each continuous variable on the mean alpha coefficient are shown in Table 5. The reason for applying meta-regression analysis separately for each variable instead of multiple is missing data. Since the data of different studies could not be accessed under each heading, there are less than 15 studies when the general model is established.
According to the meta-regression findings, the variables of scale mean score and standard deviation of scale mean score are statistically significant predictors of the mean reliability value (p < 0.05). As predicted, there is a significant and positive relationship between these variables and mean alpha values. A one-unit increase in the mean scale score is expected to increase the mean reliability value by 0.0056. Furthermore, a one-unit increase in the standard deviation of the mean scale score is expected to increase the mean reliability value by 0.0198 points. Here, it is seen that the standard deviation of the mean scale score affects the mean alpha value more than the mean scale score. The R2 value, which is the indicator of the explained variance ratio, is 24% for the mean scale score and 24% for the standard deviation of the mean scale score. According to Table 5, other continuous variables were not statistically significant predictors of the mean alpha coefficient (p > 0.05). In the multiple regression model for the combined reliability value, the entire model was statistically significant (QM = 3593.76, p < 0.01) and explained 98% of the total variance. In addition, the model for residual values is also statistically significant (Q = 3424.61, p < 0.01).
4 Conclusion and discussion
The aim of the present study is to establish an overall reliability value for all of the statistics anxiety scales, as well as to apply reliability generalization meta-analysis for each statistics anxiety scale (STARS, SAS, WAESTA, SAS-10). Bonett (2010) conversion formula was applied to the alpha coefficients of the studies in which statistics anxiety scales were used. Since the mean alpha value obtained from the transformed alpha coefficients is above 0.90, the applications are reliable (Büyüköztürk, 2021). In the literature, there are studies in which the mean alpha coefficient of each scale was calculated in addition to reliability generalization meta-analysis for different scales measuring the same construct (Graham and Christiansen, 2009; Graham et al., 2011). Reliability generalization meta-analysis was also conducted for the subscales of multidimensional statistics anxiety scales. As a result of this analysis, it was concluded that the reliability of each subscale was above 0.70 (Clark and Watson, 2016). When the literature is examined, there are studies in which the mean alpha coefficients for the subscales were analyzed in addition to the mean alpha value for the entire measurement tool (Kıyıcı and Kahraman, 2022; Graham and Christiansen, 2009). This is similar to the method of the current study. In addition to the reliability generalization meta-analysis applied for each scale, reliability generalization meta-analysis was also applied for all of the statistics anxiety scales. Since the mean alpha coefficient obtained as a result of the analysis conducted under the random effects model was above 0.70, it was seen that the applications of the scale were reliable (DeVellis, 1991; Tavakol and Dennick, 2011). Moreover, the heterogeneity of the data group was ensured and it was determined that the amount of heterogeneity was quite high. The scope of the current study includes articles, proceedings, and theses that meet the specified criteria and use at least one of the scales mentioned above, accessible via the Web of Science, Scopus, ERIC databases, and Google Scholar search engine as of July 2023. Only studies published in Turkish or English were included in the research. In many studies, it was observed that the reliability coefficient of the sample in which the scale was used was not reported. Since the conditions of each application of the scale are different, the reliability value should be reported (Shields and Caruso, 2003).
In order to investigate the sources of heterogeneity in the heterogeneous data group, analog to the ANOVA was applied for categorical variables and meta-regression analysis was applied for continuous variables. The categorical variables for which analog to the ANOVA was applied were examined in five subgroups: the type of scale, the educational level of the sample, the continent in which the scale was applied, the language in which the scale was applied, and the type of publication of the study. In line with the findings obtained, it is seen that the mean alpha value differs statistically significantly according to the scale type variable. Grajzel (2019) applied STARS and SAS measurement tools to the same sample and reported the alpha coefficient of STARS as 0.890 and the alpha coefficient of SAS as 0.940. This does not coincide with the results obtained in the present study on the other hand, Igbokwe et al. (2017), similar to Grajzel (2019), reported the alpha coefficients of STARS and SAS measurement tools for the same sample as 0.960 for STARS and 0.910 for SAS. This supports the Analog to the ANOVA findings of the current study classified according to scale type.
In the present study, the mean alpha coefficient does not differ statistically significantly according to other categorical variables (the educational level of the sample, the continent where the scale was applied, the language in which the scale was applied, the type of publication of the study). In the literature, there are studies in which statistics anxiety differs according to the educational level of the sample (Fitzgerald, 1996), as well as studies in which there is no statistically significant difference between statistics anxiety and educational level (Benson, 1989). On the other hand, there are studies in the literature where there is a difference in the level of statistics anxiety according to continents (Kesici et al., 2011; Onwuegbuzie, 1999). Moreover, although there was no statistically significant difference in the current study in terms of the language in which the scale was applied, there is a study in the literature in which there was a difference in the level of statistics anxiety according to the language in which the scale was applied (Förster and Maur, 2015). In this study, it was revealed that the group whose mother tongue was different had more statistics anxiety than the group whose mother tongue was German. The reason for this situation can be explained by the inability to understand statistical concepts correctly, followed by a higher level of statistics anxiety. In the present study, no statistically significant difference was found between statistics anxiety and the type of publication of the study. In a different study using reliability generalization meta-analysis method (Sen, 2022), no difference was found between the mean alpha coefficient and the type of publication. On the other hand, there is a study showing that the type of extension is one of the variables affecting statistics anxiety (Fitzgerald, 1996). Since the aforementioned study tried to explain statistics anxiety together with attitude and achievement factors, different results may have been obtained from the present study.
In order to determine the continuous variables affecting the mean alpha coefficient, meta-regression analysis was performed for seven moderator variables (year of publication of the study, size of the sample, number of items, mean age of the sample, ratio of the number of women to the number of men, mean scale score and standard deviation of the mean scale score). As a result of the analysis, it was found that the mean scale score and the standard deviation of the mean scale score were statistically significant predictors of the mean alpha value. This may be due to the different number of items in the scales. Similarly, in another study in which reliability generalization meta-analysis was applied (López-Pina et al., 2015), the mean scale score and standard deviation of the mean scale score were found to be predictors of the mean reliability value. This supports the current study. On the other hand, unlike the conclusion of the current study that year of publication has no effect on statistics anxiety, a study concluding that year of publication has an effect on statistics anxiety was found (Fitzgerald, 1996). Since the year of publication was classified and analyzed as categorical variables in groups of ten instead of continuous variables in the aforementioned study, different results may have been reached. In addition, the result that the mean alpha coefficient was not affected by the sample size in the current study was also reached in a study (Sen, 2022) in which reliability generalization meta-analysis was applied. In the present study, the number of items was not one of the variables affecting the mean alpha coefficient. In the literature, it was found that test length affects statistics anxiety (Fitzgerald, 1996). Contrary, in a study investigating attitude towards mathematics (Bradford, 1990), it was revealed that the length of the test affects attitude. In the aforementioned studies, test length was classified and transformed into a categorical variable and analyzes were conducted in this way. In this case, it can be considered natural to reach different results with the present study. On the other hand, the study concluded that the mean age had no effect on the mean alpha coefficient. Another variable whose effect on the mean alpha coefficient was investigated is the mean age of the sample. In the present study, it was concluded that the mean age did not affect the mean alpha coefficient. Similarly, there are studies in the literature (Bui and Alfaro, 2011; Fitzgerald, 1996) that concluded that age has no effect on statistics anxiety. On the other hand, there are also studies showing that groups with a higher mean age exhibit higher statistics anxiety than younger groups (Bell, 2003; Edirisooriya and Lipscomb, 2021). Since younger groups are more familiar with computerized applications (Prensky, 2001), they may have lower statistics anxiety in statistical applications. Finally, in this study, it was determined that the ratio of the number of women to the number of men in the sample had no effect on the mean alpha coefficient. In the literature, there are studies that show that the level of statistics anxiety differs statistically significantly according to gender, rather than the ratio of the number of women to the number of men in samples that aim to determine statistics anxiety (Edirisooriya and Lipscomb, 2021; Gibeau et al., 2023; Hsiao and Chiang, 2011; MacArthur, 2020). In her study, Demaria-Mitton (1987) found that the level of statistics anxiety did not differ statistically significantly according to gender. Furthermore, Mandap (2016) investigated the difference of statistics anxiety of the group to which STARS was applied according to gender on the subscales of STARS. According to the results, a statistically significant difference was found only for one of the subscales according to gender. According to the findings, women’s level of fear of asking for help was found to be higher than that of men. When we look at this result, it is seen that there is a statistically significant difference only for one of the subscales rather than the whole scale. For this reason, it can be considered that the aforementioned study may have differed from the result of the current study, since analyzes were not conducted for the whole scale.
Bonett (2010) conversion formula was applied to the alpha coefficients in the study. In future studies, different conversion formulas (Fisher z, Hakstian et al., 1976) can be applied to different reliability values (composite reliability, test–retest). In addition, researchers who want to examine the variables affecting statistical anxiety, unlike the variables investigated in the current study, can examine the effect of variables such as the field of education of the scale and the country where the scale is applied.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.
Author contributions
EÖ: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. IY: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.
Funding
The author(s) declared that financial support was not received for this work and/or its publication.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that Generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1675957/full#supplementary-material
References
Akdeniz, F. (2015). İstatistikte yeni eğilimler ve gelişmeler. Sosyal Bilimler Araştırma Dergisi 4, 1–11.
Asare, P. Y. (2023). Profiling teacher pedagogical behaviours in plummeting postgraduate students’ anxiety in statistics. Cogent Educ. 10:2222656. doi: 10.1080/2331186X.2023.2222656
Begg, C. B., and Mazumdar, M. (1994). Operating characteristics of a rank correlation test for publication bias. Biometrics 50, 1088–1101. doi: 10.2307/2533446,
Bendig, A. W., and Hughes, J. B. (1954). Student attitude and achievement in a course in introductory statistics. J. Educ. Psychol. 45, 268–276. doi: 10.1037/h0057391
Benson, J. (1989). Structural components of statistical test anxiety in adults: an exploratory model. J. Exp. Educ. 57, 247–261. doi: 10.1080/00220973.1989.10806509
Beretvas, S. N., and Pastor, D. A. (2003). Using mixed-effects models in reliability generalization studies. Educ. Psychol. Meas. 63, 75–95. doi: 10.1177/0013164402239318
Betz, N. E. (1978). Prevalence, distribution and correlates of math anxiety in college students. J. Couns. Psychol. 25, 441–448. doi: 10.1037/0022-0167.25.5.441
Bonett, D. G. (2002). Sample size requirements for testing and estimating coefficient alpha. J. Educ. Behav. Stat. 27, 335–340. doi: 10.3102/10769986027004335
Bonett, D. G. (2010). Varying coefficient meta-analytic methods for alpha reliability. Psychol. Methods 15, 368–385. doi: 10.1037/a0020142,
Bradford, J. W. (1990). A meta-analysis of selected research on student attitudes towards mathematics : The University of Iowa.
Büyüköztürk, Ş. (2021). Sosyal Bilimler İçin Veri Analizi El Kitabı (29. baskı). Pegem Akademi, Ankara.
Büyüköztürk, Ş., Kılıç Çakmak, E., Akgün, Ö. E., Karadeniz, Ş., and Demirel, F. (2012). Örnekleme yöntemleri.
Card, N. A. (2015). Applied meta-analysis for social science research. New York: Guilford Publications.
Clark, L. A., and Watson, D. (2016). Constructing validity: basic issues in objective scale development. Psychological Assessment 7, 309–319. doi: 10.1037/14805-012
Cleophas, T. J., and Zwinderman, A. H. (2007). Meta-analysis. Circulation 115, 2870–2875. doi: 10.1161/CIRCULATIONAHA.105.594960,
Cruise, R. J., Cash, R. W., and Bolton, D. L. (1985). “Development and validation of an instrument to measure statistical anxiety” in Paper presented at the annual meeting of the American Statistical Association statistics education section, Las Vegas, Nevada.
Demaria-Mitton, P. A. 1987 Locus-of-control. Gender and type of major as correlates to statistics anxiety in college students. Unpublished doctoral dissertation, American U niversity
DeVellis, R. F. (1991). Scale development: Theory and applications. California: Sage Publications, 1–113.
Duval, S., and Tweedie, R. (2000). Trim and fll: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics 56, 455–463. doi: 10.1111/j.0006-341X.2000.00455.x
Egger, M., Smith, G. D., Schneider, M., and Minder, C. (1997). Bias in meta-analysis detected by a simple, graphical test. BMJ 315, 629–634. doi: 10.1136/bmj.315.7109.629,
Faber, G., Drexler, H., Stappert, A., and Eichhorn, J. (2018). Education science students’ statistics anxiety: developing and analyzing a scale for measuring their worry, avoidance, and emotionality cognitions. Int. J. Educ. Psychol. 7, 248–285. doi: 10.17583/ijep.2018.2872
Feinberg, L. B., and Halperin, S. (1978). Affective and cognitive correlates of course performance in introductory statistics. J. Exp. Educ. 46, 11–18. doi: 10.1080/00220973.1978.11011637
Fisch, R. (1971). Course evaluation, test anxiety, and final test results in a statistics course. Z. Entwicklungspsychol. Paedagog. Psychol. 3, 212–228.
Fitzgerald, S. M. (1996). The relationship between anxiety and statistics achievement: a meta-analysis : The University of Toledo.
Graham, J. M., and Christiansen, K. (2009). The reliability of romantic love: a reliability generalization meta-analysis. Pers. Relat. 16, 49–66. doi: 10.1111/j.1475-6811.2009.01209.x
Graham, J. M., Diebels, K. J., and Barnow, Z. B. (2011). The reliability of relationship satisfaction: a reliability generalization meta-analysis. J. Fam. Psychol. 25, 39–48. doi: 10.1037/a0022441,
Hakstian, A. R., Whalen, T. E., and Masson, M. E. (1976). AK-sample procedure for comparatively assessing multivariate association. Psychol. Bull. 83, 922–927. doi: 10.1037/0033-2909.83.5.922
Hedges, L. V. (1982). Fitting categorical models to effect sizes from a series of experiments. J. Educ. Stat. 7, 119–137. doi: 10.3102/10769986007002119
Hedges, L. V., and Olkin, I. (2014). Statistical methods for meta-analysis. Orlando: Academic press.
Jones, A., Knibb, G., and Christiansen, P. (2022). Statistics anxiety, predictions of exam performance and actual exam performance in UK psychology students. PsyArXiv. doi: 10.31234/osf.io/edc2n
Kıyıcı, G., and Kahraman, N. (2022). A meta-analytic reliability generalization study of the computational thinking scale. Sci. Insights Educ. Front. 13, 1859–1874. doi: 10.15354/sief.22.ma011
Leech, N. L., Onwuegbuzie, A. J., and O'Conner, R. (2011). Assessing internal consistency in counseling research. Counsel. Outcome Res. Eval. 2, 115–125. doi: 10.1177/2150137811414873
Light, R. J., and Pillemer, D. B. (1984). Summing up: The science of reviewing research. Massachusetts: Harvard University Press.
López-Pina, J. A., Sánchez-Meca, J., López-López, J. A., Marín-Martínez, F., Núñez-Núñez, R. M., Rosa-Alcázar, A. I., et al. (2015). The Yale–Brown obsessive compulsive scale: a reliability generalization metaanalysis. Assessment 22, 619–628. doi: 10.1177/1073191114551954,
Moher, D., Liberati, A., Tetzlaff, J., and Altman, D. G.Prisma Group (2009). Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 151:e1000097. doi: 10.7326/0003-4819-151-4-200908180-00135,
O’Rourke, N., Hatcher, L., and Stepanski, E. J. (2005). A step-by-step approach to using SAS for univariate & multivariate statistics. Cary, NC, United States: SAS Institute.
Ock, J., McAbee, S. T., Ercan, S., Shaw, A., and Oswald, F. L. (2021). Reliability generalization analysis of the core self-evaluations scale. Pract. Assess. Res. Eval. 26:6. doi: 10.7275/zsc7-jw58
Orwin, R. G. (1983). A fail-safe N for effect size in meta-analysis. J. Educ. Stat. 8, 157–159. doi: 10.2307/1164923
Özdemir, V., Yıldırım, Y., and Tan, Ş. (2020). A meta-analytic reliability generalization study of the Oxford happiness scale in Turkish sample. J. Meas. Eval. Educ. Psychol. 11, 374–404. doi: 10.21031/epod.766266
Prensky, M. (2001). Digital natives, digital immigrants part II: do they really think differently? On Horizon 9, 1–6. doi: 10.1108/10748120110424843
Rosenthal, R. (1979). The file drawer problem and tolerance for null results. Psychol. Bull. 86, 638–641. doi: 10.1037/0033-2909.86.3.638
Sanchez‐Meca, J., López‐López, J. A., and López‐Pina, J. A. (2013). Some recommended statistical analytic practices when reliability generalization studies are conducted. British Journal of Mathematical and Statistical Psychology 66, 402–425. doi: 10.1111/j.2044-8317.2012.02057.x,
Sánchez-Meca, J., Marín-Martínez, F., López-López, J. A., Núñez-Núñez, R. M., Rubio-Aparicio, M., López-García, J. J., et al. (2021). Improving the reporting quality of reliability generalization meta-analyses: the REGEMA checklist. Res. Synth. Methods 12, 516–536. doi: 10.1002/jrsm.1487,
Schacht, S., and Stewart, B. J. (1990). What's funny about statistics? A technique for reducing student anxiety. Teach. Sociol. 18, 52–56. doi: 10.2307/1318231
Sen, S. (2022). A reliability generalization meta-analysis of Runco ideational behavior scale. Creat. Res. J. 34, 178–194. doi: 10.1080/10400419.2021.1960719
Shields, A. L., and Caruso, J. C. (2003). Reliability generalization of the alcohol use disorders identification test. Educ. Psychol. Meas. 63, 404–413. doi: 10.1177/0013164403063003004
Shields, A. L., and Caruso, J. C. (2004). A reliability induction and reliability generalization study of the CAGE questionnaire. Educational and Psychological Measurement, 64, 254–270.
Spielberger, C. D., Gonzalez-Reigosa, F., Martinez-Urrutia, A., Natalicio, L. F., and Natalicio, D. S. (1970). The state-trait anxiety inventory. Revista Interamericana de Psicologia/Interamerican journal of psychology, 5.
Tavakol, M., and Dennick, R. (2011). Making sense of Cronbach’s alpha. International journal of medical education, 2:53–55. doi: 10.5116/ijme.4dfb.8dfd
Taylor, R. 2012 Review of the motivated strategies for learning questionnaire (MSLQ) using reliability generalization techniques to assess scale reliability. Doctoral dissertation
Vacha-Haase, T. (1998). Reliability generalization: exploring variance in measurement error affecting score reliability across studies. Educ. Psychol. Meas. 58, 6–20. doi: 10.1177/0013164498058001002
Vahedi, S., Farrokhi, F., and Bevrani, H. (2011). A confirmatory factor analysis of the structure of statistics anxiety measure: an examination of four alternative models. Iran. J. Psychiatry 6:92,
Wallace, K. A., and Wheeler, A. J. (2002). Reliability generalization of the life satisfaction index. Educ. Psychol. Meas. 62, 674–684. doi: 10.1177/0013164402062004009
Yaşar, M. (2014). İstatistiğe yönelik tutum ölçeği: Geçerlilik ve güvenirlik çalışması. Pamukkale Üniversitesi Eğitim Fakültesi Dergisi 36, 59–75. doi: 10.9779/PUJE640
Yıldırım, A. B. (2021). The effect of exercise on the total number of BrdU+ cell counts in rats’ hippocampal dentate gyrus: A meta-analysis study. Brain Research, 1766, 147512.
Zeidner, M. (1991). Statistics and mathematics anxiety in social science students: some interesting parallels. Br. J. Educ. Psychol. 61, 319–328. doi: 10.1111/j.2044-8279.1991.tb00989.x,
Studies included in the meta-analysis
Abd Hamid, H. S., and Sulaiman, M. K. (2016). “Factor structure of statistics anxiety rating scale (STARs): EFA and CFA Using Malaysian Undergraduate Psychology Students” in 7th International Conference on University Learning and Teaching (InCULT 2014) Proceedings: Educate to Innovate (pp. 149–156) (Singapore: Springer).
Abdalla, S. (2021). Prevalence and predictors of statistics anxiety among sociology and social work students. J. Arts Soc. Sci.
Abdulghani, R. 2023 Predictors of Statistics Anxiety Among Graduate Students in Saudi Arabia. Doctoral dissertation, Andrews University
Adegboye, O. A., and Jawid, A. (2016). Multivariate multilevel models for attitudes toward statistics: multi-disciplinary settings in Afghanistan. J. Appl. Stat. 43, 244–261. doi: 10.1080/02664763.2015.1091445
Afdal, A., Alizamar, A., Ilyas, A., Zikra, Z., Taufik, T., Erlamsyah, E., et al. (2019). Contribution of statistical anxiety to student learning outcomes: study in universitas negeri Padang. J. Phys. Conf. Ser. 1157:042126. doi: 10.1088/1742-6596/1157/4/042126
Ali, S. I., and Gaber, S. A. (2022). Modeling the causal relationships between statistical self-efficacy, academic hardiness, and statistics anxiety among postgraduate students at King Faisal University. J. Posit. Sch. Psychol., 1734–1749.
Alizamar, A., Afdal, A., Ifdil, I., Ardi, Z., Ilyas, A., and Zikra, Z., & & Febriani, R. D. (2019). Are there statistical anxiety differences between male and female students?. In J. Phys. Conf. Ser. (Vol. 1157, No., p.:042127). IOP Publishing. doi:doi: 10.1088/1742-6596/1157/4/042127
Altun, M., Yılmaz, G. K., Demir, B., and Çelik, H. S. (2021). Statistical anxiety and metacognitive awareness levels of graduate students studying in mathematics education program. Eur. J. Educ. Stud. 9.
Baloğlu, M., Abbassi, A., and Kesici, S. (2017). Multivariate relationships between statistics anxiety and motivational beliefs. Education 137, 430–444.
Baloğlu, M. (2002). Psychometric properties of the statistics anxiety rating scale. Psychol. Rep. 90, 315–325. doi: 10.2466/pr0.2002.90.1.315,
Baloğlu, M. (2003). Individual differences in statistics anxiety among college students. Pers. Individ. Differ. 34, 855–865. doi: 10.1016/S0191-8869(02)00076-4
Baloğlu, M., and Zelhart, P. (2004). Üniversite öğrencileri arasında yüksek ve düşük istatistik kaygısının ayrıştırıcıları. Eğitim ve Bilim 29.
Baloğlu, M., Deniz, M. E., and Kesici, Ş. (2011). A descriptive study of individual and cross-cultural differences in statistics anxiety. Learn. Individ. Differ. 21, 387–391. doi: 10.1016/j.lindif.2011.03.003
Bathurst, N. G. 2015 Using a self-management project to improve student performance in an online introductory statistics course. San José State University
Bektaş, H., Akman, S., and Yeşilaltay, E. (2021). İstatistik kaygı ölçeğinin (SAS) psikometrik özelliklerinin incelenmesi. İstanbul Gelişim Üniversitesi Sosyal Bilimler Dergisi 8, 1–14. doi: 10.17336/igusbd.627197
Bell, A. A. 2022 Social desirability and other predictors of statistics anxiety at the graduate level (doctoral dissertation, Capella University)
Beurze, S. M., Donders, A. R. T., Zielhuis, G. A., de Vegt, F., and Verbeek, A. L. (2013). Statistics anxiety: a barrier for education in research methodology for medical students? Med. Sci. Educ. 23, 377–384.
Bourne, V. J., and Nesbit, R. J. (2018). Do attitudes towards statistics influence the decision to study psychology at degree level? A pilot investigation. Psychol. Teach. Rev. 24, 55–63. doi: 10.53841/bpsptr.2018.24.2.55
Bui, N. H., and Alfaro, M. A. (2011). Statistics anxiety and science attitudes: age, gender, and ethnicity factors. Coll. Stud. J. 45, 573–585.
Casinillo, E. L., Lina, E. R., Casinillo, L. F., Batidor, P. G., and Lebante, M. R. (2022). Econometric evidence on statistical anxiety of engineering students during the new normal setup. Philipp. Soc. Sci. J. 5, 9–17. doi: 10.52006/main.v5i4.564
Chau, Q. 2018 Exploration of statistics anxiety among doctoral students in health sciences related disciplines Seton Hall University
Chew, K. H. P., and Dillon, D. B. (2014a). “Individual differences in statistics anxiety among students in Singapore” in Proceedings of the international conference on managing the Asian century: ICMAC 2013 (Singapore: Springer Singapore), 293–302.
Chew, K. H. P., and Dillon, D. B. (2014b). Statistics anxiety and the big five personality factors. Procedia Soc. Behav. Sci. 112, 1177–1186. doi: 10.1016/j.sbspro.2014.01.1282
Chew, P. K., and Dillon, D. B. (2014c). Reliability and validity of the statistical anxiety scale among students in Singapore and Australia. J. Trop. Psychol. 4:e7. doi: 10.1017/jtp.2014.7
Chew, P. K., Dillon, D. B., and Swinbourne, A. L. (2018). An examination of the internal consistency and structure of the statistical anxiety rating scale (STARS). PLoS One 13:e0194195. doi: 10.1371/journal.pone.0194195,
Chew, P. K., Swinbourne, A., and Dillon, D. B. (2014). No evidence of attentional bias in statistics anxiety. Eur. J. Soc. Behav. Sci. 10, 1451–1465. doi: 10.15405/ejsbs.131
Chiesi, F., Primi, C., and Carmona, J. (2011). Measuring statistics anxiety: cross-country validity of the statistical anxiety scale (SAS). J. Psychoeduc. Assess. 29, 559–569. doi: 10.1177/0734282911404985
Collins, K. M., and Onwuegbuzie, A. J. (2006). “Relatıonshıp between Readıng Abılıty and Statıstıcs Anxıety among Afrıcan-Amerıcan graduate students: Implıcatıons for the Teachıng and Learnıng of Statıstıcs” in 7th international conference on teaching statistics (Bahia, Brazil).
Collins, K. M., and Onwuegbuzie, A. J. (2007). I cannot read my statistics textbook: the relationship between reading ability and statistics anxiety. J. Negro Educ. 7, 118–129.
Cravalho, P. F. 2010 Learning statistics using concept maps: effects on anxiety and performance. San Jose state university
de Vink, I. C. 2017 The relationship between statistics anxiety and statistical performance (bachelor's thesis)
DeVaney, T. A. (2016). Confirmatory factor analysis of the statistical anxiety rating scale with online graduate students. Psychol. Rep. 118, 565–586. doi: 10.1177/0033294116644093,
Dunn, K. (2014). Why wait? The influence of academic self-regulation, intrinsic motivation, and statistics anxiety on procrastination in online statistics. Innov. High. Educ. 39, 33–44. doi: 10.1007/s10755-013-9256-1
Durak, I., and Karagöz, Y. (2021). Adaptation of statistics anxiety scale to Turkish: validity and reliability study. Int. J. Assess. Tools Educ. 8, 667–683. doi: 10.21449/ijate.863225
Earp, M. S. 2007 Development and validation of the statistics anxiety measure (doctoral dissertation, University of Denver)
Edirisooriya, M. L., and Lipscomb, T. J. (2021). Gender influence on statistics anxiety among graduate students. J. Res. Sci. Math. Technol. Educ. 4, 63–74. doi: 10.31756/jrsmte.421,
Eshet, Y., Steinberger, P., and Grinautsky, K. (2021). Relationship between statistics anxiety and academic dishonesty: a comparison between learning environments in social sciences. Sustainability 13:1564. doi: 10.3390/su13031564
Eshet, Y., Steinberger, P., and Grinautsky, K. (2022). Does statistics anxiety impact academic dishonesty? Academic challenges in the age of distance learning. Int. J. Educ. Integr. 18, 1–19. doi: 10.1007/s40979-022-00117-w,
Esnard, T. R., Alladin, F. M., and Samlal, K. C. (2021). Prior mathematics performance, statistics anxiety, self-efficacy and expectations for performance in statistics: a survey of social sciences students in a Caribbean institution of higher education. Stat. Educ. Res. J. 20, 4–4. doi: 10.52041/serj.v20i1.98
Fabbricatore, R., Parola, A., Pepicelli, G., and Palumbo, F. (2022). Successful factors in statistics learning for non-STEM courses students: a PLS-PM approach. Statistic. Applic. Ital. J. Appl. Statistic., (2). doi:doi: 10.26398/IJAS.0034-010
Faber, G., and Drexler, H. (2019). Predicting education science students' statistics anxiety: the role of prior experiences within a framework of domain-specific motivation constructs. High. Learn. Res. Commun. 9:n1. doi: 10.18870/hlrc.v9i1.435
Faber, G., Drexler, H., Stappert, A., and Eichhorn, J. (2018). Education science students’ statistics anxiety: developing and analyzing a scale for measuring their worry, avoidance, and emotionality cognitions. Int. J. Educ. Psychol. 7, 248–285. doi: 10.17583/ijep.2018.2872
Fairlamb, S., Papadopoulou, H., and Bourne, V. J. (2022). Reach for the STARS? The role of academic contingent self-worth in statistics anxiety and learning. Learn. Motiv. 78:101815. doi: 10.1016/j.lmot.2022.101815
Fernando, W. W. D. P., Wathsanie, M. V., Kavinda, R. P. M., and Lankeshwara, P. (2020). Identifying the causative factors for statistical anxiety among undergraduates (with special reference to general sir John Kotelawala defence university). J. Univers. Kelaniya 34:80. doi: 10.4038/kalyani.v34i1.49
Ferrando, P. J., and Navarro-González, D. (2018). Assessing the quality and usefulness of factor-analytic applications to personality measures: a study with the statistical anxiety scale. Pers. Individ. Differ. 123, 81–86. doi: 10.1016/j.paid.2017.11.014
Förster, M., and Maur, A. (2015). Statistics anxiety and selfconcept of beginning students in the social sciences–a matter of gender and socio-cultural background. Zeitschrift für Hochschulentwicklung 10, 67–90.
Frey-Clark, M., Natesan, P., and O’Bryant, M. (2019). Assessing statistical anxiety among online and traditional students. Front. Psychol. 10:1440. doi: 10.3389/fpsyg.2019.01440
Gibeau, R. M., Maloney, E. A., Béland, S., Lalande, D., Cantinotti, M., Williot, A., et al. (2023). The correlates of statistics anxiety: relationships with spatial anxiety, mathematics anxiety and gender. J. Numer. Cogn. 9, 16–43. doi: 10.5964/jnc.8199
Grajzel, K. (2019). Validation of the statistical anxiety scale among college students in the United States : University of Colorado Colorado Springs.
Güler, N., Taşdelen-Teker, G., and İlhan, M. (2019). Lisansüstü eğitim öğrencilerine yönelik istatistik kaygısı ölçeği’nin Türkçeye uyarlanması. J. Measure. Eval. Educ. Psychol. 10, 435–450. doi: 10.21031/epod.550765
Hanna, D., Shevlin, M., and Dempster, M. (2008). The structure of the statistics anxiety rating scale: a confirmatory factor analysis using UK psychology students. Pers. Individ. Differ. 45, 68–74. doi: 10.1016/j.paid.2008.02.021
Heretick, D. M., and Tanguma, J. (2021). Anxiety and attitudes toward statistics and research among younger and older nontraditional adult learners. J. Contin. High. Educ. 69, 87–99. doi: 10.1080/07377363.2020.1784690
Hoegler, S., and Nelson, M. (2018). The influence of anxiety and self-efficacy on statistics performance: a path analysis. Psi Chi J. Psychol. Res. 23, 364–375. doi: 10.24839/2325-7342.JN23.5.364
Hsiao, T. Y. (2010). The statistical anxiety rating scale: further evidence for multidimensionality. Psychol. Rep. 107, 977–982. doi: 10.2466/07.11.PR0.107.6.977-982,
Hsiao, T. Y., and Chiang, S. (2011). Gender differences in statistics anxiety among graduate students learning English as a foreign language. Soc. Behav. Pers. 39, 41–42. doi: 10.2224/sbp.2011.39.1.41
Hubbard, A. 2022 Motivation and statistics anxiety among adult online learners: The mediating role of self-regulated learning (doctoral dissertation, Walden University)
Idika, D. O., and Ojong, B. M. Demographıc Characterıstıcs and response effect of Statıstıcal Anxıety among Unıversıty students in cross Rıver state, Nıgerıa. International Journal of Scientific Research and Engineering Development, 3, 1024–1040.
Igbokwe, D. O., Oyewole, F. O., Prekake, T. S., and Agoha, B. C. E. (2017). Factor structure, reliability and validity of two statistical anxiety measures in a sample of Nigerian psychology undergraduates. J. Nigerian Psychol. Res. Abuja 5.
Jiao, Q. G., and Onwuegbuzie, A. J. (2012). The role of statıstıcs anxıety in cooperatıve learnıng groups in graduate-level research methodology courses. Int. J. Univ. Teach. Fac. Dev. 3:311.
Kafadar, T. (2022). Sosyal Bilgiler Eğitimi Lisansüstü Öğrencilerinin İstatistik Kaygılarının İncelenmesi. Sakarya Üniversitesi Eğitim Fakültesi Dergisi 22, 226–236. doi: 10.53629/sakaefd.1195896
Kangni, A. C. 2021 The influence of the instructor’s visibility in an instructional statistics video on student’s perceived social presence, anxiety, and performance (master's thesis, University of Twente)
Tonsing, K. N. (2018). Instructor immediacy and statistics anxiety in social work undergraduate students. Soc. Work. Educ. 37, 223–233. doi: 10.1080/02615479.2017.1395009
Keeley, J., Zayac, R., and Correia, C. (2008). Curvilinear relationships between statistics anxiety and performance among undergraduate students: evidence for optimal anxiety. Stat. Educ. Res. J. 7, 4–15. doi: 10.52041/serj.v7i1.477
Kesici, Ş., Baloğlu, M., and Deniz, M. E. (2011). Self-regulated learning strategies in relation with statistics anxiety. Learn. Individ. Differ. 21, 472–477. doi: 10.1016/j.lindif.2011.02.006
Lalande, D., Cantinotti, M., Williot, A., Gagnon, J., and Cousineau, D. (2019). Three pathways from achievement goals to academic performance in an undergraduate statistics course. Stat. Educ. Res. J. 18, 94–105. doi: 10.52041/serj.v18i1.152
Larwin, K., and Harvey, M. (2012). A demonstration of a systematic item-reduction approach using structural equation modeling. Pract. Assess. Res. Eval., 17, 8. doi:doi: 10.7275/0nem-w659
Lavasani, M. G., Weisani, M., and Shariati, F. (2014). The role of achievement goals, academic motivation in statistics anxiety: testing a causal model. Procedia Soc. Behav. Sci. 114, 933–938. doi: 10.1016/j.sbspro.2013.12.810
Lavidas, K., Manesis, D., and Gialamas, V. (2021). Investigation of the statistical anxiety rating scale psychometric properties with a sample of Greek students. Int. J. Educ. Psychol. 10, 116–142. doi: 10.17583/ijep.2021.6032
Levpušček, M. P., and Cukon, M. (2022). That old devil called ‘statistics’: statistics anxiety in university students and related factors. Cent. Educ. Policy Stud. J. 12, 147–168. doi: 10.26529/cepsj.826
Lin, Y. H. (2021). Clustering and profile analysis on statistics anxiety styles of undergraduate students. Int. J. Intel. Technol. Appl. Statistic. 14. doi: 10.6148/IJITAS.202112.14(4).0004
Lin, Y. J., and Tang, H. (2017). Exploring student perceptions of the use of open educational resources to reduce statistics anxiety. J. Form. Des. Learn. 1, 110–125. doi: 10.1007/s41686-017-0007-z
Liu, L. (2021). Assessing statistics anxiety in an online or hybrid setting: the adaption and development of a new instrument SASOH. Stat. Educ. Res. J. 20, 2–2. doi: 10.52041/serj.v20i1.96
Liu, S., Onwuegbuzie, A. J., and Meng, L. (2011). Examination of the score reliability and validity of the statistics anxiety rating scale. J. Educ. Enq. 11.
Lorenzo Seva, U., Vigil Colet, A., and Ferrando Piera, P. J. (2022). Development of a revised version of the statistical anxiety scale. Psicothema. doi: 10.7334/psicothema2022.231
Maat, S. M., and Rosli, M. K. (2016). The Rasch model analysis for statistical anxiety rating scale (STARS). Creat. Educ. 07, 2820–2828. doi: 10.4236/ce.2016.718261,
Maat, S. M., Mohd, N., Ahmad, C. N. C., and Puteh, M. (2022). Statistical anxiety and achievement among university students. TEM J., 11(1),:420. MacArthur, K. R. (2020). Avoiding over-diagnosis: Exploring the role of gender in changes over time in statistics anxiety and attitudes. Numeracy (1), 4. doi:doi: 10.18421/TEM111-53
Maat, S. M., Zakaria, E., and Rosli, M. K. (2017). The validation of statistical anxiety rating scale using confirmatory factor analysis among graduate students. Int. J. Novel Res. Educ. Learn. 4, 160–167.
MacArthur, K. R. (2020). Avoiding over-diagnosis: exploring the role of gender in changes over time in statistics anxiety and attitudes. Numeracy 13:4. doi: 10.5038/1936-4660.13.1.4
MacArthur, K. R., and Santo, J. B. (2023). A multi-level analysis of the effects of statistics anxiety/attitudes on trajectories of exam scores. J. Stat. Data Sci. Educ. 31, 102–112. doi: 10.1080/26939169.2022.2093805
Macher, D., Paechter, M., Papousek, I., and Ruggeri, K. (2012). Statistics anxiety, trait anxiety, learning behavior, and academic performance. Eur. J. Psychol. Educ. 27, 483–498. doi: 10.1007/s10212-011-0090-5
Macher, D., Paechter, M., Papousek, I., Ruggeri, K., Freudenthaler, H. H., and Arendasy, M. (2013). Statistics anxiety, state anxiety during an examination, and academic achievement. Br. J. Educ. Psychol. 83, 535–549. doi: 10.1111/j.2044-8279.2012.02081.x,
Mandap, C. M. (2016). Examining gender differences in statistics anxiety among college students. Int. J. Educ. Res. 4, 358–366.
Maysick, D. J. 1985 The relationship of self-concept, sex, age, major, anxiety, mathematics background, and mathematics ability to the level of mathematics and statistical anxiety among college juniors. Andrews University
McCaughey, N. J., Hill, T. G., and Mackinnon, S. P. (2022). The association of self-efficacy, anxiety sensitivity, and perfectionism with statistics and math anxiety. Pers. Sci. 3, 1–23. doi: 10.5964/ps.7091
McDonald, S., and Barnard, M. P. (2023). The influence of prior experience with mathematics and A-level science subjects on statistics anxiety in undergraduate psychology students. Psychol. Teach. Rev. 29, 37–50. doi: 10.53841/bpsptr.2023.29.1.37
McGrath, A. L., Ferns, A., Greiner, L., Wanamaker, K., and Brown, S. (2015). Reducing anxiety and increasing self-efficacy within an advanced graduate psychology statistics course. Can. J. Scholarsh. Teach. Learn. 6:5. doi: 10.5206/cjsotl-rcacea.2015.1.5
McIntee, S. E., Goulet-Pelletier, J. C., Williot, A., Deck-Léger, E., Lalande, D., Cantinotti, M., et al. (2022). (mal) adaptive cognitions as predictors of statistics anxiety. Stat. Educ. Res. J. 21:5. doi: 10.52041/serj.v21i1.364
Mji, A. (2009). Differences in university students' attitudes and anxiety about statistics. Psychol. Rep. 104, 737–744. doi: 10.2466/PR0.104.3.737-744,
Mji, A., and Onwuegbuzie, A. J. (2004). Evidence of score reliability and validity of the statistical anxiety rating scale among technikon students in South Africa. Meas. Eval. Couns. Dev. 36, 238–251. doi: 10.1080/07481756.2004.11909745
Mokhele, M. 2018 The association of statistics anxiety, attitude toward statistics and mathematics self-concept with performance in a business statistics course (doctoral dissertation, University of the Free State)
Najmi, A., Raza, S. A., and Qazi, W. (2018). Does statistics anxiety affect students' performance in higher education? The role of students' commitment, self-concept and adaptability. Int. J. Manag. Educ. 12, 95–113. doi: 10.1504/IJMIE.2018.090705
Nesbit, R. J., and Bourne, V. J. (2018). Statistics anxiety rating scale (STARS) use in psychology students: a review and analysis with an undergraduate sample. Psychol. Teach. Rev. 24, 101–110. doi: 10.53841/bpsptr.2018.24.2.101
Neumann, D. L., and Hood, M. (2009). The effects of using a wiki on student engagement and learning of report writing skills in a university statistics course. Australas. J. Educ. Technol. 25. doi: 10.14742/ajet.1141
Nielsen, T., and Kreiner, S. (2018). Measuring statistical anxiety and attitudes towards statistics: development of a comprehensive Danish instrument (HFS-R). Cogent Educ. 5:1521574. doi: 10.1080/2331186X.2018.1521574
O’Bryant, M., Natesan Batley, P., and Onwuegbuzie, A. J. (2021). Validation of an adapted version of the statistical anxiety scale in English and its relationship to attitudes toward statistics. SAGE Open 11:21582440211001378. doi: 10.1177/21582440211001378
O'Bryant, M. J. (2017). How attitudes towards statistics courses and the field of statistics predicts statistics anxiety among undergraduate social science majors: a validation of the statistical anxiety scale. Ann Arbor, MI: ProQuest LLC.
Ogbogo, S., and Amadi, G. N. (2018). Statistics self-efficacy and learning styles as predictor of statistics anxiety. Int. J. Sci. Res. Manag. 6, 240–248. doi: 10.18535/ijsrm/v6i4.el06
Onwuegbuzie, A. J. 1993 The interaction of statistics test anxiety and examination condition in statistics achievement of post-baccalaureate non-statistics majors. University of South Carolina
Onwuegbuzie, A. J. (1999). Statistics anxiety among African American graduate students: an affective filter? J. Black Psychol. 25, 189–209. doi: 10.1177/0095798499025002004
Onwuegbuzie, A. J. (2000a). I'll begin my statistics assignment tomorrow: the relationship between statistics anxiety and academic procrastination.
Onwuegbuzie, A. J. (2000b). Statistics anxiety and the role of self-perceptions. J. Educ. Res. 93, 323–330. doi: 10.1080/00220670009598724
Onwuegbuzie, A. J. (2004). Academic procrastination and statistics anxiety. Assess. Eval. High. Educ. 29, 3–19. doi: 10.1080/0260293042000160384
Onwuegbuzie, A. J., and Daley, C. E. (1999). Perfectionism and statistics anxiety. Pers. Individ. Differ. 26, 1089–1102. doi: 10.1016/S0191-8869(98)00214-1
Onwuegbuzie, A. J., DaRos, D., and Ryan, J. M. (1997). The components of statistics anxiety: a phenomenological study. Focus. Learn. Probl. Math. 19, 11–35. doi: 10.46743/2160-3715/2015.2101
Pan, W., and Tang, M. (2004). Examining the effectiveness of innovative instructional methods on reducing statistics anxiety for graduate students in the social sciences. J. Instr. Psychol. 31.
Papousek, I., Ruggeri, K., Macher, D., Paechter, M., Heene, M., Weiss, E. M., et al. (2012). Psychometric evaluation and experimental validation of the statistics anxiety rating scale. J. Pers. Assess. 94, 82–91. doi: 10.1080/00223891.2011.627959,
Paul, L., Parveen, T., Ahmed, O., and Aktar, R. (2018). Adaptation study of the statistical anxiety scale on a Bangladeshi sample. Bulg. J. Sci. Educ. Policy 12, 380–401.
Perepiczka, M., Chandler, N., and Becerra, M. (2011). Relationship between graduate students' statistics self-efficacy, statistics anxiety, attitude toward statistics, and social support. Prof. Counselor 1, 99–108. doi: 10.15241/mpa.1.2.99
Pretorius, T. B., and Norman, A. M. (1992). Psychometric data on the statistics anxiety scale for a sample of south African students. Educ. Psychol. Meas. 52, 933–937. doi: 10.1177/0013164492052004015
Primi, C., and Chiesi, F. (2016). “Statistics anxiety: a mediator in learning probability” in 13th international congress on mathematical education, 1–7.
Primi, C., and Chiesi, F. (2018). “The role of mathematics anxiety and statistics anxiety in learning statistics” in 10th international conference on teaching statistics (Kyoto, Japan). Retrieved fromhttps://iase-web.org/icots/10/proceedings/pdfs/ICOTS10_5E2.pdf
Ratanaolarn, T. (2016). The development of a structural equation model of graduate students’ statistics achievement. J. Behav. Sci. 11, 153–168. https://so06.tci-thaijo.org/index.php/IJBS/article/view/63288
Rodarte-Luna, B., and Sherry, A. (2008). Sex differences in the relation between statistics anxiety and cognitive/learning strategies. Contemp. Educ. Psychol. 33, 327–344. doi: 10.1016/j.cedpsych.2007.03.002
Rosli, M. K., Mistima, S., and Rosli, N. (2017). Students’ attitude and anxiety towards statistics a descriptive analysis. Res. Educ. Psychol. 1, 47–56.
Saidi, S. S., and Siew, N. M. (2022). Assessing secondary school students’ statistical reasoning, attitude towards statistics, and statistics anxiety. Stat. Educ. Res. J. 21, 6–6. doi: 10.52041/serj.v21i1.67
Sandoz, E. K., Butcher, G., and Protti, T. A. (2017). A preliminary examination of willingness and importance as moderators of the relationship between statistics anxiety and performance. J. Context. Behav. Sci. 6, 47–52. doi: 10.1016/j.jcbs.2017.02.002
Sesé, A. J., Jiménez López, R., Montaño Moreno, J. J., and Palmer Pol, A. L. (2015). Can attitudes toward statistics and statistics anxiety explain students' performance?. J. Psychodidactics, 2015, vol. 20, num., p. 285–304.
Sockol, L. E., Ellison, W. D., Stutts, L. A., and Knouse, L. E. (2021). Improving quantitative abilities and attitudes in clinical psychology courses: longitudinal assessment of a blended learning intervention. Teach. Psychol. 48, 316–327. doi: 10.1177/0098628320979894
Starcher, K. (2016). Reducing statistics anxiety in undergraduate business students. Curr. ACADEMIC 19.
Stefanovska–Petkovska, M., and Velikj, V. (2020). Statistics for all: analyses of attitudes and anxiety towards learning statistics in undergraduate and postgraduate students in medical and management sciences. For the publisher 114.
Steinberger, P. (2020). Assessing the statistical anxiety rating scale as applied to prospective teachers in an Israeli teacher-training college. Stud. Educ. Eval. 64:100829. doi: 10.1016/j.stueduc.2019.100829
Steinberger, P., Eshet, Y., and Grinautsky, K. (2021). No anxious student is left behind: statistics anxiety, personality traits, and academic dishonesty—lessons from COVID-19. Sustainability 13:4762. doi: 10.3390/su13094762
Teman, E. D. (2013a). A Rasch analysis of the statistical anxiety rating scale. J. Appl. Meas. 14, 414–434,
Teman, E. D. (2013b). Factorial invariance of the statistical anxiety rating scale across sex and students' classification. Compr. Psychol. 2, 02–20. doi: 10.2466/02.20.CP.2.1
Valle, N., Antonenko, P., Valle, D., Dawson, K., Huggins-Manley, A. C., and Baiser, B. (2021). The influence of task-value scaffolding in a predictive learning analytics dashboard on learners' statistics anxiety, motivation, and performance. Comput. Educ. 173:104288. doi: 10.1016/j.compedu.2021.104288
van Dijck, J. P., Abrahamse, E., Kesteloot, S., Willems, R., and Fias, W. (2022). Statistics anxiety in Flanders: exploring its level, antecedents, and performance impact across professional and academic bachelor programs in psychology. Int. Electron. J. Elem. Educ. 14, 363–385. doi: 10.26822/iejee.2022.250
Vigil-Colet, A., Lorenzo-Seva, U., and Condon, L. (2008). Development and validation of the statistical anxiety scale. Psicothema 20, 174–180,
Watson, F. S., Hess, M. R., Onwuegbuzie, A. J., Ferron, J. M., Kromrey, J. D., Hogarty, K. Y., et al. (2004). “Anxiety and attitudes in graduate statistics education: measuring and monitoring change during the semester” in Annual meeting of the eastern educational research association (Clearwater, Florida).
Watson, F. S., Lang, T. R., Kromrey, J. D., Ferron, J. M., Hess, M. R., and Hogarty, K. Y. 2003 An assessment blueprint for EncStat: a statistics anxiety intervention program
Watson, F. S., Lang, Thomas R., and Kromrey, Jeffrey D. 2002 Breaking ground for EncStat: a statistics anxiety intervention program
Welch, P. S., Jacks, M. E., Smiley, L. A., Walden, C. E., Clark, W. D., and Nguyen, C. A. (2015). A study of statistics anxiety levels of graduate dental hygiene students. Am. Dent. Hyg. Assoc. 89, 46–54.
Williams, A. (2012). Online homework vs. traditional homework: statistics anxiety and self-efficacy in an educational statistics course. Technol. Innov. Stat. Educ. 6. doi: 10.5070/T561011683
Williams, A. (2014). An exploration of preference for numerical information in relation to math self-concept and statistics anxiety in a graduate statistics course. J. Stat. Educ. 22. doi: 10.1080/10691898.2014.11889693,
Williams, A. S. (2010). Statistics anxiety and instructor immediacy. J. Stat. Educ. 18:187. doi: 10.1080/10691898.2010.11889495
Williams, A. S. (2013). Worry, intolerance of uncertainty, and statistics anxiety. Stat. Educ. Res. J. 12, 48–59. doi: 10.52041/serj.v12i1.321
Williams, A. S. (2015). Statistics anxiety and worry: the roles of worry beliefs, negative problem orientation, and cognitive avoidance. Stat. Educ. Res. J. 14, 53–75. doi: 10.52041/serj.v14i2.261
Wu, R., Chen, J., Li, Q., and Zhou, H. (2022). Reducing the influence of perfectionism and statistics anxiety on college student performance in statistics courses. Front. Psychol. 13:1011278. doi: 10.3389/fpsyg.2022.1011278,
Yaffe, Y., Bouhnik, O., and Burg, D. (2020). Further examination of the structure of the statistical anxiety rating scale (STARS) in international use. J. Appl. Meas. 21, 235–242.
Yeşildağ, A. Y. (2022). Sağlık Yönetimi Alanında Lisansüstü Eğitim Gören Öğretim Elemanlarının İstatistiksel Kaygı Düzeylerinin İncelenmesi. Avrupa Bilim Teknoloji Dergisi, (45), 88–92. doi:doi: 10.31590/ejosat.1222420
Younesi, S., Tavakoli, V., Hosseini, S., and Hashemizadeh, V. (2014). The relation between emotional self-efficacy and meta-cognitive believes with statistical anxiety. Indian J. Fundam. Appl. Life Sci. 4, 433–440.
Yulianto, A. (2022). The effect of statistics anxiety on statistics courses achievement in psychology students. Cendikia: Media Jurnal Ilmiah Pendidikan 13. http://www.iocscience.org/ejournal/index.php/Cendikia
Yunis, F. A. (2011). “Factors influencing the psychology student in dealing with statistics courses” in Avaliação Psicológica: Guia de consulta para estudantes e profissionais de psicologia. eds. R. Ambiel, et al. (Brasil–São Paulo: Casa da Psicologia).
Zahan, F. N., Kawsar, L. A., Islam, M. A., and Bhuia, M. R. (2020). Internal consistency and structure of the statistics anxiety rating scale: a multivariate analysis among undergraduate students of SUST, Sylhet.
Zhang, J. W., Kessler, E., and Braasch, J. L. (2021). Self-compassion mindsets can predict statistics course performance via intelligence mindsets and statistics anxiety. Learn. Individ. Differ. 90:102047. doi: 10.1016/j.lindif.2021.102047
Keywords: reliability generalization, meta-analysis, reliability induction, statistics anxiety, reliability
Citation: Özdemir EÖ and Yildirim I (2026) A reliability generalization meta-analysis of self-report measures of statistics anxiety. Front. Psychol. 16:1675957. doi: 10.3389/fpsyg.2025.1675957
Edited by:
Silvia Cristina da Costa Dutra, University of Zaragoza, SpainCopyright © 2026 Özdemir and Yildirim. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ibrahim Yildirim, aXlpbGRpcmltODRAZ21haWwuY29t