- Department of Psychology, University of the Western Cape, Cape Town, South Africa
Background: Sleep is essential for physical health and psychological wellbeing, and insomnia is strongly associated with mental health difficulties, including depression, anxiety, and fatigue. Among first responders, the prevalence of insomnia is particularly high due to chronic exposure to stress, trauma, and irregular work hours.
Aim: As part of a broader study focusing on the mental health of first responders in South Africa, the current study examined the psychometric properties of the Insomnia Severity Index from three different psychometric perspectives: classical test theory, Rasch analysis and Mokken scale analysis.
Methods: Participants were first responders (n = 429) in the Western Cape province of South Africa and they included police officers (n = 309) and paramedics (n = 120). They completed the Insomnia Severity Index (ISI), the Patient Health Questionnaire-9, the Generalized Anxiety Disorder-7, and the Chalder Fatigue Questionnaire.
Results: The three psychometric paradigms converged to confirm that the ISI measures a unidimensional scale. Furthermore, all three paradigms provided evidence for the construct validity of the ISI. In addition, classical test theory indices provided evidence for convergent and discriminant validity. Lastly, the correlations between insomnia as measured by the ISI and depression, anxiety, and fatigue provided evidence for concurrent validity.
Conclusion: These findings affirm that the ISI is a stable and sound tool for assessing insomnia severity within the first responder population. The absence of measurement bias across gender and professional roles also enhances the practical utility of the ISI, as it ensures equitable assessment across subgroups within the first responder workforce. The ISI emerges from this study as a valuable resource for clinicians, researchers, and occupational health professionals working with South African first responders.
1 Introduction
Insomnia is a sleep disorder that is highly prevalent in the general population but it often remains underdiagnosed and untreated (Morin et al., 2023). It is characterized by difficulty initiating and/or maintaining sleep despite opportunities for sleep and is subjectively experienced as adversely affecting daytime functioning. Insomnia can be either acute, lasting for 3 days to a week, or it can be chronic and persist for at least 3 months (Vargas et al., 2020). Insomnia has consistently been associated with adverse physical health outcomes and is a risk factor for hypertension and cardiovascular disease owing to the elevation of cortisol levels arising from lack of sleep (Dean et al., 2023). Insomnia commonly co-occurs with mental health disorders such as anxiety and mood disorders. This has been ascribed to a bi-directional relationship between insomnia and these conditions. For instance, both generalized anxiety disorder and major depressive disorder are associated with persistent worrying, rumination about potential negative outcomes and frequent negative thoughts about oneself, the world and the future. These cognitive processes trigger autonomic arousal and emotional distress, which can impact sleep initiation and maintenance. Furthermore, it can lead to attentional bias toward internal and external cues that signal that sleep is being disrupted leading to an escalation in anxiety, which culminates in sleep deficits (Harvey, 2002; Tang et al., 2023).
Insomnia, in turn, can produce distress and adversely impact interpersonal and occupational functioning, which can aggravate psychological symptoms. Owing to the co-occurrence of insomnia and mental health disorders, interventions have typically prioritized treating the primary mental health condition, based on the assumption that the insomnia will remit once the underlying disorder is addressed. However, insomnia has been found to precede comorbid mental health disorders, aggravate the symptoms associated with them and persist despite effective treatment of the co-occurring condition (Morin et al., 2023). This has prompted the recognition of insomnia as an independent disorder, necessitating direct treatment and it has driven the development of instruments to accurately assess, diagnose and monitor sleep disturbances.
The Pittsburgh Sleep Quality Index (PSQI) and the Epworth Sleepiness Scale (ESS), for example, are frequently used instruments to assess sleep problems (Buysse et al., 2008; Salihu et al., 2022). However, these tools offer only general assessments of factors such as daytime sleepiness, sleep quality, and disturbances, and are not specifically designed to measure insomnia. The Insomnia Severity Index (ISI) is a prominent instrument for measuring insomnia severity and perceptions of the condition (Bastien et al., 2001). The ISI captures the diagnostic criteria for insomnia outlined in the Diagnostic and Statistical Manual of Mental Disorders (American Psychiatric Association, 2022). The ISI has been translated and validated into many languages including Korean (Cho et al., 2014), Portuguese (Clemente et al., 2017), Moroccan (Oneib et al., 2022), Spanish (Fernandez-Mendoza et al., 2012), Hausa (Salihu et al., 2022), and Persian (Sadeghniiat-Haghighi et al., 2014). These studies have confirmed that the ISI is a reliable instrument. The factor structure of the ISI has been examined in several studies, predominantly using Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA) and have provided mixed and inconsistent results.
The original validation study examining the psychometric properties of the ISI was conducted with a clinical population and employed EFA, which yielded a three-factor solution (Morin, 1993). This was supported by a study that used CFA in a non-clinical Spanish-speaking population (Fernandez-Mendoza et al., 2012). Subsequent research identified three distinct types of one-factor models and five different types of both two-factor and three-factor models. For instance, Gerber and colleagues validated the German version of the ISI among three independent samples and reported the scale was unidimensional (Gerber et al., 2016). Similarly, Dragioti and colleagues, analyzed the Swedish version of the ISI and found that a one factor solution had the best fit. Support for the two-factor solution has emerged from a range of studies including a validation study of the ISI in a sample of Korean university students (Lee and Kim, 2023) and research among Arabic patients diagnosed with chronic conditions (Al Maqbali et al., 2022). In contrast, the three-factor solution has been supported in research undertaken with Italian clinical patients diagnosed with insomnia (Castronovo et al., 2016) and Chinese undergraduate students (Lin et al., 2018). A systematic review and meta-analytic study on the structural validity of the ISI identified thirteen distinct models across the literature (Dilshad Manzar et al., 2021). The wide range of multifactorial models suggests that insomnia is composed of heterogeneous latent constructs that emerge from a common set of symptoms. This variability implies that individuals may experience and interpret insomnia symptoms in different ways, leading to diverse underlying factor structures. It also highlights the complexity of insomnia as a clinical phenomenon, where shared symptoms such as difficulty falling asleep, staying asleep, or experiencing non-restorative sleep, may cluster differently depending on the population studied, the context, or methodological approaches used.
The current study focused on South African first responders and examined the psychometric properties of the ISI from three different psychometric perspectives: Classical Test Theory (CTT), Rasch analysis and Mokken Scale Analysis (MSA). First responders are frontline workers responsible for emergency medical care. This group of professionals include emergency medical service personnel (e.g., paramedics and ambulance personnel), firefighters and law enforcement officials. Their primary responsibility is to provide rapid and often life-saving assistance in situations that range from medical crises and accidents to natural disasters, violent incidents, and other critical emergencies. Due to the nature of their work, first responders are frequently exposed to high-stress environments, traumatic events, and unpredictable scenarios (Arjmand et al., 2024).
Existing research has confirmed that insomnia is a highly prevalent condition among first responders. In their meta-analytical study on sleep disorders among first responders, Huang and colleagues reported a prevalence rate of 28% for insomnia (Huang et al., 2022). These sleep disturbances have been attributed to the demands of their occupation including irregular work shifts, frequent exposure to potentially traumatic events and the physical and emotional toll of responding to crisis situations. Recent evidence also supports fear of sleep as a central mechanism underlying insomnia among first responders and develops in response to certain post-traumatic stress disorder (PTSD) symptoms, including intrusive re-experiencing of trauma through nightmares (Reffi et al., 2023; Lebeaut et al., 2022). These fears drive maladaptive coping behaviors including avoidance of sleep or reluctance to sleep at night (Reffi et al., 2023; Werner et al., 2021). Chronic insomnia is associated with fatigue, mood disturbances, difficulties with alertness and concentration, cognitive impairment and reduced quality of life (Bard et al., 2023). Given the prevalence of insomnia among first responders and its serious implications for both physical and mental health, it remains essential to ensure that the tools used to assess this condition are psychometrically sound and contextually appropriate. Furthermore, existing research has highlighted a wide range of multifactorial models for the ISI, underscoring variability in how the instrument performs across different populations and settings (Dilshad Manzar et al., 2021). These inconsistencies point to the importance of examining the underlying structure and item functioning of the ISI in specific contexts rather than assuming universal applicability.
The ISI has been used in the South African context but the reporting of the psychometric properties of the scale has been limited. Bentley et al. (2025), for example, used the ISI to examine sleep deterioration in a retrospective study of the general population during the COVID-19 pandemic. The ISI has also been used to assess a sleep intervention among adolescents (Rossouw et al., 2024), sleep difficulties among youth (De Doncker and Mclean, 2022) and the impact of sleep, physical activity and sedentary behavior on symptoms of depression and anxiety before and during the COVID-19 pandemic (Lewis et al., 2021). The ISI has also been used to investigate fear of not being safe during sleep and its association with sleep quality among adults (Correia et al., 2024). However, the psychometric properties of the ISI were not reported. Phaswana-Mafuya et al. (2019) used two items of the ISI to measure insomnia symptoms among pregnant women and reported satisfactory reliability (α = 0. 82).
By employing three distinct psychometric perspectives, the current study aims to provide a more comprehensive and nuanced understanding of the instrument's performance. While CTT assesses the reliability and validity of the scale at a global level and provides information on internal consistency and factor structure, Rasch analysis provides insights into the functioning of individual items (Meijer et al., 1990). MSA is a nonparametric item response theory (IRT) method that further evaluates an instrument's scalability and monotonicity, which are important for determining whether items consistently reflect increasing levels of insomnia severity (Franco et al., 2022). Taken together, these analyses have the potential to provide robust evidence on the reliability, dimensionality, and construct validity of the ISI among South African first responders.
2 Materials and methods
2.1 Participants and procedure
Participants were first responders (n = 429) in the Western Cape province of South Africa and they included police officers (n = 309) and paramedics (n = 120). We constructed electronic versions of the instruments described in the Measures section using Google forms. With the permission of administrators of Facebook groups that consisted of first responders, we posted this electronic link together with an invitation to participate in the study on these Facebook sites. We also obtained permission from the South African Police Services (reference: 3/34/2, 27 June 2023) and the Western Cape Department of Health (reference: WC_202307_041, 15 September 2023) to conduct the study. These permissions allowed for research assistants to visit police stations and hospitals to access potential participants in person.
The sample was predominantly male (55%) and the first responders mainly worked in an urban area (92.3%). Fifty-one percent of the sample was married while 35.2% were single. Slightly less than half of the sample (49.7%) had a post-matric qualification and 49.2% indicated that they had a matric certificate. The mean age of the sample was 39 years (SD = 9.93) and the mean number of years working as a first responder was 13.24 years (SD = 9.65).
2.2 Measures
The current study was part of a broader study on the mental health of first responders in South Africa. In addition to the ISI, participants completed the Patient Health Questionnaire-9 (PHQ-9: Kroenke et al., 2001), the Generalized Anxiety Disorder-7 (GAD-7: Spitzer et al., 2006), and the Chalder Fatigue Questionnaire (CFQ: Chalder et al., 1993). The last three instruments were included for the purpose of establishing the concurrent validity of the ISI.
The ISI is a brief screening measure of insomnia and consists of seven items that are scored on a 5-point scale ranging from 0 to 4, where higher scores reflect more acute symptoms of insomnia. An example of an item of the ISI is “how noticeable to others do you think your sleeping problem is in terms of impairing the quality of your life?”. The initial validation study reported an internal consistency estimate of α = 0.74 for the scores of the ISI (Bastien et al., 2001). The authors also demonstrated that the ISI was sensitive enough to detect changes in insomnia resulting from treatment interventions. Although the ISI has previously been used in South Africa (eg., Correia et al., 2024), we could not find any study that reported on the reliability of ISI scores when used in the country.
The PHQ-9 is a nine-item measure that is used to screen for and diagnose depression. It is responded to on a four-point scale that ranges from “not at all” (0) to “nearly every day” (3), and higher scores on the PHQ-9 reflect higher levels of depression. An example of an item of the PHQ-9 is “over the last 2 weeks, how often have you been bothered by feeling tired or having little energy?” The initial validation study reported Cronbach's alphas of 0.89 and 0.86 for the scores of the PHQ-9 in two different studies and correlations with the Short-Form General Health Survey (Stewart et al., 1988) served as evidence for construct validity (Kroenke et al., 2001). The PHQ-9 has previously been used in South Africa and Cronbach's alphas greater than 0.70 have generally been reported (e.g., Bhana et al., 2015; Rakshasa-Loots et al., 2023; Kigozi, 2020).
The GAD-7 consists of seven items and is used to screen for generalized anxiety disorder. Responses to the seven items are made on a four-point scale that ranges from “not at all” (0) to “nearly every day” (3), and higher scores on the GAD-7 reflect higher levels of depression. An example of an item of the GAD-7 is “over the last 2 weeks, how often have you been bothered by being so restless that it is hard to sit still?” The authors of the GAD-7 (Spitzer et al., 2006) reported an estimate of internal consistency of 0.92 for the scores of the GAD-7 with patients in primary care facilities and also provided evidence of criterion, construct, and factorial validity. In South Africa, reported reliability estimates for the scores of GAD-7 were generally greater than 0.70 (e.g., Kigozi, 2021; Tadi et al., 2022).
The CFQ consists of 11 items and is used to measure the extent and severity of fatigue and general tiredness. It is scored on a four-point scale that ranges from “less than usual” (0) to “much more than usual” (3) and higher scores reflect greater fatigue. An example of an item of the CFQ is “do you have problems with tiredness?” The authors of the scale (Chalder et al., 1993) reported an estimate of internal consistency of 0.89 for the scores of the CFQ and factor as well as receiver operating curve analyses provided support for the validity of the CFQ. In South Africa, only one study was found that used the CFQ (Coetzee et al., 2018: α = 0.83).
2.3 Ethics
The study received ethical approval from the Humanities and Social Sciences Research Ethics Committee of the University of the Western Cape (ethics reference: HS23/2/4, May 23, 2023) and was conducted in accordance with the guidelines of the Declaration of Helsinki. Participants provided informed consent on the first page of the electronic link. Participation was voluntary and participants were informed on the landing page of the electronic survey that they may withdraw from the study at any time.
2.4 Analysis
CTT analyses were conducted with IBM SPSS for Windows version 30 and IBM SPSS Amos for Windows version 28 (IBM Corp., Armonk, NY, USA). Rasch analysis was conducted with Winsteps 5.8.0 (Linacre, 2023) and MSA with the package “Mokken” (Van Der Ark, 2012) in R software (R Development Core Team, 2020). There were no missing data as participants had to respond to every item of the survey.
2.4.1 CTT indices
CTT indices obtained with SPSS included skewness and kurtosis, descriptive statistics (means and standard deviations), inter-item correlations, Item-Total Correlations (ITC), reliability indices (alpha, omega and composite reliability—CR), average variance extracted (AVE), maximum shared variance (MSV), average shared variance (ASV), and intercorrelations between variables (for concurrent validity). We used the indices of skewness and kurtosis to examine the distribution of data and in general data is regarded as normally distributed if skewness and kurtosis values ranged between −2 and +2 (SmartPLS, 2024). The internal consistency of the ISI was assessed using inter-item correlations, ITC, and the various reliability indices. Inter-item correlations should be between 0.15 and 0.85 and the average inter-item correlations should be between 0.15 and 0.50, while ITC should be greater than 0.50 (Hajjar, 2018; Paulsen and Brckalorenz, 2017). ITC that meet these criteria also contribute to the construct validity of the ISI since this would demonstrate that the items contribute to the measurement of the latent variable, insomnia (Devon et al., 2007). With respect to the reliability coefficients, it is recommended that an acceptable reliability coefficient for scales should be at least 0.70 for research purposes and much higher for clinical purposes (Devon et al., 2007). AVE refers to the amount of item variance extracted by the latent variable and if it is greater than 0.50 and less than CR it is indicative of convergent validity (Posch et al., 2019). MSV is the square of the correlation between insomnia and the variable it is most highly related to while ASV is the mean of the squared correlations between insomnia and all variables. Discriminant validity is established if ASV is greater than MSV and ASV, as this would indicate that the latent construct has more in common with the items that contribute to its measurement than with other related variables (Almén et al., 2018). With respect to concurrent validity we hypothesized, based on the available literature (Riemann et al., 2020; Taylor et al., 2005; Vethe et al., 2018), that insomnia would be positively associated with depression, anxiety, and fatigue.
2.4.2 Factor structure and dimensionality
To examine the factor structure of the ISI, we conducted an EFA (principal components with varimax rotation) as well as a CFA. Prior to the EFA we used the Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy and Bartlett's test of sphericity to determine whether the data is suitable for factor analysis. If Bartlett's test was significant and KMO was greater than 0.50, items would be considered sufficiently correlated to conduct factor analysis. When factor loadings (λ) in EFA are greater than 0.50 it also provides evidence of construct validity as it would indicate that the latent construct is well represented by the individual items (Hair et al., 2010). We used CFA in SPSS Amos to examine three models of the factor structure of the ISI, a one-factor model in which items load on a single factor, a correlated two-factor model in which items load on two factors (called severity and impact which are the subscales most commonly identified in the literature) that are correlated, and a bifactor model in which items load on a total scale (also called a general factor) as well as on the two subscales (also referred to as specific factors) of severity and impact. To examine the extent to which the models fit the data to an acceptable degree, the following are the minimum fit indices recommended (Gaskin et al., 2022): chi-squared (χ2: should ideally be non-significant), relative χ2 (χ2 divided by degrees of freedom: should be between 1 and 3), the comparative fit index (CFI: should be ≥0.95), the standardized root mean square residual (SRMR: should be ≤ 0.08), root mean square error of approximation (RMSEA: should be ≤ 0.06) and the p-value of close fit (PCLOSE: should be statistically non-significant i.e., p > 0.05). In addition to these recommended fit indices, we also included a model comparison index, Akaike's information criterion (AIC) and lower values of AIC indicate better models (Arbuckle, 2012) and an AIC difference (ΔAIC) between the baseline model and another model of greater than −2 reflects that the baseline model is the better model (Bevans, 2023).
In the case of the bifactor model we also used a freely available online Excel spreadsheet (Dueber, 2017) to obtain ancillary bifactor indices which clarifies whether the specific factors explained a sufficient amount of variance in the items over and above the variance explained by the general factor. The bifactor indices included explained common variance (ECV), omega hierarchical (omegaH for the general factor and omegaHS for specific factors) and the percentage of uncontaminated correlations (PUC). ECV is the percentage of variance explained by the general factor and the specific factors, respectively and an ECV greater than 0.70 for the general factor would indicate that the instrument is essentially unidimensional (Rodriguez et al., 2016). OmegaH is an estimate that represents the proportion of variance in observed scores that is due to the general factor. In the case of the specific factors, omegaHS refers to the proportion of systematic variance accounted for by a specific factor after the variance attributable to the general factor and other specific factors is controlled. An omegaH greater than 0.80 would indicate a very reliable factor (Schmitt et al., 2018). PUC refers to the percentage of correlations between items which reflect variance from the general factor and it is suggested that PUC should be used in conjunction with ECV and omegaH to draw conclusions about the dimensionality of an instrument. In this regard, Reise et al. (2013) suggested that when PUC is less than 0.80, ECV greater than 0.60, and omegaH greater than 0.80, any dimensionality that might exist is not strong enough to override an interpretation of the instrument in question as unidimensional.
To further examine the dimensionality of the ISI, we conducted parallel analysis using SPSS syntax that is freely available online (O'connor, 2000). Parallel analysis is regarded as one of the most accurate ways to determine the number of factors present in a set of items (Zwick and Velicer, 1986) and it involves comparing eigenvalues obtained in the current study with eigenvalues obtained in a number of simulated datasets (n = 1000). The number of eigenvalues obtained in the current dataset that are greater than the 95th percentile of the simulated eigenvalues represent significant factors (Hayton et al., 2004). If only one eigenvalue is greater than the 95th percentile of the simulated eigenvalues, the instrument in question should be regarded as unidimensional.
2.4.3 Rasch analysis
The indices obtained in the Rasch analysis included infit and outfit mean square (MnSq) as well as item and person separation reliability and index and these indices contribute to assessing the construct validity of the ISI. Infit and outfit MnSq provide an indication of the extent to which items fit the Rasch model and values between 0.5 and 1.5 indicate good fit (Linacre, 2023). Person separation reliability and index provide an indication as to whether the scale can distinguish between low and high scorers on the latent variable and it is recommended that person separation index should be greater than 2 and person separation reliability greater than 0.80. Item separation reliability and index assess whether an item hierarchy exist and an item separation index greater than 3 together with an item separation reliability greater than 0.80 are recommended (Linacre, 2023). In Rasch analysis a principal component analysis (PCA) of the residuals (after the latent trait has been removed) is used to determine dimensionality. If an additional dimension, called the first contrast, has an eigenvalue greater than 2, it would be indicative of two or more items loading on a second factor, thus suggesting that an instrument is multidimensional. We also used Rasch analysis to determine measurement invariance across gender and across type of first responder using a Rasch index called Differential Item Functioning (DIF). DIF values greater than 0.50 would indicate that the item does not measure the same construct across groups (Linacre, 2023).
2.4.4 Mokken analyses
In MSA, dimensionality is assessed using an algorithm called automated item selection procedure (AISP) which assigns a value of zero if an item is unscalable (does not load on a scale) and as many values as there are scales within a set of items. If all items are assigned the value of 1, it would indicate that all items load on a single scale (Van Der Ark, 2012). The strength of a scale is assessed by a scalability coefficient, H and an H value greater than 0.50 indicates a strong scale, while an H value less than 0.30 reflects a weak scale (Wind, 2017). MSA also provides an H value for each individual item (Hi) which indicates the extent to which the item contributes to the measurement of the latent variable and Hi values should be greater than 0.30 (Mokken, 2011). An important assumption in MSA is monotonicity which implies that the probability of endorsing an item increases as the value of the latent variable increases. Violations of this assumption is indicated by the symbol #vi (the number of violations) and the index Crit—a Crit value greater than 80 indicates a significant violation of monotonicity (Sijtsma and Van Der Ark, 2017). Lastly, MSA also provides a Mokken reliability coefficient, MSrho.
3 Results
The inter-item correlations of the ISI as well as the CTT, Rasch and Mokken indices are reported in Table 1.
Table 1 shows that the inter-item correlations ranged between 0.33 and 0.72 and all were statistically significant (p < 0.001). The average inter-item correlation was 0.50. The item-total correlations ranged between 0.59 and 0.73 and were all statistically significant (p < 0.001). KMO was greater than 0.50 (0.86) and Bartlett's test was significant, indicating that the data was suitable for factor analysis. EFA (principal components with varimax rotation) extracted one factor which accounted for 57.68% of the variance. The factor loadings ranged between 0.71 and 0.82 and were all statistically significant (p < 0.001).
With respect to the Rasch indices, infit and outfit MnSq values were at an acceptable level and ranged between 0.78 and 1.33 for infit MnSq and between 0.83 and 1.25 for outfit MnSq. The items of the ISI demonstrated measurement invariance across gender (DIF = −0.23 to 0.29) and across type of first responder (DIF = −0.31 to 0.46).
With regards to the Mokken indices, Table 1 also shows that AISP found that all of the items of the ISI loaded on one scale as denoted by the value 1 for all items. Furthermore, the Hi coefficients for the individual items of the ISI were all above 0.30 and ranged between 0.49 and 0.59. There were no violations of monotonicity as #vi and Crit values were 0 for all items of the ISI.
The three models of the factor structure of the ISI that were examined with CFA are shown in Figure 1, together with standardized factor loadings. Figures 1A–C depict the one-factor, correlated two-factor, and bifactor models, respectively.

Figure 1. Three models of the factor structure of the insomnia severity index. (A) = one-factor model, Figure (B) = correlated two-factor model, Figure (C) = bifactor model. Rectangles are observed variables, ellipses are latent variables. All regression coefficients are standardized.
The fit indices associated with the three models in Figure 1 are reported in Table 2.
The fit indices in Table 2 indicated that all three models fit the data to an acceptable degree (relative χ2 < 3, CFI > 0.95, SRMR < 0.08, RMSEA ≤ 0.06, PCLOSE non-significant). However, the model comparison index, AIC, indicated that both the bifactor model and the one-factor model (AIC difference was 2) were better fits than the correlated two-factor model (AIC difference between the bifactor model and the correlated two-factor model was 5.7). In addition, the two factors were substantially correlated (r = 0.77, p < 0.001). Also, while the bifactor model demonstrated an acceptable fit, Figure 1C shows that the regression coefficients of the severity subscale were very problematic since two of the items were very low (“Difficulty falling asleep” and “Difficulty staying asleep”) and two were negative (“Early morning awakenings” and “Dissatisfaction with sleep”). This is in stark contrast to the factor loadings associated with these four items on the same subscales in the correlated two-factor model where the factor loadings were greater than 0.60. Both factor loadings that were very low were statistically significant (i.e., 0.23 and 0.13). This is to be expected given a sample size of n = 429, but it is way below the accepted threshold of 0.50 for meaningful factor loadings.
Since CFA found that the bifactor structure fitted the data to an acceptable degree, ancillary bi-factor indices were calculated to determine whether the specific factors (two subscales) explained a sufficient amount of variance over and above that explained by the general factor (total scale). In this regard the bifactor indices indicated that the general factor, insomnia, explained 73% of the variance in the items (ECV) while the specific factors explained 27% of the variance (ECVseverity = 8.3%, ECVimpact = 18.7%). OmegaH for the general factor was 0.82, reflecting a very reliable general factor, while for the specific factors it was 0.01 and 0.39, respectively. Further, a PUC value of 0.57 together with an ECV of 0.73 and omegaH of 0.82 provides strong evidence that the ISI should be regarded as essentially unidimensional. Parallel analysis also extracted only one component with an eigenvalue (4.04) that was greater than the eigenvalue of the 95th percentile (1.25) of simulated eigenvalues. The eigenvalue of the second component (0.93) was less than the 95th percentile (1.15) of simulated eigenvalues.
The CTT, Rasch and Mokken indices at scale level for the unidimensional ISI are reported in Table 3.
Table 3 indicates that the CTT, Rasch and Mokken indices for the unidimensional ISI were all at an acceptable level. In the first instance, the eigenvalue of the first contrast was less than 2. Secondly, all the estimates of internal consistency were greater than 0.70 (α and ω = 0.87, CR = 0.90, MSrho = 0.89). Thirdly, AVE was greater than 0.50 and MSV and ASV were less than AVE. The scalability coefficient in Mokken analysis was greater than 0.50 (H = 0.55). The item separation reliability and index in the Rasch analysis were greater than 0.80 and 3, respectively, while the person separation reliability and index were greater than 0.80 and 2, respectively.
The descriptive statistics for and the intercorrelations between insomnia and the other variables that were included for the purpose of establishing concurrent validity, are reported in Table 4.
The indices of skewness in Table 4 ranged between 0.12 and 0.38, and those of kurtosis between −0.81 and −0.25. The values of skewness and kurtosis were thus within the acceptable range of −2 to +2. Table 4 also indicates that insomnia was significantly positively related to depression (r = 0.52, p < 0.001, large effect size), anxiety (r = 0.49, p < 0.001, medium effect size), and fatigue (r = 0.61, p < 0.001, large effect size). The positive relationships would indicate that higher levels of insomnia were associated with higher levels of depression, anxiety, and fatigue. The PHQ-9 contains one item related to sleep difficulty and to ensure that the correlation between depression and insomnia was not excessively influenced by this one item, we also examined the correlation between the two variables without this one item. The correlation remained a large effect (r = 0.50, p < 0.001 and the reliability of the PHQ-9 was not significantly affected (α = 0.88).
4 Discussion
This study aimed to evaluate the psychometric properties of the ISI among South African first responders, a population uniquely vulnerable to sleep disturbances due to the demanding and stressful nature of their work. Drawing on CTT, Rasch analysis and MSA, the study aimed to provide a comprehensive assessment of the instrument's reliability, validity and dimensional structure within this context.
Overall, the psychometric properties of the ISI, inclusive of reliability and validity, can be regarded as highly satisfactory. In the first instance, evidence from the three different psychometric paradigms overwhelmingly favored a unidimensional interpretation of the ISI, a finding that supports the one-factor solution reported in other studies (e.g., Gerber et al., 2016; Kaufmann et al., 2019). EFA extracted one factor and CFA fit indices found that a one-factor model and a bifactor model fit the data to an acceptable degree. However, ancillary bifactor indices found that the majority of the item variance was explained by a very reliable general factor. The unidimensionality of the ISI was confirmed with parallel analysis in that only one eigenvalue in the current dataset was greater than the 95th percentile of simulated eigenvalues. Further, a PCA of the residuals in Rasch analysis confirmed a unidimensional solution and AISP in Mokken analysis showed that all of the items of the ISI loaded on a single scale. The scalability coefficient in Mokken analysis, H was greater than.50 indicating a very strong unidimensional scale.
All the estimates of the reliability of the scores of the ISI may be considered satisfactory. The CTT reliability indices, alpha, omega and CR as well as the MSA index, Msrho exceeded 0.80 and was therefore well above the recommended threshold of 0.70 (Devon et al., 2007).
All three psychometric paradigms provided evidence for the construct validity of the ISI. In CTT the inter-item correlations, item-total correlations and factor loadings were within the ranges recommended in the literature (Hajjar, 2018; Devon et al., 2007; Paulsen and Brckalorenz, 2017; Hair et al., 2010) thus demonstrating that all the individual items of the ISI contribute to the measurement of the latent variable, insomnia. In Rasch analysis infit and outfit MnSq values were within an acceptable range (Linacre, 2023), thus confirming that all the items of the ISI fit the Rasch model. The person separation index and reliability as well as the item separation index and reliability confirmed that the ISI can distinguish between low and high scorers on the latent variable, and that an item hierarchy exists. DIF in Rasch analysis confirmed the measurement invariance across gender and type of first responder, thus indicating that the items of the ISI measure the same construct in men and women as well as in police officers and paramedics. Lastly, in MSA the scalability coefficients for the individual items of the ISI, Hi, exceeded the threshold identified in the literature (Mokken, 2011), and these scalability coefficients, similar to item-total correlations, confirmed that all the items of the ISI contributed to the measurement of insomnia and that the latent construct of insomnia was well represented by its indicators.
CTT analysis also provided evidence for other forms of construct validity. AVE was greater than 0.50 and less than CR and according to Posch et al. (2019) this serves as evidence for convergent validity. AVE was greater than MSV and ASV and since this would indicate that the latent construct had more in common with the items that contributed to its measurement than with other related variables it provides evidence for discriminant validity. Lastly the strong correlations between insomnia as measured by the ISI and depression, anxiety, and fatigue provides evidence for the concurrent validity of the ISI.
The findings of this study offer important theoretical and practical implications for the assessment of insomnia, particularly within high-risk occupational groups such as first responders. From a theoretical perspective, the results reinforce and extend the existing understanding of the latent structure of insomnia as measured by the ISI. Evidence from all three psychometric paradigms favored a strong unidimensional solution, thus supporting the conceptualization of insomnia as a unified construct rather than a multidimensional syndrome in this population. This suggests that despite symptom diversity, the experience of insomnia can be meaningfully captured along a single underlying dimension. These findings affirm that the ISI is a stable and theoretically coherent tool for assessing insomnia severity across diverse groups within the first responder population. In terms of practical implications, the findings support the use of the ISI as a screening and diagnostic tool for insomnia among South African first responders. Its unidimensional structure simplifies scoring and interpretation, making it suitable for routine clinical use, as well as for incorporation into mental health surveillance and employee wellness programs. The ISI can assist in the early identification of individuals at risk, facilitate timely referrals, and guide the implementation of evidence-based interventions to mitigate the impact of sleep disturbances on overall wellbeing and job performance. The absence of measurement bias across gender and professional roles (e.g., paramedics vs. police officers) also enhances the practical utility of the ISI, as it ensures equitable assessment across subgroups within the first responder workforce.
While the findings of this study provide strong support for the unidimensionality, reliability and validity of the ISI when used with South African first responders, several limitations should be acknowledged. The study employed a cross-sectional design, which limits the ability to draw conclusions about the temporal stability of the ISI scores. Longitudinal research is needed to assess test–retest reliability and to evaluate how insomnia symptoms and ISI scores may change over time, particularly in response to interventions or varying occupational stressors. All data, including responses to the ISI and other psychological scales, were based on self-report. While self-report is a widely accepted method in research, it is susceptible to response biases such as social desirability, recall bias, or underreporting of symptoms due to stigma, especially within occupational groups like first responders. Although the study included a diverse sample of first responders (e.g., paramedics and police officers), the findings may not be generalizable to all emergency service personnel across South Africa or to first responders in other low- and middle-income countries. Furthermore, the use of the same subject samples for EFA and CFA represents another limitation. This approach may lead to overfitting and inflated estimates of model fit. Future research is needed to address these limitations by utilizing separate samples for EFA and CFA to enhance the validity and generalizability of the findings. Finally, the survey was administered online, which limited the ability to fully validate the population. The online format, while providing broad accessibility, restricted the ability to track those recruited from specific locations. Future studies could address this limitation by using more controlled recruitment methods to enhance population validation.
5 Conclusion
This study provides comprehensive evidence for the reliability and validity of the ISI among South African first responders, using three complementary psychometric approaches, namely CTT, Rasch and Moken analyses. The ISI had a unidimensional structure across all methods, which suggests that the instrument effectively captures the core features of insomnia as a single latent construct in this high-risk population. Furthermore, the instrument demonstrated strong internal consistency, item functioning, and measurement invariance across gender and professional subgroups, indicating that it is both psychometrically robust and fair. Given the high prevalence of insomnia and its impact on the physical and psychological functioning and occupational performance of first responders, the availability of a validated, contextually appropriate instrument remains essential. The ISI emerges from this study as a valuable resource for clinicians, researchers, and occupational health professionals working with South African first responders.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the Humanities and Social Sciences Research Ethics Committee of the University of the Western Cape. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
TP: Writing – review & editing, Methodology, Writing – original draft, Conceptualization, Investigation, Visualization, Formal analysis, Data curation. AP: Project administration, Conceptualization, Supervision, Writing – review & editing, Methodology, Writing – original draft, Investigation.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Abbreviations
ISI, Insomnia Severity Index; PHQ-9, Patient Health Questionnaire-9; GAD-7, Generalized Anxiety Disorder-7; CFQ, Chalder Fatigue Questionnaire; EFA, Exploratory factor analysis; CFA, Confirmatory factor analysis; CTT, Classical test theory; MSA, Mokken Scale Analysis; ITC, Item-total correlation; AVE, Average Variance Extracted; MSV, Maximum Shared Variance; ASV, Average Shared Variance; CR, Composite Reliability; KMO, Kaiser–Meyer–Olkin; CFI, Comparative Fit Index; SRMR, Standardized Root Mean Square Residual; RMSEA, Root Mean Square Error Of Approximation; PCLOSE, p-value of Close Fit; AIC, Akaike's Information Criterion; ECV, Explained Common Variance; PUC, Percentage of Uncontaminated Correlations; MnSq, Mean Square; PCA, Principal Component Analysis; DIF, Differential Item Functioning; AISP, Automated Item Selection Procedure.
References
Al Maqbali, M., Madkhali, N., and Dickens, G. L. (2022). Psychometric properties of the insomnia severity index among arabic chronic diseases patients. SAGE Open Nurs. 8:7278. doi: 10.1177/23779608221107278
Almén, N., Lundberg, H., Sundin, Ö., and Jansson, B. (2018). The reliability and factorial validity of the Swedish version of the recovery experience questionnaire. Nord. Psychol. 70, 324–333. doi: 10.1080/19012276.2018.1443280
American Psychiatric Association (2022). Diagnostic and Statistical Manual of Mental Disorders: DSM-5-TR, Washington, DC: American Psychiatric AssociationPublishing. doi: 10.1176/appi.books.9780890425787
Arjmand, H.-A., O'donnell, M. L., Putica, A., Sadler, N., Peck, T., Nursey, J., et al. (2024). Mental health treatment for first responders: an assessment of mental health provider needs. Psychol. Serv. 21, 489–499. doi: 10.1037/ser0000832
Bard, H. A., O'driscoll, C., Miller, C. B., Henry, A. L., Cape, J., and Espie, C. A. (2023). Insomnia, depression, and anxiety symptoms interact and individually impact functioning: a network and relative importance analysis in the context of insomnia. Sleep Med. 101, 505–514. doi: 10.1016/j.sleep.2022.12.005
Bastien, C. H., Vallières, A., and Morin, C. M. (2001). Validation of the insomnia severity index as an outcome measure for insomnia research. Sleep Med. 2, 297–307. doi: 10.1016/S1389-9457(00)00065-4
Bentley, A., Roden, L. C., Davy, J. P., Iacovides, S., Gómez-Oliv,é, F. X., Scheuermaier, K., et al. (2025). Deterioration of sleep and mental health in individuals with insomnia during South Africa's COVID-19 lockdown. S. Afr. J. Sci. 121, 1–8. doi: 10.17159/sajs.2025/16682
Bevans, R. (2023). Akaike Information Criterion: When and How To Use It (Example). SCRIBBR. Available online at: https://www.scribbr.com/statistics/akaike-information-criterion/ (Accessed March 9, 2025).
Bhana, A., Rathod, S. D., Selohilwe, O., Kathree, T., and Petersen, I. (2015). The validity of the patient health questionnaire for screening depression in chronic care patients in primary health care in South Africa. BMC Psychiatr. 15, 118–118. doi: 10.1186/s12888-015-0503-0
Buysse, D. J., Hall, M. L., Strollo, P. J., Kamarck, T. W., Owens, J., Lee, L., et al. (2008). Relationships between the Pittsburgh Sleep Quality Index (PSQI), Epworth Sleepiness Scale (ESS), and clinical/polysomnographic measures in a community sample. J. Clin. Sleep Med. 4, 563–571. doi: 10.5664/jcsm.27351
Castronovo, V., Galbiati, A., Marelli, S., Brombin, C., Cugnata, F., Giarolli, L., et al. (2016). Validation study of the Italian version of the Insomnia Severity Index (ISI). Neurol. Sci. 37, 1517–1524. doi: 10.1007/s10072-016-2620-z
Chalder, T., Berelowitz, G., Pawlikowska, T., Watts, L., Wessely, S., Wright, D., and Wallace, E. P. (1993). Development of a fatigue scale. J. Psychosom. Res. 37, 147–153. doi: 10.1016/0022-3999(93)90081-P
Cho, Y. W., Song, M. L., and Morin, C. M. (2014). Validation of a Korean version of the insomnia severity index. J. Clin. Neurol. 10, 210–215. doi: 10.3988/jcn.2014.10.3.210
Clemente, V., Gomes, A. A., Mendes, M. M., Marques, D., Russo, B., Serra, J., et al. (2017). The European Portuguese version of the Insomnia Severity Index (ISI): reliability, validity and diagnostic accuracy. Sleep Med. 40, e67–e67. doi: 10.1016/j.sleep.2017.11.191
Coetzee, B., Loades, M. E., Du Toit, S., Read, R., and Kagee, A. (2018). Fatigue among South African adolescents living with HIV: is the Chalder Fatigue Questionnaire a suitable measure and how common is fatigue? Vul. Child. Youth Stud. 13, 305–316. doi: 10.1080/17450128.2018.1510147
Correia, A. T. L., Forshaw, P. E., Roden, L. C., Lipinska, G., Rauch, H. G. L., Lambert, E. V., et al. (2024). Associations between fears related to safety during sleep and self-reported sleep in men and women living in a low-socioeconomic status setting. Sci. Rep. 14, 3609–3609. doi: 10.1038/s41598-024-54032-w
De Doncker, K., and Mclean, N. (2022). Social media, sleep difficulties and depressive symptoms: a case study of South African youth in Cape Town. Technol. Soc. 70:102038. doi: 10.1016/j.techsoc.2022.102038
Dean, Y. E., Shebl, M. A., Rouzan, S. S., Bamousa, B. A. A., Talat, N. E., et al. (2023). Association between insomnia and the incidence of myocardial infarction: a systematic review and meta-analysis. Clin. Cardiol. 46, 376–385. doi: 10.1002/clc.23984
Devon, H. A., Block, M. E., Moyle-Wright, P., Ernst, D. M., Hayden, S. J., Lazzara, D. J., et al. (2007). A psychometric toolbox for testing validity and reliability. J. Nurs. Schol. 39, 155–164. doi: 10.1111/j.1547-5069.2007.00161.x
Dilshad Manzar, M., Haitham, J., and Bahammam, A. S. (2021). Structural validity of the Insomnia Severity Index: a systematic review and meta-analysis. Sleep Med. Rev. 60:101531. doi: 10.1016/j.smrv.2021.101531
Dueber, D. M. (2017). Bifactor Indices Calculator: A Microsoft Excel-Based Tool To Calculate Various Indices Relevant To Bifactor CFA Models. Available online at: https://uknowledge.uky.edu/edp_tools/1/ (Accessed January 11, 2025).
Fernandez-Mendoza, J., Rodriguez-Muñoz, A., Vela-Bueno, A., Olavarrieta-Bernardino, S., Calhoun, S. L., Bixler, E. O., and Vgontzas, A. N. (2012). The Spanish version of the Insomnia Severity Index: a confirmatory factor analysis. Sleep Med. 13, 207–210. doi: 10.1016/j.sleep.2011.06.019
Franco, V. R., Laros, J. A., and Bastos, R. V. S. (2022). Theoretical and practical foundations of Mokken Scale analysis in psychology. Paidéia Psicol. Educ. 32. doi: 10.1590/1982-4327e3223
Gaskin, J., Lim, J., and Steed, J. (2022). “Model Fit Summary”, AMOS Plugin Available online at: http://statwiki.gaskination.com (Accessed February 8, 2025).
Gerber, M., Lang, C., Lemola, S., Colledge, F., Kalak, N., Holsboer-Trachsler, E., et al. (2016). Validation of the German version of the insomnia severity index in adolescents, young adults and adult workers: results from three cross-sectional studies. BMC Psychiatr. 16, 174–174. doi: 10.1186/s12888-016-0876-8
Hair, J. F., Black, W. C., Babin, B. J., and Anderson, R. E. (2010). Multivariate Data Analysis, Englewood Cliffs, NJ:Prentice Hall.
Hajjar, S. (2018). Statistical analysis: internal-consistency reliability and construct validity. Int. J. Quant. Qual. Res. Methods 6, 46–57.
Harvey, A. G. (2002). A cognitive model of insomnia. Behav. Res. Ther. 40, 869–893. doi: 10.1016/S0005-7967(01)00061-4
Hayton, J. C., Allen, D. G., and Scarpello, V. (2004). Factor retention decisions in exploratory factor analysis: a tutorial on parallel analysis. Organ. Res. Methods 7, 191–205. doi: 10.1177/1094428104263675
Huang, G., Lee, T.-Y., Banda, K. J., Pien, L.-C., Jen, H.-J., Chen, R., et al. (2022). Prevalence of sleep disorders among first responders for medical emergencies: a meta-analysis. J. Glob. Health 12:04092. doi: 10.7189/jogh.12.04092
Kaufmann, C. N., Orff, H. J., Moore, R. C., Delano-Wood, L., Depp, C. A., and Schiehser, D. M. (2019). Psychometric characteristics of the Insomnia Severity Index in veterans with history of traumatic brain injury. Behav. Sleep Med. 17, 12–18. doi: 10.1080/15402002.2016.1266490
Kigozi, G. (2020). Confirmatory factor analysis of the Patient Health Questionnaire-9: a study amongst tuberculosis patients in the Free State province. S. Afr. J. Infect. Dis. 35, e1–e6. doi: 10.4102/sajid.v35i1.242
Kigozi, G. (2021). Construct validity and reliability of the generalised anxiety disorder-7 scale in a sample of tuberculosis patients in the Free State Province, South Africa. S. Afr. J. Infect. Dis. 36, e1–e6. doi: 10.4102/sajid.v36i1.298
Kroenke, K., Spitzer, R. L., and Williams, J. B. W. (2001). The PHQ-9: validity of a brief depression severity measure. J. Gen. Int. Med. 16, 606–613. doi: 10.1046/j.1525-1497.2001.016009606.x
Lebeaut, A., Zegel, M., Healy, N. A., Rogers, A. H., Buser, S. J., and Vujanovic, A. A. (2022). PTSD symptom severity, pain intensity, and pain-related disability among trauma-exposed firefighters: the moderating role of mindfulness. Mindfulness 13, 786–798. doi: 10.1007/s12671-022-01836-4
Lee, B., and Kim, Y. E. (2023). Validation of the Insomnia Severity Index among university students in Korea. OBM Neurobiol. 7, 1–14. doi: 10.21926/obm.neurobiol.2304196
Lewis, R., Roden, L. C., Scheuermaier, K., Gomez-Olive, F. X., Rae, D. E., Iacovides, S., et al. (2021). The impact of sleep, physical activity and sedentary behaviour on symptoms of depression and anxiety before and during the COVID-19 pandemic in a sample of South African participants. Sci. Rep. 11:24059. doi: 10.1038/s41598-021-02021-8
Lin, R.-M., Xie, S.-S., Yan, W.-J., and Yan, Y.-W. (2018). Factor structure and psychometric properties of the Insomnia Severity Index in Mainland China. Soc. Behav. Personal. 46, 209–218. doi: 10.2224/sbp.6639
Linacre, J. M. (2023). A User's Guide to Winsteps. Portland, Oregon: Winsteps.com. Available online at: https://www.winsteps.com/winman/copyright.htm (Accessed December 11, 2024).
Meijer, R. R., Sijtsma, K., and Smid, N. G. (1990). Theoretical and empirical comparison of the Mokken and the Rasch approach to IRT. Appl. Psychol. Measur. 14, 283–298. doi: 10.1177/014662169001400306
Mokken, R. J. (2011). A Theory and Procedure of Scale Analysis, The Hague, Netherlands: De Gruyter Mouton.
Morin, C. M. (1993). Insomnia: Psychological Assessment and Management. New York, NY: Guilford press.
Morin, C. M., Bertisch, S. M., Pelayo, R., Watson, N. F., Winkelman, J. W., Zee, P. C., et al. (2023). What should be the focus of treatment when insomnia disorder is comorbid with depression or anxiety disorder? J. Clin. Med. 12:1975. doi: 10.3390/jcm12051975
O'connor, B. P. (2000). SPSS and SAS programs for determining the number of components using parallel analysis and Velicer's MAP test. Behav. Res. Methods Instrum. Comput. 32, 396–402. doi: 10.3758/BF03200807
Oneib, B., El Filali, A., and Abda, N. (2022). The Moroccan dialect version of the Insomnia Severity Index. Middle East Curr. Psychiatr. 29, 1–6. doi: 10.1186/s43045-022-00182-4
Paulsen, J., and Brckalorenz, A. (2017). Internal Consistency Statistics. FSSE Psychometric Portfolio. Available onliine at: https://scholarworks.iu.edu/dspace/items/d0ac3a09-cbc4-427d-a49b-5c3c1dbe7e22 (Accessed December 11, 2024).
Phaswana-Mafuya, N., Peltzer, K., and Pengpid, S. (2019). Maternal Tobacco use during Pregnancy in South Africa: results from a National Population-based Survey. Internat. J. Prevent. Med. 10, 99–99. doi: 10.4103/ijpvm.IJPVM_212_18
Posch, L., Bleier, A., Lechner, C., Danner, D., Flöck, F., and Strohmaier, M. (2019). Measuring motivations of crowdworkers: The Multidimensional Crowdworker Motivation Scale. ACM Transact. Soc. Comput. 2, 1–34. doi: 10.1145/3335081
R Development Core Team. (2020). R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. Available online at: http://www.R-project.org/ [Accessed Devember 12, 2023).
Rakshasa-Loots, A. M., Hamana, T., Fanqa, B., Lindani, F., Van Wyhe, K., Kruger, S., et al. (2023). isiXhosa translation of the Patient Health Questionnaire (PHQ-9) shows satisfactory psychometric properties for the measurement of depressive symptoms [Stage 2]. Brain Neurosci. Adv. 7:23982128231194452. doi: 10.1177/23982128231194452
Reffi, A. N., Kalmbach, D. A., Cheng, P., Tappenden, P., Valentine, J., Drake, C. L., et al. (2023). Fear of sleep in first responders: associations with trauma types, psychopathology, and sleep disturbances. Sleep Adv. 4:zpad053. doi: 10.1093/sleepadvances/zpad053
Reise, S. P., Scheines, R., Widaman, K. F., and Haviland, M. G. (2013). Multidimensionality and structural coefficient bias in structural equation modeling: a bifactor perspective. Educ. Psychol. Measur. 73, 5–26. doi: 10.1177/0013164412449831
Riemann, D., Krone, L. B., Wulff, K., and Nissen, C. (2020). Sleep, insomnia, and depression. Neuropsychopharmacology, 45, 74–89. doi: 10.1038/s41386-019-0411-y
Rodriguez, A., Reise, S. P., and Haviland, M. G. (2016). Applying bifactor statistical indices in the evaluation of psychological measures. J. Personal. Assess. 98, 223–237. doi: 10.1080/00223891.2015.1089249
Rossouw, J., Suliman, S., Nothling, J., Lombard, C., Bröcker, E., Hewett, M., et al. (2024). A pilot randomised control study to investigate the effect of the South African Adolescence Group Sleep Intervention (SAASI) on adolescent sleep and PTSD. Eur. J. psychotraumatol. 15:2350217. doi: 10.1080/20008066.2024.2350217
Sadeghniiat-Haghighi, K., Montazeri, A., Khajeh-Mehrizi, A., Nedjat, S., and Aminian, O. (2014). The Insomnia Severity Index: cross-cultural adaptation and psychometric evaluation of a Persian version. Qual. Life Res. 23, 533–537. doi: 10.1007/s11136-013-0489-3
Salihu, D., Wong, E. M. L., Kwan, R. Y. C., Bello, U. M., Chutiyami, M., Leung, A. Y. M., et al. (2022). Translation, cross-cultural adaptation, and psychometric properties of the Hausa version of the Insomnia Severity Index among internally displaced persons in Africa. Sleep Med. 96, 57–63. doi: 10.1016/j.sleep.2022.03.011
Schmitt, T. A., Sass, D. A., Chappelle, W., and Thompson, W. (2018). Selecting the “best” factor structure and moving measurement validation forward: an illustration. J. Personal. Assess. 100, 345–362. doi: 10.1080/00223891.2018.1449116
Sijtsma, K., and Van Der Ark, L. A. (2017). A tutorial on how to do a Mokken scale analysis on your test and questionnaire data. Br. J.f Math. Stat. Psychol. 70, 137–158. doi: 10.1111/bmsp.12078
SmartPLS (2024). How to Interpret Excess Kurtosis And Skewness. Available online at: https://www.smartpls.com/documentation/functionalities/excess-kurtosis-and-skewness (Accessed December 12, 2024).
Spitzer, R. L., Kroenke, K., Williams, J. B., and Löwe, B. (2006). A brief measure for assessing generalized anxiety disorder: The GAD-7. Arch. Int. Med. 166, 1092–1097. doi: 10.1001/archinte.166.10.1092
Stewart, A. L., Hays, R. D., and Ware, J. E. (1988). The MOS short-form general health survey: reliability and validity in a patient population. Med. Care 26, 724–735. doi: 10.1097/00005650-198807000-00007
Tadi, N. F., Pillay, K., Ejoke, U. P., and Khumalo, I. P. (2022). Sex differences in depression and anxiety symptoms: measurement invariance, prevalence, and symptom heterogeneity among university students in South Africa. Front. Psychol. 13:873292. doi: 10.3389/fpsyg.2022.873292
Tang, N. K. Y., Saconi, B., Jansson-Frojmark, M., Ong, J. C., and Carney, C. E. (2023). Cognitive factors and processes in models of insomnia: a systematic review. J. Sleep Res. 32:13923. doi: 10.1111/jsr.13923
Taylor, D. J., Lichstein, K. L., Durrence, H. H., Reidel, B. W., and Bush, A. J. (2005). Epidemiology of insomnia, depression, and anxiety. Sleep 28:1457. doi: 10.1093/sleep/28.11.1457
Van Der Ark, L. A. (2012). New developments in Mokken scale analysis in R. J. Stat. Softw. 48, 1–27. doi: 10.18637/jss.v048.i05
Vargas, I., Nguyen, A. M., Muench, A., Bastien, C. H., Ellis, J. G., and Perlis, M. L. (2020). Acute and chronic insomnia: what has time and/or hyperarousal got to do with it? Brain Sci. 10:71. doi: 10.3390/brainsci10020071
Vethe, D., Kallestad, H., Jacobsen, H. B., Landr,ø, N. I., Borchgrevink, P. C., and Stiles, T. C. (2018). The relationship between improvement in insomnia severity and long-term outcomes in the treatment of chronic fatigue. Front. Psychol. 9:1764. doi: 10.3389/fpsyg.2018.01764
Werner, G. G., Riemann, D., and Ehring, T. (2021). Fear of sleep and trauma-induced insomnia: A review and conceptual model. Sleep Med. Rev. 55:101383. doi: 10.1016/j.smrv.2020.101383
Wind, S. A. (2017). An instructional module on mokken scale analysis. Educ. Measur. Issues Pract. 36, 50–66. doi: 10.1111/emip.12153
Keywords: insomnia severity index, classical test theory, rasch analysis, mokken analysis, dimensionality, reliability, validity
Citation: Pretorius TB and Padmanabhanunni A (2025) Evaluating the Insomnia Severity Index among South African first responders: evidence from classical test theory, Rasch, and Mokken analyses. Front. Sleep 4:1635434. doi: 10.3389/frsle.2025.1635434
Received: 26 May 2025; Accepted: 31 August 2025;
Published: 23 September 2025.
Edited by:
Stuart F. Quan, Harvard Medical School, United StatesReviewed by:
Solomon Mashegoane, University of Limpopo, South AfricaPatricia L. Haynes, University of Arizona, United States
Copyright © 2025 Pretorius and Padmanabhanunni. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Tyrone B. Pretorius, dHByZXRvcml1c0B1d2MuYWMuemE=
†These authors have contributed equally to this work and share first authorship