A Note on the Eigensystem of the Covariance Matrix of Dichotomous Guttman Items

Davis-Stober, Clintin P.; Doignon, Jean-Paul; Suck, Reinhard

doi:10.3389/fpsyg.2015.01767

ORIGINAL RESEARCH article

Front. Psychol., 01 December 2015

Sec. Quantitative Psychology and Measurement

Volume 6 - 2015 | https://doi.org/10.3389/fpsyg.2015.01767

A Note on the Eigensystem of the Covariance Matrix of Dichotomous Guttman Items

Clintin P. Davis-Stober¹^*

Jean-Paul Doignon²

Reinhard Suck³

¹Department of Psychological Sciences, University of Missouri, Columbia, MO, USA
²Department of Mathematics, Université Libre de Bruxelles, Brussels, Belgium
³Universität Osnabrück, Osnabrück, Germany

We consider the covariance matrix for dichotomous Guttman items under a set of uniformity conditions, and obtain closed-form expressions for the eigenvalues and eigenvectors of the matrix. In particular, we describe the eigenvalues and eigenvectors of the matrix in terms of trigonometric functions of the number of items. Our results parallel those of Zwick (1987) for the correlation matrix under the same uniformity conditions. We provide an explanation for certain properties of principal components under Guttman scalability which have been first reported by Guttman (1950).

1. Introduction

Guttman scales form the conceptual foundation for modern Item Response Theory (IRT). For example, Guttman scales underlie the Rasch model (e.g., Andrich, 1985) as well as Mokken scales (e.g., van Schuur, 2003),—see Tenenhaus and Young (1985) and Lord and Novick (1968) for classic reviews and discussions of Guttman scaling. Under the auspices of understanding the principal component structure of unidimensional scales, Guttman (1950) derived several important properties relating to the correlation matrix of perfect dichotomous Guttman items. Later work by Zwick (1987) identified that the eigenvalues corresponding to this matrix can be written as simple functions of the number of items, under a set of uniformity conditions.

In this brief note, we extend the results of Zwick (1987) by considering the covariance matrix of dichotomous Guttman items under these same uniformity conditions. We derive closed-form solutions for the eigenvalues and eigenvectors of this matrix, for any number of items. In particular, we provide expressions in terms of simple trigonometric functions of the number of items. These expressions lead to a simple explanation of the signing relationships among principal components for Guttman scales first described by Guttman (1950).

2. Main Results

The core idea of a Guttman scale is that the set of items under consideration forms a unidimensional scale, i.e., if a person obtains a correct response to an item then this person would obtain a correct response to all “easier” items. Table 1 presents a matrix of response patterns conforming to a perfect Guttman scale for five items, with Item 5 being the most “difficult” and Item 1 being the “easiest.”

TABLE 1

Table 1. An example of a perfect Guttman scale for five items.

As in Zwick (1987), we consider the following two assumptions. First, we assume that all items are distinct, i.e., no two items produce identical responses for all possible response patterns. Second, we assume that the probability of obtaining each response pattern is $\frac{1}{n + 1}$ , where n is the number of items, i.e., a uniform distribution over response patterns. This last assumption is rather strong, given that responses are typically modeled using a normal distribution. While we assume uniformly distributed response patterns primarily for mathematical tractability, we demonstrate later via simulations that our results approximate those obtained from a normal distribution under highly discriminating items that are equally spaced by difficulty.

Under our assumptions, the covariance between any items i and j, with i ≤ j, is equal to the following:

\begin{matrix} σ_{i, j}^{c o v} = \frac{i (n + 1 - j)}{{(n + 1)}^{2}}, \forall i, j \in {1, 2, \dots n} with i \leq j . & (1) \end{matrix}

As one would expect, Equation (1) is closely related to the Pearson product-moment correlation, which, as described by Zwick (1987), is equal to:

\begin{matrix} \begin{matrix} C o r r (i, j) = \sqrt{\frac{i (n + 1 - j)}{j (n + 1 - i)}}, \forall i, j \in {1, 2, \dots n} with i \leq j . \end{matrix} & (2) \end{matrix}

Parallel to Zwick (1987) and Guttman (1950), who handled the correlation matrix, we consider the n × n covariance matrix defined by Equation (1). We first provide the n distinct eigenvalues.

Proposition 1. The covariance matrix σ^cov, with entries given by Equation (1), has its eigenvalues equal to (in decreasing order)

\begin{matrix} λ_{i}^{c o v} = \frac{1}{(n + 1) (2 - 2 \cos (\frac{i π}{n + 1}))}, i = 1, 2, \dots, n . & (3) \end{matrix}

The proof is in the Appendix.

Note how i and n determine the period of the cosine term in the denominator of the right-hand side of Equation (3). From the same equation, the maximal eigenvalue for any fixed number of items n is equal to $λ_{n}^{m a x} = \frac{1}{(n + 1) (2 - 2 cos (\frac{π}{n + 1}))}$ . Note that as n → ∞, $λ_{n}^{m a x} \to \infty$ . Also, the eigenvalues of the covariance matrix are very different from the eigenvalues of the Pearson product correlation matrix, which, as described by Zwick (1987), are equal to $λ_{i}^{c o r r} = \frac{n + 1}{i (i + 1)}$ .

The eigenvectors of the covariance matrix also have an elegant, closed-form expression.

Proposition 2. For the covariance matrix σ^cov defined by Equation (1), an eigenvector P_i of eigenvalue $λ_{i}^{c o v}$ , with i = 1, 2, …, n (as in Proposition 1), results from setting

\begin{matrix} \begin{matrix} P_{i, m} = \sin (\frac{i m π}{n + 1}), m = 1, 2, \dots, n . \end{matrix} & (4) \end{matrix}

The proof is in the Appendix.

Guttman (1950) derived a series of relationships on the eigenvector components of correlation matrices based on perfect, “error free” scales. Let sgn(x) be the sign function of the value x. Define a sign change of an eigenvector P_{i, m} as a value j such that sgn(P_{i, j}) ≠ sgn(P_{i, j+1}), j ∈ {1, 2, …, n}. As described in Guttman (1950), for n-many items there exists exactly one eigenvector with no sign changes, one eigenvector with a single sign change, one with two sign changes, and so on, with the eigenvector corresponding to the smallest eigenvalue having exactly n–1 sign changes. This symmetry can be seen in Table 2, which presents the eigenvectors in Equation (4) for n = 5. As made explicit by Equation (4), these sign changes result from the symmetry of the sine function as the values of i and m vary.

TABLE 2

Table 2. This table presents the eigenvector components of the covariance matrix for n = 5.

3. Comparison to IRT Data

In this section, we illustrate how our analytic results could be used to evaluate responses conforming to modern IRT models. We consider the well-known two parameter logistic (2PL) model, where the probability of a correct response to item i is defined as follows:

\begin{matrix} \begin{matrix} p_{i} (θ) = \frac{1}{1 + \exp^{- a_{i} (θ - b_{i})}}, \end{matrix} & (5) \end{matrix}

where $a_{i} \in ℝ^{+}$ is the item discrimination parameter, b_i ∈ ℝ is the item difficulty parameter and θ ∈ ℝ is the person-specific ability parameter.

From the perspective of the 2PL model, Guttman items are obtained by letting the a_i (item discrimination) parameter values become arbitrarily large (e.g., van Schuur, 2003), i.e., the probability of a test taker correctly answering an item given that their latent skill is higher (lower, resp.) than the item difficulty is 1 (0 resp.). Our results provide a new perspective on the item covariance and principal component structure of 2PL items under the idealized conditions of a Guttman scale. Indeed, one could consider the eigenvalues and eigenvectors in Equations (3–4) as an error-free ideal for such response data, under our assumption of a uniform distribution over response patterns.

In the next section, we compare our results to simulated data that relax the assumption of a uniform distribution over response patterns. In the first simulation study, we compare our results to data generated from a Rasch model (Equation 5 with a_i = 1, i = 1, 2, …, n) where the person specific ability parameter, θ, is randomly drawn from a standard normal distribution. For the second simulation study, we consider a setup nearly identical to the first, with the exception that we consider large values of a_i for each item, i.e., high discrimination among items.

3.1. Simulation Study 1

For this study, we considered six conditions comprised of: 4, 6, 8, 16, 32, and 64 test items. For each condition, the difficulty of the items, b_i, was equally spaced along the interval [−1, 1]. For each condition, we randomly sampled 5000 values of θ from a standard normal distribution (e.g., Anderson et al., 2007). We obtained simulated responses to the items by applying the sampled θ values, and item difficulties, b_i, to Equation (5), with a_i = 1 for all test items, i.e., a Rasch model. Thus, for each condition, we have 5000 simulated responses to the test items.

For each condition, we computed the covariance matrix of the items using the 5000 simulated responses, i.e., we calculated the sample covariance of the 5000 responses. We then numerically calculated the eigenvalues of this covariance matrix. Figure 1 compares the eigenvalues obtained from the simulated data to the eigenvalues obtained from Equation (3), for each condition. It is interesting to note that the largest eigenvalue for the simulated data is always larger than the maximal eigenvalue obtained via Equation (3), this is similar to results obtained by Zwick (1987) within the context of the Guttman correlation matrix. In general, moving to a probabilistic response model (the Rasch model) and sampling the θ values from a normal distribution appears to yield covariance eigenvalues that greatly differ from those obtained in Equation (3). As we show in the next study, improving item discrimination will yield different results.

FIGURE 1

Figure 1. Each plot compares the eigenvalues obtained from Equation (3) to those obtained from simulated Rasch data under the assumption that θ ~ N(0, 1) under n = 4, 6, 8, 16, 32, and 64 items.

3.2. Simulation Study 2

In this simulation study, we consider nearly identical conditions to the first, with the exception that the item discrimination parameters, a_i, are large in size, indicating excellent item discrimination. As in the previous study, we considered six conditions comprised of: 4, 6, 8, 16, 32, and 64 test items. For each condition, the difficulty of the items, b_i, was equally spaced along the interval [−1, 1]. As before, for each condition, we randomly sampled 5000 values of θ from a standard normal distribution. We obtained simulated responses to the items by applying the sampled θ values, and item difficulties, b_i, to Equation (5), with a_i = 3, i = 1, 2, …, n, indicating excellent item discrimination. As before, for each condition, we have 5000 simulated responses to the test items.

For each condition, we computed the covariance matrix of the 5000 simulated responses and numerically calculated the eigenvalues of the generated covariance matrix for each condition. Figure 2 compares the eigenvalues from these simulated data to the eigenvalues obtained via Equation (3), for each condition. It is interesting to note that there is a much closer correspondence between the two sets of eigenvalues under these conditions. Further, this relationship becomes stronger as the number of equally spaced items increases, yielding nearly a perfect match to the maximal eigenvalue as the number of items reaches 32 and 64.

FIGURE 2

Figure 2. Each plot compares the eigenvalues obtained from Equation (3) to those obtained from simulated 2PL data with high item discrimination under the assumption that θ ~ N(0, 1) under n = 4, 6, 8, 16, 32, and 64 items.

This study illustrates that our analytic results, which are derived under the strong assumption of uniformly distributed response patterns, may be useful as an approximation even when the ability parameter is normally distributed. This approximation is best when the difficulty range of the items are within a single standard deviation of the mean and the items have excellent discriminability. As the range of the item difficulty increases and/or the variance of the ability parameter distribution shrinks, the approximation becomes much poorer. Our Matlab code for generating these graphs and exploring other configurations is available as an online supplement.

4. Conclusion

We derived closed-form solutions for the eigenvalues and eigenvectors of the covariance matrix of dichotomous Guttman items, under a uniform sampling assumption. We demonstrated that these eigenvalues and eigenvectors are simple trigonometric functions of the number of items, n. Our results parallel those of Zwick (1987), who examined the eigenvalues of the correlation matrix of dichotomous Guttman items under the same uniformity assumptions. It remains an open question whether the eigenvectors of the correlation matrix, as investigated by Zwick (1987), can also be solved for explicitly.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We would like to thank Edgar Merkle, Jay Verkuilen, and David Budescu for comments on an earlier draft. Davis-Stober was supported by National Science Foundation grant (SES-1459866, PI: Davis-Stober).

Supplementary Material

The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fpsyg.2015.01767

References

Anderson, C. J., Li, Z., and Vermunt, J. K. (2007). Estimation of models in a Rasch family for polytomous items and multiple latent variables. J. Stat Soft. 20, 1–36. doi: 10.18637/jss.v020.i06

CrossRef Full Text | Google Scholar

Andrich, D. (1985). “An elaboration of Guttman scaling with Rasch models for measurement,” in Sociological Methodology 1985. Jossey-Bass Social and Behavioral Science Series, ed N. B. Tuma (San Francisco, CA: Jossey-Bass), 33–80.

Google Scholar

Bünger, F. (2014). Inverses, determinants, eigenvalues, and eigenvectors of real symmetric Toeplitz matrices with linearly increasing entries. Linear Algebra Appl. 459, 595–619. doi: 10.1016/j.laa.2014.07.023

CrossRef Full Text | Google Scholar

Elliott, J. F. (1953). The Characteristic Roots of Certain Real Symmetric Matrices. Masters thesis, University of Tennessee.

Gregory, R. T., and Karney, D. (1969). A Collection of Matrices for Testing Computational Algorithm. New York, NY: Wiley-Interscience.

Google Scholar

Guttman, L. (1950). “The principal components of scale analysis,” in Measurement and Prediction, eds S. A. Stouffer, L. Guttman, E. A. Suchman, P. F. Lazarsfeld, S. A. Star, and J. A. Clausen (Princeton, NJ: Princeton University Press), 312–361.

Google Scholar

Lord, F. M., and Novick, M. (1968). Statistical Theories of Mental Test Scores. Reading, MA: Addison-Wesley.

Google Scholar

Tenenhaus, M., and Young, F. W. (1985). An analysis and synthesis of multiple correspondence analysis, optimal scaling, dual scaling, homogeneity analysis and other methods for quantifying categorical multivariate data. Psychometrika 50, 91–119. doi: 10.1007/BF02294151

CrossRef Full Text | Google Scholar

van Schuur, W. H. (2003). Mokken scale analysis: between the Guttman scale and parametric item response theory. Polit. Anal. 11, 139–163. doi: 10.1093/pan/mpg002

CrossRef Full Text | Google Scholar

Yueh, W.-C. (2005). Eigenvalues of several tridiagonal matrices. Appl. Math. E-Notes 5, 66–74.

Google Scholar

Yueh, W.-C., and Cheng, S. S. (2008). Explicit eigenvalues and inverses of tridiagonal Toeplitz matrices with four perturbed corners. ANZIAM J. 49, 361–387. doi: 10.1017/S1446181108000102

CrossRef Full Text | Google Scholar

Zwick, R. (1987). Some properties of the correlation matrix of dichotomous Guttman items. Psychometrika 52, 515–520. doi: 10.1007/BF02294816

CrossRef Full Text | Google Scholar

Appendix

A. Proofs of Propositions 1 and 2

To prove the main results of the paper, we first derive the general inverse of the covariance matrix of Guttman items under our uniformity assumptions. This inverse has a special tridiagonal form. From this tridiagonal form, we apply known algebraic results to obtain the required eigenvalues and eigenvectors.

Define X as the (n + 1) × n matrix of perfect Guttman scores under the specified uniformity assumptions. The rows of this matrix correspond to response patterns while the columns correspond to Guttman items, see also Table 1. This matrix has zeros on the diagonal and above, and all elements below the diagonal are ones:

X = (\begin{matrix} 0 & 0 & 0 & \dots & 0 \\ 1 & 0 & 0 & \dots & 0 \\ 1 & 1 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & 1 & 1 & \dots & 1 \end{matrix}) .

We denote by e⁽ⁱ⁾ the i-th vector of the canonical basis of ℝ^{n + 1}, and by v⁽ⁱ⁾ the i-th column vector of X. For later use, it is convenient to introduce also v⁽⁰⁾ = (1, 1, …, 1)′ and v^{(n + 1)} = (0, 0, …, 0)′. Thus in ℝ^{n + 1} we have for i = 1, 2, …, n + 1,

v^{(i)} = v^{(i - 1)} - e^{(i)},

and then also, for i = 1, 2, …, n,

\begin{matrix} \begin{array}{l} v^{(i)} = \frac{1}{2} (v^{(i - 1)} - e^{(i)}) + \frac{1}{2} (v^{(i + 1)} + e^{(i + 1)}) \\ = \frac{1}{2} (v^{(i - 1)} + v^{(i + 1)}) + \frac{1}{2} (e^{(i + 1)} - e^{(i)}) . \end{array} & (A 1) \end{matrix}

Obtaining the eigenvalues and eigenvectors of the covariance matrix via the columns v⁽ⁱ⁾ of X can be done in five steps:

1. centering each vector v⁽ⁱ⁾ (for i = 1, 2, …, n), that is, subtracting the mean of all components of v⁽ⁱ⁾ from each component; let us denote by ṽ⁽ⁱ⁾ the resulting vector;

2. computing the element S_ij of the matrix S as the scalar product ṽ⁽ⁱ⁾ · ṽ^(j);

3. deriving the inverse of S by taking into account a special property of the rows of S (see below);

4. inferring the eigenvalues and eigenvectors of S⁻¹ (then also of S) from the special form of S⁻¹, a tridiagonal matrix;

5. finally, observing that the covariance matrix equals $σ^{c o v} = \frac{1}{(n + 1)} S$ , thus obtaining the eigenvalues and eigenvectors of ${\hat{σ}}^{c o v}$ .

Let us rephrase these steps in a more geometric fashion. In Step 1, ṽ⁽ⁱ⁾ is the image of v⁽ⁱ⁾ by the orthogonal projection from ℝ^{n + 1} to the hyperplane H with equation $\sum_{i = 1}^{n + 1} x_{i} = 0$ (indeed, ṽ⁽ⁱ⁾ ∈ H and furthermore ṽ⁽ⁱ⁾ − v⁽ⁱ⁾, a constant vector, is orthogonal to H). Moreover, notice that e⁽ⁱ⁾ − e^(j) belongs to H and so projects onto itself. Consequently, for i = 1, 2, …, n, we derive from Equation (A1)

\begin{array}{l} ṽ^{(i)} = \frac{1}{2} (ṽ^{(i - 1)} + ṽ^{(i + 1)}) + \frac{1}{2} (e^{(i + 1)} - e^{(i)}) . & (A 2) \end{array}

In Step 2, we compute S_{i, j} as the scalar product of ṽ⁽ⁱ⁾ with ṽ^(j). Taking the scalar product of both sides of the previous equation with ṽ^(j), we get for i = 2, 3, …, n − 1 and j = 1, 2, …, n

S_{i, j} = \frac{1}{2} (S_{i - 1, j} + S_{i + 1, j}) + \frac{1}{2} (ṽ_{i + 1}^{(j)} - ṽ_{i}^{(j)}) .

Now because

{\tilde{v}}_{i + 1}^{(j)} - {\tilde{v}}_{i}^{(j)} = v_{i + 1}^{(j)} - v_{i}^{(j)} = {\begin{array}{l} 1 & if i = j, \\ 0 & otherwise, \end{array}

we see that row S_{i, •} is the mean of rows S_{i − 1, •} and S_{i + 1, •} except for its diagonal element which is $\frac{1}{2}$ more than the mean. This holds for i = 2, 3, …, n − 1. By considering extraneous rows S_{0, •} = (0, 0, …, 0) and S_{n + 1, •} = (0, 0, …, 0), we can also allow i = 1 and i = n [this follows again from (A2), considered now for i = 1 and i = n, together with ṽ⁽⁰⁾ = ṽ^{(n + 1)} = (0, 0, …, 0)′]. This special property of S immediately translates into the following expression for the inverse matrix of S:

S^{- 1} = (\begin{matrix} 2 & - 1 & 0 & 0 & 0 & \dots & 0 \\ - 1 & 2 & - 1 & 0 & 0 & \dots & 0 \\ 0 & - 1 & 2 & - 1 & 0 & \dots & 0 \\ 0 & 0 & - 1 & 2 & - 1 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋱ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 & - 1 & 2 & - 1 \\ 0 & 0 & \dots & 0 & 0 & - 1 & 2 \end{matrix}) .

(indeed, the product of the above matrix with S equals the identity matrix).

The form of S⁻¹ follows a particular tridiagonal form that has been extensively studied in the mathematics literature. Elliott (1953) and Gregory and Karney (1969) identified that the eigenvalues λ_i of S⁻¹, and (selected) corresponding eigenvectors P_i, for i = 1, 2, …, n, are given by $λ_{i} = 2 - 2 cos (\frac{i π}{n + 1})$ (in increasing order) and $P_{i, m} = sin (\frac{i m π}{n + 1})$ respectively (where m = 1, 2, …, n). These results were later extended to more general tridiagonal matrices by Yueh (2005), see also Yueh and Cheng (2008) and Bünger (2014).

Because the covariance matrix σ^cov equals $\frac{1}{(n + 1)} S$ , the eigenvalues of σ^cov are equal to $\frac{1}{n + 1}$ times those of S, so they are also $\frac{1}{n + 1}$ times the inverses of the eigenvalues of S⁻¹.

Proposition 1 now follows from the fact that the eigenvalues of σ^cov are equal to $\frac{1}{n + 1}$ times the reciprocals of the eigenvalues of S⁻¹, and Proposition 2 from the fact the matrices σ^cov, S, and S⁻¹ have the same eigenvectors. This completes the proof. □

Keywords: Guttman scale, dichotomous items, Rasch model, principal component analysis, eigenvalues, eigenvectors

Citation: Davis-Stober CP, Doignon J-P and Suck R (2015) A Note on the Eigensystem of the Covariance Matrix of Dichotomous Guttman Items. Front. Psychol. 6:1767. doi: 10.3389/fpsyg.2015.01767

Received: 19 May 2015; Accepted: 04 November 2015;
Published: 01 December 2015.

Edited by:

Pietro Cipresso, IRCCS Istituto Auxologico Italiano, Italy

Reviewed by:

Juergen Heller, Universität Tübingen, Germany
Tomer Fekete, KU Leuven, Belgium

Copyright © 2015 Davis-Stober, Doignon and Suck. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Clintin P. Davis-Stober, c3RvYmVyY0BtaXNzb3VyaS5lZHU=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.