<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2022.982137</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Overfactoring in rating scale data: A comparison between factor analysis and item response theory</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Revuelta</surname> <given-names>Javier</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/393790/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Xim&#x00E9;nez</surname> <given-names>Carmen</given-names></name>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/264161/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Minaya</surname> <given-names>Noelia</given-names></name>
<xref ref-type="author-notes" rid="fn002"><sup>&#x2020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1909027/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Psychology, Autonomous University of Madrid</institution>, <addr-line>Madrid</addr-line>, <country>Spain</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Wenchao Ma, University of Alabama, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Alexander Robitzsch, IPN - Leibniz Institute for Science and Mathematics Education, Germany; Steffen Gr&#x00F8;nneberg, BI Norwegian Business School, Norway</p></fn>
<corresp id="c001">&#x002A;Correspondence: Javier Revuelta, <email>javier.revuelta@uam.es</email></corresp>
<fn fn-type="other" id="fn002"><p><sup>&#x2020;</sup>ORCID: Javier Revuelta, <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0003-4705-6282">orcid.org/0000-0003-4705-6282</ext-link>; Carmen Xim&#x00E9;nez, <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0003-1337-6309">orcid.org/0000-0003-1337-6309</ext-link>; Noelia Minaya, <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0002-9736-899X">orcid.org/0000-0002-9736-899X</ext-link></p></fn>
<fn fn-type="other" id="fn004"><p>This article was submitted to Quantitative Psychology and Measurement, a section of the journal Frontiers in Psychology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>30</day>
<month>11</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>982137</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>06</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>01</day>
<month>11</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2022 Revuelta, Xim&#x00E9;nez and Minaya.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Revuelta, Xim&#x00E9;nez and Minaya</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Educational and psychological measurement is typically based on dichotomous variables or rating scales comprising a few ordered categories. When the mean of the observed responses approaches the upper or the lower bound of the scale, the distribution of the data becomes skewed and, if a categorical factor model holds in the population, the Pearson correlation between variables is attenuated. The consequence of this correlation attenuation is that the traditional linear factor model renders an excessive number of factors. This article presents the results of a simulation study investigating the problem of overfactoring and some solutions. We compare five widely known approaches: (1) The maximum-likelihood factor analysis (FA) model for normal data, (2) the categorical factor analysis (FAC) model based on polychoric correlations and maximum likelihood (ML) estimation, (3) the FAC model estimated using a weighted least squares algorithm, (4) the mean corrected chi-square statistic by Satorra&#x2013;Bentler to handle the lack of normality, and (5) the Samejima&#x2019;s graded response model (GRM) from item response theory (IRT). Likelihood-ratio chi-square, parallel analysis (PA), and categorical parallel analysis (CPA) are used as goodness-of-fit criteria to estimate the number of factors in the simulation study. Our results indicate that the maximum-likelihood estimation led to overfactoring in the presence of skewed variables both for the linear and categorical factor model. The Satorra&#x2013;Bentler and GRM constitute the most reliable alternatives to estimate the number of factors.</p>
</abstract>
<kwd-group>
<kwd>factor analysis</kwd>
<kwd>categorical factor analysis</kwd>
<kwd>polychoric correlations</kwd>
<kwd>skewness</kwd>
<kwd>rating scales</kwd>
<kwd>Monte Carlo simulation</kwd>
<kwd>item response theory</kwd>
<kwd>graded response model</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="13"/>
<equation-count count="5"/>
<ref-count count="71"/>
<page-count count="17"/>
<word-count count="13106"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>Ordinal data have an overwhelming presence in educational and psychological measurement (<xref ref-type="bibr" rid="B27">J&#x00F6;reskog and Moustaki, 2001</xref>; <xref ref-type="bibr" rid="B15">Flora and Curran, 2004</xref>; <xref ref-type="bibr" rid="B31">Lee et al., 2012</xref>). Rating scales, Likert-type items, graded responses, and dichotomous data are the basis for the measurement of attitudes, personality traits, and abilities. By definition, this type of data is bounded between a lower and an upper limit. When the responses concentrate close to one of the boundaries, the distribution of the data presents an extreme mean and either positive or negative skewness. This may happen, for example, when the individuals show a tendency to agree to an opinion scale or when they easily pass the items of an ability test. In some circumstances, the responses are skewed in opposite directions. This pattern of mixed skewness may appear for instance in balanced Likert-type scales containing an equal number of items worded in opposite directions (positive or negative) to control for acquiescence and other response biases (<xref ref-type="bibr" rid="B13">Ferrando and Lorenzo-Seva, 2010</xref>; <xref ref-type="bibr" rid="B60">Savalei and Falk, 2014</xref>). Usually, a high score in positively worded items is associated to a high factor score whereas the reverse occurs for negatively worded items (<xref ref-type="bibr" rid="B34">Lindwall et al., 2012</xref>). Positively and negatively worded items typically have skewness with reversed signs. Another example of mixed skewness occurs when an ability test contains items of varying difficulty to evaluate individuals at different ability levels. Easy items generally have a high mean and negative skewness, and the contrary occurs for difficult items (<xref ref-type="bibr" rid="B23">Ho and Yu, 2015</xref>). The same phenomenon may occur with binary data, which is the particular case of rating scales in which responses are scored in two categories.</p>
<p>From a theoretical standpoint, ordinal variables cannot be normally distributed. This is at odds with the assumption of multivariate normality implicit in the most popular methods of statistical inference in the context of factor analysis (FA). However, the assumption of normality is a convenient one from a computational perspective and normal-based methods are still in widespread use under the presupposition that they are reasonable accounts of the data. Alternative methods have been proposed in the psychometric literature to handle non-normal variables (<xref ref-type="bibr" rid="B15">Flora and Curran, 2004</xref>). Robust methods are still based on normality and introduce corrections to deal with excess skewness and kurtosis. Models for ordinal data get rid of the normality assumption of manifest variables and explicitly assume a multinomial distribution of responses. There are still no clear answers as to what point one must switch from a normal model to a robust-normal model and to a model for multinomial data.</p>
<p>This article aims to shed some light on this problem by comparing the different inferential methods in a simulation study under conditions of varied skewness. The purpose of the article is to present recommendations to applied practitioners who have rating data and must decide which method to apply from the wide range of ready-to-use options that are available in popular computer programs. The comparison is largely based on <xref ref-type="bibr" rid="B43">Olsson (1979)</xref> study, which addressed these effects in relation to the likelihood-ratio chi-square statistic for normally distributed variables. <xref ref-type="bibr" rid="B43">Olsson (1979)</xref> investigated several conditions, including sample size, the magnitude of the factor loadings, the number of item response categories, the number of items, and the pattern of skewness. He found that mixed skewness is an important driver for the problem of overfactoring in the context of exploratory FA. Other relevant studies are those by <xref ref-type="bibr" rid="B17">Foldnes and Gr&#x00F8;nneberg (2022)</xref> and <xref ref-type="bibr" rid="B20">Gr&#x00F8;nneberg and Foldnes (2022)</xref>, who found that the dissimilarity between the distribution of the latent responses and the assumed normal distribution leads to a distortion in the polychoric correlations and, ultimately, in the results of the categorical factor model.</p>
<p><xref ref-type="bibr" rid="B43">Olsson (1979)</xref> also found that overfactoring is associated with the magnitude of the factor loadings, occurring that higher factor loadings are associated with an increased chi-square value and, consequentially, lead to a worse fit. This phenomenon has been termed the <italic>reliability paradox</italic> (<xref ref-type="bibr" rid="B22">Hancock and Mueller, 2011</xref>; <xref ref-type="bibr" rid="B38">McNeish et al., 2018</xref>), meaning that high loadings imply that the measurement is more reliable but are associated with an inflated chi-square value and overfactoring. One tentative solution is to switch from FA for normal variables to categorical factor analysis (FAC). Previous research has investigated categorical factor models under different study conditions and estimation methods. For example, <xref ref-type="bibr" rid="B11">Curran et al. (1996)</xref> investigated alternative estimation methods and recommended the use of ADF estimation (<xref ref-type="bibr" rid="B4">Browne, 1984</xref>) for models including skewed variables. However, ADF is problematic when the sample size includes only a few hundred observations (<xref ref-type="bibr" rid="B24">Hu et al., 1992</xref>), which poses a problem for applied investigators in the field of behavioral sciences who typically cannot afford samples of thousands of individuals in their studies.</p>
<p>This article compares five approaches for the FA of ordinal data. First, FA for normal data is considered because it is the first method that was developed (<xref ref-type="bibr" rid="B30">Lawley and Maxwell, 1971</xref>) and it is still in widespread use. The second approach is FAC estimated from the polychoric correlation matrix and a maximum-likelihood algorithm (<xref ref-type="bibr" rid="B9">Christoffersson, 1975</xref>; <xref ref-type="bibr" rid="B40">Muth&#x00E9;n, 1978</xref>). The third approach is weighted least squares estimation with mean and variance correction (WLSMV) of the FA model, which has proved to be appropriate for the analysis of ordinal data (<xref ref-type="bibr" rid="B15">Flora and Curran, 2004</xref>; <xref ref-type="bibr" rid="B32">Li, 2016</xref>). The fourth approach is the mean corrected chi-square by Satorra&#x2013;Bentler (<italic>T</italic><sub><italic>M</italic></sub>; <xref ref-type="bibr" rid="B57">Satorra and Bentler, 2001</xref>, <xref ref-type="bibr" rid="B58">2010</xref>; see also <xref ref-type="bibr" rid="B59">Savalei, 2018</xref>), which is a modification of the chi-square statistic of FA to correct for the lack of normality in the data. The <italic>T</italic><sub><italic>M</italic></sub> rescales the chi-square statistic by an amount that reflects the degree of kurtosis, and it is customarily used with ordinal data. Finally, the fifth approach is the Samejima graded response model (GRM; <xref ref-type="bibr" rid="B55">Samejima, 1969</xref>, <xref ref-type="bibr" rid="B56">2016</xref>) from the item response theory (IRT). The FAC and GRM share the basic form of the model (the algebraic relation between latent factors and manifest variables; <xref ref-type="bibr" rid="B64">Takane and de Leeuw, 1987</xref>) but differ in the estimation method. The GRM is not estimated from the polychoric correlations but from the individual response patterns using a marginal-likelihood/EM algorithm.</p>
<p>We have assumed that the population follows a FAC model because this approach is possibly the most common in the analysis of ordinal data. However, there is also an active line of research about populations that do not conform to FAC but to a Pearson correlation model (<xref ref-type="bibr" rid="B51">Robitzsch, 2020</xref>, <xref ref-type="bibr" rid="B52">2022</xref>). Our simulations cannot help to decide what the population model is. We investigate the performance of the different analytic approaches under the assumption of population FAC.</p>
<p>The article is organized into four sections. Section &#x201C;Factor analysis models and the effect of categorization&#x201D; presents the theoretical description of the FA models and the effect of skewness on manifest correlations and overfactoring. The section &#x201C;Simulation study&#x201D; describes a simulation study to compare the FA and IRT models under several conditions. A real data example is presented in the section &#x201C;Real data example&#x201D; to illustrate these problems in an applied setting. Finally, the article concludes with a discussion and recommendations for applied researchers.</p>
</sec>
<sec id="S2">
<title>Factor analysis models and the effect of categorization</title>
<sec id="S2.SS1">
<title>Factor analysis models for continuous and categorical data</title>
<p>The linear factor model (FA) for a vector of variables <italic><bold>y&#x002A;</bold></italic> of <italic>J</italic> elements is</p>
<disp-formula id="S2.E1">
<label>(1)</label>
<mml:math id="M1">
<mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>y</mml:mi>
<mml:mo>&#x002A;</mml:mo>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mi mathvariant="bold">&#x039B;</mml:mi>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>&#x03BE;</mml:mi>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mtext mathvariant="bold">e</mml:mtext>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where &#x03BE; is a vector of latent factor, <bold><italic>e</italic></bold> is a vector of random measurement errors<sup><xref ref-type="fn" rid="footnote1">1</xref></sup>, and <bold>&#x039B;</bold> is the matrix of factor loadings. Under the additional assumption of normality, the factors, &#x03BE;, follow a standard normal distribution, and the errors, <bold><italic>e</italic></bold>, are distributed as normal (0, &#x03C6;), where &#x03C6; is the standard deviation. In consequence, the distribution of <italic><bold>y&#x002A;</bold></italic> is multivariate normal with variance-covariance matrix:</p>
<disp-formula id="S2.E2">
<label>(2)</label>
<mml:math id="M2">
<mml:mrow>
<mml:mi mathvariant="bold">&#x03A3;</mml:mi>
<mml:mmultiscripts>
<mml:mo>=</mml:mo>
<mml:mprescripts/>
<mml:none/>
<mml:mo>&#x002A;</mml:mo>
</mml:mmultiscripts>
<mml:mi>&#x039B;</mml:mi>
<mml:mmultiscripts>
<mml:mi>&#x039B;</mml:mi>
<mml:mprescripts/>
<mml:none/>
<mml:mo>&#x2032;</mml:mo>
</mml:mmultiscripts>
<mml:mo>+</mml:mo>
<mml:mi>&#x03A8;</mml:mi>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <bold>&#x03A8;</bold> is the diagonal variance-covariance matrix of<bold> <italic>e</italic></bold>.</p>
<p>The categorical factor analysis model (FAC) assumes that <italic><bold>y&#x002A;</bold></italic> is an unobservable variable and that the manifest responses are obtained from <italic><bold>y&#x002A;</bold></italic>through a discretization process (<xref ref-type="bibr" rid="B41">Muth&#x00E9;n, 1984</xref>; <xref ref-type="bibr" rid="B14">Ferrando and Lorenzo-Seva, 2013</xref>). For example, the binary factor model assumes that the response to item <italic>j</italic> is 1 if <inline-formula><mml:math id="INEQ3"><mml:msubsup><mml:mi>y</mml:mi><mml:mi>j</mml:mi><mml:mo>&#x002A;</mml:mo></mml:msubsup></mml:math></inline-formula> is sufficiently high and 0 otherwise. More specifically:</p>
<disp-formula id="S2.E3">
<label>(3)</label>
<mml:math id="M3">
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mtable displaystyle="true" rowspacing="0pt">
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo rspace="8.1pt">,</mml:mo>
<mml:mrow>
<mml:mpadded width="+2.8pt">
<mml:mi>if</mml:mi>
</mml:mpadded>
<mml:mo>&#x2062;</mml:mo>
<mml:msubsup>
<mml:mi>y</mml:mi>
<mml:mi>j</mml:mi>
<mml:mo>&#x002A;</mml:mo>
</mml:msubsup>
</mml:mrow>
</mml:mrow>
<mml:mo>&#x2264;</mml:mo>
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="center">
<mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo rspace="8.1pt">,</mml:mo>
<mml:mrow>
<mml:mpadded width="+2.8pt">
<mml:mi>if</mml:mi>
</mml:mpadded>
<mml:mo>&#x2062;</mml:mo>
<mml:msubsup>
<mml:mi>y</mml:mi>
<mml:mi>j</mml:mi>
<mml:mo>&#x002A;</mml:mo>
</mml:msubsup>
</mml:mrow>
</mml:mrow>
<mml:mo>&gt;</mml:mo>
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mi/>
</mml:mrow>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where <italic>&#x03C4;</italic><sub><italic>j</italic></sub> is a threshold parameter. In general, a model for <italic>K</italic> ordered response categories is based on a discretization process for <italic><bold>y&#x002A;</bold></italic> based on <italic>K</italic>-1 threshold parameters.</p>
<p>Mathematically, the skewness of the distribution of the manifest variable, <italic>y</italic><sub><italic>j</italic></sub>, depends on the placement of the thresholds. In the case of dichotomous data, the distribution will be skewed when the threshold parameters have an extreme value. On the contrary, the distribution of responses will be roughly symmetric when the threshold (the population mean of &#x03BE;) is close to zero. If the threshold is far away from zero, one of the categories will be more probable than the other. In the general case of an item with <italic>K</italic> response categories, skewness is associated to an uneven location of thresholds around the factor mean.</p>
</sec>
<sec id="S2.SS2">
<title>The attenuation of correlation</title>
<p>In general, the variance-covariance matrix of <bold><italic>y</italic></bold>, say <bold>&#x03A3;</bold>, will not conform to the structure implied by the linear model of Equation (2). However, <xref ref-type="bibr" rid="B42">Muth&#x00E9;n and Kaplan (1985)</xref> identified some circumstances where the covariance structure of (2) holds for both <italic><bold>y&#x002A;</bold></italic> and <italic>y</italic>. In particular, when a single factor model holds for <italic><bold>y&#x002A;</bold></italic> and all the items have the same slope, error variance and thresholds, all correlations between <italic><bold>y&#x002A;</bold></italic> are equal. In this case, all the pairs of <italic>y</italic><sub><italic>j</italic></sub> have the same correlation, <inline-formula><mml:math id="INEQ7"><mml:mrow><mml:msub><mml:mi>&#x03C1;</mml:mi><mml:mi>y</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mi>a</mml:mi><mml:mo>&#x2062;</mml:mo><mml:msub><mml:mi>&#x03C1;</mml:mi><mml:msup><mml:mi>y</mml:mi><mml:mo>&#x002A;</mml:mo></mml:msup></mml:msub></mml:mrow></mml:mrow></mml:math></inline-formula>, where <italic>a</italic> is the attenuation factor.</p>
<p><xref ref-type="fig" rid="F1">Figure 1</xref> provides details about the effect of the skewness on the Pearson correlation (<xref ref-type="bibr" rid="B28">Jorgensen and Johnson, 2022</xref>), showing the correlation for two variables that measure a common factor. Factor loadings are set to 0.71 and the correlation between <italic>y</italic><sub><italic>1</italic></sub> and <italic>y</italic><sub><italic>2</italic></sub> is computed as a function of the threshold.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Marginal correlation between the two dichotomous factor indicators as a function of the thresholds.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-982137-g001.tif"/>
</fig>
<p>The most evident result in <xref ref-type="fig" rid="F1">Figure 1</xref> is that the correlation depends on the threshold and is higher for threshold values close to zero. This is a noteworthy difference between FAC and FA, because in the later model the correlation is mathematically independent of the mean of the variables. A more subtle result shown by <xref ref-type="fig" rid="F1">Figure 1</xref> is that when the variables are skewed in opposite directions, the attenuation of the correlation is more pronounced and the overfactoring will be potentially more severe. This can be appreciated by comparing the correlations in the corners of <xref ref-type="fig" rid="F1">Figure 1</xref>. For example, the correlations in the corner (&#x2212;2, &#x2212;2) are about 0.20 whereas, in the corner (&#x2212;2, 2), they are about 0.11.</p>
<p>The maximum-likelihood estimation of the FA model consists of finding the values of the parameters that minimize the discrepancy function (<xref ref-type="bibr" rid="B5">Browne and Cudeck, 1992</xref>).</p>
<disp-formula id="S2.E4">
<label>(4)</label>
<mml:math id="M4">
<mml:mrow>
<mml:mi>F</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi mathvariant="bold">&#x03A3;</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="bold">&#x03A3;</mml:mi>
<mml:mmultiscripts>
<mml:mo stretchy="false">)</mml:mo>
<mml:mprescripts/>
<mml:none/>
<mml:mo>&#x002A;</mml:mo>
</mml:mmultiscripts>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mi>log</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:msup>
<mml:mi>&#x03A3;</mml:mi>
<mml:mo>&#x002A;</mml:mo>
</mml:msup>
<mml:mo stretchy="false">|</mml:mo>
<mml:mo>-</mml:mo>
<mml:mi>log</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi>&#x03A3;</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:mo>+</mml:mo>
<mml:mtext>trace</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi mathvariant="normal">&#x03A3;</mml:mi>
<mml:msup>
<mml:mi mathvariant="normal">&#x03A3;</mml:mi>
<mml:mo>&#x002A;</mml:mo>
</mml:msup>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>+</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
</disp-formula>
<p>where &#x03A3; is a variance-covariance matrix and <italic>p</italic> is the number of manifest variables. The population value of the discrepancy is computed by setting <bold>&#x03A3;</bold> to the true variance-covariance matrix (for example, those reproduced by a FAC model) and minimizing <italic>F</italic> with respect to <bold>&#x03A3;</bold>&#x002A;. In estimation, &#x03A3; is set to the sample variance-covariance matrix (<bold>&#x03A3;</bold> = <bold>S</bold>). The likelihood-ratio chi-square goodness of fit statistic is <inline-formula><mml:math id="INEQ11"><mml:mrow><mml:msup><mml:mi>X</mml:mi><mml:mn>2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mi>N</mml:mi><mml:mi>F</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mtext mathvariant="bold">S</mml:mtext><mml:mo>,</mml:mo><mml:mi mathvariant="bold">&#x03A3;</mml:mi><mml:mmultiscripts><mml:mo stretchy="false">)</mml:mo><mml:mprescripts/><mml:none/><mml:mo>&#x002A;</mml:mo></mml:mmultiscripts></mml:mrow></mml:mrow></mml:math></inline-formula>, and E(<italic>X</italic><sup>2</sup>) = <italic>df</italic> (<xref ref-type="bibr" rid="B10">Cudeck and Browne, 1985</xref>).</p>
<p>Fitting a FA model to a population that conforms to a FAC model has two unwilling consequences. First, in the special circumstances where the FA fits the true variance-covariance matrix [that is, <inline-formula><mml:math id="INEQ13"><mml:mrow><mml:mi>F</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold">&#x03A3;</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold">&#x03A3;</mml:mi><mml:mmultiscripts><mml:mo stretchy="false">)</mml:mo><mml:mprescripts/><mml:none/><mml:mo>&#x002A;</mml:mo></mml:mmultiscripts></mml:mrow><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:math></inline-formula> in the population], the FA contains the correct number of latent factors but the slopes will be attenuated. Second, in most cases the FA do not fit the population model [the population value is <inline-formula><mml:math id="INEQ14"><mml:mrow><mml:mi>F</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi mathvariant="bold">&#x03A3;</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold">&#x03A3;</mml:mi><mml:mmultiscripts><mml:mo stretchy="false">)</mml:mo><mml:mprescripts/><mml:none/><mml:mo>&#x002A;</mml:mo></mml:mmultiscripts></mml:mrow><mml:mo>&gt;</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:math></inline-formula>]; in that circumstances, the FA cannot reproduce the true variance-covariance matrix, the chi-square has an expectation of E(<italic>X</italic><sup>2</sup>) = &#x03B4; + <italic>df</italic> (where &#x03B4; is a positive non-centrality parameter), and the probability of rejecting the FA model with the correct number of factors will be inflated. The psychometric literature has demonstrated that this bias may lead to distortions such as overfactoring and the emergence of difficulty factors (<xref ref-type="bibr" rid="B37">McDonald and Ahlawat, 1974</xref>).</p>
<p>This article compares several widely used approaches to estimate the number of latent factors from ordinal data. These approaches consist of comparing the chi-square associated with a particular model (FA or FAC) and selecting the most parsimonious model that obtains a non-significant statistic. The alternatives are the likelihood-ratio chi-square for the FA model, the modified chi-square statistics that compensate for the lack of normality (<xref ref-type="bibr" rid="B58">Satorra and Bentler, 2010</xref>; <xref ref-type="bibr" rid="B6">Bryant and Satorra, 2012</xref>), the chi-square associated with FAC under two estimation methods [maximum likelihood (<bold>ML</bold>) and WLSMV], and the (GRM; <xref ref-type="bibr" rid="B55">Samejima, 1969</xref>) from IRT. The purpose is to evaluate the precision of the different methods and the potential bias associated with fitting a theoretically incorrect model (the FA). The comparison is conducted <italic>via</italic> simulation studies.</p>
</sec>
</sec>
<sec id="S3">
<title>Simulation study</title>
<p>The present simulation study compares the different approaches for estimating the number of latent factors in exploratory FA when an ordinal factor model with underlying normality holds in the population. Parallel analysis (PA) and categorical parallel analysis (CPA, which is PA applied to polychoric correlations) are also included because of their prominent current use as a method for estimating dimensionality (<xref ref-type="bibr" rid="B33">Lim and Jahng, 2019</xref>). Special attention is paid to the differential effect of skewness and magnitude of factor loadings because previous investigations have suggested that these are key conditions in the assessment of model fit (<xref ref-type="bibr" rid="B11">Curran et al., 1996</xref>; <xref ref-type="bibr" rid="B69">Xim&#x00E9;nez, 2006</xref>; <xref ref-type="bibr" rid="B18">Forero and Maydeu-Olivares, 2009</xref>).</p>
<sec id="S3.SS1">
<title>Simulated conditions and procedure</title>
<p>The simulated conditions include the sample size (<italic>N</italic>), the number of observed variables (<italic>p</italic>), the number of thresholds (&#x03C4;), the magnitude of the factor loadings (&#x03BB;), and the skewness as measured by the &#x03B3; coefficient (<xref ref-type="bibr" rid="B25">Joanes and Gill, 1998</xref>). The levels of these independent variables are summarized in <xref ref-type="table" rid="T1">Table 1</xref>. In summary, the number of conditions examined was 120 = 2 (sample size levels) &#x00D7; 2 (number of observed variables levels) &#x00D7; 3 (number of thresholds levels) &#x00D7; 2 (loading levels in the factors) &#x00D7; 5 (skewness levels). The number of simulated samples for each condition is 1000.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Conditions of the simulation study.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Code</td>
<td valign="top" align="left">Variable</td>
<td valign="top" align="left">Levels</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>N</italic></td>
<td valign="top" align="left">Sample size</td>
<td valign="top" align="left">100</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">500</td>
</tr>
<tr>
<td valign="top" align="left"><italic>p</italic></td>
<td valign="top" align="left">Number of variables</td>
<td valign="top" align="left">6</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">12</td>
</tr>
<tr>
<td valign="top" align="left">&#x03C4;</td>
<td valign="top" align="left">Number of thresholds</td>
<td valign="top" align="left">1 (2 response categories)</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">2 (3 response categories)</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">4 (5 response categories)</td>
</tr>
<tr>
<td valign="top" align="left">&#x03BB;</td>
<td valign="top" align="left">Magnitude of</td>
<td valign="top" align="left">0.6 (Medium)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">factor loadings</td>
<td valign="top" align="left">0.9 (High)</td>
</tr>
<tr>
<td valign="top" align="left">&#x03B3;</td>
<td valign="top" align="left">skewness</td>
<td valign="top" align="left">0 (None)</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">2 (strong and positive: SP)</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">&#x2212;1 and 1 (Mild and mixed: MM)</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">&#x2212;2 and 2 (Strong and mixed: SM)</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">&#x2212;2, 0 and 2 (None + Strong and<break/> mixed: N + SM)</td>
</tr>
</tbody>
</table></table-wrap>
<p>We have manipulated <italic>N</italic> because of its direct relation to chi-square and the potential effect on the results. The number of factors may have an impact on the results of FA given that a model for continuous variables might better approximate the data when the number of response categories increases. The number of variables is manipulated because the size of the fitted model is another condition that affects the population value of the discrepancy function and might influence the results of the chi-square.</p>
<p>The simulating model is a common-factor model (one latent factor) for categorical variables that realistically represents data from rating scales. We have simulated only the one-factor model to keep the number of conditions at a manageable level. Because the true model contains a single factor, we conclude that overfactoring occurs whenever the number of estimated factors is more than one.</p>
<p>The following models were fitted to each sample:</p>
<list list-type="simple">
<list-item>
<label>(1)</label>
<p><bold>Factor analysis model fitted by ML</bold>. It was estimated using the <italic>fa</italic> function of the <italic>psych</italic> package in R (<xref ref-type="bibr" rid="B46">Revelle, 2021</xref>).</p>
</list-item>
<list-item>
<label>(2)</label>
<p><bold>Categorical factor analysis model</bold>. It is the FAC model estimated using a maximum-likelihood algorithm from the polychoric correlation matrix. It was estimated using the <italic>fa</italic> function of the <italic>psych</italic> package in R (<xref ref-type="bibr" rid="B46">Revelle, 2021</xref>).</p>
</list-item>
<list-item>
<label>(3)</label>
<p>The FAC model estimated using a weighted least squares estimation method (WLSMV) from the polychoric correlation matrix. The WLSMV method has proved to be appropriate for the analysis of ordinal data (<xref ref-type="bibr" rid="B15">Flora and Curran, 2004</xref>; <xref ref-type="bibr" rid="B19">Forero et al., 2009</xref>; <xref ref-type="bibr" rid="B32">Li, 2016</xref>). This method was applied using the <italic>lavaan</italic> package in R (<xref ref-type="bibr" rid="B54">Rosseel, 2012</xref>), which provides a normal-theory likelihood-ratio chi-square statistic used to select the number of latent factors in an exploratory analysis.</p>
</list-item>
<list-item>
<label>(4)</label>
<p>The estimator WLSMV in lavaan provides two values of chi-square, the likelihood-ratio chi-square and the scaled chi-square statistic by <xref ref-type="bibr" rid="B58">Satorra and Bentler (2010)</xref>. The idea of scaling is to correct the distribution of the statistic to match the mean and variance of the theoretical chi-square distribution. The scaling correction applies to differences between non-robust chi-square statistics. The difference between the chi-square values of models with one and two factors is another chi-square. The Satorra-Bentler correction of the chi-square difference is the <bold>d<italic>T</italic><sub><italic>M</italic></sub> statistic</bold>, which is used to evaluate whether the difference between the one- and two-factor models is significant (<xref ref-type="bibr" rid="B2">Bandalos, 2014</xref>; <xref ref-type="bibr" rid="B12">DiStefano and Morgan, 2014</xref>). The d<italic>T</italic><sub><italic>M</italic></sub> was estimated in <italic>lavaan</italic> using the <italic>lavTestLRT</italic> function.</p>
</list-item>
<list-item>
<label>(5)</label>
<p>The <bold>GRM model</bold> assumes the same relation between manifest variables and latent factors as the FAC model. The specificity of GRM is that it is estimated using a marginal maximum-likelihood estimation algorithm implemented in the <italic>mirt</italic> package of R (<xref ref-type="bibr" rid="B8">Chalmers, 2012</xref>). This package uses a logit link to relate manifest variables and latent factors. The logit link is a convenient mathematical approximation to the normal ogive model, so it is based on the same assumption of normally distributed errors as the FAC model.</p>
</list-item>
</list>
<p>The R code for fitting these models appears in Appendix E of the <xref ref-type="supplementary-material" rid="DS1">Supplementary materials</xref>.</p>
<p>The procedure for simulating categorical data with a prescribed value of skewness is due to <xref ref-type="bibr" rid="B43">Olsson (1979)</xref>. We are working in the specific case of ordinal data with binomial marginals. The binominal marginal has one free parameter, &#x03C0;, which can be chosen to get the desired skewness. Let &#x03B3; be the coefficient of skewness and <italic>r</italic> be the number of thresholds. Using Equation (7) in <xref ref-type="bibr" rid="B43">Olsson (1979)</xref>, the parameter &#x03C0; of the binomial distribution with given values of <italic>r</italic> and &#x03B3; is:</p>
<disp-formula id="S3.E5">
<label>(5)</label>
<mml:math id="M5">
<mml:mrow>
<mml:mi>&#x03C0;</mml:mi>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:mfrac>
<mml:mo>-</mml:mo>
<mml:msqrt>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mi>&#x03B3;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>16</mml:mn>
<mml:mo>+</mml:mo>
<mml:mrow>
<mml:mn>4</mml:mn>
<mml:mo>&#x2062;</mml:mo>
<mml:msup>
<mml:mi>&#x03B3;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
<mml:mo>&#x2062;</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:mrow>
</mml:mfrac>
</mml:msqrt>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
<p>Once that &#x03C0; has been computed from Equation (5), the second step of the procedure consists of computing the cumulative probabilities of the binomial (<italic>r</italic>, &#x03C0;) distribution, say <bold><italic>pr</italic></bold> = [<italic>pr</italic><sub>1</sub>, <italic>pr</italic><sub>2</sub>, &#x2026;, <italic>pr</italic><sub>(r+1)</sub>]. Finally, the thresholds &#x03C4;<sub>1</sub>, &#x2026;, &#x03C4;<sub><italic>k</italic></sub> are the standard normal deviates associated to the probabilities given by <bold><italic>pr</italic></bold>; that is, the thresholds are &#x03C4;<sub><italic>t</italic></sub> = <italic>F</italic><sup>&#x2212;1</sup>(<italic>pr</italic><sub><italic>t</italic></sub>) for <italic>t</italic> = 1, &#x2026;, <italic>r</italic>.</p>
<p><xref ref-type="table" rid="T2">Table 2</xref> shows the threshold values associated with the different values of &#x03B3; and <italic>r</italic> manipulated in the simulation study. The same thresholds were used for all the variables that have the same value of &#x03B3;. Notice that it is not guaranteed that the set of thresholds associated with a given &#x03B3; is unique. Other algorithms may remove the assumption of binomial marginals and render different thresholds, which could have an impact on the results of the simulation. The R code for implementing this procedure is given in Appendix E.</p>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Threshold values for the conditions of the simulation study.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="center" colspan="7">Number of thresholds</td>
</tr>
<tr>
<td valign="top" align="center"></td>
<td valign="top" align="center" colspan="7"><hr/></td>
</tr>
<tr>
<td valign="top" align="left">Gamma</td>
<td valign="top" align="center">1 threshold</td>
<td valign="top" align="center" colspan="2">2 thresholds</td>
<td valign="top" align="center" colspan="4">4 thresholds</td>
</tr>
<tr>
<td valign="top" align="center"></td>
<td valign="top" align="center"><hr/></td>
<td valign="top" align="center" colspan="2"><hr/></td>
<td valign="top" align="center" colspan="4"><hr/></td>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x03C4;<sub>1</sub></td>
<td valign="top" align="center">&#x03C4;<sub>1</sub></td>
<td valign="top" align="center">&#x03C4;<sub>2</sub></td>
<td valign="top" align="center">&#x03C4;<sub>1</sub></td>
<td valign="top" align="center">&#x03C4;<sub>2</sub></td>
<td valign="top" align="center">&#x03C4;<sub>3</sub></td>
<td valign="top" align="center">&#x03C4;<sub>4</sub></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">-2</td>
<td valign="top" align="center">&#x2013;1.05</td>
<td valign="top" align="center">&#x2013;2.39</td>
<td valign="top" align="center">&#x2013;0.93</td>
<td valign="top" align="center">&#x2013;4.32</td>
<td valign="top" align="center">&#x2013;3.26</td>
<td valign="top" align="center">&#x2013;2.16</td>
<td valign="top" align="center">&#x2013;0.86</td>
</tr>
<tr>
<td valign="top" align="left">-1</td>
<td valign="top" align="center">&#x2013;0.59</td>
<td valign="top" align="center">&#x2013;1.70</td>
<td valign="top" align="center">&#x2013;0.31</td>
<td valign="top" align="center">&#x2013;3.31</td>
<td valign="top" align="center">2.28</td>
<td valign="top" align="center">&#x2013;1.25</td>
<td valign="top" align="center">&#x2013;0.08</td>
</tr>
<tr>
<td valign="top" align="left">0</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">&#x2013;0.67</td>
<td valign="top" align="center">0.67</td>
<td valign="top" align="center">&#x2013;1.53</td>
<td valign="top" align="center">&#x2013;0.49</td>
<td valign="top" align="center">0.49</td>
<td valign="top" align="center">1.53</td>
</tr>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">0.59</td>
<td valign="top" align="center">0.31</td>
<td valign="top" align="center">1.70</td>
<td valign="top" align="center">0.08</td>
<td valign="top" align="center">1.25</td>
<td valign="top" align="center">2.28</td>
<td valign="top" align="center">3.31</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">1.05</td>
<td valign="top" align="center">0.93</td>
<td valign="top" align="center">2.39</td>
<td valign="top" align="center">0.86</td>
<td valign="top" align="center">2.16</td>
<td valign="top" align="center">3.26</td>
<td valign="top" align="center">4.32</td>
</tr>
</tbody>
</table></table-wrap>
<p>In all simulated conditions, the population model is FAC. However, there are conditions where the FA is capable of reproducing the true population variance-covariance matrix and there are others where it is not. As explained in Section &#x201C;Simulation study,&#x201D; the FA can reproduce the variance-covariance matrix when all the items have the same parameters (slopes, error variance, and thresholds; that is, in the conditions with no mixed skewness because mixed skewness means that the items have different thresholds). In the conditions with no mixed skewness, the FA and the FAC models are equivalent in the population, the two models have the same number of factors, and attenuation coefficients for the Pearson correlations (<xref ref-type="bibr" rid="B42">Muth&#x00E9;n and Kaplan, 1985</xref>) take the values shown in <xref ref-type="supplementary-material" rid="DS1">Supplementary Table 1</xref>. In these cases, any discrepancy between the number of factors estimated by FA and FAC is not attributable to the population model and the simulation provides information about the relative performance of the estimation methods.</p>
<p>In the conditions with mixed skewness (mild and mixed skewness, strong and mixed, none + strong, and mixed), the maximum-likelihood discrepancy function for the FA model attains a nonzero value in the population. The population value of the discrepancy and the RMSEA (<xref ref-type="bibr" rid="B5">Browne and Cudeck, 1992</xref>) appear in <xref ref-type="supplementary-material" rid="DS1">Supplementary Table 2</xref>. The consequence is an inflated chi-square statistic and an increased probability of rejecting a FA model that contains the correct number of factors. In these cases, the simulations inform about the magnitude of the overfactoring effect due to fitting an incorrect model (the FA). Notice that the conditions of mixed skewness are more realistic and representative of practical applications, where the variables rarely are perfectly parallel measures.</p>
<p>The data were analyzed using the empirical proportion of samples (EPS) in which the one-factor model is retained for each condition. The logit of the <italic>p</italic>-value was analyzed as a function of the independent variables manipulated in the study using an ANOVA model to compute the effect size (partial-&#x03B7;<sup>2</sup>) associated with each condition. Notice that the EPS is the proportion of correct decisions, and 1&#x2014;EPS is the proportion of samples in which the latent dimensionality is overestimated. Thus, the EPS can be interpreted as the statistical specificity of the testing procedure and 1&#x2014;EPS as the Type-I statistical error.</p>
<p>All the analyses were conducted using the R programming language (<xref ref-type="bibr" rid="B45">R Core Team, 2021</xref>) and the aforementioned packages. Some other R packages were used for the analysis of the results: the skewness and kurtosis were estimated using <italic>e1071</italic> (<xref ref-type="bibr" rid="B39">Meyer et al., 2021</xref>), and the ANOVA table of the results of the simulation was computed using <italic>sjstats</italic> (<xref ref-type="bibr" rid="B35">L&#x00FC;decke, 2021</xref>). All the methods used in this article are included in the R language and the indicated packages, so they are readily available and free of charge for any practitioner.</p>
</sec>
<sec id="S3.SS2">
<title>Results of the simulation</title>
<sec id="S3.SS2.SSS1">
<title>Descriptive analysis of the empirical proportion of samples</title>
<p>The EPS for the true one-factor model computed across all simulated conditions is 0.33 for FA, 0.05 for FAC, 0.78 for WLSMV, 0.80 for d<italic>T</italic><sub><italic>M</italic></sub>, and 0.75 for GRM. Regarding parallel analysis, the EPS mean is 0.51 for PA and 0.55 for CPA. This overall result is congruent with the problems of the linear factor model analyzing ordinal data and reflects the improvement with the Satorra-Bentler chi-square and the GRM. These figures are disaggregated now to analyze the effects of the independent variables separately.</p>
<p><xref ref-type="table" rid="T3">Table 3</xref> summarizes the EPS as a function of the skewness (which determines if the FA model is equivalent to the FAC model or not) and the magnitude of the factor loadings in the condition with 6 variables. The results show a strong decrease in the EPS associated with FA and FAC in the conditions of high factor loadings and mixed skewness. However, this decrease does not occur for WLSMV and d<italic>T</italic><sub><italic>M</italic></sub>, which generally provide a good estimation of the number of factors. Interestingly, the polychoric-based methods (CPA, FA, WLSMV, and d<italic>T</italic><sub><italic>M</italic></sub>) do not provide similar results, and the estimation method applied in connection to polychoric correlations apparently is a quite decisive element. The GRM is not as precise as WLSMV or d<italic>T</italic><sub><italic>M</italic></sub> but it is clearly superior to FA and FAC.</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Empirical proportion of samples where the one-factor model is retained depending on the status of the linear model, the skewness, the magnitude of the factor loadings, and the method.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Linear model</td>
<td valign="top" align="left">Skew</td>
<td valign="top" align="left">Factor loadings</td>
<td valign="top" align="center">PA</td>
<td valign="top" align="center">FA</td>
<td valign="top" align="center">WLSMV</td>
<td valign="top" align="center">d<italic>T</italic><sub><italic>M</italic></sub></td>
<td valign="top" align="center">CPA</td>
<td valign="top" align="center">FAC</td>
<td valign="top" align="center">GRM</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">True</td>
<td valign="top" align="left">None</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">0.998</td>
<td valign="top" align="center">0.867</td>
<td valign="top" align="center">0.961</td>
<td valign="top" align="center">0.931</td>
<td valign="top" align="center">0.999</td>
<td valign="top" align="center">0.431</td>
<td valign="top" align="center">0.868</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.908</td>
<td valign="top" align="center">0.708</td>
<td valign="top" align="center">0.981</td>
<td valign="top" align="center">0.964</td>
<td valign="top" align="center">0.712</td>
<td valign="top" align="center">0.131</td>
<td valign="top" align="center">0.839</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SP</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">0.925</td>
<td valign="top" align="center">0.629</td>
<td valign="top" align="center">0.937</td>
<td valign="top" align="center">0.925</td>
<td valign="top" align="center">0.955</td>
<td valign="top" align="center">0.024</td>
<td valign="top" align="center">0.881</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.546</td>
<td valign="top" align="center">0.234</td>
<td valign="top" align="center">0.985</td>
<td valign="top" align="center">0.977</td>
<td valign="top" align="center">0.162</td>
<td valign="top" align="center">0.003</td>
<td valign="top" align="center">0.869</td>
</tr>
<tr>
<td valign="top" align="left">False</td>
<td valign="top" align="left">MM</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">0.074</td>
<td valign="top" align="center">0.807</td>
<td valign="top" align="center">0.954</td>
<td valign="top" align="center">0.934</td>
<td valign="top" align="center">0.814</td>
<td valign="top" align="center">0.160</td>
<td valign="top" align="center">0.876</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.787</td>
<td valign="top" align="center">0.004</td>
<td valign="top" align="center">0.979</td>
<td valign="top" align="center">0.970</td>
<td valign="top" align="center">0.447</td>
<td valign="top" align="center">0.009</td>
<td valign="top" align="center">0.881</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SM</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.545</td>
<td valign="top" align="center">0.971</td>
<td valign="top" align="center">0.967</td>
<td valign="top" align="center">0.818</td>
<td valign="top" align="center">0.005</td>
<td valign="top" align="center">0.897</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.143</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.961</td>
<td valign="top" align="center">0.961</td>
<td valign="top" align="center">0.045</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.998</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">None + SM</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">0.009</td>
<td valign="top" align="center">0.760</td>
<td valign="top" align="center">0.966</td>
<td valign="top" align="center">0.955</td>
<td valign="top" align="center">0.731</td>
<td valign="top" align="center">0.025</td>
<td valign="top" align="center">0.883</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.582</td>
<td valign="top" align="center">0.007</td>
<td valign="top" align="center">0.994</td>
<td valign="top" align="center">0.994</td>
<td valign="top" align="center">0.132</td>
<td valign="top" align="center">0.002</td>
<td valign="top" align="center">0.916</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>Results for the conditions with 6 variables. The linear model is true when all the variables have the same slope, error variance and thresholds (conditions of no skewness and strong positive skewness). PA is parallel analysis, CPA is Categorical parallel analysis, FA is normal factor analysis, FAC is categorical factor analysis from the tetrachoric correlation matrix, WLSMV is FA estimated using a robust diagonally weighted least square estimation method, d<italic>T<sub>M</sub></italic> is the Satorra-Bentler mean corrected chi-square difference, and GRM is the Samejima&#x2019;s graded response model.</p></fn>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="table" rid="T4">Table 4</xref> shows the PA in relation to skewness and magnitude of the factor loadings in the condition with 12 variables. The results confirm the poor performance of the ML and tetrachoric estimation methods. The WLSMV, d<italic>T</italic><sub><italic>M</italic></sub>, and GRM are stable across all conditions, although the EPS for GRM is not as optimal as in the other two methods. This stability is a crucial result since it means that, under the simulated conditions, the performance of these methods remains constant no matter what the skew of the factor indicators is.</p>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p>Empirical proportion of samples where the one-factor model is retained depending on the status of the linear model, the skewness, the magnitude of the factor loadings, and the method.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Linear model</td>
<td valign="top" align="left">Skew</td>
<td valign="top" align="left">Factor loadings</td>
<td valign="top" align="center">PA</td>
<td valign="top" align="center">FA</td>
<td valign="top" align="center">WLSMV</td>
<td valign="top" align="center">d<italic>T</italic><sub><italic>M</italic></sub></td>
<td valign="top" align="center">CPA</td>
<td valign="top" align="center">FAC</td>
<td valign="top" align="center">GRM</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">True</td>
<td valign="top" align="left">None</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.611</td>
<td valign="top" align="center">0.910</td>
<td valign="top" align="center">0.929</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">0.084</td>
<td valign="top" align="center">0.608</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.977</td>
<td valign="top" align="center">0.348</td>
<td valign="top" align="center">0.984</td>
<td valign="top" align="center">0.969</td>
<td valign="top" align="center">0.682</td>
<td valign="top" align="center">0.002</td>
<td valign="top" align="center">0.525</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SP</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">0.930</td>
<td valign="top" align="center">0.096</td>
<td valign="top" align="center">0.792</td>
<td valign="top" align="center">0.938</td>
<td valign="top" align="center">0.962</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.609</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.535</td>
<td valign="top" align="center">0.001</td>
<td valign="top" align="center">0.983</td>
<td valign="top" align="center">0.987</td>
<td valign="top" align="center">0.017</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.582</td>
</tr>
<tr>
<td valign="top" align="left">False</td>
<td valign="top" align="left">MM</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">0.017</td>
<td valign="top" align="center">0.431</td>
<td valign="top" align="center">0.863</td>
<td valign="top" align="center">0.915</td>
<td valign="top" align="center">0.836</td>
<td valign="top" align="center">0.002</td>
<td valign="top" align="center">0.612</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.823</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.960</td>
<td valign="top" align="center">0.967</td>
<td valign="top" align="center">0.334</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.580</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SM</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.167</td>
<td valign="top" align="center">0.875</td>
<td valign="top" align="center">0.942</td>
<td valign="top" align="center">0.720</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.620</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.292</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.733</td>
<td valign="top" align="center">0.948</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.798</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">None + SM</td>
<td valign="top" align="left">Medium</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.323</td>
<td valign="top" align="center">0.875</td>
<td valign="top" align="center">0.937</td>
<td valign="top" align="center">0.707</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.610</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">High</td>
<td valign="top" align="center">0.578</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.844</td>
<td valign="top" align="center">0.997</td>
<td valign="top" align="center">0.015</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.640</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>Results for the conditions with 12 variables. The linear model is true when all the variables have the same slope, error variance and thresholds (conditions of no skewness and strong positive skewness). PA is parallel analysis, CPA is Categorical parallel analysis, FA is normal factor analysis, FAC is categorical factor analysis from the tetrachoric correlation matrix, WLSMV is FA estimated using a robust diagonally weighted least square estimation method, d<italic>T<sub>M</sub></italic> is the Satorra-Bentler mean corrected chi-square difference, and GRM is the Samejima&#x2019;s graded response model.</p></fn>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="table" rid="T5">Table 5</xref> contains the PA in relation to skewness and sample size for the conditions with 6 variables. The increase in sample size has a small effect on PA except in the conditions of ML and PA in connection to the FA model.</p>
<table-wrap position="float" id="T5">
<label>TABLE 5</label>
<caption><p>Empirical proportion of samples where the one-factor model is retained depending on the status of the linear model, the skewness, the sample size, and the method.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Linear model</td>
<td valign="top" align="left">Skew</td>
<td valign="top" align="center"><italic>N</italic></td>
<td valign="top" align="center">PA</td>
<td valign="top" align="center">FA</td>
<td valign="top" align="center">WLSMV</td>
<td valign="top" align="center">d<italic>T</italic><sub><italic>M</italic></sub></td>
<td valign="top" align="center">CPA</td>
<td valign="top" align="center">FAC</td>
<td valign="top" align="center">GRM</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">True</td>
<td valign="top" align="left">None</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.948</td>
<td valign="top" align="center">0.788</td>
<td valign="top" align="center">0.971</td>
<td valign="top" align="center">0.962</td>
<td valign="top" align="center">0.852</td>
<td valign="top" align="center">0.276</td>
<td valign="top" align="center">0.863</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.959</td>
<td valign="top" align="center">0.787</td>
<td valign="top" align="center">0.971</td>
<td valign="top" align="center">0.933</td>
<td valign="top" align="center">0.859</td>
<td valign="top" align="center">0.286</td>
<td valign="top" align="center">0.845</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SP</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.707</td>
<td valign="top" align="center">0.444</td>
<td valign="top" align="center">0.952</td>
<td valign="top" align="center">0.952</td>
<td valign="top" align="center">0.542</td>
<td valign="top" align="center">0.012</td>
<td valign="top" align="center">0.879</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.765</td>
<td valign="top" align="center">0.418</td>
<td valign="top" align="center">0.963</td>
<td valign="top" align="center">0.944</td>
<td valign="top" align="center">0.574</td>
<td valign="top" align="center">0.015</td>
<td valign="top" align="center">0.871</td>
</tr>
<tr>
<td valign="top" align="left">False</td>
<td valign="top" align="left">MM</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.480</td>
<td valign="top" align="center">0.448</td>
<td valign="top" align="center">0.963</td>
<td valign="top" align="center">0.958</td>
<td valign="top" align="center">0.670</td>
<td valign="top" align="center">0.085</td>
<td valign="top" align="center">0.895</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.381</td>
<td valign="top" align="center">0.364</td>
<td valign="top" align="center">0.967</td>
<td valign="top" align="center">0.943</td>
<td valign="top" align="center">0.591</td>
<td valign="top" align="center">0.084</td>
<td valign="top" align="center">0.862</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SM</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.126</td>
<td valign="top" align="center">0.393</td>
<td valign="top" align="center">0.955</td>
<td valign="top" align="center">0.955</td>
<td valign="top" align="center">0.494</td>
<td valign="top" align="center">0.002</td>
<td valign="top" align="center">0.968</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.017</td>
<td valign="top" align="center">0.152</td>
<td valign="top" align="center">0.975</td>
<td valign="top" align="center">0.971</td>
<td valign="top" align="center">0.369</td>
<td valign="top" align="center">0.004</td>
<td valign="top" align="center">0.927</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">None + SM</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.333</td>
<td valign="top" align="center">0.442</td>
<td valign="top" align="center">0.982</td>
<td valign="top" align="center">0.980</td>
<td valign="top" align="center">0.503</td>
<td valign="top" align="center">0.011</td>
<td valign="top" align="center">0.915</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.258</td>
<td valign="top" align="center">0.325</td>
<td valign="top" align="center">0.977</td>
<td valign="top" align="center">0.967</td>
<td valign="top" align="center">0.359</td>
<td valign="top" align="center">0.016</td>
<td valign="top" align="center">0.884</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>Results for the conditions with 6 variables. The linear model is true when all the variables have the same slope, error variance and thresholds (conditions of no skewness and strong positive skewness). PA is parallel analysis, CPA is Categorical parallel analysis, FA is normal factor analysis, FAC is categorical factor analysis from the tetrachoric correlation matrix, WLSMV is FA estimated using a robust diagonally weighted least square estimation method, d<italic>T<sub>M</sub></italic> is the Satorra-Bentler mean corrected chi-square difference, and GRM is the Samejima&#x2019;s graded response model.</p></fn>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="table" rid="T6">Table 6</xref> contains the PA in relation to skewness and sample size for the conditions with 12 variables. The pattern of results is similar to <xref ref-type="table" rid="T6">Table 6</xref> with a noticeable effect; the EPS for GRM reduces with the increase of the number of variables. This is because the GRM is fitted to the contingency table of the response patterns, which is sparser as the number of variables increases.</p>
<table-wrap position="float" id="T6">
<label>TABLE 6</label>
<caption><p>Empirical proportion of samples where the one-factor model is retained depending on the status of the linear model, the skewness, the sample size, and the method.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Linear model</td>
<td valign="top" align="left">Skew</td>
<td valign="top" align="center"><italic>N</italic></td>
<td valign="top" align="center">PA</td>
<td valign="top" align="center">FA</td>
<td valign="top" align="center">WLSMV</td>
<td valign="top" align="center">d<italic>T</italic><sub><italic>M</italic></sub></td>
<td valign="top" align="center">CPA</td>
<td valign="top" align="center">FAC</td>
<td valign="top" align="center">GRM</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">True</td>
<td valign="top" align="left">None</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.984</td>
<td valign="top" align="center">0.502</td>
<td valign="top" align="center">0.917</td>
<td valign="top" align="center">0.987</td>
<td valign="top" align="center">0.834</td>
<td valign="top" align="center">0.041</td>
<td valign="top" align="center">0.581</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.993</td>
<td valign="top" align="center">0.456</td>
<td valign="top" align="center">0.974</td>
<td valign="top" align="center">0.911</td>
<td valign="top" align="center">0.848</td>
<td valign="top" align="center">0.044</td>
<td valign="top" align="center">0.552</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SP</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.704</td>
<td valign="top" align="center">0.052</td>
<td valign="top" align="center">0.782</td>
<td valign="top" align="center">0.974</td>
<td valign="top" align="center">0.477</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.594</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.762</td>
<td valign="top" align="center">0.045</td>
<td valign="top" align="center">0.926</td>
<td valign="top" align="center">0.948</td>
<td valign="top" align="center">0.502</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.597</td>
</tr>
<tr>
<td valign="top" align="left">False</td>
<td valign="top" align="left">MM</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.484</td>
<td valign="top" align="center">0.303</td>
<td valign="top" align="center">0.849</td>
<td valign="top" align="center">0.974</td>
<td valign="top" align="center">0.644</td>
<td valign="top" align="center">0.001</td>
<td valign="top" align="center">0.608</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.356</td>
<td valign="top" align="center">0.128</td>
<td valign="top" align="center">0.938</td>
<td valign="top" align="center">0.910</td>
<td valign="top" align="center">0.526</td>
<td valign="top" align="center">0.001</td>
<td valign="top" align="center">0.584</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SM</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.287</td>
<td valign="top" align="center">0.165</td>
<td valign="top" align="center">0.721</td>
<td valign="top" align="center">0.941</td>
<td valign="top" align="center">0.490</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.731</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.005</td>
<td valign="top" align="center">0.002</td>
<td valign="top" align="center">0.792</td>
<td valign="top" align="center">0.947</td>
<td valign="top" align="center">0.230</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.687</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">None + SM</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">0.432</td>
<td valign="top" align="center">0.272</td>
<td valign="top" align="center">0.757</td>
<td valign="top" align="center">0.979</td>
<td valign="top" align="center">0.489</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.653</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">500</td>
<td valign="top" align="center">0.146</td>
<td valign="top" align="center">0.051</td>
<td valign="top" align="center">0.939</td>
<td valign="top" align="center">0.949</td>
<td valign="top" align="center">0.233</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.598</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>Results for the conditions with 12 variables. The linear model is true when all the variables have the same slope, error variance and thresholds (conditions of no skewness and strong positive skewness). PA is parallel analysis, CPA is Categorical parallel analysis, FA is normal factor analysis, FAC is categorical factor analysis from the tetrachoric correlation matrix, WLSMV is FA estimated using a robust diagonally weighted least square estimation method, d<italic>T<sub>M</sub></italic> is the Satorra-Bentler mean corrected chi-square difference, and GRM is the Samejima&#x2019;s graded response model.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>The results concerning skewness and the number of thresholds appear in <xref ref-type="table" rid="T7">Tables 7</xref>, <xref ref-type="table" rid="T8">8</xref> for the conditions with 6 and 12 items, respectively. An increase in the number of thresholds is associated with a mild increase in correct estimations of dimensionality for the FA (in the conditions without mixed skewness) and a reduction for the GRM. One tentative explanation is that more response categories mean that the normal distribution better represents the data and the contingency table of the responses (used for GRM estimation) is sparser. Once again, the WLSMV and d<italic>T</italic><sub><italic>M</italic></sub> are the best methods, and PA supersedes CPA.</p>
<table-wrap position="float" id="T7">
<label>TABLE 7</label>
<caption><p>Empirical proportion of samples where the one-factor model is retained depending on the status of the linear model, the skewness, the number of thresholds, and the method.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Linear model</td>
<td valign="top" align="left">Skew</td>
<td valign="top" align="center">Thresholds</td>
<td valign="top" align="center">PA</td>
<td valign="top" align="center">FA</td>
<td valign="top" align="center">WLSMV</td>
<td valign="top" align="center">d<italic>T</italic><sub><italic>M</italic></sub></td>
<td valign="top" align="center">CPA</td>
<td valign="top" align="center">FAC</td>
<td valign="top" align="center">GRM</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">True</td>
<td valign="top" align="left">None</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.917</td>
<td valign="top" align="center">0.657</td>
<td valign="top" align="center">0.971</td>
<td valign="top" align="center">0.953</td>
<td valign="top" align="center">0.715</td>
<td valign="top" align="center">0.068</td>
<td valign="top" align="center">0.897</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.966</td>
<td valign="top" align="center">0.827</td>
<td valign="top" align="center">0.971</td>
<td valign="top" align="center">0.947</td>
<td valign="top" align="center">0.894</td>
<td valign="top" align="center">0.265</td>
<td valign="top" align="center">0.861</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.976</td>
<td valign="top" align="center">0.879</td>
<td valign="top" align="center">0.971</td>
<td valign="top" align="center">0.941</td>
<td valign="top" align="center">0.957</td>
<td valign="top" align="center">0.510</td>
<td valign="top" align="center">0.802</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SP</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.667</td>
<td valign="top" align="center">0.400</td>
<td valign="top" align="center">0.970</td>
<td valign="top" align="center">0.964</td>
<td valign="top" align="center">0.503</td>
<td valign="top" align="center">0.006</td>
<td valign="top" align="center">0.898</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.753</td>
<td valign="top" align="center">0.432</td>
<td valign="top" align="center">0.953</td>
<td valign="top" align="center">0.942</td>
<td valign="top" align="center">0.568</td>
<td valign="top" align="center">0.014</td>
<td valign="top" align="center">0.868</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.787</td>
<td valign="top" align="center">0.462</td>
<td valign="top" align="center">0.948</td>
<td valign="top" align="center">0.933</td>
<td valign="top" align="center">0.603</td>
<td valign="top" align="center">0.021</td>
<td valign="top" align="center">0.859</td>
</tr>
<tr>
<td valign="top" align="left">False</td>
<td valign="top" align="left">MM</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.357</td>
<td valign="top" align="center">0.416</td>
<td valign="top" align="center">0.979</td>
<td valign="top" align="center">0.971</td>
<td valign="top" align="center">0.336</td>
<td valign="top" align="center">0.020</td>
<td valign="top" align="center">0.921</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.433</td>
<td valign="top" align="center">0.397</td>
<td valign="top" align="center">0.963</td>
<td valign="top" align="center">0.944</td>
<td valign="top" align="center">0.728</td>
<td valign="top" align="center">0.082</td>
<td valign="top" align="center">0.864</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.501</td>
<td valign="top" align="center">0.405</td>
<td valign="top" align="center">0.951</td>
<td valign="top" align="center">0.930</td>
<td valign="top" align="center">0.828</td>
<td valign="top" align="center">0.153</td>
<td valign="top" align="center">0.850</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SM</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.037</td>
<td valign="top" align="center">0.299</td>
<td valign="top" align="center">0.948</td>
<td valign="top" align="center">0.948</td>
<td valign="top" align="center">0.486</td>
<td valign="top" align="center">0.002</td>
<td valign="top" align="center">0.960</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.077</td>
<td valign="top" align="center">0.261</td>
<td valign="top" align="center">0.979</td>
<td valign="top" align="center">0.976</td>
<td valign="top" align="center">0.436</td>
<td valign="top" align="center">0.002</td>
<td valign="top" align="center">0.947</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.101</td>
<td valign="top" align="center">0.258</td>
<td valign="top" align="center">0.978</td>
<td valign="top" align="center">0.973</td>
<td valign="top" align="center">0.372</td>
<td valign="top" align="center">0.005</td>
<td valign="top" align="center">0.936</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">None + SM</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.220</td>
<td valign="top" align="center">0.400</td>
<td valign="top" align="center">0.979</td>
<td valign="top" align="center">0.977</td>
<td valign="top" align="center">0.302</td>
<td valign="top" align="center">0.005</td>
<td valign="top" align="center">0.925</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.318</td>
<td valign="top" align="center">0.377</td>
<td valign="top" align="center">0.983</td>
<td valign="top" align="center">0.974</td>
<td valign="top" align="center">0.486</td>
<td valign="top" align="center">0.011</td>
<td valign="top" align="center">0.909</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.347</td>
<td valign="top" align="center">0.373</td>
<td valign="top" align="center">0.975</td>
<td valign="top" align="center">0.967</td>
<td valign="top" align="center">0.504</td>
<td valign="top" align="center">0.024</td>
<td valign="top" align="center">0.865</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>Results for the conditions with 6 variables. The linear model is true when all the variables have the same slope, error variance and thresholds (conditions of no skewness and strong positive skewness). PA is parallel analysis, CPA is Categorical parallel analysis, FA is normal factor analysis, FAC is categorical factor analysis from the tetrachoric correlation matrix, WLSMV is FA estimated using a robust diagonally weighted least square estimation method, d<italic>T<sub>M</sub></italic> is the Satorra-Bentler mean corrected chi-square difference, and GRM is the Samejima&#x2019;s graded response model.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T8">
<label>TABLE 8</label>
<caption><p>Empirical proportion of samples where the one-factor model is retained depending on the status of the linear model, the skewness, the number of thresholds, and the method.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Linear model</td>
<td valign="top" align="left">Skew</td>
<td valign="top" align="center">Thresholds</td>
<td valign="top" align="center">PA</td>
<td valign="top" align="center">FA</td>
<td valign="top" align="center">WLSMV</td>
<td valign="top" align="center">d<italic>T</italic><sub><italic>M</italic></sub></td>
<td valign="top" align="center">CPA</td>
<td valign="top" align="center">FAC</td>
<td valign="top" align="center">GRM</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">True</td>
<td valign="top" align="left">None</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.973</td>
<td valign="top" align="center">0.283</td>
<td valign="top" align="center">0.932</td>
<td valign="top" align="center">0.958</td>
<td valign="top" align="center">0.615</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.639</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.996</td>
<td valign="top" align="center">0.502</td>
<td valign="top" align="center">0.957</td>
<td valign="top" align="center">0.948</td>
<td valign="top" align="center">0.920</td>
<td valign="top" align="center">0.009</td>
<td valign="top" align="center">0.593</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.996</td>
<td valign="top" align="center">0.652</td>
<td valign="top" align="center">0.949</td>
<td valign="top" align="center">0.938</td>
<td valign="top" align="center">0.988</td>
<td valign="top" align="center">0.119</td>
<td valign="top" align="center">0.468</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SP</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.659</td>
<td valign="top" align="center">0.047</td>
<td valign="top" align="center">0.897</td>
<td valign="top" align="center">0.979</td>
<td valign="top" align="center">0.462</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.627</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.754</td>
<td valign="top" align="center">0.051</td>
<td valign="top" align="center">0.852</td>
<td valign="top" align="center">0.944</td>
<td valign="top" align="center">0.496</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.592</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.785</td>
<td valign="top" align="center">0.048</td>
<td valign="top" align="center">0.841</td>
<td valign="top" align="center">0.933</td>
<td valign="top" align="center">0.510</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.567</td>
</tr>
<tr>
<td valign="top" align="left">False</td>
<td valign="top" align="left">MM</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.407</td>
<td valign="top" align="center">0.229</td>
<td valign="top" align="center">0.914</td>
<td valign="top" align="center">0.963</td>
<td valign="top" align="center">0.271</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.666</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.400</td>
<td valign="top" align="center">0.206</td>
<td valign="top" align="center">0.886</td>
<td valign="top" align="center">0.918</td>
<td valign="top" align="center">0.656</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.565</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.454</td>
<td valign="top" align="center">0.212</td>
<td valign="top" align="center">0.902</td>
<td valign="top" align="center">0.912</td>
<td valign="top" align="center">0.829</td>
<td valign="top" align="center">0.002</td>
<td valign="top" align="center">0.556</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">SM</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.148</td>
<td valign="top" align="center">0.094</td>
<td valign="top" align="center">0.590</td>
<td valign="top" align="center">0.945</td>
<td valign="top" align="center">0.430</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.759</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.147</td>
<td valign="top" align="center">0.083</td>
<td valign="top" align="center">0.909</td>
<td valign="top" align="center">0.955</td>
<td valign="top" align="center">0.354</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.700</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.143</td>
<td valign="top" align="center">0.075</td>
<td valign="top" align="center">0.885</td>
<td valign="top" align="center">0.933</td>
<td valign="top" align="center">0.296</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.669</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">None + SM</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.307</td>
<td valign="top" align="center">0.188</td>
<td valign="top" align="center">0.795</td>
<td valign="top" align="center">0.982</td>
<td valign="top" align="center">0.306</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.689</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.283</td>
<td valign="top" align="center">0.152</td>
<td valign="top" align="center">0.914</td>
<td valign="top" align="center">0.955</td>
<td valign="top" align="center">0.394</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.626</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.277</td>
<td valign="top" align="center">0.144</td>
<td valign="top" align="center">0.900</td>
<td valign="top" align="center">0.939</td>
<td valign="top" align="center">0.384</td>
<td valign="top" align="center">0.000</td>
<td valign="top" align="center">0.561</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>Results for the conditions with 12 variables. The linear model is true when all the variables have the same slope, error variance and thresholds (conditions of no skewness and strong positive skewness). PA is parallel analysis, CPA is Categorical parallel analysis, FA is normal factor analysis, FAC is categorical factor analysis from the tetrachoric correlation matrix, WLSMV is FA estimated using a robust diagonally weighted least square estimation method, d<italic>T<sub>M</sub></italic> is the Satorra-Bentler mean corrected chi-square difference, and GRM is the Samejima&#x2019;s graded response model.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="S3.SS2.SSS2">
<title>ANOVA model and effect size for the logit of <italic>p</italic>-value</title>
<p>An <italic>ANOVA</italic> model was fitted to the logit of the chi-square <italic>p</italic>-value with the purpose of analyzing the interaction between the conditions of the simulation and evaluating the importance of the different independent variables manipulated in the study. The magnitude of the effects was evaluated using the partial-&#x03B7;<sup>2</sup> as a measure of effect size. The statistical significance is unimportant here as it depends on the number of simulated samples and is easily manipulated.</p>
<p><xref ref-type="supplementary-material" rid="DS1">Supplementary Tables 3</xref>&#x2013;<xref ref-type="supplementary-material" rid="DS1">7</xref> summarize the <italic>ANOVA</italic> results. The tables are computed separately for each model and for the conditions with 6 and 12-factor indicators. Regarding the FA model (see <xref ref-type="supplementary-material" rid="DS1">Supplementary Table 3</xref>), the only interaction that appears to have some importance is &#x03BB; &#x00D7; &#x03B3;, especially in the condition with a smaller number of variables. The most relevant main effects are those of factor loading and skewness. Plots of the <italic>p</italic>-value for the FA model appear in <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 1</xref>. The <italic>p</italic>-value is closer to zero for the model with 12 variables, resulting in a decreased EPS and smaller interactions because the lines in the plots are mostly flat. The effect of the increase of the factor loadings consists of pulling the <italic>p</italic>-value toward zero as skewness increases.</p>
<p>The same analysis in relation to FAC appears in <xref ref-type="supplementary-material" rid="DS1">Supplementary Table 4</xref> and <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 2</xref>. In this case, the number of thresholds has a more important role regarding the main effects and the interactions as compared with the FA model, although the effect sizes are smaller because the <italic>p</italic>-value is generally close to zero for all conditions.</p>
<p>The results for WLSMV and d<italic>T</italic><sub><italic>M</italic></sub> are summarized in <xref ref-type="supplementary-material" rid="DS1">Supplementary Tables 5</xref>, <xref ref-type="supplementary-material" rid="DS1">6</xref>. No relevant effects appear for the short number of variables, which is an excellent result because it means that the performance of these methods is good regardless of the simulated conditions. Some relevant effect sizes appear in conditions with a larger number of variables, but the methods remain reliable also in these conditions.</p>
<p>The results for the GRM appear in <xref ref-type="supplementary-material" rid="DS1">Supplementary Table 7</xref> and <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 3</xref>. Only the skewness and its interaction with factor loading have a relevant effect size, and only in the condition of the short number of variables. <xref ref-type="supplementary-material" rid="DS1">Supplementary Figure 3</xref> shows that the condition of strong and mixed skewness pulls the <italic>p</italic>-value toward 1 but only in the condition of high factor loadings. In any case, this result does not alter the EPS because the <italic>p</italic>-values remain in the acceptance region.</p>
</sec>
</sec>
<sec id="S3.SS3">
<title>Recovery of the parameters</title>
<p>The success of a method cannot be based solely on the estimation of the number of factors. A method that correctly estimates the latent dimensionality whilst producing wildly incorrect parameter estimates is of limited usefulness. Parameter estimates under the different models were analyzed to assess their relative merit. Recovery was evaluated by the root mean squared error between the true and estimated parameters (RMSE).</p>
<p>The estimated parameters for FA have two specific sources of error that are not present in the other methods: Attenuation and structural error. As explained previously, attenuation occurs when a FA model is able to reproduce the true variance-covariance parameters (which conform to a FAC model) but the FA parameters have smaller values than the true FAC model. Attenuation occurs in conditions with no skewness and strong positive skewness, and its magnitude depends on the coefficients shown in <xref ref-type="supplementary-material" rid="DS1">Supplementary Table 1</xref>. In the conditions where the FA cannot reproduce the population variance-covariance matrix of the FAC model (those that involve mixed skewness), the FA estimates have the additional bias of structural error. These are population biases that will not vanish no matter what the sampling size is. The simulations are informative about the magnitude of the three sources of error combined (sampling error, attenuation, and structural error), and the increase of error that is specific to FA in comparison to the other methods.</p>
<p>The <italic>mirt</italic> function estimates the GRM using the IRT parameterization, whereas the true parameter values for the simulation are in the FA parameterization. Previously to computing the RMSE, the IRT parameters were converted to the FA parameterization using the equations (A15) and (A16) of <xref ref-type="bibr" rid="B44">Paek et al. (2018)</xref>.</p>
<p>The recovery of lambda in relation to the manipulated conditions appears in <xref ref-type="supplementary-material" rid="DS1">Supplementary Table 8</xref> and <xref ref-type="supplementary-material" rid="DS1">Supplementary Figures 4</xref>&#x2013;<xref ref-type="supplementary-material" rid="DS1">6</xref>. WLSMV and FAC performed similarly and obtained the higher RMSE in the conditions of a large number of thresholds, small loadings, and strong positive skewness. The results indicate that GRM outperforms these methods and has a smaller RMSE that remains stable across conditions. The FA is affected by attenuation and inconsistency biases and is generally poorer than the other methods. The results for &#x03C8; follow a similar pattern and are summarized in <xref ref-type="supplementary-material" rid="DS1">Supplementary Table 9</xref>.</p>
<p>Recovery of &#x03C4; is summarized in <xref ref-type="supplementary-material" rid="DS1">Supplementary Table 10</xref> and <xref ref-type="supplementary-material" rid="DS1">Supplementary Figures 7</xref>&#x2013;<xref ref-type="supplementary-material" rid="DS1">9</xref>. The results are similar to those found for the recovery of &#x03BB;. There are small differences between WLSMV and FAC, and the two methods render an increased RMSE in the conditions of a large number of thresholds and strong positive skewness. The GRM obtains a smaller RMSE than the other methods and is stable across conditions.</p>
</sec>
<sec id="S3.SS4">
<title>Conclusions of the simulation study</title>
<p>The results of our simulation have shown that, in the present context, the chi-square statistic for FA leads to overfactoring. However, there are remarkable differences between the models and the study conditions. The magnitude of the factor loadings and the skewness of the factor indicators have the largest effect size in relation to the logit of the <italic>p</italic>-value, but this effect varies between the models. The empirical proportion of rejection for the one-factor FA model is generally high and sharply increases in the conditions where this model is wrong in the population. The PA shows better performance than the chi-square, but the overfactoring is still high in the presence of skewness. Notice that the effect of skewness might depend on the choice of thresholds values. We have used the algorithm by <xref ref-type="bibr" rid="B43">Olsson (1979)</xref> to generate the threshold values associated with the gamma coefficient. However, other algorithms may render different thresholds associated with the same gamma and might have a different effect on the performance of the chi-square. One topic for further investigation would be the development of other algorithms for generating thresholds and the study of the differential impact of the choice of thresholds while keeping gamma fixed to a constant value. The pattern of the factor loadings also has a noticeable effect, and smaller rejection rates are found for the tau-equivalent than for the congeneric model.</p>
<p>Interestingly, dimensionality recovery for the FAC model is tightly related to the estimation method, as recovery is poor for ML and reliable for WLSMV and d<italic>T</italic><sub><italic>M</italic></sub>.</p>
<p>The effect sizes of all independent variables are negligible for the GRM, which indicates that the model is robust to the manipulated conditions. The Type-I error is about 30%, regardless of skewness, which is still far from the nominal levels, but it improves the results of the other models. The number of factor indicators also has an effect, and more rejections of a correct model are found with a larger number of factor indicators. This is an expected result because bigger samples are necessary for the likelihood-ration statistic to approximate its chi-square reference distribution when there are a large number of factor indicators. On the counter side, the small effect size associated with <italic>N</italic> means that the improvement in EPS decreases as <italic>N</italic> increases. From a theoretical point of view, the GRM is not exactly true in this simulation because data were generated assuming a normal distribution for the error (normal link) whereas the implementation of the GRM in the <italic>mirt</italic> package assumes a logit link. The logit link was developed as a convenient approximation to the normal link and the difference between them is small. However, this difference does not vanish as sample size increases, which converts the logit link to an inconsistent procedure. One problem for future research is to investigate if the results of the GRM improve when estimated with a normal link.</p>
<p>Finally, the WLSMV and the Satorra-Bentler d<italic>T</italic><sub><italic>M</italic></sub> are the best alternatives to estimate dimensionality, with rejection rates close to the nominal 5% level that remains relatively constant across all the simulated conditions. The recovery of parameters for GRM was better than for WLSML, and less affected by those conditions in which the RMSE increases for WLSMV. Thus, a good combination of methods could be to estimate the dimensionality using a Satorra-Bentler fit statistic, as in WLSMV, and estimate parameters using a marginal ML algorithm as in the GRM once the dimensionality of the model has been fixed.</p>
</sec>
</sec>
<sec id="S4">
<title>Real data example</title>
<sec id="S4.SS1">
<title>Data and procedure</title>
<p>A real data sample was analyzed to illustrate the different approaches in an applied context. The data consists of responses to the scale of attitudes toward censorship, originally published by <xref ref-type="bibr" rid="B53">Rosander and Thurstone (1931)</xref>, and reprinted by <xref ref-type="bibr" rid="B61">Shaw and Wright (1967)</xref>. The scale is composed of 20 Likert-type items with six response alternatives. The responses are labeled: <italic>Strongly Disagree</italic>, <italic>Disagree</italic>, <italic>Slightly Disagree</italic>, <italic>Slightly Agree</italic>, <italic>Agree</italic>, and <italic>Strongly Agree</italic>. The sample size is 223 observations. The items and the data are publicly available in the Georgia Tech Psychometric Research and Development Lab<sup><xref ref-type="fn" rid="footnote2">2</xref></sup> and were retrieved on 03/17/2022. We have applied the same methods used in the simulation study (FA, FAC, WLSMV, d<italic>T</italic><sub><italic>M</italic></sub>, and GRM) and some new methods for completeness. The new methods are:</p>
<list list-type="simple">
<list-item>
<label>&#x2022;</label>
<p>Descriptive statistics about the type-3 coefficient of skewness and kurtosis are run using the <italic>e1071</italic> package (<xref ref-type="bibr" rid="B39">Meyer et al., 2021</xref>). Multivariate coefficients of skewness and kurtosis were computed using the <italic>semTools</italic> package (<xref ref-type="bibr" rid="B29">Jorgensen et al., 2022</xref>).</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>Mean corrected chi-square, <italic>T</italic><sub><italic>M</italic></sub> (<xref ref-type="bibr" rid="B57">Satorra and Bentler, 2001</xref>) and mean and variance corrected chi-square, <italic>T</italic><sub><italic>MV</italic></sub> (<xref ref-type="bibr" rid="B1">Asparouhov and Muth&#x00E9;n, 2010</xref>) computed with the maximum-likelihood estimator for the FA model. The <italic>T</italic><sub><italic>M</italic></sub> and the <italic>T</italic><sub><italic>MV</italic></sub> are modified chi-square statistics that evaluate the fit for a single model and correct for the lack of normality in the data. The <italic>T</italic><sub><italic>M</italic></sub> and the <italic>T</italic><sub><italic>MV</italic></sub> can be obtained in <italic>lavaan</italic> using the function <italic>cfa</italic> with the arguments <italic>estimator = &#x201C;MLM&#x201D;</italic> and <italic>estimator = &#x201C;MLMV,&#x201D;</italic> respectively.</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>The <italic>T</italic><sub><italic>M</italic></sub> and <italic>T</italic><sub><italic>MV</italic></sub> statistics are also computed with WLSMV estimator and FAC model using the arguments <italic>estimator = &#x201C;WLSMV&#x201D;</italic> and <italic>ordered = TRUE</italic> to the <italic>cfa</italic> function in <italic>lavaan</italic>. The <italic>T</italic><sub><italic>M</italic></sub> and the <italic>T</italic><sub><italic>MV</italic></sub> are not to be confused with the correction of the chi-square difference, d<italic>T</italic><sub><italic>M</italic></sub>, used in the simulation study.</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>The analysis of the normality of the latent responses (variable <italic><bold>y&#x002A;</bold></italic> in Equation 3) was evaluated using a bootstrap test implemented in the <italic>bootTest</italic> function in the R package <italic>discnorm</italic> (<xref ref-type="bibr" rid="B16">Foldnes and Gr&#x00F8;nneberg, 2020</xref>).</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>The analysis of normality of the latent factor (that is, testing that <italic>f</italic>(&#x03BE;)is a normal distribution) was run using the empirical histogram method (<xref ref-type="bibr" rid="B67">Woods, 2007</xref>) and the Davidian Curve method (<xref ref-type="bibr" rid="B68">Woods and Lin, 2009</xref>; <xref ref-type="bibr" rid="B63">Smits et al., 2020</xref>). These methods are implemented in the <italic>mirt</italic> function using the arguments <italic>dentype = &#x201C;empiricalhist</italic>,&#x201D; and <italic>dentype</italic> = &#x201C;<italic>Davidian</italic>-#&#x201D; argument. We have estimated the one-factor FAC model with a normal distribution for <italic>f</italic>(&#x03BE;), a Davidian-curve estimate for <italic>f</italic>(&#x03BE;) with two smooth parameters, and a Davidian curve with three smooth parameters. The models were compared using the Hannan&#x2013;Quinn criterion (HQ; <xref ref-type="bibr" rid="B66">Woods, 2006</xref>) implemented in the <italic>anova</italic> method of the <italic>mirt</italic> package.</p>
</list-item>
<list-item>
<label>&#x2022;</label>
<p>Multidimensional Nominal Categories Model (MNCN; <xref ref-type="bibr" rid="B48">Revuelta, 2014</xref>; <xref ref-type="bibr" rid="B50">Revuelta et al., 2020</xref>, <xref ref-type="bibr" rid="B49">2021</xref>). The MNCM is a generalization of the GRM that removes the assumption that the responses follow an ordinal scale. The comparison between the MCNM and GRM provides a means to test the assumption of equally spaced categories implicit in the GRM.</p>
</list-item>
</list>
<p>The R codes for all the analyses appear in Appendix G.</p>
<p>In the simulation study, we have assumed that the population model is an ordinal factor model and we have manipulated the parameters of the model to investigate the performance of several estimation methods, but the correctness of the model has not been put into question. In a real data analysis, the validity of an ordinal factor model cannot be taken for granted as it might not be correct in the population, which introduces another source of bias in the analysis. Some of the above analyses (normality of latent responses, normality of latent factors, and the MNCM) evaluate part of the assumptions of the ordinal factor model to determine if they may be maintained for the present sample.</p>
</sec>
<sec id="S4.SS2">
<title>Results of the empirical study</title>
<p><xref ref-type="table" rid="T9">Table 9</xref> contains the descriptive statistics for the items, including the response frequency of the categories, the mean, the standard deviation, and the kurtosis. The variables have a pattern of mild and mixed skewness. Most of them have a mean value close to the midpoint of the scale, and a skewness value close to zero. However, there are also several variables negatively skewed because of the mean being close to the higher point of the response scale, and one of the variables has a low mean and positive skew. The multivariate skewness and kurtosis coefficient have a <italic>p</italic>-value smaller than 0.01 and the hypothesis of normality is rejected.</p>
<table-wrap position="float" id="T9">
<label>TABLE 9</label>
<caption><p>Descriptive statistics for the scale of attitudes about censorship.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Item</td>
<td valign="top" align="center">Freq 1</td>
<td valign="top" align="center">Freq 2</td>
<td valign="top" align="center">Freq 3</td>
<td valign="top" align="center">Freq 4</td>
<td valign="top" align="center">Freq 5</td>
<td valign="top" align="center">Freq 6</td>
<td valign="top" align="center">Mean</td>
<td valign="top" align="center">Std</td>
<td valign="top" align="center">skewness</td>
<td valign="top" align="center">Kurtosis</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">35</td>
<td valign="top" align="center">56</td>
<td valign="top" align="center">45</td>
<td valign="top" align="center">33</td>
<td valign="top" align="center">34</td>
<td valign="top" align="center">3.62</td>
<td valign="top" align="center">1.52</td>
<td valign="top" align="center">0.04</td>
<td valign="top" align="center">&#x2013;0.98</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">52</td>
<td valign="top" align="center">113</td>
<td valign="top" align="center">5.12</td>
<td valign="top" align="center">1.12</td>
<td valign="top" align="center">&#x2013;1.38</td>
<td valign="top" align="center">1.85</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">61</td>
<td valign="top" align="center">35</td>
<td valign="top" align="center">28</td>
<td valign="top" align="center">3.52</td>
<td valign="top" align="center">1.58</td>
<td valign="top" align="center">&#x2013;0.10</td>
<td valign="top" align="center">&#x2013;1.06</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">39</td>
<td valign="top" align="center">73</td>
<td valign="top" align="center">43</td>
<td valign="top" align="center">27</td>
<td valign="top" align="center">3.82</td>
<td valign="top" align="center">1.39</td>
<td valign="top" align="center">&#x2013;0.32</td>
<td valign="top" align="center">&#x2013;0.57</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="center">74</td>
<td valign="top" align="center">61</td>
<td valign="top" align="center">35</td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">2.44</td>
<td valign="top" align="center">1.43</td>
<td valign="top" align="center">0.81</td>
<td valign="top" align="center">&#x2013;0.27</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="center">38</td>
<td valign="top" align="center">52</td>
<td valign="top" align="center">57</td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">26</td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">3.04</td>
<td valign="top" align="center">1.50</td>
<td valign="top" align="center">0.40</td>
<td valign="top" align="center">&#x2013;0.80</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">53</td>
<td valign="top" align="center">47</td>
<td valign="top" align="center">28</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">3.13</td>
<td valign="top" align="center">1.49</td>
<td valign="top" align="center">0.18</td>
<td valign="top" align="center">&#x2013;0.91</td>
</tr>
<tr>
<td valign="top" align="left">8</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">58</td>
<td valign="top" align="center">56</td>
<td valign="top" align="center">63</td>
<td valign="top" align="center">4.51</td>
<td valign="top" align="center">1.30</td>
<td valign="top" align="center">&#x2013;0.67</td>
<td valign="top" align="center">&#x2013;0.09</td>
</tr>
<tr>
<td valign="top" align="left">9</td>
<td valign="top" align="center">92</td>
<td valign="top" align="center">43</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">25</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">2.34</td>
<td valign="top" align="center">1.45</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">&#x2013;0.31</td>
</tr>
<tr>
<td valign="top" align="left">10</td>
<td valign="top" align="center">93</td>
<td valign="top" align="center">63</td>
<td valign="top" align="center">27</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">2.17</td>
<td valign="top" align="center">1.35</td>
<td valign="top" align="center">1.08</td>
<td valign="top" align="center">0.25</td>
</tr>
<tr>
<td valign="top" align="left">11</td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">42</td>
<td valign="top" align="center">58</td>
<td valign="top" align="center">34</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">3.28</td>
<td valign="top" align="center">1.53</td>
<td valign="top" align="center">0.18</td>
<td valign="top" align="center">&#x2013;0.99</td>
</tr>
<tr>
<td valign="top" align="left">12</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">39</td>
<td valign="top" align="center">39</td>
<td valign="top" align="center">47</td>
<td valign="top" align="center">46</td>
<td valign="top" align="center">3.89</td>
<td valign="top" align="center">1.61</td>
<td valign="top" align="center">&#x2013;0.25</td>
<td valign="top" align="center">&#x2013;1.12</td>
</tr>
<tr>
<td valign="top" align="left">13</td>
<td valign="top" align="center">25</td>
<td valign="top" align="center">29</td>
<td valign="top" align="center">37</td>
<td valign="top" align="center">62</td>
<td valign="top" align="center">44</td>
<td valign="top" align="center">26</td>
<td valign="top" align="center">3.67</td>
<td valign="top" align="center">1.50</td>
<td valign="top" align="center">&#x2013;0.24</td>
<td valign="top" align="center">&#x2013;0.88</td>
</tr>
<tr>
<td valign="top" align="left">14</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">75</td>
<td valign="top" align="center">60</td>
<td valign="top" align="center">44</td>
<td valign="top" align="center">4.30</td>
<td valign="top" align="center">1.33</td>
<td valign="top" align="center">&#x2013;0.71</td>
<td valign="top" align="center">0.03</td>
</tr>
<tr>
<td valign="top" align="left">15</td>
<td valign="top" align="center">25</td>
<td valign="top" align="center">26</td>
<td valign="top" align="center">29</td>
<td valign="top" align="center">65</td>
<td valign="top" align="center">48</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">3.78</td>
<td valign="top" align="center">1.52</td>
<td valign="top" align="center">&#x2013;0.37</td>
<td valign="top" align="center">&#x2013;0.83</td>
</tr>
<tr>
<td valign="top" align="left">16</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">81</td>
<td valign="top" align="center">51</td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">4.05</td>
<td valign="top" align="center">1.31</td>
<td valign="top" align="center">&#x2013;0.51</td>
<td valign="top" align="center">&#x2013;0.16</td>
</tr>
<tr>
<td valign="top" align="left">17</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">42</td>
<td valign="top" align="center">70</td>
<td valign="top" align="center">87</td>
<td valign="top" align="center">4.93</td>
<td valign="top" align="center">1.15</td>
<td valign="top" align="center">&#x2013;1.13</td>
<td valign="top" align="center">0.99</td>
</tr>
<tr>
<td valign="top" align="left">18</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">27</td>
<td valign="top" align="center">69</td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">3.26</td>
<td valign="top" align="center">1.53</td>
<td valign="top" align="center">&#x2013;0.05</td>
<td valign="top" align="center">&#x2013;1.09</td>
</tr>
<tr>
<td valign="top" align="left">19</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">96</td>
<td valign="top" align="center">44</td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">3.85</td>
<td valign="top" align="center">1.39</td>
<td valign="top" align="center">&#x2013;0.62</td>
<td valign="top" align="center">&#x2013;0.18</td>
</tr>
<tr>
<td valign="top" align="left">20</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">52</td>
<td valign="top" align="center">65</td>
<td valign="top" align="center">81</td>
<td valign="top" align="center">4.85</td>
<td valign="top" align="center">1.16</td>
<td valign="top" align="center">&#x2013;1.00</td>
<td valign="top" align="center">0.81</td>
</tr>
</tbody>
</table></table-wrap>
<p>The hypothesis of normality of the latent responses is discarded (<italic>p</italic>-value &#x003C; 0.01). This result threats the validity of the methods based on polychoric correlations since these correlations are highly sensitive to the validity of the normality assumption for the latent responses (<xref ref-type="bibr" rid="B16">Foldnes and Gr&#x00F8;nneberg, 2020</xref>, <xref ref-type="bibr" rid="B17">2022</xref>). However, as we do not have information about what the distribution of the latent responses might be, we are not in a position of estimating polychoric correlations under a different distribution.</p>
<p>Regarding the analysis of normality for the latent factor, the HQ for the normal, Davidian-2, Davidian-3, and empirical histogram models are 13782.5, 13789.3, 13788.2, and 14175.1, respectively. Similar to AIC and BIC, the HQ supports the model that minimizes it. Since the normal distribution minimizes the HQ, the Gaussian distribution can be retained.</p>
<p>The goodness-of-fit statistics for the FA models with one to four factors appear in <xref ref-type="table" rid="T10">Table 10</xref>, including the chi-square against a saturated model and the modified chi-squares: <italic>T</italic><sub><italic>M</italic></sub> and <italic>T</italic><sub><italic>MV</italic></sub>. The chi-square estimates three factors whereas the robust statistics estimates two factors. Since the data show non-negligible skewness and kurtosis, these results point in the direction of relying in robust statistics and retaining two factors. Parallel analysis gives a similar result and the number of estimated factors is two for PA and three for CPA.</p>
<table-wrap position="float" id="T10">
<label>TABLE 10</label>
<caption><p>Chi-square and modified chi-square for the FA model.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="center" colspan="3">Chi-square</td>
<td valign="top" align="center" colspan="3">Mean corrected</td>
<td valign="top" align="center" colspan="4">Mean and variance corrected</td>
</tr>
<tr>
<td valign="top" align="center"></td>
<td valign="top" align="center" colspan="3"><hr/></td>
<td valign="top" align="center" colspan="3"><hr/></td>
<td valign="top" align="center" colspan="4"><hr/></td>
</tr>
<tr>
<td valign="top" align="left">Factors</td>
<td valign="top" align="center"><italic>G</italic><sup>2</sup></td>
<td valign="top" align="center"><italic>df</italic></td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center"><italic>T</italic><sub><italic>M</italic></sub></td>
<td valign="top" align="center"><italic>c</italic></td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center"><italic>T</italic><sub><italic>MV</italic></sub></td>
<td valign="top" align="center"><italic>a</italic></td>
<td valign="top" align="center"><italic>b</italic></td>
<td valign="top" align="center"><italic>p</italic></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">466.2</td>
<td valign="top" align="center">170</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">405.5</td>
<td valign="top" align="center">1.15</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">301.4</td>
<td valign="top" align="center">2.06</td>
<td valign="top" align="center">75.18</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">269.3</td>
<td valign="top" align="center">151</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center"><bold>241.0</bold></td>
<td valign="top" align="center"><bold>1.12</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
<td valign="top" align="center"><bold>206.8</bold></td>
<td valign="top" align="center"><bold>1.80</bold></td>
<td valign="top" align="center"><bold>57.42</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center"><bold>194.76</bold></td>
<td valign="top" align="center"><bold>133</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
<td valign="top" align="center">174.5</td>
<td valign="top" align="center">1.12</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">160.5</td>
<td valign="top" align="center">1.69</td>
<td valign="top" align="center">45.03</td>
<td valign="top" align="center">0.05</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">151.9</td>
<td valign="top" align="center">116</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">133.5</td>
<td valign="top" align="center">1.14</td>
<td valign="top" align="center">0.13</td>
<td valign="top" align="center">127.8</td>
<td valign="top" align="center">1.69</td>
<td valign="top" align="center">38.08</td>
<td valign="top" align="center">0.21</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><italic>G</italic><sup>2</sup> is the likelihood ratio chi-square statistic against a saturated model under the assumption of multivariate normality, <italic>df</italic> is the degrees of freedom, <italic>p</italic> is the <italic>p</italic>-value. <italic>T<sub>M</sub></italic> is the mean corrected <italic>G</italic><sup>2</sup> by Satorra and Bentler. <italic>T<sub>MV</sub></italic> is the mean and variance corrected <italic>G</italic><sup>2</sup> by Asparouhov and Muth&#x00E9;n. Boldface values indicate the simplest non-rejected model using a nominal type-I error rate of 0.01.</p></fn>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="table" rid="T11">Table 11</xref> contains the goodness-of-fit statistics for FAC with the WLSMV estimator, the chi-square statistics corrected for non-normality (the <italic>T</italic><sub><italic>M</italic></sub>, the <italic>T</italic><sub><italic>MV</italic></sub>), and the Satorra-Bentler correction of the chi-square difference (d<italic>T</italic><sub><italic>M</italic></sub>). The statistics differ in their conclusions. The chi-square and the chi-square difference give support to a three-factor model. On the other hand, the modified chi-square statistics (<italic>T</italic><sub><italic>M</italic></sub>, <italic>T</italic><sub><italic>MV</italic></sub>, and d<italic>T</italic><sub><italic>M</italic></sub>) suggest models with higher dimensionality.</p>
<table-wrap position="float" id="T11">
<label>TABLE 11</label>
<caption><p>Chi-square and modified chi-square for the FAC model using WLSMV estimation.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Factors</td>
<td valign="top" align="center"><italic>G</italic><sup>2</sup></td>
<td valign="top" align="center"><italic>df</italic></td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center">dif-<italic>G</italic><sup>2</sup></td>
<td valign="top" align="center">dif-<italic>df</italic></td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center"><italic>T</italic><sub><italic>M</italic></sub></td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center"><italic>T</italic><sub><italic>MV</italic></sub></td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center">d<italic>T</italic><sub><italic>M</italic></sub></td>
<td valign="top" align="center"><italic>p</italic></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">642.6</td>
<td valign="top" align="center">170</td>
<td valign="top" align="center">0.00</td>
<td/>
<td/>
<td/>
<td valign="top" align="center">1057.6</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">681.8</td>
<td valign="top" align="center">0.00</td>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">211.3</td>
<td valign="top" align="center">151</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">431.4</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">434.6</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">331.9</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">128.0</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center"><bold>125.0</bold></td>
<td valign="top" align="center"><bold>133</bold></td>
<td valign="top" align="center"><bold>0.68</bold></td>
<td valign="top" align="center"><bold>86.3</bold></td>
<td valign="top" align="center"><bold>18</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
<td valign="top" align="center">287.2</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">237.1</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">57.4</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">89.6</td>
<td valign="top" align="center">116</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">35.4</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">223.5</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">190.4</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">50.51</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="center">61.6</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">28.0</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">0.03</td>
<td valign="top" align="center">166.3</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center"><bold>148.4</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
<td valign="top" align="center">260.3</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="center">44.3</td>
<td valign="top" align="center">85</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">17.3</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">0.30</td>
<td valign="top" align="center"><bold>127.0</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
<td valign="top" align="center">116.3</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">65.5</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="center">30.7</td>
<td valign="top" align="center">71</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">13.6</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">0.48</td>
<td valign="top" align="center">93.3</td>
<td valign="top" align="center">0.04</td>
<td valign="top" align="center">88.4</td>
<td valign="top" align="center">0.08</td>
<td valign="top" align="center">54.7</td>
<td valign="top" align="center">0.00</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><italic>G</italic><sup>2</sup> is the likelihood ratio statistic against a saturated model, <italic>df</italic> is the degrees of freedom and p is the <italic>p</italic>-value. dif-<italic>G</italic><sup>2</sup> is the difference between any two consecutive values of <italic>G</italic><sup>2</sup>. <italic>T<sub>M</sub></italic> is the mean corrected chi-square by Satorra and Bentler. <italic>T<sub>MV</sub></italic> is the mean and variance corrected chi-square by Asparouhov and Muth&#x00E9;n. <italic>T<sub>d</sub></italic> is the mean corrected Satorra-Bentler statistic applied to the difference between two consecutive values of <italic>G</italic><sup>2</sup>. Boldface values indicate the simplest non-rejected model using a nominal type-I error rate of 0.01.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>The FAC model under ML results in five or six estimated factors, depending on the statistic used to test model fit (chi-square of absolute model testing or chi-square between pairs of nested models). The results appear in <xref ref-type="table" rid="T12">Table 12</xref>. However, the FAC method based on ML apparently is questionable in light of the simulation results.</p>
<table-wrap position="float" id="T12">
<label>TABLE 12</label>
<caption><p>Chi-square for the FAC model estimated by maximum likelihood.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Factors</td>
<td valign="top" align="center"><italic>G</italic><sup>2</sup></td>
<td valign="top" align="center"><italic>df</italic></td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center">dif-<italic>G</italic><sup>2</sup></td>
<td valign="top" align="center"><italic>df-dif</italic></td>
<td valign="top" align="center"><italic>p-dif</italic></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">623.7</td>
<td valign="top" align="center">170</td>
<td valign="top" align="center">0.00</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">270.9</td>
<td valign="top" align="center">151</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">253.7</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center">370.0</td>
<td valign="top" align="center">133</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">99.2</td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">219.5</td>
<td valign="top" align="center">116</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">51.4</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="center"><bold>163.0</bold></td>
<td valign="top" align="center"><bold>100</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
<td valign="top" align="center">56.5</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">0.00</td>
</tr>
<tr>
<td valign="top" align="left">6</td>
<td valign="top" align="center">121.9</td>
<td valign="top" align="center">85</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center"><bold>41.1</bold></td>
<td valign="top" align="center"><bold>15</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
</tr>
<tr>
<td valign="top" align="left">7</td>
<td valign="top" align="center">93.1</td>
<td valign="top" align="center">71</td>
<td valign="top" align="center">0.04</td>
<td valign="top" align="center">28.8</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">0.01</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><italic>G</italic><sup>2</sup> is the likelihood ratio chi-square statistic against a saturated model, <italic>df</italic> is the degrees of freedom, <italic>p</italic> is the <italic>p</italic>-value, dif-<italic>G</italic><sup>2</sup> is the likelihood-ratio chi-square for pairs of consecutive models. Boldface values indicate the simplest non-rejected model using a nominal type-I error rate of 0.01.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>The MNCM was estimated to evaluate the assumption that responses follow an ordinal scale. The difference between an ordinal and a nominal model is that there is only one slope for each item in the former model and one slope for each response category in the later model (except for one of the categories of the item, which has a slope of zero for identifiability). The results appear in <xref ref-type="table" rid="T13">Table 13</xref> and indicate that the MNCM does not significantly fit better than the GRM. Thus the ordinality assumption involved in the GRM can be maintained in light of this analysis.</p>
<table-wrap position="float" id="T13">
<label>TABLE 13</label>
<caption><p>Chi-square for the GRM and the MNCM models.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="center" colspan="5">GRM</td>
<td valign="top" align="center" colspan="5">MNCM</td>
<td valign="top" align="center" colspan="3">GRM vs. MNCM</td>
</tr>
<tr>
<td valign="top" align="center"></td>
<td valign="top" align="center" colspan="5"><hr/></td>
<td valign="top" align="center" colspan="5"><hr/></td>
<td valign="top" align="center" colspan="3"><hr/></td>
</tr>
<tr>
<td valign="top" align="left">Factors</td>
<td valign="top" align="center">pars.</td>
<td valign="top" align="center"><italic>LK</italic></td>
<td valign="top" align="center">dif-<italic>G</italic><sup>2</sup></td>
<td valign="top" align="center"><italic>df</italic></td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center">pars.</td>
<td valign="top" align="center"><italic>LK</italic></td>
<td valign="top" align="center">dif-<italic>G</italic><sup>2</sup></td>
<td valign="top" align="center"><italic>df</italic></td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center">dif-<italic>G</italic><sup>2</sup>-M</td>
<td valign="top" align="center"><italic>df</italic></td>
<td valign="top" align="center"><italic>p</italic></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">120</td>
<td valign="top" align="center">&#x2212;6688.7</td>
<td/>
<td/>
<td/>
<td valign="top" align="center">200</td>
<td valign="top" align="center">&#x2212;6658.1</td>
<td/>
<td/>
<td/>
<td valign="top" align="center">61.3</td>
<td valign="top" align="center">80</td>
<td valign="top" align="center">0.94</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">139</td>
<td valign="top" align="center">&#x2212;6573.7</td>
<td valign="top" align="center">230.1</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">219</td>
<td valign="top" align="center">&#x2212;6529.2</td>
<td valign="top" align="center">257.7</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">88.9</td>
<td valign="top" align="center">80</td>
<td valign="top" align="center">0.23</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="center"><bold>157</bold></td>
<td valign="top" align="center">&#x2212;<bold>6530.5</bold></td>
<td valign="top" align="center"><bold>86.3</bold></td>
<td valign="top" align="center"><bold>18</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
<td valign="top" align="center">237</td>
<td valign="top" align="center">&#x2212;6500.8</td>
<td valign="top" align="center">56.7</td>
<td valign="top" align="center">18</td>
<td valign="top" align="center">0.00</td>
<td valign="top" align="center">59.3</td>
<td valign="top" align="center">80</td>
<td valign="top" align="center">0.96</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="center">174</td>
<td valign="top" align="center">&#x2212;6514.1</td>
<td valign="top" align="center">32.9</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">254</td>
<td valign="top" align="center">&#x2212;<bold>6482.3</bold></td>
<td valign="top" align="center"><bold>37.1</bold></td>
<td valign="top" align="center"><bold>17</bold></td>
<td valign="top" align="center"><bold>0.00</bold></td>
<td valign="top" align="center">63.5</td>
<td valign="top" align="center">80</td>
<td valign="top" align="center">0.91</td>
</tr>
<tr>
<td valign="top" align="left">5</td>
<td valign="top" align="center">190</td>
<td valign="top" align="center">&#x2212;6508.4</td>
<td valign="top" align="center">11.2</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">0.79</td>
<td valign="top" align="center">270</td>
<td valign="top" align="center">&#x2212;6482.2</td>
<td valign="top" align="center">0.2</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">1.00</td>
<td valign="top" align="center">52.5</td>
<td valign="top" align="center">80</td>
<td valign="top" align="center">0.99</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p>pars. is the number of estimated parameters for each model, LK is the log likelihood, dif-<italic>G</italic><sup>2</sup> is the likelihood-ratio chi-square for pairs of consecutive models, df is the degrees of freedom, p is the p-value. dif-<italic>G</italic><sup>2</sup>-M is the likelihood ratio chi-square of the GRM against the MNCM with the same number of factors. Boldface values indicate the simplest non-rejected model using a nominal type-I error rate of 0.01.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>This empirical example shows mixed results. The most parsimonious results are those of FA and GRM, whereas the modified chi-squares lead to models of higher dimensionality. The large number of factors resulting from polychoric-based methods (FAC in its different variants) may be due to an unprecise estimation of polychoric correlations because of the lack of normality of the latent responses. In the simulation, we considered only the case in which latent normality holds. This assumption is not tenable in the empirical application, and the psychometric literature shows that this might be a source of imprecision in the estimates of polychoric correlations (<xref ref-type="bibr" rid="B20">Gr&#x00F8;nneberg and Foldnes, 2022</xref>). All in all, we would recommend the solution with a small number of factors for parsimony and the questionability of polychoric-based methods in this sample.</p>
</sec>
</sec>
<sec id="S5" sec-type="discussion|conclusion">
<title>Discussion and conclusion</title>
<p>The purpose of this article was to investigate the behavior of the chi-square value of the FA when the data are categorical. We compare different approaches to deal with this problem and examine their performance to determine the number of factors under several design conditions in a simulation study. The article is oriented to investigators analyzing ordinal data, for example from Likert-type items, rating scales, etc. In this respect, we assume that an ordinal factor model is correct in the population and analyze the data using the most common methods available nowadays. It is important to notice that the FA (based on the normality assumption) cannot be generally correct in this situation, and it is included here mainly because of its popularity in the applied field and to illustrate some of the biases that it might produce. Appropriate alternatives to normal-FA that can be easily implemented are the FAC based on WLSMV estimation, the modified chi-square statistics, and the GRM.</p>
<p>Our results concur with those of the psychometric literature, pointing out two important drivers for overfactoring related to FA: the magnitude of factor loadings and the data skewness. We have stressed the importance of skewness because this phenomenon is inherent to ordinal data and perhaps has not received as much attention in the literature as the issue of factor loadings. More specifically, mixed skewness is of particular importance due to its effect on attenuating the correlation between factor indicators and the subsequent increase in the number of estimated factors. The results of our simulation study are congruent with <xref ref-type="bibr" rid="B43">Olsson (1979)</xref>&#x2019;s study and provide new insights into the performance of the robust methods and the FA for categorical variables.</p>
<p>Ordinal data generally present skewness because these data come from bounded response formats, such as Likert and rating scales, which are the response formats most commonly used in educational and psychological responses. Bounded scales generate asymmetric distributions when the mean of the variable approaches the lower or the upper bound. The FA model based on the normality assumption is often applied routinely to these data albeit being incorrect, which results in overfactoring and inconsistent parameter estimates. Our results show that mixed skewness and high factor loadings are especially associated to overfactoring in connection with this method. The other conditions manipulated in the simulation (including sample size, number of factor indicators, and number of response categories) were of secondary importance given the simulated conditions.</p>
<p>The problems with the FA model are not surprising since they have been documented for quite some time in the psychometric literature (<xref ref-type="bibr" rid="B42">Muth&#x00E9;n and Kaplan, 1985</xref>). However, our results showed that the factor model based on polychoric correlations (the FAC model) can be problematic if the estimation algorithm is not selected carefully. In general, the estimation operates in two stages: (1) Estimate the polychoric correlations from the categorical responses, and (2) estimate factor parameters from the polychoric correlations (using either maximum-likelihood or WLSMV in our simulations). The results show that the choice of an estimation method in stage 2 has important consequences, and WLSMV is far superior to any other method. However, the conclusions of the simulation depend on the conditions used in our study, which are ordinal data with binomial marginals and exact underlying normality. Some aspects in which our simulations can be extended are to consider other estimation methods, larger sample sizes, and different patterns of item thresholds. Apart from this, the estimation of polychoric correlations is sensitive to the normality assumption of the latent responses (<xref ref-type="bibr" rid="B16">Foldnes and Gr&#x00F8;nneberg, 2020</xref>), which is another area of active investigation.</p>
<p>The GRM model from IRT has shown better performance than FAC albeit being based on similar assumptions. One important advantage of GRM over FAC is that it avoids the use of polychoric correlations by relying on a marginal ML estimation procedure (MMLE; <xref ref-type="bibr" rid="B3">Bock and Aitkin, 1981</xref>). However, the Type-I error rate for the GRM is about 30%, still far from the nominal 5% level. GRM has two inherent limitations, the numerical integration procedure involved in estimation (the so-called <italic>curse of dimensionality</italic>) and the sparseness of the contingency table of manifest responses when the number of variables increases. MMLE cannot be applied with more than five or six factors. However, some recent advances (<xref ref-type="bibr" rid="B7">Cai, 2010</xref>) have opened the way to apply it with a larger number of factors, and constitute a viable alternative to factor-analyzing ordinal data.</p>
<p>All in all, our results suggest that the most tenable method for estimating dimensionality is the family of modified chi-square statistics (<italic>T</italic><sub><italic>M</italic></sub>, <italic>T</italic><sub><italic>MV</italic></sub>, and <italic>T</italic><sub><italic>d</italic></sub>) in connection with the WLSMV estimation method, as long as the assumption of underlying normality can be maintained. The Type-I error rate is about the nominal level for all conditions, which is also a very convenient property because researchers do not have to worry about which statistic to use and in which circumstances. However, the simulation also shows that recovery of the parameters is more precise for GRM than for the other methods. One possible explanation for this phenomenon is that GRM is estimated by a full-information marginal ML procedure that fits the model to the individual response patterns, whereas the polychoric-based methods use a limited information estimation method that fits the model to the second-order moments.</p>
<p>Parallel analysis is nowadays widely used to estimate the number of factors due to the problems associated with chi-square. The results of our simulation show that PA (based on Pearson correlations) performs better than FA but not better than GRM or <italic>T</italic><sub><italic>d</italic></sub>. The parallel analysis for categorical data inherited the problems associated with polychoric correlations and overfactoring persisted.</p>
<p>Apart from the aforementioned extensions of our simulated conditions, there are other directions in which this research can be expanded. One is to investigate other goodness-of-fit indices not considered here. Drawing on recent research by <xref ref-type="bibr" rid="B70">Xim&#x00E9;nez et al. (2022)</xref>, future research could be directed to assess the performance of the RMSEA and the unbiased SRMR index, which are consistent and asymptotically unbiased estimators of the parameter of interest and have shown good statistical properties and efficiency to provide interpretation guidelines to assess the goodness-of-fit (<xref ref-type="bibr" rid="B62">Shi et al., 2018</xref>; <xref ref-type="bibr" rid="B70">Xim&#x00E9;nez et al., 2022</xref>).</p>
<p>Future studies may also examine the consequences of distributional assumptions at different levels (<xref ref-type="bibr" rid="B71">Zygmont and Smith, 2014</xref>), which is currently a field of active psychometric research. Regarding manifest variables, the present article focuses on raking data because of the widespread use of Likert-type items in psychological and educational measurement. However, other types of variables may require a specific model. For example, continuous bounded variables may be analyzed with a factor model based on the beta distribution (<xref ref-type="bibr" rid="B47">Revuelta et al., 2022</xref>). A FA model can also be based on manifest indicators measured on a nominal scale (<xref ref-type="bibr" rid="B50">Revuelta et al., 2020</xref>, <xref ref-type="bibr" rid="B49">2021</xref>).</p>
<p>The methods for categorical data evaluated in this article are based on the assumption that manifest responses are the result of a discretization process for a latent response, where both latent factors and errors follow a normal distribution. This currently is the common setup in applied psychometrics, and this article is oriented to practitioners that analyze ordinal data with the common methods. However, there is also an active line of research that questions the normality of factors and errors (<xref ref-type="bibr" rid="B20">Gr&#x00F8;nneberg and Foldnes, 2022</xref>; <xref ref-type="bibr" rid="B26">Jobst et al., 2022</xref>; <xref ref-type="bibr" rid="B36">Manapat and Edwards, 2022</xref>). The validity of the conclusion from our simulations is contingent upon the normality and the other assumptions involved in the categorical factor model. While the literature has shown that tetrachoric correlations are distorted by violations of normality (<xref ref-type="bibr" rid="B21">Gr&#x00F8;nneberg et al., 2020</xref>), more investigation is needed to determine the consequences of this violation in the other methods, as well as the estimation of the distribution of latent responses (<xref ref-type="bibr" rid="B16">Foldnes and Gr&#x00F8;nneberg, 2020</xref>, <xref ref-type="bibr" rid="B17">2022</xref>). In the meantime, if the investigator insists on using polychoric-based methods, normality tests for latent responses shall be run before applying these methods.</p>
<p>Another generalization of the proposed methods consists of estimating the distribution of the latent factor instead of assuming a Gaussian variable. The present article has applied several options for the one-dimensional latent space (<xref ref-type="bibr" rid="B66">Woods, 2006</xref>; <xref ref-type="bibr" rid="B68">Woods and Lin, 2009</xref>), but extensions of these methods for multiple-factor models do not exist yet. One final area of investigation is the nominal factor model, which assumes that latent variables are Gaussian although it includes one latent response for each category instead of one response for the complete item as in the FAC model (<xref ref-type="bibr" rid="B48">Revuelta, 2014</xref>; <xref ref-type="bibr" rid="B49">Revuelta et al., 2021</xref>). The nominal factor model is more flexible and can fit the data in some circumstances where the FAC cannot. The development of intermediate models between the full nominal model and the FAC is another way of defining models that better represent the data and achieve an acceptable fit (<xref ref-type="bibr" rid="B65">Thissen et al., 2010</xref>).</p>
</sec>
<sec id="S6" sec-type="data-availability">
<title>Data availability statement</title>
<p>Publicly available datasets were analyzed in this study. This data can be found here: <ext-link ext-link-type="uri" xlink:href="https://prdlab.gatech.edu/unfolding/data/">https://prdlab.gatech.edu/unfolding/data/</ext-link>.</p>
</sec>
<sec id="S7">
<title>Author contributions</title>
<p>JR contributed to the conception of the manuscript, the design and planning of the simulation study, the development of the R code for the data simulation, the statistical analysis and interpretation of results, and the drafting of the manuscript. CX participated in the literature review, implementation of the robust statistical procedures, statistical analysis, and the revision of the manuscript. NM contributed to the implementation of the simulation study and the data analyses. All authors contributed to the article and approved the submitted version.</p>
</sec>
</body>
<back>
<sec id="S8" sec-type="funding-information">
<title>Funding</title>
<p>This work was supported by the Grant Nos. PGC2018-093838-B-I00 and PID2021-124885NB-I00 from the Spanish Ministerio de Ciencia, Innovaci&#x00F3;n y Universidades. Simulations have been run with the support of the Centro de Computaci&#x00F3;n Cient&#x00ED;fica at the Autonoma University of Madrid (Center for Scientific Computation, CCC-UAM).</p>
</sec>
<sec id="S9" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="S10" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="S11" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fpsyg.2022.982137/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fpsyg.2022.982137/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.docx" id="DS1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p>In the original formulation of the model, <bold><italic>e</italic></bold> splits into a unique factor and a measurement error. However, such a distinction is mainly of a theoretical interest as these two elements cannot be empirically distinguished in most practical settings.</p></fn>
<fn id="footnote2">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="https://prdlab.gatech.edu/unfolding/data/">https://prdlab.gatech.edu/unfolding/data/</ext-link></p></fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Asparouhov</surname> <given-names>T.</given-names></name> <name><surname>Muth&#x00E9;n</surname> <given-names>B.</given-names></name></person-group> (<year>2010</year>). <source><italic>Simple second order chi-square correction.</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.statmodel.com/download/WLSMV_new_chi21.pdf">https://www.statmodel.com/download/WLSMV_new_chi21.pdf</ext-link> <comment>(accessed July 16, 2022)</comment>.</citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bandalos</surname> <given-names>D. L.</given-names></name></person-group> (<year>2014</year>). <article-title>Relative performance of categorical diagonally weighted least squares and robust maximum likelihood estimation.</article-title> <source><italic>Struct. Equ. Model. Multidiscip. J.</italic></source> <volume>21</volume> <fpage>102</fpage>&#x2013;<lpage>116</lpage>. <pub-id pub-id-type="doi">10.1080/10705511.2014.859510</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bock</surname> <given-names>R. D.</given-names></name> <name><surname>Aitkin</surname> <given-names>M.</given-names></name></person-group> (<year>1981</year>). <article-title>Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm.</article-title> <source><italic>Psychometrika</italic></source> <volume>46</volume> <fpage>443</fpage>&#x2013;<lpage>459</lpage>. <pub-id pub-id-type="doi">10.1007/BF02293801</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Browne</surname> <given-names>M. W.</given-names></name></person-group> (<year>1984</year>). <article-title>Asymptotic distribution free methods in the analysis of covariance structures.</article-title> <source><italic>Br. J. Math. Stat. Psychol.</italic></source> <volume>37</volume> <fpage>127</fpage>&#x2013;<lpage>141</lpage>. <pub-id pub-id-type="doi">10.1111/j.2044-8317.1984.tb00789.x</pub-id> <pub-id pub-id-type="pmid">6733054</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Browne</surname> <given-names>M. W.</given-names></name> <name><surname>Cudeck</surname> <given-names>R.</given-names></name></person-group> (<year>1992</year>). <article-title>Alternative ways of assessing model fit.</article-title> <source><italic>Sociol. Methods Res.</italic></source> <volume>21</volume> <fpage>230</fpage>&#x2013;<lpage>258</lpage>. <pub-id pub-id-type="doi">10.1177/0049124192021002005</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bryant</surname> <given-names>F. B.</given-names></name> <name><surname>Satorra</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Principles and practice of scaled difference chi-square testing.</article-title> <source><italic>Struct. Equ. Model. Multidiscip. J.</italic></source> <volume>19</volume> <fpage>372</fpage>&#x2013;<lpage>398</lpage>. <pub-id pub-id-type="doi">10.1080/10705511.2012.687671</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cai</surname> <given-names>L.</given-names></name></person-group> (<year>2010</year>). <article-title>High-dimensional exploratory item factor analysis by a metropolis&#x2013;hastings robbins&#x2013;monro algorithm.</article-title> <source><italic>Psychometrika</italic></source> <volume>75</volume> <fpage>33</fpage>&#x2013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1007/s11336-009-9136-x</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chalmers</surname> <given-names>R. P.</given-names></name></person-group> (<year>2012</year>). <article-title>mirt: A multidimensional item response theory package for the R environment.</article-title> <source><italic>J. Stat. Softw.</italic></source> <volume>48</volume> <fpage>1</fpage>&#x2013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.18637/jss.v048.i06</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Christoffersson</surname> <given-names>A.</given-names></name></person-group> (<year>1975</year>). <article-title>Factor analysis of dichotomized variables.</article-title> <source><italic>Psychometrika</italic></source> <volume>40</volume> <fpage>5</fpage>&#x2013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1007/BF02291477</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cudeck</surname> <given-names>R.</given-names></name> <name><surname>Browne</surname> <given-names>M. W.</given-names></name></person-group> (<year>1985</year>). <article-title>Constructing a covariance matrix that yields a specified minimizer and a specified minimum discrepancy function value.</article-title> <source><italic>Psychometrika</italic></source> <volume>57</volume> <fpage>357</fpage>&#x2013;<lpage>369</lpage>. <pub-id pub-id-type="doi">10.1007/BF02295424</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Curran</surname> <given-names>P. J.</given-names></name> <name><surname>West</surname> <given-names>S. G.</given-names></name> <name><surname>Finch</surname> <given-names>J. F.</given-names></name></person-group> (<year>1996</year>). <article-title>The robustness of test statistics to nonnormality and specification error in confirmatory factor analysis.</article-title> <source><italic>Psychol. Methods</italic></source> <volume>1</volume> <fpage>16</fpage>&#x2013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1037/1082-989X.1.1.16</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>DiStefano</surname> <given-names>C.</given-names></name> <name><surname>Morgan</surname> <given-names>G. B.</given-names></name></person-group> (<year>2014</year>). <article-title>A comparison of diagonal weighted least squares robust estimation techniques for ordinal data.</article-title> <source><italic>Struct. Equ. Model. Multidiscip. J.</italic></source> <volume>21</volume> <fpage>425</fpage>&#x2013;<lpage>438</lpage>. <pub-id pub-id-type="doi">10.1080/10705511.2014.915373</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferrando</surname> <given-names>P. J.</given-names></name> <name><surname>Lorenzo-Seva</surname> <given-names>U.</given-names></name></person-group> (<year>2010</year>). <article-title>Acquiescence a source of bias and model and person misfit: A theoretical and empirical analysis.</article-title> <source><italic>Br. J. Math. Stat. Psychol.</italic></source> <volume>63</volume> <fpage>427</fpage>&#x2013;<lpage>448</lpage>. <pub-id pub-id-type="doi">10.1348/000711009X470740</pub-id> <pub-id pub-id-type="pmid">19840493</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferrando</surname> <given-names>P. J.</given-names></name> <name><surname>Lorenzo-Seva</surname> <given-names>U.</given-names></name></person-group> (<year>2013</year>). <source><italic>Unrestricted item factor analysis and some relations with item response theory.</italic></source> <comment>Technical Report</comment>. <publisher-loc>Tarragona</publisher-loc>: <publisher-name>Department of Psychology, Universitat Rovira i Virgili</publisher-name>.</citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flora</surname> <given-names>D. B.</given-names></name> <name><surname>Curran</surname> <given-names>P. J.</given-names></name></person-group> (<year>2004</year>). <article-title>An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data.</article-title> <source><italic>Psychol. Methods</italic></source> <volume>9</volume> <fpage>466</fpage>&#x2013;<lpage>491</lpage>. <pub-id pub-id-type="doi">10.1037/1082-989X.9.4.466</pub-id> <pub-id pub-id-type="pmid">15598100</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foldnes</surname> <given-names>N.</given-names></name> <name><surname>Gr&#x00F8;nneberg</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Pernicious polychorics: The impact and detection of underlying non-normality.</article-title> <source><italic>Struct. Equ. Model. Multidiscip. J.</italic></source> <volume>27</volume> <fpage>525</fpage>&#x2013;<lpage>543</lpage>. <pub-id pub-id-type="doi">10.1080/10705511.2019.1673168</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foldnes</surname> <given-names>N.</given-names></name> <name><surname>Gr&#x00F8;nneberg</surname> <given-names>S.</given-names></name></person-group> (<year>2022</year>). <article-title>The sensitivity of structural equation modeling with ordinal data to underlying non-normality and observed distributional forms</article-title>. <source><italic>Psychol. Methods</italic></source> <volume>27</volume>, <fpage>541</fpage>&#x2013;<lpage>567</lpage>. <pub-id pub-id-type="doi">10.1037/met0000385</pub-id> <pub-id pub-id-type="pmid">33793270</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Forero</surname> <given-names>C. G.</given-names></name> <name><surname>Maydeu-Olivares</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Estimation of IRT graded response models: Limited versus full information methods.</article-title> <source><italic>Psychol. Methods</italic></source> <volume>14</volume> <fpage>275</fpage>&#x2013;<lpage>299</lpage>. <pub-id pub-id-type="doi">10.1037/a0015825</pub-id> <pub-id pub-id-type="pmid">19719362</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Forero</surname> <given-names>C. G.</given-names></name> <name><surname>Maydeu-Olivares</surname> <given-names>A.</given-names></name> <name><surname>Gallardo-Pujol</surname> <given-names>D.</given-names></name></person-group> (<year>2009</year>). <article-title>Factor analysis with ordinal indicators: A monte carlo study comparing DWLS and ULS estimation.</article-title> <source><italic>Struct. Equ. Model. Multidiscip. J.</italic></source> <volume>16</volume> <fpage>625</fpage>&#x2013;<lpage>641</lpage>. <pub-id pub-id-type="doi">10.1080/10705510903203573</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gr&#x00F8;nneberg</surname> <given-names>S.</given-names></name> <name><surname>Foldnes</surname> <given-names>N.</given-names></name></person-group> (<year>2022</year>). <article-title>Factor analyzing ordinal items requires substantive knowledge of response marginals.</article-title> <source><italic>Psychol. Methods</italic></source> <pub-id pub-id-type="doi">10.1037/met0000495</pub-id> <pub-id pub-id-type="pmid">35588076</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gr&#x00F8;nneberg</surname> <given-names>S.</given-names></name> <name><surname>Moss</surname> <given-names>J.</given-names></name> <name><surname>Foldnes</surname> <given-names>N.</given-names></name></person-group> (<year>2020</year>). <article-title>Partial identification of latent correlations with binary data.</article-title> <source><italic>Psychometrika</italic></source> <volume>85</volume> <fpage>1028</fpage>&#x2013;<lpage>1051</lpage>. <pub-id pub-id-type="doi">10.1007/s11336-020-09737-y</pub-id> <pub-id pub-id-type="pmid">33346887</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hancock</surname> <given-names>G. R.</given-names></name> <name><surname>Mueller</surname> <given-names>R. O.</given-names></name></person-group> (<year>2011</year>). <article-title>The reliability paradox in assessing structural relations within covariance structure models.</article-title> <source><italic>Educ. Psychol. Meas.</italic></source> <volume>71</volume> <fpage>306</fpage>&#x2013;<lpage>324</lpage>. <pub-id pub-id-type="doi">10.1177/0013164410384856</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ho</surname> <given-names>A. D.</given-names></name> <name><surname>Yu</surname> <given-names>C. C.</given-names></name></person-group> (<year>2015</year>). <article-title>Descriptive statistics for modern test score distributions: skewness, kurtosis, discreteness, and ceiling effects.</article-title> <source><italic>Educ. Psychol. Meas.</italic></source> <volume>75</volume> <fpage>365</fpage>&#x2013;<lpage>388</lpage>. <pub-id pub-id-type="doi">10.1177/0013164414548576</pub-id> <pub-id pub-id-type="pmid">29795825</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>L.</given-names></name> <name><surname>Bentler</surname> <given-names>P. M.</given-names></name> <name><surname>Kano</surname> <given-names>Y.</given-names></name></person-group> (<year>1992</year>). <article-title>Can test statistics in covariance structure analysis be trusted?</article-title> <source><italic>Psychol. Bull.</italic></source> <volume>112</volume> <fpage>351</fpage>&#x2013;<lpage>362</lpage>. <pub-id pub-id-type="doi">10.1037/0033-2909.112.2.351</pub-id> <pub-id pub-id-type="pmid">1454899</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Joanes</surname> <given-names>D. N.</given-names></name> <name><surname>Gill</surname> <given-names>C. A.</given-names></name></person-group> (<year>1998</year>). <article-title>Comparing measures of sample skewness and kurtosis.</article-title> <source><italic>Statistician</italic></source> <volume>47</volume> <fpage>183</fpage>&#x2013;<lpage>189</lpage>. <pub-id pub-id-type="doi">10.1111/1467-9884.00122</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jobst</surname> <given-names>L. J.</given-names></name> <name><surname>Auerswald</surname> <given-names>M.</given-names></name> <name><surname>Moshagen</surname> <given-names>M.</given-names></name></person-group> (<year>2022</year>). <article-title>The effect of latent and error non-normality on measures of fit in structural equation modeling.</article-title> <source><italic>Educ. Psychol. Meas.</italic></source> <volume>82</volume> <fpage>911</fpage>&#x2013;<lpage>937</lpage>. <pub-id pub-id-type="doi">10.1177/00131644211046201</pub-id> <pub-id pub-id-type="pmid">35989731</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>J&#x00F6;reskog</surname> <given-names>K. G.</given-names></name> <name><surname>Moustaki</surname> <given-names>I.</given-names></name></person-group> (<year>2001</year>). <article-title>Factor analysis of ordinal variables: A comparison of three approaches.</article-title> <source><italic>Multivar. Behav. Res.</italic></source> <volume>36</volume> <fpage>347</fpage>&#x2013;<lpage>387</lpage>. <pub-id pub-id-type="doi">10.1207/S15327906347-387</pub-id> <pub-id pub-id-type="pmid">26751181</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jorgensen</surname> <given-names>T. D.</given-names></name> <name><surname>Johnson</surname> <given-names>A. R.</given-names></name></person-group> (<year>2022</year>). <article-title>How to derive expected values of structural equation model parameters when treating discrete data as continuous.</article-title> <source><italic>Struct. Equ. Model. Multidis. J.</italic></source> <volume>29</volume> <fpage>639</fpage>&#x2013;<lpage>650</lpage>. <pub-id pub-id-type="doi">10.1080/10705511.2021.1988609</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jorgensen</surname> <given-names>T. D.</given-names></name> <name><surname>Pornprasertmanit</surname> <given-names>S.</given-names></name> <name><surname>Schoemann</surname> <given-names>A. M.</given-names></name> <name><surname>Rosseel</surname> <given-names>Y.</given-names></name></person-group> (<year>2022</year>). <source><italic>semTools: Useful tools for structural equation modeling: A Multidisciplinary Journal. R package version 0.5-6.</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://CRAN.R-project.org/package=semTools">https://CRAN.R-project.org/package=semTools</ext-link></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lawley</surname> <given-names>D. N.</given-names></name> <name><surname>Maxwell</surname> <given-names>A. E.</given-names></name></person-group> (<year>1971</year>). <source><italic>Factor analysis as a statistical method</italic></source>, <edition>2nd Edn</edition>. <publisher-loc>London</publisher-loc>: <publisher-name>Butterworths</publisher-name>.</citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>G.</given-names></name> <name><surname>Edwards</surname> <given-names>M. C.</given-names></name></person-group> (<year>2012</year>). <article-title>Ordinary least squares estimation of parameters in exploratory factor analysis with ordinal data.</article-title> <source><italic>Multivar. Behav. Res.</italic></source> <volume>47</volume> <fpage>314</fpage>&#x2013;<lpage>339</lpage>. <pub-id pub-id-type="doi">10.1080/00273171.2012.658340</pub-id> <pub-id pub-id-type="pmid">26734852</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>C. H.</given-names></name></person-group> (<year>2016</year>). <article-title>Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares.</article-title> <source><italic>Behav. Res. Methods</italic></source> <volume>48</volume> <fpage>936</fpage>&#x2013;<lpage>949</lpage>. <pub-id pub-id-type="doi">10.3758/s13428-015-0619-7</pub-id> <pub-id pub-id-type="pmid">26174714</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lim</surname> <given-names>S.</given-names></name> <name><surname>Jahng</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <article-title>Determining the number of factors using parallel analysis and its recent variants.</article-title> <source><italic>Psychol. Methods</italic></source> <volume>24</volume> <fpage>452</fpage>&#x2013;<lpage>467</lpage>. <pub-id pub-id-type="doi">10.1037/met0000230</pub-id> <pub-id pub-id-type="pmid">31180694</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindwall</surname> <given-names>M.</given-names></name> <name><surname>Barkoukis</surname> <given-names>V.</given-names></name> <name><surname>Grano</surname> <given-names>C.</given-names></name> <name><surname>Lucidi</surname> <given-names>F.</given-names></name> <name><surname>Raudsepp</surname> <given-names>L.</given-names></name> <name><surname>Liukkonen</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Method effects: The problem with negatively versus positively keyed items.</article-title> <source><italic>J. Pers. Assess.</italic></source> <volume>94</volume> <fpage>196</fpage>&#x2013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1080/00223891.2011.645936</pub-id> <pub-id pub-id-type="pmid">22339312</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>L&#x00FC;decke</surname> <given-names>D.</given-names></name></person-group> (<year>2021</year>). <source><italic>sjstats: Statistical functions for regression models (Version 0.18.1).</italic></source> <pub-id pub-id-type="doi">10.5281/zenodo.1284472</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Manapat</surname> <given-names>P. D.</given-names></name> <name><surname>Edwards</surname> <given-names>M. C.</given-names></name></person-group> (<year>2022</year>). <article-title>Examining the robustness of the graded response and 2-parameter logistic models to violations of construct normality.</article-title> <source><italic>Educ. Psychol. Meas.</italic></source> <volume>82</volume> <fpage>967</fpage>&#x2013;<lpage>988</lpage>. <pub-id pub-id-type="doi">10.1177/00131644211063453</pub-id> <pub-id pub-id-type="pmid">35989729</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McDonald</surname> <given-names>R. P.</given-names></name> <name><surname>Ahlawat</surname> <given-names>K. S.</given-names></name></person-group> (<year>1974</year>). <article-title>Difficulty factors in binary data.</article-title> <source><italic>Br. J. Math. Stat. Psychol.</italic></source> <volume>27</volume> <fpage>82</fpage>&#x2013;<lpage>99</lpage>. <pub-id pub-id-type="doi">10.1111/j.2044-8317.1974.tb00530.x</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McNeish</surname> <given-names>D.</given-names></name> <name><surname>An</surname> <given-names>J.</given-names></name> <name><surname>Hancock</surname> <given-names>G. R.</given-names></name></person-group> (<year>2018</year>). <article-title>The thorny relation between measurement quality and fit index cutoffs in latent variable models.</article-title> <source><italic>J. Pers. Assess.</italic></source> <volume>100</volume> <fpage>43</fpage>&#x2013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.1080/00223891.2017.1281286</pub-id> <pub-id pub-id-type="pmid">28631976</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meyer</surname> <given-names>D.</given-names></name> <name><surname>Dimitriadou</surname> <given-names>E.</given-names></name> <name><surname>Hornik</surname> <given-names>K.</given-names></name> <name><surname>Weingessel</surname> <given-names>A.</given-names></name> <name><surname>Leisch</surname> <given-names>F.</given-names></name> <name><surname>Chang</surname> <given-names>C. C.</given-names></name><etal/></person-group> (<year>2021</year>). <source><italic>e1071: Misc functions of the department of statistics, probability theory group (Formerly: E1071), TU Wien [R Package Version 1.7-9].</italic></source> <comment>Comprehensive R Archive Network (CRAN)</comment>.</citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Muth&#x00E9;n</surname> <given-names>B.</given-names></name></person-group> (<year>1978</year>). <article-title>Contributions to factor analysis of dichotomous variables.</article-title> <source><italic>Psychometrika</italic></source> <volume>43</volume> <fpage>551</fpage>&#x2013;<lpage>560</lpage>. <pub-id pub-id-type="doi">10.1007/BF02293813</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Muth&#x00E9;n</surname> <given-names>B.</given-names></name></person-group> (<year>1984</year>). <article-title>A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators.</article-title> <source><italic>Psychometrika</italic></source> <volume>49</volume> <fpage>115</fpage>&#x2013;<lpage>132</lpage>. <pub-id pub-id-type="doi">10.1007/BF02294210</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Muth&#x00E9;n</surname> <given-names>B.</given-names></name> <name><surname>Kaplan</surname> <given-names>D.</given-names></name></person-group> (<year>1985</year>). <article-title>A comparison of some methodologies for the factor analysis of non-normal Likert variables.</article-title> <source><italic>Br. J. Math. Stat. Psychol.</italic></source> <volume>38</volume> <fpage>171</fpage>&#x2013;<lpage>189</lpage>. <pub-id pub-id-type="doi">10.1111/j.2044-8317.1985.tb00832.x</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olsson</surname> <given-names>U.</given-names></name></person-group> (<year>1979</year>). <article-title>On the robustness of factor analysis against crude classification of the observations.</article-title> <source><italic>Multivar. Behav. Res.</italic></source> <volume>14</volume> <fpage>485</fpage>&#x2013;<lpage>500</lpage>. <pub-id pub-id-type="doi">10.1207/s15327906mbr1404_7</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paek</surname> <given-names>I.</given-names></name> <name><surname>Cui</surname> <given-names>M.</given-names></name> <name><surname>&#x00D6;zt&#x00FC;rk G&#x00FC;be&#x015F;</surname> <given-names>N.</given-names></name> <name><surname>Yang</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <article-title>Estimation of an IRT model by Mplus for dichotomously scored responses under different estimation methods.</article-title> <source><italic>Educ. Psychol. Meas.</italic></source> <volume>78</volume> <fpage>569</fpage>&#x2013;<lpage>588</lpage>. <pub-id pub-id-type="doi">10.1177/0013164417715738</pub-id> <pub-id pub-id-type="pmid">30147117</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><collab>R Core Team</collab> (<year>2021</year>). <source><italic>R: A language and environment for statistical computing. R foundation for statistical computing.</italic></source> <publisher-loc>Vienna</publisher-loc>: <publisher-name>R Core Team</publisher-name>.</citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Revelle</surname> <given-names>W.</given-names></name></person-group> (<year>2021</year>). <source><italic>psych: Procedures for personality and psychological research.</italic></source> <publisher-loc>Evanston, IL</publisher-loc>: <publisher-name>Northwestern University</publisher-name>.</citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Revuelta</surname> <given-names>H.</given-names></name> <name><surname>Hidalgo</surname> <given-names>B.</given-names></name> <name><surname>Alcazar-C&#x00F3;rcoles</surname> <given-names>M. A.</given-names></name></person-group> (<year>2022</year>). <article-title>Bayesian estimation and testing of a beta factor model for bounded continuous variables.</article-title> <source><italic>Multivar. Behav. Res.</italic></source> <volume>57</volume> <fpage>57</fpage>&#x2013;<lpage>78</lpage>. <pub-id pub-id-type="doi">10.1080/00273171.2020.1805582</pub-id> <pub-id pub-id-type="pmid">32804553</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Revuelta</surname> <given-names>J.</given-names></name></person-group> (<year>2014</year>). <article-title>Multidimensional item response model for nominal variables.</article-title> <source><italic>Appl. Psychol. Meas.</italic></source> <volume>38</volume> <fpage>549</fpage>&#x2013;<lpage>562</lpage>. <pub-id pub-id-type="doi">10.1177/0146621614536272</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Revuelta</surname> <given-names>J.</given-names></name> <name><surname>Franco-Mart&#x00ED;nez</surname> <given-names>A.</given-names></name> <name><surname>Xim&#x00E9;nez</surname> <given-names>C.</given-names></name></person-group> (<year>2021</year>). <article-title>Nominal factor analysis of situational judgment tests: Evaluation of latent dimensionality and factorial invariance.</article-title> <source><italic>Educ. Psychol. Meas.</italic></source> <volume>81</volume> <fpage>1054</fpage>&#x2013;<lpage>1088</lpage>. <pub-id pub-id-type="doi">10.1177/0013164421994321</pub-id> <pub-id pub-id-type="pmid">34565816</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Revuelta</surname> <given-names>J.</given-names></name> <name><surname>Maydeu-Olivares</surname> <given-names>A.</given-names></name> <name><surname>Xim&#x00E9;nez</surname> <given-names>C.</given-names></name></person-group> (<year>2020</year>). <article-title>Factor analysis for nominal (first choice) data.</article-title> <source><italic>Struct. Equ. Model. Multidis. J.</italic></source> <volume>27</volume> <fpage>781</fpage>&#x2013;<lpage>795</lpage>. <pub-id pub-id-type="doi">10.1080/10705511.2019.1668276</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robitzsch</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>Why ordinal variables can (almost) always be treated as continuous variables: Clarifying assumptions of robust continuous and ordinal factor analysis estimation methods.</article-title> <source><italic>Front. Educ.</italic></source> <volume>5</volume>:<issue>589965</issue>. <pub-id pub-id-type="doi">10.3389/feduc.2020.589965</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robitzsch</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). <article-title>On the bias in confirmatory factor analysis when treating discrete variables as ordinal instead of continuous.</article-title> <source><italic>Axioms</italic></source> <volume>11</volume>:<issue>162</issue>. <pub-id pub-id-type="doi">10.3390/axioms11040162</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rosander</surname> <given-names>A. C.</given-names></name> <name><surname>Thurstone</surname> <given-names>L. L.</given-names></name></person-group> (<year>1931</year>). &#x201C;<article-title>Scale of attitude toward censorship: Scale No. 28</article-title>,&#x201D; in <source><italic>The measurement of social attitudes</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Thurstone</surname> <given-names>L. L.</given-names></name></person-group> (<publisher-loc>Chicago, IL</publisher-loc>: <publisher-name>University of Chicago Press</publisher-name>).</citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rosseel</surname> <given-names>Y.</given-names></name></person-group> (<year>2012</year>). <article-title>lavaan: An R package for structural equation modeling.</article-title> <source><italic>J. Stat. Softw.</italic></source> <volume>48</volume> <fpage>1</fpage>&#x2013;<lpage>36</lpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2014.01521</pub-id> <pub-id pub-id-type="pmid">25601849</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Samejima</surname> <given-names>F.</given-names></name></person-group> (<year>1969</year>). <source><italic>Estimation of ability using a response pattern of graded scores. Psychometrika Monograph, 17.</italic></source> <publisher-loc>Richmond, VA</publisher-loc>: <publisher-name>Psychometric Society</publisher-name>. <pub-id pub-id-type="doi">10.1007/BF03372160</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Samejima</surname> <given-names>F.</given-names></name></person-group> (<year>2016</year>). &#x201C;<article-title>Graded response models</article-title>,&#x201D; in <source><italic>Handbook of Item response theory, volume one</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>van der Linden</surname> <given-names>W. J.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Chapman and Hall/CRC</publisher-name>), <fpage>123</fpage>&#x2013;<lpage>136</lpage>.</citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Satorra</surname> <given-names>A.</given-names></name> <name><surname>Bentler</surname> <given-names>P. M.</given-names></name></person-group> (<year>2001</year>). <article-title>A scaled difference chi-square test statistic for moment structure analysis.</article-title> <source><italic>Psychometrika</italic></source> <volume>66</volume> <fpage>507</fpage>&#x2013;<lpage>514</lpage>. <pub-id pub-id-type="doi">10.1007/BF02296192</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Satorra</surname> <given-names>A.</given-names></name> <name><surname>Bentler</surname> <given-names>P. M.</given-names></name></person-group> (<year>2010</year>). <article-title>Ensuring positiveness of the scaled difference chi-square test statistic.</article-title> <source><italic>Psychometrika</italic></source> <volume>75</volume> <fpage>243</fpage>&#x2013;<lpage>248</lpage>. <pub-id pub-id-type="doi">10.1007/s11336-009-9135-y</pub-id> <pub-id pub-id-type="pmid">20640194</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Savalei</surname> <given-names>V.</given-names></name></person-group> (<year>2018</year>). <article-title>On the computation of the RMSEA and CFI from the mean-and-variance corrected test statistic with nonnormal data in SEM.</article-title> <source><italic>Multivar. Behav. Res.</italic></source> <volume>53</volume> <fpage>419</fpage>&#x2013;<lpage>429</lpage>. <pub-id pub-id-type="doi">10.1080/00273171.2018.1455142</pub-id> <pub-id pub-id-type="pmid">29624085</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Savalei</surname> <given-names>V.</given-names></name> <name><surname>Falk</surname> <given-names>C. F.</given-names></name></person-group> (<year>2014</year>). <article-title>Recovering substantive factor loadings in the presence of acquiescence bias: A comparison of three approaches.</article-title> <source><italic>Multivar. Behav. Res.</italic></source> <volume>49</volume> <fpage>407</fpage>&#x2013;<lpage>424</lpage>. <pub-id pub-id-type="doi">10.1080/00273171.2014.931800</pub-id> <pub-id pub-id-type="pmid">26732356</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shaw</surname> <given-names>M. E.</given-names></name> <name><surname>Wright</surname> <given-names>J. M.</given-names></name></person-group> (<year>1967</year>). <source><italic>Scales for the measurement of attitudes.</italic></source> <publisher-loc>New York, NY</publisher-loc>: <publisher-name>McGraw-Hill</publisher-name>.</citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>D.</given-names></name> <name><surname>Maydeu-Olivares</surname> <given-names>A.</given-names></name> <name><surname>DiStefano</surname> <given-names>C.</given-names></name></person-group> (<year>2018</year>). <article-title>The relationship between the standardized root mean square residual and model misspecification in factor analysis models.</article-title> <source><italic>Multivar. Behav. Res.</italic></source> <volume>53</volume> <fpage>676</fpage>&#x2013;<lpage>694</lpage>. <pub-id pub-id-type="doi">10.1080/00273171.2018.1476221</pub-id> <pub-id pub-id-type="pmid">30596259</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smits</surname> <given-names>N.</given-names></name> <name><surname>&#x00D6;&#x00F0;reden</surname> <given-names>O.</given-names></name> <name><surname>Garnier-Villarreal</surname> <given-names>M.</given-names></name> <name><surname>Terwee</surname> <given-names>C. B.</given-names></name> <name><surname>Chalmers</surname> <given-names>R. P.</given-names></name></person-group> (<year>2020</year>). <article-title>A study of alternative approaches to non-normal latent trait distributions in item response theory models used for health outcome measurement.</article-title> <source><italic>Stat. Methods Med. Res.</italic></source> <volume>29</volume> <fpage>1030</fpage>&#x2013;<lpage>1048</lpage>. <pub-id pub-id-type="doi">10.1177/0962280220907625</pub-id> <pub-id pub-id-type="pmid">32156195</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Takane</surname> <given-names>Y.</given-names></name> <name><surname>de Leeuw</surname> <given-names>J.</given-names></name></person-group> (<year>1987</year>). <article-title>On the relationship between item response theory and factor analysis of discretized variables.</article-title> <source><italic>Psychometrika</italic></source> <volume>52</volume> <fpage>393</fpage>&#x2013;<lpage>408</lpage>. <pub-id pub-id-type="doi">10.1007/BF02294363</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thissen</surname> <given-names>D.</given-names></name> <name><surname>Cai</surname> <given-names>L.</given-names></name> <name><surname>Bock</surname> <given-names>R. D.</given-names></name></person-group> (<year>2010</year>). &#x201C;<article-title>The nominal categories model</article-title>,&#x201D; in <source><italic>Handbook of polytomous item response theory models</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Nering</surname> <given-names>M. L.</given-names></name> <name><surname>Ostini</surname> <given-names>R.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Routledge</publisher-name>), <fpage>43</fpage>&#x2013;<lpage>76</lpage>.</citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woods</surname> <given-names>C. M.</given-names></name></person-group> (<year>2006</year>). <article-title>Ramsay-curve item response theory (RC-IRT) to detect and correct for nonnormal latent variables.</article-title> <source><italic>Psychol. Methods</italic></source> <volume>11</volume> <fpage>253</fpage>&#x2013;<lpage>270</lpage>. <pub-id pub-id-type="doi">10.1037/1082-989X.11.3.253</pub-id> <pub-id pub-id-type="pmid">16953704</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woods</surname> <given-names>C. M.</given-names></name></person-group> (<year>2007</year>). <article-title>Empirical histograms in item response theory with ordinal data.</article-title> <source><italic>Educ. Psychol. Meas.</italic></source> <volume>67</volume> <fpage>73</fpage>&#x2013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1177/0013164406288163</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woods</surname> <given-names>C. M.</given-names></name> <name><surname>Lin</surname> <given-names>N.</given-names></name></person-group> (<year>2009</year>). <article-title>Item response theory with estimation of the latent density using Davidian curves.</article-title> <source><italic>Appl. Psychol. Meas.</italic></source> <volume>33</volume> <fpage>102</fpage>&#x2013;<lpage>117</lpage>. <pub-id pub-id-type="doi">10.1177/0146621608319512</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xim&#x00E9;nez</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <article-title>A Monte Carlo study of recovery of weak factor loadings in confirmatory factor analysis</article-title>. <source><italic>Struct. Equ. Modeling multidiscipl. J.</italic></source> <volume>13</volume>, <fpage>587</fpage>&#x2013;<lpage>614</lpage>. <pub-id pub-id-type="doi">10.1207/s15328007sem1304_5</pub-id> <pub-id pub-id-type="pmid">33486653</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xim&#x00E9;nez</surname> <given-names>C.</given-names></name> <name><surname>Maydeu-Olivares</surname> <given-names>A.</given-names></name> <name><surname>Shi</surname> <given-names>D.</given-names></name> <name><surname>Revuelta</surname> <given-names>J.</given-names></name></person-group> (<year>2022</year>). <article-title>Assessing cutoff values of SEM fit indices: Advantages of the unbiased SRMR index and its cutoff criterion based on communality.</article-title> <source><italic>Struct. Equ. Model. Multidis. J.</italic></source> <volume>29</volume> <fpage>368</fpage>&#x2013;<lpage>380</lpage>. <pub-id pub-id-type="doi">10.1080/10705511.2021.1992596</pub-id></citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zygmont</surname> <given-names>C.</given-names></name> <name><surname>Smith</surname> <given-names>M. R.</given-names></name></person-group> (<year>2014</year>). <article-title>Robust factor analysis in the presence of normality violations, missing data, and outliers: Empirical questions and possible solutions.</article-title> <source><italic>Quant. Methods Psychol.</italic></source> <volume>10</volume> <fpage>40</fpage>&#x2013;<lpage>55</lpage>. <pub-id pub-id-type="doi">10.20982/tqmp.10.1.p040</pub-id></citation></ref>
</ref-list>
</back>
</article>