<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2022.980261</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Predicted as observed? How to identify empirically adequate theoretical constructs</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Witte</surname>
<given-names>Erich H.</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="fn0011" ref-type="author-notes"><sup>&#x2020;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/483872/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Stanciu</surname>
<given-names>Adrian</given-names>
</name>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<xref rid="fn0011" ref-type="author-notes"><sup>&#x2020;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/703575/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Zenker</surname>
<given-names>Frank</given-names>
</name>
<xref rid="aff3" ref-type="aff"><sup>3</sup></xref>
<xref rid="c001" ref-type="corresp"><sup>&#x002A;</sup></xref>
<xref rid="fn0011" ref-type="author-notes"><sup>&#x2020;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/379981/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Institute for Psychology, University of Hamburg</institution>, <addr-line>Hamburg</addr-line>, <country>Germany</country></aff>
<aff id="aff2"><sup>2</sup><institution>Data and Research on Society, GESIS-Leibniz Institute for the Social Sciences</institution>, <addr-line>Mannheim</addr-line>, <country>Germany</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Philosophy, Bo&#x011F;azi&#x00E7;i University</institution>, <addr-line>Istanbul</addr-line>, <country>Turkey</country></aff>
<author-notes>
<fn id="fn0002" fn-type="edited-by"><p>Edited by: Luca Tummolini, National Research Council, Italy</p></fn>
<fn id="fn0003" fn-type="edited-by"><p>Reviewed by: Jan Ketil Arnulf, BI Norwegian Business School, Norway; Jean-Marie C. Chevalier, Universit&#x00E9; Paris-Est Cr&#x00E9;teil Val de Marne, France</p></fn>
<corresp id="c001">&#x002A;Correspondence: Frank Zenker, <email>fzenker@gmail.com</email></corresp>
<fn id="fn0011" fn-type="equal"><p><sup>&#x2020;</sup>ORCID: Erich H. Witte <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0002-9446-3211">https://orcid.org/0000-0002-9446-3211</ext-link></p><p>Adrian Stanciu <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0001-8149-7829">https://orcid.org/0000-0001-8149-7829</ext-link></p><p>Frank Zenker <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0001-7173-7964">https://orcid.org/0000-0001-7173-7964</ext-link></p></fn>
<fn id="fn0004" fn-type="other"><p>This article was submitted to Theoretical and Philosophical Psychology, a section of the journal Frontiers in Psychology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>01</day>
<month>12</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>980261</elocation-id>
<history>
<date date-type="received">
<day>28</day>
<month>06</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>19</day>
<month>10</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2022 Witte, Stanciu and Zenker.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Witte, Stanciu and Zenker</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The identification of an empirically adequate theoretical construct requires determining whether a theoretically predicted effect is sufficiently similar to an observed effect. To this end, we propose a simple similarity measure, describe its application in different research designs, and use computer simulations to estimate the necessary sample size for a given observed effect. As our main example, we apply this measure to recent meta-analytical research on precognition. Results suggest that the evidential basis is too weak for a predicted precognition effect of <italic>d</italic> =&#x2009;0.20 to be considered empirically adequate. As additional examples, we apply this measure to object-level experimental data from dissonance theory and a recent crowdsourcing hypothesis test, as well as to meta-analytical data on the correlation of personality traits and life outcomes.</p>
</abstract>
<kwd-group>
<kwd>crowdsourcing hypothesis test</kwd>
<kwd>dissonance theory</kwd>
<kwd>empirical adequacy</kwd>
<kwd>Paul Meehl</kwd>
<kwd>meta-analysis</kwd>
<kwd>personality research</kwd>
<kwd>precognition</kwd>
<kwd>theory construction</kwd>
</kwd-group>
<contract-num rid="cn1">118C257</contract-num>
<contract-sponsor id="cn1">TUBITAK</contract-sponsor>
<counts>
<fig-count count="2"/>
<table-count count="2"/>
<equation-count count="4"/>
<ref-count count="60"/>
<page-count count="15"/>
<word-count count="11851"/>
</counts>
</article-meta>
</front>
<body>
<disp-quote>
<p>&#x201C;<italic>I am deliberately setting aside statistical significance testing</italic>, <italic>or the setting up of confidence intervals</italic> [&#x2026;]&#x201D;</p>
</disp-quote>
<p>(<xref ref-type="bibr" rid="ref37">Meehl, 1990</xref>; p. 128)</p>
<sec id="sec1" sec-type="intro">
<title>Introduction</title>
<p>As classical empirical findings fail to replicate and empirical studies often prove to be poorly conducted (<xref ref-type="bibr" rid="ref17">Gervais, 2021</xref>; <xref ref-type="bibr" rid="ref43">Nosek et al., 2022</xref>), the <italic>replication crisis</italic> or <italic>confidence crisis</italic> presents a major impasse for behavioral science (<xref ref-type="bibr" rid="ref14">Fleck, 1935</xref>; <xref ref-type="bibr" rid="ref29">Kuhn, 1962</xref>). While the motives for employing questionable research practices (<xref ref-type="bibr" rid="ref16">Gelman and Carlin, 2014</xref>; <xref ref-type="bibr" rid="ref15">Gelman, 2018</xref>) and the limitations of research methods (<xref ref-type="bibr" rid="ref24">Kerr, 1998</xref>) are increasingly better understood, most reform proposals today recommend <italic>transparency measures</italic> (e.g., study pre-registration or registered replications; <xref ref-type="bibr" rid="ref13">Fiedler and Prager, 2018</xref>; <xref ref-type="bibr" rid="ref27">Klein et al., 2018</xref>). Less frequently addressed is that scientific progress requires good theoretical constructs (<xref ref-type="bibr" rid="ref36">Meehl, 1978</xref>; <xref ref-type="bibr" rid="ref18">Gigerenzer, 1998</xref>; <xref ref-type="bibr" rid="ref40">Mi&#x0142;kowski et al., 2019</xref>; <xref ref-type="bibr" rid="ref42">Muthukrishna and Henrich, 2019</xref>; <xref ref-type="bibr" rid="ref44">Oberauer and Lewandowsky, 2019</xref>; <xref ref-type="bibr" rid="ref10">Eronen and Romeijn, 2020</xref>; <xref ref-type="bibr" rid="ref57">van Rooij and Baggio, 2020</xref>; <xref ref-type="bibr" rid="ref8">Cornelissen et al., 2021</xref>; <xref ref-type="bibr" rid="ref9">Eronen and Bringmann, 2021</xref>; <xref ref-type="bibr" rid="ref17">Gervais, 2021</xref>; <xref ref-type="bibr" rid="ref23">Irvine, 2021</xref>).</p>
<p>A good theoretical construct minimally allows for an empirically adequate prediction. A theoretical construct &#x201C;is empirically adequate exactly if what it says about the observable things and events in the world is true&#x2014;exactly if it &#x201C;saves [or captures] the phenomena&#x201D; (<xref ref-type="bibr" rid="ref56">van Fraassen, 1980</xref>, p. 12). In the context of experimental research, this means that the effect that is predicted by a theoretical construct must be sufficiently <italic>similar</italic> to a relevant observed effect.</p>
<p>The development of an empirically adequate construct depends on high-quality observations. But even observations of the highest quality cannot automatically generate a theoretical construct that offers a <italic>non-circular</italic> justification for why a future event occurs as predicted.<xref rid="fn0005" ref-type="fn"><sup>1</sup></xref> Because a theoretical construct must deductively entail its prediction <italic>before</italic> observations are made, a non-circular approach to predicting a phenomenon of interest thus requires a <italic>deductive</italic> approach to the development of empirical adequate theoretical constructs (<xref ref-type="bibr" rid="ref48">Popper, 1959</xref>; <xref ref-type="bibr" rid="ref31">Lakens, 2013</xref>; <xref ref-type="bibr" rid="ref32">Lakens et al., 2018</xref>),</p>
<p>We begin by summarizing why the empirical adequacy of a theoretical construct should be evaluated <italic>independently</italic> of statistical elements (<xref ref-type="bibr" rid="ref37">Meehl, 1990</xref>; p. 128) and review the shortcomings of extant evaluative approaches. To this end, we propose a new formal measure that is independent of statistical elements, thus enabling a <italic>direct</italic> comparison between theory and observation. The intended application for this measure is theory construction. To demonstrate its use value, we exemplarily evaluate recent meta-analytical findings on <italic>precognition</italic> (<xref ref-type="bibr" rid="ref3">Bem et al., 2016</xref>). Additional examples, as well as a description of how this measure can be applied under various research designs, are provided in <xref ref-type="supplementary-material" rid="SM1">Supplementary Appendix S1</xref>, <xref ref-type="supplementary-material" rid="SM1">S2</xref>.</p>
</sec>
<sec id="sec2">
<title>Summary</title>
<p>Evaluating whether a theoretical prediction agrees with observations requires a theory-accommodating approach. But if this approach combinates theoretical and statistical aspects, then the evaluative outcome depends on the <italic>variance</italic> of error-prone observations. Consequently, one cannot be sufficiently certain about the accuracy of observations to which the theoretical prediction is compared. Since this uncertainty transfers to the evaluative outcome, the question of whether a theoretical prediction agrees with observations should be addressed <italic>independently</italic> of how observations vary (<xref ref-type="bibr" rid="ref37">Meehl, 1990</xref>, <xref ref-type="bibr" rid="ref38">1992</xref>, <xref ref-type="bibr" rid="ref39">1997</xref>).</p>
<p>Yet the opposite holds if a standardized effect size measure such as Cohen&#x2019;s <italic>d</italic>&#x2009;=&#x2009;(<italic>m</italic><sub>1</sub>&#x2009;&#x2013;&#x2009;<italic>m</italic><sub>0</sub>)&#x2009;/&#x2009;<italic>s</italic> is used to quantify the observations. This measure combines the observed <italic>mean difference</italic> (<italic>m</italic><sub>1</sub>&#x2009;&#x2013;&#x2009;<italic>m</italic><sub>0</sub>) with the statistical element of the observed <italic>standard deviation</italic> (<italic>s</italic>). A theoretical construct, however, predicts only (<italic>m</italic><sub>1</sub>&#x2013;<italic>m</italic><sub>0</sub>), yet not <italic>s</italic>. This makes a standardized effect size measure an inappropriate formal tool to evaluate the empirical adequacy of a theoretical construct.</p>
<p>A theoretical construct contrasts most starkly with an inductive generalization that states a <italic>directional</italic> hypothesis. Because a directional hypothesis is informative only relative to its inductive basis, it can merely &#x201C;predict&#x201D; the pattern of past observations it subsumes. A theoretical construct, by contrast, is informative beyond this basis (see our note 1). Moreover, the construct must predict future observations not as a directional but as a <italic>point</italic>-specific effect. Otherwise, one simply cannot evaluate whether the theoretically predicted mean agrees with the observed mean.</p>
</sec>
<sec id="sec3">
<title>Shortcomings of the inductive strategy</title>
<sec id="sec4">
<title>Standard deviation</title>
<p>The observed standard deviation (<italic>s</italic>) is a measure of the variance of observations. The observed variance depends on the extent to which an empirical setting is subject to uncontrolled (random) influences. Other things being equal, empirical settings that are more rigorously controlled for (random) influences go along with reduced observed variance, i.e., a smaller <italic>s</italic>. Compared to a less rigorously controlled setting, therefore, the value of Cohen&#x2019;s <italic>d</italic>-measure increases.</p>
<p>Since the observed standard deviation quantifies the variance of error-prone observations, an observed effect must be related to a probability distribution. This process is known as <italic>standardization</italic>. Once standardized, the observed effect becomes a statistic of an entire sample of observations that can no longer be related <italic>directly</italic> to a theoretically predicted effect. With a standardized observed effect, therefore, one cannot evaluate the <italic>similarity</italic> between what a theoretical construct predicts and what a measurement instrument records. Instead, one evaluates the relative position of statistically transformed measurement scores on a measurement scale against a random distribution.</p>
<p>A statistical test relies on the observed standard deviation to evaluate whether the observed effect differs statistically significantly from a null hypothesis. A <italic>t</italic>-test, for instance, can often show that a <italic>large</italic> difference between the observed means in the experimental and the control group is statistically significant. But the standard deviation <italic>combines</italic> several causes that contribute to the observed variance (e.g., the sample selection process, the experimental implementation, the validity and the reliability of the in- and dependent variables, and the random influences on an empirical setting). Thus, a theoretically predicted and an observed effect may well agree. But if the observed effect depends on the observed standard deviation, then its statistical significance is an <italic>insufficient</italic> criterion to evaluate a theoretical construct as empirically adequate.</p>
</sec>
<sec id="sec5">
<title>Parameter estimation</title>
<p>Parameter estimation is an <italic>inductive</italic> strategy to separate systematic patterns from non-systematic noise in data. A parameter operates at the level of statistics rather than the level of measurement. &#x2018;Parameter&#x2019; thus refers not to the properties of observations but those of data (e.g., their central tendency as measured by the mean, or the strength of associations between variables as measured by correlation or regression coefficients). Since data provide the basis for a parameter estimate, its accuracy is informed by statistical procedures that evaluate the parameter against the observed variance. The latter results from the variation of behavioral responses and measurement shortcomings. A given measurement instrument, therefore, captures both a relevant phenomenon <italic>and</italic> random influences (e.g., due to participants&#x2019; salient memories, chronic moods, or even the weather).</p>
<p>This leads to three complications in estimating a parameter accurately. First, since perfectly error-free observations are impossible, the accuracy of a parameter must be evaluated against the observed variance by using statistical procedures (that rely on a significance level <italic>&#x03B1;</italic> and an associated probability level <italic>p</italic>). Such procedures are often subjective and need not be reliable (see <italic>p</italic>-harking, <xref ref-type="bibr" rid="ref24">Kerr, 1998</xref>; <italic>p</italic>-hacking, <xref ref-type="bibr" rid="ref53">Simmons et al., 2013</xref>). Crucially, statistical procedures cannot distinguish whether the observed variance results from measurement shortcomings or rather from uncontrolled (random) influences on an empirical setting.</p>
<p>Second, what matters for scientific discovery is the size of the parameter estimate. For instance, a small observed mean difference between people&#x2019;s political orientation that varies with color preferences presumably <italic>fails</italic> to be a substantially meaningful finding. Whereas a similarly small observed difference that varies with cultural background presumably would be substantially meaningful. This finding, however, should be further explored only if it is sufficiently large. But recent <italic>meta</italic>-meta-analyses (<xref ref-type="bibr" rid="ref45">Olsson-Collentine et al., 2020</xref>; <xref ref-type="bibr" rid="ref51">Schauer and Hedges, 2020</xref>; <xref ref-type="bibr" rid="ref34">Linden and H&#x00F6;nekopp, 2021</xref>) strongly suggest that individual published studies across different behavioral science domains typically report observed object-level effects that are small and homogenous (read: small <italic>d</italic>, small <italic>s</italic>) or medium-to-large and heterogeneous (read: large(r) <italic>d</italic>, large <italic>s</italic>). A small observed variance thus tends to go along with a small observed mean effect. Whereas the findings of individual object-level studies that are sufficiently large to be further explored go along with a large observed variance. This necessarily results in a <italic>vague</italic> impression of the parameter that an empirical adequate theoretical construct would have to predict.</p>
<p>Third, a parameter estimate is useful for theory construction only if its inductive basis accurately captures an observed effect in a relevant population. Considerations of test-power and sample representativity dictate the use of sufficiently large samples to discover systematic behavioral patterns (law of large numbers). In small samples, by contrast, these patterns are likely truncated by uncontrolled (random) influences, resulting in inaccurate parameter estimates. Generally, large samples allow for more accurate parameter estimates if the underlying distribution of observations is uniform.</p>
<p>Among the widely used tools to estimate parameters are Cohen&#x2019;s <italic>d</italic>-measure, confidence intervals, and tools that rely on inductive model fitting and probabilistic distributions.</p>
<sec id="sec6">
<title>Cohen&#x2019;s <italic>d-</italic>measure</title>
<p>The goal of null-hypothesis significance testing is to determine whether an observed object-level effect differs significantly from a random effect. Relative to a predefined significance level <italic>&#x03B1;</italic> and an associated probability level <italic>p</italic>, the statistical significance of an observed effect indicates the probability of observing this effect under the null hypothesis. But this says nothing about whether the null or the alternative hypothesis is true or whether the observed object-level effect is relevant for theory construction. For theory construction, therefore, the statistical significance of an observed object-level effect is merely a necessary criterion. In addition, publications should also report the observed object-level effect&#x2019;s size.</p>
<p>Among the available tools to calculate the observed effect size, standardized effect size measures are often preferred because they weigh the observed effect by the observed variance, thus providing a robustness check for the observed effect. As one of the most widely used measures in behavioral science (<xref ref-type="bibr" rid="ref50">Sch&#x00E4;fer and Schwarz, 2019</xref>), for instance, Cohen&#x2019;s standardized <italic>d</italic>-measure <italic>d</italic> =&#x2009;(<italic>m</italic><sub>1</sub> &#x2013; <italic>m</italic><sub>0</sub>)&#x2009;/&#x2009;<italic>s</italic> (<xref ref-type="bibr" rid="ref7">Cohen, 1977</xref>) weighs the observed mean difference (<italic>m</italic><sub>1</sub> &#x2013; <italic>m</italic><sub>0</sub>) between the experimental (<italic>m</italic><sub>1</sub>) and the control group (<italic>m</italic><sub>0</sub>) by the pooled standard deviation in both groups (<italic>s</italic>). It should be easy to see that, if (<italic>m</italic><sub>1</sub> &#x2013; <italic>m</italic><sub>0</sub>) is constant, then the <italic>d</italic>-value is sensitive to the observed variance captured by <italic>s</italic>.</p>
<p>Even if an experimental study that relies on the <italic>d</italic>-measure would report a very <italic>large</italic> statistically significant effect, this is insufficient to motivate the development of a theoretical construct for it. To be theorized, after all, is the <italic>true</italic> parameter, rather than its ratio to the observed variance. The main challenge thus is to tease apart the causes that contribute to the observed variance (see above). Standardized effect size measures, however, simply cannot meet this challenge, making them inappropriate tools for theory construction research. Therefore, an additional layer of scrutiny must address the confidence that an inductive parameter estimates the true parameter.</p>
</sec>
<sec id="sec7">
<title>Confidence intervals</title>
<p>A true parameter can be estimated with perfect accuracy only in theory. In praxis, (random) influences or measurement instrument shortcomings render a perfectly accurate parameter estimate unlikely. One can nevertheless state the parameter&#x2019;s <italic>expected</italic> accuracy using a <italic>confidence interval</italic> (CI), the width of which depends on the level of significance <italic>&#x03B1;</italic>. To determine the CI, one simultaneously considers the observed mean difference, the observed variance, the level of significance, and the sample size. This is formally given as <inline-formula><mml:math id="M1"><mml:mrow><mml:mi>C</mml:mi><mml:mi>I</mml:mi><mml:mo>=</mml:mo><mml:mi>d</mml:mi><mml:mo>&#x00B1;</mml:mo><mml:mi>z</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>s</mml:mi><mml:mo>/</mml:mo><mml:mo>&#x221A;</mml:mo><mml:mi>n</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>.</p>
<p>Like Cohen&#x2019;s <italic>d</italic>-measure, however, a CI cannot determine whether a true effect (e.g., the mean difference between two groups in a population) was estimated accurately because also a CI combines the mean difference with the statistical element <italic>s</italic>. Thus, the observed variance once again results in a <italic>vague</italic> impression of the parameter. Generally, unless the causes that contribute to the observed variance can be teased apart, vague observations will undermine theory construction research. And the one possible way of teasing these causes apart is to increase the sample size.</p>
</sec>
<sec id="sec8">
<title>Inductive model fitting</title>
<p>Using inductive model fitting, researchers can address the complexity of human behavior by statistically modeling the associations between two or more estimated parameters, followed by testing the statistical model against a random model. Using various indexes (e.g., the Comparative Fit Index (CFI) or the Root Mean Square Error of Approximation (RMSEA)), a finite set of observations is compared against a class of statistical models (see the special issue on model selection, <xref ref-type="bibr" rid="ref002">Myung et al., 2000</xref>; <xref ref-type="bibr" rid="ref5">Burnham and Anderson, 2004</xref>). The model that best describes the data is said to be <italic>identified</italic> in the population (<xref ref-type="bibr" rid="ref4">Bollen et al., 2010</xref>).</p>
<p>Inductive model fitting presupposes a reconstruction of the variance&#x2013;covariance structure in the data. But fitting a statistical model to data inherits all attributes of the data (including errors due to measurement instrument shortcomings, uncontrolled random influences, non-uniform distributions, or outliers). So, although inductive model fitting improves over the estimation of a single parameter, its use-value for theory construction primarily depends on the quality of the data. Even the best-fitted model, however, cannot <italic>unequivocally</italic> tell meaningful data patterns from patterns owed to measurement instrument shortcomings or uncontrolled (random) influences. This holds regardless of whether the estimated parameter is statistically significant or whether the effect size is large. All an inductively fitted model can tell is whether data are described well.</p>
<p>Since model fitting is an <italic>iterative</italic> strategy, moreover, some parameters must be estimated before others, so that the associations between parameters can be specified to obtain a data-fitting model. The identification of the parameters that are to be estimated first would ideally rely on theoretical considerations. But when researchers fit a model to data, they instead often rely on <italic>p</italic>-harking or <italic>p</italic>-hacking strategies.</p>
</sec>
<sec id="sec9">
<title>Bayesian probabilistic distributions</title>
<p>In the Bayesian approach to parameter estimation, the known probability of past observations is assumed to estimate the probability of (predicted) future observations. A theoretical construct can thus be evaluated based on the prior probability of a statistical model (<xref ref-type="bibr" rid="ref58">Wagenmakers and Farell, 2004</xref>). The observed variance is here captured by the assumption that the theoretical construct is itself subject to variation. So, rather than evaluating the agreement between data and a single statistical model, Bayesians evaluate the agreement between data and a <italic>distribution</italic> of possible statistical models.</p>
<p>A theoretical construct is thus specified not as a single parameter, but as one that is embedded in a prior probability distribution (e.g., a normal or a Cauchy distribution). Of course, if this prior probability distribution accurately captures the true parameter, then a theoretical construct that is specified as a probability distribution may be useful for theory construction. What the <italic>true</italic> probability distribution is, however, one can never know. A Bayesian parameter estimate, therefore, depends not so much on the quality of the data, but more on a researcher&#x2019;s (subjective) assumptions about the prior probability distribution (see <xref ref-type="bibr" rid="ref28">Krefeld-Schwalb et al., 2018</xref>).</p>
<p>Since the theoretical construct is more likely to be associated with an upper and a lower probability bound than with a unique probability, the Bayesian approach to parameter estimation corresponds&#x2014;except for the distribution of possible theoretical parameters&#x2014;to the specification of a theoretical construct as an <italic>interval</italic> hypothesis (i.e., a two-point-hypothesis). Because the endpoints of this interval represent two distinct theoretical parameters, each endpoint must be <italic>separately</italic> evaluated against data. But the possibility of a separate evaluation of two theoretical parameters also shows that there is no genuine need to distribute them. After all, if the (subjective) <italic>a priori</italic> probabilities of both theoretical parameters are independent, then as one parameter is assigned probability 1, the other can be assigned probability 0.</p>
</sec>
</sec>
</sec>
<sec id="sec10">
<title>Toward a deductive strategy: Paul Meehl&#x2019;s corroboration index</title>
<p>In the context of theory construction research, probably the first in behavioral science to recognize a problem in relating the theoretically predicted effect to the sample statistic <italic>s</italic> was <xref ref-type="bibr" rid="ref37">Meehl (1990)</xref>. Against the background of <xref ref-type="bibr" rid="ref30">Lakatos&#x2019; (1978)</xref> &#x201C;core vs. protective belt&#x201D;-model of empirical theories&#x2014;which recognizes that making suitable adjustments to the protective belt can (in principle forever) deflect the empirically inadequate predictions that constitute a theory&#x2019;s <italic>falsification instances</italic> away from the core&#x2014;Meehl argued that a formal measure for the empirical adequacy of a theoretical construct should <italic>ignore s</italic>.</p>
<disp-quote>
<p>&#x201C;To construct a crude [corroboration-]index of a theory&#x2019;s [predictive] track record, one first amends the earlier Popper to the later Popper by shifting emphasis from falsification to verisimilitude. [&#x2026;] Meanwhile, we require of a candidate index that it somehow reflect how bad a numerical &#x201C;miss&#x201D; the experimenter chalks up against [the theory] T. [&#x2026;] We are examining the relationship between T and its track record in predicting numerical values of [a hypothesis] H, <italic>ignoring the stochastic slippage</italic> between H and the data set that is the main concern of the statistician.&#x201D;</p>
</disp-quote>
<p>(<xref ref-type="bibr" rid="ref37">Meehl, 1990</xref>; p. 128)</p>
<p>Meehl&#x2019;s corroboration index (<italic>C<sub>i</sub></italic>) is the following:</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M2"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>C</mml:mi><mml:mi>l</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>I</mml:mi><mml:mi>n</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mspace width="thickmathspace"/></mml:mrow></mml:math></disp-formula>
<p>where <italic>Cl</italic>&#x2009;=&#x2009;the closeness of observed data to the theoretical prediction;</p>
<p><italic>In</italic> =&#x2009;the intolerance of the theory (e.g., the standardized precision of a prediction).</p>
<p>These terms can be expanded:</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M3"><mml:mrow><mml:mi>C</mml:mi><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2013;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>D</mml:mi><mml:mo>/</mml:mo><mml:mi>S</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>
<p>where <italic>D</italic> =&#x2009;the deviation of observed data from the tolerance interval of the theory;</p>
<p><italic>S</italic> =&#x2009;&#x201C;Spielraum,&#x201D; i.e., the expected range of observed data regardless of whether the theory is true; and</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M4"><mml:mrow><mml:mi>I</mml:mi><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2013;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>I</mml:mi><mml:mo>/</mml:mo><mml:mi>S</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>
<p>where <italic>I</italic> =&#x2009;the interval tolerated by the theory (or the raw precision of a theoretical prediction).</p>
<p>For a given experiment, the index <italic>C<sub>i</sub></italic> is the product of the <italic>closeness</italic> of the data to the theoretical prediction (<italic>Cl</italic>) and the <italic>intolerance</italic> of a theory (<italic>In</italic>). Thus, large values of <italic>C<sub>i</sub></italic> are expected for an empirically adequate theoretical construct and small values of <italic>C<sub>i</sub></italic> for an empirically inadequate one. Although several critics considered the <italic>C<sub>i</sub></italic> measure overly complex (see the special issue of <italic>Psychological Inquiry</italic>, including <xref ref-type="bibr" rid="ref37">Meehl 1990</xref>), <xref ref-type="bibr" rid="ref38">Meehl (1992)</xref> rightly replied that formal measures are needed to develop empirically adequate theoretical constructs. Yet, Meehl&#x2019;s key insight&#x2014;that a formal measure to evaluate the empirical adequacy of a theoretical construct should <italic>ignore</italic> the statistical element <italic>s</italic>&#x2014;further awaits uptake. Researchers instead continue to rely on statistical considerations (e.g., CIs, <italic>t</italic>, <italic>d</italic>, etc.) or on model-fitting approaches that combine theoretical with statistical elements.</p>
<p>Heeding Meehl&#x2019;s insight, we propose the <italic>similarity index I</italic><sub>SIM</sub> as an alternative formal measure, one far simpler than <italic>C<sub>i</sub></italic>.</p>
</sec>
<sec id="sec11">
<title>The similarity index</title>
<p>As we saw, if a parameter is induced from an interval of observations, then the parameter captures the uncontrolled (random) influences on an empirical setting that are represented by <italic>s</italic>. Although this parameter may (misleadingly) be referred to as a theoretical construct, this construct is as <italic>vague</italic> as the underlying interval of observations is wide. An inductive parameter, therefore, is at most as informative as a two-point, directional alternative hypothesis (H<sub>1</sub>). But a directional alternative hypothesis cannot stand in the <italic>one-to-one</italic> relation between prediction and observation that is required to evaluate whether a theoretical construct is empirically adequate (<xref ref-type="bibr" rid="ref26">Klein, 2014</xref>; <xref ref-type="bibr" rid="ref54">Szucs and Ioannidis, 2017</xref>; <xref ref-type="bibr" rid="ref15">Gelman, 2018</xref>). Only a point-specific theoretical construct can do so.</p>
<p>For this reason, <xref ref-type="bibr" rid="ref37">Meehl (1990)</xref> argued that the evaluation of the empirical adequacy of a theoretical construct should ignore <italic>s</italic>. Once the evaluation is independent of <italic>s</italic>, it pertains only to the <italic>similarity</italic> between a predicted and an observed mean difference in a sample. This is precisely what the similarity index <italic>I</italic><sub>SIM</sub> captures (see 4).</p>
<disp-formula id="E4"><label>(4)</label><mml:math id="M5"><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi mathvariant="normal">SIM</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mfenced close="|" open="|"><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi mathvariant="normal">THEO</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>m</mml:mi><mml:mn>0</mml:mn></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mfenced close="|" open="|"><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>m</mml:mi><mml:mn>0</mml:mn></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="normal">ES</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">THEO</mml:mi></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:msub><mml:mrow><mml:mi mathvariant="normal">ES</mml:mi></mml:mrow><mml:mrow><mml:mi mathvariant="normal">OBS</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></disp-formula>
<p>ES, effect size.</p>
<p><italic>m</italic><sub>THEO</sub>, the theoretically predicted mean.</p>
<p><italic>m</italic><sub>1</sub>, the observed mean in the treatment group.</p>
<p><italic>m</italic><sub>0</sub>, the observed mean in the control group.</p>
<p><italic>m</italic><sub>THEO</sub> &#x2013; <italic>m</italic><sub>0</sub>, the theoretically predicted mean difference (ES<sub>THEO</sub>).</p>
<p><italic>m</italic><sub>1</sub> &#x2013; <italic>m</italic><sub>0</sub>, the empirically observed mean difference (ES<sub>OBS</sub>).</p>
<p>A formal measure for the empirical adequacy of a theoretical construct should satisfy several criteria that are relevant to theory construction. First, an experimentally observed phenomenon must be independent of the measurement scale that a given measurement instrument presupposes. Second, any two phenomena that are recorded on distinct measurement scales must remain comparable. Third, observations must remain stable under theoretically plausible transformations.</p>
<p>But if different measurement scales are made comparable by a transformation into <italic>z</italic>-values, then recourse to the inductive element <italic>s</italic> entails that the measurement quality of the empirical setting is retained. A <italic>z</italic>-transformation thus <italic>inherits</italic> information originating from the uncontrolled (random) influences on an empirical setting. This is problematic for theory construction research because, given that <italic>s</italic> as a property of observations <italic>lacks</italic> a theoretical counterpart, recourse to <italic>s</italic> &#x201C;blurs&#x201D; the evaluation of the empirical adequacy of a theoretical construct.</p>
<p><italic>I</italic><sub>SIM</sub> uses a transformation that avoids <italic>s</italic>. The comparability of observations that are recorded on different measurement scales is guaranteed because a ratio of differences is invariant under the addition of a constant or multiplication by some factor.<xref rid="fn0006" ref-type="fn"><sup>2</sup></xref> <italic>I</italic><sub>SIM</sub> also guarantees that the direction of the observed effect can be interpreted. This matters for evaluating whether the observed effect leans toward the experimental or the control group. If the direction of the observed effect and the theoretically predicted effect agree, then <italic>I</italic><sub>SIM</sub> is invariant concerning the order of means. That the same mathematical signs (+, &#x2212;) now appear in the numerator and the denominator of <italic>I</italic><sub>SIM</sub> can be neglected. Whereas if the direction of the observed effect and that of the theoretically predicted effect differ, then distinct mathematical signs indicate that the prediction fails to agree with observations. In this case, <italic>I</italic><sub>SIM</sub> is set to 0.</p>
<p>Using <italic>I</italic><sub>SIM</sub>, the theoretically predicted effect can thus be compared <italic>directly</italic> to the observed effect. A direct comparison should arguably also apply if a theoretically predicted effect is compared to a meta-analytically estimated population effect that is aggregated from the results of independent replication studies. But the opposite is the case if this comparison relies on a standardized effect size measure such as Cohen&#x2019;s <italic>d</italic>, which is widely used for this purpose today. Sometimes, indeed, the observed <italic>d</italic>-value simply stands in for the estimated population effect.</p>
<p>The intended application for <italic>I</italic><sub>SIM</sub> is a rigorously controlled empirical setting where participants are randomly allocated to the experimental and the control group, respectively are randomly selected as study participants in a correlational study.<xref rid="fn0007" ref-type="fn"><sup>3</sup></xref> Since the use of this kind of setting to evaluate the empirical adequacy of a <italic>directional</italic> H<sub>1</sub> undermines all efforts at controlling the setting, a rigorously controlled empirical setting should exclusively serve to evaluate the high-risk prediction that only a point-specific theoretical construct can offer.</p>
<sec id="sec12">
<title>The similarity between theory and observations</title>
<p>The agreement between a theoretical prediction and observations is <italic>perfect</italic> if the ratio between both is one, i.e., ES<sub>THEO</sub>&#x2009;/&#x2009;ES<sub>OBS</sub> =&#x2009;1.00. A perfectly empirically adequate prediction, however, is a strong idealization because even the most rigorously controlled empirical setting is subject to some uncontrolled (random) influences and errors. So, even if a theoretical construct predicts a population effect perfectly (i.e., ES<sub>THEO</sub>&#x2009;=&#x2009;ES<sub>POP</sub>), a measurement instrument with imperfect reliability or random influences on an empirical setting do entail that the observed effect will be &#x201C;blurred.&#x201D; A formal measure for the empirical adequacy of a theoretical construct, therefore, can only <italic>approximate</italic> the agreement between a theoretical prediction and observations.</p>
<p>Analytically, the agreement between a theoretical prediction and observations varies between a <italic>match</italic> (<italic>I</italic><sub>SIM</sub>&#x2009;=&#x2009;1.00) and a <italic>mismatch</italic> in one of two directions (<italic>I</italic><sub>SIM</sub> =&#x2009;0 and <italic>I</italic><sub>SIM</sub> &#x003E;&#x2009;&#x003E;&#x2009;1). The reason for a mismatch&#x2014;namely whether the theoretical construct predicts an empirically inadequate effect or whether the observed effect is subject to random influences&#x2014;can be teased out by collecting additional data, i.e., by increasing the sample size <italic>n</italic>. If the values of <italic>I</italic><sub>SIM</sub> cluster around 1 as <italic>n</italic> increases, this indicates that the theoretically predicted effect approximately matches a relevant population effect (law of large numbers). As the observed effect thus progressively converges onto the population effect (ES<sub>OBS</sub> =&#x2009;ES<sub>POP</sub>), it can eventually be excluded that random influences account for the observations. Thus, one gains evidence that the theoretically predicted effect is <italic>empirically adequate</italic>. This case is perfect for theory construction because the theoretical construct can be adopted into a theory.</p>
<p>Whereas if values of <italic>I</italic><sub>SIM</sub> never cluster around 1 as <italic>n</italic> increases, then the theoretical prediction is <italic>empirically inadequate</italic>. This means one gains evidence that the theoretically predicted effect misrepresents the population effect, wherefore the theoretical construct requires adjustment. Subsequently, a new theoretically predicted effect must be separately evaluated using new observations.</p>
</sec>
<sec id="sec13">
<title>The similarity interval</title>
<p>Defining the range of acceptable deviations from a perfect match requires an interval of the form [<italic>x</italic>&#x2009;&#x003C;&#x2009;<italic>I</italic><sub>SIM</sub>&#x2009;=&#x2009;1.00&#x2009;&#x003C;&#x2009;<italic>y</italic>]. The purpose of this <italic>similarity interval</italic> (SI) is distinct from that of a <italic>confidence interval</italic> (CI). When a population effect (ES<sub>POP</sub>) is estimated from observations, a CI handles randomly distributed &#x201C;noise&#x201D; in an empirical setting by stating the interval within which ES<sub>POP</sub> is expected to lie to some predefined probability (see the section <italic>Parameter Estimation</italic>). The SI, by contrast, differentiates between evidence for and against the empirical adequacy of a theoretical construct by stating the probability that the theoretically predicted effect is similar to observations if a study is <italic>repeated</italic> numerous times.</p>
<p>The SI is motivated by two constraints. First, an empirically adequate theoretical construct must neither grossly under- nor grossly over-predict the population effect (ES<sub>THEO</sub>&#x2009;&#x2245;&#x2009;ES<sub>POP</sub>). Second, provided the first constraint holds, if the theoretically predicted effect keeps approximating the observed effect as the number of study repetitions increases, then the theoretically predicted effect becomes increasingly more promising as a parameter for theory construction because the prediction remains empirically adequate.</p>
<p>The SI particularly facilitates the identification of a <italic>preliminary</italic> match between a theoretically predicted effect (ES<sub>THEO</sub>) and an observed effect (ES<sub>OBS</sub>), because an <italic>I</italic><sub>SIM</sub>-based evaluation is fallible&#x2014;future studies may lead to an opposite evaluation. We define a preliminary match using an SI with bounds of [0.80;1.20]. If the <italic>I</italic><sub>SIM</sub> value lies within these bounds, then the theoretical prediction is preliminarily empirically adequate. The bounds [0.80;1.20] are informed by 10,000 simulated study repetitions (see the section <italic>Simulated Data and Results</italic>). For instance, given <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;1,000 participants, our simulations show that if the population effect is a <italic>medium</italic> effect, ES<sub>POP</sub> =&#x2009;0.50, then <italic>I</italic><sub>SIM</sub>-values fall within this SI in approximately 99% of 10,000 study repetitions. And, given <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;100 participants in each study condition, if the population effect is a <italic>large</italic> effect, ES<sub>POP</sub> =&#x2009;1.00, then <italic>I</italic><sub>SIM</sub>-values fall within the SI in approximately 95% of 10,000 repetitions.</p>
<p>Since a <italic>small</italic> sample suffices to detect a <italic>large</italic> population effect under small error-rates, whereas detecting a <italic>small</italic> population effect requires a <italic>large</italic> sample, the application of a 99%-SI to the small to medium effects that are normally observed in behavioral science would require unrealistically large samples (<xref ref-type="bibr" rid="ref34">Linden and H&#x00F6;nekopp, 2021</xref>). Given the conventional error rate of 5%, however, already a 95%-SI can suffice as an evidence-based criterion to decide whether a theoretical construct can be accepted as empirically adequate, whether it should be improved, or whether additional data should be collected.</p>
</sec>
<sec id="sec14">
<title>Simulated data and results</title>
<p>If simulations approximate the universe of possible observed effects, they are useful to explore the stability of effects that real studies would observe (see <xref ref-type="bibr" rid="ref41">Morris et al., 2019</xref>). Real observations are made in samples drawn from some population of interest. But researchers typically cannot access the entire population, neither in real life nor in simulations. To account for the ultimately unknown observed variance, real observations are treated statistically as a <italic>t</italic>-distribution, which is sensitive to <italic>n</italic>. As <italic>n</italic> increases, a <italic>t</italic>-distribution approximates the normal distribution that is expected for a population (central limit theorem).</p>
<p>We therefore simulated data from <italic>t</italic>-distributions in a universe of study settings that comprises 10,000 repeated individual studies of the same effect. A study setting is characterized by the means observed in the control (<italic>m</italic><sub>0</sub>) and the experimental group (<italic>m</italic><sub>1</sub>) and by the sample size (<italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub>). All simulations were conducted in R (<xref ref-type="bibr" rid="ref49">R Core Team, 2021</xref>) using the packages tidyverse (<xref ref-type="bibr" rid="ref59">Wickham et al., 2019</xref>), dplyr (<xref ref-type="bibr" rid="ref60">Wickham et al., 2021</xref>), and effsize (<xref ref-type="bibr" rid="ref55">Torchiano, 2020</xref>).</p>
<p>In the first of two basic scenarios, where the theoretically predicted effect <italic>matches</italic> the population effect (ES<sub>THEO</sub> =&#x2009;ES<sub>POP</sub>), the sample size of a study setting was <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;20, 30, 50, 100, 300, or 1,000. In the control group the observed mean was null (<italic>m</italic><sub>0</sub> =&#x2009;0) and in the experimental group <italic>m</italic><sub>1</sub> =&#x2009;0.20, 0.50, 0.80, 1.00, 1.20, 1.40, 1.60, 1.80, or 2.00. In this way, we simulated 54 study settings times 10,000 repetitions, calculating the similarity index <italic>I</italic><sub>SIM</sub> separately for each repetition of a study setting (see formula 4). For the percentages of <italic>I</italic><sub>SIM</sub>-values falling inside and outside the similarity interval SI, see <xref rid="tab1" ref-type="table">Table 1</xref> and <xref rid="fig1" ref-type="fig">Figure 1</xref>.</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption>
<p>True predictions [ES<sub>THEO</sub>&#x2009;=&#x2009;ES<sub>POP</sub> equals (<italic>m</italic><sub>THEO</sub> &#x2013; <italic>m</italic><sub>0</sub>)&#x2009;=&#x2009;(<italic>m</italic><sub>POP</sub> &#x2013; <italic>m</italic><sub>0</sub>)]: expected <italic>I</italic><sub>SIM</sub>-values for varying values of <italic>m</italic><sub>THEO</sub> and <italic>n</italic>.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><italic>m</italic><sub>THEO</sub> = <italic>m</italic><sub>POP</sub></th>
<th align="left" valign="top"><italic>n</italic></th>
<th align="left" valign="top">&#x003C;0.6]</th>
<th align="left" valign="top">(0.6,0.7]</th>
<th align="left" valign="top">(0.7,0.8]</th>
<th align="left" valign="top">(0.8,0.9]</th>
<th align="left" valign="top">(0.9,1.0]</th>
<th align="left" valign="top">(1.0,1.1]</th>
<th align="left" valign="top">(1.1,1.2]</th>
<th align="left" valign="top">(1.2,1.3]</th>
<th align="left" valign="top">(1.3,1.4]</th>
<th align="left" valign="top">&#x003E;(1.4</th>
</tr>
</thead>
<tbody>
<tr>
<td align="char" valign="top" char=".">0.20</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">52.2</td>
<td align="char" valign="top" char="&#x00B1;">5.55</td>
<td align="char" valign="top" char="&#x00B1;">5.67</td>
<td align="char" valign="top" char="&#x00B1;">5.34</td>
<td align="char" valign="top" char="&#x00B1;">4.52</td>
<td align="char" valign="top" char="&#x00B1;">3.44</td>
<td align="char" valign="top" char="&#x00B1;">2.69</td>
<td align="char" valign="top" char="&#x00B1;">2.3</td>
<td align="char" valign="top" char="&#x00B1;">1.96</td>
<td align="char" valign="top" char="&#x00B1;">16.33</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">45.89</td>
<td align="char" valign="top" char="&#x00B1;">6.64</td>
<td align="char" valign="top" char="&#x00B1;">6.43</td>
<td align="char" valign="top" char="&#x00B1;">5.7</td>
<td align="char" valign="top" char="&#x00B1;">4.84</td>
<td align="char" valign="top" char="&#x00B1;">4.14</td>
<td align="char" valign="top" char="&#x00B1;">3.61</td>
<td align="char" valign="top" char="&#x00B1;">2.78</td>
<td align="char" valign="top" char="&#x00B1;">1.92</td>
<td align="char" valign="top" char="&#x00B1;">18.05</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">35.97</td>
<td align="char" valign="top" char="&#x00B1;">7.86</td>
<td align="char" valign="top" char="&#x00B1;">7.43</td>
<td align="char" valign="top" char="&#x00B1;">7.29</td>
<td align="char" valign="top" char="&#x00B1;">6.29</td>
<td align="char" valign="top" char="&#x00B1;">5.06</td>
<td align="char" valign="top" char="&#x00B1;">4.24</td>
<td align="char" valign="top" char="&#x00B1;">3.49</td>
<td align="char" valign="top" char="&#x00B1;">2.75</td>
<td align="char" valign="top" char="&#x00B1;">19.62</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">20.84</td>
<td align="char" valign="top" char="&#x00B1;">8.4</td>
<td align="char" valign="top" char="&#x00B1;">9.97</td>
<td align="char" valign="top" char="&#x00B1;">9.98</td>
<td align="char" valign="top" char="&#x00B1;">9</td>
<td align="char" valign="top" char="&#x00B1;">7</td>
<td align="char" valign="top" char="&#x00B1;">5.3</td>
<td align="char" valign="top" char="&#x00B1;">4.39</td>
<td align="char" valign="top" char="&#x00B1;">3.46</td>
<td align="char" valign="top" char="&#x00B1;">21.66</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">3.26</td>
<td align="char" valign="top" char="&#x00B1;">5.69</td>
<td align="char" valign="top" char="&#x00B1;">11.86</td>
<td align="char" valign="top" char="&#x00B1;">15.24</td>
<td align="char" valign="top" char="&#x00B1;">15.25</td>
<td align="char" valign="top" char="&#x00B1;">12.31</td>
<td align="char" valign="top" char="&#x00B1;">9.1</td>
<td align="char" valign="top" char="&#x00B1;">6.6</td>
<td align="char" valign="top" char="&#x00B1;">4.6</td>
<td align="char" valign="top" char="&#x00B1;">16.09</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0.04</td>
<td align="char" valign="top" char="&#x00B1;">0.71</td>
<td align="char" valign="top" char="&#x00B1;">5.62</td>
<td align="char" valign="top" char="&#x00B1;">17.65</td>
<td align="char" valign="top" char="&#x00B1;">25.46</td>
<td align="char" valign="top" char="&#x00B1;">21.84</td>
<td align="char" valign="top" char="&#x00B1;">13.84</td>
<td align="char" valign="top" char="&#x00B1;">6.94</td>
<td align="char" valign="top" char="&#x00B1;">3.87</td>
<td align="char" valign="top" char="&#x00B1;">4.03</td>
</tr>
<tr>
<td align="char" valign="top" char=".">0.50</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">18.04</td>
<td align="char" valign="top" char="&#x00B1;">8.61</td>
<td align="char" valign="top" char="&#x00B1;">10.18</td>
<td align="char" valign="top" char="&#x00B1;">10.67</td>
<td align="char" valign="top" char="&#x00B1;">8.86</td>
<td align="char" valign="top" char="&#x00B1;">7.93</td>
<td align="char" valign="top" char="&#x00B1;">6.07</td>
<td align="char" valign="top" char="&#x00B1;">4.77</td>
<td align="char" valign="top" char="&#x00B1;">3.71</td>
<td align="char" valign="top" char="&#x00B1;">21.16</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">10.25</td>
<td align="char" valign="top" char="&#x00B1;">8.03</td>
<td align="char" valign="top" char="&#x00B1;">10.77</td>
<td align="char" valign="top" char="&#x00B1;">12.29</td>
<td align="char" valign="top" char="&#x00B1;">11.78</td>
<td align="char" valign="top" char="&#x00B1;">9.09</td>
<td align="char" valign="top" char="&#x00B1;">7.05</td>
<td align="char" valign="top" char="&#x00B1;">5.74</td>
<td align="char" valign="top" char="&#x00B1;">4.36</td>
<td align="char" valign="top" char="&#x00B1;">20.64</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">3.32</td>
<td align="char" valign="top" char="&#x00B1;">6.03</td>
<td align="char" valign="top" char="&#x00B1;">10.57</td>
<td align="char" valign="top" char="&#x00B1;">14.8</td>
<td align="char" valign="top" char="&#x00B1;">15.65</td>
<td align="char" valign="top" char="&#x00B1;">12.71</td>
<td align="char" valign="top" char="&#x00B1;">9.16</td>
<td align="char" valign="top" char="&#x00B1;">6.33</td>
<td align="char" valign="top" char="&#x00B1;">4.28</td>
<td align="char" valign="top" char="&#x00B1;">17.15</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0.33</td>
<td align="char" valign="top" char="&#x00B1;">2.58</td>
<td align="char" valign="top" char="&#x00B1;">8.69</td>
<td align="char" valign="top" char="&#x00B1;">18.15</td>
<td align="char" valign="top" char="&#x00B1;">20.52</td>
<td align="char" valign="top" char="&#x00B1;">16.48</td>
<td align="char" valign="top" char="&#x00B1;">11.87</td>
<td align="char" valign="top" char="&#x00B1;">7.1</td>
<td align="char" valign="top" char="&#x00B1;">5.31</td>
<td align="char" valign="top" char="&#x00B1;">8.97</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.04</td>
<td align="char" valign="top" char="&#x00B1;">1.96</td>
<td align="char" valign="top" char="&#x00B1;">15.78</td>
<td align="char" valign="top" char="&#x00B1;">32.69</td>
<td align="char" valign="top" char="&#x00B1;">28.31</td>
<td align="char" valign="top" char="&#x00B1;">13.69</td>
<td align="char" valign="top" char="&#x00B1;">5.02</td>
<td align="char" valign="top" char="&#x00B1;">1.51</td>
<td align="char" valign="top" char="&#x00B1;">1</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">3.68</td>
<td align="char" valign="top" char="&#x00B1;">46.41</td>
<td align="char" valign="top" char="&#x00B1;">42.07</td>
<td align="char" valign="top" char="&#x00B1;">7.32</td>
<td align="char" valign="top" char="&#x00B1;">0.5</td>
<td align="char" valign="top" char="&#x00B1;">0.02</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td align="char" valign="top" char=".">0.80</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">3.82</td>
<td align="char" valign="top" char="&#x00B1;">5.49</td>
<td align="char" valign="top" char="&#x00B1;">11.44</td>
<td align="char" valign="top" char="&#x00B1;">15.23</td>
<td align="char" valign="top" char="&#x00B1;">14.87</td>
<td align="char" valign="top" char="&#x00B1;">12.24</td>
<td align="char" valign="top" char="&#x00B1;">8.93</td>
<td align="char" valign="top" char="&#x00B1;">7.06</td>
<td align="char" valign="top" char="&#x00B1;">4.26</td>
<td align="char" valign="top" char="&#x00B1;">16.66</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">1.1</td>
<td align="char" valign="top" char="&#x00B1;">4.02</td>
<td align="char" valign="top" char="&#x00B1;">10.5</td>
<td align="char" valign="top" char="&#x00B1;">16.51</td>
<td align="char" valign="top" char="&#x00B1;">18.03</td>
<td align="char" valign="top" char="&#x00B1;">15.48</td>
<td align="char" valign="top" char="&#x00B1;">10.74</td>
<td align="char" valign="top" char="&#x00B1;">7.25</td>
<td align="char" valign="top" char="&#x00B1;">4.96</td>
<td align="char" valign="top" char="&#x00B1;">11.41</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">0.08</td>
<td align="char" valign="top" char="&#x00B1;">1.26</td>
<td align="char" valign="top" char="&#x00B1;">7.74</td>
<td align="char" valign="top" char="&#x00B1;">17.78</td>
<td align="char" valign="top" char="&#x00B1;">23.74</td>
<td align="char" valign="top" char="&#x00B1;">19.29</td>
<td align="char" valign="top" char="&#x00B1;">12.41</td>
<td align="char" valign="top" char="&#x00B1;">7.53</td>
<td align="char" valign="top" char="&#x00B1;">3.96</td>
<td align="char" valign="top" char="&#x00B1;">6.21</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.05</td>
<td align="char" valign="top" char="&#x00B1;">2.42</td>
<td align="char" valign="top" char="&#x00B1;">16.08</td>
<td align="char" valign="top" char="&#x00B1;">30.82</td>
<td align="char" valign="top" char="&#x00B1;">26.69</td>
<td align="char" valign="top" char="&#x00B1;">14.27</td>
<td align="char" valign="top" char="&#x00B1;">6.06</td>
<td align="char" valign="top" char="&#x00B1;">2.24</td>
<td align="char" valign="top" char="&#x00B1;">1.37</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.11</td>
<td align="char" valign="top" char="&#x00B1;">6.03</td>
<td align="char" valign="top" char="&#x00B1;">44.34</td>
<td align="char" valign="top" char="&#x00B1;">39.03</td>
<td align="char" valign="top" char="&#x00B1;">9.41</td>
<td align="char" valign="top" char="&#x00B1;">1.02</td>
<td align="char" valign="top" char="&#x00B1;">0.05</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.28</td>
<td align="char" valign="top" char="&#x00B1;">49.5</td>
<td align="char" valign="top" char="&#x00B1;">49.18</td>
<td align="char" valign="top" char="&#x00B1;">1.04</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td align="char" valign="top" char=".">1.00</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">1.19</td>
<td align="char" valign="top" char="&#x00B1;">3.96</td>
<td align="char" valign="top" char="&#x00B1;">10.87</td>
<td align="char" valign="top" char="&#x00B1;">16.73</td>
<td align="char" valign="top" char="&#x00B1;">17.35</td>
<td align="char" valign="top" char="&#x00B1;">15.2</td>
<td align="char" valign="top" char="&#x00B1;">10.68</td>
<td align="char" valign="top" char="&#x00B1;">6.71</td>
<td align="char" valign="top" char="&#x00B1;">4.87</td>
<td align="char" valign="top" char="&#x00B1;">12.44</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">0.19</td>
<td align="char" valign="top" char="&#x00B1;">1.65</td>
<td align="char" valign="top" char="&#x00B1;">8.32</td>
<td align="char" valign="top" char="&#x00B1;">18.17</td>
<td align="char" valign="top" char="&#x00B1;">22.51</td>
<td align="char" valign="top" char="&#x00B1;">17.97</td>
<td align="char" valign="top" char="&#x00B1;">12.35</td>
<td align="char" valign="top" char="&#x00B1;">7.55</td>
<td align="char" valign="top" char="&#x00B1;">4.49</td>
<td align="char" valign="top" char="&#x00B1;">6.8</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
<td align="char" valign="top" char="&#x00B1;">0.29</td>
<td align="char" valign="top" char="&#x00B1;">4.33</td>
<td align="char" valign="top" char="&#x00B1;">17.45</td>
<td align="char" valign="top" char="&#x00B1;">28.02</td>
<td align="char" valign="top" char="&#x00B1;">22.9</td>
<td align="char" valign="top" char="&#x00B1;">13.84</td>
<td align="char" valign="top" char="&#x00B1;">7.24</td>
<td align="char" valign="top" char="&#x00B1;">3.09</td>
<td align="char" valign="top" char="&#x00B1;">2.83</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.8</td>
<td align="char" valign="top" char="&#x00B1;">11.7</td>
<td align="char" valign="top" char="&#x00B1;">36.43</td>
<td align="char" valign="top" char="&#x00B1;">32.75</td>
<td align="char" valign="top" char="&#x00B1;">13.23</td>
<td align="char" valign="top" char="&#x00B1;">3.98</td>
<td align="char" valign="top" char="&#x00B1;">0.82</td>
<td align="char" valign="top" char="&#x00B1;">0.29</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">2.75</td>
<td align="char" valign="top" char="&#x00B1;">48.25</td>
<td align="char" valign="top" char="&#x00B1;">43.03</td>
<td align="char" valign="top" char="&#x00B1;">5.77</td>
<td align="char" valign="top" char="&#x00B1;">0.2</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.04</td>
<td align="char" valign="top" char="&#x00B1;">49.99</td>
<td align="char" valign="top" char="&#x00B1;">49.84</td>
<td align="char" valign="top" char="&#x00B1;">0.13</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td align="char" valign="top" char=".">1.20</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">0.17</td>
<td align="char" valign="top" char="&#x00B1;">2.07</td>
<td align="char" valign="top" char="&#x00B1;">8.31</td>
<td align="char" valign="top" char="&#x00B1;">18.05</td>
<td align="char" valign="top" char="&#x00B1;">21.51</td>
<td align="char" valign="top" char="&#x00B1;">17.93</td>
<td align="char" valign="top" char="&#x00B1;">12.61</td>
<td align="char" valign="top" char="&#x00B1;">7.18</td>
<td align="char" valign="top" char="&#x00B1;">4.25</td>
<td align="char" valign="top" char="&#x00B1;">7.92</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">0.03</td>
<td align="char" valign="top" char="&#x00B1;">0.57</td>
<td align="char" valign="top" char="&#x00B1;">5.43</td>
<td align="char" valign="top" char="&#x00B1;">18.69</td>
<td align="char" valign="top" char="&#x00B1;">25.74</td>
<td align="char" valign="top" char="&#x00B1;">21.59</td>
<td align="char" valign="top" char="&#x00B1;">13.15</td>
<td align="char" valign="top" char="&#x00B1;">7.29</td>
<td align="char" valign="top" char="&#x00B1;">3.51</td>
<td align="char" valign="top" char="&#x00B1;">4</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.02</td>
<td align="char" valign="top" char="&#x00B1;">2.03</td>
<td align="char" valign="top" char="&#x00B1;">15.88</td>
<td align="char" valign="top" char="&#x00B1;">31.69</td>
<td align="char" valign="top" char="&#x00B1;">27.75</td>
<td align="char" valign="top" char="&#x00B1;">14.34</td>
<td align="char" valign="top" char="&#x00B1;">5.33</td>
<td align="char" valign="top" char="&#x00B1;">1.87</td>
<td align="char" valign="top" char="&#x00B1;">1.09</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.25</td>
<td align="char" valign="top" char="&#x00B1;">9.25</td>
<td align="char" valign="top" char="&#x00B1;">39.33</td>
<td align="char" valign="top" char="&#x00B1;">37.34</td>
<td align="char" valign="top" char="&#x00B1;">11.53</td>
<td align="char" valign="top" char="&#x00B1;">1.95</td>
<td align="char" valign="top" char="&#x00B1;">0.33</td>
<td align="char" valign="top" char="&#x00B1;">0.02</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">1.15</td>
<td align="char" valign="top" char="&#x00B1;">48.05</td>
<td align="char" valign="top" char="&#x00B1;">47.62</td>
<td align="char" valign="top" char="&#x00B1;">3.12</td>
<td align="char" valign="top" char="&#x00B1;">0.06</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">49.58</td>
<td align="char" valign="top" char="&#x00B1;">50.4</td>
<td align="char" valign="top" char="&#x00B1;">0.02</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td align="char" valign="top" char=".">1.40</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">0.02</td>
<td align="char" valign="top" char="&#x00B1;">1.03</td>
<td align="char" valign="top" char="&#x00B1;">6.74</td>
<td align="char" valign="top" char="&#x00B1;">17.44</td>
<td align="char" valign="top" char="&#x00B1;">24.80</td>
<td align="char" valign="top" char="&#x00B1;">20.73</td>
<td align="char" valign="top" char="&#x00B1;">13.02</td>
<td align="char" valign="top" char="&#x00B1;">7.28</td>
<td align="char" valign="top" char="&#x00B1;">4.04</td>
<td align="char" valign="top" char="&#x00B1;">4.90</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.21</td>
<td align="char" valign="top" char="&#x00B1;">3.40</td>
<td align="char" valign="top" char="&#x00B1;">17.22</td>
<td align="char" valign="top" char="&#x00B1;">28.40</td>
<td align="char" valign="top" char="&#x00B1;">25.32</td>
<td align="char" valign="top" char="&#x00B1;">13.83</td>
<td align="char" valign="top" char="&#x00B1;">6.60</td>
<td align="char" valign="top" char="&#x00B1;">2.90</td>
<td align="char" valign="top" char="&#x00B1;">2.12</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
<td align="char" valign="top" char="&#x00B1;">0.95</td>
<td align="char" valign="top" char="&#x00B1;">13.18</td>
<td align="char" valign="top" char="&#x00B1;">36.19</td>
<td align="char" valign="top" char="&#x00B1;">30.6</td>
<td align="char" valign="top" char="&#x00B1;">13.48</td>
<td align="char" valign="top" char="&#x00B1;">4.04</td>
<td align="char" valign="top" char="&#x00B1;">1.15</td>
<td align="char" valign="top" char="&#x00B1;">0.4</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
<td align="char" valign="top" char="&#x00B1;">6.21</td>
<td align="char" valign="top" char="&#x00B1;">43.14</td>
<td align="char" valign="top" char="&#x00B1;">39.94</td>
<td align="char" valign="top" char="&#x00B1;">9.67</td>
<td align="char" valign="top" char="&#x00B1;">0.99</td>
<td align="char" valign="top" char="&#x00B1;">0.03</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.46</td>
<td align="char" valign="top" char="&#x00B1;">49.71</td>
<td align="char" valign="top" char="&#x00B1;">48.36</td>
<td align="char" valign="top" char="&#x00B1;">1.47</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">49.94</td>
<td align="char" valign="top" char="&#x00B1;">50.06</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td align="char" valign="top" char=".">1.60</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.39</td>
<td align="char" valign="top" char="&#x00B1;">4.62</td>
<td align="char" valign="top" char="&#x00B1;">17.25</td>
<td align="char" valign="top" char="&#x00B1;">27.52</td>
<td align="char" valign="top" char="&#x00B1;">23.28</td>
<td align="char" valign="top" char="&#x00B1;">13.94</td>
<td align="char" valign="top" char="&#x00B1;">6.78</td>
<td align="char" valign="top" char="&#x00B1;">3.32</td>
<td align="char" valign="top" char="&#x00B1;">2.90</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.08</td>
<td align="char" valign="top" char="&#x00B1;">2.02</td>
<td align="char" valign="top" char="&#x00B1;">15.50</td>
<td align="char" valign="top" char="&#x00B1;">33.03</td>
<td align="char" valign="top" char="&#x00B1;">27.58</td>
<td align="char" valign="top" char="&#x00B1;">13.67</td>
<td align="char" valign="top" char="&#x00B1;">5.17</td>
<td align="char" valign="top" char="&#x00B1;">1.96</td>
<td align="char" valign="top" char="&#x00B1;">0.99</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.44</td>
<td align="char" valign="top" char="&#x00B1;">10.87</td>
<td align="char" valign="top" char="&#x00B1;">38.88</td>
<td align="char" valign="top" char="&#x00B1;">34.33</td>
<td align="char" valign="top" char="&#x00B1;">12.22</td>
<td align="char" valign="top" char="&#x00B1;">2.79</td>
<td align="char" valign="top" char="&#x00B1;">0.38</td>
<td align="char" valign="top" char="&#x00B1;">0.09</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">4.06</td>
<td align="char" valign="top" char="&#x00B1;">45.70</td>
<td align="char" valign="top" char="&#x00B1;">42.55</td>
<td align="char" valign="top" char="&#x00B1;">7.25</td>
<td align="char" valign="top" char="&#x00B1;">0.43</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.1</td>
<td align="char" valign="top" char="&#x00B1;">50.43</td>
<td align="char" valign="top" char="&#x00B1;">48.8</td>
<td align="char" valign="top" char="&#x00B1;">0.67</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">50.4</td>
<td align="char" valign="top" char="&#x00B1;">49.6</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td align="char" valign="top" char=".">1.80</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.19</td>
<td align="char" valign="top" char="&#x00B1;">2.83</td>
<td align="char" valign="top" char="&#x00B1;">16.56</td>
<td align="char" valign="top" char="&#x00B1;">30.69</td>
<td align="char" valign="top" char="&#x00B1;">25.19</td>
<td align="char" valign="top" char="&#x00B1;">14.10</td>
<td align="char" valign="top" char="&#x00B1;">6.30</td>
<td align="char" valign="top" char="&#x00B1;">2.43</td>
<td align="char" valign="top" char="&#x00B1;">1.71</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
<td align="char" valign="top" char="&#x00B1;">0.94</td>
<td align="char" valign="top" char="&#x00B1;">13.79</td>
<td align="char" valign="top" char="&#x00B1;">35.09</td>
<td align="char" valign="top" char="&#x00B1;">30.50</td>
<td align="char" valign="top" char="&#x00B1;">13.77</td>
<td align="char" valign="top" char="&#x00B1;">4.43</td>
<td align="char" valign="top" char="&#x00B1;">1.06</td>
<td align="char" valign="top" char="&#x00B1;">0.41</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.1</td>
<td align="char" valign="top" char="&#x00B1;">8.07</td>
<td align="char" valign="top" char="&#x00B1;">42.21</td>
<td align="char" valign="top" char="&#x00B1;">36.61</td>
<td align="char" valign="top" char="&#x00B1;">10.85</td>
<td align="char" valign="top" char="&#x00B1;">1.83</td>
<td align="char" valign="top" char="&#x00B1;">0.28</td>
<td align="char" valign="top" char="&#x00B1;">0.05</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">2.26</td>
<td align="char" valign="top" char="&#x00B1;">48.66</td>
<td align="char" valign="top" char="&#x00B1;">43.88</td>
<td align="char" valign="top" char="&#x00B1;">4.98</td>
<td align="char" valign="top" char="&#x00B1;">0.22</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.03</td>
<td align="char" valign="top" char="&#x00B1;">49.82</td>
<td align="char" valign="top" char="&#x00B1;">49.99</td>
<td align="char" valign="top" char="&#x00B1;">0.16</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">49.72</td>
<td align="char" valign="top" char="&#x00B1;">50.28</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td align="char" valign="top" char=".">2.00</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.11</td>
<td align="char" valign="top" char="&#x00B1;">2.01</td>
<td align="char" valign="top" char="&#x00B1;">15.78</td>
<td align="char" valign="top" char="&#x00B1;">32.42</td>
<td align="char" valign="top" char="&#x00B1;">27.64</td>
<td align="char" valign="top" char="&#x00B1;">14.13</td>
<td align="char" valign="top" char="&#x00B1;">5.23</td>
<td align="char" valign="top" char="&#x00B1;">1.82</td>
<td align="char" valign="top" char="&#x00B1;">0.86</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.49</td>
<td align="char" valign="top" char="&#x00B1;">11.42</td>
<td align="char" valign="top" char="&#x00B1;">38.40</td>
<td align="char" valign="top" char="&#x00B1;">32.96</td>
<td align="char" valign="top" char="&#x00B1;">12.47</td>
<td align="char" valign="top" char="&#x00B1;">3.33</td>
<td align="char" valign="top" char="&#x00B1;">0.71</td>
<td align="char" valign="top" char="&#x00B1;">0.22</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.07</td>
<td align="char" valign="top" char="&#x00B1;">6.51</td>
<td align="char" valign="top" char="&#x00B1;">44.16</td>
<td align="char" valign="top" char="&#x00B1;">39.04</td>
<td align="char" valign="top" char="&#x00B1;">9.13</td>
<td align="char" valign="top" char="&#x00B1;">1</td>
<td align="char" valign="top" char="&#x00B1;">0.08</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">1.45</td>
<td align="char" valign="top" char="&#x00B1;">48.14</td>
<td align="char" valign="top" char="&#x00B1;">46.84</td>
<td align="char" valign="top" char="&#x00B1;">3.49</td>
<td align="char" valign="top" char="&#x00B1;">0.08</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">50.68</td>
<td align="char" valign="top" char="&#x00B1;">49.21</td>
<td align="char" valign="top" char="&#x00B1;">0.11</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">50.62</td>
<td align="char" valign="top" char="&#x00B1;">49.38</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Cells state the percentages of I<sub>SIM</sub>-values falling within specific I<sub>SIM</sub> intervals for various sample sizes (n), based on 10,000 simulations per row; (, value not included; ], value included.</p>
</table-wrap-foot>
</table-wrap>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption>
<p>Values of <italic>I</italic><sub>SIM</sub> were calculated in 10,000 simulated study-settings with <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;100 under the assumption that the theoretically predicted effect matches the population effect. Each row of this graph represents different values of <italic>m</italic><sub>THEO</sub>.</p>
</caption>
<graphic xlink:href="fpsyg-13-980261-g001.tif"/>
</fig>
<p>Findings are consistent with the claim that an empirical adequate theoretical construct is associated with values of <italic>I</italic><sub>SIM</sub> that fall inside the SI [0.80;1.20]. For example, values of <italic>I</italic><sub>SIM</sub> fall inside this SI in approximately 95% of study repetitions if the sample size is <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;100 and if (<italic>m</italic><sub>1</sub> &#x2013; <italic>m</italic><sub>0</sub>)&#x2009;=&#x2009;1.00. In contrast, values of <italic>I</italic><sub>SIM</sub> fall inside this SI in approximately 67% of study repetitions given the same sample size and a smaller effect of (<italic>m</italic><sub>1</sub> &#x2013; <italic>m</italic><sub>0</sub>)&#x2009;=&#x2009;0.50. This suggests that <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;100 suffices to evaluate a <italic>large</italic> theoretically predicted effect as preliminarily empirically adequate, whereas evaluating a <italic>small</italic> or <italic>medium</italic> theoretically predicted effect requires a considerably larger sample.</p>
<p>The second scenario, where the theoretically predicted effect <italic>failed</italic> to match the population effect (ES<sub>THEO</sub> &#x2260;&#x2009;ES<sub>POP</sub>), examined how false positive and false negative predictions fare in our simulated universe of study repetitions. A false positive prediction occurs if the theoretically predicted effect is mistakenly identified as matching the population effect. And a false negative prediction occurs if the value of <italic>I</italic><sub>SIM</sub> falls outside the SI despite the theoretically predicted effect matching the population effect. In this scenario, we simulated four study settings where the theoretically predicted effect varied from small to large, and the population effect was either over- or underestimated. Notice that the relevant quantity to guide the identification of an empirically adequate theoretical construct here is not the <italic>absolute</italic> probability of detecting an empirically (in-)adequate prediction, but the <italic>difference</italic> between the probabilities of detecting one or the other kind of prediction.</p>
<p>In each of the four study settings, the sample size was <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;20, 30, 50, 100, 300 or 1,000. In two of the four study settings, the theoretically predicted effect <italic>overestimates</italic> the population effect. Setting 1 simulated data from <italic>t</italic>-distributions representing a population effect of ES<sub>POP</sub> =&#x2009;0.20, whereas the theoretically predicted effect was ES<sub>THEO</sub> =&#x2009;0.50. Setting 2 simulated data from <italic>t</italic>-distributions representing a population effect of ES<sub>POP</sub> =&#x2009;0.80, whereas the theoretically predicted effect was ES<sub>THEO</sub> =&#x2009;1.00. In the remaining two study settings, the theoretically predicted effect <italic>underestimates</italic> the population effect. Setting 3 simulated data from <italic>t</italic>-distributions representing a population effect of ES<sub>POP</sub> =&#x2009;0.80, whereas the theoretically predicted effect was ES<sub>THEO</sub> =&#x2009;0.20. Setting 4 simulated data from <italic>t</italic>-distributions representing a population effect of ES<sub>POP</sub> =&#x2009;1.20, whereas the theoretical effect was ES<sub>THEO</sub> =&#x2009;1.00. All four study settings were repeated 10,000 times. For the percentages of <italic>I</italic><sub>SIM</sub>-values falling inside and outside the SI, see <xref rid="tab2" ref-type="table">Table 2</xref> and <xref rid="fig2" ref-type="fig">Figure 2</xref>.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption>
<p>False predictions (ES<sub>THEO</sub>&#x2009;&#x2260;&#x2009;ES<sub>POP</sub>): expected <italic>I</italic><sub>SIM</sub>-values given discrepancies between ES<sub>THEO</sub> (<italic>m</italic><sub>THEO</sub> &#x2013; <italic>m</italic><sub>0</sub>) and ES<sub>POP</sub> (<italic>m</italic><sub>POP</sub> &#x2013; <italic>m</italic><sub>0</sub>) for varying <italic>n</italic>.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top"><italic>m</italic><sub>THEO</sub> &#x2260; <italic>m</italic><sub>POP</sub></th>
<th align="left" valign="middle"><italic>n</italic></th>
<th align="left" valign="middle">&#x003C;0.6]</th>
<th align="left" valign="middle">(0.6,0.7]</th>
<th align="left" valign="middle">(0.7,0.8]</th>
<th align="left" valign="middle">(0.8,0.9]</th>
<th align="left" valign="middle">(0.9,1.0]</th>
<th align="left" valign="middle">(1.0,1.1]</th>
<th align="left" valign="middle">(1.1,1.2]</th>
<th align="left" valign="middle">(1.2,1.3]</th>
<th align="left" valign="middle">(1.3,1.4]</th>
<th align="left" valign="middle">&#x003E;(1.4</th>
</tr>
</thead>
<tbody>
<tr>
<td align="char" valign="top" char=".">0.50&#x2009;&#x2260;&#x2009;0.20</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">28.08</td>
<td align="char" valign="top" char="&#x00B1;">0.84</td>
<td align="char" valign="top" char="&#x00B1;">1.47</td>
<td align="char" valign="top" char="&#x00B1;">2.70</td>
<td align="char" valign="top" char="&#x00B1;">3.38</td>
<td align="char" valign="top" char="&#x00B1;">3.86</td>
<td align="char" valign="top" char="&#x00B1;">4.18</td>
<td align="char" valign="top" char="&#x00B1;">3.98</td>
<td align="char" valign="top" char="&#x00B1;">3.56</td>
<td align="char" valign="top" char="&#x00B1;">47.95</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">22.41</td>
<td align="char" valign="top" char="&#x00B1;">0.40</td>
<td align="char" valign="top" char="&#x00B1;">0.94</td>
<td align="char" valign="top" char="&#x00B1;">1.37</td>
<td align="char" valign="top" char="&#x00B1;">2.53</td>
<td align="char" valign="top" char="&#x00B1;">3.09</td>
<td align="char" valign="top" char="&#x00B1;">3.95</td>
<td align="char" valign="top" char="&#x00B1;">4.19</td>
<td align="char" valign="top" char="&#x00B1;">3.88</td>
<td align="char" valign="top" char="&#x00B1;">57.24</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">16.11</td>
<td align="char" valign="top" char="&#x00B1;">0.03</td>
<td align="char" valign="top" char="&#x00B1;">0.11</td>
<td align="char" valign="top" char="&#x00B1;">0.54</td>
<td align="char" valign="top" char="&#x00B1;">1.19</td>
<td align="char" valign="top" char="&#x00B1;">2.21</td>
<td align="char" valign="top" char="&#x00B1;">2.99</td>
<td align="char" valign="top" char="&#x00B1;">3.86</td>
<td align="char" valign="top" char="&#x00B1;">4.15</td>
<td align="char" valign="top" char="&#x00B1;">68.81</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">7.47</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
<td align="char" valign="top" char="&#x00B1;">0.13</td>
<td align="char" valign="top" char="&#x00B1;">0.45</td>
<td align="char" valign="top" char="&#x00B1;">1.04</td>
<td align="char" valign="top" char="&#x00B1;">1.93</td>
<td align="char" valign="top" char="&#x00B1;">3.29</td>
<td align="char" valign="top" char="&#x00B1;">85.68</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0.59</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.09</td>
<td align="char" valign="top" char="&#x00B1;">0.39</td>
<td align="char" valign="top" char="&#x00B1;">98.93</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">100</td>
</tr>
<tr>
<td align="char" valign="top" char=".">0.50&#x2009;&#x2260;&#x2009;0.80</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">45.21</td>
<td align="char" valign="top" char="&#x00B1;">18.25</td>
<td align="char" valign="top" char="&#x00B1;">13.53</td>
<td align="char" valign="top" char="&#x00B1;">8.23</td>
<td align="char" valign="top" char="&#x00B1;">5.09</td>
<td align="char" valign="top" char="&#x00B1;">3.08</td>
<td align="char" valign="top" char="&#x00B1;">1.66</td>
<td align="char" valign="top" char="&#x00B1;">1.15</td>
<td align="char" valign="top" char="&#x00B1;">0.76</td>
<td align="char" valign="top" char="&#x00B1;">3.04</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">44.90</td>
<td align="char" valign="top" char="&#x00B1;">21.76</td>
<td align="char" valign="top" char="&#x00B1;">15.61</td>
<td align="char" valign="top" char="&#x00B1;">8.48</td>
<td align="char" valign="top" char="&#x00B1;">4.16</td>
<td align="char" valign="top" char="&#x00B1;">2.19</td>
<td align="char" valign="top" char="&#x00B1;">0.78</td>
<td align="char" valign="top" char="&#x00B1;">0.67</td>
<td align="char" valign="top" char="&#x00B1;">0.33</td>
<td align="char" valign="top" char="&#x00B1;">1.12</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">42.86</td>
<td align="char" valign="top" char="&#x00B1;">28.41</td>
<td align="char" valign="top" char="&#x00B1;">17.45</td>
<td align="char" valign="top" char="&#x00B1;">6.74</td>
<td align="char" valign="top" char="&#x00B1;">2.66</td>
<td align="char" valign="top" char="&#x00B1;">1.1</td>
<td align="char" valign="top" char="&#x00B1;">0.43</td>
<td align="char" valign="top" char="&#x00B1;">0.17</td>
<td align="char" valign="top" char="&#x00B1;">0.07</td>
<td align="char" valign="top" char="&#x00B1;">0.11</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">38.55</td>
<td align="char" valign="top" char="&#x00B1;">39.89</td>
<td align="char" valign="top" char="&#x00B1;">17.09</td>
<td align="char" valign="top" char="&#x00B1;">3.7</td>
<td align="char" valign="top" char="&#x00B1;">0.6</td>
<td align="char" valign="top" char="&#x00B1;">0.13</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
<td align="char" valign="top" char="&#x00B1;">0.02</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">31.46</td>
<td align="char" valign="top" char="&#x00B1;">59.73</td>
<td align="char" valign="top" char="&#x00B1;">8.68</td>
<td align="char" valign="top" char="&#x00B1;">0.13</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">19.03</td>
<td align="char" valign="top" char="&#x00B1;">80.32</td>
<td align="char" valign="top" char="&#x00B1;">0.65</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td align="char" valign="top" char=".">1.00&#x2009;&#x2260;&#x2009;0.80</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">0.80</td>
<td align="char" valign="top" char="&#x00B1;">0.69</td>
<td align="char" valign="top" char="&#x00B1;">2.53</td>
<td align="char" valign="top" char="&#x00B1;">5.76</td>
<td align="char" valign="top" char="&#x00B1;">10.07</td>
<td align="char" valign="top" char="&#x00B1;">12.96</td>
<td align="char" valign="top" char="&#x00B1;">12.61</td>
<td align="char" valign="top" char="&#x00B1;">10.64</td>
<td align="char" valign="top" char="&#x00B1;">8.69</td>
<td align="char" valign="top" char="&#x00B1;">35.25</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">0.14</td>
<td align="char" valign="top" char="&#x00B1;">0.16</td>
<td align="char" valign="top" char="&#x00B1;">0.97</td>
<td align="char" valign="top" char="&#x00B1;">4.04</td>
<td align="char" valign="top" char="&#x00B1;">9.52</td>
<td align="char" valign="top" char="&#x00B1;">13.54</td>
<td align="char" valign="top" char="&#x00B1;">14.83</td>
<td align="char" valign="top" char="&#x00B1;">13.22</td>
<td align="char" valign="top" char="&#x00B1;">11.21</td>
<td align="char" valign="top" char="&#x00B1;">32.37</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">0.03</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.09</td>
<td align="char" valign="top" char="&#x00B1;">1.51</td>
<td align="char" valign="top" char="&#x00B1;">6.88</td>
<td align="char" valign="top" char="&#x00B1;">14.29</td>
<td align="char" valign="top" char="&#x00B1;">18.59</td>
<td align="char" valign="top" char="&#x00B1;">17.29</td>
<td align="char" valign="top" char="&#x00B1;">13.49</td>
<td align="char" valign="top" char="&#x00B1;">27.83</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.11</td>
<td align="char" valign="top" char="&#x00B1;">1.99</td>
<td align="char" valign="top" char="&#x00B1;">11.39</td>
<td align="char" valign="top" char="&#x00B1;">22.93</td>
<td align="char" valign="top" char="&#x00B1;">24.65</td>
<td align="char" valign="top" char="&#x00B1;">17.87</td>
<td align="char" valign="top" char="&#x00B1;">21.06</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.07</td>
<td align="char" valign="top" char="&#x00B1;">2.97</td>
<td align="char" valign="top" char="&#x00B1;">25.10</td>
<td align="char" valign="top" char="&#x00B1;">42.35</td>
<td align="char" valign="top" char="&#x00B1;">22.06</td>
<td align="char" valign="top" char="&#x00B1;">7.45</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.03</td>
<td align="char" valign="top" char="&#x00B1;">14.63</td>
<td align="char" valign="top" char="&#x00B1;">68.47</td>
<td align="char" valign="top" char="&#x00B1;">16.37</td>
<td align="char" valign="top" char="&#x00B1;">0.50</td>
</tr>
<tr>
<td align="char" valign="top" char=".">1.00&#x2009;&#x2260;&#x2009;1.20</td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">20</td>
<td align="char" valign="top" char="&#x00B1;">4.72</td>
<td align="char" valign="top" char="&#x00B1;">13.52</td>
<td align="char" valign="top" char="&#x00B1;">23.39</td>
<td align="char" valign="top" char="&#x00B1;">23.79</td>
<td align="char" valign="top" char="&#x00B1;">15.41</td>
<td align="char" valign="top" char="&#x00B1;">8.48</td>
<td align="char" valign="top" char="&#x00B1;">4.53</td>
<td align="char" valign="top" char="&#x00B1;">2.37</td>
<td align="char" valign="top" char="&#x00B1;">1.66</td>
<td align="char" valign="top" char="&#x00B1;">2.13</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">30</td>
<td align="char" valign="top" char="&#x00B1;">1.99</td>
<td align="char" valign="top" char="&#x00B1;">11.73</td>
<td align="char" valign="top" char="&#x00B1;">27.04</td>
<td align="char" valign="top" char="&#x00B1;">27.82</td>
<td align="char" valign="top" char="&#x00B1;">17.23</td>
<td align="char" valign="top" char="&#x00B1;">8.00</td>
<td align="char" valign="top" char="&#x00B1;">3.36</td>
<td align="char" valign="top" char="&#x00B1;">1.59</td>
<td align="char" valign="top" char="&#x00B1;">0.64</td>
<td align="char" valign="top" char="&#x00B1;">0.60</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">50</td>
<td align="char" valign="top" char="&#x00B1;">0.26</td>
<td align="char" valign="top" char="&#x00B1;">7.22</td>
<td align="char" valign="top" char="&#x00B1;">29.28</td>
<td align="char" valign="top" char="&#x00B1;">36.03</td>
<td align="char" valign="top" char="&#x00B1;">19.12</td>
<td align="char" valign="top" char="&#x00B1;">6.02</td>
<td align="char" valign="top" char="&#x00B1;">1.53</td>
<td align="char" valign="top" char="&#x00B1;">0.37</td>
<td align="char" valign="top" char="&#x00B1;">0.09</td>
<td align="char" valign="top" char="&#x00B1;">0.08</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">100</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">2.13</td>
<td align="char" valign="top" char="&#x00B1;">28.44</td>
<td align="char" valign="top" char="&#x00B1;">50.06</td>
<td align="char" valign="top" char="&#x00B1;">17.3</td>
<td align="char" valign="top" char="&#x00B1;">1.87</td>
<td align="char" valign="top" char="&#x00B1;">0.19</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.01</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">300</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0.02</td>
<td align="char" valign="top" char="&#x00B1;">19.69</td>
<td align="char" valign="top" char="&#x00B1;">73.84</td>
<td align="char" valign="top" char="&#x00B1;">6.41</td>
<td align="char" valign="top" char="&#x00B1;">0.04</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
<tr>
<td/>
<td align="char" valign="top" char="&#x00B1;">1,000</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">5.93</td>
<td align="char" valign="top" char="&#x00B1;">93.82</td>
<td align="char" valign="top" char="&#x00B1;">0.25</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
<td align="char" valign="top" char="&#x00B1;">0</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Cells state the percentages of <italic>I</italic><sub>SIM</sub>-values falling within specific <italic>I</italic><sub>SIM</sub> intervals for various sample sizes <italic>(n)</italic>, based on 10,000 simulations per row; (, value not included; ], value included; <italic>m</italic><sub>POP</sub>, the true value in the population (i.e., the numerator of <italic>I</italic><sub>SIM</sub>); <italic>m</italic><sub>THEO</sub>, the theoretical value in the population (i.e., the denominator of <italic>I</italic><sub>SIM</sub>).</p>
</table-wrap-foot>
</table-wrap>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption>
<p>Values of <italic>I</italic><sub>SIM</sub> were calculated in 10,000 simulated study-settings with <italic>n</italic><sub>0</sub>&#x2009;=&#x2009;<italic>n</italic><sub>1</sub>&#x2009;=&#x2009;100 under the assumption that the theoretically predicted effect does not match the population effect. Each row of this graph represents different combinations of <italic>m</italic><sub>THEO</sub> and <italic>m</italic><sub>POP</sub>.</p>
</caption>
<graphic xlink:href="fpsyg-13-980261-g002.tif"/>
</fig>
<p>We first turn to cases where the theoretically predicted effect <italic>overestimates</italic> the population effect. Given a sample size of <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;100, values of <italic>I</italic><sub>SIM</sub> fall inside the SI in approximately 2% of repetitions of setting 1 (ES<sub>POP</sub> =&#x2009;0.20, ES<sub>THEO</sub> =&#x2009;0.50), compared to approximately 31% of repetitions of a study setting where the theoretically predicted effect matches the population effect (ES<sub>THEO</sub> =&#x2009;ES<sub>POP</sub> =&#x2009;0.20). The 29% difference between false positive and true positives predictions increases as <italic>n</italic> increases (see <xref rid="tab1" ref-type="table">Tables 1</xref>, <xref rid="tab2" ref-type="table">2</xref>). For the 2% of false positive predictions, the decision is clear: the theoretical construct requires adjustment. Whereas in case of the 31% true positive predictions, the identification of an empirically adequate construct would benefit from increasing <italic>n</italic>.</p>
<p>Further, given a sample size of <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;100, values of <italic>I</italic><sub>SIM</sub> fall inside the SI in approximately 36% of repetitions of setting 2 (ES<sub>POP</sub> =&#x2009;0.80, ES<sub>THEO</sub> =&#x2009;1.00), compared to approximately 88% of repetitions of a study setting where the theoretically predicted effect matches the population effect (ES<sub>THEO</sub> =&#x2009;ES<sub>POP</sub> =&#x2009;0.80). The 53% difference between false positive and true positive predictions increases as <italic>n</italic> increases. In both cases, however, the decision to adjust the theoretical construct requires considerably larger samples to clearly distinguish a true positive from a false positive prediction.</p>
<p>We now turn to cases where the theoretically predicted effect <italic>underestimates</italic> the population effect. Given a sample size of <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;100, values of <italic>I</italic><sub>SIM</sub> fall inside the SI in approximately 4% of repetitions of setting 1 (ES<sub>POP</sub> =&#x2009;0.80, ES<sub>THEO</sub> =&#x2009;0.20), compared to approximately 88% of repetitions of a study setting where the theoretically predicted effect matches the population effect (ES<sub>THEO</sub> =&#x2009;ES<sub>POP</sub> =&#x2009;0.80). The 84% difference between true positives and false positive predictions arguably suffices to evaluate the theoretical construct as empirically inadequate.</p>
<p>Finally, given a sample size of <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;100, values of <italic>I</italic><sub>SIM</sub> fall inside the SI in approximately 69% of repetitions of setting 1 (ES<sub>POP</sub> =&#x2009;1.20, ES<sub>THEO</sub> =&#x2009;1.00), compared to approximately 97% of repetitions of a study setting where the theoretically predicted effect matches the population effect (ES<sub>THEO</sub> =&#x2009;ES<sub>POP</sub> =&#x2009;1.20). The 28% difference between false positives and true positive predictions suggests that it is more likely that values of <italic>I</italic><sub>SIM</sub> fall inside the SI if the theoretical prediction matches the population effect than otherwise.</p>
<p>We proceed to exemplify the application of <italic>I</italic><sub>SIM</sub> with a case study. Additional examples are provided in <xref ref-type="supplementary-material" rid="SM1">Supplementary Appendix S2</xref>.</p>
</sec>
<sec id="sec15">
<title>Case study: The psi-effect</title>
<p>The question of whether humans can cognize the future (aka <italic>precognition</italic> or <italic>psi-</italic>effect) has interested several scholars in psychology. The authors of the largest meta-analysis on the psi-effect to date (<xref ref-type="bibr" rid="ref3">Bem et al., 2016</xref>), comprising 90 experimental studies of which 51 are peer-reviewed (see <xref ref-type="bibr" rid="ref3">Bem et al., 2016</xref>; <xref ref-type="supplementary-material" rid="SM1">Supplementary Table S1</xref>), claim to have obtained decisive evidence <italic>for</italic> a psi-effect. Whereas some concluded from this that the psi-effect is real (e.g., <xref ref-type="bibr" rid="ref6">Cardena, 2018</xref>), others argued that Bem et al.&#x2019;s (2016) meta-analytical data leave it too unlikely that the psi-effect is real (e.g., <xref ref-type="bibr" rid="ref62">Witte and Zenker, 2017</xref>).</p>
<p>Across the 51 peer-reviewed object-level psi-studies, the observed effect ranges from <italic>d</italic> =&#x2009;0.02 to <italic>d</italic> =&#x2009;0.21 (<xref ref-type="bibr" rid="ref3">Bem et al., 2016</xref>). These two values describe a ratio of 1 : 9.7, indicating that the observed object-level effects are very heterogeneous. The heterogeneity of the observed object-level effects may suggest that the <italic>average</italic> psi-effect should be evaluated by combining a statistical inference strategy with an error account (<xref ref-type="bibr" rid="ref35">Lord and Novick, 1968</xref>). This evaluation, however, would remain sensitive to how <italic>n</italic> and <italic>s</italic> vary across individual studies. But as statistical parameters, <italic>n</italic> and <italic>s</italic> lack theoretical meaning. Particularly <italic>s</italic> is merely a normalization factor to render several object-level effects comparable.</p>
<p>To achieve an evaluation that is independent of how <italic>n</italic> and <italic>s</italic> vary across the object-level studies, one should rather compare the point-specific ES<sub>THEO</sub> <italic>directly</italic> to the point-specific ES<sub>OBS</sub> in each study, <italic>without</italic> averaging the effect. To this end, Bem et al.&#x2019;s (2016; <xref ref-type="supplementary-material" rid="SM1">Supplementary Table A1</xref>) meta-analytical findings can be re-analyze as follows:</p>
<list list-type="order">
<list-item>
<p>As Bem himself proposed (<xref ref-type="bibr" rid="ref2">Bem, 2011</xref>, p. 409, note 1), the theoretical psi-effect is specified as <italic>d</italic><sub>THEO</sub> =&#x2009;0.20 using a scale of <italic>z</italic>-values where <italic>s</italic> =&#x2009;1. Consequently, <italic>d</italic><sub>THEO</sub> =&#x2009;ES<sub>THEO</sub>. (A theoretical construct cannot reasonably predict a smaller psi-effect because it would be overlain by the standard measurement error.)</p>
</list-item>
<list-item>
<p>To control for the quality of the object-level studies, we exclude the 49 non-peer-reviewed object-level studies, retaining the 51 peer-reviewed ones (see <xref ref-type="bibr" rid="ref3">Bem et al., 2016</xref>, <xref ref-type="supplementary-material" rid="SM1">Supplementary Table S1</xref>).</p>
</list-item>
<list-item>
<p>To eliminate the variation of <italic>s</italic>, the mean difference (<italic>m</italic><sub>1</sub> &#x2212; <italic>m</italic><sub>0</sub>) is calculated by multiplying the instance of ES<sub>OBS</sub> in each peer-reviewed object-level study with that study&#x2019;s observed <italic>s</italic>. This yields ES<sub>OBS</sub> =&#x2009;(<italic>m</italic><sub>1</sub> &#x2212; <italic>m</italic><sub>0</sub>)&#x2009;/&#x2009;<italic>s</italic>, where <italic>s</italic> =&#x2009;1.</p>
</list-item>
<list-item>
<p>For each peer-reviewed object-level study, <italic>I</italic><sub>SIM</sub> is computed as follows: (a) <italic>I</italic><sub>SIM</sub> =&#x2009;0 if the mean difference is negative; (b) <italic>I</italic><sub>SIM</sub> is undefined if the between-group ES<sub>OBS</sub>-difference (treatment vs. control) is 0; otherwise, since <italic>s</italic> =&#x2009;1, (c) <italic>I</italic><sub>SIM</sub> =&#x2009;(ES<sub>THEO</sub> =&#x2009;0.20&#x2009;&#x00D7; <italic>s</italic>) / (ES<sub>OBS</sub> &#x00D7; <italic>s</italic>)&#x2009;=&#x2009;(0.20&#x2009;/&#x2009;ES<sub>OBS</sub>).</p>
</list-item>
</list>
<p>Because <italic>s</italic> has been eliminated, the 95%-SI [0.80;1.20] can be applied to each peer-reviewed object-level study individually. The two relevant parameters are ES<sub>THEO</sub> = <italic>d</italic><sub>THEO</sub> =&#x2009;0.20 relative to the sample size of an object-level study, and the percentage of ES<sub>OBS</sub>-instances that fall inside the 95%-SI given ES<sub>THEO</sub> = <italic>d</italic><sub>THEO</sub> =&#x2009;0.20.</p>
<p>The application of <italic>I</italic><sub>SIM</sub> indicates that, although each of the 51 peer-reviewed object-level studies was published as evidence <italic>for</italic> a psi-effect (<xref ref-type="bibr" rid="ref3">Bem et al., 2016</xref>), the mean difference is negative (<italic>I</italic><sub>SIM</sub> =&#x2009;0) in 16 studies (31% of 51 studies), that two studies show no difference (<italic>I</italic><sub>SIM</sub> is undefined), and that the <italic>I</italic><sub>SIM</sub>-value falls outside the 95%-SI in 22 studies (43%). This means that ES<sub>OBS</sub> is <italic>insufficiently similar</italic> to ES<sub>THEO</sub>.</p>
<p>In the remaining 11 studies (22% of 51 studies), where ES<sub>OBS</sub> is <italic>sufficiently similar</italic> to ES<sub>THEO</sub>, the percentages of <italic>I</italic><sub>SIM</sub>-values falling inside the 95%-SI (see <xref rid="tab1" ref-type="table">Table 1</xref>) are nevertheless quite low: 33% (<italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;100); 37% (150); 33% (99); 33% (100); 33% (100); 34% (109); 23% (49); 33% (100); 34% (111); 42% (201); 23% (50). This means that each study&#x2019;s sample is <italic>too small</italic> to generate the evidence required to consider empirically adequate a theoretical construct that predicts ES<sub>THEO</sub> = <italic>d</italic><sub>THEO</sub> =&#x2009;0.20.</p>
<p>To appreciate the sample size that is needed to consider as empirically adequate a theoretical construct that predicts ES<sub>THEO</sub> = <italic>d</italic><sub>THEO</sub> =&#x2009;0.20, a one-sided <italic>t</italic>-test under <italic>&#x03B1;</italic> =&#x2009;0.05 and test-power of (1 &#x2212; <italic>&#x03B2;</italic>)&#x2009;=&#x2009;0.80 already requires <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;101. Under <italic>&#x03B1;</italic> = <italic>&#x03B2;</italic> =&#x2009;0.05, it even requires <italic>n</italic><sub>0</sub> = <italic>n</italic><sub>1</sub> =&#x2009;201. The reason for the large samples is that the theoretically predicted effect is small enough to be accounted for exclusively by random influences on the empirical setting. But random influences are independent of ES<sub>THEO</sub> and so lack theoretical meaning. Indeed, this is the reason why ES<sub>THEO</sub> = <italic>d</italic><sub>THEO</sub> =&#x2009;0.20 requires a statistical corroboration against random influences in the first place.</p>
<p>In sum, although ES<sub>OBS</sub> is sufficiently similar to ES<sub>THEO</sub> in 11 out of 51 peer-reviewed object-level studies, these 11 studies <italic>individually</italic> fail to provide the evidence required to consider as empirically adequate a theoretical construct that predicts ES<sub>THEO</sub> = <italic>d</italic><sub>THEO</sub> =&#x2009;0.20. Arguably, therefore, if the empirical adequacy of the theoretically predicted psi-effect had been evaluated before conducting additional studies, some research effort concerning the psi-effect could have been avoided.</p>
</sec>
</sec>
<sec id="sec16" sec-type="discussions">
<title>Discussion</title>
<p>Whether a theoretical construct adequately predicts future observations is a distinct question from whether a data-based parameter estimate (induced from past observations) deviates statistically significantly from a random distribution. This difference matters because behavioral science research regularly uses a data-based parameter estimate and its associated confidence bounds as a proxy for a theoretical construct. But a parameter that is estimated using a <italic>z</italic>-standardized effect size measure such as Cohen&#x2019;s <italic>d</italic> cannot distinguish whether particularly a <italic>small</italic> observed <italic>d</italic>-value points to a mean difference that is too small to be observable, or rather to a large <italic>s</italic>. Without <italic>making</italic> this distinction, however, the evaluation of the empirical adequacy of a theoretical construct is out of reach.</p>
<p>The <italic>I</italic><sub>SIM</sub> measure and the SI fare better. Both together can inform the evaluation of the empirical adequacy of a theoretical construct because, if the inductive element <italic>s</italic> that serves to <italic>z</italic>-standardize measurements is avoided, then the observed mean difference <italic>ceases</italic> to be &#x201C;blurred&#x201D; by random influences. As this enables a <italic>direct</italic> comparison between the theoretically predicted and the observed mean-difference, the evaluation of the empirical adequacy of a theoretical construct is placed within reach. On how <italic>I</italic><sub>SIM</sub> and the SI can be applied beyond a simple experimental setting, see <xref ref-type="supplementary-material" rid="SM1">Supplementary Appendix S1</xref>. For additional examples, see <xref ref-type="supplementary-material" rid="SM1">Supplementary Appendix S2</xref>. To apply <italic>I</italic><sub>SIM</sub> and the SI to extant data, we provide an online tool at <ext-link xlink:href="https://adrian-stanciu.shinyapps.io/Similarity-Index/" ext-link-type="uri">https://adrian-stanciu.shinyapps.io/Similarity-Index/</ext-link>.</p>
<sec id="sec17">
<title>Practical implications</title>
<p>As behavioral science has come under scrutiny, <italic>replication crisis</italic> denotes that few previously &#x201C;established&#x201D; findings are independently replicable and that questionable research practices are regularly employed (e.g., <xref ref-type="bibr" rid="ref24">Kerr, 1998</xref>; <xref ref-type="bibr" rid="ref26">Klein, 2014</xref>; <xref ref-type="bibr" rid="ref23">Irvine, 2021</xref>; <xref ref-type="bibr" rid="ref43">Nosek et al., 2022</xref>). A familiar response to the replication crisis is to recommend measures that improve the quality of data (e.g., study pre-registrations, multi-lab projects, or open access to materials). Such measures constitute important elements of an inductive approach to parameter estimation. But some effort must also go toward developing theoretical constructs that logically entail an empirically adequate prediction, i.e., toward a deductive approach to theory construction.</p>
<p>A central limitation of the inductive approach to parameter estimation is exemplified by meta-analytical research. To arrive at robust meta-level or population effect size estimates, observed object-level effects are regularly sought to be made comparable by weighing them to the observed <italic>s</italic> (<xref ref-type="bibr" rid="ref52">Schulze, 2004</xref>). But since <italic>s</italic> varies with the (random) influences on an empirical setting, this <italic>invites</italic> all the problems discussed above. So, if a meta-analysis retains the observed <italic>s</italic> of observed object-level effects, a robust meta-level or population effect size estimate cannot be had. For this reason, <italic>s</italic> should be avoided in both theory construction research and meta-analytical research.</p>
<p>The similarity index <italic>I</italic><sub>SIM</sub> fares better. First, <italic>I</italic><sub>SIM</sub> offers a more transparent view of observations. This can assist in improving a theoretical construct because using <italic>I</italic><sub>SIM</sub> and the associated 95%-SI allows distinguishing between an empirically adequate prediction (true positive; ES<sub>THEO</sub> =&#x2009;ES<sub>POP</sub>) and an empirically inadequate one (false positive; ES<sub>THEO</sub> &#x2260;&#x2009;ES<sub>POP</sub>). Making this distinction is required to decide whether a theoretical construct can be maintained, whether its theoretically predicted effect should be adjusted, or whether additional data should be collected. The last option particularly counts if available data indicate a small effect, which is generally not well-observable.</p>
<p>Second, the <italic>I</italic><sub>SIM</sub> measure and the 95%-SI help to evaluate whether a false positive prediction indicates that the population effect is under- or overestimated. After all, for all possible combinations of a theoretically predicted effect and a sample size, as long as the percentage of non-matching observations (ES<sub>THEO</sub> &#x2260;&#x2009;ES<sub>POP</sub>) makes it unreasonable to evaluate the ES<sub>THEO</sub>-value as a true positive prediction, an empirically adequate prediction is more probable to fall inside the 95%-SI than not.</p>
<p>Third, assume that, as <italic>n</italic> increases, also the value of ES<sub>OBS</sub> becomes increasingly more similar to the value of the true population parameter (law of large numbers). If so, then the corresponding increase in the percentage difference between a true positive and a false positive prediction goes along with an increase in the proportion of viable theoretical assumptions relative to all possible alternative theoretical assumptions. With each additional <italic>I</italic><sub>SIM</sub>-value for a point-ES<sub>THEO</sub> = <italic>x</italic> that falls inside the 95%-SI, therefore, it becomes more reasonable for researchers to develop a theoretical construct for <italic>x</italic> because &#x201C;getting something right&#x201D; about <italic>x</italic> is more probable than not.</p>
<p>Fourth, if additional <italic>independent</italic> studies happen to estimate a point-ES<sub>OBS</sub> = <italic>y</italic> that is similar to <italic>x</italic>, then <italic>I</italic><sub>SIM</sub> continues to approximate the condition for a perfect match between prediction and observations (<italic>I</italic><sub>SIM</sub> =&#x2009;1). The independence of additional studies entails that the approximation of <italic>I</italic><sub>SIM</sub> =&#x2009;1 is unlikely to occur by random. Consequently, a researcher&#x2019;s confidence that ES<sub>THEO</sub> = <italic>x</italic> is empirically adequate would increase. The same rationale underlies having confidence in a meta-analytically estimated point-ES<sub>OBS</sub> that is based on independently observed object-level effects (<xref ref-type="bibr" rid="ref22">Hunter and Schmidt, 2004</xref>).</p>
<p>The use-value of an <italic>I</italic><sub>SIM</sub>-based evaluation of a theoretical construct is perhaps most readily apparent in the context of the <italic>research program strategy</italic> (RPS) (<xref ref-type="bibr" rid="ref62">Witte and Zenker, 2017</xref>; <xref ref-type="bibr" rid="ref28">Krefeld-Schwalb et al., 2018</xref>). If the effects of several independent and topically related studies are observed under low error-rates, then RPS induces the observed mean effect as a parameter estimate (see the subsection <italic>Parameter Estimation</italic>). Next, RPS develops a theoretical construct that logically entails a theoretically predicted point-effect of identical size as this inductive parameter estimate. Provided <italic>new</italic> observations under low error-rates, finally, if the likelihood of the theoretically predicted effect sufficiently exceeds the likelihood of an alternative effect, then RPS evaluates the former as <italic>preliminarily</italic> verified, respectively as <italic>substantially</italic> verified if the likelihood of the theoretically predicted effect is sufficiently similar to the maximum likelihood of new observations. For the verification thresholds of this statistical likelihood model, see <xref ref-type="bibr" rid="ref28">Krefeld-Schwalb et al. (2018</xref>, p. 22).</p>
<p>Beyond this likelihood model, the attempt to verify a theoretically predicted effect by comparing it to observations <italic>requires</italic> an <italic>I</italic><sub>SIM</sub>-like measure. An inductive parameter estimate, after all, has uncertainty bounds that reflect the variance of observations, whereas a theoretical construct that is developed based on theoretical considerations predicts a point-specific effect. For this reason, <italic>I</italic><sub>SIM</sub> avoids comparing the theoretically predicted effect <italic>indirectly</italic> to observations, an indirectness that results from using a statistical error account and a data distribution (e.g., a <italic>t</italic>-, <italic>F</italic>-, or <italic>&#x0425;</italic><sup>2</sup>-distribution). Instead, the theoretically predicted effect is compared <italic>directly</italic> to observations (as measured), while the admissible variation of a theoretical construct is captured by the 95%-SI (see the section &#x201C;<italic>Case study&#x201D;</italic>).</p>
<p>This explains why we modeled the admissible variation of a theoretical construct by simulating random samples of possible measurements, rather than by using an inferential statistical theory (e.g., a likelihood model). In RPS, the inferential statistical evaluation of (simulated or real) observations is useful, only if the <italic>I</italic><sub>SIM</sub>-value already lies within the 95%-SI, indicating that the theoretically predicted effect is similar to observations. Thus, <italic>I</italic><sub>SIM</sub> evaluates the similarity between a theoretical construct and observations <italic>before</italic> inferentially testing the theoretically predicted effect (<xref ref-type="bibr" rid="ref61">Witte and Heitkamp, 2006</xref>). Nevertheless, for a specific theoretically predicted effect to be accepted as empirically adequate, both its point-specification and its statistical substantial verification are required. In brief, <italic>I</italic><sub>SIM</sub> assists in specifying the effect size, while RPS verifies it.</p>
</sec>
<sec id="sec18">
<title>Limitations</title>
<p>Rather than replacing standardized effect size measures such as Cohen&#x2019;s <italic>d</italic> or inductive data-evaluation tools like a model-fitting index, <italic>I</italic><sub>SIM</sub> complements them. <italic>I</italic><sub>SIM</sub> should be applied mindfully. Several limitations apply:</p>
<p>First, <italic>I</italic><sub>SIM</sub> does not offer a criterion for a data-based decision to accept or reject hypotheses. Rather than comparing two hypotheses (H<sub>0</sub>, H<sub>1</sub>) in view of data, <italic>I</italic><sub>SIM</sub> evaluates only the H<sub>1</sub>-hypothesis that states ES<sub>THEO</sub>. Therefore, <italic>I</italic><sub>SIM</sub> cannot enable a relative statistical corroboration of a theoretical construct against random influences. This continues to require statistical testing.</p>
<p>Second, if the theoretically predicted effect ES<sub>THEO</sub> = <italic>x</italic> falls outside the 95%-SI, then <italic>x</italic> appears to be empirically <italic>inadequate</italic>. This appearance may mislead researchers to prematurely abandon <italic>x</italic> as a candidate value for ES<sub>THEO</sub>. But as a rule, the decision to abandon <italic>x</italic> should squarely depend on having collected an adequately large sample.</p>
<p>Third, like all formal measures, <italic>I</italic><sub>SIM</sub> is open to &#x201C;tweaking&#x201D; the data to let ES<sub>OBS</sub> and ES<sub>THEO</sub> match artificially. With a new formal measure, therefore, additional temptation to engage in questionable research practices may arise.</p>
<p>Fourth, a simple &#x201C;recycling&#x201D; of the ES<sub>OBS</sub>-value as the ES<sub>THEO</sub>-value would <italic>trivially</italic> satisfy the perfect-match condition (<italic>I</italic><sub>SIM</sub> =&#x2009;1), known as <italic>p</italic>-harking. So, the same critical considerations apply as were stated immediately above (<xref ref-type="bibr" rid="ref24">Kerr, 1998</xref>).</p>
<p>Fifth, in the context of a confirmatory factor analysis (CFA), which relies on an explorative factor analysis (EFA) to evaluate the deviation of predetermined parameters in some complex mathematical model, several of these parameters must be determined simultaneously (e.g., the number and correlations of factors, their weights, loadings, etc.). However, <italic>I</italic><sub>SIM</sub> cannot be applied to test whether the complex mathematical model itself agrees with the abstract data deduced from it; <italic>I</italic><sub>SIM</sub> can only test whether a basic parameter (e.g., a mean or a correlation) agrees with empirical data. Given a correlation matrix, for instance, <italic>I</italic><sub>SIM</sub> can evaluate the similarity between a single predicted correlation and an empirically observed correlation (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Appendix S2</xref>, personality traits and life outcomes). As a basic (non-complex) measure, <italic>I</italic><sub>SIM</sub> thus operates at the level of each element in a correlation matrix and can there compare a prediction <italic>directly</italic> with observations (see <xref ref-type="bibr" rid="ref47">Perez-Gil et al., 2000</xref>).</p>
</sec>
</sec>
<sec id="sec19" sec-type="conclusions">
<title>Conclusion</title>
<p>The identification of an empirically adequate theoretical construct requires determining whether a theoretically predicted effect is sufficiently similar to an observed effect. To this end, we proposed <italic>I</italic><sub>SIM</sub> and the 95%-SI as a simple measure to evaluate the similarity between a theoretically predicted effect and observations, a measure that avoids the statistical element of the observed standard deviation. Using computer simulations, we estimated the sample size and the observed effect size that are necessary to identify an empirically adequate theoretical construct.</p>
<p>Generally relevant for theory construction research, the <italic>I</italic><sub>SIM</sub> measure and the 95%-SI particularly serve to develop a point-specific theoretical construct, where both should be applied alongside a statistical corroboration measure (e.g., the likelihood ratio). If the <italic>I</italic><sub>SIM</sub>-value falls within the 95%-SI, then a theoretical construct postulating a theoretically predicted point-specific effect ES<sub>THEO</sub> = <italic>x</italic> can be (fallibly) maintained as empirically adequate. If independent studies subsequently observe a point-effect ES<sub>OBS</sub> = <italic>y</italic> that is similar to <italic>x</italic>, a researcher&#x2019;s confidence that <italic>x</italic> is empirically adequate would increase. Whereas if too many <italic>I</italic><sub>SIM</sub>-values fall outside the 95%-SI as the number of independent studies increases, then ES<sub>THEO</sub> = <italic>x</italic> must be corrected, or the standard error must be reduced, e.g., by restricting the experimental setting. The most direct way of reducing the standard error, of course, is to increase the sample.</p>
<p>An exemplary application of <italic>I</italic><sub>SIM</sub> to recent meta-analytical findings on the precognition effect (<xref ref-type="bibr" rid="ref3">Bem et al., 2016</xref>) indicated that 51 peer-reviewed object-level studies individually fail to provide the evidence that is required to evaluate as empirically adequate a theoretical construct that predicts a precognition effect of <italic>d</italic> =&#x2009;0.20 (additional application examples are found in <xref ref-type="supplementary-material" rid="SM1">Supplementary Appendix S2</xref>).</p>
<p>In behavioral science as elsewhere, measurement comprises an ontological aspect related to the theoretical construct under development, and an epistemological aspect related to the specific measurement procedures employed. When using Cohen&#x2019;s <italic>d</italic> measure, behavioral scientists tend to address a question that combines both aspects of measurement. This is understandable if theory-testing relies on statistical inference procedures, which simultaneously relate to both aspects of measurement. But to facilitate theory construction research and the development of measurement, the ontological and epistemological aspects are best kept separate. Otherwise, it is quite difficult to say what a measurement instance in fact measures.</p>
</sec>
<sec id="sec20" sec-type="data-availability">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: <ext-link xlink:href="https://osf.io/rgwsp/" ext-link-type="uri">https://osf.io/rgwsp/</ext-link>.</p>
</sec>
<sec id="sec21">
<title>Author contributions</title>
<p>EHW developed the <italic>I</italic><sub>SIM</sub> measure. AS coded and ran the simulations. All authors drafted the manuscript, which FZ edited. All authors approved the final submitted version.</p>
</sec>
<sec id="sec22" sec-type="funding-information">
<title>Funding</title>
<p>FZ acknowledges support from TUBITAK (No. 118C257).</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec100" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="sec24" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/fpsyg.2022.980261/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/fpsyg.2022.980261/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.pdf" id="SM1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="ref001"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Andreas</surname> <given-names>H.</given-names></name></person-group> (<year>2021</year>). &#x201C;<article-title>Theoretical Terms in Science</article-title>,&#x201D; in <source><italic>The Stanford Encyclopedia of Philosophy (Fall 2021 Edition)</italic></source>. ed. <person-group person-group-type="editor"><name><surname>Edward</surname> <given-names>E. N.</given-names></name></person-group> Available at: <ext-link xlink:href="https://plato.stanford.edu/archives/fall2021/entries/theoretical-terms-science/" ext-link-type="uri">https://plato.stanford.edu/archives/fall2021/entries/theoretical-terms-science/</ext-link></citation></ref>
<ref id="ref2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bem</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <article-title>Feeling the future: experimental evidence for anomalous retroactive influences on cognition and affect</article-title>. <source>J. Pers. Soc. Psychol.</source> <volume>100</volume>, <fpage>407</fpage>&#x2013;<lpage>425</lpage>. doi: <pub-id pub-id-type="doi">10.1037/a0021524</pub-id>, PMID: <pub-id pub-id-type="pmid">21280961</pub-id></citation></ref>
<ref id="ref3"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Bem</surname> <given-names>D.</given-names></name> <name><surname>Tressoldi</surname> <given-names>P.</given-names></name> <name><surname>Rabeyron</surname> <given-names>T.</given-names></name> <name><surname>Duggan</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>Feeling the future: a meta-analysis of 90 experiments on the anticipation of random future events [version 2; referees: 2 approved]. F1000 research</article-title>. <volume>4</volume>:<fpage>1188</fpage>. <ext-link xlink:href="https://f1000researchdata.s3.amazonaws.com/datasets/7177/9efe17e0-4b70-4f10-9945-a309e42de2c4_TableA1.xlsx" ext-link-type="uri">https://f1000researchdata.s3.amazonaws.com/datasets/7177/9efe17e0-4b70-4f10-9945-a309e42de2c4_TableA1.xlsx</ext-link>]</citation></ref>
<ref id="ref4"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bollen</surname> <given-names>K. A.</given-names></name> <name><surname>Bauer</surname> <given-names>D. J.</given-names></name> <name><surname>Christ</surname> <given-names>S. L.</given-names></name> <name><surname>Edwards</surname> <given-names>M. C.</given-names></name></person-group> (<year>2010</year>). &#x201C;<article-title>An overview of structural equations models and recent extensions</article-title>&#x201D; in <source>Recent developments in social science statistics</source>. eds. <person-group person-group-type="editor"><name><surname>Kolenikov</surname> <given-names>S.</given-names></name> <name><surname>Steinley</surname> <given-names>D.</given-names></name> <name><surname>Thombs</surname> <given-names>L.</given-names></name></person-group> (<publisher-loc>Hoboken</publisher-loc>: <publisher-name>Wiley</publisher-name>), <fpage>37</fpage>&#x2013;<lpage>80</lpage>.</citation></ref>
<ref id="ref5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burnham</surname> <given-names>K. P.</given-names></name> <name><surname>Anderson</surname> <given-names>D. R.</given-names></name></person-group> (<year>2004</year>). <article-title>Multimodel inference: understanding AIC and BIC in model selection</article-title>. <source>Sociol. Methods Res.</source> <volume>33</volume>, <fpage>261</fpage>&#x2013;<lpage>304</lpage>.</citation></ref>
<ref id="ref6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cardena</surname> <given-names>E.</given-names></name></person-group> (<year>2018</year>). <article-title>The experimental evidence for parapsychological phenomena: a review</article-title>. <source>Am. Psychol.</source> <volume>73</volume>, <fpage>663</fpage>&#x2013;<lpage>677</lpage>. doi: <pub-id pub-id-type="doi">10.1037/amp0000236</pub-id>, PMID: <pub-id pub-id-type="pmid">29792448</pub-id></citation></ref>
<ref id="ref7"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>J.</given-names></name></person-group> (<year>1977</year>). <source>Statistical power analysis for the behavioral sciences</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Academic Press</publisher-name>.</citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cornelissen</surname> <given-names>J.</given-names></name> <name><surname>H&#x00F6;llerer</surname> <given-names>M. A.</given-names></name> <name><surname>Seidl</surname> <given-names>D.</given-names></name></person-group> (<year>2021</year>). <article-title>What theory is and can be: forms of theorizing in organizational scholarship</article-title>. <source>Organ. Theory</source> <volume>2</volume>, <fpage>263178772110203</fpage>&#x2013;<lpage>263178772110219</lpage>.</citation></ref>
<ref id="ref9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eronen</surname> <given-names>M. I.</given-names></name> <name><surname>Bringmann</surname> <given-names>L. F.</given-names></name></person-group> (<year>2021</year>). <article-title>The theory crisis in psychology: how to move forward</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>16</volume>, <fpage>779</fpage>&#x2013;<lpage>788</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1745691620970586</pub-id>, PMID: <pub-id pub-id-type="pmid">33513314</pub-id></citation></ref>
<ref id="ref10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eronen</surname> <given-names>M. I.</given-names></name> <name><surname>Romeijn</surname> <given-names>J. W.</given-names></name></person-group> (<year>2020</year>). <article-title>Philosophy of science and the formalization of psychological theory</article-title>. <source>Theory Psychol.</source> <volume>30</volume>, <fpage>786</fpage>&#x2013;<lpage>799</lpage>.</citation></ref>
<ref id="ref13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fiedler</surname> <given-names>K.</given-names></name> <name><surname>Prager</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>The regression trap and other pitfalls of replication science&#x2014;illustrated by the report of the Open Science collaboration</article-title>. <source>Basic Appl. Soc. Psychol.</source> <volume>40</volume>, <fpage>115</fpage>&#x2013;<lpage>124</lpage>. doi: <pub-id pub-id-type="doi">10.1080/01973533.2017.1421953</pub-id></citation></ref>
<ref id="ref14"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Fleck</surname> <given-names>L.</given-names></name></person-group> (<year>1935</year>). <source>Genesis and development of a scientific fact</source>. <publisher-loc>Chicago</publisher-loc>: <publisher-name>The University of Chicago Press</publisher-name>.</citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gelman</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>). <article-title>The failure of null hypothesis significance testing when studying incremental changes, and what to do about it</article-title>. <source>Personal. Soc. Psychol. Bull.</source> <volume>44</volume>, <fpage>16</fpage>&#x2013;<lpage>23</lpage>.</citation></ref>
<ref id="ref16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gelman</surname> <given-names>A.</given-names></name> <name><surname>Carlin</surname> <given-names>J.</given-names></name></person-group> (<year>2014</year>). <article-title>Beyond power calculations: assessing type S (sign) and type M (magnitude) errors</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>9</volume>, <fpage>641</fpage>&#x2013;<lpage>651</lpage>.</citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gervais</surname> <given-names>W. M.</given-names></name></person-group> (<year>2021</year>). <article-title>Practical methodological reform needs good theory</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>16</volume>, <fpage>827</fpage>&#x2013;<lpage>843</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1745691620977471</pub-id>, PMID: <pub-id pub-id-type="pmid">33513312</pub-id></citation></ref>
<ref id="ref18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gigerenzer</surname> <given-names>G.</given-names></name></person-group> (<year>1998</year>). <article-title>Surrogates for theories</article-title>. <source>Theory Psychol.</source> <volume>8</volume>, <fpage>195</fpage>&#x2013;<lpage>204</lpage>.</citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hempel</surname> <given-names>C. G.</given-names></name></person-group> (<year>1988</year>). <article-title>Provisoes: a problem concerning the inferential function of scientific theories</article-title>. <source>Erkenntnis</source> <volume>28</volume>, <fpage>147</fpage>&#x2013;<lpage>164</lpage>. doi: <pub-id pub-id-type="doi">10.1007/BF00166441</pub-id></citation></ref>
<ref id="ref20"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Henderson</surname> <given-names>L.</given-names></name></person-group> (<year>2020</year>). <article-title>The problem of induction</article-title>. in <person-group person-group-type="editor"><name><surname>Zalta</surname> <given-names>E. N.</given-names></name></person-group> (Ed.), <source><italic>The Stanford encyclopedia of philosophy</italic> (online book)</source>. <publisher-loc>Stanford</publisher-loc>: <publisher-name>Metaphysics Research Lab, Stanford University</publisher-name>. Available at: <ext-link xlink:href="https://plato.stanford.edu/entries/induction-problem/" ext-link-type="uri">https://plato.stanford.edu/entries/induction-problem/</ext-link></citation></ref>
<ref id="ref21"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hume</surname> <given-names>D.</given-names></name></person-group> (<year>1739</year>). <source>A treatise of human nature</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="ref22"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hunter</surname> <given-names>J. E.</given-names></name> <name><surname>Schmidt</surname> <given-names>F. L.</given-names></name></person-group> (<year>2004</year>). <source>Methods of meta-analysis: Correcting error and bias in research findings</source>. <edition>2nd Edn</edition>. <publisher-loc>Thousand Oaks</publisher-loc>: <publisher-name>Sage Publications</publisher-name>.</citation></ref>
<ref id="ref23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Irvine</surname> <given-names>E.</given-names></name></person-group> (<year>2021</year>). <article-title>The role of replication studies in theory building</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>16</volume>, <fpage>844</fpage>&#x2013;<lpage>853</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1745691620970558</pub-id>, PMID: <pub-id pub-id-type="pmid">33440125</pub-id></citation></ref>
<ref id="ref24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kerr</surname> <given-names>N. L.</given-names></name></person-group> (<year>1998</year>). <article-title>HARKing: hypothesizing after the results are known</article-title>. <source>Personal. Soc. Psychol. Rev.</source> <volume>2</volume>, <fpage>196</fpage>&#x2013;<lpage>217</lpage>.</citation></ref>
<ref id="ref25"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Kish</surname> <given-names>L.</given-names></name></person-group> (<year>1965</year>). <source>Survey sampling</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Wiley</publisher-name>.</citation></ref>
<ref id="ref26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klein</surname> <given-names>S. B.</given-names></name></person-group> (<year>2014</year>). <article-title>What can recent replication failures tell us about the theoretical commitments of psychology?</article-title> <source>Theory Psychol.</source> <volume>24</volume>, <fpage>326</fpage>&#x2013;<lpage>338</lpage>.</citation></ref>
<ref id="ref27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klein</surname> <given-names>R. A.</given-names></name> <name><surname>Vianello</surname> <given-names>M.</given-names></name> <name><surname>Hasselman</surname> <given-names>F.</given-names></name> <name><surname>Adams</surname> <given-names>B. G.</given-names></name> <name><surname>Adams</surname> <given-names>R. B.</given-names> <suffix>Jr.</suffix></name> <name><surname>Alper</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Many labs 2: investigating variation in replicability across sample and setting</article-title>. <source>Adv. Methods Pract. Psychol. Sci.</source> <volume>1</volume>, <fpage>443</fpage>&#x2013;<lpage>490</lpage>.</citation></ref>
<ref id="ref28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krefeld-Schwalb</surname> <given-names>A.</given-names></name> <name><surname>Witte</surname> <given-names>E. H.</given-names></name> <name><surname>Zenker</surname> <given-names>F.</given-names></name></person-group> (<year>2018</year>). <article-title>Hypothesis-testing demands trustworthy data&#x2014;a simulation approach to statistical inference advocating the research program strategy</article-title>. <source>Front. Psychol.</source> <volume>9</volume>:<fpage>460</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2018.00460</pub-id>, PMID: <pub-id pub-id-type="pmid">29740363</pub-id></citation></ref>
<ref id="ref29"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Kuhn</surname> <given-names>T.</given-names></name></person-group> (<year>1962</year>). <source>The structure of scientific revolutions</source>. <publisher-loc>Chicago</publisher-loc>: <publisher-name>The University of Chicago Press</publisher-name>.</citation></ref>
<ref id="ref30"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Lakatos</surname> <given-names>I.</given-names></name></person-group> (<year>1978</year>). <source>The methodology of scientific research Programmes</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="ref31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lakens</surname> <given-names>D.</given-names></name></person-group> (<year>2013</year>). <article-title>Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs</article-title>. <source>Front. Psychol.</source> <volume>4</volume>:<fpage>863</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2013.00863</pub-id>, PMID: <pub-id pub-id-type="pmid">24324449</pub-id></citation></ref>
<ref id="ref32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lakens</surname> <given-names>D.</given-names></name> <name><surname>Scheel</surname> <given-names>A. M.</given-names></name> <name><surname>Isager</surname> <given-names>P. M.</given-names></name></person-group> (<year>2018</year>). <article-title>Equivalence testing for psychological research: a tutorial</article-title>. <source>Adv. Methods Pract. Psychol. Sci.</source> <volume>1</volume>, <fpage>259</fpage>&#x2013;<lpage>269</lpage>.</citation></ref>
<ref id="ref34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Linden</surname> <given-names>A. H.</given-names></name> <name><surname>H&#x00F6;nekopp</surname> <given-names>J.</given-names></name></person-group> (<year>2021</year>). <article-title>Heterogeneity of research results: a new perspective from which to assess and promote progress in psychological science</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>16</volume>, <fpage>358</fpage>&#x2013;<lpage>376</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1745691620964193</pub-id>, PMID: <pub-id pub-id-type="pmid">33400613</pub-id></citation></ref>
<ref id="ref35"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Lord</surname> <given-names>F. M.</given-names></name> <name><surname>Novick</surname> <given-names>M. R.</given-names></name></person-group> (<year>1968</year>). <source>Statistical theories of mental test scores</source>. <publisher-loc>Reading</publisher-loc>: <publisher-name>Addison-Wesley</publisher-name>.</citation></ref>
<ref id="ref36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meehl</surname> <given-names>P. E.</given-names></name></person-group> (<year>1978</year>). <article-title>Theoretical risks and tabular asterisks: sir Karl, sir Ronald, and the slow progress of soft psychology</article-title>. <source>J. Consult. Clin. Psychol.</source> <volume>46</volume>, <fpage>806</fpage>&#x2013;<lpage>834</lpage>. doi: <pub-id pub-id-type="doi">10.1037/0022-006X.46.4.806</pub-id></citation></ref>
<ref id="ref37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meehl</surname> <given-names>P. E.</given-names></name></person-group> (<year>1990</year>). <article-title>Appraising and amending theories: the strategy of Lakatosian defense and two principles that warrant it</article-title>. <source>Psychol. Inq.</source> <volume>1</volume>, <fpage>108</fpage>&#x2013;<lpage>141</lpage>. doi: <pub-id pub-id-type="doi">10.1207/s15327965pli0102_1</pub-id></citation></ref>
<ref id="ref38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meehl</surname> <given-names>P. E.</given-names></name></person-group> (<year>1992</year>). <article-title>Cliometric metatheory: the actuarial approach to empirical, history-based philosophy of science</article-title>. <source>Psychol. Rep.</source> <volume>91</volume>, <fpage>339</fpage>&#x2013;<lpage>404</lpage>. doi: <pub-id pub-id-type="doi">10.2466/pr0.2002.91.2.339</pub-id></citation></ref>
<ref id="ref39"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Meehl</surname> <given-names>P. E.</given-names></name></person-group> (<year>1997</year>). &#x201C;<article-title>The problem is epistemology, not statistics: replace significance tests by confidence intervals and quantify accuracy of risky numeral predictions</article-title>&#x201D; in <source>What if there were no significance tests?</source> eds. <person-group person-group-type="editor"><name><surname>Harlow</surname> <given-names>L. L.</given-names></name> <name><surname>Mulaik</surname> <given-names>S. A.</given-names></name> <name><surname>Steiger</surname> <given-names>J. H.</given-names></name></person-group> (<publisher-loc>Mahwah</publisher-loc>: <publisher-name>Erlbaum</publisher-name>), <fpage>393</fpage>&#x2013;<lpage>425</lpage>.</citation></ref>
<ref id="ref40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mi&#x0142;kowski</surname> <given-names>M.</given-names></name> <name><surname>Hohol</surname> <given-names>M.</given-names></name> <name><surname>Nowakowski</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>Mechanisms in psychology: the road towards unity?</article-title> <source>Theory Psychol.</source> <volume>29</volume>, <fpage>567</fpage>&#x2013;<lpage>578</lpage>.</citation></ref>
<ref id="ref41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morris</surname> <given-names>T. P.</given-names></name> <name><surname>White</surname> <given-names>I. R.</given-names></name> <name><surname>Crowther</surname> <given-names>M. J.</given-names></name></person-group> (<year>2019</year>). <article-title>Using simulation studies to evaluate statistical methods</article-title>. <source>Stat. Med.</source> <volume>38</volume>, <fpage>2074</fpage>&#x2013;<lpage>2102</lpage>. doi: <pub-id pub-id-type="doi">10.1002/sim.8086</pub-id>, PMID: <pub-id pub-id-type="pmid">30652356</pub-id></citation></ref>
<ref id="ref42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Muthukrishna</surname> <given-names>M.</given-names></name> <name><surname>Henrich</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>A problem in theory</article-title>. <source>Nat. Hum. Behav.</source> <volume>3</volume>, <fpage>221</fpage>&#x2013;<lpage>229</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41562-018-0522-1</pub-id></citation></ref>
<ref id="ref002"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Myung</surname> <given-names>I. J.</given-names></name></person-group> (<year>2000</year>). <article-title>The importance of complexity in model selection</article-title>. <source>Journal of Mathematical Psychology</source> <volume>44</volume>, <fpage>190</fpage>&#x2013;<lpage>204</lpage>.</citation></ref>
<ref id="ref43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nosek</surname> <given-names>B. A.</given-names></name> <name><surname>Hardwicke</surname> <given-names>T. E.</given-names></name> <name><surname>Moshontz</surname> <given-names>H.</given-names></name> <name><surname>Allard</surname> <given-names>A.</given-names></name> <name><surname>Corker</surname> <given-names>K. S.</given-names></name> <name><surname>Dreber</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2022</year>). <article-title>Replicability, robustness, and reproducibility in psychological science</article-title>. <source>Annu. Rev. Psychol.</source> <volume>73</volume>, <fpage>719</fpage>&#x2013;<lpage>748</lpage>. doi: <pub-id pub-id-type="doi">10.1146/annurev-psych-020821-114157</pub-id>, PMID: <pub-id pub-id-type="pmid">34665669</pub-id></citation></ref>
<ref id="ref44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oberauer</surname> <given-names>K.</given-names></name> <name><surname>Lewandowsky</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <article-title>Addressing the theory crisis in psychology</article-title>. <source>Psychon. Bull. Rev.</source> <volume>26</volume>, <fpage>1596</fpage>&#x2013;<lpage>1618</lpage>. doi: <pub-id pub-id-type="doi">10.3758/s13423-019-01645-2</pub-id>, PMID: <pub-id pub-id-type="pmid">31515732</pub-id></citation></ref>
<ref id="ref45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olsson-Collentine</surname> <given-names>A.</given-names></name> <name><surname>Wicherts</surname> <given-names>J. M.</given-names></name> <name><surname>van Assen</surname> <given-names>M. A.</given-names></name></person-group> (<year>2020</year>). <article-title>Heterogeneity in direct replications in psychology and its association with effect size</article-title>. <source>Psychol. Bull.</source> <volume>146</volume>, <fpage>922</fpage>&#x2013;<lpage>940</lpage>. doi: <pub-id pub-id-type="doi">10.1037/bul0000294</pub-id>, PMID: <pub-id pub-id-type="pmid">32700942</pub-id></citation></ref>
<ref id="ref46"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Peirce</surname> <given-names>C. S.</given-names></name></person-group> (<year>1931&#x2013;1958</year>). in <source>Collected papers of Charles Sanders Peirce</source>. eds. <person-group person-group-type="editor"><name><surname>Weiss</surname> <given-names>P.</given-names></name> <name><surname>Hartshorne</surname> <given-names>C.</given-names></name> <name><surname>Burks</surname> <given-names>A. W.</given-names></name></person-group>, <volume>vol. 1&#x2013;8</volume>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Harvard University Press</publisher-name>.</citation></ref>
<ref id="ref47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Perez-Gil</surname> <given-names>J. A.</given-names></name> <name><surname>Moscoso</surname> <given-names>S. C.</given-names></name> <name><surname>Rodriguez</surname> <given-names>R. M.</given-names></name></person-group> (<year>2000</year>). <article-title>Validez de constructo: el uso de analisis factorial exploratorio-confirmatorio Para obtener evidencias de validez [construct validity: the use of exploratory-confirmatory factor analysis in determining validity evidence]</article-title>. <source>Psicothema</source> <volume>12</volume>, <fpage>442</fpage>&#x2013;<lpage>446</lpage>.</citation></ref>
<ref id="ref48"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Popper</surname> <given-names>K. R.</given-names></name></person-group> (<year>1959</year>). <source>Logic of discovery</source>. <publisher-loc>London</publisher-loc>: <publisher-name>Routledge</publisher-name>.</citation></ref>
<ref id="ref49"><citation citation-type="book"><person-group person-group-type="author"><collab id="coll1">R Core Team</collab></person-group>. (<year>2021</year>). <source>R: A language and environment for statistical computing</source>. <publisher-name>R Foundation for Statistical Computing</publisher-name>. <publisher-loc>Vienna, Austria</publisher-loc>. Available at: <ext-link xlink:href="https://www.R-project.org/" ext-link-type="uri">https://www.R-project.org/</ext-link></citation></ref>
<ref id="ref50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sch&#x00E4;fer</surname> <given-names>T.</given-names></name> <name><surname>Schwarz</surname> <given-names>M. A.</given-names></name></person-group> (<year>2019</year>). <article-title>The meaningfulness of effect sizes in psychological research: differences between sub-disciplines and the impact of potential biases</article-title>. <source>Front. Psychol.</source> <volume>10</volume>:<fpage>813</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2019.00813</pub-id>, PMID: <pub-id pub-id-type="pmid">31031679</pub-id></citation></ref>
<ref id="ref51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schauer</surname> <given-names>J. M.</given-names></name> <name><surname>Hedges</surname> <given-names>L. V.</given-names></name></person-group> (<year>2020</year>). <article-title>Assessing heterogeneity and power in replications of psychological experiments</article-title>. <source>Psychol. Bull.</source> <volume>146</volume>, <fpage>701</fpage>&#x2013;<lpage>719</lpage>. doi: <pub-id pub-id-type="doi">10.1037/bul0000232</pub-id>, PMID: <pub-id pub-id-type="pmid">32271029</pub-id></citation></ref>
<ref id="ref52"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Schulze</surname> <given-names>R.</given-names></name></person-group> (<year>2004</year>). <source>Meta-Analysis</source>. <publisher-loc>A comparison of approaches</publisher-loc>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Hogrefe &#x0026; Huber</publisher-name>.</citation></ref>
<ref id="ref53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simmons</surname> <given-names>J. P.</given-names></name> <name><surname>Nelson</surname> <given-names>L. D.</given-names></name> <name><surname>Simonsohn</surname> <given-names>U.</given-names></name></person-group> (<year>2013</year>). <article-title>Life after p-hacking</article-title>. <source>Meet. Soc. Pers. Soc. Psychol.</source> doi: <pub-id pub-id-type="doi">10.2139/ssrn.2205186</pub-id></citation></ref>
<ref id="ref54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Szucs</surname> <given-names>D.</given-names></name> <name><surname>Ioannidis</surname> <given-names>J. P. A.</given-names></name></person-group> (<year>2017</year>). <article-title>Empirical assessment of published effect sizes and power in the recent cognitive neuroscience and psychology literature</article-title>. <source>PLoS Biol.</source> <volume>15</volume>:<fpage>e2000797</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pbio.2000797</pub-id>, PMID: <pub-id pub-id-type="pmid">28253258</pub-id></citation></ref>
<ref id="ref55"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Torchiano</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>Effsize: efficient effect size computation (package version 0.8.1.)</article-title>. doi: <pub-id pub-id-type="doi">10.5281/zenodo.1480624</pub-id></citation></ref>
<ref id="ref56"><citation citation-type="book"><person-group person-group-type="author"><name><surname>van Fraassen</surname> <given-names>B.</given-names></name></person-group> (<year>1980</year>). <source>The scientific image</source>. <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>. doi: <pub-id pub-id-type="doi">10.1093/0198244274.001.0001</pub-id></citation></ref>
<ref id="ref57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Rooij</surname> <given-names>I.</given-names></name> <name><surname>Baggio</surname> <given-names>G.</given-names></name></person-group> (<year>2020</year>). <article-title>Theory before the test: how to build high-verisimilitude explanatory theories in psychological science</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>16</volume>, <fpage>682</fpage>&#x2013;<lpage>697</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1745691620970604</pub-id>, PMID: <pub-id pub-id-type="pmid">33404356</pub-id></citation></ref>
<ref id="ref58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wagenmakers</surname> <given-names>E.</given-names></name> <name><surname>Farrell</surname> <given-names>S.</given-names></name></person-group> (<year>2004</year>). <article-title>AIC model selection using Akaike weights</article-title>. <source>Psychon. Bull. Rev.</source> <volume>11</volume>, <fpage>192</fpage>&#x2013;<lpage>196</lpage>. doi: <pub-id pub-id-type="doi">10.3758/BF03206482</pub-id>, PMID: <pub-id pub-id-type="pmid">15117008</pub-id></citation></ref>
<ref id="ref59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wickham</surname> <given-names>H.</given-names></name> <name><surname>Averick</surname> <given-names>M.</given-names></name> <name><surname>Bryan</surname> <given-names>J.</given-names></name> <name><surname>Chang</surname> <given-names>W.</given-names></name> <name><surname>D&#x2019;Agostino McGowan</surname> <given-names>L.</given-names></name> <name><surname>Francois</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Welcome to the tidyverse</article-title>. <source>J. Open Source Softw.</source> <volume>4</volume>:<fpage>1686</fpage>. doi: <pub-id pub-id-type="doi">10.21105/joss.01686</pub-id></citation></ref>
<ref id="ref60"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Wickham</surname> <given-names>H.</given-names></name> <name><surname>Francois</surname> <given-names>R.</given-names></name> <name><surname>Henry</surname> <given-names>L.</given-names></name> <name><surname>M&#x00FC;ller</surname> <given-names>K.</given-names></name></person-group> (<year>2021</year>). <article-title>Dplyr: a grammar of data manipulation (package version 1.0.7.)</article-title>. Available at: <ext-link xlink:href="https://CRAN.R-project.org/package=dplyr" ext-link-type="uri">https://CRAN.R-project.org/package=dplyr</ext-link></citation></ref>
<ref id="ref61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Witte</surname> <given-names>E. H.</given-names></name> <name><surname>Heitkamp</surname> <given-names>I.</given-names></name></person-group> (<year>2006</year>). <article-title>Quantitative Rekonstruktionen (Retrognosen) als Instrument der Theorienbildung und Theorienpr&#x00FC;fung in der Sozialpsychologie</article-title>. <source>Z. Sozialpsychol.</source> <volume>37</volume>, <fpage>205</fpage>&#x2013;<lpage>214</lpage>. doi: <pub-id pub-id-type="doi">10.1024/0044-3514.37.3.205</pub-id></citation></ref>
<ref id="ref62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Witte</surname> <given-names>E. H.</given-names></name> <name><surname>Zenker</surname> <given-names>F.</given-names></name></person-group> (<year>2017</year>). <article-title>From discovery to justification: outline of an ideal research program in empirical psychology</article-title>. <source>Front. Psychol.</source> <volume>8</volume>:<fpage>1847</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2017.01847</pub-id>, PMID: <pub-id pub-id-type="pmid">29163256</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn0005"><p><sup>1</sup>The creative process of developing a theoretical construct is what C.S. Peirce called an <italic>abduction</italic> over past observations (<xref ref-type="bibr" rid="ref46">Peirce, 1931&#x2013;1958</xref>). Jointly with initial and boundary conditions, a theoretical construct allows for a <italic>deduction</italic> of a theoretical prediction about possible future observations. And a (dis-)confirmation of this prediction by <italic>new</italic> observations that are (in-)consistent with it relies on a testing process that the late C.S. Peirce called <italic>induction</italic>. Abduction thus is "the process of forming an explanatory hypothesis [and is] the only logical operation which introduces any new idea," whereas "deduction merely evolves the necessary consequences of a pure hypothesis," while induction "does nothing but determine a [truth] value" (Peirce, 5.171).</p><p>Importantly, the information content of a theoretical construct exceeds that of an <italic>inductive generalization</italic> (e.g., a mathematical function stating an observed law-like regularity) that <italic>descriptively</italic> subsumes past observations. Theoretical constructs acquire this excess content by featuring at least one <italic>theoretical entity</italic> that is not presupposed by the observational theory employed to make past observations (see <xref ref-type="bibr" rid="ref001">Andreas, 2021</xref>, and our <xref ref-type="supplementary-material" rid="SM1">Supplementary Appendix S2</xref>, dissonance theory). Behavioral scientist, however, who likewise develop theoretical constructs based on past observations, regularly fail to acknowledge, and to perform, what <xref ref-type="bibr" rid="ref19">Hempel (1988)</xref> called <italic>theoretical (or inductive) ascent</italic>, i.e., "[&#x2026;] a transition from a data sentence expressed in [an antecedent vocabulary] <italic>V</italic><sub>A</sub> to a theoretical hypothesis [&#x2026;]" (p. 150) that is "formulated with the help of a theoretical vocabulary, <italic>V</italic><sub>C</sub>, whose terms refer to the kinds and characteristics of the <italic>theoretical</italic> entities and processes in question" (p. 147, <italic>italics added</italic>) which themselves are the products of <italic>abduction</italic>.</p><p>Without theoretical ascent, therefore, the information content of a <italic>non</italic>-genuine theoretical construct is at most as large as that of an inductive generalization. The main consequence is that the act of predicting <italic>future</italic> observations based on the inductive generalization that a <italic>non</italic>-genuine theoretical construct is, runs straight into Hume&#x2019;s problem of induction (<xref ref-type="bibr" rid="ref21">Hume, 1739</xref>): a non-pragmatic justification for a prediction of future observations based on an inductive generalization of past observations (<italic>sans</italic> theoretical ascent) presupposes that induction is a <italic>valid</italic> mode of reasoning. But this inference is circular (<xref ref-type="bibr" rid="ref20">Henderson, 2020</xref>).</p></fn>
<fn id="fn0006"><p><sup>2</sup>With &#x2018;<italic>a</italic>&#x2019; for the <italic>origin</italic> of the scale (normalization), &#x2018;<italic>u</italic>&#x2019; for the <italic>unit</italic> of the scale (standardization), and &#x2018;<italic>x</italic>, <italic>y</italic>, <italic>z</italic>&#x2019; for arbitrary <italic>measurement values</italic>, the ratio [(<italic>x</italic>&#x2009;+&#x2009;<italic>a</italic>) <italic>u</italic> &#x2212; (<italic>y</italic>&#x2009;+&#x2009;<italic>a</italic>) <italic>u</italic>] / [(<italic>z</italic>&#x2009;+&#x2009;<italic>a</italic>) <italic>u</italic> &#x2212; (<italic>y</italic>&#x2009;+&#x2009;<italic>a</italic>) <italic>u</italic>]&#x2009;=&#x2009;(<italic>x</italic> &#x2212; <italic>y</italic>) / (<italic>z</italic> &#x2212; <italic>y</italic>) is invariant for all values of <italic>a</italic> and <italic>u</italic>.</p></fn>
<fn id="fn0007"><p><sup>3</sup>Such rigor often cannot be achieved. Researchers in personality psychology, for instance, typically cannot randomly allocate study participants according to their personality characteristics. The object of inquiry, therefore, are not treatment effects but correlations between variables. These correlations can nevertheless be generalized to a population if a sample is representative of it (<xref ref-type="bibr" rid="ref25">Kish, 1965</xref>).</p></fn></fn-group></back></article>