Skip to main content

ORIGINAL RESEARCH article

Front. Psychol., 31 March 2021
Sec. Quantitative Psychology and Measurement
This article is part of the Research Topic Moving Beyond Non-Informative Prior Distributions: Achieving the Full Potential of Bayesian Methods for Psychological Research View all 13 articles

Assessing the Impact of Precision Parameter Prior in Bayesian Non-parametric Growth Curve Modeling

  • 1Department of Psychology, University of Virginia, Charlottesville, VA, United States
  • 2Department of Psychology, Sun Yat-sen University, Guangzhou, China

Bayesian non-parametric (BNP) modeling has been developed and proven to be a powerful tool to analyze messy data with complex structures. Despite the increasing popularity of BNP modeling, it also faces challenges. One challenge is the estimation of the precision parameter in the Dirichlet process mixtures. In this study, we focus on a BNP growth curve model and investigate how non-informative prior, weakly informative prior, accurate informative prior, and inaccurate informative prior affect the model convergence, parameter estimation, and computation time. A simulation study has been conducted. We conclude that the non-informative prior for the precision parameter is less preferred because it yields a much lower convergence rate, and growth curve parameter estimates are not sensitive to informative priors.

1. Introduction

Bayesian non-parametric (BNP) modeling, also called semiparametric Bayesian modeling in the literature, has been recognized as a valuable data analytical technique due to its great flexibility and adaptivity (e.g., Müller and Mitra, 2004; Gershman and Blei, 2012). It is rapidly gaining popularity among methodologists and practitioners and has been applied to a variety of models including regressions, latent variable models with complex structures, sequential models, etc. BNP models are on an infinite dimensional parameter space and the complexity of the models adapts to the data. One of the most popular BNP models is Dirichlet process (DP) mixtures. Being able to adapt the number of latent classes to the complexity of the data, DP mixtures are powerful in modeling empirical data. However, they also face technical challenges. One challenge is the estimation of the precision parameter in the DP mixture. In this study, we focus on the prior of precision parameter and investigate how it affects model convergence, parameter estimation, and computation time in BNP growth curve modeling.

Growth curve models are broadly used in longitudinal research (e.g., Meredith and Tisak, 1990; McArdle and Nesselroade, 2014). Many popular longitudinal models in social and behavioral sciences, such as multilevel models, some mixed-effects models, and linear hierarchical models, can be written as a form of growth curve models. In growth curve models, dependent variables are repeatedly measured and explained as a function of time and possible control variables. The mean function between the dependent variables and time is the mean growth. Random effects and measurement errors cause the individual growth trajectories to deviate from the mean growth curve. Traditional growth curve modeling is typically based on the normality assumption. That is, both the random effects and measurement errors are assumed to follow normal distributions. However, empirical data often violate the normality assumption (Micceri, 1989; Cain et al., 2017). Non-normal population distributions and data contamination are two common causes of non-normality. Although standard errors and test statistics have been corrected to reduce the adverse effect of distributional assumption violation (e.g., Chou et al., 1991; Curran et al., 1996), normal-distribution-based maximum likelihood estimation may still yield inefficient or inaccurate parameter estimates, and thus misleading statistical inferences (e.g., Yuan and Bentler, 2001; Maronna et al., 2006). Therefore, researchers have developed robust methods to obtain accurate parameter estimation and statistical inference.

The ideas of robust methods can be divided into two types. For the first type, the key idea is to downweight extreme cases. To do so, this type of robust methods assigns a weight to each subject in a dataset according to its distance from the center of the majority of the data (e.g., Pendergast and Broffitt, 1985; Singer and Sen, 1986; Silvapulle, 1992; Yuan and Bentler, 1998; Zhong and Yuan, 2010). For the second type, the key idea is to use non-normal distributions that are mathematically tractable while building the statistical model. For example, latent variables and/or measurement errors are assumed to follow a t or skew-t distribution (Tong and Zhang, 2012; Zhang, 2016) or a mixture of certain distributions (Muthén and Shedden, 1999; Lu and Zhang, 2014). While being useful, these methods have limitations under certain conditions. For example, the downweighting method does not perform well when latent variables contain extreme scores (see Zhong and Yuan, 2011). Using a t distribution or a mixture of normal distributions still imposes restrictions on the shape of the data distribution.

The aforementioned issues are automatically resolved by BNP methods. BNP modeling relies on a building block, DP, to handle the non-normality issue. DP is a distribution over probability measures that can be used to estimate unknown distributions. Consequently, the non-normality issue can be addressed by directly estimating the unknown random distributions of latent variables or measurement errors (i.e., obtaining the posteriors of the distributions).

The advantages of using BNP methods with DP priors have been discussed in the literature (e.g., Ghosal et al., 1999; MacEachern, 1999; Hjort, 2003; Müller and Mitra, 2004; Fahrmeir and Raach, 2007; Hjort et al., 2010). They do not constrain models to a specific parametric form which may limit the scope and type of statistical inferences in many situations, especially when data are not normally distributed. Thus, a typical motivation of using BNP methods is that one is unwilling to make somewhat arbitrary and unverified assumptions for latent variables or error distributions as in the parametric modeling. Meanwhile, BNP methods can provide full probability models for the data-generating process and lead to analytically tractable posterior distributions.

BNP methods have been applied to complex models. For example, Bush and MacEachern (1996), Kleinman and Ibrahim (1998), and Brown and Ibrahim (2003) used DP mixtures to handle non-normal random effects. Burr and Doss (2005) used a conditional DP to handle heterogeneous effect sizes in the context of meta-analysis. Ansari and Iyengar (2006) included Dirichlet components to build a semiparametric recurrent choice model. Dunson (2006) used dynamic mixtures of DP to estimate the varied distributions of a latent variable, which change non-parametrically across groups. Si and Reiter (2013), Si et al. (2015) used DP mixtures of multinomial distributions for categorical data with missing values. BNP approach has also been adapted to structural equation modeling to relax the normality assumption of the latent variables (e.g., Lee et al., 2008; Yang and Dunson, 2010). Tong and Zhang (2019) directly used a DP mixture to model non-normal data in growth curve modeling.

Although the application of BNP modeling has increased dramatically since the theoretical properties of BNP methods were better understood and their computational hurdles were removed (e.g., Neal, 2000), BNP modeling is still unfamiliar to the majority of researchers in social and behavioral sciences. Additionally, there are technical issues that have not yet been fully addressed (Sharif-Razavian and Zollmann, 2009). The convergence issue is one of such unanswered questions. Non-convergence can occur when BNP method is applied to complex models. Tong and Zhang (2019) found that non-convergence was largely caused by the precision parameter of the mixing DP. The precision parameter is a critical hyperparameter that governs the expected number of mixture components. When a non-informative prior was used for the precision parameter, non-convergence occurred or a longer computation time was observed (Tong and Zhang, 2019). Informative priors may help solve this issue. However, only a few studies have noticed and discussed the effect of the precision parameter in DP mixtures (e.g., West, 1992; Ohlssen et al., 2007; Jara et al., 2011). Ishwaran (2000) was among the few that studied the informative prior for the precision parameter. Ishwaran (2000) suggested to use the Gamma(2, 2) prior to encourage both small and large values of the precision parameter. In sum, despite its impact on the model convergence issue, no study has systematically investigated how the prior for the precision parameter should be specified.

Therefore, in this study, we evaluate and compare non-informative, weakly informative, accurate informative, and inaccurate informative priors for the precision parameter of DP mixtures. We study how these priors influence model convergence, model estimation, and computation time in BNP growth curve modeling. In the next section, we introduce BNP growth curve modeling. After providing the conditional posterior distribution of the precision parameter, we use a simulation study to assess the impact of four types of priors for the precision parameter. Recommendations are provided at the end of the article. We also provide a guideline about the implementation of BNP growth curve modeling using R (R Core Team, 2019) in the Appendix.

2. Bayesian Non-parametric Growth Curve Modeling

We now introduce a typical growth curve model and a BNP method based on this model. Consider a longitudinal dataset with N subjects and T measurement occasions. Let yi=(yi1,...,yiT) be a T × 1 random vector with yij being a measurement from individual i at time j (i = 1, . . . , N; j = 1, . . . , T). A growth curve model without covariates can be written as

yi=Λbi+ei,bi=β+ui,

where Λ is a T × q factor loading matrix that determines the growth curves, bi is a q × 1 vector of random effects, and ei is a vector of measurement errors. The vector of random effects bi varies around its mean β. The residual vector ui represents the deviation of bi from β. When

Λ=(10111T1)  ,bi=(LiSi) , and β=(βLβS) ,

the model is reduced to a linear growth curve model with random intercept Li and random slope Si. The mean intercept and slope are denoted as βL and βS, respectively.

Traditionally, ei and ui are assumed to follow multivariate normal distributions with mean vectors of zero and covariance matrices Φ and Ψ, respectively, so ei ~ MNT(0, Φ) and ui ~ MNq(0, Ψ). Here, MN denotes a multivariate normal distribution and its subscript indicates its dimension. Measurement errors are often assumed to be uncorrelated with each other and have equal variances across time. Statistically, this simplification means the covariance matrix of measurement error Φ is reduced to Φ=σe2I where σe2 is a scale parameter. In linear growth curve models, ui=(uLi,uSi). Its covariance matrix is then Ψ=cov(ui)=(σL2σLSσLSσS2). Here, σL2 and σS2 represent the variances of the random intercept and slope across individuals, respectively, and σLS represents the covariance between the random intercept and slope.

BNP methods do not make arbitrary distributional assumptions as in the parametric modeling and thus are more flexible in handling non-normal data (e.g., Lee et al., 2008; Tong and Zhang, 2019). Unlike conventional non-parametric methods such as permutation tests, BNP methods use full probability models to describe the data-generating process and thus can derive posterior distributions for model parameters.

Within the BNP modeling scope, the parametric distributions of latent variables and measurement errors in traditional methods are replaced by unknown random distributions. To estimate these unknown distributions, DP is frequently used as the prior (Ferguson, 1973, 1974). Specifically, a random “sample” from a DP is a random distribution. Here, we denote it as G. A DP has two hyperparameters, α and G0. The base distribution, G0, represents the central tendency or “mean” distribution in the distribution space. The precision parameter, α, quantifies how far away realizations of G deviate from G0. According to Ferguson (1973), DP is a conjugate prior that has two desirable properties: (1) a sufficiently large support, and (2) analytically manageable posterior distributions. Ferguson further derived the posterior of G, DP(α~,G0~). Here, α~=α+N and

G0~=αα+NG0+Nα+NGN

with GN being the empirical distribution of the data. Notably, the posterior point estimate of G, E(G|data)=G0~, is a weighted average of the base distribution or prior mean G0 and the empirical distribution or data GN. When α = 0, the posterior point estimate is reduced to the empirical distribution GN, which is pure non-parametric. When α approaches to infinity, the posterior point estimate gradually approximates G0, which is parametric. A common practice is to specify a gamma prior for α, which would yield a posterior estimate that is neither 0 nor infinity.

In BNP growth curve modeling, latent variables and/or measurement errors can be modeled non-parametrically. In this article, we focus on the distributional assumption of measurement errors. When the normality of measurement errors is suspected, we assume that ei ~ Ge where Ge is an unknown random distribution that is determined by the data. In the BNP framework, DP is typically adopted to specify Ge. Because the distribution of ei is continuous but DP is essentially discrete, a DP mixture (DPM) can be used to model the measurement errors such that

Ge={D(μe(1),Φ(1)),with p=p1D(μe(2),Φ(2)),with p=p2D(μe(k),Φ(k)),with p=pk,

where D represents a predetermined multivariate distribution (e.g., multivariate normal, t, multinomial, etc.), and μe(k) and Φ(k),k=1,..., are means and covariances of the multivariate distribution in the kth component with probability pk. Theoretically, given an arbitrary distributional shape, there could be infinite number of mixture components as k goes to infinity. In practice, a finite number of mixture components often can describe a distribution well and the number of mixture components is determined by the DP precision parameter α. Smaller α yields a smaller number of mixture components. If α approaches infinity, there would be N mixture components, one associated with each subject. Namely, the precision parameter α is an important parameter that can determine the complexity of the model and how well the model fits the data, and thus may affect the convergence of the model. For the intraindividual measurement errors in the typical linear growth curve model, Tong and Zhang (2019) proposed that

ei|Φi~MNT(0,Φi),Φi|G ~G,        G~DP(α,G0).

That is, the unknown distribution Ge is approximated by a mixture of multivariate normal distributions where the mixing measure has a DP prior, Ge ~ DPM. The DP prior DP(α, G0) can be obtained using the truncated stick-breaking construction (e.g., Sethuraman, 1994; Lunn et al., 2013). Specifically, DP(·)=j=1Cpjδzj(·),1C<, where C (1 ≤ CN, often set at a large number) is a possible maximum number of mixture components, δzj(·) denotes a point mass at zj and zj ~ G0 independently. The random weights pj can be generated through the following procedure. With q1, q2, . . . , qC ~ Beta(1, α), define

pj'=qjk=1j1(1qk),j=1,...,C.

Then, pj is obtained by

pj=pj'K=1Cpk',    (1)

to satisfy that j=1Cpj=1. In practice, the updating of ei can proceed as in a typical DP mixture model and its distribution is an infinite mixture distribution1.

In general, the distribution of ei through the truncated stick-breaking construction is

Ge={D(μe(1),Φ(1)),with p=p1D(μe(2),Φ(2)),with p=p2D(μe(C),Φ(C)),with p=pC,

where D represents a predetermined multivariate distribution, μe(j) and Φ(j),j=1,...,C are means and covariances of the multivariate distribution in the jth component, and pj is obtained using Equation (1). Given that the mean of ei is 0, we constrain j=1Cpjμe(j)=0. For simplicity, in this study, we follow Tong and Zhang (2019) and use multivariate normal distributions for the mixing components and constrain μe(j) to be 0. We use inverse Wishart priors p(Φ(j))=IW(n0,W0) for the covariance matrices of the mixture components, Φ(j), j = 1, . . . , C. Following Lunn et al. (2013, p. 294), we fix the shape parameter n0 at a specific number and assign an inverse Wishart prior to the scale matrix W0. With such a specification, the measurement error for individual i, ei, has a pj probability of coming from the mixing component MN(0, Φ(j)). The measurement errors for other individuals may also come from the same mixing component. Let K denotes the number of mixing components or MN(0, Φ(j)) with j = 1, . . . , C. In other words, K is the number of latent classes for ei and K can be smaller than C, KC. Within each class, eis come from the same distribution.

We would like to note that a similar approach to BNP modeling is finite mixture modeling (FMM). FMM estimates or equivalently approximates an unknown distribution using a mixture of known distributions. A key difference between FMM and BNP modeling is that the number of mixture components is treated as known in FMM, whereas this number is treated as unknown and is freely estimated in BNP modeling. As a result, when FMM is used to handle non-normality, additional analyses such as model comparison are needed to determine the unknown number of mixture components. BNP modeling therefore is believed to have the advantage of being more objective and data-driven, given that additional analyses such as model comparison that may be vulnerable to subjectivity are avoided.

Bayesian methods are applied to estimate BNP growth curve models. Bayesian methods are becoming increasingly popular in recent years because of their flexibility and powerfulness in estimating models with complex structures (e.g., Lee and Shi, 2000; Lee and Song, 2004; Zhang et al., 2007; Lee and Xia, 2008; Tong and Zhang, 2012; Serang et al., 2015). The key idea of Bayesian methods is to compute the posterior distributions for model parameters by combining the likelihood function and the priors. As introduced previously, β, Φ, and Ψ are the model parameters in traditional growth curve model. In a BNP growth curve model, β and Ψ remain model parameters. In contrast, the measurement error covariance matrix Ψ is not directly estimated. Instead, we obtain ei based on which we can get Φ. Another important parameter in BNP growth curve modeling is the precision parameter α. Let p(β, Ψ, α) be the joint prior distribution of model parameters, and let L be the likelihood function. The joint posterior distribution of model parameters is

p(β,Ψ,α|yi)p(β,Ψ,α)×L db,

where b=(b1',...,bN')'.. It is difficult to solve for this integral in practice. Instead, Markov chain Monte Carlo (MCMC) methods (e.g., Gibbs sampling; Robert and Casella, 2004) are often used to obtain parameter estimates and statistical inferences. Specifically, we first derive the conditional posterior distribution for each of the parameters. We then iteratively draw samples from the derived conditional posteriors to obtain empirical marginal distributions of the model parameters. Finally, statistical inferences are made based on the empirical marginal distributions (Geman and Geman, 1984).

3. Precision Parameter in BNP Models

The convergence issue in BNP growth curve modeling is likely related to the precision parameter (Tong and Zhang, 2019). Here, we provide a theoretical discussion on how the prior of the precision parameter can influence the number of latent classes for ei.

The DP precision parameter α is the key to govern the expected number of latent classes. It directly determines the distribution of K, the number of latent classes of ei. With a larger K, measurement errors of different individuals are more likely to have different distributions. West (1992) found that K asymptotically follows a Poisson distribution

K=1+x, x~Poisson(α(γ+logN))    (2)

where γ is Euler's constant. Several percentiles of the distribution of K are given in Table 1. As shown in the table, K increases as α and N increases.

TABLE 1
www.frontiersin.org

Table 1. Different percentiles (5, 50, 95%) of the distribution of the number of clusters K, given different values of precision parameter α, and sample size N.

As discussed previously, a gamma prior Gamma(a1, a2) is often used for the hyperparameter α. Given such a prior, West (1992) derived the posterior of α as a mixture of two gamma densities

α|· ~πxGamma(a1+K,a2logx)       +(1πx)Gamma(a1+K1,a2logx),

where x is an augmented variable x|· ~ Beta(α + 1, N) and the weights πx is defined by πx/(1-πx)=a1+K-1N(a2-logx). Although West (1992) also provided an approximation to the posterior of α, p(α|·) ≈ Gamma(a1 + K − 1, a2 + γ + logN), how good the approximation was has not been investigated.

A non-informative prior for α seems to be reasonable, especially when the information about number of latent classes are not available. However, a non-informative prior may cause non-convergence of Markov chains. Therefore, it is worth evaluating different priors for the precision parameter.

4. A Simulation Study

We now present a simulation study to evaluate the influence of the prior for the precision parameter in BNP growth curve modeling when data are normally distributed and contain outliers2. The linear growth curve model in the previous section is used. Measurement errors are modeled non-parametrically to address the non-normality. Based on the results of previous studies, the number of times points (T), the covariance between the random intercept and slope (σLS), and the measurement error variance (σe2) have trivial effects on the performance of BNP growth curve modeling (e.g., Tong and Zhang, 2019). Therefore, we only consider a set of values for these parameters in this study. We follow the empirical data analysis results in Tong and Zhang (2019) to select the population parameter values: the fixed effects are fixed at β=(βL,βS)=(6.2,0.3); the number of measurement occasion is T = 4; measurement error variance σe2=0.5; variances of the random intercept and slope are 1 and 0.1, respectively, and the covariance between the random intercept and slope σLS = 0.

Three potentially influential factors are manipulated in the simulation study, including sample size, data distribution, and precision parameter prior. First, two sample sizes are considered, N =200 or 600, representing small and large sample sizes. Second, data are either normal or containing outliers. When generating outliers, three proportions of outliers are considered, r% =5, 10, or 20%. To generate outliers, we randomly select r% observations at each measurement occasion and replace them by extreme values. The extreme values are generated from 10 different distributions with a large mean of Li + Si(j − 1) + mσe where m ≥ 5 is generated from a truncated Poisson distribution, and a variance of σe2 which is the same as that of the normal data. As a result, the true distribution of the data is a mixture of 11 distributions. Outliers generated in this way conform to the definition of outliers (Yuan and Zhong, 2008; Tong and Zhang, 2017). See Supplementary Figures 1, 2 to aid the understanding of the shape of generated normal data and data with outliers. Third, four priors for the precision parameter are investigated (see Figure 1): a diffuse prior Gamma(0.001, 0.001), a weakly informative prior Gamma(2, 2) suggested by Ishwaran (2000), an accurate informative prior Gamma(100, 100), and an inaccurate informative prior Gamma(10, 100). Gamma(10,100) is an inaccurate informative prior because its mean is 0.1 and its variance is as small as 0.001. According to Table 1, the resulting number of latent classes ranges from 1 to 3, whereas the true number of mixed underlying distribution is 11. For all the other model parameters, conventional non-informative priors such as those in Zhang et al. (2013) are used. Specifically, fixed effects β have non-informative diffuse priors N(0, 106). The covariance matrix of the random intercept and slope Ψ has an inverse-Wishart prior with an identity scale matrix and degrees of freedom being 2.

FIGURE 1
www.frontiersin.org

Figure 1. Density curves for the four precision parameter priors used in the simulation study.

In each simulation condition, 500 datasets are generated. BNP growth curve modeling is applied for each dataset using JAGS with the rjags package in software R (Plummer, 2017; R Core Team, 2019). The total length of Markov chains is set at 50,000 and the first half of iterations is the burn-in period3. We assess how different priors affect model convergence rate, parameter estimation, and computation time.

Geweke tests (Geweke, 1991) are used to perform the convergence diagnostics. After the burn-in period, if parameter values are sampled from the stationary distribution of the chain, the means of the first and last parts of the Markov chain (by default the first 10% and the last 50%) should be equal and Geweke's statistic asymptotically follows a standard normal distribution. A Markov chain converges when the Geweke's statistic is between –1.96 and 1.96. If none of the convergence diagnostics (i.e., Geweke tests) for all model parameters suggest non-convergence, the model is said to have converged. In each simulation condition, the convergence rate is defined as the proportion of converged models out of the total 500 generated replications.

For the assessment of model estimation, we obtain the parameter estimate bias, average standard error (ASE), empirical standard error (ESE), mean squared error (MSE), and coverage probability (CP) of the 95% highest posterior density (HPD) credible intervals for each parameter based on converged simulation replications4.

In addition, the estimation time (in seconds) is recorded for each replication. The average estimation time (AET) is the average of the estimation time for all the converged replications.

All program code and detailed results for the simulation study are available on our GitHub site: https://github.com/CynthiaXinTong/PrecisionParPrior_BNP_GCM.

4.1. Main Results

Figure 2 shows the convergence rate for BNP growth curve modeling with different precision parameter priors when sample size is 200. This figure clearly shows that outliers harm model convergence. Note that the convergence rate for data with 5% outliers is the lowest. This may be because a small proportion of outliers (e.g., 5%) creates a steep and high-curvature region for the Markov chain to enter and thus more difficult to converge. As the outlier proportion increases, the curvature becomes smoother so the convergence rate is higher. Among the four studied priors, the non-informative prior for the precision parameter always leads to the lowest convergence rate, i.e., less than 30% across all the simulation conditions. Informative priors substantially increase the model convergence rate. Specifically, the convergence rate doubles when we switch from the non-informative prior to the weakly informative prior suggested by Ishwaran (2000) in the condition with normal data. The incremental amount is about 30% of the original convergence rate in the conditions with outliers. Both accurate informative priors and inaccurate informative priors lead to higher convergence rates. The importance of using informative priors is more salient when data are not normal. Note that inaccurate informative priors yield slightly higher convergence rates than accurate informative priors because the variance of the inaccurate prior is lower and thus its precision is higher. When N = 600, model convergence results for BNP growth curve models follow the same pattern, and thus are not reported here.

FIGURE 2
www.frontiersin.org

Figure 2. Convergence rate for different priors when N = 200.

For converged replications, we evaluate the impact of precision parameter priors on parameter estimation and computation time. Results for N = 200 are summarized in Tables 25. The relative performance of the four priors in conditions with a larger sample size (N = 600) has a similar pattern. Detailed results for N = 600 are available in the Supplementary Document.

TABLE 2
www.frontiersin.org

Table 2. Model estimation for BNP growth curve modeling with different precision parameter priors when data are normal and N = 200.

TABLE 3
www.frontiersin.org

Table 3. Model estimation for BNP growth curve modeling with different precision parameter priors when data contain 5% of outliers and N = 200.

TABLE 4
www.frontiersin.org

Table 4. Model estimation for BNP growth curve modeling with different precision parameter priors when data contain 10% of outliers and N = 200.

TABLE 5
www.frontiersin.org

Table 5. Model estimation for BNP growth curve modeling with different precision parameter priors when data contain 20% of outliers and N = 200.

From Tables 25, we obtain the following findings. First, the estimates of growth curve parameters (βL,βS,σL2,σS2,σLS,σe2) are not affected by different priors. Estimation bias, standard errors, MSE, and coverage probability of the 95% HPD credible interval across different precision parameter prior conditions are very close to each other, respectively. Note that when outliers exist (see Tables 35), the true population parameter value of the measurement error variance σe2 is unknown. So, bias, MSE, and CP for this parameter cannot be calculated.

Second, the estimation of the hyperparameter α is greatly affected by different priors. When the non-informative prior is used, the estimated α can be very large (e.g., 28.284 in Table 3) or small (e.g., 0.019 in Table 5), associating with a large standard error. When Gamma(2, 2) or Gamma(100, 100) is used, estimated α is almost always close to 1. When Gamma(10, 100) is used, estimated α is around 0.1. Different α values indicate a different total number of classes K. In general, a larger α value may yield a larger number of latent classes. Since the estimated α has a large standard error when the non-informative diffuse prior is used, the corresponding estimated K can also be large or small. For the weakly informative and accurate informative priors, the estimated number of latent classes ranges from 4 to 6 for different data conditions, whereas for the inaccurate informative prior, the estimated number of latent classes is about 2 or 3. It is interesting to see that although distinctively different hyperparameter estimates are obtained leading to different number of latent classes, the estimated growth curve parameters are essentially similar. This is because although outliers are generated from 10 different distributions, the 10 different distributions are not separated far apart. With a low class separation, one distribution may be enough to describe several outliers generated from different distributions. Thus, even the inaccurate informative prior can yield a precision parameter that is adequate to model the measurement errors.

Third, BNP growth curve modeling with the inaccurate informative prior Gamma(10, 100) requires the shortest computation time. This is because the inaccurate informative prior here has the smallest variance and thus is most “informative” among the four priors.

Fourth, outliers affect the performance of BNP growth curve modeling. When data contain a large proportion of outliers (e.g., 20%), estimation bias for the average of random intercepts βL and variance of random intercepts σL2 are much larger than those when outlier proportion is low. In addition, outliers influence computation time. It is worth mentioning that it is most time consuming when the outlier proportion is 5%. A possible reason is that a small proportion of outliers creates a steep and high-curvature region for Markov chains to enter and thus takes longer time to converge. With more outliers, the curvature is smoother so the computation is faster.

5. Discussion

Restricting to a parametric probability family can delude investigators and falsely make an illusion of posterior certainty (Müller and Mitra, 2004). On the contrary, BNP methods are adaptive and powerful to discover complex patterns in real data. Although BNP growth curve modeling has been proposed, the effect of the precision parameter was not fully studied. In this article, we have conducted a simulation study to investigate how different types of precision parameter priors impact the convergence rate, model estimation, and computation time in BNP growth curve modeling. We found that the non-informative prior suffered from the lowest convergence rates while the inaccurate informative prior with the smallest prior variance yielded the highest convergence rates and the fastest computations. Furthermore, we found that the estimation of growth curve parameters was not affected by the prior of the precision parameter. Based on these results, we recommend to use informative priors with high precision in practice.

We would like to note that although it seems counterintuitive that the inaccurate informative prior for the precision parameter performed the best, such findings have been observed in the literature. For example, Finch and Miller (2019) found that slightly informative priors can be advantageous in small samples even when these priors are incorrect. Depaoli (2013) showed that growth mixture model estimations obtained with inaccurate priors were still more accurate than maximum likelihood or Bayesian estimation with diffuse priors. Zitzmann et al. (2020) explicitly discussed this issue for small samples. Our simulation results also supported the argument that the amount of information in the prior can be more important than the accuracy of the prior under certain circumstances.

We also want to point out that the estimation bias was relatively large in our simulation study, when compared to that in previous studies (Tong and Zhang, 2019). This is because we consider much higher outlier proportions. When the outlier proportion is low (i.e., 5%), parameter estimates are very close to the true population values. As the outlier proportion increases, the bias increases. One possible way to improve the performance of BNP growth curve modeling when the outlier proportion is high is to use a non-normal base distribution. In our simulation study, for simplicity, we used normal distributions with zero mean as the mixing components of BNP modeling. This cannot handle asymmetric non-normal distributions, which may partly explain the less satisfactory performance of BNP modeling in the conditions with high outlier proportions. But BNP methods in general are very flexible. A non-normal base distribution may overcome this limitation. While future studies may continue along this path, we want to emphasize that BNP modeling as in our study still outperforms traditional growth curve modeling and is recommended to use in general when data are suspected to be non-normal (Tong and Zhang, 2019) no matter the non-normality is caused by non-normal population distribution or data contamination.

The convergence rate of BNP growth curve modeling was found to be higher in previous studies, i.e., close to one (Tong and Zhang, 2019). We would like to note that the difference is likely due to the list of parameters counted during convergence assessment. In Tong and Zhang (2019), the convergence rate was computed only for growth curve parameters. When only growth curve parameters are considered, non-convergence rarely occurred in our study. The major problem is the precision parameter. As shown in the simulation study, non-convergence frequently arose for this parameter (detailed Geweke tests results for each parameter are available on our GitHub site: https://github.com/CynthiaXinTong/PrecisionParPrior_BNP_GCM). Another possible reason why convergence rates were relatively low (below 70%) in our simulation is that Geweke tests often yield lower rates of convergence than other diagnostic methods (e.g., Jang and Cohen, 2020). However, as pointed out in Jang and Cohen, the pattern of convergence rates for model comparison was similar for different diagnostic tests. Namely, our conclusions about which precision parameter priors to use in BNP growth curve modeling will not be affected by the diagnostic tests. We further discuss the use of Geweke tests in the next paragraph. Notably, although the non-convergence for the precision parameter seemed not to impact parameter estimates for the growth curve parameters, such issue may mislead model fit assessment. Although model assessment and model comparison methods have been proposed for various models, samples of different sizes, and data structures (e.g., Celeux et al., 2006), their performance in BNP analysis has not been studied. Therefore, future studies on how different precision parameter priors affect model fit assessment are encouraged.

In our study, model convergence diagnostics were conducted using Geweke tests. Although Geweke tests are commonly used in the Bayesian literature, it is impossible to say with certainty that a finite sample from an MCMC algorithm is representative of an underlying stationary distribution and a combination of strategies aiming at evaluating and accelerating MCMC sampler convergence is recommended (Cowles and Carlin, 1996). For our simulation study, Geweke tests were relatively easy to systematically implement. In empirical studies, we recommend using multiple strategies (e.g., trace plots, multiple chains) to check model convergence. In addition, since Zitzmann and Hecht (2019) pointed out that it is possible that the approximation of the Bayesian estimates is still not optimal even when a chain converges, we recommend substantive researchers conducting sensitivity analysis and evaluating how the length of the Markov chains affects the model estimation results.

Our study echoed the previous literature in that using informative priors may help reduce computation time in Bayesian modeling. We would like to note that there are other approaches that can be used to further increase the computation efficiency. For example, Berger et al. (2020) and Daniels and Kass (1999) proposed shrinkage priors, and Hecht et al. (2020) proposed a model reformulation approach in which the sample covariance matrix was modeled instead of individual observations. This latter approach has been applied to the Bayesian continuous-time model (Hecht and Zitzmann, 2020) as well as the Bayesian STARTS model (Ludtke et al., 2018). Future research on BNP growth curve modeling could incorporate this approach and other potentially efficient approaches to reduce computation time.

The employment of BNP growth curve modeling is a field still in its early stage. New DP variants and generalizations are being proposed every year to cater to specific applications. BNP modeling was only used to handle the non-normality in intraindividual measurement errors in our study. The similar strategy can be used for random effects, such as random intercepts and slopes. Also, although we worked with balanced data, BNP growth curve modeling should be able to handle unbalanced data (e.g., individually varying time points). However, as implied by previous studies (Tong, 2014), the convergence issue may be more challenging, thereby awaiting future studies.

Data Availability Statement

The raw data supporting the conclusions of this article are available by running the data generation R code on our GitHub site: https://github.com/CynthiaXinTong/PrecisionParPrior_BNP_GCM.

Author Contributions

All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication.

Funding

This paper is based upon work supported by the National Science Foundation under grant no. SES-1951038.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2021.624588/full#supplementary-material

Footnotes

1. ^In practice, infinite-dimension means finite but unbounded dimension.

2. ^Note that non-normal data may be caused by non-normal population distributions or data contaminations. We work with outliers in this simulation study because BNP methods are essentially infinite mixture modeling procedures. Generating and dealing with outliers from multiple different distributions are more manageable as we easily know the true number of underlying classes. It is worth verifying the conclusions of this paper for non-normal population distributions in the future.

3. ^Multiple lengths of Markov chains were tested before the current setting was selected. The convergence results with 50,000 iterations were about the same as those for longer chains.

4. ^ASE is the mean estimated standard error across replications. ESE is the standard deviation of the parameter estimates from all replications. MSE is computed as squared bias plus squared ESE. Posterior credible interval, also called credible interval, is the Bayesian counterpart of the frequentist confidence interval. A HPD interval is essentially the narrowest interval on a posterior that covers a given proportion of the probable posterior values.

References

Ansari, A., and Iyengar, R. (2006). Semiparametric Thurstonian models for recurrent choices: A Bayesian analysis. Psychometrika 71, 631–657. doi: 10.1007/s11336-006-1233-5

CrossRef Full Text | Google Scholar

Berger, J. O., Sun, D., and Song, C. (2020). Bayesian analysis of the covariance matrix of a multivariate normal distribution with a new class of priors. Ann. Stat. 48, 2381–2403. doi: 10.1214/19-AOS1891

CrossRef Full Text | Google Scholar

Brown, E. R., and Ibrahim, J. G. (2003). A Bayesian semiparametric joint hierarchical model for longitudinal and survival data. Biometrics 59, 221–228. doi: 10.1111/1541-0420.00028

PubMed Abstract | CrossRef Full Text | Google Scholar

Burr, D., and Doss, H. (2005). A Bayesian semiparametric model for random-effects meta-analysis. J. Am. Stat. Assoc. 100, 242–251. doi: 10.1198/016214504000001024

CrossRef Full Text | Google Scholar

Bush, C. A., and MacEachern, S. N. (1996). A semiparametric Bayesian model for randomised block designs. Biometrika 83, 275–285. doi: 10.1093/biomet/83.2.275

CrossRef Full Text | Google Scholar

Cain, M. K., Zhang, Z., and Yuan, K.-H. (2017). Univariate and multivariate skewness and kurtosis for measuring nonnormality: prevalence, influence and estimation. Behav. Res. Methods 49, 1716–1735. doi: 10.3758/s13428-016-0814-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Celeux, G., Forbes, F., Robert, C. P., and Titterington, D. M. (2006). Deviance information criteria for missing data models. Bayesian Anal. 1, 651–673. doi: 10.1214/06-ba122

CrossRef Full Text | Google Scholar

Chou, C.-P., Bentler, P. M., and Satorra, A. (1991). Scaled test statistics and robust standard errors for non-normal data in covariance structure analysis: a monte carlo study. Br. J. Math. Stat. Psychol. 44, 347–357. doi: 10.1111/j.2044-8317.1991.tb00966.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Cowles, M. K., and Carlin, B. P. (1996). Markov chain monte carlo convergence diagnostics: a comparative review. J. Am. Stat. Assoc. 91, 883–904.

Google Scholar

Curran, P. J., West, S. G., and Finch, J. F. (1996). The robustness of test statistics to nonnormality and specification error in confirmatory factor analysis. Psychol. Methods 1, 16–29. doi: 10.1037/1082-989x.1.1.16

CrossRef Full Text | Google Scholar

Daniels, M. J., and Kass, R. E. (1999). Nonconjugate Bayesian estimation of covariance matrices and its use in hierarchical models. J. Am. Stat. Assoc. 94, 1254–1263. doi: 10.2307/2669939

CrossRef Full Text | Google Scholar

Depaoli, S. (2013). Mixture class recovery in GMM under varying degrees of class separation: frequentist versus Bayesian estimation. Psychol. Methods 18, 186–219. doi: 10.1037/a0031609

PubMed Abstract | CrossRef Full Text | Google Scholar

Dunson, D. B. (2006). Bayesian dynamic modeling of latent trait distributions. Biostatistics 7, 551–568. doi: 10.1093/biostatistics/kxj025

PubMed Abstract | CrossRef Full Text | Google Scholar

Fahrmeir, L., and Raach, A. (2007). A Bayesian semiparametric latent variable model for mixed responses. Psychometrika 72, 327–346. doi: 10.1007/s11336-007-9010-7

CrossRef Full Text | Google Scholar

Ferguson, T. (1973). A Bayesian analysis of some nonparametric problems. Ann. Stat. 1, 209–230. doi: 10.1214/aos/1176342360

CrossRef Full Text | Google Scholar

Ferguson, T. (1974). Prior distributions on spaces of probability measures. Ann. Stat. 2, 615–629. doi: 10.1214/aos/1176342752

CrossRef Full Text | Google Scholar

Finch, W. H., and Miller, J. E. (2019). The use of incorrect informative priors in the estimation of MIMIC model parameters with small sample sizes. Struct. Equat. Model. 26, 497–508. doi: 10.1080/10705511.2018.1553111

CrossRef Full Text | Google Scholar

Geman, S., and Geman, D. (1984). Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell. 6, 721–741. doi: 10.1016/b978-0-08-051581-6.50057-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Gershman, S. J., and Blei, D. M. (2012). A tutorial on Bayesian nonparametric models. J. Math. Psychol. 56, 1–12. doi: 10.1016/j.jmp.2011.08.004

CrossRef Full Text | Google Scholar

Geweke, J. (1991). “Evaluating the accuracy of sampling-based approaches to calculating posterior moments,” in Bayesian Statistics 4, eds J. M. Bernado, J. O. Berger, A. P. Dawid, and A. F. M. Smith (Oxford: Clarendon Press), 169–193. doi: 10.21034/sr.148

CrossRef Full Text | Google Scholar

Ghosal, S., Ghosh, J., and Ramamoorthi, R. (1999). Posterior consistency of Dirichlet mixtures in density estimation. Ann. Stat. 27, 143–158. doi: 10.1214/aos/1018031105

CrossRef Full Text | Google Scholar

Hecht, M., Gische, C., Vogel, D., and Zitzmann, S. (2020). Integrating out nuisance parameters for computationally more efficient Bayesian estimation - An illustration and tutorial. Struct. Equat. Model. 27, 483–493. doi: 10.1080/10705511.2019.1647432

CrossRef Full Text | Google Scholar

Hecht, M., and Zitzmann, S. (2020). A computationally more efficient Bayesian approach for estimating continuous-time models. Struct. Equat. Model. 27, 829–840. doi: 10.1080/10705511.2020.1719107

CrossRef Full Text | Google Scholar

Hjort, N. L. (2003). "Topics in nonparametric Bayesian statistics," in Highly Structured Stochastic Systems, eds P. Green, N. L. Hjort, and S. Richardson (Oxford: Oxford University Press), 455–487.

Google Scholar

Hjort, N. L., Holmes, C., Müller, P., and Walker, S. G. (2010). Bayesian Nonparametrics. Cambridge: Cambridge University Press. doi: 10.1093/acprof:oso/9780199695607.003.0013

CrossRef Full Text | Google Scholar

Ishwaran, H. (2000). “Inference for the random effects in Bayesian generalized linear mixed models,” in ASA Proceedings of the Bayesian Statistical Science Section, ed A. S. Association (London), 1–10.

Google Scholar

Jang, Y., and Cohen, A. S. (2020). The impact of Markov chain covergence on estimation of mixture IRT model parameters. Educ. Psychol. Meas. 80, 975–994. doi: 10.1177/0013164419898228

PubMed Abstract | CrossRef Full Text | Google Scholar

Jara, A., Hanson, T., Quintana, F., Müller, P., and Rosner, G. (2011). DP-package: Bayesian semi and nonparametric modeling in R. J. Stat. Softw. 40, 1–30. doi: 10.18637/jss.v040.i05

CrossRef Full Text | Google Scholar

Kleinman, K. P., and Ibrahim, J. G. (1998). A semiparametric Bayesian approach to the random effects model. Biometrics 54, 921–938. doi: 10.2307/2533846

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S. Y., Lu, B., and Song, X. Y. (2008). Semiparametric Bayesian analysis of structural equation models with fixed covariates. Stat. Med. 27, 2341–2360. doi: 10.1002/sim.3098

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S. Y., and Shi, J. Q. (2000). Joint Bayesian analysis of factor scores and structural parameters in the factor analysis model. Ann. Inst. Stat. Math. 52, 722–736. doi: 10.1023/a:1017529427433

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S. Y., and Song, X. Y. (2004). Bayesian model comparison of nonlinear structural equation models with missing continuous and ordinal categorical data. Br. J. Math. Stat. Psychol. 57, 131–150. doi: 10.1348/000711004849204

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S. Y., and Xia, Y. M. (2008). A robust bayesian approach for structural equation models with missing data. Psychometrika 73, 343–364. doi: 10.1007/s11336-008-9060-5

CrossRef Full Text | Google Scholar

Lu, Z., and Zhang, Z. (2014). Robust growth mixture models with non-ignorable missingness: models, estimation, selection, and application. Comput. Stat. Data Anal. 71, 220–240. doi: 10.1016/j.csda.2013.07.036

CrossRef Full Text | Google Scholar

Ludtke, O., Robitzsch, A., and Wagner, J. (2018). More stable estimation of the STARTS model: a Bayesian approach using Markov Chain Monte Carlo techniques. Psychol. Methods 23, 570–593. doi: 10.1037/met0000155

PubMed Abstract | CrossRef Full Text | Google Scholar

Lunn, D., Jackson, C., Best, N., Thomas, A., and Spiegelhalter, D. (2013). The BUGS Book: A Practical Introduction to Bayesian Analysis. Boca Raton, FL: CRC Press.

Google Scholar

MacEachern, S. (1999). “Dependent nonparametric processes,” in ASA Proceedings of the Section on Bayesian Statistical Science, ed A. S. Association (Alexandria, VA).

Maronna, R. A., Martin, R. D., and Yohai, V. J. (2006). Robust Statistics: Theory and Methods. New York, NY: John Wiley & Sons, Inc. doi: 10.1002/0470010940

CrossRef Full Text | Google Scholar

McArdle, J. J., and Nesselroade, J. R. (2014). Longitudinal Data Analysis Using Structural Equation Models. Washington, DC: American Psychological Association. doi: 10.1037/14440-000

CrossRef Full Text | Google Scholar

Meredith, W., and Tisak, J. (1990). Latent curve analysis. Psychometrika 55, 107–122. doi: 10.1007/bf02294746

CrossRef Full Text | Google Scholar

Micceri, T. (1989). The unicorn, the normal curve, and other improbable creatures. Psychol. Bull. 105, 156–166. doi: 10.1037/0033-2909.105.1.156

CrossRef Full Text | Google Scholar

Müller, P., and Mitra, R. (2004). Bayesian nonparametric inference - why and how. Bayesian Anal. 1, 1–33. doi: 10.1214/13-ba811

CrossRef Full Text | Google Scholar

Muthén, B., and Shedden, K. (1999). Finite mixture modeling with mixture outcomes using the EM algorithm. Biometrics 55, 463–469. doi: 10.1111/j.0006-341X.1999.00463.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Neal, R. M. (2000). Markov chain sampling methods for Dirichlet process mixture models. J. Comput. Graph. Stat. 9, 249–265. doi: 10.2307/1390653

CrossRef Full Text | Google Scholar

Ohlssen, D. I., Sharples, L. D., and Spiegelhalter, D. (2007). Flexible random-effects models using Bayesian semi-parametric models: applications to institutional comparisons. Stat. Med. 26, 2088–2112. doi: 10.1002/sim.2666

PubMed Abstract | CrossRef Full Text | Google Scholar

Pendergast, J. F., and Broffitt, J. D. (1985). Robust estimation in growth curve models. Commun. Stat. Theor. Methods 14, 1919–1939. doi: 10.1080/03610928508829021

CrossRef Full Text | Google Scholar

Plummer, M. (2017). Jags Version 4.3. 0 User Manual.

Google Scholar

R Core Team (2019). R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing.

Google Scholar

Robert, C. P., and Casella, G. (2004). Monte Carlo Statistical Methods. New York, NY: Springer. doi: 10.1007/978-1-4757-4145-2

CrossRef Full Text | Google Scholar

Serang, S., Zhang, Z., Helm, J., Steele, J. S., and Grimm, K. J. (2015). Evaluation of a Bayesian approach to estimating nonlinear mixed-effects mixture models. Struct. Equat. Model. 22, 202–215. doi: 10.1080/10705511.2014.937322

CrossRef Full Text | Google Scholar

Sethuraman, J. (1994). A constructive definition of Dirichlet priors. Stat. Sin. 4, 639–650.

Google Scholar

Sharif-Razavian, N., and Zollmann, A. (2009). An Overview of Nonparametric Bayesian Models and Applications to Natural Language Processing. Available online at: http://www.cs.cmu.edu/?simzollmann/publications/nonparametric.pdf

Google Scholar

Si, Y., and Reiter, J. P. (2013). Nonparametric Bayesian multiple imputation for incomplete categorical variables in large-scale assessment surveys. J. Educ. Behav. Stat. 38, 499–521. doi: 10.3102/1076998613480394

CrossRef Full Text | Google Scholar

Si, Y., Reiter, J. P., and Hillygus, D. S. (2015). Semi-parametric selection models for potentially non-ignorable attrition in panel studies with refreshment samples. Polit. Anal. 23, 92–112. doi: 10.1093/pan/mpu009

CrossRef Full Text | Google Scholar

Silvapulle, M. J. (1992). On M-methods in growth curve analysis with asymmetric errors. J. Stat. Plan. Inference 32, 303–309. doi: 10.1016/0378-3758(92)90013-i

CrossRef Full Text | Google Scholar

Singer, J. M., and Sen, P. K. (1986). M-methods in growth curve analysis. J. Stat. Plan. Inference 13, 251–261. doi: 10.1016/0378-3758(86)90137-0

CrossRef Full Text | Google Scholar

Tong, X. (2014). Robust semiparametric bayesian methods in growth curve modeling (Unpublished doctoral dissertation). University of Notre Dame, Notre Dame, IN, United States.

Google Scholar

Tong, X., and Zhang, Z. (2012). Diagnostics of robust growth curve modeling using Student's t distribution. Multivariate Behav. Res. 47, 493–518. doi: 10.1080/00273171.2012.692614

PubMed Abstract | CrossRef Full Text | Google Scholar

Tong, X., and Zhang, Z. (2017). Outlying observation diagnostics in growth curve modeling. Multivariate Behav. Res. 52, 768–788. doi: 10.1080/00273171.2017.1374824

PubMed Abstract | CrossRef Full Text | Google Scholar

Tong, X., and Zhang, Z. (2019). Robust Bayesian approaches in growth curve modeling: using Student's t distributions versus a semiparametric method. Struct. Equat. Model. 27, 544–560. doi: 10.1080/10705511.2019.1683014

CrossRef Full Text | Google Scholar

West, M. (1992). Hyperparameter Estimation in Dirichlet Process Mixture Models. Technical Report 92-A03, Duke University, ISDS.

Google Scholar

Yang, M., and Dunson, D. B. (2010). Bayesian semiparametric structural equation models with latent variables. Psychometrika 75, 675–693. doi: 10.1007/s11336-010-9174-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan, K.-H., and Bentler, P. M. (1998). Structural equation modeling with robust covariances. Sociol. Methodol. 28, 363–396. doi: 10.1111/0081-1750.00052

CrossRef Full Text | Google Scholar

Yuan, K.-H., and Bentler, P. M. (2001). Effect of outliers on estimators and tests in covariance structure analysis. Br. J. Math. Stat. Psychol. 54, 161–175. doi: 10.1348/000711001159366

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan, K.-H., and Zhong, X. (2008). Outliers, high-leverage observations and influential cases in factor analysis: Minimizing their effect using robust procedures. Sociol. Methodol. 38, 329–368. doi: 10.1111/j.1467-9531.2008.00198.x

CrossRef Full Text

Zhang, Z. (2016). Modeling error distributions of growth curve models through Bayesian methods. Behav. Res. Methods 48, 427–444. doi: 10.3758/s13428-015-0589-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Z., Hamagami, F., Wang, L., Grimm, K. J., and Nesselroade, J. R. (2007). Bayesian analysis of longitudinal data using growth curve models. Int. J. Behav. Dev. 31, 374–383. doi: 10.1177/0165025407077764

CrossRef Full Text | Google Scholar

Zhang, Z., Lai, K., Lu, Z., and Tong, X. (2013). Bayesian inference and application of robust growth curve models using Student's t distribution. Struct. Equat. Model. 20, 47–78. doi: 10.1080/10705511.2013.742382

CrossRef Full Text | Google Scholar

Zhong, X., and Yuan, K.-H. (2010). “Weights,” in Encyclopedia of Research Design, ed N. J. Salkind (Thousand Oaks, CA: Sage), 1617–1620. doi: 10.4135/9781412961288

CrossRef Full Text | Google Scholar

Zhong, X., and Yuan, K.-H. (2011). Bias and efficiency in structural equation modeling: maximum likelihood versus robust methods. Multivariate Behav. Res. 46, 229–265. doi: 10.1080/00273171.2011.558736

PubMed Abstract | CrossRef Full Text | Google Scholar

Zitzmann, S., and Hecht, M. (2019). Going beyond convergence in Bayesian estimation: why precision matters too and how to assess it. Struct. Equat. Model. 26, 646–661. doi: 10.1080/10705511.2018.1545232

CrossRef Full Text | Google Scholar

Zitzmann, S., Ludtke, O., Robitzsch, A., and Hecht, M. (2020). On the performance of Bayesian approaches in small samples: a comment on Smid, McNeish, Miocevic, and van de Schoot. Struct. Equat. Model. 28, 40–50. doi: 10.1080/10705511.2020.1752216

CrossRef Full Text | Google Scholar

Keywords: non-parametric Bayesian, robust method, growth curve modeling, Dirichlet process mixture, prior, precision parameter

Citation: Tong X and Ke Z (2021) Assessing the Impact of Precision Parameter Prior in Bayesian Non-parametric Growth Curve Modeling. Front. Psychol. 12:624588. doi: 10.3389/fpsyg.2021.624588

Received: 31 October 2020; Accepted: 09 February 2021;
Published: 31 March 2021.

Edited by:

Haiyan Liu, University of California, Merced, United States

Reviewed by:

Sonja Winter, University of California, Merced, United States
Steffen Zitzmann, University of Tübingen, Germany
Kenneth Wilcox, University of Notre Dame, United States

Copyright © 2021 Tong and Ke. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zijun Ke, keziyun@mail.sysu.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.