# Assessing the Impact of Precision Parameter Prior in Bayesian Non-parametric Growth Curve Modeling

^{1}Department of Psychology, University of Virginia, Charlottesville, VA, United States^{2}Department of Psychology, Sun Yat-sen University, Guangzhou, China

Bayesian non-parametric (BNP) modeling has been developed and proven to be a powerful tool to analyze messy data with complex structures. Despite the increasing popularity of BNP modeling, it also faces challenges. One challenge is the estimation of the precision parameter in the Dirichlet process mixtures. In this study, we focus on a BNP growth curve model and investigate how non-informative prior, weakly informative prior, accurate informative prior, and inaccurate informative prior affect the model convergence, parameter estimation, and computation time. A simulation study has been conducted. We conclude that the non-informative prior for the precision parameter is less preferred because it yields a much lower convergence rate, and growth curve parameter estimates are not sensitive to informative priors.

## 1. Introduction

Bayesian non-parametric (BNP) modeling, also called semiparametric Bayesian modeling in the literature, has been recognized as a valuable data analytical technique due to its great flexibility and adaptivity (e.g., Müller and Mitra, 2004; Gershman and Blei, 2012). It is rapidly gaining popularity among methodologists and practitioners and has been applied to a variety of models including regressions, latent variable models with complex structures, sequential models, etc. BNP models are on an infinite dimensional parameter space and the complexity of the models adapts to the data. One of the most popular BNP models is Dirichlet process (DP) mixtures. Being able to adapt the number of latent classes to the complexity of the data, DP mixtures are powerful in modeling empirical data. However, they also face technical challenges. One challenge is the estimation of the precision parameter in the DP mixture. In this study, we focus on the prior of precision parameter and investigate how it affects model convergence, parameter estimation, and computation time in BNP growth curve modeling.

Growth curve models are broadly used in longitudinal research (e.g., Meredith and Tisak, 1990; McArdle and Nesselroade, 2014). Many popular longitudinal models in social and behavioral sciences, such as multilevel models, some mixed-effects models, and linear hierarchical models, can be written as a form of growth curve models. In growth curve models, dependent variables are repeatedly measured and explained as a function of time and possible control variables. The mean function between the dependent variables and time is the mean growth. Random effects and measurement errors cause the individual growth trajectories to deviate from the mean growth curve. Traditional growth curve modeling is typically based on the normality assumption. That is, both the random effects and measurement errors are assumed to follow normal distributions. However, empirical data often violate the normality assumption (Micceri, 1989; Cain et al., 2017). Non-normal population distributions and data contamination are two common causes of non-normality. Although standard errors and test statistics have been corrected to reduce the adverse effect of distributional assumption violation (e.g., Chou et al., 1991; Curran et al., 1996), normal-distribution-based maximum likelihood estimation may still yield inefficient or inaccurate parameter estimates, and thus misleading statistical inferences (e.g., Yuan and Bentler, 2001; Maronna et al., 2006). Therefore, researchers have developed robust methods to obtain accurate parameter estimation and statistical inference.

The ideas of robust methods can be divided into two types. For the first type, the key idea is to downweight extreme cases. To do so, this type of robust methods assigns a weight to each subject in a dataset according to its distance from the center of the majority of the data (e.g., Pendergast and Broffitt, 1985; Singer and Sen, 1986; Silvapulle, 1992; Yuan and Bentler, 1998; Zhong and Yuan, 2010). For the second type, the key idea is to use non-normal distributions that are mathematically tractable while building the statistical model. For example, latent variables and/or measurement errors are assumed to follow a *t* or skew-*t* distribution (Tong and Zhang, 2012; Zhang, 2016) or a mixture of certain distributions (Muthén and Shedden, 1999; Lu and Zhang, 2014). While being useful, these methods have limitations under certain conditions. For example, the downweighting method does not perform well when latent variables contain extreme scores (see Zhong and Yuan, 2011). Using a *t* distribution or a mixture of normal distributions still imposes restrictions on the shape of the data distribution.

The aforementioned issues are automatically resolved by BNP methods. BNP modeling relies on a building block, DP, to handle the non-normality issue. DP is a distribution over probability measures that can be used to estimate unknown distributions. Consequently, the non-normality issue can be addressed by directly estimating the unknown random distributions of latent variables or measurement errors (i.e., obtaining the posteriors of the distributions).

The advantages of using BNP methods with DP priors have been discussed in the literature (e.g., Ghosal et al., 1999; MacEachern, 1999; Hjort, 2003; Müller and Mitra, 2004; Fahrmeir and Raach, 2007; Hjort et al., 2010). They do not constrain models to a specific parametric form which may limit the scope and type of statistical inferences in many situations, especially when data are not normally distributed. Thus, a typical motivation of using BNP methods is that one is unwilling to make somewhat arbitrary and unverified assumptions for latent variables or error distributions as in the parametric modeling. Meanwhile, BNP methods can provide full probability models for the data-generating process and lead to analytically tractable posterior distributions.

BNP methods have been applied to complex models. For example, Bush and MacEachern (1996), Kleinman and Ibrahim (1998), and Brown and Ibrahim (2003) used DP mixtures to handle non-normal random effects. Burr and Doss (2005) used a conditional DP to handle heterogeneous effect sizes in the context of meta-analysis. Ansari and Iyengar (2006) included Dirichlet components to build a semiparametric recurrent choice model. Dunson (2006) used dynamic mixtures of DP to estimate the varied distributions of a latent variable, which change non-parametrically across groups. Si and Reiter (2013), Si et al. (2015) used DP mixtures of multinomial distributions for categorical data with missing values. BNP approach has also been adapted to structural equation modeling to relax the normality assumption of the latent variables (e.g., Lee et al., 2008; Yang and Dunson, 2010). Tong and Zhang (2019) directly used a DP mixture to model non-normal data in growth curve modeling.

Although the application of BNP modeling has increased dramatically since the theoretical properties of BNP methods were better understood and their computational hurdles were removed (e.g., Neal, 2000), BNP modeling is still unfamiliar to the majority of researchers in social and behavioral sciences. Additionally, there are technical issues that have not yet been fully addressed (Sharif-Razavian and Zollmann, 2009). The convergence issue is one of such unanswered questions. Non-convergence can occur when BNP method is applied to complex models. Tong and Zhang (2019) found that non-convergence was largely caused by the precision parameter of the mixing DP. The precision parameter is a critical hyperparameter that governs the expected number of mixture components. When a non-informative prior was used for the precision parameter, non-convergence occurred or a longer computation time was observed (Tong and Zhang, 2019). Informative priors may help solve this issue. However, only a few studies have noticed and discussed the effect of the precision parameter in DP mixtures (e.g., West, 1992; Ohlssen et al., 2007; Jara et al., 2011). Ishwaran (2000) was among the few that studied the informative prior for the precision parameter. Ishwaran (2000) suggested to use the *Gamma*(2, 2) prior to encourage both small and large values of the precision parameter. In sum, despite its impact on the model convergence issue, no study has systematically investigated how the prior for the precision parameter should be specified.

Therefore, in this study, we evaluate and compare non-informative, weakly informative, accurate informative, and inaccurate informative priors for the precision parameter of DP mixtures. We study how these priors influence model convergence, model estimation, and computation time in BNP growth curve modeling. In the next section, we introduce BNP growth curve modeling. After providing the conditional posterior distribution of the precision parameter, we use a simulation study to assess the impact of four types of priors for the precision parameter. Recommendations are provided at the end of the article. We also provide a guideline about the implementation of BNP growth curve modeling using R (R Core Team, 2019) in the Appendix.

## 2. Bayesian Non-parametric Growth Curve Modeling

We now introduce a typical growth curve model and a BNP method based on this model. Consider a longitudinal dataset with *N* subjects and *T* measurement occasions. Let ${\text{y}}_{i}={({y}_{i1},...,{y}_{iT})}^{\prime}$ be a *T* × 1 random vector with *y*_{ij} being a measurement from individual *i* at time *j* (*i* = 1, . . . , *N*; *j* = 1, . . . , *T*). A growth curve model without covariates can be written as

where **Λ** is a *T* × *q* factor loading matrix that determines the growth curves, **b**_{i} is a *q* × 1 vector of random effects, and **e**_{i} is a vector of measurement errors. The vector of random effects **b**_{i} varies around its mean * β*. The residual vector

**u**

_{i}represents the deviation of

**b**

_{i}from

*. When*

**β**the model is reduced to a linear growth curve model with random intercept *L*_{i} and random slope *S*_{i}. The mean intercept and slope are denoted as *β*_{L} and *β*_{S}, respectively.

Traditionally, **e**_{i} and **u**_{i} are assumed to follow multivariate normal distributions with mean vectors of zero and covariance matrices **Φ** and **Ψ**, respectively, so **e**_{i} ~ *MN*_{T}(**0, Φ**) and **u**_{i} ~ *MN*_{q}(**0, Ψ**). Here, *MN* denotes a multivariate normal distribution and its subscript indicates its dimension. Measurement errors are often assumed to be uncorrelated with each other and have equal variances across time. Statistically, this simplification means the covariance matrix of measurement error **Φ** is reduced to $\Phi ={\sigma}_{e}^{2}\text{I}$ where ${\sigma}_{e}^{2}$ is a scale parameter. In linear growth curve models, ${\text{u}}_{i}={({u}_{Li},{u}_{Si})}^{\prime}$. Its covariance matrix is then $\text{\Psi}=cov({\text{u}}_{i})=\left(\begin{array}{cc}{\sigma}_{L}^{2}& {\text{\hspace{0.05em}}}{\sigma}_{LS}\\ {\sigma}_{LS}& {\text{\hspace{0.05em}}}{\sigma}_{S}^{2}\end{array}\right)$. Here, ${\sigma}_{L}^{2}$ and ${\sigma}_{S}^{2}$ represent the variances of the random intercept and slope across individuals, respectively, and *σ*_{LS} represents the covariance between the random intercept and slope.

BNP methods do not make arbitrary distributional assumptions as in the parametric modeling and thus are more flexible in handling non-normal data (e.g., Lee et al., 2008; Tong and Zhang, 2019). Unlike conventional non-parametric methods such as permutation tests, BNP methods use full probability models to describe the data-generating process and thus can derive posterior distributions for model parameters.

Within the BNP modeling scope, the parametric distributions of latent variables and measurement errors in traditional methods are replaced by unknown random distributions. To estimate these unknown distributions, DP is frequently used as the prior (Ferguson, 1973, 1974). Specifically, a random “sample” from a DP is a random distribution. Here, we denote it as *G*. A DP has two hyperparameters, *α* and *G*_{0}. The base distribution, *G*_{0}, represents the central tendency or “mean” distribution in the distribution space. The precision parameter, *α*, quantifies how far away realizations of *G* deviate from *G*_{0}. According to Ferguson (1973), DP is a conjugate prior that has two desirable properties: (1) a sufficiently large support, and (2) analytically manageable posterior distributions. Ferguson further derived the posterior of *G*, $DP(\stackrel{~}{\alpha},\stackrel{~}{{G}_{0}})$. Here, $\stackrel{~}{\alpha}=\alpha +N$ and

with *G*_{N} being the empirical distribution of the data. Notably, the posterior point estimate of *G*, $E(G|data)=\stackrel{~}{{G}_{0}}$, is a weighted average of the base distribution or prior mean *G*_{0} and the empirical distribution or data *G*_{N}. When *α* = 0, the posterior point estimate is reduced to the empirical distribution *G*_{N}, which is pure non-parametric. When *α* approaches to infinity, the posterior point estimate gradually approximates *G*_{0}, which is parametric. A common practice is to specify a gamma prior for *α*, which would yield a posterior estimate that is neither 0 nor infinity.

In BNP growth curve modeling, latent variables and/or measurement errors can be modeled non-parametrically. In this article, we focus on the distributional assumption of measurement errors. When the normality of measurement errors is suspected, we assume that **e**_{i} ~ *G*_{e} where *G*_{e} is an unknown random distribution that is determined by the data. In the BNP framework, DP is typically adopted to specify *G*_{e}. Because the distribution of **e**_{i} is continuous but DP is essentially discrete, a DP mixture (DPM) can be used to model the measurement errors such that

where *D* represents a predetermined multivariate distribution (e.g., multivariate normal, *t*, multinomial, etc.), and ${\mu}_{e}^{(k)}\text{}\mathrm{\text{and}}\text{}{\Phi}^{(k)},k=1,...,\infty $ are means and covariances of the multivariate distribution in the *k*th component with probability *p*_{k}. Theoretically, given an arbitrary distributional shape, there could be infinite number of mixture components as *k* goes to infinity. In practice, a finite number of mixture components often can describe a distribution well and the number of mixture components is determined by the DP precision parameter *α*. Smaller *α* yields a smaller number of mixture components. If *α* approaches infinity, there would be *N* mixture components, one associated with each subject. Namely, the precision parameter *α* is an important parameter that can determine the complexity of the model and how well the model fits the data, and thus may affect the convergence of the model. For the intraindividual measurement errors in the typical linear growth curve model, Tong and Zhang (2019) proposed that

That is, the unknown distribution *G*_{e} is approximated by a mixture of multivariate normal distributions where the mixing measure has a DP prior, *G*_{e} ~ *DPM*. The DP prior *DP*(*α*, *G*_{0}) can be obtained using the truncated stick-breaking construction (e.g., Sethuraman, 1994; Lunn et al., 2013). Specifically, $DP(\xb7)=\sum _{j=1}^{C}{p}_{j}{\delta}_{{z}_{j}}(\xb7),1\le C<\infty $, where *C* (1 ≤ *C* ≤ *N*, often set at a large number) is a possible maximum number of mixture components, δ_{zj}(·) denotes a point mass at *z*_{j} and *z*_{j} ~ *G*_{0} independently. The random weights *p*_{j} can be generated through the following procedure. With *q*_{1}, *q*_{2}, . . . , *q*_{C} ~ *Beta*(1, *α*), define

Then, *p*_{j} is obtained by

to satisfy that $\sum _{j=1}^{C}{p}_{j}=1$. In practice, the updating of **e**_{i} can proceed as in a typical DP mixture model and its distribution is an infinite mixture distribution^{1}.

In general, the distribution of **e**_{i} through the truncated stick-breaking construction is

where *D* represents a predetermined multivariate distribution, ${\mu}_{e}^{(j)}\text{}\mathrm{\text{and}}\text{}{\Phi}^{(j)},j=1,...,C$ are means and covariances of the multivariate distribution in the *j*th component, and *p*_{j} is obtained using Equation (1). Given that the mean of **e**_{i} is **0**, we constrain $\sum _{j=1}^{C}{p}_{j}{\mu}_{e}^{(j)}=\text{0}$. For simplicity, in this study, we follow Tong and Zhang (2019) and use multivariate normal distributions for the mixing components and constrain ${\mu}_{e}^{(j)}$ to be 0. We use inverse Wishart priors $p({\Phi}^{(j)})=IW({n}_{0},{W}_{0})$ for the covariance matrices of the mixture components, **Φ**^{(j)}, *j* = 1, . . . , *C*. Following Lunn et al. (2013, p. 294), we fix the shape parameter *n*_{0} at a specific number and assign an inverse Wishart prior to the scale matrix *W*_{0}. With such a specification, the measurement error for individual *i*, **e**_{i}, has a *p*_{j} probability of coming from the mixing component *MN*(**0, Φ**^{(j)}). The measurement errors for other individuals may also come from the same mixing component. Let *K* denotes the number of mixing components or *MN*(**0, Φ**^{(j)}) with *j* = 1, . . . , *C*. In other words, *K* is the number of latent classes for **e**_{i} and *K* can be smaller than *C*, *K* ≤ *C*. Within each class, **e**_{i}s come from the same distribution.

We would like to note that a similar approach to BNP modeling is finite mixture modeling (FMM). FMM estimates or equivalently approximates an unknown distribution using a mixture of known distributions. A key difference between FMM and BNP modeling is that the number of mixture components is treated as known in FMM, whereas this number is treated as unknown and is freely estimated in BNP modeling. As a result, when FMM is used to handle non-normality, additional analyses such as model comparison are needed to determine the unknown number of mixture components. BNP modeling therefore is believed to have the advantage of being more objective and data-driven, given that additional analyses such as model comparison that may be vulnerable to subjectivity are avoided.

Bayesian methods are applied to estimate BNP growth curve models. Bayesian methods are becoming increasingly popular in recent years because of their flexibility and powerfulness in estimating models with complex structures (e.g., Lee and Shi, 2000; Lee and Song, 2004; Zhang et al., 2007; Lee and Xia, 2008; Tong and Zhang, 2012; Serang et al., 2015). The key idea of Bayesian methods is to compute the posterior distributions for model parameters by combining the likelihood function and the priors. As introduced previously, * β*,

**Φ**, and

**Ψ**are the model parameters in traditional growth curve model. In a BNP growth curve model,

*and*

**β****Ψ**remain model parameters. In contrast, the measurement error covariance matrix

**Ψ**is not directly estimated. Instead, we obtain

**e**

_{i}based on which we can get

**Φ**. Another important parameter in BNP growth curve modeling is the precision parameter

*α*. Let

*p*(

*,*

**β****Ψ**,

*α*) be the joint prior distribution of model parameters, and let

*L*be the likelihood function. The joint posterior distribution of model parameters is

where $b={({b}_{1}^{\text{'}},\text{\hspace{0.17em}}...,{b}_{N}^{\text{'}})}^{\text{'}}.$. It is difficult to solve for this integral in practice. Instead, Markov chain Monte Carlo (MCMC) methods (e.g., Gibbs sampling; Robert and Casella, 2004) are often used to obtain parameter estimates and statistical inferences. Specifically, we first derive the conditional posterior distribution for each of the parameters. We then iteratively draw samples from the derived conditional posteriors to obtain empirical marginal distributions of the model parameters. Finally, statistical inferences are made based on the empirical marginal distributions (Geman and Geman, 1984).

## 3. Precision Parameter in BNP Models

The convergence issue in BNP growth curve modeling is likely related to the precision parameter (Tong and Zhang, 2019). Here, we provide a theoretical discussion on how the prior of the precision parameter can influence the number of latent classes for **e**_{i}.

The DP precision parameter *α* is the key to govern the expected number of latent classes. It directly determines the distribution of *K*, the number of latent classes of **e**_{i}. With a larger *K*, measurement errors of different individuals are more likely to have different distributions. West (1992) found that *K* asymptotically follows a Poisson distribution

where *γ* is Euler's constant. Several percentiles of the distribution of *K* are given in Table 1. As shown in the table, *K* increases as *α* and *N* increases.

**Table 1**. Different percentiles (5, 50, 95%) of the distribution of the number of clusters *K*, given different values of precision parameter *α*, and sample size *N*.

As discussed previously, a gamma prior *Gamma*(*a*_{1}, *a*_{2}) is often used for the hyperparameter *α*. Given such a prior, West (1992) derived the posterior of *α* as a mixture of two gamma densities

where *x* is an augmented variable *x*|· ~ *Beta*(*α* + 1, *N*) and the weights π_{x} is defined by ${\pi}_{x}/(1-{\pi}_{x})=\frac{{a}_{1}+K-1}{N({a}_{2}-logx)}$. Although West (1992) also provided an approximation to the posterior of *α*, *p*(*α*|·) ≈ *Gamma*(*a*_{1} + *K* − 1, *a*_{2} + *γ* + *logN*), how good the approximation was has not been investigated.

A non-informative prior for *α* seems to be reasonable, especially when the information about number of latent classes are not available. However, a non-informative prior may cause non-convergence of Markov chains. Therefore, it is worth evaluating different priors for the precision parameter.

## 4. A Simulation Study

We now present a simulation study to evaluate the influence of the prior for the precision parameter in BNP growth curve modeling when data are normally distributed and contain outliers^{2}. The linear growth curve model in the previous section is used. Measurement errors are modeled non-parametrically to address the non-normality. Based on the results of previous studies, the number of times points (*T*), the covariance between the random intercept and slope (*σ*_{LS}), and the measurement error variance (${\sigma}_{e}^{2}$) have trivial effects on the performance of BNP growth curve modeling (e.g., Tong and Zhang, 2019). Therefore, we only consider a set of values for these parameters in this study. We follow the empirical data analysis results in Tong and Zhang (2019) to select the population parameter values: the fixed effects are fixed at $\beta ={({\beta}_{L},{\beta}_{S})}^{\prime}={(6.2,0.3)}^{\prime}$; the number of measurement occasion is *T* = 4; measurement error variance ${\sigma}_{e}^{2}=0.5$; variances of the random intercept and slope are 1 and 0.1, respectively, and the covariance between the random intercept and slope *σ*_{LS} = 0.

Three potentially influential factors are manipulated in the simulation study, including sample size, data distribution, and precision parameter prior. First, two sample sizes are considered, *N* =200 or 600, representing small and large sample sizes. Second, data are either normal or containing outliers. When generating outliers, three proportions of outliers are considered, *r%* =5, 10, or 20%. To generate outliers, we randomly select *r%* observations at each measurement occasion and replace them by extreme values. The extreme values are generated from 10 different distributions with a large mean of *L*_{i} + *S*_{i}(*j* − 1) +* m σ*

_{e}where

*m*≥ 5 is generated from a truncated Poisson distribution, and a variance of ${\sigma}_{e}^{2}$ which is the same as that of the normal data. As a result, the true distribution of the data is a mixture of 11 distributions. Outliers generated in this way conform to the definition of outliers (Yuan and Zhong, 2008; Tong and Zhang, 2017). See Supplementary Figures 1, 2 to aid the understanding of the shape of generated normal data and data with outliers. Third, four priors for the precision parameter are investigated (see Figure 1): a diffuse prior

*Gamma*(0.001, 0.001), a weakly informative prior

*Gamma*(2, 2) suggested by Ishwaran (2000), an accurate informative prior

*Gamma*(100, 100), and an inaccurate informative prior

*Gamma*(10, 100). Gamma(10,100) is an inaccurate informative prior because its mean is 0.1 and its variance is as small as 0.001. According to Table 1, the resulting number of latent classes ranges from 1 to 3, whereas the true number of mixed underlying distribution is 11. For all the other model parameters, conventional non-informative priors such as those in Zhang et al. (2013) are used. Specifically, fixed effects

*β*have non-informative diffuse priors

*N*(0, 10

^{6}). The covariance matrix of the random intercept and slope

**Ψ**has an inverse-Wishart prior with an identity scale matrix and degrees of freedom being 2.

In each simulation condition, 500 datasets are generated. BNP growth curve modeling is applied for each dataset using JAGS with the rjags package in software R (Plummer, 2017; R Core Team, 2019). The total length of Markov chains is set at 50,000 and the first half of iterations is the burn-in period^{3}. We assess how different priors affect model convergence rate, parameter estimation, and computation time.

Geweke tests (Geweke, 1991) are used to perform the convergence diagnostics. After the burn-in period, if parameter values are sampled from the stationary distribution of the chain, the means of the first and last parts of the Markov chain (by default the first 10% and the last 50%) should be equal and Geweke's statistic asymptotically follows a standard normal distribution. A Markov chain converges when the Geweke's statistic is between –1.96 and 1.96. If none of the convergence diagnostics (i.e., Geweke tests) for all model parameters suggest non-convergence, the model is said to have converged. In each simulation condition, the convergence rate is defined as the proportion of converged models out of the total 500 generated replications.

For the assessment of model estimation, we obtain the parameter estimate bias, average standard error (ASE), empirical standard error (ESE), mean squared error (MSE), and coverage probability (CP) of the 95% highest posterior density (HPD) credible intervals for each parameter based on converged simulation replications^{4}.

In addition, the estimation time (in seconds) is recorded for each replication. The average estimation time (AET) is the average of the estimation time for all the converged replications.

All program code and detailed results for the simulation study are available on our GitHub site: https://github.com/CynthiaXinTong/PrecisionParPrior_BNP_GCM.

### 4.1. Main Results

Figure 2 shows the convergence rate for BNP growth curve modeling with different precision parameter priors when sample size is 200. This figure clearly shows that outliers harm model convergence. Note that the convergence rate for data with 5% outliers is the lowest. This may be because a small proportion of outliers (e.g., 5%) creates a steep and high-curvature region for the Markov chain to enter and thus more difficult to converge. As the outlier proportion increases, the curvature becomes smoother so the convergence rate is higher. Among the four studied priors, the non-informative prior for the precision parameter always leads to the lowest convergence rate, i.e., less than 30% across all the simulation conditions. Informative priors substantially increase the model convergence rate. Specifically, the convergence rate doubles when we switch from the non-informative prior to the weakly informative prior suggested by Ishwaran (2000) in the condition with normal data. The incremental amount is about 30% of the original convergence rate in the conditions with outliers. Both accurate informative priors and inaccurate informative priors lead to higher convergence rates. The importance of using informative priors is more salient when data are not normal. Note that inaccurate informative priors yield slightly higher convergence rates than accurate informative priors because the variance of the inaccurate prior is lower and thus its precision is higher. When *N* = 600, model convergence results for BNP growth curve models follow the same pattern, and thus are not reported here.

For converged replications, we evaluate the impact of precision parameter priors on parameter estimation and computation time. Results for *N* = 200 are summarized in Tables 2–5. The relative performance of the four priors in conditions with a larger sample size (*N* = 600) has a similar pattern. Detailed results for *N* = 600 are available in the Supplementary Document.

**Table 2**. Model estimation for BNP growth curve modeling with different precision parameter priors when data are normal and *N* = 200.

**Table 3**. Model estimation for BNP growth curve modeling with different precision parameter priors when data contain 5% of outliers and *N* = 200.

**Table 4**. Model estimation for BNP growth curve modeling with different precision parameter priors when data contain 10% of outliers and *N* = 200.

**Table 5**. Model estimation for BNP growth curve modeling with different precision parameter priors when data contain 20% of outliers and *N* = 200.

From Tables 2–5, we obtain the following findings. First, the estimates of growth curve parameters (${\beta}_{L},{\beta}_{S},{\sigma}_{L}^{2},{\sigma}_{S}^{2},{\sigma}_{LS},{\sigma}_{e}^{2}$) are not affected by different priors. Estimation bias, standard errors, MSE, and coverage probability of the 95% HPD credible interval across different precision parameter prior conditions are very close to each other, respectively. Note that when outliers exist (see Tables 3–5), the true population parameter value of the measurement error variance ${\sigma}_{e}^{2}$ is unknown. So, bias, MSE, and CP for this parameter cannot be calculated.

Second, the estimation of the hyperparameter *α* is greatly affected by different priors. When the non-informative prior is used, the estimated *α* can be very large (e.g., 28.284 in Table 3) or small (e.g., 0.019 in Table 5), associating with a large standard error. When *Gamma*(2, 2) or *Gamma*(100, 100) is used, estimated *α* is almost always close to 1. When *Gamma*(10, 100) is used, estimated *α* is around 0.1. Different *α* values indicate a different total number of classes *K*. In general, a larger *α* value may yield a larger number of latent classes. Since the estimated *α* has a large standard error when the non-informative diffuse prior is used, the corresponding estimated *K* can also be large or small. For the weakly informative and accurate informative priors, the estimated number of latent classes ranges from 4 to 6 for different data conditions, whereas for the inaccurate informative prior, the estimated number of latent classes is about 2 or 3. It is interesting to see that although distinctively different hyperparameter estimates are obtained leading to different number of latent classes, the estimated growth curve parameters are essentially similar. This is because although outliers are generated from 10 different distributions, the 10 different distributions are not separated far apart. With a low class separation, one distribution may be enough to describe several outliers generated from different distributions. Thus, even the inaccurate informative prior can yield a precision parameter that is adequate to model the measurement errors.

Third, BNP growth curve modeling with the inaccurate informative prior *Gamma*(10, 100) requires the shortest computation time. This is because the inaccurate informative prior here has the smallest variance and thus is most “informative” among the four priors.

Fourth, outliers affect the performance of BNP growth curve modeling. When data contain a large proportion of outliers (e.g., 20%), estimation bias for the average of random intercepts *β*_{L} and variance of random intercepts ${\sigma}_{L}^{2}$ are much larger than those when outlier proportion is low. In addition, outliers influence computation time. It is worth mentioning that it is most time consuming when the outlier proportion is 5%. A possible reason is that a small proportion of outliers creates a steep and high-curvature region for Markov chains to enter and thus takes longer time to converge. With more outliers, the curvature is smoother so the computation is faster.

## 5. Discussion

Restricting to a parametric probability family can delude investigators and falsely make an illusion of posterior certainty (Müller and Mitra, 2004). On the contrary, BNP methods are adaptive and powerful to discover complex patterns in real data. Although BNP growth curve modeling has been proposed, the effect of the precision parameter was not fully studied. In this article, we have conducted a simulation study to investigate how different types of precision parameter priors impact the convergence rate, model estimation, and computation time in BNP growth curve modeling. We found that the non-informative prior suffered from the lowest convergence rates while the inaccurate informative prior with the smallest prior variance yielded the highest convergence rates and the fastest computations. Furthermore, we found that the estimation of growth curve parameters was not affected by the prior of the precision parameter. Based on these results, we recommend to use informative priors with high precision in practice.

We would like to note that although it seems counterintuitive that the inaccurate informative prior for the precision parameter performed the best, such findings have been observed in the literature. For example, Finch and Miller (2019) found that slightly informative priors can be advantageous in small samples even when these priors are incorrect. Depaoli (2013) showed that growth mixture model estimations obtained with inaccurate priors were still more accurate than maximum likelihood or Bayesian estimation with diffuse priors. Zitzmann et al. (2020) explicitly discussed this issue for small samples. Our simulation results also supported the argument that the amount of information in the prior can be more important than the accuracy of the prior under certain circumstances.

We also want to point out that the estimation bias was relatively large in our simulation study, when compared to that in previous studies (Tong and Zhang, 2019). This is because we consider much higher outlier proportions. When the outlier proportion is low (i.e., 5%), parameter estimates are very close to the true population values. As the outlier proportion increases, the bias increases. One possible way to improve the performance of BNP growth curve modeling when the outlier proportion is high is to use a non-normal base distribution. In our simulation study, for simplicity, we used normal distributions with zero mean as the mixing components of BNP modeling. This cannot handle asymmetric non-normal distributions, which may partly explain the less satisfactory performance of BNP modeling in the conditions with high outlier proportions. But BNP methods in general are very flexible. A non-normal base distribution may overcome this limitation. While future studies may continue along this path, we want to emphasize that BNP modeling as in our study still outperforms traditional growth curve modeling and is recommended to use in general when data are suspected to be non-normal (Tong and Zhang, 2019) no matter the non-normality is caused by non-normal population distribution or data contamination.

The convergence rate of BNP growth curve modeling was found to be higher in previous studies, i.e., close to one (Tong and Zhang, 2019). We would like to note that the difference is likely due to the list of parameters counted during convergence assessment. In Tong and Zhang (2019), the convergence rate was computed only for growth curve parameters. When only growth curve parameters are considered, non-convergence rarely occurred in our study. The major problem is the precision parameter. As shown in the simulation study, non-convergence frequently arose for this parameter (detailed Geweke tests results for each parameter are available on our GitHub site: https://github.com/CynthiaXinTong/PrecisionParPrior_BNP_GCM). Another possible reason why convergence rates were relatively low (below 70%) in our simulation is that Geweke tests often yield lower rates of convergence than other diagnostic methods (e.g., Jang and Cohen, 2020). However, as pointed out in Jang and Cohen, the pattern of convergence rates for model comparison was similar for different diagnostic tests. Namely, our conclusions about which precision parameter priors to use in BNP growth curve modeling will not be affected by the diagnostic tests. We further discuss the use of Geweke tests in the next paragraph. Notably, although the non-convergence for the precision parameter seemed not to impact parameter estimates for the growth curve parameters, such issue may mislead model fit assessment. Although model assessment and model comparison methods have been proposed for various models, samples of different sizes, and data structures (e.g., Celeux et al., 2006), their performance in BNP analysis has not been studied. Therefore, future studies on how different precision parameter priors affect model fit assessment are encouraged.

In our study, model convergence diagnostics were conducted using Geweke tests. Although Geweke tests are commonly used in the Bayesian literature, it is impossible to say with certainty that a finite sample from an MCMC algorithm is representative of an underlying stationary distribution and a combination of strategies aiming at evaluating and accelerating MCMC sampler convergence is recommended (Cowles and Carlin, 1996). For our simulation study, Geweke tests were relatively easy to systematically implement. In empirical studies, we recommend using multiple strategies (e.g., trace plots, multiple chains) to check model convergence. In addition, since Zitzmann and Hecht (2019) pointed out that it is possible that the approximation of the Bayesian estimates is still not optimal even when a chain converges, we recommend substantive researchers conducting sensitivity analysis and evaluating how the length of the Markov chains affects the model estimation results.

Our study echoed the previous literature in that using informative priors may help reduce computation time in Bayesian modeling. We would like to note that there are other approaches that can be used to further increase the computation efficiency. For example, Berger et al. (2020) and Daniels and Kass (1999) proposed shrinkage priors, and Hecht et al. (2020) proposed a model reformulation approach in which the sample covariance matrix was modeled instead of individual observations. This latter approach has been applied to the Bayesian continuous-time model (Hecht and Zitzmann, 2020) as well as the Bayesian STARTS model (Ludtke et al., 2018). Future research on BNP growth curve modeling could incorporate this approach and other potentially efficient approaches to reduce computation time.

The employment of BNP growth curve modeling is a field still in its early stage. New DP variants and generalizations are being proposed every year to cater to specific applications. BNP modeling was only used to handle the non-normality in intraindividual measurement errors in our study. The similar strategy can be used for random effects, such as random intercepts and slopes. Also, although we worked with balanced data, BNP growth curve modeling should be able to handle unbalanced data (e.g., individually varying time points). However, as implied by previous studies (Tong, 2014), the convergence issue may be more challenging, thereby awaiting future studies.

## Data Availability Statement

The raw data supporting the conclusions of this article are available by running the data generation R code on our GitHub site: https://github.com/CynthiaXinTong/PrecisionParPrior_BNP_GCM.

## Author Contributions

All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication.

## Funding

This paper is based upon work supported by the National Science Foundation under grant no. SES-1951038.

## Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

## Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2021.624588/full#supplementary-material

## Footnotes

1. ^In practice, infinite-dimension means finite but unbounded dimension.

2. ^Note that non-normal data may be caused by non-normal population distributions or data contaminations. We work with outliers in this simulation study because BNP methods are essentially infinite mixture modeling procedures. Generating and dealing with outliers from multiple different distributions are more manageable as we easily know the true number of underlying classes. It is worth verifying the conclusions of this paper for non-normal population distributions in the future.

3. ^Multiple lengths of Markov chains were tested before the current setting was selected. The convergence results with 50,000 iterations were about the same as those for longer chains.

4. ^ASE is the mean estimated standard error across replications. ESE is the standard deviation of the parameter estimates from all replications. MSE is computed as squared bias plus squared ESE. Posterior credible interval, also called credible interval, is the Bayesian counterpart of the frequentist confidence interval. A HPD interval is essentially the narrowest interval on a posterior that covers a given proportion of the probable posterior values.

## References

Ansari, A., and Iyengar, R. (2006). Semiparametric Thurstonian models for recurrent choices: A Bayesian analysis. *Psychometrika* 71, 631–657. doi: 10.1007/s11336-006-1233-5

Berger, J. O., Sun, D., and Song, C. (2020). Bayesian analysis of the covariance matrix of a multivariate normal distribution with a new class of priors. *Ann. Stat.* 48, 2381–2403. doi: 10.1214/19-AOS1891

Brown, E. R., and Ibrahim, J. G. (2003). A Bayesian semiparametric joint hierarchical model for longitudinal and survival data. *Biometrics* 59, 221–228. doi: 10.1111/1541-0420.00028

Burr, D., and Doss, H. (2005). A Bayesian semiparametric model for random-effects meta-analysis. *J. Am. Stat. Assoc.* 100, 242–251. doi: 10.1198/016214504000001024

Bush, C. A., and MacEachern, S. N. (1996). A semiparametric Bayesian model for randomised block designs. *Biometrika* 83, 275–285. doi: 10.1093/biomet/83.2.275

Cain, M. K., Zhang, Z., and Yuan, K.-H. (2017). Univariate and multivariate skewness and kurtosis for measuring nonnormality: prevalence, influence and estimation. *Behav. Res. Methods* 49, 1716–1735. doi: 10.3758/s13428-016-0814-1

Celeux, G., Forbes, F., Robert, C. P., and Titterington, D. M. (2006). Deviance information criteria for missing data models. *Bayesian Anal.* 1, 651–673. doi: 10.1214/06-ba122

Chou, C.-P., Bentler, P. M., and Satorra, A. (1991). Scaled test statistics and robust standard errors for non-normal data in covariance structure analysis: a monte carlo study. *Br. J. Math. Stat. Psychol.* 44, 347–357. doi: 10.1111/j.2044-8317.1991.tb00966.x

Cowles, M. K., and Carlin, B. P. (1996). Markov chain monte carlo convergence diagnostics: a comparative review. *J. Am. Stat. Assoc.* 91, 883–904.

Curran, P. J., West, S. G., and Finch, J. F. (1996). The robustness of test statistics to nonnormality and specification error in confirmatory factor analysis. *Psychol. Methods* 1, 16–29. doi: 10.1037/1082-989x.1.1.16

Daniels, M. J., and Kass, R. E. (1999). Nonconjugate Bayesian estimation of covariance matrices and its use in hierarchical models. *J. Am. Stat. Assoc.* 94, 1254–1263. doi: 10.2307/2669939

Depaoli, S. (2013). Mixture class recovery in GMM under varying degrees of class separation: frequentist versus Bayesian estimation. *Psychol. Methods* 18, 186–219. doi: 10.1037/a0031609

Dunson, D. B. (2006). Bayesian dynamic modeling of latent trait distributions. *Biostatistics* 7, 551–568. doi: 10.1093/biostatistics/kxj025

Fahrmeir, L., and Raach, A. (2007). A Bayesian semiparametric latent variable model for mixed responses. *Psychometrika* 72, 327–346. doi: 10.1007/s11336-007-9010-7

Ferguson, T. (1973). A Bayesian analysis of some nonparametric problems. *Ann. Stat.* 1, 209–230. doi: 10.1214/aos/1176342360

Ferguson, T. (1974). Prior distributions on spaces of probability measures. *Ann. Stat.* 2, 615–629. doi: 10.1214/aos/1176342752

Finch, W. H., and Miller, J. E. (2019). The use of incorrect informative priors in the estimation of MIMIC model parameters with small sample sizes. *Struct. Equat. Model.* 26, 497–508. doi: 10.1080/10705511.2018.1553111

Geman, S., and Geman, D. (1984). Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. *IEEE Trans. Pattern Anal. Mach. Intell.* 6, 721–741. doi: 10.1016/b978-0-08-051581-6.50057-x

Gershman, S. J., and Blei, D. M. (2012). A tutorial on Bayesian nonparametric models. *J. Math. Psychol.* 56, 1–12. doi: 10.1016/j.jmp.2011.08.004

Geweke, J. (1991). “Evaluating the accuracy of sampling-based approaches to calculating posterior moments,” in *Bayesian Statistics 4*, eds J. M. Bernado, J. O. Berger, A. P. Dawid, and A. F. M. Smith (Oxford: Clarendon Press), 169–193. doi: 10.21034/sr.148

Ghosal, S., Ghosh, J., and Ramamoorthi, R. (1999). Posterior consistency of Dirichlet mixtures in density estimation. *Ann. Stat.* 27, 143–158. doi: 10.1214/aos/1018031105

Hecht, M., Gische, C., Vogel, D., and Zitzmann, S. (2020). Integrating out nuisance parameters for computationally more efficient Bayesian estimation - An illustration and tutorial. *Struct. Equat. Model.* 27, 483–493. doi: 10.1080/10705511.2019.1647432

Hecht, M., and Zitzmann, S. (2020). A computationally more efficient Bayesian approach for estimating continuous-time models. *Struct. Equat. Model.* 27, 829–840. doi: 10.1080/10705511.2020.1719107

Hjort, N. L. (2003). "Topics in nonparametric Bayesian statistics," in *Highly Structured Stochastic Systems*, eds P. Green, N. L. Hjort, and S. Richardson (Oxford: Oxford University Press), 455–487.

Hjort, N. L., Holmes, C., Müller, P., and Walker, S. G. (2010). *Bayesian Nonparametrics*. Cambridge: Cambridge University Press. doi: 10.1093/acprof:oso/9780199695607.003.0013

Ishwaran, H. (2000). “Inference for the random effects in Bayesian generalized linear mixed models,” in *ASA Proceedings of the Bayesian Statistical Science Section*, ed A. S. Association (London), 1–10.

Jang, Y., and Cohen, A. S. (2020). The impact of Markov chain covergence on estimation of mixture IRT model parameters. *Educ. Psychol. Meas.* 80, 975–994. doi: 10.1177/0013164419898228

Jara, A., Hanson, T., Quintana, F., Müller, P., and Rosner, G. (2011). DP-package: Bayesian semi and nonparametric modeling in R. *J. Stat. Softw.* 40, 1–30. doi: 10.18637/jss.v040.i05

Kleinman, K. P., and Ibrahim, J. G. (1998). A semiparametric Bayesian approach to the random effects model. *Biometrics* 54, 921–938. doi: 10.2307/2533846

Lee, S. Y., Lu, B., and Song, X. Y. (2008). Semiparametric Bayesian analysis of structural equation models with fixed covariates. *Stat. Med.* 27, 2341–2360. doi: 10.1002/sim.3098

Lee, S. Y., and Shi, J. Q. (2000). Joint Bayesian analysis of factor scores and structural parameters in the factor analysis model. *Ann. Inst. Stat. Math.* 52, 722–736. doi: 10.1023/a:1017529427433

Lee, S. Y., and Song, X. Y. (2004). Bayesian model comparison of nonlinear structural equation models with missing continuous and ordinal categorical data. *Br. J. Math. Stat. Psychol.* 57, 131–150. doi: 10.1348/000711004849204

Lee, S. Y., and Xia, Y. M. (2008). A robust bayesian approach for structural equation models with missing data. *Psychometrika* 73, 343–364. doi: 10.1007/s11336-008-9060-5

Lu, Z., and Zhang, Z. (2014). Robust growth mixture models with non-ignorable missingness: models, estimation, selection, and application. *Comput. Stat. Data Anal.* 71, 220–240. doi: 10.1016/j.csda.2013.07.036

Ludtke, O., Robitzsch, A., and Wagner, J. (2018). More stable estimation of the STARTS model: a Bayesian approach using Markov Chain Monte Carlo techniques. *Psychol. Methods* 23, 570–593. doi: 10.1037/met0000155

Lunn, D., Jackson, C., Best, N., Thomas, A., and Spiegelhalter, D. (2013). *The BUGS Book: A Practical Introduction to Bayesian Analysis*. Boca Raton, FL: CRC Press.

MacEachern, S. (1999). “Dependent nonparametric processes,” in *ASA Proceedings of the Section on Bayesian Statistical Science*, ed A. S. Association (Alexandria, VA).

Maronna, R. A., Martin, R. D., and Yohai, V. J. (2006). *Robust Statistics: Theory and Methods*. New York, NY: John Wiley & Sons, Inc. doi: 10.1002/0470010940

McArdle, J. J., and Nesselroade, J. R. (2014). *Longitudinal Data Analysis Using Structural Equation Models*. Washington, DC: American Psychological Association. doi: 10.1037/14440-000

Meredith, W., and Tisak, J. (1990). Latent curve analysis. *Psychometrika* 55, 107–122. doi: 10.1007/bf02294746

Micceri, T. (1989). The unicorn, the normal curve, and other improbable creatures. *Psychol. Bull.* 105, 156–166. doi: 10.1037/0033-2909.105.1.156

Müller, P., and Mitra, R. (2004). Bayesian nonparametric inference - why and how. *Bayesian Anal.* 1, 1–33. doi: 10.1214/13-ba811

Muthén, B., and Shedden, K. (1999). Finite mixture modeling with mixture outcomes using the EM algorithm. *Biometrics* 55, 463–469. doi: 10.1111/j.0006-341X.1999.00463.x

Neal, R. M. (2000). Markov chain sampling methods for Dirichlet process mixture models. *J. Comput. Graph. Stat.* 9, 249–265. doi: 10.2307/1390653

Ohlssen, D. I., Sharples, L. D., and Spiegelhalter, D. (2007). Flexible random-effects models using Bayesian semi-parametric models: applications to institutional comparisons. *Stat. Med.* 26, 2088–2112. doi: 10.1002/sim.2666

Pendergast, J. F., and Broffitt, J. D. (1985). Robust estimation in growth curve models. *Commun. Stat. Theor. Methods* 14, 1919–1939. doi: 10.1080/03610928508829021

R Core Team (2019). *R: A Language and Environment for Statistical Computing*. Vienna: R Foundation for Statistical Computing.

Robert, C. P., and Casella, G. (2004). *Monte Carlo Statistical Methods*. New York, NY: Springer. doi: 10.1007/978-1-4757-4145-2

Serang, S., Zhang, Z., Helm, J., Steele, J. S., and Grimm, K. J. (2015). Evaluation of a Bayesian approach to estimating nonlinear mixed-effects mixture models. *Struct. Equat. Model.* 22, 202–215. doi: 10.1080/10705511.2014.937322

Sharif-Razavian, N., and Zollmann, A. (2009). *An Overview of Nonparametric Bayesian Models and Applications to Natural Language Processing*. Available online at: http://www.cs.cmu.edu/?simzollmann/publications/nonparametric.pdf

Si, Y., and Reiter, J. P. (2013). Nonparametric Bayesian multiple imputation for incomplete categorical variables in large-scale assessment surveys. *J. Educ. Behav. Stat.* 38, 499–521. doi: 10.3102/1076998613480394

Si, Y., Reiter, J. P., and Hillygus, D. S. (2015). Semi-parametric selection models for potentially non-ignorable attrition in panel studies with refreshment samples. *Polit. Anal.* 23, 92–112. doi: 10.1093/pan/mpu009

Silvapulle, M. J. (1992). On M-methods in growth curve analysis with asymmetric errors. *J. Stat. Plan. Inference* 32, 303–309. doi: 10.1016/0378-3758(92)90013-i

Singer, J. M., and Sen, P. K. (1986). M-methods in growth curve analysis. *J. Stat. Plan. Inference* 13, 251–261. doi: 10.1016/0378-3758(86)90137-0

Tong, X. (2014). *Robust semiparametric bayesian methods in growth curve modeling* (Unpublished doctoral dissertation). University of Notre Dame, Notre Dame, IN, United States.

Tong, X., and Zhang, Z. (2012). Diagnostics of robust growth curve modeling using Student's t distribution. *Multivariate Behav. Res.* 47, 493–518. doi: 10.1080/00273171.2012.692614

Tong, X., and Zhang, Z. (2017). Outlying observation diagnostics in growth curve modeling. *Multivariate Behav. Res.* 52, 768–788. doi: 10.1080/00273171.2017.1374824

Tong, X., and Zhang, Z. (2019). Robust Bayesian approaches in growth curve modeling: using Student's t distributions versus a semiparametric method. *Struct. Equat. Model.* 27, 544–560. doi: 10.1080/10705511.2019.1683014

West, M. (1992). *Hyperparameter Estimation in Dirichlet Process Mixture Models*. Technical Report 92-A03, Duke University, ISDS.

Yang, M., and Dunson, D. B. (2010). Bayesian semiparametric structural equation models with latent variables. *Psychometrika* 75, 675–693. doi: 10.1007/s11336-010-9174-4

Yuan, K.-H., and Bentler, P. M. (1998). Structural equation modeling with robust covariances. *Sociol. Methodol.* 28, 363–396. doi: 10.1111/0081-1750.00052

Yuan, K.-H., and Bentler, P. M. (2001). Effect of outliers on estimators and tests in covariance structure analysis. *Br. J. Math. Stat. Psychol.* 54, 161–175. doi: 10.1348/000711001159366

Yuan, K.-H., and Zhong, X. (2008). Outliers, high-leverage observations and influential cases in factor analysis: Minimizing their effect using robust procedures. *Sociol. Methodol.* 38, 329–368. doi: 10.1111/j.1467-9531.2008.00198.x

Zhang, Z. (2016). Modeling error distributions of growth curve models through Bayesian methods. *Behav. Res. Methods* 48, 427–444. doi: 10.3758/s13428-015-0589-9

Zhang, Z., Hamagami, F., Wang, L., Grimm, K. J., and Nesselroade, J. R. (2007). Bayesian analysis of longitudinal data using growth curve models. *Int. J. Behav. Dev.* 31, 374–383. doi: 10.1177/0165025407077764

Zhang, Z., Lai, K., Lu, Z., and Tong, X. (2013). Bayesian inference and application of robust growth curve models using Student's t distribution. *Struct. Equat. Model.* 20, 47–78. doi: 10.1080/10705511.2013.742382

Zhong, X., and Yuan, K.-H. (2010). “Weights,” in *Encyclopedia of Research Design*, ed N. J. Salkind (Thousand Oaks, CA: Sage), 1617–1620. doi: 10.4135/9781412961288

Zhong, X., and Yuan, K.-H. (2011). Bias and efficiency in structural equation modeling: maximum likelihood versus robust methods. *Multivariate Behav. Res.* 46, 229–265. doi: 10.1080/00273171.2011.558736

Zitzmann, S., and Hecht, M. (2019). Going beyond convergence in Bayesian estimation: why precision matters too and how to assess it. *Struct. Equat. Model.* 26, 646–661. doi: 10.1080/10705511.2018.1545232

Keywords: non-parametric Bayesian, robust method, growth curve modeling, Dirichlet process mixture, prior, precision parameter

Citation: Tong X and Ke Z (2021) Assessing the Impact of Precision Parameter Prior in Bayesian Non-parametric Growth Curve Modeling. *Front. Psychol.* 12:624588. doi: 10.3389/fpsyg.2021.624588

Received: 31 October 2020; Accepted: 09 February 2021;

Published: 31 March 2021.

Edited by:

Haiyan Liu, University of California, Merced, United StatesReviewed by:

Sonja Winter, University of California, Merced, United StatesSteffen Zitzmann, University of Tübingen, Germany

Kenneth Wilcox, University of Notre Dame, United States

Copyright © 2021 Tong and Ke. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zijun Ke, keziyun@mail.sysu.edu.cn