Consistent Partial Least Squares Path Modeling via Regularization

Jung, Sunho; Park, JaeHong

doi:10.3389/fpsyg.2018.00174

ORIGINAL RESEARCH article

Front. Psychol., 19 February 2018

Sec. Quantitative Psychology and Measurement

Volume 9 - 2018 | https://doi.org/10.3389/fpsyg.2018.00174

Consistent Partial Least Squares Path Modeling via Regularization

Sunho Jung

JaeHong Park^*

Kyung Hee University, Seoul, South Korea

Partial least squares (PLS) path modeling is a component-based structural equation modeling that has been adopted in social and psychological research due to its data-analytic capability and flexibility. A recent methodological advance is consistent PLS (PLSc), designed to produce consistent estimates of path coefficients in structural models involving common factors. In practice, however, PLSc may frequently encounter multicollinearity in part because it takes a strategy of estimating path coefficients based on consistent correlations among independent latent variables. PLSc has yet no remedy for this multicollinearity problem, which can cause loss of statistical power and accuracy in parameter estimation. Thus, a ridge type of regularization is incorporated into PLSc, creating a new technique called regularized PLSc. A comprehensive simulation study is conducted to evaluate the performance of regularized PLSc as compared to its non-regularized counterpart in terms of power and accuracy. The results show that our regularized PLSc is recommended for use when serious multicollinearity is present.

Introduction

Structural equation modeling (SEM) has become a common tool in social and psychological research, including business research fields such as marketing and information systems. In no small part, this is due to its ability to provide a flexible measurement and testing framework for investigating interrelationships among observed and latent variables (Kaplan, 2009). Covariance structure analysis (CSA) (Jöreskog, 1973) and partial least squares (PLS) path modeling (Wold, 1975) represent two technically distinctive approaches to SEM (Fornell and Bookstein, 1982; Reinartz et al., 2009). Recently, a new consistent PLS estimator (PLSc) has been introduced as another alternative approach that bridges the gap between CSA and PLS (Dijkstra, 2010; Dijkstra and Henseler, 2015a). This technique rests on the idea that when PLS represents latent variables through factors, correcting for a measurement error is required to obtain consistent PLS estimates.

With the introduction of PLSc, some interest exists in evaluating its relative performance, when compared to CSA and PLS. A recent simulation study by Dijkstra and Henseler (2015b) showed that PLSc is recommended for use over traditional PLS, if the common factor model holds true for the theoretical construct. This finding is expected, given that PLSc explicitly takes the reliability of construct scores into account, and therefore, corrects the structural paths between the latent variables for attenuation, thereby enabling consistent estimates to be produced. The ability of PLSc to perform well with common factors is an important result, because SEM is frequently conducted with reflectively measured constructs.

However, Dijkstra and Henseler (2015b) clearly pointed out the potential weakness of PLSc in their simulation study, as it exhibited relatively lower statistical power and larger standard deviations under multicollinearity, as compared to other techniques. This tendency was particularly evident and problematic with small sample sizes. In practice, a high level of correlations among the latent variables is known to be quite common in the applied research (Grewal et al., 2004). In addition, because PLSc employs inter-construct correlations corrected for attenuation as input data for parameter estimations, it would likely encounter multicollinearity problems due to the possibly high correlation between independent variables. The major problem with multicollinearity is that the least squares estimators of the coefficients can produce inflated standard errors, often leading to the loss of statistical power.

Despite potential multicollinearity problems, no attempt has been made to provide methods for mitigating the problems in PLSc. Therefore, in this paper, we propose a new approach, a ridge-type of regularization, to solve multicollinearity issues in PLSc. Ridge regression (Hoerl and Kennard, 1970) is one of the possible remedies for multicollinearity in the statistical learning literature, by intentionally trading a small amount of bias for greater efficiency. Derived as an alternative to the ordinary least squares (OLS) regression estimator in the PLSc procedure, we propose a ridge least squares estimator by adding a small positive constant, called the regularization parameter, to the estimation in a straightforward manner.

The major purpose of this paper is to propose a regularized model of PLSc which handles multicollinearity problems effectively. By doing so, we believe that we can contribute to the related literature. As some researchers have already acknowledged that multicollinearity in PLSc can arouse problems in the estimation, we believe it is necessary for other researchers to consider our new approach, a ridge-type of regularization, to solve the multicollinearity issues in PLSc. The second goal of this paper is to present a comprehensive evaluation of the proposed method, relative to its non-regularized counterpart, under a variety of experimentally manipulated conditions using a Monte Carlo simulation study. With a comprehensive Monte Carlo simulation, our proposed regularized PLSc is better in dealing with a severe multicollinearity problem with common factors than ordinary PLSc.

In the next section, we discuss the previous PLSc and then propose our theoretical concept of regularized PLSc in a structural equation model. We then suggest a simulation study to confirm the newly proposed model's performance, as compared to the previous method.

Consistent Partial Least Squares via Regularization

A Consistent Reliability Coefficient for PLS

Traditional PLS approximates common factors with weighted composites of observed variables. Since the composites serve as proxies for the reflective constructs, PLS construct scores are inevitably contaminated with measurement errors. Measurement errors attenuate the relationship between any two constructs, resulting in biased and inconsistent estimates of structural relationships (e.g., Bollen, 1989; Cassel et al., 2000). Correcting for measurement error attenuation would be worthwhile, as a structural equation model typically contains one or more common factors.

To achieve this purpose, Dijkstra and Henseler (2015b) have recently proposed a consistent reliability coefficient term ρ_A, based on the estimation of the indicator weights under Mode A, suitable for reflective indicators. This plays a pivotal role in mitigating PLS' consistency problems in SEM with reflective measurement models. PLSc employs the coefficient of reliability to correct the latent variable correlations for attenuation, thereby adjusting the estimates to make them consistent.

The reliability measure for PLS' construct scores is determined as the squared correlation between composite scores for each latent variable and the corresponding true scores. A consistent estimator of ρ_A can be obtained so as to minimize the sums of squares of the discrepancies between the off-diagonal elements of S and $\hat{Σ}$ , in which S is the sample covariance matrix of a latent variable's indicators and $\hat{Σ}$ is the implied covariance matrix based on a underlying common factor model. The coefficient of reliability can be consistently estimated using the indicator weights as follows (Dijkstra and Henseler, 2015b):

\begin{array}{l} {\hat{ρ}}_{A} = {(\hat{w}' \hat{w})}^{2} \times \frac{\hat{w}' (S - diag (S)) \hat{w}}{\hat{w}' (\hat{w} \hat{w}' - diag (\hat{w} \hat{w}')) \hat{w}}, & (1) \end{array}

where $\hat{w}$ is the estimated weight vector for a block of indicators for the latent variable. In particular, the second part of this equation simply represents a scaling factor corresponding to the constant of proportionality between the indicator weights and the factor loadings. It plays a role in rescaling the former to the latter to adjust for an overestimation.

Regularized Consistent PLS

PLS involves two distinct models: a structural model and a measurement model. As the structural model of PLS includes a series of linear regression models for each endogenous latent variable, we begin by describing the path coefficient estimation procedures for the PLSc. The estimation procedure comprises three main steps: (1) estimate the iteratively updated indicator weights to obtain the latent variable correlations; (2) correct these correlations for attenuation using the consistent reliability estimates; and (3) perform the OLS regression to estimate the path coefficients based on the consistent construct correlations.

Step 1: The first step is to attempt to create latent variable proxies as linear composites of the associated observed indicators, which requires the estimation of indicator weights. This stage involves an iterative algorithm for the estimation of the weights. Accordingly, each latent variable explains as much variance as possible, with adjacent latent variables that are connected to the same latent variable. This step produces the indicator weights and correlations between the latent variable scores as inputs for the next step.

Step 2: Due to the presence of measurement errors, proxy correlations typically tend to underestimate the true factor correlations. As correlations among the proxies are mainly used for the estimation of the path coefficient in PLSc, a conventional attenuation correction factor can be applicable (e.g., Muchinsky, 1996). Specifically, for every pair of composite scores, a consistent construct correlation may be expressed in terms of the original proxy correlations and the two reliabilities obtained in Equation (1). That is, it is calculated by the ratio of the correlation between the construct scores to the square root of their respective reliabilities. Consequently, the correlations between the proxies associated with large measurement errors may be given greater weight than the correlations associated with smaller measurement errors.

Step 3: By correcting for the attenuation, due to the unreliability, as in the previous step, we are able to determine the underlying latent relationships without the distraction of measurement errors. The third step estimates the path coefficients in the structural model by means of an OLS regression. In other words, the PLSc estimator is obtained by regressing each endogenous latent variable on its causally related latent variables as follows:

\begin{array}{l} \hat{β} = R_{X}^{- 1} r_{X y}, & (2) \end{array}

where $\hat{β}$ indicates a vector of path coefficients, R_X is the consistent correlation matrix of the predictor variables of the structural equation, and r_Xy is the vector of consistent correlations between the outcome variable and the predictor variables. This illustrates that the PLSc estimator stems from the OLS regression and consistent correlation estimation.

Several variance-based SEM techniques exist, but PLSc seems to be the preferred choice of researchers for evaluating the structural model with common factors. However, although a consistent reliability coefficient helps to establish consistent estimations in the model involving factors, rather ironically, the correction for attenuation is likely to lead to a multicollinearity problem, which can give rise to spurious results. Multicollinearity is considered a major application problem in SEM, because it reduces statistical power and increases the variances for the estimated coefficients, making them unstable (Grewal et al., 2004). The more variance the coefficients have, the more difficult it is to interpret them.

To address this issue, a regularized extension of the PLSc is proposed that integrates a ridge-type of regularization into PLSc. Estimating the path parameters through regularization is straightforward. A ridge least squares estimator for $\hat{β}$ is given by:

\begin{array}{l} \hat{β} (λ) = {(R_{X} + λ I)}^{- 1} r_{X y}, & (3) \end{array}

where: λ denotes the regularization parameter (or tuning parameter). When λ = 0, the ridge estimates are equivalent to those obtained using ordinary PLSc (Equation 2). As with PLSc, the ridge estimator can be used as a tool for recursive models that only include unidirectional effects. The proposed regularized PLSc initially entails finding an appropriate value of the regularization parameter. It then estimates the path parameters using Equation (3), for which an optimal value of λ is included in the analysis.

A significant number of studies emphasize the practical utility of regularization in many multivariate data analysis techniques (Hastie et al., 2001; Tenenhaus and Tenenhaus, 2011; Srivastava et al., 2014). In general, the regularization parameter plays a crucial role in controlling the degree of regularization imposed on the parameters. It has the effect of shrinking the least squares estimates toward zero, thereby enabling more accurate solutions to be produced. A regularized estimator intentionally trades bias for reduction in variance. As such, it will certainly be biased (albeit slightly), but will still exhibit a much smaller variability. Therefore, the ridge estimates of parameters tend to be, on average, closer to the true population values than their least squares counterparts (see Groß, 2003, pp. 118–120). In particular, this positive effect of regularization is more pronounced under multicollinearity and/or small sample sizes (Takane and Jung, 2008).

The proposed method utilizes the K-fold cross-validation method to select the value of λ, which is typically a small positive constant. In the cross validation, the entire dataset is randomly divided into K subsets (typically, either 5 or 10). One of the K subsets is set aside as a validation sample, while the remaining K-1 subsets are used as a training sample for fitting a single structural equation model for each endogenous construct from which the estimates of the path coefficients are obtained. These resultant estimates are then applied to the validation sample to calculate the prediction error of the structural model. This procedure is repeated k times, changing a single group set aside systematically. The cross-validation estimate of the prediction error is accumulated over all K validation samples. The cross validation procedure also systematically varies the values of λ and the value that yields the lowest prediction error is finally chosen. When K is equal to N (sample size in the original data), the cross validation procedure is also known as the leaving-one-out cross validation, which appears to work reasonably well with small sample sizes (e.g., Molinaro et al., 2005).

As in its ordinary counterpart, the proposed regularized PLSc uses the bootstrap method (Efron, 1982) to estimate the standard errors of the parameter estimates. More specifically, their standard errors are calculated non-parametrically based on 5,000 bootstrap samples (Hair et al., 2011). Furthermore, the bootstrap standard errors can be used to test whether a structural parameter is statistically different from zero, based on a confidence interval approach (Aguirre-Urreta and Rönkkö, forthcoming). For instance, if the 95% confidence interval of a parameter does not include zero, then the observed effect may be considered statistically significant.

A Simulation Study

The primary goal of the present simulation study is to compare the performance of the proposed regularized PLSc (hereafter referred to as RegPLSc) with that of the non-regularized PLSc. A secondary goal is to evaluate the impact of a comprehensive set of design factors and their interactions on the performance of these two estimation methods. This study builds on earlier work (Grewal et al., 2004), examining the role of multicollinearity and measurement errors on parameter recovery and inference errors in SEM. All computations for this study were carried out using MATLAB R2009a (The MathWorks, Inc.).

Design Factors

The Monte Carlo simulation involved manipulating four experimental conditions: multicollinearity (ϕ), measurement error (θ), coefficient of determination (R²), and sample size (N). These design factors are essentially the same as those that Grewal et al. (2004) considered in their simulations within the framework of covariance-based SEM. Prior simulation studies have shown them to be meaningful conditions in evaluating the performance of various SEM techniques (e.g., Hwang et al., 2010; Lu et al., 2011). In particular, we employed R² as a design factor as a high R² has the potential to improve the quality of parameter estimation in the presence of multicollinearity (e.g., Mason and Perreault, 1991; Grewal et al., 2004).

The levels of the design factors should be chosen, such that they would represent the range of values encountered in substantive studies using SEM. The selected ranges for the first three factors (ϕ, θ, R²) are basically the same as those considered in Grewal et al. (2004). First, the level of multicollinearity was varied by systematically altering the correlation between ξ₁ and ξ₂. The moderate condition (ϕ = 0.4) was included, plus strong (ϕ = 0.6) and extreme (ϕ = 0.8) correlation levels. The amount of random measurement error was then varied at two levels. Specifically, the composite reliability of each latent variable was set at 0.6 or 0.8 that can be considered as weak and strong, respectively. For the coefficient of determination, the value of R² for each latent endogenous variable was set to 0.25 or 0.50, corresponding to the medium and large effect sizes, respectively, according to Fritz et al. (2012). Finally, the value of N was set to 30, 60, 120, or 200. These sample sizes are identical to the sizes Lu et al. (2011) considered in their simulations. Various approaches for SEM exist, but PLS path modeling has typically been recommended for use in the case of small samples (e.g., Henseler et al., 2009). Prior studies have found that PLS provides a better quality of solution in small samples (e.g., Chin and Newsted, 1999). Small sample sizes may be the rule, rather than the exception, in an empirical application of PLS (e.g., Haenlein and Kaplan, 2004).

We specified a structural equation model which consisted of six latent variables and four reflective indicators per latent variable (Figure 1). We adapted this model from Grewal et al. (2004), in which all unstandardized path coefficients were originally fixed at 0.28. Variance-based SEM, such as partial least squares, typically provides standardized parameter estimates and their standard errors. Thus, we calculated different sets of standardized parameter values based on varying levels of ϕ and R² (Table 1).

FIGURE 1

Figure 1. The specified model for the simulation study. Dashed line represents a path whose true value is zero.

TABLE 1

Table 1. The standardized parameters.

Data Generation

The full factorial design for the simulation leads to a total of 48 factor combinations (4 Sample Sizes × 3 Multicollinearity × 2 Measurement Error × 2 Coefficients of Determination). For each of the 48 different combinations, individual-level multivariate normal data were drawn from N(0, Σ), where Σ is the implied population covariance matrix derived from a CSA formulation using the unstandardized parameter values. During the data generation process, in some rare situations, a consistent correlation matrix is found not to be positive definite. The least squares estimator (Equation 2) fails with such a matrix. Any simulated sample was removed that failed to produce a consistent non-singular correlation matrix from further consideration to compare the two methods in an impartial manner. The first 500 replications with proper solutions were maintained for each of the combinations of the design factors.

Simulation Results

In this section, we report the ability of RegPLSc and PLSc to recover the true parameter values for the path coefficients, as well as conduct a statistical inference. The practical benefit of PLS, in empirical applications, may depend on its ability to determine the significance of a parameter estimate from the statistical power perspective. Although achieving accurate statistical inferences enables researchers to perform reliable hypothesis tests, they also put equal emphasis on the magnitude of the structural parameter to interpret the substantive significance of a result or for predictive purposes. Accordingly, evaluating the ability to recover the true parameters is important for applied researchers who would consider using PLS techniques.

Recovery of Path Parameters

To assess the recovery of the parameters under the two estimation procedures, we calculated the mean absolute differences (MAD) between the parameter values and their estimates as follows:

\begin{array}{l} MAD = \frac{\sum_{j = 1}^{P} | {\hat{θ}}_{j} - θ_{j} |}{P}, & (4) \end{array}

where: ${\hat{θ}}_{j}$ and θ_j denote the parameter estimates and population parameter values, respectively, and P is the number of parameters (e.g., Mason and Perreault, 1991).

For MAD, we conducted the full-factorial five-way mixed ANOVA. A single within-subjects method factor is the estimation method (M, where M = RegPLSc or PLSc). The between-subject data factors are the above-described four experimental conditions of the study. Table 2 presents the results about the capability of the two estimation methods. As illustrated in Table 2, most of the main and interaction effects were statistically significant, due to the large number of observations, in addition to fitting all possible interactions in the ANOVA. For this reason, it is crucial to also examine the effect size (e.g., Paxton et al., 2001). Following the accepted practice for identifying a substantial effect, we will focus on the main and interaction effects, having a partial eta-squared (η²) greater than 2%, which deserves further examination (see Reinartz et al., 2009). According to Cohen's (1988) guidelines regarding effect sizes, a value of 0.02 represents a small effect, 0.06 a medium effect, and 0.14 or greater a large effect.

TABLE 2

Table 2. The results of ANOVA test for the mean absolute differences (MAD) of parameter estimates.

The analysis method (η² = 0.24) had a sufficiently large main effect. This suggests meaningful differences in the average MAD between the two methods (RegPLSc = 0.14 and PLSc = 0.19). The ANOVA for MAD in the parameter recovery revealed that all two-way interaction effects were statistically significant and achieved effect sizes larger than 6%, reflecting a medium effect: Method × Multicollinearity (η² = 0.10), Method × Measurement Error (η² = 0.13), Method × R² (η² = 0.06), and Method × Sample Size (η² = 0.15). Two additional interactions, Method × Measurement Error × Sample Size (η² = 0.03) and Method × R² × Sample Size (η² = 0.02) were selected for further examination, because they were theoretically and practically related to multicollinearity problems and had an effect size above the cut-off point. We first discuss the two way interaction of Method × Multicollinearity. The remaining two-way interactions are then described below in the context of the three-way interactions that include them.

Figure 2 displays the average values of MAD for each method under the three levels of multicollinearity. Overall, we could confirm that RegPLSc is notably superior across different degrees of multicollinearity. As the level of multicollinearity increases, the superiority of RegPLSc over PLSc becomes larger. A closer look at the performance of RegPLSc reveals that the method appears to be an effective tool to deal with multicollinearity in structural equation models. The average MAD value for RegPLSc under extreme conditions remains similar to that under moderate conditions. In contrast, for PLSc, such a stable tendency in the values of MAD cannot be observed, implying that PLSc is highly susceptible to multicollinearity problems.

FIGURE 2

Figure 2. Two-way interactions of method × multicollinearity with MAD as dependent variable. Dashed line = PLSc, solid line = regularized PLSc.

The three-way interaction of Method × Measurement Error × Sample Size is presented in Figure 3. This three-way interaction includes the two-way interaction of Method × Sample Size, which can be seen in each of the two blocks included in the figure. In general, the average MAD values for both methods tended to decrease as the sample sizes increased. We find two intriguing characteristics, depending on the level of measurement error. First, when reliability is weak, RegPLSc yields uniformly lower MAD than PLSc across all sample sizes. Second, in contrast, when measures are highly reliable, the differences in the values of MAD of the estimates become negligible, except for the smallest sample size. This implies that the adverse effects of multicollinearity may be largely offset by the measurement properties, such as reliability. The similar pattern was replicated in a simulation study by Grewal et al. (2004).

FIGURE 3

Figure 3. Three-way interactions of method × reliability × sample size with MAD as dependent variable. Dashed line = PLSc, solid line = regularized PLSc.

Another three-way interaction (Figure 4) is produced by the interaction of R² with the Method and Sample Size. Our findings show that R² is another meaningful factor that can mitigate the damaging effects of multicollinearity on the estimation accuracy. In general, PLSc and RegPLSc perform similarly for a large R² of 0.50, while the difference becomes more markedly with a medium R² of 0.25. Consistent with the findings of Mason and Perreault (1991), the adverse effects of multicollinearity can be markedly attenuated with a greater portion of explained variance in the dependent variable. Overall, reliability and R² are likely to have an important impact on the good recovery of parameters in the presence of multicollinearity.

FIGURE 4

Figure 4. Three-way interactions of method × R² × sample size with MAD as dependent variable. Dashed line = PLSc, solid line = regularized PLSc.

Statistical Inference

The above ANOVA test results suggest that all the experimental conditions (multicollinearity, sample size, reliability, and R²) are meaningful in differentiating the performance of the two methods in parameter recovery. To gain an additional understanding of the performance of these techniques, the statistical power was further investigated under those four experimental conditions. We estimated the standard errors of path coefficients estimates on the basis of the bootstrap method with 200 bootstrap samples (e.g., Reinartz et al., 2009).

Table 3 shows the empirically obtained statistical power of each estimation method for each combination of the experimental conditions. The numbers in the table indicate the proportion of simulation trials for which a 95% confidence interval for a path coefficient rejected the null hypothesis that path coefficient equals zero.

TABLE 3

Table 3. Statistical power of PLSc and regularized PLSc.

The results suggest that under multicollinearity, RegPLSc has an advantage over PLSc with respect to detecting statistical significance, given that the hypothesized effect actually exists in the population. This pattern of results replicates the findings for path coefficient estimation accuracy. RegPLSc can maintain very similar levels of statistical power, regardless of the degrees of multicollinearity, whereas PLSc is highly hampered by severe multicollinearity. For RegPLSc, it is apparent that the statistical power varies as a function of the sample size, reliability, and R². More specifically, the minimum reasonable size of the sample (N = 30) in this particular study can lead to unacceptably low levels of statistical power. However, even under extreme multicollinearity, a small sample size of N = 60 is adequate for satisfactory statistical power (close to or above 80%), if R² is large and reliability is high. With the larger sample sizes, RegPLSc is able to achieve appropriate statistical power (above 80%) in almost all cases, if reflective measures are highly reliable. This highlights the importance of reliable measurements in the presence of multicollinearity. Conversely, when multicollinearity is extreme, PLSc still fails to achieve a sufficient statistical power for γ₁₁ and γ₁₂, which are substantially affected by high correlations between ξ₁ and ξ₂, even if reliability is high, R² is large, and the sample size is relatively large (N = 200).

Although researchers often pay more attention to the control of Type II error for theory testing in the SEM literature, they also need to consider whether an estimation method shows good control of Type I error rate (e.g., α = 0.05). Variance-based SEM techniques sometimes tends to favor less parsimonious models as they might fail to control Type I error rate (e.g., Henseler, 2012; Dijkstra and Henseler, 2015b). In general, a ridge type estimator produces more stable estimates of parameters, for which we have to pay with bias, making them prone to inflated Type I errors (Erickson, 1981). It is therefore important to evaluate the ability of RegPLSc to control the Type I error rate under multicollinearity. For the effect γ₂₂ = 0, PLSc maintained an overall Type I error rate of 5%. This result is in agreement with simulation results already obtained by Dijkstra and Henseler (2015b). Although RegPLSc seems to maintain marginally acceptable levels of Type I error (average = 0.085, minimum = 0.02, maximum = 0.164), Table 3 suggests that it can have inflated Type I error rates, even in relatively large samples, in the case of severe multicollinearity. PLSc adequately controls Type I error under all conditions, whereas RegPLSc provides greater power. If prior research and theory are sufficient to hypothesize structural model relationships, then we recommend using RegPLSc for theory testing.

Conclusions

A recently developed PLSc is regarded as a viable alternative to traditional PLS if the common factor model holds true. However, in practice, PLSc may suffer from multicollinearity. In this paper, PLSc was combined with ridge-type regularization in order to deal with potential multicollinearity problems. The ridge least squares estimates of the path coefficients can be found by adding the regularization parameter into the OLS estimation. The optimal value of the parameter may be chosen through cross-validation. Our overall conclusion is that the proposed regularized PLSc is successful while dealing with a severe multicollinearity problem in structural equation models with common factors.

A comprehensive Monte Carlo study was conducted which systematically compared the relative performance of the regularized PLSc with non-regularized PLSc in the presence of multicollinearity. In so doing, it provides a greater understanding of the capability of these two estimation methods in terms of parameter recovery and inference errors. The primary goal of this section is to briefly discuss the implications of the simulation study and provide guidelines for choosing between the two methods.

The following summarizes the major findings for each performance measure.

1. Mean absolute differences (MAD): both methods behave similarly in terms of MAD under moderate multicollinearity. If multicollinearity is from strong to extreme, the regularized PLSc generally recovers the path coefficients much better than non-regularized PLSc. The superior parameter recovery of the regularized PLSc over its non-regularized counterpart is found in most sample sizes considered, particularly when reliability is weak or when R² is moderate. When the sample size is very small (N = 30), the regularized PLSc has smaller MAD than the ordinary PLSc, regardless of the levels of reliability and R².

2. Power: as long as multicollinearity is around 0.4, reliability is high, and the sample size is more than 100, researchers should not be overly concerned about the estimation accuracy and statistical power of PLSc. However, if a higher level of multicollinearity is present in the data, the regularized PLSc should be the preferred choice of researchers. Under high or extreme multicollinearity, it has the adequate statistical power, even with relatively small sample sizes, as long as the reliability is high.

These findings have important implications for researchers who use PLS path modeling to inform substantive hypotheses. First, if researchers are ensured that no serious multicollinearity is present, there may be little reason to choose the regularized PLSc over the non-regularized PLSc, since PLSc generally resulted in similarly accurate parameter estimates and reliable statistical inference. However, this is true only when a measure is reasonably reliable. Otherwise, our results suggest that the regularized PLSc should be the method of choice. Second, when assessing structural models under conditions of multicollinearity, the regularized PLSc is highly recommended for use over non-regularized PLSc in all situations involving sample size, reliability, and R².

Despite these important contributions, the present study has a few limitations. First, this study was designed to generate synthetic data within a continuous variable framework. Covariance structural models are often fitted to the data measured on ordinal categorical scales.

Thus, it might be interesting to investigate the relative performance of ordinal PLSc (Schuberth et al., 2018) vs. its regularized extension with the sample matrix of ordinal-scale variables. More methodological work is needed on how to accommodate ordinal data within the framework of regularized PLSc. Second, as is the case with all Monte Carlo simulation studies, the relative performance of each method is conditioned on the specific levels chosen for the experimental conditions. Although the current simulation took into account important experimental conditions frequently used in Monte Carlo simulation studies within the framework of SEM, it is necessary to contemplate a wider range of models and conditions for more careful investigations of the relative performance of the two approaches in future research.

Author Contributions

SJ contributed to conducing all research activities including technical development, empirical analyses, and manuscript writing; JP contributed to technical development and manuscript writing.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This work was supported by the Ministry of Education of the Republic of Korea and the National Research Foundation of Korea (NRF-2014S1A5B8060940).

References

Aguirre-Ureta, M. I., and Rönkkö, M. (forthcoming). Statistical inference with PLSc using bootstrap confidence intervals. MIS Quart. Available online at: https://misq.org/forthcoming/?SID=kjm9t850mj6ah2v00hgutnc6l4

Google Scholar

Bollen, K. A. (1989). Structural Equations with Latent Variables. New York, NY: Wiley.

Cassel, C. M., Hackl, P., and Westlund, A. H. (2000). On measurement of intangible assets: a study of robustness of partial least squares. Total Qual. Manag. 11, 897–907. doi: 10.1080/09544120050135443

CrossRef Full Text | Google Scholar

Chin, W. W., and Newsted, P. R. (1999). “Structural equation modeling analysis with small samples using partial least squares,” in Statistical Strategies for Small Sample Research, ed R. Hoyle (Beverly Hills, CA: Sage Publications), 307–341.

Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences, 2nd Edn. Hillsdale, NJ: Lawrence Earlbaum Associates.

Dijkstra, T. K. (2010). “Latent variables and indices: Herman Wold' s basic design and partial least squares,” in Handbook of Partial Least Squares: Concepts, Methods and Applications in Marketing and Related Fields, eds V. E. Vinzi, W. W. Chin, J. Henseler, and H. Wang (Berlin: Springer), 23–46.

Google Scholar

Dijkstra, T. K., and Henseler, J. (2015a). Consistent and asymptotically normal PLS estimators for linear structural equations. Comput. Stat. Data Anal. 81, 10–23. doi: 10.1016/j.csda.2014.07.008

CrossRef Full Text | Google Scholar

Dijkstra, T. K., and Henseler, J. (2015b). Consistent partial least squares path modeling. MIS Q. 39, 297–316. doi: 10.25300/MISQ/2015/39.2.02

CrossRef Full Text | Google Scholar

Efron, B. (1982). The Jackknife, the Bootstrap and Other Resampling Plans. Philadelphia, PA: SIAM.

Google Scholar

Erickson, G. M. (1981). Using ridge regression to estimate directly lagged effects in marketing. J. Am. Stat. Assoc. 76, 766–773. doi: 10.1080/01621459.1981.10477719

CrossRef Full Text | Google Scholar

Fornell, C., and Bookstein, F. (1982). Two structural equation models: LISREL and PLS applied to consumer exit-voice theory. J. Market. Res. 19, 440–452. doi: 10.2307/3151718

CrossRef Full Text | Google Scholar

Fritz, C. O., Morris, P. E., and Richler, J. J. (2012). Effect size estimates: current use, calculations, and interpretation. J. Exp. Psychol. 141, 2–18. doi: 10.1037/a0024338

PubMed Abstract | CrossRef Full Text | Google Scholar

Grewal, R., Cote, J. A., and Baumgartner, H. (2004). Multicollinearity and measurement error in structural equation models: implications for theory testing. Market. Sci. 23, 519–529. doi: 10.1287/mksc.1040.0070

CrossRef Full Text | Google Scholar

Groß, J. (2003). Linear Regression. Berlin: Springer.

Haenlein, M., and Kaplan, A. M. (2004). A beginner's guide to partial least squares analysis. Unders. Stat. 3, 283–297. doi: 10.1207/s15328031us0304_4

CrossRef Full Text | Google Scholar

Hair, J. F., Ringle, C. M., and Sarstedt, M. (2011). PLS-SEM: indeed a silver bullet. J. Market. Theory Pract. 19, 139–151. doi: 10.2753/MTP1069-6679190202

CrossRef Full Text | Google Scholar

Hastie, T., Tibshirani, R., and Friedman, J. (2001). The Elements of Statistical Learning; Data Mining, Inference, and Prediction. New York, NY: Springer-Verlag.

Henseler, J. (2012). Why generalized structured component analysis is not universally preferable to structural equation modeling. J. Acad. Market. Sci. 40, 402–413. doi: 10.1007/s11747-011-0298-6

CrossRef Full Text | Google Scholar

Henseler, J., Ringle, C. M., and Sinkovics, R. R. (2009). “The use of partial least squares path modeling in international marketing,” in Advances in International Marketing, Vol. 20, eds R. R. Sinkovics and P. N. Ghauri (Bingley: Emerald), 277–320.

Google Scholar

Hoerl, A. F., and Kennard, R. W. (1970). Ridge regression: biased estimation for nonorthogonal problems. Technometrics 12, 55–67. doi: 10.1080/00401706.1970.10488634

CrossRef Full Text | Google Scholar

Hwang, H., Malhotra, N. K., Kim, Y., Marc, A., Tomiuk, M. A., and Hong, S. (2010). A comparative study on parameter recovery of three approaches to structural equation modeling. J. Market. Res. 37, 699–712. doi: 10.1509/jmkr.47.4.699

CrossRef Full Text | Google Scholar

Jöreskog, K. G. (1973). “A general method for estimating a linear structural equation system,” in Structural Equation Models in the Social Sciences, eds A. S. Goldberger and O. D. Duncan (New York, NY: Academic Press), 85–112.

Kaplan, D. (2009). Structural Equation Modeling: Foundations and Extensions, 2nd Edn. Newbury Park, CA: Sage.

Google Scholar

Lu, I. R. R., Kwan, E., Thomas, D. R., and Cedzynski, M. (2011). Two new methods for estimating structural equation models: an illustration and a comparison with two established methods. Int. J. Res. Market. 28, 258–268. doi: 10.1016/j.ijresmar.2011.03.006

CrossRef Full Text | Google Scholar

Mason, C. H., and Perreault, W. D. (1991). Collinearity, power, and interpretation of multiple regression analysis. J. Market. Res. 28, 268–280. doi: 10.2307/3172863

CrossRef Full Text | Google Scholar

Molinaro, A. M., Simon, R., and Pfeiffer, R. M. (2005). Prediction error estimation: a comparison of resampling methods. Bioinformatics 21, 3301–3307. doi: 10.1093/bioinformatics/bti499

PubMed Abstract | CrossRef Full Text | Google Scholar

Muchinsky, P. M. (1996). The correction for attenuation. Educ. Psychol. Meas. 56, 63–75. doi: 10.1177/0013164496056001004

CrossRef Full Text | Google Scholar

Paxton, P., Curran, P. J., Bollen, K., Kirby, J. B., and Chen, F. (2001). Monte Carlo experiments: design and implementation. Struct. Equat. Model. 8, 287–312. doi: 10.1207/S15328007SEM0802_7

CrossRef Full Text | Google Scholar

Reinartz, W. J., Haenlein, M., and Henseler, J. (2009). An empirical comparison of the efficacy of covariance-based and variance-based SEM. Int. J. Res. Market. 26, 332–344. doi: 10.1016/j.ijresmar.2009.08.001

CrossRef Full Text | Google Scholar

Schuberth, F., Henseler, J., and Dijkstra, T. K. (2018). Partial least squares path modeling using ordinal categorical indicators. Qual. Quant. 52, 9–35. doi: 10.1007/s11135-016-0401-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. (2014). Dropout: a simple way to prevent neural networks from overfitting. J. Machine Learn. Res. 15, 1929–1958. http://jmlr.org/papers/v15/srivastava14a.html

Google Scholar

Takane, Y., and Jung, S. (2008). Regularized partial and/or constrained redundancy analysis. Psychometrika 73, 671–690. doi: 10.1007/s11336-008-9067-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Tenenhaus, A., and Tenenhaus, M. (2011). Regularized generalized canonical correlation analysis. Psychometrika 76, 257–284. doi: 10.1007/s11336-011-9206-8

CrossRef Full Text | Google Scholar

Wold, H. (1975). “Path models with latent variables: the NIPALS approach,” in Quantitative Sociology: International Perspectives on Mathematical Statistical Model Building, eds H. M. Blalock, A. Aganbegian, F. M. Borodkin, R. Boudon, and V. Capecchi (New York, NY: Academic Press), 307–357.

Google Scholar

Keywords: consistent partial least squares, structural equation modeling, ridge-type regularization, multicollinearity, Monte Carlo simulation

Citation: Jung S and Park J (2018) Consistent Partial Least Squares Path Modeling via Regularization. Front. Psychol. 9:174. doi: 10.3389/fpsyg.2018.00174

Received: 02 December 2017; Accepted: 01 February 2018;
Published: 19 February 2018.

Edited by:

Maicon Rodrigues Albuquerque, Universidade Federal de Minas Gerais, Brazil

Reviewed by:

Jörg Henseler, University of Twente, Netherlands
James Gaskin, Brigham Young University, United States

Copyright © 2018 Jung and Park. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: JaeHong Park, amFlaHBAa2h1LmFjLmty

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.