- Department of Clinical Neuroscience, Karolinska Institutet (KI), Stockholm, Sweden
Background: Based on findings from analyses with cross-lagged panel models, Yang and Lei suggested reciprocal causal effects between rumination and sleep problems. However, it is well known that findings from cross-lagged panel models may be spurious.
Method: We simulated data to resemble the data used by Yang and Lei. We used triangulation and fitted complementary models to the simulated data.
Results: We found contradicting increasing and decreasing effects of initial rumination on subsequent change in sleep problems and vice versa.
Conclusion: The divergent findings indicated that it is premature to assume causal effects between rumination and sleep problems and the suggestions by Yang and Lei in this regard can be challenged. It is important for researchers to be aware that correlations, including adjusted cross-lagged effects, do not prove causality in order not to overinterpret findings, something that appears to have happened to Yang and Lei. We recommend researchers to triangulate by fitting complementary models to their data in order to evaluate if analyzed data could be used to support contradicting conclusions, in which case the data should not be used to support any of those conclusions.
1 Introduction
Yang and Lei (2025) analyzed longitudinal data (two waves of measurement in 2019 and 2021, respectively) on rumination, i.e., repetitive thinking about causes and consequences of negative events, and sleep problems in a sample of university students in China (N = 373, mean age = 18.7 years (SD = 0.8 years), 77% females) with cross-lagged panel models. Yang and Lei (2025) reported statistically significant positive cross-lagged effects of initial rumination on subsequent sleep problems when adjusting for initial sleep problems and vice versa. Yang and Lei (2025) interpreted these findings in causal terms (e.g., in the title of their article).
However, it is well established that adjusted cross-lagged effects may be spurious due to correlations with residuals and regression to the mean (Campbell and Kenny, 1999; Castro-Schilo and Grimm, 2018; Sorjonen et al., 2019). For example, there appears to be a positive correlation between self-reported rumination and sleep problems (Yang and Lei, 2025), which could be due to a confounding impact by a third variable, e.g., general positivity. Therefore, we should expect that among individuals with the same initial score on sleep problems, those with a higher rumination score have received a lower score on sleep problems compared with their true sleep problems, i.e., a more negative residual, while those with a lower rumination score have received a higher sleep problems score compared with their true score, i.e., a more positive residual. However, residuals tend to regress towards a mean value of zero between measurements. Consequently, we should expect a more positive, but spurious, change in the sleep problems score to a subsequent measurement among those with a higher initial rumination score compared with those with the same initial sleep problems score but with a lower initial rumination score.
For the sake of argument and in agreement with Yang and Lei (2025), let us assume that a positive effect of initial rumination on subsequent sleep problems when adjusting for initial sleep problems can be interpreted as a causal increasing effect (not an interpretation we endorse). Then, for consistency, a positive effect of initial rumination on initial sleep problems when adjusting for subsequent sleep problems could be interpreted as a causal decreasing effect. This positive effect would suggest that high initial rumination had decreased high initial degree of sleep problems to the same subsequent level as among individuals with lower initial degree of sleep problems and lower initial degree of rumination. Moreover, a positive, a negative, and a null effect of initial rumination on the subsequent - initial sleep problems difference score could be seen to suggest an increasing, a decreasing, and no causal effect, respectively.
We estimated the effects outlined above, as well as the reversed effects of sleep problems on rumination, in data simulated to resemble the empirical data used by Yang and Lei (2025), with the same sample size and correlations between variables (these correlations are included in our analytic script available at the Open Science Framework, see link below). We used simulated data as the empirical data were not available to us. The corresponding author of the study by Yang and Lei (2025) did not respond to our request for the data. It should be noted that both the standardized effect of X on Y2 when adjusting for Y1 (Equation 1, Cohen et al., 2003) and on the Y2-Y1 difference (Equation 2, Guilford, 1965) are functions of correlations between the variables. Consequently, these effects will be the same in data, empirical or simulated, with the same correlations between variables and simulated data can be used to estimate what the effects would have been in corresponding empirical data.
Our simulated data did, by design, satisfy assumptions of regression analyses, e.g., normally distributed residuals and lack of influential outliers. We do not know if this was the case for the empirical data used by Yang and Lei (2025) as well. However, we do not believe that it would be tenable to argue that due to potential violation of regression assumptions in the data used by Yang and Lei (2025), we should trust their conclusion of causal increasing effects between rumination and sleep problems and ignore our warning (see below) against causal inference.
Analyses and the simulation for the present study were conducted with R 4.4.0 statistical software (R Core Team, 2025) using the MASS package (Venables and Ripley, 2002). The analytic script, which also generates the simulated data, is available at the Open Science Framework at https://osf.io/cwx7s/.
Initial rumination had a positive effect on subsequent sleep problems when adjusting for initial sleep problems (β = 0.198 [0.085; 0.312], p < 0.001) and initial sleep problems had a positive effect on subsequent rumination when adjusting for initial rumination (β = 0.163 [0.049; 0.277], p = 0.005). These effects could be seen to suggest that initial rumination had an increasing causal effect on subsequent change in sleep problems and vice versa. However, initial rumination also had a positive effect on initial sleep problems when adjusting for subsequent sleep problems (β = 0.496 [0.408; 0.584], p < 0.001) and initial sleep problems had a positive effect on initial rumination when adjusting for subsequent rumination (β = 0.494 [0.407; 0.581], p < 0.001). These effects could be seen to suggest, paradoxically, that initial rumination had a decreasing causal effect on subsequent change in sleep problems and vice versa. Moreover, initial rumination had a negative effect on the subsequent-initial difference score on sleep problems (β = −0.199 [−0.299; −0.099], p < 0.001) and initial sleep problems had a negative effect on the subsequent-initial difference score on rumination (β = −0.218 [−0.317; −0.118], p < 0.001). These effects could be seen to suggest that initial rumination had a decreasing causal effect on subsequent change in sleep problems and vice versa. According to proposed benchmarks (Orth et al., 2022), all effect sizes reported above were large (|β| ≥ 0.12).
In summary, the present findings showed that the data analyzed by Yang and Lei (2025) could be used to support contradicting conclusions of increasing and decreasing effects of initial rumination on subsequent change in sleep problems and vice versa. Hence, the data cannot support any of those conclusions and the conclusions of causal increasing effects by Yang and Lei (2025) appear premature and questionable. It is important for researchers to bear in mind that correlations, including adjusted cross-lagged effects, do not prove causality in order not to overinterpret findings, something that appears to have happened to Yang and Lei (2025). We recommend researchers to, as we did here, use triangulation by fitting complementary models to their data. If results from the models converge, conclusions of causality are corroborated (although never finally proven). If, on the other hand and as in the present case, findings diverge, conclusions of causality would appear premature. It should be noted that here, we do not use the term “triangulation” in a broad sense, involving analyses of multiple data sources, multiple theoretical perspectives, etc. Rather, in line with some recent studies (e.g., Sorjonen et al., 2025a; Sorjonen et al., 2025b; Sorjonen and Melin, 2024, 2025a, 2025b), we use the term to refer to fitting different statistical models to the same data.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found at: the analytic script, which also generates the simulated data, is available at the Open Science Framework at https://osf.io/cwx7s/.
Author contributions
KS: Conceptualization, Formal analysis, Investigation, Methodology, Project administration, Writing – original draft, Writing – review & editing. BM: Conceptualization, Investigation, Supervision, Validation, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Campbell, D. T., and Kenny, D. A. (1999). A primer on regression artifacts. New York: Guilford Press.
Castro-Schilo, L., and Grimm, K. J. (2018). Using residualized change versus difference scores for longitudinal research. J. Soc. Pers. Relat. 35, 32–58. doi: 10.1177/0265407517718387
Cohen, J., Cohen, P., West, S. G., and Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences. 3rd Edn. Mahwah, NJ: Lawrence Erlbaum Associates.
Guilford, J. P. (1965). Fundamental statistics in psychology and education. 4th Edn. New York: McGraw-Hill.
Orth, U., Meier, L. L., Bühler, J. L., Dapp, L. C., Krauss, S., Messerli, D., et al. (2022). Effect size guidelines for cross-lagged effects. Psychol. Methods 29, 421–433. doi: 10.1037/met0000499
R Core Team. 2025. R: a language and environment for statistical computing Vienna, Austria: R Foundation for Statistical Computing.
Sorjonen, K., Ghaderi, A., and Melin, B. (2025a). Meta-analytic prospective associations between self-esteem and eating disorders appear to be spurious: a reanalysis and comment on Krauss et al. (2023). Front. Psychol. 16:1463701. doi: 10.3389/fpsyg.2025.1463701,
Sorjonen, K., and Melin, B. (2024). Spurious prospective effect of physical activity on problematic smartphone use: a simulated reanalysis and comment on Zhao et al. (2024). Front. Psychol. 15:1485660. doi: 10.3389/fpsyg.2024.1485660,
Sorjonen, K., and Melin, B. (2025a). Inconclusive evidence for a prospective effect of sense of coherence on subjective burden and mental health: a simulated reanalysis and comment on Gonçalves-Pereira et al. (2025). J. Affect. Disord. 391:119939. doi: 10.1016/j.jad.2025.119939
Sorjonen, K., and Melin, B. (2025b). Inconclusive effects between executive functions and symptoms of psychiatric disorders in random-intercept cross-lagged panel models: a simulated reanalysis and comment on Halse et al. (2022). Front. Psychol. 16:1500200. doi: 10.3389/fpsyg.2025.1500200,
Sorjonen, K., Melin, B., and Ingre, M. (2019). Predicting the effect of a predictor when controlling for baseline. Educ. Psychol. Meas. 79, 688–698. doi: 10.1177/0013164418822112,
Sorjonen, K., Melin, B., and Nilsonne, G. (2025b). Cross-lagged network models do not prove causality and may be evaluated through triangulation. Acta Psychol. 260:105562. doi: 10.1016/j.actpsy.2025.105562,
Venables, W. N., and Ripley, B. D. (2002). Modern applied statistics with S (Fourth Ed.). New York: Springer.
Keywords: contradicting support, cross-lagged panel model, rumination, simulation, sleep problems, triangulation
Citation: Sorjonen K and Melin B (2025) Inconclusive evidence for causal effects between rumination and sleep problems: a simulated reanalysis and comment on Yang and Lei (2025). Front. Psychol. 16:1609290. doi: 10.3389/fpsyg.2025.1609290
Edited by:
Baojuan Ye, Jiangxi Normal University, ChinaReviewed by:
Fausto Coutinho-Lourenço, Universidade Federal de São Paulo, BrazilZheng Zhang, South China Normal University, China
Copyright © 2025 Sorjonen and Melin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Kimmo Sorjonen, a2ltbW8uc29yam9uZW5Aa2kuc2U=