Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Microbiomes

Sec. Omics and Bioinformatics for Microbiomes

Volume 4 - 2025 | doi: 10.3389/frmbi.2025.1584516

Analysis of Microbiome High Dimensional Experimental Design Data using Generalized Linear Models and ANOVA Simultaneous Component Analysis

Provisionally accepted
  • 1University of Amsterdam, Amsterdam, Netherlands
  • 2Wageningen University and Research, Wageningen, Netherlands

The final, formatted version of the article will be published soon.

In microbiome studies, addressing the unique characteristics of sequence data such as compositionality, zero-inflation, overdispersion, high dimensionality, and non-normality is crucial for accurate analysis. In addition, it is worthwhile to integrate the experimental design elements into the microbiome data analysis for understanding how factors like treatment, time, and interactions affect microbial abundance. To achieve these objectives, we developed a new method that combines Generalized Linear Models (GLMs) with ANOVA Simultaneous Component Analysis (ASCA), which we coined GLM-ASCA. This method aims to improve microbiome analysis by providing a more comprehensive understanding of differential abundance patterns in response to experimental conditions. This is accomplished by modeling the unique characteristics of microbiome sequence data with GLMs and using ASCA to effectively separate the effects of different experimental factors on microbial abundance. GLM-ASCA is evaluated using simulated data and subsequently used on real data to analyze the effect of nitrogen deficiency on root microbiome recruitment in tomato. Simulation studies demonstrated the effectiveness of GLM-ASCA to analyze microbiome data in complex experimental designs, and the real data application revealed valuable insights into the dynamics of microbial communities under nitrogen starvation, as well as identifying beneficial bacterial species that promote the growth and health of tomato through nitrogen fixation.

Keywords: Generalized Linear Models, ANOVA simultaneous component analysis, experimental design, High dimensional microbiomedata, differential abundance analysis, Tweedie model

Received: 27 Feb 2025; Accepted: 19 Sep 2025.

Copyright: © 2025 Abegaz, Abedini, Dong, Westerhuis, Van Eeuwijk, Bouwmeester and Smilde. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Fentaw Abegaz, fentawabegaz@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.