ORIGINAL RESEARCH article
Front. Microbiol.
Sec. Systems Microbiology
Volume 16 - 2025 | doi: 10.3389/fmicb.2025.1656016
Multi-cohort metagenomics reveals strain functional heterogeneity and demonstrates fecal microbial load correction improves colorectal cancer diagnostic models
Provisionally accepted- 1The Eighth Affiliated Hospital of Sun Yat-Sen University, Shenzhen, China
- 2Hunan University, Changsha, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Colorectal cancer (CRC) is strongly associated with the gut microbiome, yet most studies focus on genus/species-level taxonomy, overlooking strain-level functional heterogeneity. Here, we integrated 1,123 metagenomic samples from seven global CRC cohorts to perform multi-level metagenome-wide association studies (MWAS) and evaluate the impact of fecal microbial load (FML) correction on disease classification. Strain-level analysis revealed conspecific strains with opposing CRC associations, such as Bacteroides thetaiotaomicron strains exhibiting both protective and risk-increasing effects across cohorts:,::::: with:::::::: genomic:::::::::: functional :::::::::: annotation:::::::::: suggesting::::::::: potential :::::::::: underlying::::::::: biological::::::::::: mechanisms. FML correction mitigated confounds, significantly improving within-cohort and cross-cohort model performance for CRC prediction. Notably, genus/species-level models outperformed strain-resolved counterparts in predictive robustness, likely due to higher abundance and cross-population conservation. Our findings highlight the biological value of strain-level analysis for resolving microbiome functional diversity while demonstrating that higher taxonomic levels offer more clinically translatable diagnostic markers. Integrating FML correction and multi-level taxonomic profiling enhances mechanistic insight into microbiome-CRC interactions and improves diagnostic model generalizability across populations.
Keywords: colorectal cancer, gut microbiome, Strain, Metagenomics, Fecal microbial load, Classification models
Received: 29 Jun 2025; Accepted: 01 Sep 2025.
Copyright: © 2025 Li, Liu, Zhong, Fang, Zhang, Xiong, Li and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Guangyi Li, Hunan University, Changsha, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.