Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Microbiol.

Sec. Systems Microbiology

Volume 16 - 2025 | doi: 10.3389/fmicb.2025.1656016

Multi-cohort metagenomics reveals strain functional heterogeneity and demonstrates fecal microbial load correction improves colorectal cancer diagnostic models

Provisionally accepted
Qiucheng  LiQiucheng Li1Fang  LiuFang Liu1Jianfeng  ZhongJianfeng Zhong1Xiaoling  FangXiaoling Fang1Xinyi  ZhangXinyi Zhang1Huizhen  XiongHuizhen Xiong1Guangyi  LiGuangyi Li2*Honglei  ChenHonglei Chen1
  • 1The Eighth Affiliated Hospital of Sun Yat-Sen University, Shenzhen, China
  • 2Hunan University, Changsha, China

The final, formatted version of the article will be published soon.

Colorectal cancer (CRC) is strongly associated with the gut microbiome, yet most studies focus on genus/species-level taxonomy, overlooking strain-level functional heterogeneity. Here, we integrated 1,123 metagenomic samples from seven global CRC cohorts to perform multi-level metagenome-wide association studies (MWAS) and evaluate the impact of fecal microbial load (FML) correction on disease classification. Strain-level analysis revealed conspecific strains with opposing CRC associations, such as Bacteroides thetaiotaomicron strains exhibiting both protective and risk-increasing effects across cohorts:,::::: with:::::::: genomic:::::::::: functional :::::::::: annotation:::::::::: suggesting::::::::: potential :::::::::: underlying::::::::: biological::::::::::: mechanisms. FML correction mitigated confounds, significantly improving within-cohort and cross-cohort model performance for CRC prediction. Notably, genus/species-level models outperformed strain-resolved counterparts in predictive robustness, likely due to higher abundance and cross-population conservation. Our findings highlight the biological value of strain-level analysis for resolving microbiome functional diversity while demonstrating that higher taxonomic levels offer more clinically translatable diagnostic markers. Integrating FML correction and multi-level taxonomic profiling enhances mechanistic insight into microbiome-CRC interactions and improves diagnostic model generalizability across populations.

Keywords: colorectal cancer, gut microbiome, Strain, Metagenomics, Fecal microbial load, Classification models

Received: 29 Jun 2025; Accepted: 01 Sep 2025.

Copyright: © 2025 Li, Liu, Zhong, Fang, Zhang, Xiong, Li and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Guangyi Li, Hunan University, Changsha, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.