The gut microbiome is of utmost importance to human health. While a healthy microbiome can be represented by a variety of structures, its functional capacity appears to be more important. Gene content of the community can be assessed by “shotgun” metagenomics, but this approach is still too expensive. High-throughput amplicon-based surveys are a method of choice for large-scale surveys of links between microbiome, diseases, and diet, but the algorithms for predicting functional composition need to be improved to achieve good precision. Here we show how feature engineering based on microbial phenotypes, an advanced method for functional prediction from 16S rRNA sequencing data, improves identification of alterations of the gut microbiome linked to the disease. We processed a large collection of published gut microbial datasets of inflammatory bowel disease (IBD) patients to derive their community phenotype indices (CPI)—high-precision semiquantitative profiles aggregating metabolic potential of the community members based on genome-wide metabolic reconstructions. The list of selected metabolic functions included metabolism of short-chain fatty acids, vitamins, and carbohydrates. The machine-learning approach based on microbial phenotypes allows us to distinguish the microbiome profiles of healthy controls from patients with Crohn's disease and from ones with ulcerative colitis. The classifiers were comparable in quality to conventional taxonomy-based classifiers but provided new findings giving insights into possible mechanisms of pathogenesis. Feature-wise partial dependence plot (PDP) analysis of contribution to the classification result revealed a diversity of patterns. These observations suggest a constructive basis for defining functional homeostasis of the healthy human gut microbiome. The developed features are promising interpretable candidate biomarkers for assessing microbiome contribution to disease risk for the purposes of personalized medicine and clinical trials.
Cancer can be generally defined as a cluster of systematic diseases triggered by abnormal cell proliferation and growth. With the development of biological sciences and biotechnologies, the etiology of cancer is partially revealed, including some of the most substantial pathogenic factors [either endogenous (genetics) or exogenous (environmental)]. However, some remaining factors that contribute to the tumorigenesis but have not been analyzed and discussed in detail remain. For instance, some typical correlations between microorganisms and tumorigenesis have been reported already, but previous studies are just sporadic studies on single microorganism–cancer subtype pairs and do not explain and validate the specific contribution of microbiome on tumorigenesis. On the basis of the systematic microbiome analyses of blood and cancer-associated tissues in cancer patients/controls in public domain, we performed interpretable analyses. We identified several core regulatory microorganisms that contribute to the classification of multiple tumor subtypes and established quantitative predictive models for interpretable prediction by using multiple machine learning methods. We also compared the optimal features (microorganisms) and rules identified from microbiome profiles processed using the Kraken and the SHOGUN. Collectively, our study identified new microbiome signatures and their interpretable classification rules for cancer discrimination and carried out reliable methodological comparison for robust cancer microbiome analyses, thereby promoting the development of tumor etiology at the microbiome level.
The human gut microbiome is a reservoir for antibiotic resistance gene (ARG). Therefore, characterizing resistome distribution and potential disease markers can help manage antibiotics at the clinical level. While much population-level research has highlighted the strong effect of donor geographic origin on ARG prevalence in the human gut, little is known regarding the effects of other properties, such as age, sex, and disease. Here we employed 2,037 fecal metagenomes from 12 countries. By quantifying the known resistance genes for 24 types of antibiotics in each community, we showed that tetracycline, aminoglycoside, beta-lactam, macrolide-lincosamide-streptogramin (MLS), and vancomycin resistance genes were the dominant ARG types in the human gut. We then compared the ARG profiles of 1427 healthy individuals from the 2,037 samples and observed significant differences across countries. This was consistent with expectations that regional antibiotic usage and exposure in medical and food production contexts affect distribution. Although no specific uniform pattern of ARG was observed, a significant increase in resistance potential among multiple disease groups implied that the disease condition may be another source of ARG variance. In particular, the co-occurrence pattern of some enriched bacterial species and ARGs that were obtained in type 2 diabetes (T2D) and liver cirrhosis patients implied that some disease-associated species may be potential hosts of enriched ARGs, which could be potential biomarkers for the prediction and intervention of such diseases. Overall, our study identifies factors associated with the human gut resistome, including substantial effects of region and heterogeneous effects of disease status, and highlights the value of ARG analysis in disease research and clinical applications.
Frontiers in Molecular Biosciences
Microenvironmental Evolution of Chronic Diseases and Tumors in the Digestive Tract, Screening for New Diagnostic and Therapeutic Targets