- 1Department of Medical Oncology, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Chinese Medicine), Hangzhou, Zhejiang, China
- 2Longyou Branch of Zhejiang Provincial Hospital of Chinese Medicine, Quzhou, Zhejiang, China
- 3College of Basic Medical Science, Zhejiang Chinese Medical University, Hangzhou, Zhejiang, China
- 4Academy of Chinese Medical Science, Zhejiang Chinese Medical University, Hangzhou, Zhejiang, China
- 5Key Laboratory of Neuropharmacology and Translational Medicine of Zhejiang Province, School of Pharmaceutical Sciences, Zhejiang Chinese Medical University, Hangzhou, Zhejiang, China
Background: Fecal and salivary microbiota dysbiosis play a crucial role in the pathogenesis of colorectal cancer (CRC). We investigated whether the fecal and salivary microbiota were altered during colorectal tumorigenesis and evaluated their diagnostic performance.
Methods: We enrolled 30 metastatic CRC patients, 30 nonmetastatic CRC patients, and 30 healthy controls between October 2023 and September 2024. Fecal and salivary samples were collected for microbial profiling via 16S rDNA sequencing and bioinformatics analysis.
Results: Fecal and salivary microbiota composition differed during CRC progression, with salivary microbiota progressively enriched in the gut. In addition, fecal and salivary microbial co-occurrence networks dynamically altered during CRC progression. The natural connectivity of fecal microbial community networks exhibited decreased stability, whereas salivary microbial community networks showed increased stability as CRC progressed. Finally, specific fecal microbial amplicon sequence variants (ASVs) associated with colorectal carcinogenesis enabled precise stage-specific diagnosis of CRC, outperforming salivary ASVs classifiers.
Conclusion: This study elucidates stage-specific microbial dynamics in CRC, providing novel insights into clinical diagnostic strategies.
Introduction
Colorectal cancer (CRC) is the second leading cause of cancer-related death in both men and women, with an estimated 1.92 million new CRC cases and 0.90 million CRC-related deaths in 2022, of which approximately 0.24 million deaths occur in China (Bray et al., 2024). Screening for CRC among people aged 50–75 years has substantial net benefits (Patel and Dominitz, 2024). However, only 14% of high-risk population in China undergo CRC screening due to the high cost, invasiveness, and cumbersome preparation required for colonoscopy (Chen et al., 2019). In addition, nonspecific symptoms and absence of clinical biomarkers hinder early CRC detection and intervention. Consequently, up to 60% of patients exhibit local or distant metastasis at the time of diagnosis, correlating with poor prognosis with a 5-year survival rate of less than 20% (Biller and Schrag, 2021). Therefore, a valuable means of predicting the onset of CRC and CRC metastasis is needed urgently.
Although the gut microbiota is not a static entity, it serves as a critical interface dynamically interacting with environmental factors and host health (Zmora et al., 2019), contributing to disease pathogenesis through genetic toxicity, signal transduction, inflammation, immunity, and metabolism (Wong and Yu, 2023). CRC remains one of the most life-threatening malignancies, and unhealthily alterations in fecal microbiota composition in high-risk individuals have been proven to contribute to colorectal carcinogenesis (Cui et al., 2024; Liu et al., 2024; Zepeda-Rivera et al., 2024). Fecal microbiota is highly dynamic, with substantial variation in taxonomic diversity across hosts and health states, reflecting intrahost microbial evolution driven by natural selection. For instance, convergent evidence confirms that specific alterations in fecal microbial composition occur in digestive system diseases, such as in CRC (Coker et al., 2022), gastric cancer (Yu et al., 2024), liver cancer (Rajapakse et al., 2023), and precancerous adenomas (Gao et al., 2023), as well as in other non-neoplastic diseases (Wu et al., 2024), Currently, fecal microbiota has been implicated in the diagnosis of CRC. Coker et al. (2022) established a panel of six fecal bacterial species that distinguished CRC from colorectal adenomas (CRA) with satisfactory diagnostic performance upon validation, indicating the fecal microbial dynamics during CRC progression. The oral cavity acts as the entrance to the gastrointestinal tract and harbors a complex salivary microbiome that is closely linked to the fecal microbiota through oral-to-gut microbiota translocation (Schmidt et al., 2019). Significant alterations in the salivary microbiota of CRC patients highlight its potential as a noninvasive biomarker (Zhang et al., 2020). This positions the readily accessible salivary microbiota as a powerful tool for understanding CRC pathogenesis and distinguishing CRC patients from the healthy population. Recent studies have identified that specific oral bacteria, enriched in the intestinal dysbiotic microbiota of CRC patients, can promote the carcinogenic process and possess significant potential for antitumor therapy (Bergsten et al., 2023; Chen L. et al., 2024; Zepeda-Rivera et al., 2024). However, previous studies have overlooked the critical role of oral-to-gut microbiota translocation during CRC progression. Furthermore, the associations of fecal and salivary microbiota with metastatic CRC remain uninvestigated, along with their potential as diagnostic biomarkers for early metastasis detection.
We hypothesize that stage-specific fecal and salivary microbiota signatures evolve during colorectal carcinogenesis, serving as noninvasive biomarkers for detecting early resectable CRC and monitoring metastasis progression. In addition, given the robust evidence of specific oral bacteria colonizing intestinal dysbiotic microbiota in CRC patients, we hypothesize that oral-to-gut microbiota translocation intensifies throughout colorectal carcinogenesis. Fecal and matching salivary samples were concurrently collected from 30 metastatic CRC patients, 30 nonmetastatic CRC patients, and 30 healthy adult volunteers to characterize the microbial profiles using 16S v3v4 region sequencing approach. We profiled fecal and salivary microbiota and constructed co-occurrence networks to elucidate the ecological dynamics of microbial dysbiosis during CRC progression. Simultaneously, Source Tracker analysis was used to evaluate oral–gut microbial translocation dynamics during CRC progression. Based on bioinformatics analysis, we identified the fecal microbial signatures with acceptable prediction accuracy.
Results
Participants
Based on strict inclusion criteria, 30 metastatic CRC patients (M group), 30 nonmetastatic CRC patients (NM group), and 30 healthy volunteers as normal controls (NC group) were included. Clinical features involving sex; age; BMI; comorbidities; location of primary tumor lesion; histological type; and serum CEA, CA-199, and CA-125 level were presented in the Table 1. The differences in demographic characteristics among the three groups were statistically insignificant. As for CRC patients, the specific distribution of their primary tumor location was as follows: NM group: 7 left colon cancer (LCC), 10 right colon cancer (RCC), and 13 rectal cancer (RC) as well as M group: 10 LCC, 7 RCC, and 13 RC. A total of 58 of patients were diagnosed with adenocarcinoma in terms of pathological classification, while only two of the M group patients were diagnosed with neuroendocrine neoplasms, thus showing consistency with the distribution characteristics of CRC histological types. In addition, serum tumor markers exhibited a significant difference between the M group and the NM group.
Fecal microbiota composition differed in CRC patients with distinct disease stages
Intestinal dysbacteriosis is closely associated with colorectal tumorigenesis. Therefore, healthy subjects as well as CRC patients with and without metastasis were enrolled to mirror the specific fecal microbial alterations during CRC progression. Principal coordinates analysis (PCoA) (Figure 1a) and non-metric multidimensional scaling (NMDS) (Figure 1b) analyses revealed significant β-diversity shifts (Bray–Curtis, p = 0.01) in fecal microbiota, with distinct clustering among the three groups. Moreover, we performed an ASV-based taxonomic analysis at the phylum and genus levels to identify dominant fecal microbiota taxa. At the phylum level, Firmicutes, Bacteroidota, Proteobacteria, Actinobacteriota, Verrucomicrobiota, and Fusobacteria dominated the three groups. In addition, the stacked bar plot of six predominant phyla indicated that the M group had a considerably lower average Firmicute composition (45.68%) than the NM (51.18%) and NC group (52.94%), accompanied by the increasing proportion of the Actinobacteriota (NC 5.73% vs. NM 5.83% vs. M 12.05%) (Figure 1c). The top 30 genera were plotted in the stacked bar at the genus level, while the rest were merged as others. At the genus level, Bacteroides accounted for an increasing proportion along the NC-NM-M sequence (NC 10.92% vs. NM 11.44% vs. M 16.45%) and dominated the NM and M groups. On the contrary, Prevotella_9 was significantly more abundant in the NC group compared with the CRC group (NC 13.24% vs. NM 0.97% vs. M 5.12%). Furthermore, we observed a progressive decline in the relative abundance of Escherichia–Shigella along the NC-NM-M sequence (NC 4.70% vs. NM 4.58% vs. M 2.68%) (Figure 1d). We then shared the top 6 phyla and the top 30 genera in the form of bubble plot and Sankey plot. Bubble plot showed the microbial distribution and changes through the size and location of the bubble among the groups (Figure 1e). The Sankey plot intuitively revealed the branch association between phylum and genus of three groups with different colors (Figure 1f). The average composition of Bacteroides, Prevotella_9, Bifidobacterium, Faecalibacterium, Escherichia-Shigella, Akkermansia, Klebsiella, UCG-002, Ruminococcus_gnavus_group, and Roseburia showed the statistically predominant abundance at the genus level, and Bacteroidota showed the largest proportion at the phylum level (Figures 1e,f).

Figure 1. Fecal microbiota composition differed in CRC patients with distinct disease stages. (a,b) PCoA (a) and NMDS (b) showed beta diversity of fecal microbiota across groups with distinct disease stages. p-value was calculated by the analysis of similarities (ANOSIM) (weighted Unifrac, R = 0.049, p = 0.01). (c,d) Stacked bar plot showed mean fecal microbiota proportions at the phylum (c) and genus (d) levels across groups with distinct disease stages. (e) Bubble plot showed fecal microbial genera colored by phylum and sized by relative abundance across groups with distinct disease stages. (f) Sankey plot indicated the fecal taxonomic flow with the breadth of the branch at the genus (right side) and corresponding phylum (middle side) levels during the disease progression (left side). NC, normal controls; NM, non-metastatic CRC; M, metastatic CRC.
Specific fecal microbial taxa altered in CRC patients with distinct disease stages
Particularly, the heatmap showed difference in microbial distribution at the phylum level (Figure 2a). We observed that the M group featured Eubacterium eligens group, UGG-002, Bifidobacterium, Bacteroides, Sutterella, Veillonella, and Ruminococcus, while the NM group featured Streptococcus, Akkermansia, Fusobacterium, Ruminococcus gnavus group, Ruminococcus torques group, Collinsella, Phascolarctobacterium, Fusicatenibacter, and Klebsiella at the genus level visualized by heatmap (Figure 2b). Box-plots showed the top 10 specific taxa decreased and increased constantly along the NC-NM-M sequence at the genus (Figure 2c) and species (Figure 2d) levels. Specific genera, including Proteus, Shuttleworthia, Pelomonas, Parabacteroides, Leifsonia, Megasphaera, Catabacter, Gracilibacter, Lactovum, and Prevotella_7 showed increased relative abundance during CRC progression, while others including Megamonas, Neisseria, Aggregatibacter, Lactiplantibacillus, Agathobacter, Erwinia, Lachnospiraceae UCG−004, CAG−56, Romboutsia, and UCG−003 decreased (Figure 2c). At the species level, Proteus mirabilis and Bacteroides thetaiotaomicron exhibited an upward trend (Figure 2d).

Figure 2. Specific fecal microbial taxa altered in CRC patients with distinct disease stages. (a,b) Heatmaps showed fecal microbial phyla (a) and genera (b) enriched in CRC patients with distinct disease stages. Color gradient indicated relative abundance. (c,d) Box plots depicted the specific fecal microbial taxa at the genus (c) and species (d) levels with progressive relative abundance changes along the NC-NM-M sequence (p < 0.05). Species named as unclassified indicated no further classification available. p-values were calculated using the Kruskal–Wallis test. The box represented the interquartile range (IQR) between the first and the third quartiles, and the midline represented the median. Whiskers extended to values within 1.5 times IQR. Circles indicated outliers beyond the whiskers. NC, normal controls; NM, non-metastatic CRC; M, metastatic CRC.
Salivary microbiota composition differed in CRC patients with distinct disease stages
A total of 90 salivary samples were available for 16S rDNA sequencing-based taxonomic analysis. Both PCoA (Figure 3a) and NMDS (Figure 3b) analyses demonstrated statistically significant dissimilarities (Bray–Curtis, p = 0.001) in microbial community composition among the groups. At the phylum level (Figure 3c), Proteobacteria, Bacteroidota, Firmicutes, Fusobacteriota, Actinobacteriota, and Campylobacterota dominated the three groups. Statistically, Bacteroidota and Campylobacterota were enriched in the M group relative to the NM and NC groups, whereas phyla such as Proteobacteria showed high abundance in the NC group. Furthermore, Neisseria, Haemophilus, Prevotella_7, Porphyromonas, Fusobacterium, Prevotella, Veillonella, Streptococcus, Leptotrichia, and Alloprevotella dominated the three groups at the genus level (Figure 3d). The stacked bar plot depicting the top 30 genera revealed a progressive decline in Neisseria abundance during CRC progression, contrasting with increased relative abundances of Fusobacterium and Prevotella (Figure 3d). To compressively visualize 16S rDNA sequencing data, bubble and Sankey plots were employed to reveal the six most abundant phyla and 30 most abundant genera across the three groups (Figures 3e,f).

Figure 3. Salivary microbiota composition differed in CRC patients with distinct disease stages. (a,b) PCoA (a) and NMDS (b) showed beta diversity of salivary microbiota across groups with distinct disease stages (weighted Unifrac, R = 0.0797, p = 0.001). (c,d) Stacked bar plot showed mean salivary microbiota proportions at the phylum (c) and genus (d) levels across groups with distinct disease stages. (e) Bubble plot showed salivary microbial genera across groups with distinct disease stages. (f) Sankey plot indicated the salivary taxonomic flow across groups with distinct disease stages. NC, normal controls; NM, non-metastatic CRC; M, metastatic CRC.
Specific salivary microbial taxa altered in CRC patients with distinct disease stages
Particularly, heatmaps illustrated the microbial distribution differences at the phylum (Figure 4a) and genus levels (Figure 4b). Box-plots revealed the top 10 specific taxa that decreased or increased consistently along the NC-NM-M sequence at the genus (Figure 4c) and species (Figure 4d) levels. The relative abundance of specific genera, including Prevotella, Atopobium, Lactobacillus, Escherichia−Shigella, Akkermansia, Klebsiella, Solobacterium, Lachnospiraceae NK4A136 group, Subdoligranulum, and Scardovia increased during CRC progression, while others including Vibrio, Bosea, Simonsiella, Succinivibrionaceae UCG−001, and F0332 decreased (Figure 4c).

Figure 4. Specific salivary microbial taxa altered in CRC patients with distinct disease stages. (a,b) Heatmaps showed salivary microbial phyla (a) and genera (b) enriched in CRC patients with distinct disease stages. (c,d) Box plots depicted the specific salivary microbial taxa at the genus (c) and species (d) levels with progressive relative abundance changes along the NC-NM-M sequence. NC, normal controls; NM, non-metastatic CRC; M, metastatic CRC.
Microbiota co-occurrence networks dynamically altered in CRC patients with distinct disease stages
To assess CRC stage-related changes in gut/oral network complexity, we identified 139 fecal and 281 salivary ASVs (shared by ≥1/3 of samples across groups) and analyzed community structure via SparCC correlation coefficients. We analyzed the interactions among fecal and salivary microorganisms along the NC-NM-M sequence, across 6 networks. Fecal and salivary microbial networks exhibited stage-specific co-occurrence patterns during CRC progression (Figure 5a). In addition, we calculated the natural connectivity to evaluate the resilience in complex networks. Fecal microbial networks demonstrated reduced constancy when the disease progressed, while salivary microbial networks demonstrated increased constancy (Figure 5b). Relative enrichment of oral bacteria in fecal samples has been linked to several digestive system diseases, including Crohn’s disease, ulcerative colitis, irritable bowel syndrome, and even CRC. To investigate whether CRC-induced fecal microbiota changes resulted from oral-to-gut microbiota translocation and intensified with disease progression, we linked salivary and fecal microbiota via Venn diagrams and Source Tracker analysis based on 16S rDNA sequencing. Compared to the NC group, the NM group had more shared ASVs between the salivary and fecal samples as shown in the Venn diagram, suggesting that its salivary and fecal microbiota were more similar (Supplementary Figure S1b). Interestingly, shared ASVs were fewer in the M group than in the NM group (Supplementary Figure S1c). To explore whether oral microbiota colonize the gut during CRC progression, we performed Source Tracker analysis. Source Tracker analysis detected salivary bacteria in the fecal microbiota of 63.3% of M group patients, 50.0% of NM group patients, and 46.7% of NC group patients (Figure 5c), with the highest detection rate observed in the CRC group, particularly the M group. Salivary bacteria constituted 14.9% of the fecal microbiota in the M group, progressively decreasing to 3.2% in the NM group and 1.7% in the NC group (Figure 5d), indicating relative enrichment of salivary bacteria in the gut during CRC progression. Source Tracker details were provided in Supplementary Figures S2a–c. A conceivable circumstance was that oral-to-gut microbiota translocation could drive gut dysbiosis during CRC progression, thereby exacerbating fecal microbial network instability.

Figure 5. Microbiota co-occurrence networks dynamically altered in CRC patients with distinct disease stages. (a) Co-occurrence networks of fecal (n = 139) and salivary (n = 281) ASVs of CRC patients with distinct disease stages, with edges denoting significant correlations (p < 0.05) between co-abundant ASVs. The SparCC algorithm was used to examine statistical significance. (b) The network natural connectivity test assessed the robustness changes in two network topologies during node removal. (c,d) The source tracker analysis showed both the detection rate and detailed proportion of saliva-derived microbiota in the fecal samples of CRC patients with distinct disease stages. In the pie diagrams of detection rate (c), “Saliva (+)” represented the positive proportion of each group that detected saliva-derived bacteria in their fecal microbiota. “Saliva (−)” represented the negative proportion of each group that did not detect saliva-derived bacteria in their fecal microbiota. In the pie diagrams of the detailed proportion of saliva-derived microbiota in the fecal samples (d), “Saliva” implied the percentage of saliva-derived bacteria in the fecal samples and “unknown” implied the percentage of other bacteria except saliva-derived bacteria (n = 30 in the NC, NM, and M groups). NC, normal controls; NM, non-metastatic CRC; M, metastatic CRC.
ASVs-based fecal biomarkers achieved precise stage-specific diagnosis of CRC, outperforming salivary ASV classifiers
Differential analysis revealed the number of differential fecal and salivary bacterial species in pairwise group comparisons. Compared to the NC group, the NM group exhibited 39 differential fecal species (16 enriched, 23 depleted) (Figure 6a), while the M group exhibited 128 differential fecal species (77 enriched, 51 depleted) (Figure 6b). Moreover, compared to the M group, the NM group exhibited 109 differential fecal species (43 enriched, 66 depleted) (Figure 6c). Compared to the NC group, the NM group demonstrated 34 differential salivary species (15 enriched, 19 depleted) (Figure 6d), whereas the M group demonstrated 179,179 differential salivary species (141enriched, 38 depleted) (Figure 6e). Furthermore, compared to the M group, the NM group exhibited 180 differential salivary species (33 enriched, 147 depleted) (Figure 6f). Notably, the differences between the M group and the NC group were greater than those between the NM group and the NC group. The results indicated distinct fecal and salivary microbiota differences between CRC patients at different disease stages.

Figure 6. ASVs-based fecal biomarkers achieved precise stage-specific diagnosis of CRC, outperforming salivary ASV classifiers. (a–f) Volcano plots revealed significantly altered fecal (a–c) and salivary (d–f) bacterial species (p < 0.05, |log2 FC| > 1) in pairwise comparisons among the NM, M, and NC groups. “Enriched:” specific fecal/salivary species were significantly enriched in the front group relative to the rear group; “Depleted:” specific fecal/salivary species were significantly depleted in the front group relative to the rear group; (g,h) ASV-based fecal (g) and salivary (h) biomarkers importance ranking based on random forest scores. (i,j) ROC analysis of top eight ASVs-based fecal (i) and salivary (j) ASVs biomarkers (selected by MeandecreaseGini) for distinguishing among NC, NM, and M groups. NC, normal controls; NM, non-metastatic CRC; M, metastatic CRC; FC, fold change; ASV, amplicon sequence variant; ROC, receiver operating characteristic; AUC, area under the curve.
Using the random forest model with five-fold cross-validation, we next ranked fecal (Figure 6g) and salivary (Figure 6h) bacterial ASVs by their diagnostic importance for NC/NM/M three-class discrimination. We observed substantial overlap between the top 30 fecal and salivary ASV biomarkers selected by Mean Decrease Gini score and Mean Decrease Accuracy score. Subsequently, we constructed three-class random forest models using the top 8 fecal and salivary ASV biomarkers (selected by Mean Decrease Gini) to discriminate among NC, NM, and M groups. The fecal model achieved robust discrimination, attaining an overall test accuracy of 78.95%, Cohen’s Kappa statistic of 0.68, Log Loss of 0.18, Macro-averaged F1-score of 0.80, and Micro-averaged F1-score of 0.78. Notably, receiver operating characteristic (ROC) analysis (one-vs-rest) yielded a macro-averaged area under curve (AUC) of 93.16% (NC-specific AUC = 92.71%, NM-specific AUC = 90.15%, M-specific AUC = 96.63%) and a micro-averaged AUC of 93.40% (Figure 6i), confirming balanced performance across classes. The salivary model attained an overall test accuracy of 61.54%, Cohen’s Kappa statistic of 0.41, Log Loss of 0.52, Macro-averaged F1-score of 0.65, and Micro-averaged F1-score of 0.61. ROC analysis (One-vs-Rest) yielded a macro-averaged AUC of 76.15% (NC-specific AUC = 67.92%, NM-specific AUC = 67.43%, M-specific AUC = 93.10%) and a micro-averaged AUC of 81.30% (Figure 6j).
Discussion
As an aggressive malignancy characterized by insidious onset, rapid metastasis, and high mortality, CRC urgently requires early detection biomarkers to alleviate the disease burden. Recent research has deepened our understanding about the intrinsic links between oral/gut microbiota and digestive tract diseases, particularly CRC (Chen et al., 2022; Baker et al., 2024; Kunath et al., 2024).
Our study identified fecal genera implicated in CRC pathogenesis, including Bacteroides (Zafar and Saier, 2021), Megasphaera (Chen Q. et al., 2024), Veillonella (Ubachs et al., 2021), and Alistipes (Gu et al., 2023). In addition, opportunistic pathobionts (e.g., toxigenic Bacteroides strains) progressively expanded along the NC-NM-M sequence, whereas commensal probiotics like Agathobacter showed significant depletion (Lavelle et al., 2022). Bacteroides promoted intestinal barrier permeability in metastatic CRC, thereby shaping a premetastatic niche (Genua et al., 2021). Moreover, Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis identified key metabolic pathways in the M group, notably AA biosynthesis and metabolism (Supplementary Figure S3). This finding was consistent with the prior evidence that AA metabolism accelerated CRC progression (Tian et al., 2020; Tran et al., 2020; Liu et al., 2021). Specifically, western diets depleted beneficial bacteria that produced anti-inflammatory metabolites. In contrast, Mediterranean diet interventions promoted the abundance of Agathobacter and enhanced intestinal barrier function via increased production of short chain fatty acids (SCFAs) (Tan et al., 2023; Merra et al., 2020). Intriguingly, these microbial shifts also presented potential therapeutic targets for traditional Chinese medicine. For instance, berberine inhibited pathogenic species such as Alistipes and increased the abundance of SCFA-producing bacteria (Yang et al., 2023). We therefore speculated that CRC-associated dysbiosis is both a consequence of deleterious dietary patterns and a malleable “therapeutic interface” that can be modulated through precision interventions. Further research into intervening these microbial imbalances to halt or reverse disease progression holds significant clinical potential.
We tracked CRC stage-dependent co-occurrence network dynamics of fecal and salivary microbiota. Consistent with prior evidence, microbial co-occurrence networks in healthy population exhibited higher stability and complexity than diseased cohorts (Cheng et al., 2022). Structurally integrated networks confer functional robustness and ecosystem resilience (Wagg et al., 2019; Du et al., 2025). Based on the concepts of connectivity and robustness in network theory (Shi et al., 2020), we calculated the natural connectivity of microbial networks. We observed that fecal microbiota networks showed progressive decline in natural connectivity during CRC progression, whereas salivary networks exhibited increasing connectivity. Undoubtedly, an increasing fragility of the fecal microbial network during CRC progression was observed (Figure 5b). However, what we did not expect was that salivary microbial networks exhibited strengthening co-occurrence stability over the same progression (Figure 5b). The results indicated that tumor progression not only altered microbial diversity and composition but also the interaction network stability of both fecal and salivary microbiota. We hypothesized that enhanced robustness in salivary microbiota networks may contribute to fecal network destabilization through oral-to-gut microbiota translocation, potentially influencing host metabolism and tumor promotion. The Source Tracker analysis demonstrated increasing relative abundance of salivary microbiota in feces during CRC progression (Figure 5d), indicating oral-to-gut microbiota translocation-induced dysbiosis. Oral-to-gut microbiota translocation drives colorectal carcinogenesis through pathobiont-mediated barrier disruption and metabolic dysregulation (Park et al., 2021), while simultaneously offering novel theranostic opportunities: stage-specific translocation signatures enable noninvasive liquid biopsy for early metastasis risk stratification, and targeted interventions against key oral pathobionts may halt CRC progression by restoring gut ecological balance.
We elucidated the fecal and salivary microbiota associated with different CRC stages. In addition, we developed a three-classification diagnostic model based on fecal bacterial ASV biomarkers to distinguish among the NC, NM, and M groups, which demonstrated excellent performance. The model achieved AUC values exceeding 90% across all classifications (NC-specific AUC = 92.71%, NM-specific AUC = 90.15%, M-specific AUC = 96.63%, macro-averaged AUC = 93.16%, micro-averaged AUC = 93.40%) (Figure 6i). These results indicated that fecal microbial communities harbored highly informative biomarkers capable of achieving high discrimination accuracy among the three groups, suggesting strong potential for clinical application as a noninvasive auxiliary diagnostic tool. The salivary bacterial ASV-based model showed acceptable but comparatively lower discriminatory power, with a macro-averaged AUC of 76.15% (NC-specific AUC = 67.92%, NM-specific AUC = 67.43%, M-specific AUC = 93.10%) and a micro-averaged AUC of 81.30% (Figure 6j). Notably, the model exhibited relatively low discriminatory power between the NC and NM groups, while performing better in identifying the M group. This suggested that although salivary microbiota may be indicative of advanced or overt pathological states, it held limited value for detecting early or subtle differences. From a clinical inference perspective, the fecal microbiome demonstrated clear advantages over salivary microbiota in this specific classification task, indicating that gut microbial biomarkers may be more suitable for early screening and stratified management of the disease. Although saliva sampling is noninvasive and convenient, its moderate classification performance suggested that it may be better utilized as a supplementary approach or in combination with other biomarkers to enhance overall diagnostic efficacy. Future studies could focus on developing multimodal integrated models that combine microbial information from diverse sources, such as feces and saliva, to build more robust and generalizable clinical prediction tools.
As a preliminary exploratory study, this research identified CRC progression-associated fecal/salivary microbiota biomarkers and established foundational groundwork for future larger scale research. Although the Source Tracker analysis provided insight into potential oral–gut microbiota translocation, the study did not track specific salivary taxa colonizing the gut and lacked direct experimental validation, as it was not originally designed to examine salivary–fecal microbiota associations during CRC progression. In addition, limitation included the relatively small sample size and lack of external validation for predictive models because of the stringent inclusion criteria and the single-center recruitment. We have implemented five-fold cross-validation to enhance model performance and will conduct external validation in multicenter cohorts and prospective studies to assess clinical utility in the future.
Materials and methods
Subjects and specimen collection
From March 2024 to January 2025, we recruited 90 participants (30 patients with metastatic CRC, 30 patients with nonmetastatic CRC, and 30 normal control participants) from the First Affiliated Hospital of Zhejiang Chinese Medical University. All metastatic CRC and nonmetastatic CRC subjects were diagnosed by standard colorectal surgery and the control individuals were considered without CRC-related high-risk factors. The inclusion and exclusion criteria for all patients and healthy control participants were the same as described in detail in Supplementary materials. The 16S rDNA gene amplicon of fecal and salivary microbiota and clinical features were obtained. All fecal samples were further analyzed by metagenomic sequencing to acquire fecal microbiota information comprehensively. Fresh fecal and matching salivary samples were collected from 90 participants in sterile plastic pots containing DNA-stabilizing buffer (DNA Shield; LC-Bio Technology Co., Ltd). Participants were strictly refrained from eating, drinking, smoking, or chewing gum for 30 min before saliva collection. The samples were stored at −20 °C for 4 h and subsequently were stored at −80 °C for 24 h for long-term storage until DNA extraction and taxonomic analysis. In addition, clinical data pertaining to age, sex, BMI, and follow-up information were acquired from the same hospital-structured questionnaires. All subjects provided written informed consent. The Clinical Research Ethics Committee of the First Affiliated Hospital of Zhejiang Chinese Medical University approved the study protocol (2024-KLS-015-01).
DNA extraction, amplification, sequence procession, library construction, and 16S rDNA sequencing analysis
Fecal microbial DNA was extracted using Fecal Genome DNA Extraction Kit (AU46111-96, BioTeke, China) and salivary DNA was extracted using Salivary Genome DNA Extraction Kit (AU46111-96, BioTeke, China), respectively, according to manufacturer’s instructions. The DNA integrity, sizes, and concentrations were generated by Qubit (Invitrogen, United States). The 16S rDNA gene V3–V4 region was amplified using TransStart FastPfu Polymerase with barcode-indexed primers (341F5′-CCTACGGGNGGCWGCAG-3′;805R:5′-GACTACHVGGG TATCTAATCC-3′) in a two-step PCR protocol. The amplification products were then purified by AMPure XT Beads (Beckman Coulter Genomics, Danvers, MA, United States) and were quantified using Qubit (Invitrogen, United States). Quality-filtered reads were obtained and then evaluated using an Agilent 2100 Bioanalyzer (Agilent, United States) and Illumina library quantitative kits (Kapa Biosciences, Woburn, MA, United States), which were further sequenced on the Illumina NovaSeq 6000 (PE250) platform with the default settings for paired-end reads supported by Lc-Bio Technologies Co., Ltd. (Hangzhou, China). Sequencing primer were removed from de-multiplexed raw sequences using cutadapt (v_1.9). Subsequently, paired end reads were merged using FLASH (v_1.2.8). Low-quality reads (quality scores<20), short reads (<100 bp), and reads containing more than 5% “N” records were trimmed by using the sliding-window algorithm method in fqtrim (v_0.94). Quality filtering was performed to obtain high-quality clean tags according to fqtrim. Chimeric sequences were filtered using Vsearch software (v_2.3.4). DADA2 was applied for denoising and generating ASVs.
Co-abundance ASVs
The ASVs present in one-third of fecal or salivary samples were considered as key ASVs. The correlations among 139 key ASVs of fecal samples and 281 key ASVs of salivary samples were identified by Sparse Correlations for Compositional Data (SparCC) algorithm (p-value < 0.05 was considered statistically significant) using R package SpiecEasi (v_1.1.3). Weak correlations (correlation coefficient <0.3 or >−0.3, p-value < 0.05) were removed. After filtering, Gephi (v_0.9.2) was used for the visualization of significant co-occurrence interactions.
Natural connectivity
Co-abundance ASVs in the corresponding networks, represented as the N nodes, were removed one by one. The natural connectivity statistics (y axis) reflect to invulnerability were calculated when each node (x axis) was removed. The natural connectivity value ( was calculated by R package igraph (v_4.4.3), it is related to a function , which represents the eigenvalues of the network adjacency matrix and can be used to calculate the decrease in redundancy of alternative pathways as nodes are removed, i.e.:
Where i represented the ith co-abundance ASVs.
Random forest classifier
The machine learning analysis utilized the normalized relative abundance of fecal and salivary ASVs. We implemented a three-class Random Forest Classifier, with model performance rigorously quantified through five-fold cross-validation. Evaluation metrics included: classification accuracy, Cohen’s Kappa statistic, logarithmic loss (Log Loss), macro-averaged F1-score, and micro-averaged F1-score. Feature selection and model optimization were performed using R packages randomForest (v_4.4.3) and caret (v_4.4.3) packages. Only the top eight ASVs ranked by meandecreaseGini impurity were retained as predictive features. Diagnostic efficacy was further evaluated using ROC analysis (one-vs-rest) implemented in the R package pROC (v_4.4.3). Model performance was assessed by macro-averaged area under the curve (macro-AUC) and micro-averaged area under the curve (micro-AUC).
Statistical analysis
Statistical analyses were carried out in R version 4.4.2 to identify microbial differences among groups. Wilcoxon rank-sum test was performed in the pairwise comparisons, while Kruskal–Wallis test was used for multiple group comparisons. Benjamini–Hochberg false-discovery rate (FDR) was used to control the false positive rate in multiple comparisons and ensure the credibility of the statistical results. Analysis of variance (ANOVA) was used to compare age and BMI among M, NM, and NC groups; Pearson’s chi-squared test was used to compare gender distribution differences among the three groups. Fisher’s exact test was used to compare the distribution difference of tumor location, tumor type, and tumor markers between the M group and the NM group. p < 0.05 was considered significant.
Data availability statement
The datasets presented in this study are publicly available. This data can be found here: https://ngdc.cncb.ac.cn/gsa, accession number CRA025003.
Ethics statement
The studies involving humans were approved by Ethics Committees of the First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Chinese Medicine). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author contributions
ZW: Data curation, Formal analysis, Writing – original draft. ZZ: Data curation, Formal analysis, Writing – original draft. CZ: Formal analysis, Writing – original draft. ZL: Data curation, Formal analysis, Writing – original draft. HY: Formal analysis, Writing – original draft. GJ: Conceptualization, Writing – original draft. SD: Writing – review & editing. JY: Methodology, Writing – review & editing. LS: Methodology, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This research was supported by National Natural Science Foundation of China (grant number: 82204824), Medical Health Science and Technology Project of Zhejiang Provincial Health Commission of China (grant number: 2024KY1226) and the Young Elite Scientists Sponsorship Program by CACM (grant number: 2021-QNRC2-B13). Zhejiang Provincial Traditional Chinese Medicine Science and Technology Program (grant number: 2024ZL1222).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2025.1658693/full#supplementary-material
References
Baker, J. L., Mark Welch, J. L., Kauffman, K. M., McLean, J. S., and He, X. (2024). The oral microbiome: diversity, biogeography and human health. Nat. Rev. Microbiol. 22, 89–104. doi: 10.1038/s41579-023-00963-6
Bergsten, E., Mestivier, D., Donnadieu, F., Pedron, T., Barau, C., Meda, L. T., et al. (2023). Parvimonas micra, an oral pathobiont associated with colorectal cancer, epigenetically reprograms human colonocytes. Gut Microbes 15:2265138. doi: 10.1080/19490976.2023.2265138
Biller, L. H., and Schrag, D. (2021). Diagnosis and treatment of metastatic colorectal Cancer: a review. JAMA 325, 669–685. doi: 10.1001/jama.2021.0106
Bray, F., Laversanne, M., Sung, H., Ferlay, J., Siegel, R. L., Soerjomataram, I., et al. (2024). Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 74, 229–263. doi: 10.3322/caac.21834
Chen, F., Dai, X., Zhou, C. C., Li, K. X., Zhang, Y. J., Lou, X. Y., et al. (2022). Integrated analysis of the faecal metagenome and serum metabolome reveals the role of gut microbiome-associated metabolites in the detection of colorectal cancer and adenoma. Gut 71, 1315–1325. doi: 10.1136/gutjnl-2020-323476
Chen, Q., Huang, X., Zhang, H., Jiang, X., Zeng, X., Li, W., et al. (2024). Characterization of tongue coating microbiome from patients with colorectal cancer. J. Oral Microbiol. 16:2344278. doi: 10.1080/20002297.2024.2344278
Chen, L., Kang, Z., Shen, J., Zhao, R., Miao, Y., Zhang, L., et al. (2024). An emerging antibacterial nanovaccine for enhanced chemotherapy by selectively eliminating tumor-colonizing bacteria. Sci Bull (Beijing) 69, 2565–2579. doi: 10.1016/j.scib.2024.06.016
Chen, H., Li, N., Ren, J., Feng, X., Lyu, Z., Wei, L., et al. (2019). Participation and yield of a population-based colorectal cancer screening programme in China. Gut 68, 1450–1457. doi: 10.1136/gutjnl-2018-317124
Cheng, R., Wang, L., Le, S., Yang, Y., Zhao, C., Zhang, X., et al. (2022). A randomized controlled trial for response of microbiome network to exercise and diet intervention in patients with nonalcoholic fatty liver disease. Nat. Commun. 13:2555. doi: 10.1038/s41467-022-29968-0
Coker, O. O., Liu, C., Wu, W. K. K., Wong, S. H., Jia, W., Sung, J. J. Y., et al. (2022). Altered gut metabolites and microbiota interactions are implicated in colorectal carcinogenesis and can be non-invasive diagnostic biomarkers. Microbiome 10:35. doi: 10.1186/s40168-021-01208-5
Cui, W., Guo, M., Liu, D., Xiao, P., Yang, C., Huang, H., et al. (2024). Gut microbial metabolite facilitates colorectal cancer development via ferroptosis inhibition. Nat. Cell Biol. 26, 124–137. doi: 10.1038/s41556-023-01314-6
Du, Y., Yang, Y., Wu, S., Gao, X., He, X., and Dong, S. (2025). Core microbes regulate plant-soil resilience by maintaining network resilience during long-term restoration of alpine grasslands. Nat. Commun. 16:3116. doi: 10.1038/s41467-025-58080-2
Gao, W., Gao, X., Zhu, L., Gao, S., Sun, R., Feng, Z., et al. (2023). Multimodal metagenomic analysis reveals microbial single nucleotide variants as superior biomarkers for early detection of colorectal cancer. Gut Microbes 15:2245562. doi: 10.1080/19490976.2023.2245562
Genua, F., Raghunathan, V., Jenab, M., Gallagher, W. M., and Hughes, D. J. (2021). The role of gut barrier dysfunction and microbiome Dysbiosis in colorectal Cancer development. Front. Oncol. 11:626349. doi: 10.3389/fonc.2021.626349
Gu, K., Wu, A., Yu, B., Zhang, T., Lai, X., Chen, J., et al. (2023). Iron overload induces colitis by modulating ferroptosis and interfering gut microbiota in mice. Sci. Total Environ. 905:167043. doi: 10.1016/j.scitotenv.2023.167043
Kunath, B. J., De Rudder, C., Laczny, C. C., Letellier, E., and Wilmes, P. (2024). The oral-gut microbiome axis in health and disease. Nat. Rev. Microbiol. 22, 791–805. doi: 10.1038/s41579-024-01075-5
Lavelle, A., Nancey, S., Reimund, J. M., Laharie, D., Marteau, P., Treton, X., et al. (2022). Fecal microbiota and bile acids in IBD patients undergoing screening for colorectal cancer. Gut Microbes 14:2078620. doi: 10.1080/19490976.2022.2078620
Liu, C., Wang, L., Liu, X., Tan, Y., Tao, L., Xiao, Y., et al. (2021). Cytoplasmic SHMT2 drives the progression and metastasis of colorectal cancer by inhibiting β-catenin degradation. Theranostics 11, 2966–2986. doi: 10.7150/thno.48699
Liu, J. L., Xu, X., Rixiati, Y., Wang, C. Y., Ni, H. L., Chen, W. S., et al. (2024). Dysfunctional circadian clock accelerates cancer metastasis by intestinal microbiota triggering accumulation of myeloid-derived suppressor cells. Cell Metab. 36, 1320–1334.e9. doi: 10.1016/j.cmet.2024.04.019
Merra, G., Noce, A., Marrone, G., Cintoni, M., Tarsitano, M. G., Capacci, A., et al. (2020). Influence of Mediterranean diet on human gut microbiota. Nutrients 13. doi: 10.3390/nu13010007
Park, S. Y., Hwang, B. O., Lim, M., Ok, S. H., Lee, S. K., Chun, K. S., et al. (2021). Oral-gut microbiome Axis in gastrointestinal disease and Cancer. Cancers (Basel) 13. doi: 10.3390/cancers13092124
Patel, S. G., and Dominitz, J. A. (2024). Screening for colorectal Cancer. Ann. Intern. Med. 177:Itc49-itc64. doi: 10.7326/aitc202404160
Rajapakse, J., Khatiwada, S., Akon, A. C., Yu, K. L., Shen, S., and Zekry, A. (2023). Unveiling the complex relationship between gut microbiota and liver cancer: opportunities for novel therapeutic interventions. Gut Microbes 15:2240031. doi: 10.1080/19490976.2023.2240031
Schmidt, T. S., Hayward, M. R., Coelho, L. P., Li, S. S., Costea, P. I., Voigt, A. Y., et al. (2019). Extensive transmission of microbes along the gastrointestinal tract. eLife 8. doi: 10.7554/eLife.42693
Shi, Y., Zhang, K., Li, Q., Liu, X., He, J. S., and Chu, H. (2020). Interannual climate variability and altered precipitation influence the soil microbial community structure in a Tibetan plateau grassland. Sci. Total Environ. 714:136794. doi: 10.1016/j.scitotenv.2020.136794
Tan, J. K., Macia, L., and Mackay, C. R. (2023). Dietary fiber and SCFAs in the regulation of mucosal immunity. J. Allergy Clin. Immunol. 151, 361–370. doi: 10.1016/j.jaci.2022.11.007
Tian, Q., Yuan, P., Quan, C., Li, M., Xiao, J., Zhang, L., et al. (2020). Phosphorylation of BCKDK of BCAA catabolism at Y246 by Src promotes metastasis of colorectal cancer. Oncogene 39, 3980–3996. doi: 10.1038/s41388-020-1262-z
Tran, T. Q., Hanse, E. A., Habowski, A. N., Li, H., Ishak Gabra, M. B., Yang, Y., et al. (2020). Α-Ketoglutarate attenuates Wnt signaling and drives differentiation in colorectal cancer. Nat Cancer 1, 345–358. doi: 10.1038/s43018-020-0035-5
Ubachs, J., Ziemons, J., Soons, Z., Aarnoutse, R., van Dijk, D. P. J., Penders, J., et al. (2021). Gut microbiota and short-chain fatty acid alterations in cachectic cancer patients. J. Cachexia. Sarcopenia Muscle 12, 2007–2021. doi: 10.1002/jcsm.12804
Wagg, C., Schlaeppi, K., Banerjee, S., Kuramae, E. E., and van der Heijden, M. G. A. (2019). Fungal-bacterial diversity and microbiome complexity predict ecosystem functioning. Nat. Commun. 10:4841. doi: 10.1038/s41467-019-12798-y
Wong, C. C., and Yu, J. (2023). Gut microbiota in colorectal cancer development and therapy. Nat. Rev. Clin. Oncol. 20, 429–452. doi: 10.1038/s41571-023-00766-x
Wu, X., Zhang, T., Zhang, T., and Park, S. (2024). The impact of gut microbiome enterotypes on ulcerative colitis: identifying key bacterial species and revealing species co-occurrence networks using machine learning. Gut Microbes 16:2292254. doi: 10.1080/19490976.2023.2292254
Yang, F., Gao, R., Luo, X., Liu, R., and Xiong, D. (2023). Berberine influences multiple diseases by modifying gut microbiota. Front. Nutr. 10:1187718. doi: 10.3389/fnut.2023.1187718
Yu, X., Ou, J., Wang, L., Li, Z., Ren, Y., Xie, L., et al. (2024). Gut microbiota modulate CD8(+) T cell immunity in gastric cancer through butyrate/GPR109A/HOPX. Gut Microbes 16:2307542. doi: 10.1080/19490976.2024.2307542
Zafar, H., and Saier, M. H. Jr. (2021). Gut Bacteroides species in health and disease. Gut Microbes 13, 1–20. doi: 10.1080/19490976.2020.1848158
Zepeda-Rivera, M., Minot, S. S., Bouzek, H., Wu, H., Blanco-Míguez, A., Manghi, P., et al. (2024). A distinct Fusobacterium nucleatum clade dominates the colorectal cancer niche. Nature 628, 424–432. doi: 10.1038/s41586-024-07182-w
Zhang, S., Kong, C., Yang, Y., Cai, S., Li, X., Cai, G., et al. (2020). Human oral microbiome dysbiosis as a novel non-invasive biomarker in detection of colorectal cancer. Theranostics 10, 11595–11606. doi: 10.7150/thno.49515
Keywords: colorectal cancer, fecal microbiota, salivary microbiota, 16S rDNA gene sequencing, microbial co-occurrence networks
Citation: Wu Z, Zhu Z, Zhou C, Lin Z, Yang H, Jiang G, Ding S, Yu J and Sun L (2025) Stage-specific fecal and salivary microbiota signatures for noninvasive detection of colorectal cancer. Front. Microbiol. 16:1658693. doi: 10.3389/fmicb.2025.1658693
Edited by:
Babak Pakbin, Texas A&M University Department of Food Science and Technology, United StatesReviewed by:
Arash Shahsavari, Texas A&M University, United StatesAndrea Urrutia, Texas A&M University, United States
Copyright © 2025 Wu, Zhu, Zhou, Lin, Yang, Jiang, Ding, Yu and Sun. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shuning Ding, c25kaW5nQHpjbXUuZWR1LmNu; Jieru Yu, amllcnV5dUB6Y211LmVkdS5jbg==; Leitao Sun, c3VubnlsdEB6Y211LmVkdS5jbg==
†These authors have contributed equally to this work and share first authorship