- 1Department of Oral and Maxillofacial Surgery, Suining Central Hospital, Suining, Sichuan, China
- 2Department of Respiratory Medicine and Critical Care Medicine, Suining Central Hospital, Suining, Sichuan, China
Background: The oral microbiome has been increasingly recognized for its role in systemic health through the oral–lung axis. However, population-level evidence linking oral microbial diversity and composition with chronic respiratory diseases (CRD) remains limited.
Methods: We analyzed data from 4,384 adults in the 2009–2012 National Health and Nutrition Examination Survey (NHANES), defining CRD by self-reported chronic obstructive pulmonary disease (COPD), asthma, emphysema, or chronic bronchitis. Oral rinse samples underwent 16S ribosomal RNA (16S rRNA) V1–V3 sequencing. Alpha diversity, including observed amplicon sequence variants (ASVs), Faith’s phylogenetic diversity (Faith’s PD), Shannon–Weiner index, and Simpson index, and beta diversity, including Bray–Curtis, weighted UniFrac, and unweighted UniFrac distances, were assessed. Associations with CRD were examined using weighted logistic regression and restricted cubic splines (RCS). Differential genus abundance was identified by Wilcoxon tests with false discovery rate correction. A random forest model integrated microbial and clinical features. An independent hospital cohort was additionally profiled by 16S rRNA sequencing, and genus-level differences were assessed with linear discriminant analysis effect size (LEfSe) to validate NHANES findings.
Results: Higher alpha diversity was inversely associated with CRD risk; each standard deviation increase in observed ASVs and Faith’s PD reduced CRD odds by 19 and 17%, respectively (p < 0.05). Beta diversity showed significant community-level separation by CRD status (p = 0.01). Several genera, including Rothia and Veillonella, were enriched in CRD, whereas Prevotella, Haemophilus, and Neisseria were more abundant in non-CRD individuals. The random forest model achieved an area under the curve (AUC) of 0.65. In the hospital cohort, compositional shifts were consistent with NHANES findings, and LEfSe confirmed the depletion of Alloprevotella and Peptostreptococcus in CRD patients.
Conclusion: Oral microbial diversity and composition were significantly associated with CRD across both a representative U. S. population and a hospital cohort. Select genera and diversity indices may serve as non-invasive biomarkers for respiratory health, warranting further validation in longitudinal and mechanistic studies.
1 Introduction
The oral cavity harbors a complex ecosystem of microorganisms—including bacteria, fungi, protozoa, mycoplasmas, and viruses—alongside teeth, gingiva, tongue, mucosa, and saliva (1). These microbes often form biofilms that support immune regulation, epithelial defense, and microbial homeostasis (2–4).
When this balance is disrupted, pathogenic species may cross local barriers and spread via the airway, bloodstream, or digestive tract, influencing systemic health (5). Oral pathogens have been implicated in cardiovascular disease, diabetes, endocarditis, atherosclerosis, rheumatoid arthritis, and cancers (6–10). In recent years, the concept of an “oral–lung axis” has drawn attention. Oral microorganisms can reach the lower respiratory tract through aspiration or mucosal migration and may contribute to chronic respiratory diseases (CRD), such as chronic obstructive pulmonary disease (COPD) and asthma (11). Indeed, genera such as Veillonella, Prevotella, and Rothia—common in the oral cavity—are frequently enriched in sputum and bronchoalveolar lavage fluid of COPD patients (12).
CRD, including COPD, asthma, chronic bronchitis, and emphysema, are major causes of morbidity and mortality worldwide, with disproportionate impact in low- and middle-income countries (13, 14). Known risk factors such as smoking, air pollution, and occupational exposures play important roles, but their predictive value for early detection remains limited (15). Novel biomarkers, particularly those linked to the oral microbiome, may enhance CRD risk assessment and prevention. However, despite growing evidence, population-based studies directly linking oral microbial diversity to aggregated CRD outcomes remain scarce.
Previous studies have suggested enrichment of oral bacteria in lung samples from CRD patients. However, most focused on single diseases, yielding inconsistent findings, while in clinical practice these conditions frequently overlap. Our study uniquely addresses this gap by aggregating COPD, asthma, chronic bronchitis, and emphysema into a unified CRD outcome, thereby reflecting real-world comorbidity and improving statistical power. To our knowledge, this is the first study to integrate a nationally representative cohort (NHANES) with a hospital-based cohort to investigate the association between oral microbial diversity and aggregated CRD outcomes. This two-stage design strengthens epidemiologic evidence and highlights potential microbial biomarkers with implications for both public health and clinical care.
2 Materials and methods
2.1 NHANES study
NHANES is a continuous, nationally representative cross-sectional study that uses a stratified, multistage probability sampling method to assess the health of the non-institutionalized U. S. population (16). The study protocol was approved by the National Center for Health Statistics Ethics Review Board, and all participants provided written informed consent. This study followed the STROBE reporting guidelines (17). For the present analysis, we combined data from the 2009–2010 and 2011–2012 NHANES cycles. Among 20,293 participants initially enrolled, those without oral microbiome sequencing data (n = 10,945), missing CRD status (n = 11), or incomplete covariate information (n = 4,953) were excluded. Missing covariate data included periodontal measures (n = 4,053), poverty income ratio (n = 437), alcohol use (n = 401), diabetes status (n = 34), body mass index (BMI) (n = 18), marital status (n = 4), smoking status (n = 2), education level (n = 3), and hypertension diagnosis (n = 1). A total of 4,384 participants were included in the final analytic sample (Figure 1).
Figure 1. Flowchart depicting participant inclusion and exclusion criteria for the study in the NHANES.
Oral rinse samples collected during the 2009–2010 and 2011–2012 cycles were used for oral microbiome profiling. Genomic DNA was extracted and amplified targeting the V1–V3 regions of the 16S ribosomal RNA (rRNA) gene, followed by high-throughput sequencing. Raw sequence reads underwent standard quality control procedures to generate amplicon sequence variants (ASVs), with taxonomic classification assigned from phylum to genus levels using curated reference databases. Alpha diversity was assessed using Observed ASVs, Faith’s phylogenetic diversity (Faith’s PD), Shannon–Weiner index, and Simpson index, calculated from rarefied datasets normalized to 10,000 reads per sample. Each metric was averaged across 10 independent subsamplings to enhance stability (18). Beta diversity was evaluated using Bray–Curtis dissimilarity (19), unweighted UniFrac, and weighted UniFrac distances to capture between-sample variation.
The CRD were defined as having at least one of the following conditions: COPD, asthma, emphysema, or chronic bronchitis. COPD was identified based on either a self-reported medical diagnosis or a post-bronchodilator FEV₁/FVC ratio <0.70 from spirometry testing. Asthma, emphysema, and chronic bronchitis were defined using participants’ responses to structured questionnaire items indicating whether a healthcare professional had ever diagnosed them with these conditions. Covariates included both continuous (age, BMI) and categorical variables (sex, race/ethnicity, education level, marital status, and poverty income ratio [<1.3, 1.3–3.5, ≥3.5]). Smoking status was categorized as never, former, or current; alcohol use was classified as never, former, mild, moderate, or heavy; and physical activity was divided into low (<500 MET-min/week) and high (≥500 MET-min/week), according to NHANES recommendations (20). Hypertension, diabetes, and hyperlipidemia were defined based on a combination of self-reported diagnoses, physical examination findings, and laboratory results (Supplementary Table S1). Oral health–related covariates included periodontitis severity, categorized using CDC/AAP criteria based on clinical attachment loss (CAL) and probing depth (PD) (21), as well as oral hygiene practices such as dental floss and mouthwash use in the past 7 days (yes/no).
2.2 Clinical research
Consecutive adult outpatients were recruited from the Department of Respiratory Medicine at Suining Central Hospital between August and September 2025. Eligible participants were aged 30 years or older and provided written informed consent. Participants were divided into two groups: the CRD group (n = 49), comprising patients with a clinical diagnosis of COPD, asthma, chronic bronchitis, or emphysema; and the control group (n = 46), consisting of outpatients without a history of chronic respiratory diseases. The study was approved by the Ethics Committee of Suining Central Hospital (approval number KYLLKS20250140). Exclusion criteria were: (1) use of antibiotics within the past month; (2) acute oral disease at the time of recruitment; and (3) presence of severe systemic illnesses.
The following data were recorded: (1) Demographic data: age, sex, occupation, height, weight, and BMI; (2) Lifestyle factors: smoking and drinking history, oral hygiene behaviors (e.g., use of dental floss and mouthwash); (3) Medical history: comorbidities and medication use; (4) Clinical examination: pulmonary function testing, disease diagnosis, and periodontal status. Saliva samples (10–15 mL) were collected from each participant and immediately stored at −80 °C. Microbial DNA was extracted, and the V3–V4 regions of the bacterial 16S rRNA gene were amplified and subjected to high-throughput sequencing. Sequencing data were processed using standard quality control pipelines to generate microbial taxonomic profiles and diversity measures.
2.3 Statistical analysis
All analyses were conducted using R software (version 4.4.3).
For the NHANES data, complex survey design weights were incorporated following analytic guidelines. A two-sided p-value < 0.05 was considered statistically significant. Continuous variables were summarized as means ± standard deviations (SD) and compared between CRD and non-CRD groups using weighted t-tests. Categorical variables were reported as frequencies with weighted percentages and compared using the Rao–Scott chi-square test. Weighted logistic regression was applied to examine associations between alpha diversity indices and prevalence of CRD, with results expressed as odds ratios (ORs) and 95% confidence intervals (CIs) per SD increase. Tertile-based analyses and tests for trend were also conducted. Restricted cubic spline (RCS) models were used to evaluate dose–response relationships and potential nonlinearity. Subgroup analyses were performed for Observed ASVs and Faith’s PD across age, sex, race/ethnicity, BMI, and smoking status, with interaction terms to assess effect modification. Beta diversity was evaluated using principal coordinates analysis (PCoA) and PERMANOVA based on Bray–Curtis, unweighted UniFrac, and weighted UniFrac distances. At the genus level, genera with <5% prevalence were excluded, and differentially abundant genera were identified using Wilcoxon tests with false discovery rate (FDR) correction. Key results were visualized with heatmaps and boxplots. A random forest model was constructed using the top 10 differentially abundant genera, alpha diversity indices, and selected clinical variables to classify CRD status, with performance assessed by receiver operating characteristic (ROC) curves and area under the curve (AUC). Sensitivity analyses excluded participants who had recently used antibiotics, and an additional analysis was performed incorporating HEI-2015 (diet quality) as a covariate in the regression models.
For the hospital cohort, baseline characteristics between CRD and non-CRD participants were compared using the independent sample t-test for continuous variables and the chi-square test for categorical variables. α-diversity indices (Observed species, Chao1, Shannon, and Simpson) were calculated in R (v4.4.3) based on rarefied ASV tables. Group differences in α-diversity were evaluated using the Wilcoxon rank-sum test. β-diversity was assessed using R (v4.4.3) based on Bray–Curtis and Jaccard distance metrics. Principal coordinates analysis (PCoA) was performed to visualize community dissimilarities, and PERMANOVA (999 permutations) was applied to test for significance between CRD and non-CRD groups. At the genus level, taxa with a prevalence <5% across samples were excluded from analysis. Relative abundances were compared between groups using the Wilcoxon rank-sum test, with false discovery rate (FDR) correction applied for multiple testing. Differentially abundant taxa were further identified using linear discriminant analysis effect size (LEfSe) with an LDA score threshold of 2.0. Taxonomic cladograms were generated to illustrate taxa enriched in CRD or non-CRD participants. Microbial co-occurrence networks were constructed at multiple taxonomic levels using SparCC correlation analysis, and network modules were visualized to explore potential ecological interactions.
3 Results
3.1 Baseline characteristics
In the NHANES analysis, among an estimated 93,587,279 U. S. adults, the prevalence of CRD was 19.7% (Table 1). Compared with those without CRD, affected individuals were slightly older (49.12 ± 11.03 vs. 47.56 ± 10.61), more likely to be non-Hispanic White (73.89% vs. 67.56%), and had higher rates of divorce, widowhood, or separation (22.89% vs. 17.42%). They also showed greater obesity prevalence (BMI ≥ 30: 43.24% vs. 36.63%), lower income (PIR < 1.3: 20.43% vs. 17.71%), and higher smoking rates (29.39% vs. 16.70%). The CRD group additionally exhibited a higher prevalence of moderate-to-severe periodontitis (35.26% vs. 30.08%), hypertension (43.23% vs. 34.41%), and hyperlipidemia (78.44% vs. 72.96%). No significant differences were found in sex, education, diabetes, physical activity, or oral hygiene behaviors (all p > 0.05). Importantly, participants with CRD had significantly lower oral microbial diversity across Observed ASVs, Faith’s phylogenetic diversity, and the Shannon index (all p < 0.05).
In the hospital-based cohort, 95 participants were enrolled, including 49 with CRD and 46 without CRD (Table 2). Patients with CRD were older (62.94 ± 9.58 vs. 50.63 ± 12.11), more likely to be current smokers (46.94% vs. 21.74%), and more frequently lived in rural areas (73.47% vs. 47.83%). They also had a higher prevalence of periodontitis (48.98% vs. 17.39%), whereas no significant group differences were observed for BMI, sex, alcohol use, marital status, education, diabetes, hypertension, hyperlipidemia, flossing, or mouthwash behaviors (all p > 0.05). Similar to NHANES, the CRD group demonstrated significantly lower alpha diversity indices, including Observed species richness, Chao1, Shannon, and Simpson indices (all p < 0.001).
Table 2. Associations between oral microbial alpha diversity indices and CRD in unadjusted and fully adjusted logistic regression models.
3.2 Association of oral microbial alpha diversity with CRD
In the NHANES cohort (Table 3), individuals with higher Observed ASVs had lower odds of CRD. Specifically, each SD increase in Observed ASVs was associated with OR = 0.82 (95% CI: 0.74–0.90, p < 0.001) in the unadjusted model and OR = 0.81 (95% CI: 0.70–0.93, p = 0.009) after adjusting for covariates. Participants in the highest tertile (Q3) had OR = 0.65 (95% CI: 0.46–0.92, p = 0.023) compared with the lowest tertile (Q1), with a significant trend across tertiles (p for trend = 0.022). Faith’s Phylogenetic Diversity showed similar associations (fully adjusted OR per SD = 0.83, 95% CI: 0.71–0.96, p = 0.019; Q3 vs. Q1 OR = 0.65, 95% CI: 0.46–0.93, p = 0.025, p for trend = 0.020). For the Shannon–Weiner index, the fully adjusted OR per SD was 0.89 (95% CI: 0.79–1.01, p = 0.071), and no significant association was observed for the Simpson index (OR per SD = 0.97, 95% CI: 0.88–1.08, p = 0.574). Dose–response curves using restricted cubic splines confirmed linear inverse associations for Observed ASVs and Faith’s PD, while Shannon and Simpson indices exhibited non-linear patterns (Figures 2A–D). Subgroup analyses showed consistent inverse associations across age, sex, race/ethnicity, education, smoking status, physical activity, BMI, and periodontitis severity (Figures 3A,B).
Table 3. Baseline characteristics of participants in the hospital-based cohort according to CRD status.
Figure 2. Dose–response relationships between oral microbial alpha diversity indices and CRD risk assessed by restricted cubic spline models in the NHANES. (A) Observed ASVs; (B) Faith’s Phylogenetic Diversity; (C) Shannon index; (D) Simpson index.
Figure 3. Subgroup analyses of associations between alpha diversity indices and CRD risk in the NHANES. (A) Observed ASVs; (B) Faith’s phylogenetic diversity.
In the hospital cohort, rarefaction curves confirmed sufficient sequencing depth (Figure 4A). Alpha diversity was significantly lower in CRD patients compared with non-CRD participants, including Chao1 (p = 1.1 × 10−5), Observed species (p = 7.9 × 10−6), Shannon (p = 6.7 × 10−6), and Simpson (p = 6.2 × 10−5) indices (Figure 4B). RCS analysis, adjusted for age, smoking status, and periodontitis, indicated linear negative associations between CRD and the Observed species, Chao 1, and Shannon indices (all p < 0.05), while no significant association was observed for the Simpson index (p = 0.105) (Figure 4C).
Figure 4. Alpha diversity between individuals with and without CRD in the hospital cohort. (A) Rarefaction curves showing observed ASVs by sequencing depth. (B) α-diversity indices (Chao1, Observed ASVs, Shannon, Simpson) comparing CRD and non-CRD groups. (C) Alpha diversity and CRD risk based on RCS models.
3.3 Beta diversity analysis
Beta diversity analyses were conducted to compare overall microbial community composition between participants with and without CRD in both the NHANES and hospital-based cohorts. In the NHANES cohort, principal coordinates analysis (PCoA) and PERMANOVA using Bray–Curtis dissimilarity, unweighted UniFrac, and weighted UniFrac distances showed significant differences in community structure after adjusting for demographic and lifestyle factors (Bray–Curtis: R2 = 7.95%, p = 0.01; Unweighted UniFrac: R2 = 5.34%, p = 0.01; Weighted UniFrac: R2 = 5.93%, p = 0.01) (Figures 5A–C). Similarly, in the hospital cohort, Bray–Curtis and Jaccard-based PCoA revealed partial separation between CRD and non-CRD participants, with PC1 and PC2 explaining 13.6 and 11.0% of variance for Bray–Curtis, and 9.0 and 7.3% for Jaccard (Figures 5D,E). PERMANOVA confirmed significant differences in microbial community structure (Bray–Curtis: R2 = 2.38%, p = 0.003; Jaccard: R2 = 1.98%, p = 0.002).
Figure 5. Principal coordinates analysis (PCoA) plots of beta diversity metrics comparing oral microbial community structure between CRD and non-CRD groups. (A) Bray–Curtis dissimilarity; (B) unweighted UniFrac distance; (C) weighted UniFrac distance. (D) PCoA based on Bray–Curtis distances. (E) PCoA based on Jaccard distances.
3.4 Genus-level differential abundance and predictive modeling of CRD in the NHANES
To identify microbial features linked to CRD, we conducted genus-level differential abundance analysis followed by predictive modeling. After FDR correction (FDR < 0.05), 385 genera showed significant differences in relative abundance between CRD and non-CRD groups. To ensure biological relevance, we further selected genera present in at least 5% of participants, yielding 42 representative genera for hierarchical clustering. The heatmap revealed distinct microbial composition patterns between the two groups (Figure 6A). Among the top 10 differentially abundant genera, Rothia, Veillonella, and Atopobium were enriched in the CRD group. In contrast, Haemophilus, Prevotella, Neisseria, Alloprevotella, Porphyromonas, Aggregatibacter, and Peptostreptococcus were more abundant in the non-CRD group. These genera spanned major phyla such as Actinobacteria, Firmicutes, Bacteroidetes, and Proteobacteria. Boxplots clearly showed the distinct abundance patterns of these genera between groups (Figure 6B).
Figure 6. Genus-level differential abundance and predictive modeling of CRD in the NHANES. (A) Boxplots of top 10 differentially abundant genera between CRD and non-CRD groups. (B) Variable importance plot from random forest classification model. (C) Receiver operating characteristic (ROC) curve assessing model performance. (D) Receiver operating characteristic (ROC) curve assessing model performance.
We then incorporated the 10 genera, two alpha diversity indices (Observed ASVs and Faith’s PD), and seven key clinical variables (age, sex, smoking status, hypertension, alcohol use, BMI, and periodontal status) into a random forest classification model. Variable importance analysis showed that both microbial genera and diversity indices played significant roles in model accuracy (Figure 6C). The receiver operating characteristic (ROC) curve of this combined model yielded an area under the curve (AUC) of 0.652, indicating moderate ability to distinguish between CRD and non-CRD participants (Figure 6D).
3.5 Genus-level compositional differences in the hospital cohort
At the genus level, distinct shifts in microbial composition were observed between CRD and non-CRD groups (Figures 7A,B). In the non-CRD group, genera such as Alloprevotella, Prevotella, and Veillonella were more abundant, whereas the CRD group was enriched in potential pathogenic taxa including Fusobacterium, Leptotrichia, and Porphyromonas.
Figure 7. Genus-level composition and differential analysis of the oral microbiota in CRD and non-CRD groups in the hospital cohort. (A,B) Stacked bar plots of the relative abundance of predominant genera in non-CRD and CRD participants. (C) Linear discriminant analysis (LDA) scores of taxa identified by LEfSe (LDA > 2). (D) Boxplots showing relative abundance of representative differential genera. (E) Cladogram illustrating the phylogenetic relationships of taxa with significant differences between groups.
LEfSe analysis (LDA score > 2) identified taxa that discriminated between the two groups (Figure 7C). Genera such as Alloprevotella and Peptostreptococcus were enriched in the non-CRD group, while Fusobacterium and Leptotrichia were significantly associated with CRD.
Boxplot analysis confirmed significant differences in the relative abundances of the identified genera (Figure 7D). Alloprevotella showed higher relative abundance in the non-CRD group (p < 0.05), whereas Fusobacterium and Leptotrichia were markedly enriched in the CRD group (p < 0.01).
Phylogenetic analysis further demonstrated that taxa enriched in the non-CRD group clustered into coherent modules, including genera such as Alloprevotella, Megasphaera, Parvimonas, and Peptostreptococcus (Figure 7E). In contrast, CRD-enriched taxa, represented by Escherichia–Shigella, were relatively isolated within the phylogenetic tree.
3.6 Sensitivity analysis in the NHANES
In the NHANES cohort, sensitivity analyses excluding participants who reported recent antibiotic use, with HEI-2015 additionally included as a covariate, showed results consistent with the main analyses (Supplementary Figures S1–S4 and Supplementary Table S2).
4 Discussion
Across both the population-based NHANES analysis and our hospital cohort, higher oral microbial alpha diversity was consistently associated with lower odds of CRD. This association remained significant after adjusting for multiple potential confounders, and the relationship appeared linear across different diversity metrics. Beta diversity analyses further revealed clear separation between CRD and non-CRD participants, suggesting global alterations in community composition. At the genus level, specific taxa were differentially enriched in CRD versus non-CRD individuals, reflecting disease-related microbial dysbiosis. Collectively, the concordant findings from two independent cohorts strengthen the evidence for a robust association between reduced oral microbial diversity and increased CRD risk.
In the NHANES analysis, weighted logistic regression models showed that higher oral α-diversity was associated with lower CRD risk. Each standard deviation increase in Observed ASVs and Faith’s PD corresponded to 19 and 17% reductions in CRD risk, respectively, while the Shannon index was only weakly associated with reduced risk. These findings indicate that greater microbial richness and evenness might be linked to resilience against chronic respiratory inflammation. Importantly, the inverse associations remained robust in subgroup analyses, particularly among non-Hispanic White and Mexican American populations. This observation is consistent with prior reports of racial differences in oral microbial diversity (22–27). Regarding β-diversity, both PCoA and PERMANOVA analyses revealed significant structural differences in the oral microbiome between CRD and non-CRD individuals using Bray–Curtis, weighted UniFrac, and unweighted UniFrac distances (all p < 0.01), underscoring the presence of microbial dysbiosis (28–31).
At the genus level, Rothia and Veillonella were enriched in CRD cases, whereas Haemophilus, Prevotella, and Neisseria were more common in non-CRD participants. These distribution patterns align with prior studies showing that common oral genera, including Veillonella, Prevotella, Fusobacterium, and Actinomyces, can migrate to the lower respiratory tract and potentially influence respiratory health (24, 32, 33), which is consistent with the hypothesis that the oral cavity serves as a microbial reservoir for respiratory disease. Consistent with NHANES, the hospital cohort also showed that Fusobacterium, Leptotrichia, and Rothia were enriched in CRD patients, whereas Prevotella, Haemophilus, Neisseria, and Alloprevotella were more abundant in non-CRD individuals. These overlapping patterns reinforce the robustness of these taxa as potential microbial markers associated with CRD. However, the LEfSe analysis yielded partially different results. Only two genera in the non-CRD group—Alloprevotella and Peptostreptococcus—overlapped with the NHANES findings. This discrepancy may reflect methodological differences: NHANES emphasized abundant and clearly differentially expressed genera, whereas LEfSe integrates features across multiple taxonomic ranks (phylum, class, order, family, and genus), thereby attenuating genus-level signals.
Additionally, the random forest model showed moderate predictive performance. This likely reflects the study population, which is drawn from a general, mostly healthy cohort. In such population-based settings, differences between CRD and non-CRD individuals are subtler than in hospital cohorts or case–control studies, making prediction inherently more challenging. Despite this, the model still identifies relevant microbial and clinical features, supporting the epidemiological relevance of the findings. Moreover, predictive performance could potentially be improved in future studies by incorporating additional features, such as lifestyle factors or multi-omics data.
Nevertheless, both methods consistently highlighted Alloprevotella and Peptostreptococcus as being depleted in the CRD group across cohorts. This convergence suggests their possible involvement in respiratory health rather than a definitive protective role. The oral microbiome serves as an important reservoir of respiratory pathogens. Bacteria from dental plaque, periodontal pockets, and saliva can be aspirated into the lower respiratory tract, where they may trigger or exacerbate conditions such as aspiration pneumonia and COPD. The pathogenic mechanisms involve immune modulation, particularly the balance between Th1 and Th2 responses. Oral pathogens can stimulate airway epithelial cells to produce pro-inflammatory cytokines (e.g., TNF-α, IL-1β, and IL-6) and regulate mucus secretion (34, 35). In addition, microbial enzymes and cellular products from the oral microbiome can disrupt the respiratory mucosal barrier, facilitating pathogen colonization and increasing the risk of infection (36, 37). Emerging evidence also highlights complex interactions between commensal oral microbes and respiratory pathogens, with some species enhancing virulence and others producing inhibitory substances that limit pathogen growth (38, 39).
In this context, Alloprevotella may contribute to mucosal barrier maintenance and inflammation modulation through the production of short-chain fatty acids (SCFAs), such as acetate and succinate. SCFAs have also been shown to regulate Th17-mediated pathways, which are strongly implicated in COPD and asthma (40). Similarly, Peptostreptococcus may regulate microbial community balance and modulate host immunity via metabolic cross-feeding interactions with other commensals. By stabilizing the oral ecosystem, it may indirectly limit the expansion of pathobionts such as Rothia and Fusobacterium, which were enriched in CRD patients (41). Their reduction may reflect ecological shifts that compromise mucosal defense and promote inflammation, but causality remains uncertain, ultimately increasing susceptibility to chronic respiratory inflammation. These mechanistic explanations are speculative and should be further examined in longitudinal and experimental studies. Moreover, recent evidence suggests that the interaction between the microbiome and host vitamin D metabolism plays an important role in modulating immune responses, including autoimmunity and chronic inflammation. Vitamin D influences both innate and adaptive immunity by regulating antimicrobial peptide expression and promoting immune tolerance. Alterations in oral microbial composition could therefore affect vitamin D–mediated mucosal immunity along the oral–lung axis. Conversely, vitamin D deficiency has been associated with dysbiosis and impaired epithelial barrier function, which may exacerbate respiratory inflammation. These findings, as discussed by Murdaca et al. (42–44), highlight the complex bidirectional interplay between vitamin D signaling and the microbiome in shaping systemic and respiratory immune responses.
This study, based on a nationally representative NHANES sample and an independent hospital cohort, is the first to systematically assess the association between the oral microbiome and CRD, identifying key genera and exploring potential biological mechanisms. The use of two complementary cohorts enhances the robustness and generalizability of the findings.
However, several limitations should be noted. First, both cohorts were cross-sectional in nature, which limits causal inference and precludes assessment of temporal changes in the oral microbiome during disease progression. Second, although the NHANES sample offers broad population representativeness, it was restricted to only two cycles, while the hospital cohort—although valuable for validation—had a more limited sample size and may be subject to selection bias. Third, although mouthwash samples are widely used in oral microbiome studies and can capture overall microbial diversity (45, 46), they may not fully reflect microbial communities in specific oral niches such as subgingival or tongue dorsum areas. Finally, due to the lack of lung microbiome data, this study could not directly validate the biological pathways linking the oral and pulmonary systems (“oral–lung axis”) in CRD, and mechanistic interpretations remain largely based on prior evidence.
In addition, the moderate predictive performance of the random forest model and the reliance on LEfSe for differential abundance analysis should be acknowledged when interpreting the findings. LEfSe was chosen due to its wide use in oral microbiome studies and its suitability for validating NHANES findings in our hospital cohort, whereas methods such as DESeq2 or ANCOM may be limited by the smaller sample size. Future studies could enhance prediction by incorporating additional variables, complementary differential abundance methods, or multi-omics data.
5 Conclusion
In this study, using both a nationally representative NHANES sample and an independent hospital cohort, we observed a consistent association between the oral microbiome and CRD. CRD patients exhibited reduced α-diversity, distinct β-diversity patterns, and differential enrichment of specific bacterial genera. Notably, the depletion of Alloprevotella and Peptostreptococcus was consistent across cohorts, highlighting robust microbial signatures associated with CRD and supporting the relevance of the oral microbiome and the “oral–lung axis” in respiratory health.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary material.
Ethics statement
The studies involving humans were approved by Ethics Committee of Suining Central Hospital, Suining, Sichuan Province, China. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
BJ: Software, Validation, Visualization, Writing – original draft, Writing – review & editing. XW: Investigation, Visualization, Writing – review & editing. GH: Software, Writing – original draft. QW: Validation, Writing – original draft. LG: Validation, Writing – original draft. JR: Validation, Writing – original draft. GL: Resources, Writing – original draft. XZ: Resources, Writing – original draft. SY: Funding acquisition, Visualization, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by the Sichuan Provincial Department of Science and Technology, grant number 2022SNZY001; the Sichuan Provincial Health and Health Commission, grant number 2022JDXM021; the Chinese Stomatological Association of Western Stomatology, grant number CSA-W2023-03; and the Suining Municipal Health and Science Technology Program-Guiding Project, grant number 25ZDJB08. The APC was funded by internal resources of the corresponding author’s institution.
Acknowledgments
We acknowledge the National Health and Nutrition Examination Survey (NHANES) program for providing access to invaluable public data resources. We also thank the participants and clinical staff involved in the hospital cohort for their contributions to this study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2025.1696041/full#supplementary-material
SUPPLEMENTARY FIGURE S1 | Dose–response relationships between oral microbial alpha diversity indices and CRD risk assessed by restricted cubic spline models. (A) Observed ASVs; (B) Faith’s phylogenetic diversity; (C) Shannon index; (D) Simpson index.
SUPPLEMENTARY FIGURE S2 | Subgroup analyses of associations between alpha diversity indices and CRD risk. (A) Observed ASVs; (B) Faith’s phylogenetic diversity.
SUPPLEMENTARY FIGURE S3 | Principal coordinates analysis (PCoA) plots of beta diversity metrics comparing oral microbial community structure between CRD and non-CRD groups. (A) Bray–Curtis dissimilarity; (B) unweighted UniFrac distance; (C) weighted UniFrac distance.
SUPPLEMENTARY FIGURE S4 | Genus-level differential abundance and predictive modeling of CRD. (A) Boxplots of top 10 differentially abundant genera between CRD and non-CRD groups. (B) Variable importance plot from random forest classification model. (C) Receiver operating characteristic (ROC) curve assessing model performance.
References
1. Kilian, M, Chapple, IL, Hannig, M, Marsh, PD, Meuric, V, Pedersen, AM, et al. The oral microbiome – an update for oral healthcare professionals. Br Dent J. (2016) 221:657–66. doi: 10.1038/sj.bdj.2016.865
2. Dewhirst, FE, Chen, T, Izard, J, Paster, BJ, Tanner, AC, Yu, WH, et al. The human oral microbiome. J Bacteriol. (2010) 192:5002–17. doi: 10.1128/JB.00542-10
3. Zhang, Y, Wang, X, Li, H, Ni, C, Du, Z, and Yan, F. Human oral microbiota and its modulation for oral health. Biomed Pharmacother. (2018) 99:883–93. doi: 10.1016/j.biopha.2018.01.146
4. Jhajharia, K, Parolia, A, Shetty, KV, and Mehta, LK. Biofilm in endodontics: a review. J Int Soc Prev Community Dent. (2015) 5:1–12. doi: 10.4103/2231-0762.151956
5. Gao, L, Xu, T, Huang, G, Jiang, S, Gu, Y, and Chen, F. Oral microbiomes: more and more importance in oral cavity and whole body. Protein Cell. (2018) 9:488–500. doi: 10.1007/s13238-018-0548-1
6. Genco, RJ, Grossi, SG, Ho, A, Nishimura, F, and Murayama, Y. A proposed model linking inflammation to obesity, diabetes, and periodontal infections. J Periodontol. (2005) 76:2075–84. doi: 10.1902/jop.2005.76.11-S.2075
7. Ogrendik, M. Rheumatoid arthritis is linked to oral bacteria: etiological association. Mod Rheumatol. (2009) 19:453–6. doi: 10.1007/s10165-009-0194-9
8. Parahitiyawa, NB, Jin, LJ, Leung, WK, Yam, WC, and Samaranayake, LP. Microbiology of odontogenic bacteremia: beyond endocarditis. Clin Microbiol Rev. (2009) 22:46–64. doi: 10.1128/CMR.00028-08
9. Seedorf, H, Griffin, NW, Ridaura, VK, Reyes, A, Cheng, J, Rey, FE, et al. Bacteria from diverse habitats colonize and compete in the mouse gut. Cell. (2014) 159:253–66. doi: 10.1016/j.cell.2014.09.008
10. Chhibber-Goel, J, Singhal, V, Bhowmik, D, Vivek, R, Parakh, N, Bhargava, B, et al. Linkages between oral commensal bacteria and atherosclerotic plaques in coronary artery disease patients. NPJ Biofilm Microb. (2016) 2:7. doi: 10.1038/s41522-016-0009-7
11. Budden, KF, Gellatly, SL, Wood, DL, Cooper, MA, Morrison, M, Hugenholtz, P, et al. Emerging pathogenic links between microbiota and the gut-lung axis. Nat Rev Microbiol. (2017) 15:55–63. doi: 10.1038/nrmicro.2016.142
12. Wei, Y, Lu, X, and Liu, C. Gut microbiota and chronic obstructive pulmonary disease: a mendelian randomization study. Front Microbiol. (2023) 14:1196751. doi: 10.3389/fmicb.2023.1196751
13. Prevalence and attributable health burden of chronic respiratory diseases, 1990-2017: a systematic analysis for the global burden of disease study 2017. Lancet Respir Med. (2020) 8:585–96. doi: 10.1016/S2213-2600(20)30105-3
14. Adeloye, D, Song, P, Zhu, Y, Campbell, H, Sheikh, A, and Rudan, I. Global, regional, and national prevalence of, and risk factors for, chronic obstructive pulmonary disease (COPD) in 2019: a systematic review and modelling analysis. Lancet Respir Med. (2022) 10:447–58. doi: 10.1016/S2213-2600(21)00511-7
15. Fallahzadeh, A, Sharifnejad Tehrani, Y, Sheikhy, A, Ghamari, SH, Mohammadi, E, Saeedi Moghaddam, S, et al. The burden of chronic respiratory disease and attributable risk factors in North Africa and Middle East: findings from global burden of disease study (GBD) 2019. Respir Res. (2022) 23:268. doi: 10.1186/s12931-022-02187-3
16. Prevention CfDCa. National Health and nutrition examination survey (2025). Available online at: https://www.cdc.gov/nchs/nhanes/?CDC_AAref_Val=https://www.cdc.gov/nchs/nhanes/index.htm (Accessed June 10, 2025).
17. von Elm, E, Altman, DG, Egger, M, Pocock, SJ, Gøtzsche, PC, and Vandenbroucke, JP. The strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. Lancet. (2007) 370:1453–7. doi: 10.1016/S0140-6736(07)61602-X
18. Bolyen, E, Rideout, JR, Dillon, MR, Bokulich, NA, Abnet, CC, Al-Ghalith, GA, et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat Biotechnol. (2019) 37:852–7. doi: 10.1038/s41587-019-0209-9
19. Lozupone, C, Lladser, ME, Knights, D, Stombaugh, J, and Knight, R. UniFrac: an effective distance metric for microbial community comparison. ISME J. (2011) 5:169–72. doi: 10.1038/ismej.2010.133
20. Wang, X, Yan, X, Zhang, J, Pan, S, Li, R, Cheng, L, et al. Associations of healthy eating patterns with biological aging: national health and nutrition examination survey (NHANES) 1999-2018. Nutr J. (2024) 23:112. doi: 10.1186/s12937-024-01017-0
21. Ortigara, GB, Mário Ferreira, TG, Tatsch, KF, Romito, GA, Ardenghi, TM, Sfreddo, CS, et al. The 2018 EFP/AAP periodontitis case classification demonstrates high agreement with the 2012 CDC/AAP criteria. J Clin Periodontol. (2021) 48:886–95. doi: 10.1111/jcpe.13462
22. Dzidic, M, Abrahamsson, TR, Artacho, A, Collado, MC, Mira, A, and Jenmalm, MC. Oral microbiota maturation during the first 7 years of life in relation to allergy development. Allergy. (2018) 73:2000–11. doi: 10.1111/all.13449
23. Leitao Filho, FS, Alotaibi, NM, Ngan, D, Tam, S, Yang, J, Hollander, Z, et al. Sputum microbiome is associated with 1-year mortality after chronic obstructive pulmonary disease hospitalizations. Am J Respir Crit Care Med. (2019) 199:1205–13. doi: 10.1164/rccm.201806-1135OC
24. Pragman, AA, Knutson, KA, Gould, TJ, Isaacson, RE, Reilly, CS, and Wendt, CH. Chronic obstructive pulmonary disease upper airway microbiota alpha diversity is associated with exacerbation phenotype: a case-control observational study. Respir Res. (2019) 20:114. doi: 10.1186/s12931-019-1080-4
25. Kou, Z, Liu, K, Qiao, Z, Wang, Y, Li, Y, Li, Y, et al. The alterations of oral, airway and intestine microbiota in chronic obstructive pulmonary disease: a systematic review and meta-analysis. Front Immunol. (2024) 15:1407439. doi: 10.3389/fimmu.2024.1407439
26. Pragman, AA, Hodgson, SW, Wu, T, Zank, A, Reilly, CS, and Wendt, CH. Sputum microbiome α-diversity is a key feature of the COPD frequent exacerbator phenotype. ERJ Open Res. (2024) 10:00595–2023. doi: 10.1183/23120541.00595-2023
27. Chaturvedi, AK, Vogtmann, E, Shi, J, Yano, Y, Blaser, MJ, Bokulich, NA, et al. Oral microbiome profile of the US population. JAMA Netw Open. (2025) 8:e258283. doi: 10.1001/jamanetworkopen.2025.8283
28. Zhou, Y, Mihindukulasuriya, KA, Gao, H, La Rosa, PS, Wylie, KM, Martin, JC, et al. Exploration of bacterial community classes in major human habitats. Genome Biol. (2014) 15:R66. doi: 10.1186/gb-2014-15-5-r66
29. Wang, Z, Bafadhel, M, Haldar, K, Spivak, A, Mayhew, D, Miller, BE, et al. Lung microbiome dynamics in COPD exacerbations. Eur Respir J. (2016) 47:1082–92. doi: 10.1183/13993003.01406-2015
30. Wu, BG, Sulaiman, I, Tsay, JJ, Perez, L, Franca, B, Li, Y, et al. Episodic aspiration with oral commensals induces a MyD88-dependent, pulmonary T-helper cell type 17 response that mitigates susceptibility to Streptococcus pneumoniae. Am J Respir Crit Care Med. (2021) 203:1099–111. doi: 10.1164/rccm.202005-1596OC
31. Zhang, X, Li, X, Xu, H, Fu, Z, Wang, F, Huang, W, et al. Changes in the oral and nasal microbiota in pediatric obstructive sleep apnea. J Oral Microbiol. (2023) 15:2182571. doi: 10.1080/20002297.2023.2182571
32. Tan, L, Wang, H, Li, C, and Pan, Y. 16S rDNA-based metagenomic analysis of dental plaque and lung bacteria in patients with severe acute exacerbations of chronic obstructive pulmonary disease. J Periodontal Res. (2014) 49:760–9. doi: 10.1111/jre.12159
33. Wu, X, Chen, J, Xu, M, Zhu, D, Wang, X, Chen, Y, et al. 16S rDNA analysis of periodontal plaque in chronic obstructive pulmonary disease and periodontitis patients. J Oral Microbiol. (2017) 9:1324725. doi: 10.1080/20002297.2017.1324725
34. Petelin, M, Naruishi, K, Shiomi, N, Mineshiba, J, Arai, H, Nishimura, F, et al. Systemic up-regulation of sTNFR2 and IL-6 in Porphyromonas gingivalis pneumonia in mice. Exp Mol Pathol. (2004) 76:76–81. doi: 10.1016/j.yexmp.2003.09.002
35. Schaub, B, Lauener, R, and von Mutius, E. The many faces of the hygiene hypothesis. J Allergy Clin Immunol. (2006) 117:969–77; quiz 78. doi: 10.1016/j.jaci.2006.03.003
36. Gomes-Filho, IS, Passos, JS, and Seixas da Cruz, S. Respiratory disease and the role of oral bacteria. J Oral Microbiol. (2010):2. doi: 10.3402/jom.v2i0.5811
37. Pathak, JL, Yan, Y, Zhang, Q, Wang, L, and Ge, L. The role of oral microbiome in respiratory health and diseases. Respir Med. (2021) 185:106475. doi: 10.1016/j.rmed.2021.106475
38. Whiley, RA, Fleming, EV, Makhija, R, and Waite, RD. Environment and colonisation sequence are key parameters driving cooperation and competition between Pseudomonas aeruginosa cystic fibrosis strains and oral commensal streptococci. PLoS One. (2015) 10:e0115513. doi: 10.1371/journal.pone.0115513
39. Baty, JJ, Stoner, SN, McDaniel, MS, Huffines, JT, Edmonds, SE, Evans, NJ, et al. An oral commensal attenuates Pseudomonas aeruginosa-induced airway inflammation and modulates nitrite flux in respiratory epithelium. Microbiol Spectr. (2023) 11:e0219823. doi: 10.1128/spectrum.02198-23
40. Ma, J, Piao, X, Mahfuz, S, Long, S, and Wang, J. The interaction among gut microbes, the intestinal barrier and short chain fatty acids. Anim Nutr. (2022) 9:159–74. doi: 10.1016/j.aninu.2021.09.012
41. Hirmas, B, Gasaly, N, Orellana, G, Vega-Sagardía, M, Saa, P, Gotteland, M, et al. Metabolic Modeling and bidirectional culturing of two gut microbes reveal cross-feeding interactions and protective effects on intestinal cells. mSystems. (2022) 7:e0064622. doi: 10.1128/msystems.00646-22
42. Murdaca, G, Gerosa, A, Paladin, F, Petrocchi, L, Banchero, S, and Gangemi, S. Vitamin D and microbiota: is there a link with allergies? Int J Mol Sci. (2021) 22:4288. doi: 10.3390/ijms22084288
43. Murdaca, G, Greco, M, Borro, M, and Gangemi, S. Hygiene hypothesis and autoimmune diseases: a narrative review of clinical evidences and mechanisms. Autoimmun Rev. (2021) 20:102845. doi: 10.1016/j.autrev.2021.102845
44. Murdaca, G, Tagliafico, L, Page, E, Paladin, F, and Gangemi, S. Gender differences in the interplay between vitamin D and microbiota in allergic and autoimmune diseases. Biomedicine. (2024) 12:1023. doi: 10.3390/biomedicines12051023
45. Willis, JR, González-Torres, P, Pittis, AA, Bejarano, LA, Cozzuto, L, Andreu-Somavilla, N, et al. Citizen science charts two major "stomatotypes" in the oral microbiome of adolescents and reveals links with habits and drinking water composition. Microbiome. (2018) 6:218. doi: 10.1186/s40168-018-0592-3
Keywords: oral microbiome, chronic respiratory disease, alpha diversity, beta diversity, NHANES, linear discriminant analysis effect size (LEfSe), 16s ribosomal RNA (rRNA) sequencing
Citation: Jia B, Wu X, He G, Wang Q, Guan L, Ren J, Li G, Zheng X and Yang S (2025) Oral microbiome dysbiosis is associated with chronic respiratory diseases: evidence from a population-based study and a hospital cohort. Front. Public Health. 13:1696041. doi: 10.3389/fpubh.2025.1696041
Edited by:
Roberto Giovanni Carbone, University of Genoa, ItalyReviewed by:
Giuseppe Murdaca, University of Genoa, ItalyRajesh Shigdel, University of Bergen, Norway
Copyright © 2025 Jia, Wu, He, Wang, Guan, Ren, Li, Zheng and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sen Yang, eXMxMzg4MDQzNTQxM0AxNjMuY29t
†These authors have contributed equally to this work
Gaoyan He2†