Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 10 December 2021
Sec. Microbial Symbioses
This article is part of the Research Topic The Oral Microbiome is a Key factor in Oral and Systemic Health View all 12 articles

Can the Salivary Microbiome Predict Cardiovascular Diseases? Lessons Learned From the Qatari Population

  • 1Mother and Child Health Department, Sidra Medicine, Doha, Qatar
  • 2Clinical Research Center Department, Sidra Medicine, Doha, Qatar

Background: Many studies have linked dysbiosis of the gut microbiome to the development of cardiovascular diseases (CVD). However, studies assessing the association between the salivary microbiome and CVD risk on a large cohort remain sparse. This study aims to identify whether a predictive salivary microbiome signature is associated with a high risk of developing CVD in the Qatari population.

Methods: Saliva samples from 2,974 Qatar Genome Project (QGP) participants were collected from Qatar Biobank (QBB). Based on the CVD score, subjects were classified into low-risk (LR < 10) (n = 2491), moderate-risk (MR = 10–20) (n = 320) and high-risk (HR > 30) (n = 163). To assess the salivary microbiome (SM) composition, 16S-rDNA libraries were sequenced and analyzed using QIIME-pipeline. Machine Learning (ML) strategies were used to identify SM-based predictors of CVD risk.

Results: Firmicutes and Bacteroidetes were the predominant phyla among all the subjects included. Linear Discriminant Analysis Effect Size (LEfSe) analysis revealed that Clostridiaceae and Capnocytophaga were the most significantly abundant genera in the LR group, while Lactobacillus and Rothia were significantly abundant in the HR group. ML based prediction models revealed that Desulfobulbus, Prevotella, and Tissierellaceae were the common predictors of increased risk to CVD.

Conclusion: This study identified significant differences in the SM composition in HR and LR CVD subjects. This is the first study to apply ML-based prediction modeling using the SM to predict CVD in an Arab population. More studies are required to better understand the mechanisms of how those microbes contribute to CVD.

GRAPHICAL ABSTRACT
www.frontiersin.org

Graphical Abstract. Salivary microbiome marker selection: CVD risk score using ML model in the Qatari population.

Introduction

Non-communicable Diseases (NCDs) are the leading cause of death globally (Allen et al., 2017). According to the World Health Organization [WHO] (2013) report, the global burden of non-communicable diseases (NCDs) raised to 82% by 2020. The most common NCDs are cardiovascular diseases (CVD), cancer, respiratory disorders, and diabetes (Balakumar et al., 2016).

CVD comprises coronary heart disease, heart failure, stroke, rheumatic heart disease, and cardiomyopathies among others (Caldwell et al., 2019). CVD is the leading cause of death, claiming about 17.9 million deaths annually and increasing worldwide (Lear et al., 2017; Al-Shamsi et al., 2019).

In Qatar, NCDs are the leading cause of death for the past 10 years (Al-Kaabi and Atherton, 2015) with the CVD mortality rates reaching 8.3 per 100000 MOPH (2020). In addition, the 2006-World-Health-Survey revealed that the Qatari population suffers from various predisposing factors to CVD such as obesity (28.8%), high cholesterol (24.7%), diabetes (16.7%), and hypertension (14.4%) Haj Bakri and Al-Thani (2012).

In the past decade, advances in the multi-omics technologies have enhanced our chances to discover novel biomarkers (Olivier et al., 2019). Blood-based biomarkers are considered invasive, there is an urgent need to use non-invasive samples such as saliva to develop new disease biomarkers. In addition, the advance in Next-Generation Sequencing platforms (NGS) has enabled us to assess the human microbiome with an unprecedented resolution and depth. Using the human microbiome composition to identify disease biomarkers is the next chapter of precision medicine (Morganti et al., 2019; Zhong et al., 2021).

The human microbiome (HM) comprises trillions of bacteria, viruses, protozoa, and fungi that reside in and on our body surfaces (Amon and Sanderson, 2017). The HM is complex, dynamic, ubiquitous, and shows striking variability from one individual to another and between various body sites (Ursell et al., 2012; Aagaard et al., 2013). The HM has a wide array of roles ranging from digestion, protection from pathogens, immune-regulation, and metabolites production (Marchesi et al., 2016). The oral cavity harbors more than 700 diverse microorganisms and is considered the second most diverse site after the gut (Deo and Deshmukh, 2019). In healthy subjects, the core salivary microbiome (SM) includes genera Streptococcus, Veillonella, Neisseria, and Actinomyces (Zaura et al., 2009, 2014). In a large-scale population-based Japanese study, the authors showed that the SM is dominated by Streptococcus, Neisseria, Rothia, Prevotella, Actinomyces, Granulicatella, Haemophilus, and Porphyromonas (Yamashita and Takeshita, 2017). Our previous study aiming to characterize the salivary microbiome composition in the Qatari population (Murugesan et al., 2020) showed that Bacteroidetes, Firmicutes, Actinobacteria, and Proteobacteria were the common phyla, with Bacteroidetes being the most predominant (Murugesan et al., 2020). Dysbiosis in the SM is associated with oral diseases (Mashima et al., 2017; Davis et al., 2020) and systemic diseases like obesity, diabetes, and CVD (Wade, 2013; Kholy et al., 2015; Cortez et al., 2019).

Advances in Machine Learning (ML) technologies, an essential branch of artificial intelligence, have enabled researchers to build prediction biomarker models for various diseases such as arthritis, diabetes, and inflammatory bowel disease (Jamshidi et al., 2019; Aryal et al., 2020; Kohli et al., 2020). On the other hand, few studies have trained ML models using the gut microbiome profiles to identify predictors of atherosclerosis and CVD (Aryal et al., 2020; Liu et al., 2020) and none have used the SM so far.

This study aims to identify whether a predictive salivary microbiome signature is associated with a high risk of developing CVD in the Qatari population. We integrated the phenotypic, clinical, and microbiome data, and we identified SM-biomarkers associated with an increased risk to CVD using ML models.

Materials and Methods

Ethics Statement

The study was approved by the Institutional Review Board (IRB) of Sidra Medicine under (protocol #1510001907) and by Qatar Biobank (QBB) (protocol #E/2018/QBB-RES-ACC-0063/0022. All study participants signed an informed consent before sample collection. All experiments were performed under the approved guidelines.

Clinical Data

We collected de-identified saliva samples, phenotypic and clinical data from a total of 2,974 participants enrolled in the Qatar genome project (QGP). QGP included any adult who is either a Qatari national or long-term resident (lived in Qatar for at least 15 years) and can contribute to QBB around 3 h of their time for answering all the questionnaires, complete measurements, imaging and fitness assessments, in addition to providing all the samples required including saliva. In the pilot phase, the cohort consisted of 1,432 males and 1,542 females (Table 1). Each subject’s anthropometric and blood parameters were established by analyzing body mass index (BMI), total protein content, hemoglobin, albumin, ferritin, calcium, iron, vitamin-D, high or low-density lipoprotein cholesterol (HDL, LDL), triglycerides, and glucose levels.

TABLE 1
www.frontiersin.org

Table 1. Clinical parameters of the study cohort.

Calculation of Cardiovascular Diseases Risk Score

Cox proportional-hazards regression has been used to evaluate the risk of developing CVD over 10-years. The CVD-risk score for 2974 patients was estimated using sex-specific multivariable factors consisting of age, total-Cholesterol, HDL, systolic blood pressure (BP), hypertension treatment, smoking, and diabetes status (HbA1C ≥ 6.5%, and participants who confirmed having diabetes). D’Agostino et al. (2013) adapted the regression coefficient for the functions from earlier analysis. This method uses the following equation:

p^= 1-S0(t)exp(i=1pβixi-i=1pβix¯i)

Where S0(t), baseline survival at follow-up time t (here t = 10 years); βi, estimated regression coefficient (log hazard ratio that is measured for all risk functions and sex-specific); xi, log-transformed value of the ith risk factor; i, corresponding mean, p, number of risk factors.

Sample Collection

Qatar Biobank collected saliva samples according to standard procedure. They organized to collect 5 mL of spontaneous, whole, unstimulated saliva into a 50 mL sterile DNA-free Falcon tube from each participant by spitting. The samples were divided into 0.4 mL aliquots and stored at −80 C until further analysis. The aliquots were received from QBB for total salivary DNA extraction.

DNA Extraction and 16S rRNA Gene Sequencing

The total salivary DNA was extracted using automated QIAsymphony protocol (Qiagen, Hilden, Germany), following the Manufacturer’s instructions. DNA purity was evaluated by the A260/A280 ratio using a NanoDrop 7000 Spectrophotometer (Thermo Fisher Scientific, Waltham, MA, United States), and the DNA integrity was checked on a 1% agarose by gel electrophoresis.

The V1–V3 regions of the 16S rRNA gene were amplified using Illumina NextEra XT library preparation Kit (FC-131-1002). Step 1 PCR is performed using 10 ng of template DNA for 50 μL PCR reaction using 2X Phusion Hot Start Ready mix (Thermo Fisher Scientific™). The following thermal cycling conditions were used: 5 min of initial denaturation at 94°C; 25 cycles of denaturation at 94 C for 30 s, annealing at 55°C for 30 s, extension at 72 C for 30 s; and a final extension at 72 C for 5 min. According to the Manufacturer’s instructions, the amplified PCR products of approximately 550 bp in size was purified using AgenCourt AMPure XP magnetic beads (Beckman Coulter). Purified PCR products of STEP 1 was used as template for amplification of STEP 2 NextEra index PCR using thermocycling conditions of 5 min of initial denaturation at 94°C; 8 cycles of denaturation at 94 C for 30 s, annealing at 55°C for 30 s, extension at 72 C for 30 s; and a final extension at 72 C for 5 min. These PCR products were purified using AgenCourt AMPure XP magnetic beads and purified products were pooled in equimolar concentrations. High-throughput sequencing was performed using an Illumina MiSeq 2 × 300 platform following the manufacturer’s instructions.

16S rRNA Sequencing Data Analysis

Demultiplexed sequence data were revised for quality control using FastQC (Andrews, 2010). PEAR tool was used to merge both forward and reverse sequence reads of respective samples (Zhang et al., 2014), and sequence reads of quality score <20 were discarded. All merged reads were trimmed to 160 bp > Reads < 500 bp using the Trimmomatic tool (Bolger et al., 2014). Trimmed FASTQ files were converted into FASTA files. Demultiplexed FASTA files were analyzed using Quantitative Insights Into Microbial Ecology (QIIME) v1.9.0 pipeline (Caporaso et al., 2010; Murugesan et al., 2020). Operational taxonomic units (OTUs) were generated by aligning against the Greengenes database (Version: 13_8) with a confidence threshold of 97% (DeSantis et al., 2006).

Statistical Taxonomic and Diversity Analyses

Linear Discriminant Analysis Effect Size (LEfSe) (Segata et al., 2011) was used to find differentially abundant taxa between the studied categories. Alpha diversity measures including Chao1, Observed, Shannon, and Simpson indices were calculated with R-phyloseq package (McMurdie and Holmes, 2013). The alpha diversity statistical significance was calculated using Mann–Whitney test through Minitab-17 (2010). P-values less than 0.05 were considered statistically significant. Differences in the beta diversity were presented as principal coordinate analysis using QIIME. Analysis of similarities (ANOSIM) was used to calculate the distance matrix difference between the categories using Bray-Curtis metric (Caporaso et al., 2010).

Supervised Machine Learning Modeling

We applied four statistical ML methods for regularization and feature selection based on penalized least squares (Figure 1B). The methods are the Least Absolute Shrinkage and Selection Operator (Lasso), Smoothly Clipped Absolute Deviation Penalty (Zou and Li, 2008) (SCAD), Elastic Net (Zou and Hastie, 2005) (ENet), and Minimax concave penalty (Zhang, 2010) (MCP). The methods differ by the mathematical properties of the corresponding penalties: Lasso and ENet use convex penalties, while MCP and Scad use concave penalties. We applied two transformations to the abundance-counts as in: a binary transformation (Binary), and a variance-stability transformation (Arcsin), while the CVD-score outcome was log-transformed (Dong et al., 2020). Analyses were performed using the R-packages glmnet (Hastie and Qian, 2014) and ncvreg (Breheny, 2020). The graphics were generated using the R-packages ggplot2, RVenn, and ggpubr (Wickham, 2011; Akyol, 2019; Kassambara, 2020). We randomly split the data 50-times into a training set (80%) on which the predictive-models were build and a test-set (20%) on which we tested the performance of each model. Optimal tuning parameters were chosen via 10-fold cross-validation.

FIGURE 1
www.frontiersin.org

Figure 1. Overall study design from participant recruitment to SM-based CVD marker selection. (A) The study workflow. (B) Strategies applied in Supervised machine learning (ML) to select pertinents.

Results

Demographic and Clinical Parameters of the Study Population

The study population was composed of 2,974 Qatari participants. The cohort was classified into three CVD groups as low-risk (LR) (CVD score < 10), moderate-risk (MR) (CVD score: 10–20), and high-risk (HR) (>20), as described in the section “Materials and Methods.” As a result, 2491 participants were LR, 320 were MR, and 163 were HR (Table 1). The average participant’s age in the HR group (55.87 ± 8.14 years) was significantly higher than those in the MR (50.89 ± 7.15 years) and LR (35.11 ± 10.22 years) groups (Table 1). Moreover, the BMI was significantly higher in the HR group than in the MR and LR groups (Table 1). In addition, among the blood parameters tested, Alkaline phosphatase, Calcium, Total-Cholesterol, LDL, Creatinine, Ferritin, Fibrinogen, Folate, Glucose, HbA1C, Urea, and Triglycerides were significantly higher in the HR group (Table 1).

The Salivary Microbiome Composition Reveals Signatures for Cardiovascular Diseases

After stratifying the study cohort based on the CVD risk score, we assessed the SM composition in all subjects. Then, we compared the compositional changes between different study groups. A diagram that summarizes the study design is shown in Figures 1A,B. The microbial sequence data generated from all the participants revealed 22 bacterial phyla, 46 classes, 87 orders, 173 families, and 390 genera. Bacteroidetes, Firmicutes, Actinobacteria, and Proteobacteria were the most abundant phyla observed in the saliva samples collected from the Qatari subjects, covering approximately 90% of total microbial abundance (Figure 2A). In addition, our data showed that Streptococcus, Prevotella, Porphyromonas, Granulicatella, and Veillonella represent the salivary core microbiome members at the genus level (Figure 2B).

FIGURE 2
www.frontiersin.org

Figure 2. The salivary microbiome composition of CVD risk groups. Y-axis shows % of relative abundance of the microbiome; X-axis indicates the microbial abundance in LR, MR, and HR groups; (A) phylum level; (B) genus level.

Differential Salivary Microbial Taxa Between the High-Risk and Low-Risk-Cardiovascular Diseases Groups

After assessing the study cohort’s SM, LEfSe analysis compared the salivary microbiome compositions in the LR, MR, and HR (Figure 3). Our data indicated that Capnocytophaga and Clostridiaceae were significantly abundant in the LR group compared to the HR group (p < 0.0001). In contrast, Lactobacillus and Rothia were significantly enriched in the HR group (p < 0.0001) (Figure 3A) in comparison to the LR group. Clostridiaceae and Porphyromonas were significantly increased in the LR group than MR group. Neisseria and Capnocytophaga were greatly enriched in the MR group than HR group (Figures 3B,C).

FIGURE 3
www.frontiersin.org

Figure 3. Graphs of linear discriminant analysis (LDA) scores for differentially enriched bacterial genera among the groups. (A) LR (green) vs. HR (red) groups; (B) LR (green) vs. MR (Yellow) groups; (C) HR (red) vs MR (Yellow) groups.

Alpha and beta diversity measures were calculated to assess the changes in diversity among groups (Supplementary Figure 1). Alpha diversity parameters revealed no significant differences observed between all groups (Supplementary Figure 1A). We then performed beta diversity analysis to assess the divergence in the community composition between the groups using the Bray-Curtis distance metric (Supplementary Figure 1B). We showed that the salivary microbiome in HR and MR were not significantly dissimilar from the LR group (p = 0.085) (Supplementary Figure 1B).

Identification of Pertinent Salivary Microbial Markers Associated With the Cardiovascular Diseases Score Using Machine Learning Models

The apparent differences between the study groups using alpha and beta diversity measures were not identified due to the significant sample size differences and imbalance. In this study, the participants were selected from the QGP Cohort, who provided saliva samples exclusively. QBB collected the biosamples from all volunteers as a sampling of Qatari population without focusing on CVD risk-based recruitment. We decided to use regression-based ML selection of pertinent SM biomarkers to avoid bias based on the sample size. The data were split 50-times randomly, using the four feature selection techniques, and the whole dataset was used without any exclusion (Figure 1B).

To search for pertinent variables, we focused on the abundances of SM selected at least 80% of the time among the 50-random splits of the data and the four feature selection techniques as described in the section “Materials and Methods.” Our results are shown in Figure 4. Seven microbes were selected at least 80% of the time using the binary and Arcsin transformations by all the ML methods (Lasso, SCAD, ENet, and MCP) (Figures 4A,B). Three microbes were presented at all the tested models and both transformations (Figures 4C,D). In comparison, four microbes were specific to the binary transformation and four were particular to the Arcsin transformation (Figure 4D). The common microbes were Prevotella, Tissierellaceae, and Desulfobulbus (Figure 4D). To better understand how these microbes affect the CVD-score, we counted the sign of the regression coefficients number of times, Positive, Negative, or Zero (Figure 4E). From this analysis, the three microbes mentioned above contribute to an increase in the CVD score (Figure 4E). At the same time, our data showed that an increase in Clostridiaceae level contributed to a decrease in CVD-score (Figure 4F). Assessment using the Mean squared error (MSE) method disclosed that binary transformation has better prediction accuracy than Arcsin (Figure 4G).

FIGURE 4
www.frontiersin.org

Figure 4. Machine learning models. Barplots representing the selection percentages of the microbes selected at least 80% of the time by the four methods over the 50 random splits of the data. (A) Binary transformation. (B) Arcsin transformation. (C) Venn Diagram showing the number of microbes. (D) Heatmap [presence (green)/absence (red)] of selected microbes using Binary and Arcsin transformations. (E) Balloon plot representing sign counts of the regression coefficients: Binary transformation (F) Arcsin transformation. The size of circles represents the number of splits. The color represents the number of counts. (G) Box plots of the MSE for the four-methods and the two transformations applied to the microbiome abundance data. Each point of the boxplot represents the MSE on the test-set.

Discussion

The need for practical, non-invasive tools for predicting and preventing CVD risk has led to concerted research efforts in recent years to identify and characterize biomarkers associated with the disease as a step forward toward precision medicine. In addition, recent studies on the microbiome have enlightened its role in human health and disease (Solbiati and Frias-Lopez, 2018). Despite that, the diversity of the gut microbiome is affected by several factors like gender, ethnicity, age, and environmental factors; it was found to be associated with many diseases, including CVD and IBD using ML-models (Gulden, 2018; Chang and Kao, 2019). However, the potential use of the SM composition in assessing CVD is still lacking.

This study evaluated whether the SM composition can predict a high risk for developing CVD in a diverse Qatari population. Using a large cohort of 2,974 Qatari participants and based on the CVD risk score, we showed for the first time that the SM composition in LR and HR individuals is different (LefSe analysis). A significant SM alteration was observed between LR, MR, and HR groups (Figures 3A–C). Furthermore, Capnocytophaga and Clostridiaceae were significantly enriched in the LR group (Figure 3A). While no studies are addressing the role of Capnocytophaga in health and disease, a study among Japanese patients showed that non-ischemic heart failure is associated with lower levels of Clostridiaceae (Katsimichas et al., 2018). In line with our findings, a significant reduction of Clostridiaceae was observed in the HR-CVD group in the Qatari population (Figures 3A,4D,F).

Moreover, our data showed that Lactobacillus and Rothia were enriched in the HR group compared to the LR group (Figure 3A). Similarly, a study aiming to utilize the gut microbiome as a diagnostic marker of coronary artery disease (CAD) in the Japanese population has revealed that Lactobacilli were more abundant in patients with CAD than their matching controls (Emoto et al., 2017). On the other hand, Rothia, a nitrate-reducing bacterium, was enriched in hypertensive patients (Wang et al., 2021).

Next, we employed a novel approach of regression-based machine learning by combining the entire dataset of 16S rDNA sequencing data with ML models to identify the potential predictors of HR CVD without stratifying the cohort to mask the bias due to sample size differences among groups. We found that three microbes (Prevotella, Tissierellaceae, and Desulfobulbus) were represented by binary and Arcsin transformations and different training model techniques. Those were associated with high CVD-score (Figure 4). The Bogalusa Heart Study aimed to associate the lifetime CVD risk among the participants using the gut microbes revealed that the genus Prevotella was significantly enriched in the CVD HR participants (Kelly et al., 2016). Also, the role of gut microbiome in Chinese CVD patients with cardiac valve calcification revealed that Prevotella is a potential pathogen that is positively correlated with LDL (Liu et al., 2019). Moreover, hypertensive rats had a significant increase of Tissierellaceae in the gut microbiome (Sherman et al., 2018). Furthermore, Tissierella soehngenia was more abundant in rats with acute myocardial infarction than in the control groups (Wu et al., 2017). Tissierellaceae produces trimethyl amino N-oxide (TMAO), a known microbial metabolite associated with heart attack, stroke, and chronic kidney disease (Al-Obaide et al., 2017). Our study showed that Desulfobulbus – sulfidogenic bacterium (Devkota et al., 2012) has a positive regression coefficient with CVD scores in both trained models (Figures 4C,D). The elevated level of Desulfobulbus is known to trigger proinflammatory cytokines in patients with rheumatoid arthritis and periodontitis (Eriksson et al., 2019). Moreover, its abundance is positively correlated with age rendering it an excellent predictor to diagnose systemic diseases like diabetes and CVD (Tomas et al., 2012).

To our knowledge, this study is the first to demonstrate the promising potential of artificial intelligence via ML modeling for a convenient prediction screening of CVD based on the SM composition in the Arab population. While most ML strategies based on the health records (including age, sex, smoking habit, systolic BP, total cholesterol, HDL, cholesterol, BP treatment, and diabetes), fewer studies used gut microbiome profiles to predict IBD and CVD with an AUC of ≈0.70 and 0.90, respectively (Masetic and Subasi, 2016; Weng et al., 2017; Aryal et al., 2020; Tsoi et al., 2020; Manandhar et al., 2021). A pilot study of Japanese patients with atherosclerotic cardiovascular disease (ACVD) revealed that SM could be used as an optimal marker of ACVD with an AUC of 0.933 (Kato-Kogoe et al., 2021). It is a promising finding to enable the discovery of non-invasive biomarkers that can predict the risk of the disease before it occurs. This study is novel, and the outcomes will be a step toward developing new biomarkers for early non-invasive testing aiming to reduce the CVD burden. The main limitation of this study is the single time point recruitment of the participants without any follow-up on the participants, in addition to the imbalance in the sample size between the groups. This study mainly focuses on the SM shift with a change in CVD-score. In this study, we did not consider the other confounding factors such as chronic diseases like diabetes, arthritis, and hypertension and their treatment, which can also influence the SM shift.

Further studies are warranted to confirm our findings and the potential use of these microbial signatures as diagnostic or prognostic markers. In addition, more investigation of these biomarkers for their mechanistic and pathophysiological evidence could be helpful in the personalized approach to treat CVD.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: NCBI (accession: PRJNA781451).

Ethics Statement

The studies involving human participants were reviewed and approved by the Institutional Review Board (IRB) of Sidra Medicine under (protocol #1510001907) and Qatar Biobank (QBB) (protocol #E/2018/QBB-RES-ACC-0063/0022). The patients/participants provided their written informed consent to participate in this study.

Author Contributions

SAK designed the study, obtained funds for the project, reviewed the data, and finalized the manuscript. SM processed the samples, analyzed the data, and wrote the initial draft. AT and DB calculated the CVD scores and reviewed the data and the manuscript. ME analyzed the data using ML techniques. All authors reviewed and accepted the final version of the manuscript.

Funding

This project was financially supported by funds from Qatar National Research Fund, project # PPM2-0216-170012 and Qatar Genome Program to SAK.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

Samples were collected from QGP participants. Data and samples used in this study were obtained from QBB (www.qatarbiobank.org.qa). Sample processing for DNA extraction was performed by members from the Omics Core at Sidra Medicine.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2021.772736/full#supplementary-material

Supplementary Figure 1 | (A) Alpha diversity measures for the LR, MR, and HR groups. (B) Principal Coordinates Analysis (PCoA) based on Bray-Curtis distances of SM.

References

Aagaard, K., Petrosino, J., Keitel, W., Watson, M., Katancik, J., Garcia, N., et al. (2013). The Human Microbiome Project strategy for comprehensive sampling of the human microbiome and why it matters. FASEB J. 27, 1012–1022. doi: 10.1096/fj.12-220806

PubMed Abstract | CrossRef Full Text | Google Scholar

Akyol, T. Y. (2019). RVenn: An R Package for Set Operationson Multiple Sets.

Google Scholar

Allen, L., Williams, J., Townsend, N., Mikkelsen, B., Roberts, N., Foster, C., et al. (2017). Socioeconomic status and non-communicable disease behavioural risk factors in low-income and lower-middle-income countries: a systematic review. Lancet Glob. Health 5, e277–e289. doi: 10.1016/S2214-109X(17)30058-X

CrossRef Full Text | Google Scholar

Al-Kaabi, S. K., and Atherton, A. (2015). Impact of noncommunicable diseases in the State of Qatar. Clinicoecon. Outcomes Res. 7, 377–385. doi: 10.2147/CEOR.S74682

PubMed Abstract | CrossRef Full Text | Google Scholar

Al-Obaide, M. A. I., Singh, R., Datta, P., Rewers-Felkins, K. A., Salguero, M. V., Al-Obaidi, I., et al. (2017). Gut microbiota-dependent trimethylamine-N-oxide and serum biomarkers in patients with T2DM and advanced CKD. J. Clin. Med. 6:86. doi: 10.3390/jcm6090086

PubMed Abstract | CrossRef Full Text | Google Scholar

Al-Shamsi, S., Regmi, D., and Govender, R. D. (2019). Incidence of cardiovascular disease and its associated risk factors in at-risk men and women in the United Arab Emirates: a 9-year retrospective cohort study. BMC Cardiovasc. Disord. 19:148. doi: 10.1186/s12872-019-1131-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Amon, P., and Sanderson, I. (2017). What is the microbiome? Arch. Dis. Child. Educ. Pract. Ed. 102, 257–260.

Google Scholar

Andrews, S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data.

Google Scholar

Aryal, S., Alimadadi, A., Manandhar, I., Joe, B., and Cheng, X. (2020). Machine learning strategy for gut microbiome-based diagnostic screening of cardiovascular disease. Hypertension 76, 1555–1562. doi: 10.1161/HYPERTENSIONAHA.120.15885

PubMed Abstract | CrossRef Full Text | Google Scholar

Balakumar, P., Maung, U. K., and Jagadeesh, G. (2016). Prevalence and prevention of cardiovascular disease and diabetes mellitus. Pharmacol. Res. 113, 600–609. doi: 10.1016/j.phrs.2016.09.040

PubMed Abstract | CrossRef Full Text | Google Scholar

Bolger, A. M., Lohse, M., and Usadel, B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120. doi: 10.1093/bioinformatics/btu170

PubMed Abstract | CrossRef Full Text | Google Scholar

Breheny, P. J. (2020). Regularization Paths for SCAD and MCP Penalized Regression Models. CRAN 3.12.0.

Google Scholar

Caldwell, M., Martinez, L., Foster, J. G., Sherling, D., and Hennekens, C. H. (2019). Prospects for the primary prevention of myocardial infarction and stroke. J. Cardiovasc. Pharmacol. Ther. 24, 207–214. doi: 10.1177/1074248418817344

PubMed Abstract | CrossRef Full Text | Google Scholar

Caporaso, J. G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F. D., Costello, E. K., et al. (2010). QIIME allows analysis of high-throughput community sequencing data. Nat. Methods 7, 335–336. doi: 10.1038/nmeth.f.303

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, C. S., and Kao, C. Y. (2019). Current understanding of the gut microbiota shaping mechanisms. J. Biomed. Sci. 26:59. doi: 10.1186/s12929-019-0554-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Cortez, R. V., Taddei, C. R., Sparvoli, L. G., Angelo, A. G. S., Padilha, M., Mattar, R., et al. (2019). Microbiome and its relation to gestational diabetes. Endocrine 64, 254–264. doi: 10.1007/s12020-018-1813-z

PubMed Abstract | CrossRef Full Text | Google Scholar

D’Agostino, R. B. Sr., Pencina, M. J., Massaro, J. M., and Coady, S. (2013). Cardiovascular disease risk assessment: insights from Framingham. Glob. Heart 8, 11–23. doi: 10.1016/j.gheart.2013.01.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Davis, E., Bakulski, K. M., Goodrich, J. M., Peterson, K. E., Marazita, M. L., and Foxman, B. (2020). Low levels of salivary metals, oral microbiome composition and dental decay. Sci. Rep. 10:14640. doi: 10.1038/s41598-020-71495-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Deo, P. N., and Deshmukh, R. (2019). Oral microbiome: unveiling the fundamentals. J. Oral Maxillofac. Pathol. 23, 122–128. doi: 10.4103/jomfp.JOMFP_304_18

CrossRef Full Text | Google Scholar

DeSantis, T. Z., Hugenholtz, P., Larsen, N., Rojas, M., Brodie, E. L., Keller, K., et al. (2006). Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl. Environ. Microbiol. 72, 5069–5072. doi: 10.1128/AEM.03006-05

PubMed Abstract | CrossRef Full Text | Google Scholar

Devkota, S., Wang, Y., Musch, M. W., Leone, V., Fehlner-Peach, H., Nadimpalli, A., et al. (2012). Dietary-fat-induced taurocholic acid promotes pathobiont expansion and colitis in Il10–/– mice. Nature 487, 104–108. doi: 10.1038/nature11225

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, M., Li, L., Chen, M., Kusalik, A., and Xu, W. (2020). Predictive analysis methods for human microbiome data with application to Parkinson’s disease. PLoS One 15:e0237779. doi: 10.1371/journal.pone.0237779

PubMed Abstract | CrossRef Full Text | Google Scholar

Emoto, T., Yamashita, T., Kobayashi, T., Sasaki, N., Hirota, Y., Hayashi, T., et al. (2017). Characterization of gut microbiota profiles in coronary artery disease patients using data mining analysis of terminal restriction fragment length polymorphism: gut microbiota could be a diagnostic marker of coronary artery disease. Heart Vessels 32, 39–46. doi: 10.1007/s00380-016-0841-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Eriksson, K., Fei, G., Lundmark, A., Benchimol, D., Lee, L., Hu, Y. O. O., et al. (2019). Periodontal health and oral microbiota in patients with rheumatoid arthritis. J. Clin. Med. 8:630. doi: 10.3390/jcm8050630

PubMed Abstract | CrossRef Full Text | Google Scholar

Gulden, E. (2018). Lifestyle factors affecting the gut microbiota’s relationship with type 1 diabetes. Curr. Diab. Rep. 18:111. doi: 10.1007/s11892-018-1098-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Haj Bakri, A., and Al-Thani, A. (2012). Chronic Disease Risk Factor Surveillance. Qatar STEPwsie Report 2012. Doha: The Supreme Council of Health.

Google Scholar

Hastie, T., and Qian, J. (2014). Glmnet Vignette, 1–30.

Google Scholar

Jamshidi, A., Pelletier, J. P., and Martel-Pelletier, J. (2019). Machine-learning-based patient-specific prediction models for knee osteoarthritis. Nat. Rev. Rheumatol. 15, 49–60. doi: 10.1038/s41584-018-0130-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Kassambara, A. (2020). ggpubr: ‘ggplot2’ Based Publication Ready Plots. R Package Version 0.3.0.

Google Scholar

Kato-Kogoe, N., Sakaguchi, S., Kamiya, K., Omori, M., Gu, Y. H., Ito, Y., et al. (2021). Characterization of salivary microbiota in patients with atherosclerotic cardiovascular disease: a case-control study. J. Atheroscler. Thromb. doi: 10.5551/jat.60608

PubMed Abstract | CrossRef Full Text | Google Scholar

Katsimichas, T., Ohtani, T., Motooka, D., Tsukamoto, Y., Kioka, H., Nakamoto, K., et al. (2018). Non-ischemic heart failure with reduced ejection fraction is associated with altered intestinal microbiota. Circ. J. 82, 1640–1650. doi: 10.1253/circj.CJ-17-1285

PubMed Abstract | CrossRef Full Text | Google Scholar

Kelly, T. N., Bazzano, L. A., Ajami, N. J., He, H., Zhao, J., Petrosino, J. F., et al. (2016). Gut microbiome associates with lifetime cardiovascular disease risk profile among Bogalusa heart study participants. Circ. Res. 119, 956–964. doi: 10.1161/CIRCRESAHA.116.309219

PubMed Abstract | CrossRef Full Text | Google Scholar

Kholy, K. E., Genco, R. J., and Van Dyke, T. E. (2015). Oral infections and cardiovascular disease. Trends Endocrinol. Metab. 26, 315–321.

Google Scholar

Kohli, A., Holzwanger, E. A., and Levy, A. N. (2020). Emerging use of artificial intelligence in inflammatory bowel disease. World J. Gastroenterol. 26, 6923–6928.

Google Scholar

Lear, S. A., Hu, W., Rangarajan, S., Gasevic, D., Leong, D., Iqbal, R., et al. (2017). The effect of physical activity on mortality and cardiovascular disease in 130 000 people from 17 high-income, middle-income, and low-income countries: the PURE study. Lancet 390, 2643–2654. doi: 10.1016/S0140-6736(17)31634-3

CrossRef Full Text | Google Scholar

Liu, S., Zhao, W., Liu, X., and Cheng, L. (2020). Metagenomic analysis of the gut microbiome in atherosclerosis patients identify cross-cohort microbial signatures and potential therapeutic target. FASEB J. 34, 14166–14181. doi: 10.1096/fj.202000622R

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Z., Li, J., Liu, H., Tang, Y., Zhan, Q., Lai, W., et al. (2019). The intestinal microbiota associated with cardiac valve calcification differs from that of coronary artery disease. Atherosclerosis 284, 121–128. doi: 10.1016/j.atherosclerosis.2018.11.038

PubMed Abstract | CrossRef Full Text | Google Scholar

Manandhar, I., Alimadadi, A., Aryal, S., Munroe, P. B., Joe, B., and Cheng, X. (2021). Gut microbiome-based supervised machine learning for clinical diagnosis of inflammatory bowel diseases. Am. J. Physiol. Gastrointest. Liver Physiol. doi: 10.1152/ajpgi.00360.2020

PubMed Abstract | CrossRef Full Text | Google Scholar

Marchesi, J. R., Adams, D. H., Fava, F., Hermes, G. D., Hirschfield, G. M., Hold, G., et al. (2016). The gut microbiota and host health: a new clinical frontier. Gut 65, 330–339. doi: 10.1136/gutjnl-2015-309990

PubMed Abstract | CrossRef Full Text | Google Scholar

Masetic, Z., and Subasi, A. (2016). Congestive heart failure detection using random forest classifier. Comput. Methods Programs Biomed. 130, 54–64. doi: 10.1016/j.cmpb.2016.03.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Mashima, I., Theodorea, C. F., Thaweboon, B., Thaweboon, S., Scannapieco, F. A., and Nakazawa, F. (2017). Exploring the salivary microbiome of children stratified by the oral hygiene index. PLoS One 12:e0185274. doi: 10.1371/journal.pone.0185274

PubMed Abstract | CrossRef Full Text | Google Scholar

McMurdie, P. J., and Holmes, S. (2013). phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS One 8:e61217. doi: 10.1371/journal.pone.0061217

PubMed Abstract | CrossRef Full Text | Google Scholar

Minitab-17 (2010). Minitab 17 Statistical Software [Computer Software]. State College, PA: Minitab, Inc.

Google Scholar

Morganti, S., Tarantino, P., Ferraro, E., D’Amico, P., Viale, G., Trapani, D., et al. (2019). Complexity of genome sequencing and reporting: next generation sequencing (NGS) technologies and implementation of precision medicine in real life. Crit. Rev. Oncol. Hematol. 133, 171–182. doi: 10.1016/j.critrevonc.2018.11.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Murugesan, S., Al Ahmad, S. F., Singh, P., Saadaoui, M., Kumar, M., and Al Khodor, S. (2020). Profiling the salivary microbiome of the Qatari population. J. Transl. Med. 18:127. doi: 10.1186/s12967-020-02291-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Olivier, M., Asmis, R., Hawkins, G. A., Howard, T. D., and Cox, L. A. (2019). The need for multi-omics biomarker signatures in precision medicine. Int. J. Mol. Sci. 20:4781. doi: 10.3390/ijms20194781

PubMed Abstract | CrossRef Full Text | Google Scholar

Segata, N., Izard, J., Waldron, L., Gevers, D., Miropolsky, L., Garrett, W. S., et al. (2011). Metagenomic biomarker discovery and explanation. Genome Biol. 12:R60. doi: 10.1186/gb-2011-12-6-r60

PubMed Abstract | CrossRef Full Text | Google Scholar

Sherman, S. B., Sarsour, N., Salehi, M., Schroering, A., Mell, B., Joe, B., et al. (2018). Prenatal androgen exposure causes hypertension and gut microbiota dysbiosis. Gut Microbes 9, 400–421. doi: 10.1080/19490976.2018.1441664

PubMed Abstract | CrossRef Full Text | Google Scholar

Solbiati, J., and Frias-Lopez, J. (2018). Metatranscriptome of the oral microbiome in health and disease. J. Dent. Res. 97, 492–500. doi: 10.1177/0022034518761644

PubMed Abstract | CrossRef Full Text | Google Scholar

Tomas, I., Diz, P., Tobias, A., Scully, C., and Donos, N. (2012). Periodontal health status and bacteraemia from daily oral activities: systematic review/meta-analysis. J. Clin. Periodontol. 39, 213–228. doi: 10.1111/j.1600-051X.2011.01784.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsoi, K. K. F., Chan, N. B., Yiu, K. K. L., Poon, S. K. S., Lin, B., and Ho, K. (2020). Machine learning clustering for blood pressure variability applied to Systolic Blood Pressure Intervention Trial (SPRINT) and the Hong Kong community cohort. Hypertension 76, 569–576. doi: 10.1161/HYPERTENSIONAHA.119.14213

PubMed Abstract | CrossRef Full Text | Google Scholar

Ursell, L. K., Clemente, J. C., Rideout, J. R., Gevers, D., Caporaso, J. G., and Knight, R. (2012). The interpersonal and intrapersonal diversity of human-associated microbiota in key body sites. J. Allergy Clin. Immunol. 129, 1204–1208. doi: 10.1016/j.jaci.2012.03.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Wade, W. G. (2013). The oral microbiome in health and disease. Pharmacol. Res. 69, 137–143.

Google Scholar

Wang, P., Dong, Y., Zuo, K., Han, C., Jiao, J., Yang, X., et al. (2021). Characteristics and variation of fecal bacterial communities and functions in isolated systolic and diastolic hypertensive patients. BMC Microbiol. 21:128. doi: 10.1186/s12866-021-02195-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Weng, S. F., Reps, J., Kai, J., Garibaldi, J. M., and Qureshi, N. (2017). Can machine-learning improve cardiovascular risk prediction using routine clinical data? PLoS One 12:e0174944. doi: 10.1371/journal.pone.0174944

PubMed Abstract | CrossRef Full Text | Google Scholar

Wickham, H. (2011). ggplot2. Wiley Interdiscip. Rev. Comput. Stat. 3, 180–185.

Google Scholar

World Health Organization [WHO] (2013). Global Action Plan for the Prevention and Control of Noncommunicable Diseases 2013-2020. Geneva: WHO, 55.

Google Scholar

Wu, Z. X., Li, S. F., Chen, H., Song, J. X., Gao, Y. F., Zhang, F., et al. (2017). The changes of gut microbiota after acute myocardial infarction in rats. PLoS One 12:e0180717. doi: 10.1371/journal.pone.0180717

PubMed Abstract | CrossRef Full Text | Google Scholar

Yamashita, Y., and Takeshita, T. (2017). The oral microbiome and human health. J. Oral Sci. 59, 201–206.

Google Scholar

Zaura, E., Keijser, B. J., Huse, S. M., and Crielaard, W. (2009). Defining the healthy “core microbiome” of oral microbial communities. BMC Microbiol. 9:259. doi: 10.1186/1471-2180-9-259

PubMed Abstract | CrossRef Full Text | Google Scholar

Zaura, E., Nicu, E. A., Krom, B. P., and Keijser, B. J. (2014). Acquiring and maintaining a normal oral microbiome: current perspective. Front. Cell. Infect. Microbiol. 4:85. doi: 10.3389/fcimb.2014.00085

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, C.-H. (2010). Nearly unbiased variable selection under minimax concave penalty. Ann. Stat. 38, 894–942.

Google Scholar

Zhang, J., Kobert, K., Flouri, T., and Stamatakis, A. (2014). PEAR: a fast and accurate Illumina paired-end reAd mergeR. Bioinformatics 30, 614–620. doi: 10.1093/bioinformatics/btt593

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhong, Y., Xu, F., Wu, J., Schubert, J., and Li, M. M. (2021). Application of next generation sequencing in laboratory medicine. Ann. Lab. Med. 41, 25–43. doi: 10.3343/alm.2021.41.1.25

PubMed Abstract | CrossRef Full Text | Google Scholar

Zou, H., and Hastie, T. (2005). Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B Stat. Methodol. 67, 301–320. doi: 10.1093/brain/awv075

PubMed Abstract | CrossRef Full Text | Google Scholar

Zou, H., and Li, R. (2008). One-step sparse estimates in nonconcave penalized likelihood models. Ann. Stat. 36, 1509–1533. doi: 10.1214/009053607000000802

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: CVD, salivary microbiome, precision medicine, machine learning, QGP

Citation: Murugesan S, Elanbari M, Bangarusamy DK, Terranegra A and Al Khodor S (2021) Can the Salivary Microbiome Predict Cardiovascular Diseases? Lessons Learned From the Qatari Population. Front. Microbiol. 12:772736. doi: 10.3389/fmicb.2021.772736

Received: 08 September 2021; Accepted: 17 November 2021;
Published: 10 December 2021.

Edited by:

Florence Carrouel, Université Claude Bernard Lyon 1, France

Reviewed by:

Ina Saliasi, Université Claude Bernard Lyon 1, France
Frédéric Denis, Université de Nantes, France

Copyright © 2021 Murugesan, Elanbari, Bangarusamy, Terranegra and Al Khodor. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Souhaila Al Khodor, salkhodor@sidra.org

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.