Original Research ARTICLE
What's Normal? Microbiomes in Human Milk and Infant Feces Are Related to Each Other but Vary Geographically: The INSPIRE Study
- 1Margaret Ritchie School of Family and Consumer Sciences, University of Idaho, Moscow, ID, United States
- 2Department of Animal and Veterinary Science, University of Idaho, Moscow, ID, United States
- 3Department of Anthropology, Washington State University, Pullman, WA, United States
- 4Statistical Programs, College of Agricultural and Life Sciences, University of Idaho, Moscow, ID, United States
- 5Department of Biological Sciences, University of Idaho, Moscow, ID, United States
- 6Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
- 7Department of Human Nutrition, Egerton University, Nakuru, Kenya
- 8Department of Women and Children's Health, King's College London, London, United Kingdom
- 9MRC Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, Gambia
- 10MRC International Nutrition Group, London School of Hygiene and Tropical Medicine, London, United Kingdom
- 11Department of Anthropology, Hawassa University, Hawassa, Ethiopia
- 12Faculty of Medicine, Lund University, Lund, Sweden
- 13Department of Nutrition and Food Science, University of Ghana, Accra, Ghana
- 14Probisearch, Tres Cantos, Spain
- 15Department of Microbiology and Biochemistry of Dairy Products, Instituto de Productos Lácteos de Asturias (IPLA-CSIC), Villaviciosa, Spain
- 16Department of Nutrition, Food Science, and Food Technology, Complutense University of Madrid, Madrid, Spain
- 17Nutrition Research Institute, Lima, Peru
- 18Larsson-Rosenquist Foundation Mother-Milk-Infant Center of Research Excellence, University of California, San Diego, La Jolla, CA, United States
- 19Department of Pediatrics, University of California, San Diego, La Jolla, CA, United States
Background: Microbial communities in human milk and those in feces from breastfed infants vary within and across populations. However, few researchers have conducted cross-cultural comparisons between populations, and little is known about whether certain “core” taxa occur normally within or between populations and whether variation in milk microbiome is related to variation in infant fecal microbiome. The purpose of this study was to describe microbiomes of milk produced by relatively healthy women living at diverse international sites and compare these to the fecal microbiomes of their relatively healthy infants.
Methods: We analyzed milk (n = 394) and infant feces (n = 377) collected from mother/infant dyads living in 11 international sites (2 each in Ethiopia, The Gambia, and the US; 1 each in Ghana, Kenya, Peru, Spain, and Sweden). The V1-V3 region of the bacterial 16S rRNA gene was sequenced to characterize and compare microbial communities within and among cohorts.
Results: Core genera in feces were Streptococcus, Escherichia/Shigella, and Veillonella, and in milk were Streptococcus and Staphylococcus, although substantial variability existed within and across cohorts. For instance, relative abundance of Lactobacillus was highest in feces from rural Ethiopia and The Gambia, and lowest in feces from Peru, Spain, Sweden, and the US; Rhizobium was relatively more abundant in milk produced by women in rural Ethiopia than all other cohorts. Bacterial diversity also varied among cohorts. For example, Shannon diversity was higher in feces from Kenya than Ghana and US-California, and higher in rural Ethiopian than Ghana, Peru, Spain, Sweden, and US-California. There were limited associations between individual genera in milk and feces, but community-level analyses suggest strong, positive associations between the complex communities in these sample types.
Conclusions: Our data provide additional evidence of within- and among-population differences in milk and infant fecal bacterial community membership and diversity and support for a relationship between the bacterial communities in milk and those of the recipient infant's feces. Additional research is needed to understand environmental, behavioral, and genetic factors driving this variation and association, as well as its significance for acute and chronic maternal and infant health.
Although long thought to be sterile, human milk is now known to contain myriad bacteria, and growing evidence suggests that the composition and profiles of these microbiomes differ among geographically distinct populations of women. The microbiome of milk produced by healthy women is of scientific and public health interest because these microbes may, at least in part, determine which microbial communities are in the gastrointestinal (GI) tracts of their infants (1–5). The infant GI microbiome (often assessed through the analysis of feces) is of substantial interest because its variation has been associated with a variety of human diseases, both in early and later life [reviewed in (6)].
In the first report of a complex bacterial community in human milk using high-throughput methodology, the milk microbiome of healthy women (n = 16) in the Moscow, ID/Pullman, WA region of the United States was found to be dominated by Streptococcus, Staphylococcus, Serratia, and Corynebacterium (7). Bacterial communities appeared to be somewhat unique for each woman, although 9 “core” genera were common all samples. Since the publication of this paper, additional studies have suggested that the primary bacterial taxa in milk vary across populations (Supplementary Table 1). For example, Cabrera-Rubio et al. (8) found that Leuconostoc, Weisella, and Lactococcus were the most predominant genera in milk produced by women living in Finland (n = 18), while Davé et al. (9) reported Streptococcus, Staphylococcus, Xanthomonadaceae, and Sediminibacterium were the most abundant taxa in milk produced by Mexican-American mothers (n = 10). In Chinese and Taiwanese women (n = 133), family-level analysis revealed Streptococcaceae, Pseudomonadaceae, Staphylococcaceae, Lactobacillaceae, and Oxalobacteraceae as the most abundant taxa (10). Other reports suggest additional differences among populations (11–16), although some similarities are notable, such as the dominance of members of the Streptococcaceae and Staphylococcaceae families. It is unknown, however, if this variation is due to genuine differences among populations, differences in location of milk collection (e.g., hospital vs. home), or differences in sample collection methods, storage, processing, and analyses. This is a persistent problem in microbiome research and can only be solved with rigorously controlled studies of representative cohorts of women from diverse populations, including standardized milk collection protocols.
To this end and to help address other potential confounders, Kumar et al. (17) investigated the influence of geographic location on the milk microbiome by collecting and analyzing milk produced by 80 healthy women (20 each from Spain, Finland, South Africa, and China) at 1 mo postpartum. Substantial differences were found among cohorts, and variation was related to a variety of factors, such as delivery mode. For example, milk produced by Chinese women contained relatively more Streptococcus than milk produced by women in all other cohorts, while milk produced by Spanish women had relatively more Propionibacterium and Pseudomonas than that produced in other locations. This study also demonstrated that milk produced by Spanish and South African women was characterized by relatively higher proportion of bacterial genes involved in lipid, amino acid, and carbohydrate metabolism than that of Finnish women. As such, not only do there appear to be genuine compositional differences in the milk microbiome around the world, but there may also be differences in microbial function.
Similarly, a substantial and growing literature exists regarding the human fecal microbiome during infancy [for example, (2, 18–22)]. These studies also report differences across global populations. For instance, in a study of 6-mo-old Malawian and Finnish infants, Bifidobacterium was the most common genus despite other distinct population differences such as a higher relative abundance of Bacteroidetes-Prevotella and Clostridium histolyticum in the Malawian infants (23). Murphy et al. (12) found that Staphylococcus, Escherichia-Shigella, and Veillonella were the most abundant microbial genera in infant feces in Ireland. This contrasts with some earlier work. For example, Backhëd et al. (20) reported that at 4 mo, Bifidobacterium, Lactobacillus, Collinsella, Granulicatella, and Veillonella dominated microbial communities in feces of vaginally delivered infants in Sweden, while Bifidobacterium, Ruminococcus, and Bacteroidetes dominated feces of infants born via cesarean section in the same location. (24) also reported a high relative abundance of Bifidobacterium in feces of Gambian infants over the first 6 mo of life, followed by Streptococcus and members of the Enterobacteriaceae family. The common taxa in many of these results suggest that there may be some shared patterns in the bacterial community in infant feces (e.g., Bifidobacterium and Bacteroidetes as the most abundant genera across cohorts). However, it remains unclear whether these commonalities and differences are genuine or are simply due to methodological differences. True differences, as opposed to biases introduced by varying methodology, might indicate that microbial communities are shaped (at least in part) by some combination of genetics, environment, and behavior, and that there may not be a universal or “normal” infant fecal microbiome representative of health or disease.
The primary purpose of the study described here was to, using standardized collection and analysis procedures, characterize and compare human milk and infant fecal microbiomes in selected global regions. Our hypotheses were that: (1) human milk and infant fecal microbiomes vary among cohorts representing populations from selected geographical regions; (2) there exists a “core” group of bacteria common to milk across cohorts; (3) there exists a “core” group of bacteria common to infant feces across cohorts; (4) variation in the milk microbiome is related to variation in the infant fecal microbiome; and (5) milk and fecal microbiomes of mothers and their own infants are more similar to each other than to maternal/infant dyads in other cohorts. Relationships of milk and fecal microbiomes with other important factors such as delivery mode, other milk components, maternal and infant diets, and household composition and childcare parameters will be addressed in subsequent publications.
Materials and Methods
Aim, Design, and Setting
All study procedures were approved by the Washington State University Institutional Review Board (#13264) and at each study location. Sample collection took place between May 2014 and April 2016 and was carried out as a cross-sectional, epidemiological, multi-cohort study. Informed written or verbal consent was obtained in the local language from each participating woman in her primary language. Informed verbal consent was obtained when a subject's literacy level prevented traditional written consent. Verbal consent, approved and required by the overarching and site-specific IRB boards, was also obtained by team members fluent in the local language. Samples were collected from 11 populations (cohorts), including Ethiopia (a rural population denoted ETR; and an urban population denoted ETU); Kenya (KE), Ghana (GN), The Gambia (a rural population denoted GBR; and an urban population denoted GBU), Peru (PE), Spain (SP), Sweden (SW), and the United States (including a self-identified Hispanic population recruited from California and denoted USC; and an ethnically heterogenous population living in Washington/Idaho, primarily composed of women of northern European descent, denoted USW). Details about these populations have been published previously (25, 26), and subject/sample disposition is summarized in Figure 1. A total of 413 mothers and their infants were enrolled.
Figure 1. Flowchart depicting the disposition of infant feces and milk included in this study. ETR, rural Ethiopia; ETU, urban Ethiopia; GBR, rural Gambia; GBU, urban Gambia; GN, Ghana; KE, Kenya; SP, Spain; SW, Sweden; PE, Peru; USC, California (United States); USW, Washington (United States).
For inclusion, women had to be breastfeeding or pumping ≥ 5 times/d and be ≥ 18 y of age. Our goal was to enroll women between 1 and 3 mo (± 7 d) postpartum, although 17 women were outside this target, resulting in a range from 20 to 161 d postpartum. Exclusion criteria for women included (1) current indication of a breast infection or breast pain that the woman did not consider normal for lactation, (2) illness (including fever, vomiting, severe cough, or diarrhea) in the last 7 d, and/or (3) antibiotic use in the previous 30 d. Women did not need to be exclusively breastfeeding to participate. To be included, infants had to be described as healthy by their mothers, have no signs and/or symptoms of acute illness (fever, vomiting, severe cough, diarrhea, or rapid breathing) in the previous 7 d, and have not taken antibiotics in the previous 30 d.
Anthropometric, Demographic, and Anthropologic Information
Women's height and weight were measured, and body mass index (BMI) calculated. Infants were weighed, and their length measured. Infant weight-for-length z-scores were calculated using the restricted analysis function in the World Health Organization's Anthro igrowup macro (27) using R (version 3.4.1). Weight-for-length z-scores flagged as biologically implausible (< −5 or > 5) were removed from the analysis (n = 3). Extensive in-person survey data were collected on aspects of delivery, maternal, and infant characteristics; household composition; maternal and infant diet; and other lifestyle variables. For this study, “exclusively breastfed” was defined as never having received liquids (e.g., water, formula) or semi-solid or solid foods. If an infant received oral medications or non-nutritive dietary supplements (e.g., gripe water, vitamin drops) at any point in his/her lifetime, but was not fed other liquids or foods, he/she was still considered exclusively breastfed. Selected anthropometric and demographic data of the women and infants for whom milk and/or fecal samples were successfully collected and characterized are provided in Table 1.
Table 1. Description of the subjects for whom milk and/or fecal samples were successfully collected and characterized.
Infant Fecal Sample Collection
Fecal samples (~1 g) were collected from 406 infants. When possible, fecal samples were collected by study personnel at the same time the milk was collected; when this was not possible, mothers collected the next fecal sample available. Samples were collected from provided diapers (Parent's Choice; Walmart, Bentonville, AR) or directly from the infant's skin using a sterile scoop (Sarstedt AG & Co., Nümbrecht, Germany); the sample was then placed in the sterile polypropylene container accompanying the scoop and frozen within 30 min of collection (except ETR) at −20°C. In ETR, because of unreliable electricity, RNAlater® (Ambion) was added to each fecal sample in a ~1:4 ratio (feces: preservative) and frozen within 6 d. All samples were shipped on dry ice to the University of Idaho, Moscow, ID, USA, where they were immediately frozen at −20°C.
Extraction of DNA From Feces
After thawing feces at room temperature, 0.2 g of each sample was transferred into a sterile tube, 0.5 mL TE50 (10 mM Tris-HCl, 50 mM EDTA, pH 8) added, the mixture vortexed until homogeneous, and then frozen at −80°C until DNA extraction. If < 0.2 g of sample was available, 0.5 mL TE50 was added to the collection tube, the mixture vortexed, and the entire volume transferred to a sterile tube and frozen at −80°C until DNA extraction. Frozen, homogenized fecal samples were quick-thawed on a dry heat block at 37°C and vortexed to re-homogenize. DNA was extracted using the QIAamp® Fast DNA Stool Mini Kit (Qiagen, Germantown, MD) with an additional bead beating step at the beginning using 0.1 mm diameter zirconia/silica beads (BioSpec Products, Inc., Bartlesville, OK) and a FastPrep FP120A-115 (Qbiogene, Carlsbad, CA). For all rounds of DNA extractions, 500 μL TE50 taken from the same aliquot used previously to prepare the samples and 500 μL nuclease-free water (Ambion, Waltham, MA) were extracted as negative controls. Samples were eluted in 200 μL ATE buffer supplied in the kit and stored at −80°C until amplified.
Milk Sample Collection
Milk was collected from 412 of the women using methods described previously (26). Briefly, both participant and researcher wore nitrile gloves, and milk (~30 mL) was expressed using an electric pump (USW, USC, PE, and SW) and sterile collection kits (Medela, Baar, Switzerland), or hand-expressed into a sterile collection container (all other sites). Except for those collected in ETR where there was unreliable electricity for cold storage, samples were immediately placed on ice, aliquoted into polypropylene cryotubes (Simport Scientific, Saint-Mathieu-de-Beloeil, Quebec) within 30 min, and frozen at −20°C. Milk collected in ETR was preserved in a 1:1 ratio with Milk Preservation Solution (Norgen Biotek, Thorold, Ontario) and frozen within 6 d. We have shown previously that this method can maintain bacterial DNA integrity in human milk held at 37°C for at least 2 wk (28). All samples were shipped on dry ice to the University of Idaho, where they were immediately frozen at −20°C.
Extraction of DNA From Milk
For all sites except ETR, 1 mL milk was thawed on ice and centrifuged (13,000 x g) for 10 min at 4°C. After removing the lipid and supernatant layers, the cell pellet was resuspended in 500 μL TE50. Samples were subjected to enzymatic lysis by adding 100 μL of a mixture containing 50 μL lysozyme (10 mg/mL in nuclease-free water) (Sigma-Aldrich), 6 μL mutanolysin (25 KU/mL in nuclease-free water) (Sigma-Aldrich), 3 μL lysostaphin (4,000 U/mL in 20 mM sodium acetate) (Sigma-Aldrich), and 41 μL TE50 for 1 h at 37°C. Following the enzymatic lysis, samples were subjected to physical disruption by bead beating with 0.1 mm zirconia/silica beads (BioSpec Products) for 1 min on setting 5 using a FastPrep FP120 (Qbiogene). DNA was extracted using a modified protocol of the DNA Mini Kit (Qiagen), whereby 100 μL 3 M sodium acetate, pH 5.5, was added to the lysate prior to addition to the spin column. DNA was eluted in 50 μL nuclease-free water (Ambion). For each set of milk samples processed in this way, 500 μL TE50 was extracted as a negative control.
DNA in 0.25 mL of each milk sample collected in ETR was extracted using the kit accompanying the Milk Preservation Solution (Norgen Biotek, Thorold, Ontario) as per manufacturer's instructions, including the 2 h enzymatic lysis (20 mg/mL lysozyme). DNA was eluted in 100 μL elution buffer (included with the kit) and stored at −20°C until amplification. Nuclease-free water (500 μL; Ambion) was extracted as a negative control.
Amplification of Bacterial DNA
For both milk and feces, a dual-barcoded, two-step 30-cycle polymerase chain reaction (PCR) was conducted to amplify the V1-V3 hypervariable region of the 16S rRNA bacterial gene. For the first step, a 7-fold degenerate forward primer targeting position 27 [modified from Frank et al. (29)] and a reverse primer targeting position 534 (positions numbered according to the Escherichia coli rRNA gene) were used as described previously (30). The reaction mixture included 12.5 μL Q5® Hot Start High-Fidelity 2X Master Mix (New England Biolabs®, Inc., Ipswich, MA); 0.25 μL each forward and reverse primers (10 μM each); 2 μL template DNA; and 8 μL nuclease-free, sterile water (Ambion) to bring the reaction volume for the first PCR to 23 μL. Nuclease-free water (2 μL) and Escherichia coli DNA (2 μL; 221 ng/mL) were used as PCR negatives and positives, respectively. The first PCR was conducted in 96-well plates (USA Scientific, Ocala, FL), using the Veriti model thermal cycler (Applied Biosystems, Foster City, CA) under the following conditions: initial denaturation at 98°C for 30 s; followed by 15 cycles of denaturation at 98°C for 10 s, annealing at 51°C for 20 s, and extension at 72°C for 20 s. After the 15th extension cycle, the thermal cycler was paused at 72°C and the samples removed. For the second step, 2 μL of a unique barcoded primer pair with Illumina adaptors attached [2 μM; obtained from the University of Idaho's Institute for Bioinformatics and Evolutionary Studies (IBEST) Genomics Core facility] were then added to each sample. The plate was vortexed, briefly centrifuged, placed in the thermal cycler, heated to 98°C for 30 s, amplified for an additional 15 cycles as described above (except with an annealing temperature of 60°C), subjected to a 2-min final extension step at 72°C, and held at 4°C until the plate was removed from the thermal cycler.
DNA Quality, Quantification, and Pooling
PCR products for all negative and positive controls were electrophoresed at 80 V on a 1% agarose gel for 30 min in tris-acetate-ethylenediamine tetraacetic acid (EDTA) buffer (TAE; 40 mM Tris, 20 mM acetic acid, 1 mM EDTA); stained with GelRedTM (10X, Biotium, Fremont, CA); and run alongside a 1-kb ladder (Thermo Fisher Scientific, Grand Island, NY). Gels were visualized using an UltraCam Digital Imaging System (BioRad, Hercules, CA). Amplicon quality was assessed using the QIAxcel DNA Screening cartridge (Qiagen). Briefly, 2 μL PCR product generated in the second amplification was added to 8 μL QX DNA dilution buffer and visualized using high-resolution capillary electrophoresis on a QIAxcel Advanced System (Qiagen). Samples with a peak at the appropriate amplicon size and minimum levels of primer dimers were deemed as having adequate quality. DNA (2 μL amplicon combined with 198 μL of a solution Qubit dsDNA HS reagent and Qubit HS buffer in a 1:200 dilution) was quantified using the Qubit® 2.0 Fluorometer and the QubitTM dsDNA High Sensitivity Assay (Thermo Fisher Scientific, Waltham, MA). Samples were pooled to contain 50 ng DNA from each sample. If samples amplified poorly, an attempt at re-amplification was made. Subsequently, amplicons were combined if necessary to obtain sufficient amounts of DNA, or the entire volume from the PCR was used.
Sequencing and Identification of Microbial DNA
Amplicon pools were size-selected using AMPure beads (Beckman Coulter, Indianapolis, IN); quality checked on a Fragment Analyzer (Advanced Analytical Technologies, Inc., Ankeny, IA); and quantified using the KAPA Biosciences Illumina library quantification kit and Applied Biosystems StepOne Plus real-time PCR system. Pools of PCR amplicons for milk and feces were sequenced in two separate sequencing runs by sample type. Sequences were obtained using an Illumina MiSeq (San Diego, CA) v3 paired-end 300-bp protocol for 600 cycles at the IBEST Genomics Core.
Sequence reads were demultiplexed using dbcAmplicons (a custom python application; https://github.com/msettles/dbcAmplicons). During preprocessing, barcodes were allowed ≤ 1 mismatch (hamming distance), and primers were allowed ≤ 4 mismatches (Levenshtein distance) as long as the final 4 bases of the primer perfectly matched the target sequence. Sequence reads without a corresponding barcode and primer sequence were discarded. Sequence reads were also trimmed of their primer sequence. Reads were split into separate sample R1 and R2 files using a custom python script (splitReadsBySample.py; https://github.com/msettles/dbcAmplicons/blob/master/scripts/python/splitReadsBySample.py). Sequence reads were evaluated for quality, trimmed, and filtered using the DADA2 sequence process pipeline (version 1.2.2; (31)). The output from DADA2 infers amplicon sequence variants (ASV) after modeling the errors from a subset of reads (1 × 106) from the sequencing run. Each MiSeq run was analyzed separately for error estimation. Briefly, sequence reads were truncated to 270 bases with a maxEE setting of 4. Reads were also truncated if the base call reached Q2. Reads with < 270 bases were discarded. Because of loss of reads and little overlap between forward and reverse reads following quality filtering and trimming, only forward reads were used in subsequent analyses. ASV were then assigned taxonomies using the Ribosomal Database Project (RDP) Bayesian classifier (32) and the SILVA 16S rRNA database version 123 formatted for DADA2 (33–35). Relative abundances of bacterial taxa at various taxonomic levels were calculated from sequence read count data in R (version 3.4.1) using the phyloseq package (36).
Designation of Core, Unique, and Aggregated Taxa
A set of “core” genera were characterized for each sample type both in the overall dataset and within each cohort. To be included in the core taxa, a genus must have been present in ≥ 90% of the samples and represent ≥ 0.1% of all identified taxa. We also characterized the relatively “unique” taxa across all taxa identified in the sequencing run for each sample type, defined as genera present in only one cohort and in ≥ 10% of samples within that cohort.
The 10 most-abundant genera from each cohort and for each sample type (based on relative prevalence) were identified and combined to create a set of 28 genera for infant fecal data and a set of 29 genera for the milk data. An “other” category was created at both the phylum and genus levels in these datasets, which is a sum of all other identified taxa within each dataset. These datasets are hereafter referred to as “aggregations” in subsequent text and figures.
Calculation of Diversity Indices
Diversity indices (richness, Shannon diversity, inverse Simpson diversity, and Fisher diversity) were calculated using phyloseq (36) and sequence read count data. Richness measures the absolute number of taxa present in a population (37), whereas, Shannon diversity is a compound measure of richness and evenness (38, 39). Inverse Simpson diversity is the inverse of the probability that two randomly chosen taxa belong to different genera (38, 40), and Fisher diversity describes the mathematical relationship between the number of genera and the number of individuals within each genus (41). Because some diversity indices (specifically, richness) are linked to the number of reads per sample, we rarefied the infant fecal and milk data to 1,000 reads prior to calculation of all diversity indices.
All analyses using SAS software were conducted in version 9.3 (SAS Institute Inc., Cary, NC); all other analyses were performed in R [version 3.4.1; (42)]. Significance for all statistical tests were declared at P ≤ 0.05. Prior to performing inferential statistical tests, all relative abundance data were rounded to the tenth decimal place. Any taxa denoted as having a relative abundance of zero for all sites after rounding were excluded from further analyses and were not included in tables. For the remaining taxa, all zero values in the dataset were replaced with 1 × 10−6. One-way analysis of variance (ANOVAs) were carried out using a generalized linear mixed model (GLIMMIX; SAS) assuming distributions appropriate for the response types. In the case of continuous descriptive variables presented in Table 1, all data except for time postpartum, maternal height, and weight-for-length infant z-scores (which were all normally distributed) were assumed to be log-normally distributed; for binary variables, we assumed a binomial distribution, and for relative abundance (proportional data) we utilized a beta distribution. For infant fecal diversity indices, data were assumed a log-normal transformation; for milk, richness, inverse Simpson, and Fisher diversity values were assumed a log-normal transformation; Shannon diversity values were untransformed. For all ANOVA and associated pairwise comparisons, a Bonferroni correction for multiple comparisons was applied. In the case of multiple comparisons, P-values presented are Bonferroni's adjusted P.
Hierarchical clustering was performed on relative abundance data using the vegan package (43), specifically the vegdist function and hclust function, in R using a Bray-Curtis dissimilarity matrix and average linkage hierarchical clustering. Non-metric multidimensional scaling (NMDS) plots were created in R using the vegan and ggplot2 (44) packages using rounded data and the aggregate taxa lists for milk and infant feces.
For analyses exploring associations between microbial communities of milk and infant fecal samples, only matched-dyad samples were included (n = 360). Heatmaps based upon Spearman rank correlations were constructed in R using the stats (42) and gplots (45) packages to evaluate relationships between the 28 aggregated genera for infant fecal samples and the 29 aggregated genera in milk. Canonical correlation analysis was performed in SAS to explore communities in the data set as a whole (all cohorts combined) using the aggregated infant fecal and aggregated milk genera. For canonical correlations, relative abundance data within each observation were first transformed using a logit transformation. To compare the within- and between-group similarity of bacterial communities, an analysis of similarity (ANOSIM) was performed in R with the vegan package using Bray-Curtis distance and 999 permutations. To evaluate maternal/infant dyad similarity, Bray-Curtis dissimilarity and Jaccard indices were calculated for all matched dyads and all combinations of non-matched dyads, and Wilcoxon test was used to determine differences between these values (14). Associations between diversity indices in milk and infant feces were evaluated using Spearman rank correlations.
Description of Subjects
There were many notable differences among the cohorts, as summarized in Table 1. For example, Spanish women were older than women in ETR, ETU, GBR, GBU, GN, KE, PE, and USW (P ≤ 0.0017); women in SW were breastfeeding younger infants than women in ETR, KE, SP, and USW (P ≤ 0.0328); and there were fewer vaginally-delivered infants in PE than in ETR, ETU, GBR, GBU, GN, and SP (P < 0.0001). Women in GBR and ETR had a higher parity than women in all cohorts except GBU and KE (P = 0.0411). Maternal BMI was higher in women from PE and USC than women from all African cohorts and SP (P ≤ 0.0113), and women in SW were taller than women in ETR, ETU, GBR, GN, KE, PE, and USC (P ≤ 0.0174). In addition, exclusive breastfeeding was more common in ETR, ETU, and GBR than in GN, KE, and PE (P ≤ 0.0052). Kenyan infants were heavier than infants in GBR, GBU, GN, SP, SW, and USW (P ≤ 0.0231); accordingly, infants in KE had a higher weight-for-length z-score than infants in ETR, ETU, GBU, GN, SP, SW, and USW (P ≤ 0.0157), though notably, the average z-score for all cohorts was within the normal range (-2 < z < 2).
The sequencing run for the 398 infant fecal samples generated 4,385,982 reads, with a mean (± standard deviation, SD) of 11,020 ± 6,632 reads and a range of 10 to 40,267 reads following initial processing using the DADA2 workflow. After additional filtering of any read that could not be classified to the genus level, and omitting any sample with < 1,000 reads, the infant fecal dataset analyzed here contained 4,314,551 reads across 377 samples, with a mean (± SD) of 11,444 ± 6,198; and a range of 1,662–40,255 reads. For the 409 milk samples, sequencing generated 7,528,193 reads, with a mean (± SD) of 18,406 ± 18,389 reads and a range of 12−141,620 reads following initial processing using the DADA2 workflow. Using the same filtering criteria as the infant fecal dataset, the milk dataset used here contained 6,709,277 reads across 394 samples, with mean (± SD) of 17,029 ± 16,783, and a range of 1,302–130,700 reads. These curated datasets were used for all further analyses.
Infant Fecal Microbiome: Individual Phyla Analysis
Pie charts illustrating the mean relative abundances of bacterial phyla in infant feces are provided in Figure 2A (mean values available in Supplementary Table 2). Overall, Firmicutes, Proteobacteria, Bacteroidetes, and Actinobacteria together composed > 99.5% of the bacteria identified, though notably only 5 phyla were identified in infant feces; the “other” category, defined as all identified phyla besides the top 4 described above, was entirely composed of Verrucomicrobia (~0.5%). To analyze differences in the relative abundances of these phyla among cohorts, ANOVA was performed and indicated an effect of cohort existed for Firmicutes and Actinobacteria. The relative abundance of Firmicutes was higher in feces from ETR than feces from GN, SP, and USW (P ≤ 0.0374) and Actinobacteria was lower in feces from ETR, KE, PE, and USC than feces from GN (P ≤ 0.0360).
Figure 2. Mean relative abundances of the bacterial (A) phyla and (B) an aggregation of the 10 most-abundant bacterial genera in each cohort in infant feces. ETR, rural Ethiopia; ETU, urban Ethiopia; GBR, rural Gambia; GBU, urban Gambia; GN, Ghana; KE, Kenya; SP, Spain; SW, Sweden; PE, Peru; USC, California (United States); USW, Washington (United States).
Infant Fecal Microbiome: Individual Genera Analysis
There was also variation in the relative abundance of bacterial genera both among (Figure 2B; Table 2) and within (Supplementary Figures 1–11) cohorts. There was a statistical effect of cohort on 8 of the 28 aggregate genera, 7 of which were among the most abundant taxa (Table 2). Differences between cohorts varied by genera. For example, feces from GN, SP, SW, USC, and USW had lower relative abundance of Streptococcus than feces from GBR and GBU (P ≤ 0.0012). Feces from infants in USW had lower relative abundance of Escherichia/Shigella than those in ETU and PE (P ≤ 0.0399). Feces from infants in GN had a lower relative abundance of Veillonella than infants in ETR, GBR, KE, PE, and SP (P ≤ 0.0105). Conversely, GN infants' feces contained relatively more Bifidobacterium than feces from ETR, PE, and USC infants (P ≤ 0.0310). Relative abundance of Bacteroides was higher in feces from infants in USW than GBR, GBU, and KE (P ≤ 0.0403). Feces from infants in the two rural populations (ETR and GBR) had higher relative abundance of Lactobacillus than feces from PE, SP, SW, USC, and USW (P ≤ 0.0122). Relative abundance of Clostridium sensu stricto 1 was higher in SW than GBU (P = 0.0350), and feces from infants in GN had a higher relative abundance of Enterococcus than feces from infants in ETR, ETU, GBR, KE, PE, USC, and USW (P ≤ 0.0347). For more details, see Table 2.
Table 2. Effect of cohort on relative abundances (%) of the aggregate 28 bacterial genera in infant feces.
Infant Fecal Microbiome: Community Analysis
Considering similarities and differences among the collective relative abundances of the aggregate genera, hierarchical clustering patterns (Figure 3A) suggest that fecal bacterial community structure in USC and SW, as compared to African cohorts, clustered together and were characterized by relatively high amounts of Bacteroides, Clostridium sensu stricto 1, and Parabacteroides, but relatively low amounts of Lactobacillus. Fecal microbial communities in GBU were closely related to those in GBR and GN, a clade characterized by high relative abundance of Streptococcus, Bifidobacterium, and Lactobacillus, and low abundance of Bacteroides. GN differed from GBU and GBR in that Veillonella was low in GN.
Figure 3. Hierarchical clustering (vertical axis) of the mean relative abundances for an aggregation of the 10 most-abundant bacterial genera from each cohort in (A) infant feces and (B) milk. ETR, rural Ethiopia; ETU, urban Ethiopia; GBR, rural Gambia; GBU, urban Gambia; GN, Ghana; KE, Kenya; SP, Spain; SW, Sweden; PE, Peru; USC, California (United States); USW, Washington (United States).
NMDS plots suggested no clear clustering by cohort (Supplementary Figure 12A). Although there was considerable variation in fecal bacterial composition among infants within a cohort (Supplementary Figures 1–11), there was more similarity within a cohort than among a random subsampling across cohorts (ANOSIM R = 0.1318, P < 0.001). In other words, samples within a cohort were more similar to each other than would be expected by random chance.
Infant Fecal “Core” and “Unique” Bacteria
Overall, Streptococcus, Escherichia/Shigella, and Veillonella were identified as core taxa in infant feces, being present in 98.4, 91.7, and 90.2% (respectively) of all samples (Table 3; Supplementary Figure 13). When considered within each cohort, there were sometimes different sets of core taxa. For example, Lactobacillus was part of the core taxa for ETR, GBR, GN, and KE, and Bacteroides was part of the core taxa for USW. There were no unique bacterial genera identified within any cohort.
Table 3. Percent of infant feces and milk samples with quantifiable amounts of niche-specific “core” genera, defined as being present in ≥ 90% of the samples in a single cohort (cohort core) or in the overall datasets (overall core).
Infant Fecal Microbiome: Diversity Measures
Microbial diversity of infant feces also varied by cohort (Table 4). Richness and Fisher diversity of KE feces were higher than those of feces from ETU, GN, PE, SP, and USC (P ≤ 0.0127 and ≤ 0.0111, respectively). Conversely, richness and Fisher's diversity score of fecal samples collected in USC were the lowest of all cohorts (P ≤ 0.0126, P ≤ 0.0164, respectively). This finding, however, may be due to the small sample size in the USC cohort and should therefore be interpreted cautiously. Feces from infants in KE had higher Shannon diversity than those collected in GN and USC (P ≤ 0.0448). Feces from GN had a lower inverse Shannon diversity score than those from ETR, ETU, GBR, GBU, GN, KE, and PE (P ≤ 0.0354).
Table 4. Effect of cohort on microbial diversity indices (rarefied to 1000 reads) of infant feces and milk. Values are model estimates and, except for Shannon diversity which was normally distributed, were back transformed from estimated log values.
Milk Microbiome: Individual Phyla Analysis
A total of 15 phyla were identified in milk, with Firmicutes, Proteobacteria, Actinobacteria, and Bacteroidetes collectively representing 97.7% of those identified (Figure 4A; Supplementary Table 3). There was an effect of cohort on Firmicutes, Proteobacteria, Actinobacteria, Bacteroidetes, and “other.” ANOVA analysis indicated that the relative abundance of Firmicutes was lower in ETR than in all cohorts besides GBR and USC (P ≤ 0.0139). Proteobacteria was relatively more abundant in milk collected in ETR than all other cohorts (P ≤ 0.0077). Actinobacteria in milk was more abundant in ETR, ETU, GBR, and GBU than in GN, USC, SP, and PE (P ≤ 0.0014). Relative abundance of Bacteroidetes was higher in KE than GN (P = 0.0060). There was also an effect of cohort on the “other” category; relative abundance in GBR was higher than in ETR, ETU, GN, PE, SP, SW, and USC (P ≤ 0.0036).
Figure 4. Mean relative abundances of bacterial (A) phyla and (B) an aggregation of the 10 most-abundant genera in milk in each cohort. ETR, rural Ethiopia; ETU, urban Ethiopia; GBR, rural Gambia; GBU, urban Gambia; GN, Ghana; KE, Kenya; SP, Spain; SW, Sweden; PE, Peru; USC, California (United States); USW, Washington (United States).
Milk Microbiome: Individual Genera Analysis
There was also variation in milk microbiome at the genus level both within and among cohorts (Supplementary Figures 14–24 and Figure 4B, respectively). Of the 29 genera evaluated, ANOVA indicated that there was an effect of cohort on 19 of them (Table 5). Examples of differences among cohorts include a higher relative abundance of Rhizobium, Achromobacter, and Psychrobacter in milk collected in ETR than all other cohorts (P < 0.0001). Except for ETU, milk collected in ETR also had a higher relative abundance of Corynebacterium 1 than all other cohorts (P ≤ 0.0417). Peruvian milk bacterial communities, on average, were comprised of 50% Streptococcus, which was relatively higher than all African (ETR, ETU, GBR, GBU, GN, and KE) and US (USW and USC) samples (P ≤ 0.0386). Milk from women in USW had relatively more Dyella than all sites except The Gambia (GBR and GBU) (P ≤ 0.0040).
Milk Microbiome: Community Analysis
Hierarchical clustering of the complex bacterial community structures in milk (Figure 3B) suggested that samples collected in GBR and GBU milk clustered together with high abundance of Streptococcus, and intermediate abundances of Staphylococcus, Corynebacterium 1, Dyella, and Bacillus. PE and SP were relatively similar, characterized by high relative abundance of Streptococcus, Staphylococcus, and Rothia, and low abundance of Corynebacterium 1. It is noteworthy that ETR formed an outgroup characterized by relatively high abundance of Rhizobium and Achromobacter; intermediate abundances of Streptococcus and Staphylococcus; and very little Propionibacterium, Dyella, and Rothia. Samples from GN also formed a unique clade with relatively high levels of Lactobacillus, Klebsiella, and Enterococcus.
Like feces, there was more similarity in the milk bacterial composition within a cohort than across cohorts (ANOSIM R = 0.2244, P = 0.001). NMDS plots were created to evaluate the similarity of cohorts, but no clear clustering was observed (Supplementary Figure 12B).
Milk Microbiome: Core and Unique Bacteria
For milk, Staphylococcus and Streptococcus were identified as core genera, being present in 98.7 and 97.7% of all samples, respectively (Table 3; Supplementary Figure 13). However, as with feces, there were sometimes different sets of core taxa within each cohort: Propionibacterium was also present in 90.5% of milk from KE, Dyella in 94.9% of milk collected in USW, and Corynebacterium 1 in 95.0 and 94.1% of milk samples collected in ETR and ETU, respectively. In addition, it is noteworthy that milk collected in ETR contained 8 core genera, including Rhizobium, Brevundimonas, and Achromobacter, which were present in every sample from this location. There were also several unique bacteria identified in some cohorts in milk: only milk collected in ETR contained Acidothermus, Demequina, Flaviflexus, and Pediococcus; milk from GBU uniquely contained Chroococcidiopsis and Isoptericola; and milk from GN uniquely contained Akkermansia and Butyricicoccus.
Milk Microbiome: Diversity Measures
There was an effect of cohort on all indices of milk microbial diversity considered (P < 0.0001; Table 4). Milk collected in ETR had a higher richness and a higher Fisher diversity scores than milk from ETU, GN, PE, SP, SW, USC, and USW (P ≤ 0.0181; P ≤ 0.0106, respectively). The mean Shannon diversity score of milk from ETR was higher than those of milk from GN, PE, SP, SW, and USC (P ≤ 0.0182). Inverse Simpson diversity scores followed similar trends as Shannon diversity.
Relationships Between Milk and Fecal Microbiomes
A few simple relationships (P ≤ 0.01, −0.3 ≥ rs ≥ 0.3) were observed between relative abundances of the 28 aggregated genera in infant fecal and 29 aggregated genera in milk samples (Figure 5). Relative abundances of Psychrobacter and Achromobacter in milk were positively correlated with Leuconostoc in feces, and the relative abundance of Lactobacillus in milk positively correlated with Lactobacillus in feces.
Figure 5. Spearman rank correlations between an aggregation of the 10 most-abundant bacterial genera in infant feces and an aggregation of the 10 most-abundant bacterial genera in milk. Stars indicate P < 0.01 and rs < −0.3 or rs > 0.3.
On a multivariate basis, canonical correlations between the aggregated milk genera and the aggregated infant fecal genera support a strong relationship between these varied and complex communities (r1 = 0.663 P < 0.0001; Figure 6). In this analysis, the first canonical component for infant feces were largely driven by Lactobacillus (rs = 0.508) and Leuconostoc (rs = 0.537), and the first canonical component in milk was driven by Lactobacillus (rs = −0.498). When plotted by cohort, the relationships among the infant fecal and milk taxa appear to be specific to cohort; for example, the relationship (using the first canonical component) between milk and infant fecal microbiomes was strong in PE but diminished in SW. Correlations between milk and infant fecal diversity indices did not reveal significant correlations in any cohort except for KE where richness and Fisher diversity of infant feces and milk were positively correlated with each other (r = 0.31, P = 0.0453; Supplementary Table 4).
Figure 6. Canonical correlations between aggregations of the 10 most-abundant bacterial genera in milk (x-axes) and aggregations of the 10 most-abundant bacterial genera in infant feces (y-axes) in each cohort. The overall correlation is plotted in the upper-left panel; individual cohorts are illustrated in subsequent plots.
Microbial Relationships Between Women and Their Infants
Upon evaluating the relationship between a mother's milk microbiome and her infant's fecal microbiome using both the Jaccard and Bray-Curtis distance metrics, the mother/infant dyads' samples were found to be more similar or tended to be more similar to each other than to all other combinations of mothers and infants (with Jaccard index: P = 0.0258; with Bray-Curtis dissimilarity: P = 0.0936). When evaluated within each cohort, microbial communities of a mother's milk and her infant's feces were not more similar to each other than to all other combinations of mother/infant dyads (all P > 0.05 for both matrices), except for in mother/infant samples from ETU in which the Bray-Curtis distance metric showed that bacterial communities between mothers and their infants were more similar to each other (P = 0.0185) than to other random combinations within the cohort.
Data from this study support our hypotheses that: (1) the human milk and infant fecal microbiomes vary among global cohorts; (2) there exists a small core group of bacteria common to milk across all cohorts (although some cohorts had additional taxa which composed their own unique core); (3) there exists a small core group of bacteria common to infant feces across all cohorts (although some cohorts had their own cores more different from the overall core); and (4) variation in the milk microbiome is related to variation in the infant fecal microbiome (although this was more apparent at the community level than the individual taxa level). Our hypothesis that fecal microbiomes of infants and the milk microbiomes of their mothers would be more similar within a cohort than to other cohorts was also supported, though the variation among both women and infants within a cohort was only slightly less than the variation among cohorts. This individual variation should be noted, because it is possible that milk and infant fecal microbiomes are tailored to a given environment. Additionally, the possibility exists that both the membership and structure of milk and fecal microbial communities may also be tailored to lifestyle and behavioral factors associated with individual maternal/infant dyads (16, 46, 47). For this reason, these results highlight the need for additional work comparing populations of women and infants globally if we are ever to understand whether a “healthy” or “normal” human milk or infant fecal microbiome exists.
Based on the summary statistics calculated from anthropometric and health data evaluated, we believe our data are representative of generally healthy women and their infants in the locations studied. However, due to the high level of individual microbial variation present among these samples, even within a cohort, these data may not be representative of neighboring populations or countries. Indeed, Meehan et al. (16) determined that among foragers and horticulturalist women in the Central African Republic who spend considerable time in in proximity to each other, milk microbial communities vary significantly both within populations and between ethnic groups. In the present study, even though Kenya and Ethiopia are geographical neighbors, substantial differences in microbial communities existed. For example, Veillonella and Lactobacillus were members of the core fecal genera in KE and ETR, but not ETU. Bacterial richness in feces from KE was greater than that from ETU, while diversity of milk from ETR and ETU were generally similar to that of KE. Additionally, milk collected from women in ETR contained relatively more Achromobacter and Rhizobium than all other sites. While these differences are not comprehensive among these cohorts, these examples illustrate that even in close geographical proximity (KE to ETR and ETU) and with genetic similarity (ETR and ETU, and GBR and GBU), there are substantial differences between milk and infant fecal microbial community structures that cannot be ignored, particularly in the framework of recommendations for healthy breastfeeding women/breastfed infants.
Our results are generally congruent with the limited number of other studies of the infant fecal microbiome among similarly-aged infants. For example, Bifidobacterium, Streptococcus, and the family Enterobacteriacae, which includes both Enterobacter and Escherichia/Shigella, were previously reported to be the most abundant taxa in feces from Gambian infants (24). Our data show that infant feces from Gambian infants also contain substantial amounts of Lactobacillus and Veillonella. Consistent with our results, Backhëd et al. (20) found Bacteroides, Bifidobacteria, Prevotella, Streptococcus, Veillonella, and Enterobacteriacae to be abundant in feces of Swedish infants.
It is important to note that feces collected in ETR were preserved with RNAlater, a method previously used for this purpose (48–50). However, RNAlater may introduce biases, such as an increased observation of members of the Bacteroidetes phylum, and a decrease in the members of Actinobacteria relative to unpreserved control samples; this bias has been demonstrated in soil samples and fecal samples (51–55). Despite this methodological difference, feces collected in ETR—at least with respect to phyla and the most abundant genera—were similar to those collected at its closest neighboring site, ETU. However, there were differences with respect to the core taxa, and samples from ETU formed a cluster with feces from KE and PE, while ETR was an outgroup in this clade. Further work will be needed to determine if the use of RNAlater does or does not influence bacterial communities identified in infant fecal samples collected across various cohorts.
Milk microbiomes characterized here were also relatively similar to those described in the literature previously. For instance, Kumar et al. (17) reported that relative abundance of Proteobacteria was highest in milk produced by women in South Africa; that produced by Finnish women was highest in Firmicutes; that of Chinese women had the highest Streptococcus; and that produced by Spanish women had the highest Propionibacterium and Pseudomonas. In our study, milk produced in ETR had the most Proteobacteria and Rhizobium, whereas that produced in PE had the highest level of Firmicutes, and that produced in SP the highest Propionibacterium. Despite these differences, most of the dominant taxa present in the study of Kumar and colleagues were also well represented in our study. Nonetheless, it remains unclear as to whether the differences in relative abundance between our present study and that of Kumar are genuine or due to methodologic differences. For example, Kumar and colleagues used the same sequencing platform, but chose primers targeting the V4 hypervariable region of the 16S rRNA gene, whereas our primers targeted the V1–V3 region. In this case, we also were only able to use the forward read, which could have been a source of bias in elucidating the microbial genera present.
Of note are the compositional differences in the ETR milk as compared to all other cohorts. Unlike all the others which were frozen upon collection, these samples were chemically preserved. The use of chemical preservatives at this site was necessary because freezing the samples was logistically difficult. While Milk Preservation Solution performed well (as compared to other preservatives) in a test of utility in preserving bacterial DNA for microbial analysis (28), the methods used for extracting the ETR samples are different than those employed on all other milk samples in this study. For this reason, results from ETR should be interpreted cautiously. For example, one notable difference we found was the high richness and diversity in the ETR milk samples as compared to the other sites. While this could be biologically relevant, this also could be an artifact of the method used for these samples.
Important for further interpretation of the findings in this study is the fact that, in addition to its microbial communities, milk's micronutrient and macronutrient compositions vary globally. Whether milk's microbiome and nutrient compositions are related has not been studied, although data from our group clearly suggests that maternal nutrient intake (which can be related to milk's nutrient composition) is related to milk microbiome (56). One example particularly relevant to the data presented here is related to human milk oligosaccharides (HMO), which vary across populations—including those reported here (26, 57). Because HMO can act as prebiotics, it is possible that variation in HMO profiles might drive variation in both milk and infant fecal microbiomes. This possibility will be explored thoroughly in subsequent publications. We have also analyzed the milk samples described in this report for their complex immune factor profiles, which also vary (25); as with HMO, subsequent publications will evaluate potential relationships among milk's immune factors, the milk microbiome, and the infant fecal microbiome. While the etiology of these population-level differences has not been completely elucidated, the potential importance of these multi-faceted differences in shaping milk's microbial communities should continue to be evaluated.
Importantly, our data provide evidence of relationships between the milk microbiome and the fecal microbiome of breastfed infants. For example, the relative abundance of Lactobacillus in milk was positively correlated with the relative abundance of Lactobacillus in infant feces. Given the fact that this genus was also determined to be the primary factor that distinguished the first and second canonical axes in our multivariate correlation analyses, it is plausible that human milk may be an important source of Lactobacillus for the developing infant's GI tract, as has been demonstrated through culture-dependent analyses (3–5). Future work focusing on the species and strains of Lactobacillus at both body sites is needed to understand these complex communities. Our data support a handful of studies published previously on this topic. For example, Murphy et al. (12) collected milk and infant fecal samples from 10 mother-infant pairs at 1, 3, 6, and 12 wk postpartum. Approximately 70–88% of the genera identified in infant feces were also identified in human milk. The most abundant genera in infant feces were Streptococcus, Escherichia/Shigella, Bifidobacterium, and Veillonella, which were the 4 most abundant genera in infant fecal samples in our study. Murphy and colleagues also observed that human milk bacterial communities exhibited greater diversity than infant fecal bacterial communities collected at the same time. With the exception of GN, more taxa were identified in milk than infant feces here as well. However, except for KE where bacterial richness of infant feces was positively correlated with that of milk, we found no correlations between the richness of milk and richness of infant feces in the populations we studied.
There are several important limitations that should be considered when interpreting the findings reported here. As previously discussed, we chemically preserved samples collected only in ETR. In addition, for practical and logistic reasons, we also used a combination of methods for collecting the milk (electric pump vs. hand expression). The impact of these methodological differences on milk's microbial composition is unknown. Additionally, limited cohort sample sizes (particularly with respect to USC), restricts the ability to conduct some statistical analyses. Furthermore, because this study is the first of its kind to this scale, we do not know if our results are comparable to other studies on milk and fecal microbial compositions, particularly as it relates to methodological differences. For example, some genera, such as Bifidobacterium, Ruminococcus, and Coprococcus, are present in higher relative abundances when mechanical lysis (via bead-beating) is used (58) due to the difficulty in cell membrane disruption for these taxa. It is important to note that characterization and analyses of the bacteria at a level lower than their genera here were not performed. Future studies should evaluate these communities at the species (and perhaps sub-species) level. The selection of a hypervariable region to target for 16S rRNA analysis is also a source of potential bias in our study (59). We chose V1–V3 because it has been previously used in our laboratory to categorize and classify the bacterial community structure of both milk and infant feces (16, 30, 56). However, future studies should be designed to determine if this hypervariable region is optimal for this application. Additionally, one important limitation to 16S-based sequencing techniques is the inability to distinguish live bacteria from dead bacteria or residual bacterial DNA that may be present in a sample. As more is understood about the microbial communities in various populations globally and at various body sites, more targeted, culture- or RNA-based approaches can be used in conjunction with 16S sequencing to more thoroughly address these questions of live contributors to the milk and infant fecal microbiomes.
Nonetheless, this research represents the largest, cross-cultural, international study to date using standardized methods of collection and analysis for characterizing the microbial compositions of human milk and infant feces. Data from this study clearly indicate that what is “normal” in terms of milk and fecal microbiomes of healthy breastfeeding mothers and their infants, respectively, varies around the world. In addition, we provide compelling evidence that the milk microbiome might, at least in part, play a role in shaping the infant's GI microbiome. It is likely that there are many additional factors that play into these findings, such as antibiotic usage, infant age, parity, infant sex, and exclusive breastfeeding status. These factors are likely important to understanding what are normal for milk and infant fecal microbial community structures. The goal of the current study was to present the microbial communities across a broad range of populations, and subsequent studies will focus on parsing the relationships among these factors and the milk and infant fecal microbiomes. For these findings to have impact, however, researchers must strive to better understand the genesis of microbial differences within and among cohorts, and if this variation is related to maternal and infant health.
In conclusion, this study is the largest of its kind to use standardized methodologies to characterize and compare the milk and infant fecal microbiomes of maternal/infant dyads worldwide. Substantial differences in both the composition and relative abundance of specific taxa were present among cohorts. We also found substantial variation among women/infants within a cohort, suggesting that environment alone does not drive variation in milk and fecal microbial community structure. Additionally, relationships present among the genera in milk and the genera in infant feces which vary among cohorts suggest that milk may be tailored not only to infants in a given environment, but also specifically to the needs of individuals. As such, we conclude that what is “normal” in terms of the fecal microbiome of healthy infants and milk microbiome of healthy lactating women varies by culture and/or location. Further, we posit that bacterial compositions of human milk and feces of breastfed infants are likely specific to a dyad within a culture and location. Future studies should evaluate the bacterial species and sub-species encompassed in these genera, their functionality, and whether or not variation is related to maternal and infant health.
This study was carried out in accordance with the recommendations of the Washington State University Institutional Review Board and the ethics approval committees at each participating location with informed consent from all subjects. All subjects gave informed consent in accordance with the Declaration of Helsinki. The protocol was approved by the Washington State University Institutional Review Board (approval #13264).
MKM, CM, MAM, JW, JF, DS, LK, JR, LB, RP, and AP designed the study. KL, JZ, EB, CM, EK-M, EK, DK, SM, SEM, GO, CG-C, EJ, and LR collected the samples, KL, EB, and JZ performed the laboratory analyses, KL, WP, and JW performed the data analyses, KL and MKM wrote the manuscript. All authors read, contributed to, and approved the final manuscript.
This study was supported by the National Science Foundation (award 1344288), the Ministry of Economy and Competitiveness, Spain (project AGL2013-4190-P), and the European Commission [grant 624773 (FP-7-PEOPLE-2013-IEF)]. Sterile, single-use milk-collection kits were generously provided by Medela Inc. This work was supported in part by NIH COBRE Phase III grant P30GM103324.
Conflict of Interest Statement
CG-C and EJ are employed by company Probisearch.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We sincerely thank Andrew Doel (Medical Research Council Unit, The Gambia) for field supervision and logistics planning and Alansan Sey for questionnaire administration and taking anthropometric measurements in The Gambia; Jane Odei (University of Ghana) for supervising field data collection in Ghana; Katherine Flores (Washington State University), Dubale Gebeyehu (Hawassa University), Haile Belachew (Hawassa University), and Birhanu Sintayehu for planning, logistics, recruiting, and data collection and the administration and staff at Adare Hospital in Hawassa for assistance with logistics in Ethiopia; Catherine O Sarange (Egerton University) for field supervision and logistics planning and Milka W. Churuge and Minne M Gachau for recruiting, questionnaire administration, and taking anthropometric measurements in Kenya; Gisella Barbagelatta (Instituto de Investigación Nutricional) for field supervision and logistics planning, Patricia Calderon (Instituto de Investigación Nutricional) for recruiting, questionnaire administration, and taking anthropometric measurements, and Roxana Barrutia (Instituto de Investigación Nutricional) for the management and shipping of samples in Peru; Leonides Fernandez, and Irene Espinosa (Complutense University of Madrid) for technical assistance, expertise, and review of the manuscript and M Ángeles Checa (Zaragoza, Spain), Katalina Legarra (Guernica, Spain), and Julia Mínguez (Huesca, Spain) for participation in the collection of samples in Spain; Kirsti Kaski and Maije Sjöstrand (both Helsingborg Hospital) for participation in the collection of samples, questionnaire administration, and anthropometric measurements in Sweden; Renee Bridge and Kara Sunderland (both University of California, San Diego); Janae Carrothers and Shelby Hix (Washington State University) for logistics planning, recruiting, questionnaire administration, sample collection, and taking anthropometric measurements in California and Washington; Romana J. Hyde and Morgan M. Potton for their help in sample analysis; Glenn Miller (Washington State University) for his expertise and critical logistic help that were needed for shipping samples and supplies worldwide; and Ryan Pace (University of Idaho) for his assistance in bioinformatics and analyses and review of the manuscript.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2019.00045/full#supplementary-material
ANOSIM, analysis of similarity; ANOVA, analysis of variance; GI, gastrointestinal; GLIMMIX, generalized linear mixed model; HMO, human milk oligosaccharide; ASV, amplicon sequence variants; ETR, Ethiopia, rural site; ETU, Ethiopia, urban site; GBR, The Gambia, rural site; GBU, The Gambia, urban site; GN, Ghana; KE, Kenya; PE, Peru; SP, Spain; SW, Sweden; USC, United States, California site; USW, United States, Washington site.
1. Fernández L, Langa S, Martín V, Maldonado A, Jiménez E, Martín R, et al. The human milk microbiota: origin and potential roles in health and disease. Pharmacol Res. (2013) 69:1–10. doi: 10.1016/j.phrs.2012.09.001
2. Milani C, Duranti S, Bottacini F, Casey E, Turroni F, Mahony J, et al. The first microbial colonizers of the human gut: composition, activities, and health implications of the infant gut microbiota. Microbiol Mol Biol Rev. (2017) 81:e00036–17. doi: 10.1128/MMBR.00036-17
4. Martín R, Heilig GH, Zoetendal EG, Smidt H, Rodríguez JM. Diversity of the Lactobacillus group in breast milk and vagina of healthy women and potential role in the colonization of the infant gut. J Appl Microbiol. (2007) 103:2638–44. doi: 10.1111/j.1365-2672.2007.03497.x
5. Martín V, Maldonado-Barragán A, Moles L, Rodriguez-Baños M, Campo RD, Fernández L, et al. Sharing of bacterial strains between breast milk and infant feces. J Hum Lact. (2012) 28:36–44. doi: 10.1177/0890334411424729
7. Hunt KM, Foster JA, Forney LJ, Schütte UM, Beck DL, Abdo Z, et al. Characterization of the diversity and temporal stability of bacterial communities in human milk. PLoS ONE. (2011) 6:e21313. doi: 10.1371/journal.pone.0021313
8. Cabrera-Rubio R, Collado MC, Laitinen K, Salminen S, Isolauri E, Mira A. The human milk microbiome changes over lactation and is shaped by maternal weight and mode of delivery. Am J Clin Nutr. (2012) 96:544–51. doi: 10.3945/ajcn.112.037382
9. Davé V, Street K, Francis S, Bradman A, Riley L, Eskenazi B, et al. Bacterial microbiome of breast milk and child saliva from low-income Mexican-American women and children. Pediatr Res. (2016) 79:846–54. doi: 10.1038/pr.2016.9
10. Li SW, Watanabe K, Hsu CC, Chao SH, Yang ZH, Lin YJ, et al. Bacterial composition and diversity in breast milk samples from mothers living in Taiwan and mainland China. Front Microbiol. (2017) 8:965. doi: 10.3389/fmicb.2017.00965
12. Murphy K, Curley D, O'Callaghan TF, O'Shea CA, Dempsey EM, O'Toole PW, et al. The composition of human milk and infant faecal microbiota over the first three months of life: a pilot study. Sci Rep. (2017) 7:40597. doi: 10.1038/srep40597
14. Pannaraj PS, Li F, Cerini C, Bender JM, Yang S, Rollie A, et al. Association between breast milk bacterial communities and establishment and development of the infant gut microbiome. JAMA Pediatr. (2017) 171:647–54. doi: 10.1001/jamapediatrics.2017.0378
15. Patel SH, Vaidya YH, Patel RJ, Pandit RJ, Joshi CG, Kunjadiya AP. Culture independent assessment of human milk microbial community in lactational mastitis. Sci Rep. (2017) 7:7804. doi: 10.1038/s41598-017-08451-7
16. Meehan CL, Lackey KA, Hagen EH, Williams JE, Roulette J, Helfrecht C, et al. Social networks, cooperative breeding, and the human milk microbiome (HMM). Am J Hum Biol. (2018) 30:e23131. doi: 10.1002/ajhb.23131
17. Kumar H, du Toit E, Kulkarni A, Aakko J, Linderborg KM, Zhang Y, et al. Distinct patterns in human milk microbiota and fatty acid profiles across specific geographic locations. Front Microbiol. (2016) 7:1619. doi: 10.3389/fmicb.2016.01619
19. Azad MB, Konya T, Maughan H, Guttman DS, Field CJ, Chari RS, et al. Gut microbiota of healthy Canadian infants: profiles by mode of delivery and infant diet at 4 months. CMAJ. (2013) 85:385–94. doi: 10.1503/cmaj.121189
20. Bäckhed F, Roswall J, Peng Y, Feng Q, Jia H, Kovatcheva-Datchary P, et al. Dynamics and stabilization of the human gut microbiome during the first year of life. Cell Host Microbe. (2015) 17:852. doi: 10.1016/j.chom.2015.05.012
21. Rodríguez JM, Murphy K, Stanton C, Ross RP, Kober OI, Juge N, et al. The composition of the gut microbiota throughout life, with an emphasis on early life. Microb Ecol Health Dis. (2015) 26:26050. doi: 10.3402/mehd.v26.26050
22. Martin R, Makino H, Cetinyurek Yavuz A, Ben-Amor K, Roelofs M, Ishikawa E, et al. Early-life events, including mode of delivery and type of feeding, siblings and gender, shape the developing gut microbiota. PLoS ONE. (2016) 11:e0158498. doi: 10.1371/journal.pone.0158498
23. Grześkowiak Ł, Collado MC, Mangani C, Maleta K, Laitinen K, Ashorn P, et al. Distinct gut microbiota in southeastern African and northern European infants. J Pediatr Gastroenterol Nutr. (2012) 54:812–6. doi: 10.1097/MPG.0b013e318249039c
24. Davis JC, Lewis ZT, Krishnan S, Bernstein RM, Moore SE, Prentice AM, et al. Growth and morbidity of Gambian infants are influenced by maternal milk oligosaccharides and infant gut microbiota. Sci Rep. (2017) 7:40466. doi: 10.1038/srep40466
25. Ruiz L, Espinosa-Martos I, García-Carral C, Manzano S, McGuire MK, Meehan CL, et al. What's Normal? Immune profiling of human milk from healthy women living in different geographical and socioeconomic settings. Front Immunol. (2017) 8:696. doi: 10.3389/fimmu.2017.00696
26. McGuire MK, Meehan CL, McGuire MA, Williams JE, Foster J, Sellen DW, et al. What's normal? Oligosaccharide concentrations and profiles in milk produced by healthy women vary geographically. Am J Clin Nutr. (2017) 105:1086–100. doi: 10.3945/ajcn.116.139980
27. World Health Organization. Child Growth Standards: Anthro and Macros. R (version 3.2.2). (2011). Available online at: http://www.who.int/childgrowth/software/en/
28. Lackey KA, Williams JE, Price WJ, Carrothers JM, Brooker SL, Shafii B, et al. Comparison of commercially-available preservatives for maintaining the integrity of bacterial DNA in human milk. J Microbiol Methods. (2017) 141:73–81. doi: 10.1016/j.mimet.2017.08.002
29. Frank JA, Reich CI, Sharma S, Weisbaum JS, Wilson BA, Olsen GJ. Critical evaluation of two primers commonly used for amplification of bacterial 16S rRNA genes. Appl Environ Microbiol. (2008) 74:2461–70. doi: 10.1128/AEM.02272-07
30. Carrothers JM, York MA, Brooker SL, Lackey KA, Williams JE, Shafii B, et al. Fecal microbial community structure is stable over time and related to variation in macronutrient and micronutrient intakes in lactating women. J Nutr. (2015) 145:2379–88. doi: 10.3945/jn.115.211110
32. Wang Q, Garrity GM, Tiedje JM, Cole JR. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl Environ Microbiol. (2007) 73:5261–7. doi: 10.1128/AEM.00062-07
34. Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. (2013) 41(Database issue):D590–6. doi: 10.1093/nar/gks.1219
35. Yilmaz P, Parfrey LW, Yarza P, Gerken J, Pruesse E, Quast C, et al. The SILVA and “All-species Living Tree Project (LTP)” taxonomic frameworks. Nucleic Acids Res. (2014) 42(Database issue):D643–8. doi: 10.1093/nar/gkt1209
38. Morris EK, Caruso T, Buscot F, Fischer M, Hancock C, Maier TS, et al. Choosing and using diversity indices: insights for ecological applications from the German Biodiversity Exploratories. Ecol Evol. (2014) 4:3514–24. doi: 10.1002/ece3.1155
41. Fisher RA, Corbet AS, Williams CB. The relation between the number of species and the number of individuals in a random sample of an animal population. J Anim Ecol. (1943) 12:42–58. doi: 10.2307/1411
42. R Core Team. R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing, (2016). Available online at: https://www.R-project.org/
43. Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D, et al. vegan: Community Ecology Package. R Package Version 2.5-2. (2018). Available online at: https://CRAN.R-project.org/package=vegan
45. Warnes GR, Bolker B, Bonebakker L, Gentleman R, Huber W, Liaw A, et al. gplots: Various R Programming Tools for Plotting Data. R Package Version 3.0.1. (2016). Available online at: https://CRAN.R-project.org/package=gplots
46. McGuire MK, Meehan CL, Brooker SL, Williams JE, Foster JA, and McGuire MA. An evolutionary, biosocial perspective on variation in human milk microbes and oligosaccharides: An example of eco-homeorhesis? In: McGuire MK, McGuire MA, Bode L, editors. Prebiotics and Probiotics in Human Milk. Amsterdam: Elselvier, Inc. (2017).
48. De Filippo C, Cavalieri D, Di Paola M, Ramazzotti M, Poullet JB, Massart S, et al. Impact of diet in shaping gut microbiota revealed by a comparative study in children from Europe and rural Africa. Proc Natl Acad Sci USA. (2010) 107:14691–6. doi: 10.1073/pnas.1005963107
49. Vlčková K, Mrázek J, Kopečný J, and Petrželková KJ. Evaluation of different storage methods to characterize the fecal bacterial communities of captive western lowland gorillas (Gorilla gorilla). J Microbiol Methods. (2012) 91:45–51. doi: 10.1016/j.mimet.2012.07.015
50. Flores R, Shi J, Yu G, Ma B, Ravel J, Goedert JJ, et al. Collection media and delayed freezing effects on microbial composition of human stool. Microbiome. (2015) 3:33. doi: 10.1186/s40168-015-0092-7
52. McCarthy A, Chiang E, Schmidt ML, Denef VJ. RNA preservation agents and nucleic acid extraction method bias perceived bacterial community composition. PLoS ONE. (2015) 10:e0121659. doi: 10.1371/journal.pone.0121659
53. Gorzelak MA, Gill SK, Tasnim N, Ahmadi-Vand Z, Jay M, Gibson DL. Methods for improving human gut microbiome data by reducing variability through sample processing and storage of stool. PLoS ONE. (2015) 10:e0134802. doi: 10.1371/journal.pone.0134802
54. Song SJ, Amir A, Metcalf JL, Amato KR, Xu ZZ, Humphrey G, et al. Preservation methods differ in fecal microbiome stability, affecting suitability for field studies. mSystems. (2016) 1:e00021–16. doi: 10.1128/mSystems.00021-16
56. Williams JE, Carrothers JM, Lackey KA, Beatty NF, York MA, Brooker SL, et al. Human milk microbial community structure is relatively stable and related to variations in macronutrient and micronutrient intakes in healthy lactating women. J Nutr. (2017) 147:1739–48. doi: 10.3945/jn.117.248864
57. Erney RM, Malone WT, Skelding MB, Marcon AA, Kleman-Leyer KM, O'Ryan ML, et al. Variability of human milk neutral oligosaccharides in a diverse population. J Pediatr Gastroenterol Nutr. (2000) 30:181–92. doi: 10.1097/00005176-200002000-00016
60. Jiménez E, de Andrés J, Manrique M, Pareja-Tobes P, Tobes R, Martínez-Blanch JF, et al. Metagenomic analysis of milk of healthy and mastitis-suffering women. J Hum Lact. (2015) 31:406–15. doi: 10.1177/0890334415585078
61. Jost T, Lacroix C, Braegger C, Chassard C. Assessment of bacterial diversity in breast milk using culture-dependent and culture-independent approaches. Br J Nutr. (2013) 110:1253–62. doi: 10.1017/S0007114513000597
62. Biagi E, Aceti A, Quercia S, Beghetti I, Rampelli S, Turroni S, et al. Microbial community dynamics in mother's milk and infant's mouth and gut in moderately preterm infants. Front Microbiol. (2018) 9:2512. doi: 10.3389/fmicb.2018.02512
65. Boix-Amorós A, Collado MC, Mira A. Relationship between milk microbiota, bacterial load, macronutrients, and human cells during lactation. Front Microbiol. (2016) 7:492. doi: 10.3389/fmicb.2016.00492
66. Cabrera-Rubio R, Mira-Pascual L, Mira A, Collado MC. Impact of mode of delivery on the milk microbiota composition of healthy women. J Dev Orig Health Dis. (2016) 7:54–60. doi: 10.1017/S2040174415001397
67. Tuominen H, Rautava S, Collado MC, Syrjänen S, Rautava J. HPV infection and bacterial microbiota in breast milk and infant oral mucosa. PLoS ONE. (2018) 13:e0207016. doi: 10.1371/journal.pone.0207016
68. Urbaniak C, McMillan A, Angelini M, Gloor GB, Sumarah M, Burton JP, et al. Effect of chemotherapy on the microbiota and metabolome of human milk, a case report. Microbiome. (2014) 2:24. doi: 10.1186/2049-2618-2-24
69. Cabrera-Rubio R, Kunz C, Rudloff S, Garcia-Mantrana I, Crehuá-Gaudiza E, Martinez-Costa C, et al. Association of maternal secretor status and human milk oligosaccharides with milk microbiota: an observational pilot study. J Pediatr Gastroenterol Nutr. (2018) 68:256–63. doi: 10.1097/MPG.0000000000002216
Keywords: human milk, breastmilk, feces, microbiome, international, infant, breastfeeding, maternal
Citation: Lackey KA, Williams JE, Meehan CL, Zachek JA, Benda ED, Price WJ, Foster JA, Sellen DW, Kamau-Mbuthia EW, Kamundia EW, Mbugua S, Moore SE, Prentice AM, K. DG, Kvist LJ, Otoo GE, García-Carral C, Jiménez E, Ruiz L, Rodríguez JM, Pareja RG, Bode L, McGuire MA and McGuire MK (2019) What's Normal? Microbiomes in Human Milk and Infant Feces Are Related to Each Other but Vary Geographically: The INSPIRE Study. Front. Nutr. 6:45. doi: 10.3389/fnut.2019.00045
Received: 20 December 2018; Accepted: 27 March 2019;
Published: 17 April 2019.
Edited by:Aldo Corsetti, University of Teramo, Italy
Reviewed by:Arianna Aceti, University of Bologna, Italy
Veronique Demers-Mathieu, Oregon State University, United States
Copyright © 2019 Lackey, Williams, Meehan, Zachek, Benda, Price, Foster, Sellen, Kamau-Mbuthia, Kamundia, Mbugua, Moore, Prentice, K., Kvist, Otoo, García-Carral, Jiménez, Ruiz, Rodríguez, Pareja, Bode, McGuire and McGuire. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Michelle K. McGuire, email@example.com