Metagenomics Approaches to Investigate the Neonatal Gut Microbiome

Early infancy is critical for the development of an infant's gut flora. Many factors can influence microbiota development during the pre- and postnatal periods, including maternal factors, antibiotic exposure, mode of delivery, dietary patterns, and feeding type. Therefore, investigating the connection between these variables and host and microbiome interactions in neonatal development would be of great interest. As the “unculturable” era of microbiome research gives way to an intrinsically multidisciplinary field, microbiome research has reaped the advantages of technological advancements in next-generation sequencing, particularly 16S rRNA gene amplicon and shotgun sequencing, which have considerably expanded our knowledge about gut microbiota development during early life. Using omics approaches to explore the neonatal microbiome may help to better understand the link between the microbiome and newborn diseases. Herein, we summarized the metagenomics methods and tools used to advance knowledge on the neonatal microbiome origin and evolution and how the microbiome shapes early and late individuals' lives for health and disease. The way to overcome limitations in neonatal microbiome studies will be discussed.


INTRODUCTION
The human microbiome is a set of microorganisms (bacteria, viruses, protozoa, and fungi) residing in our body in different types of commensal relationships. The gut microbiota is the most important and consequential of them, with approximately 1,000 species (1) and 100 trillion microorganisms, which is ten times more than the number of eukaryotic cells that make up our body (2). It plays a role in regulating nutrient intake, intestinal motility, and metabolic and immunological development (3). However, how the newborn gut microbiota forms remains an open question. The combination of neonatal (gestational age, genetic history), maternal (delivery mode, diet), and environmental (e.g., antibiotic exposures) variables are considered to impact microbial colonization (4,5), although the precise processes remain unknown. Several studies have shown that the infant gut microbiome can be influenced by different factors, leading to many diseases in the infant or child, including necrotizing enterocolitis, obesity, and inflammatory diseases (6,7). Thus it is critical to understand the mechanism of microbiome colonization in newborns and how various factors may influence this process.
With the development, sophistication, and sensitivity of metagenomics and culturomics, identifying fecal microbes is improving our current knowledge about gut microbiota development, particularly during the early days' post-delivery (8,9). Two methods have been widely used to examine microbial diversity: 16S rRNA gene amplicon sequencing and whole-genome shotgun metagenomics sequencing (WGS). WGS metagenomics sequencing allows the characterization of microbial whole genomes, while 16S rRNA gene amplicon gives a depth description of the diversity of certain taxonomic groups (10,11).
The purpose of this review is to summarize and discuss the available amplicon-based and WGS metagenomics methodologies and tools used to profile the newborn microbiome and study the in utero transmission of microorganisms from mother to newborn.

THE STERILE UTERUS AND THE IN UTERO COLONIZATION HYPOTHESIS
Conflicting results from various studies have unsettled the notion of the in utero gut microbiome. The "sterile uterus" theory maintains that the embryo grows in a sterile environment in utero, and except for intrauterine infections during pregnancy, microbial colonization begins after birth (9,12). However, this theory of the sterile uterus has been called into question by recent studies using both metagenomics and culture techniques that revealed the presence of microbial community in the meconium, placenta, blood umbilical cord, and amniotic fluid (13,14). In addition, other researchers have reported the presence of a unique microbiota in the placenta and amniotic fluid, as well as in healthy women at the time of elective cesarean section, associated with low diversity and a prevalence of Proteobacteria (15). Likewise, other studies have discovered microbes in the umbilical cord blood and amniotic fluid in healthy women and those with pregnancy complications (16,17). Several studies have found microorganisms in the meconium, lending credence to the in utero colonization of the baby's gut theory. Staphylococcus was reported as the most frequent bacterium in meconium samples, followed by Enterobacteriaceae family, Enterococcus, Lactobacillus, and Bifidobacterium genera (18).
In contrast, supporters of the sterile uterus theory attribute the identified microbial components to external contamination (19) because there is no indication of bacterial colony survival (20). Likewise, attempts to grow viable bacteria from placental samples in healthy pregnancies have thus far been unsuccessful (21). Perez-Muñoz et al. (12), on the other hand, support both the "sterile uterus" theory and "in utero colonization" as a methodology-associated artifact (22). They concluded that the data stating a sterile uterine environment was more robust. They claim that methodological techniques in which contamination is fairly easy are responsible for the observed in utero colonization (12). In well-controlled studies, oral, vaginal, and placental samples were compared to contaminated controls. They determined that while there are unique microbial profiles in vaginal and oral samples, they did not discover a distinct placental microbiome, supporting the sterile environment theory (23). Therefore, firm conclusions remain elusive, and further research in this field is still required.

IMPACT OF PRE AND POST NATAL FACTORS ON THE NEONATAL MICROBIOME
Several studies have shown that the gut microbiota has a role in the programming and development of the fetal immune system, metabolic programming, and preventing pathogen colonization of the gut, all of which have long-term consequences in infancy, early childhood, and adulthood (24)(25)(26). Colonization of the gut microbiome is a complicated process controlled by many variables (Figure 1) (27). Despite recent research on in utero colonization, the birth canal is still considered the baby's first bacterial encounter (28). The neonatal microbiota is dominated by Enterobacteria, Escherichia, and Shigella throughout the first few days (9,29,30). According to a new study (9), while the Firmicutes phylum dominates the meconium, Proteobacteria species dominate fecal samples from newborns throughout their 1st months of life. The newborn gut microbiota shows more significant fluctuations and minor variations in the early days of life. Nevertheless, as the infant ages, the bacterial communities increase in diversity and stability. It has been proven that by the age of two, the neonatal microbiota has stabilized to a level equivalent to that of adults (31). However, other scientists suggest that this process might take up to 5 years (32).

Delivery Mode
It is well established that the delivery mode, either vaginal or cesarean section (CS), impacts gut microbiome colonization. Lactobacillaceae (Firmicutes) and Proteobacteria were initially characterized in vaginal-birth infants. Conversely, the gut of cesarean-delivered infants is dominated by Streptococcaceae and Staphylococcaceae (Firmicutes) (33,34), with a low diversity during the first years of life (35). Interestingly, this difference in colonization disappears with age, and the microbiota of both vaginally and cesarean section-delivered infants become similar (36). Several studies have found that vaginal birth seeds have a more favorable and healthier microbiota than CS delivery seeds. Furthermore, it has been observed that the mode of delivery impacts the development of immunological responses that may lead to allergies and autoimmune diseases (37,38). Extensive other studies demonstrated a link between cesarean delivery and a higher risk of asthma, obesity, and autoimmune disorders. However, more research needs to be done to provide a final answer to determine the relationship between mode of delivery and disease in later life (32).

Gestational Age
Gestational age is an essential factor influencing the first colonization of an infant's gut. Several studies revealed differences in microbiota composition between term and preterm infants. The gut microbiota of full-term infants is generally dominated by Bifidobacterium and Lactobacillus, which are considered healthier bacteria (32,39,40), while preterm infants were found to have retarded colonization of Bifidobacterium and Lactobacillus in their early life. Instead, they are more likely to be colonized by potentially pathogenic bacteria, particularly E. coli, Staphylococcus, Enterobacteriaceae, and Bacteroides (41,42). Premature infants are vulnerable to diseases including necrotizing enterocolitis (NEC) and sepsis, which are commonly caused by antimicrobial-resistant bacteria (38,43,44) due to their aberrant microbiota and immature gut immune systems. These diseases are rare in full-term babies, but they are severe and sometimes deadly in preterm babies. Moreover, gestational age has been revealed to influence milk composition, altering the metabolites that impact colonization, such as human milk oligosaccharides (HMOs) (15,45).

Antibiotics Exposure
Antibiotics are one of the most studied factors affecting microbiota, and they are the most prescribed treatment for infants. Antibiotic exposure and bacterial infections significantly affect microbiota composition in the postnatal and prenatal periods (46)(47)(48). Antibiotic treatment during pregnancy could affect the neonatal microbiota, leading to dysbiosis (48). Various studies have found a link between early gut microbiome dysbiosis and many diseases, such as asthma, immunological disorders (49), obesity, diabetes, and developmental disorders, such as autism, in later life. Early antibiotic exposure in infancy affects the composition and diversity of the infants' intestinal microbiota, with a reduction in Bifidobacteria and a marked increase in Proteobacteria. Furthermore, infants of mothers who received antibiotics before delivery showed the same microbiome changes as those seen in antibiotic-treated infants (48,49). In addition, amoxicillin/clavulanic acid antibiotic medication was linked to a fourfold increased risk of newborn NEC (50).

Nutrition
Nutrition during infant development plays a significant role in early microbiota colonization. Healthy full-term infants born through vaginal delivery and exclusively breastfed are thought to have the most beneficial gut microbiota composition. Human milk (HM) represents the optimal natural food (51) and contains a mix of nutrients, commensal bacteria, and functional groups such as oligosaccharides. HM also contains antimicrobials (such as lactoferrin) that can prevent the colonization of enteropathogens and stimulate the growth of Bifidobacterium (52), which may promote neonatal health by decreasing the risk of obesity and NEC and promoting mental development in preterm infants (53). Based on recent data, HM contains an average of 106 bacterial cells/ml (54), dominated by Proteobacteria and Firmicutes phyla, and a minor component of Pseudomonas, Staphylococcus, and Streptococcus genera (51,53).
The exact mechanism through which bacteria reach the mammary glands and are excreted into breast milk is still debated (55). One hypothesis is that HM mainly contains bacteria derived from the mother's skin and/or the infant's mouth (56). The other hypothesis, called the "Enteromammary pathway", postulates that some bacteria migrate from the maternal gastrointestinal tract to the mammary glands during late gestation and lactation (57). Formula-fed infants are codominated by Bifidobacterium and Bacteroides with a low percentage of Escherichia coli and Clostridia (52,58). A high level of Firmicutes with a low level of Bifidobacteria has been associated with a predisposition to obesity (59).
The second shift in the microbial colonization process after breastfeeding, is the introduction of solid food. During this period, the microbiota is rapidly changing, and it is characterized by bacteria such as the Ruminococcus, Blautia, Lachnospira, and Faecalibacterium genera that can digest mucin and glycans and produce bioactive molecules such as short-chain fatty acids (SCFAs) (32).

Genetics
Genetics also plays a role in microbial colonization, but the mechanisms are still poorly understood. Ethnicity has been recognized as one of the factors affecting neonatal microbial colonization, even in infants from the same geographic locations (60). Studies showed that the impact of ethnicity on newborn microbiota was visible 3 months after birth, and that was before the introduction of supplemental foods. A longitudinal study assessed the gut microbiota composition of 106 infants of three Asian ethnicities (Malay, Indian, and Chinese) who lived in the same geographical region (Singapore) and revealed that ethnic impacts were visible at 3 months post-birth and remained significant until 12 months in Chinese and Indian infants. The microbiota of Indian newborns was characterized by increased abundances of Lactobacillus and Bifidobacterium, while Bacteroides and Akkermansia were more abundant in Chinese infants (60). As a result of these ethnic variations in microbiota composition, human genetics may have a role in the establishment of gut microbiota.
Other studies on twins have discovered heritable bacteria in the gut microbiota, including the Christensenellaceae family and methanogenic archaealarchaea (60). MZ twins share more microorganisms than DZ twins or non-twin pairs, according to a study of ten healthy twins, five monozygotic (MZ) and five dizygotic (DZ), whose ages ranged from 0 to 6 years (61).

METHODS FOR INVESTIGATING THE HUMAN MICROBIOME
Prior to the microbiome era, bacterial characterization was primarily focused on identifying pathogenic species. However, the revelation of the importance of the microbiome-host interaction in human health has resulted in the creation of novel microbe investigation technologies. These so-called metagenomics techniques have helped researchers better understand the microbial diversity contained in a sample. This approach directly uses the genetic material existing in an environmental sample without the requirement for culture. Table 1 lists the most commonly used methodologies for studying the microbiome, as well as its benefits and limitations.
The most common methods to explore the human microbiome are 16S rRNA gene amplicon sequencing and shotgun metagenomics sequencing (WGS) (10). The 16S rRNA genes consist of both highly conserved and variable regions used for taxonomic classification. At the same time, WGS presents all genome sequences found in a given sample (10,11). Meta-omic studies have been increasingly used to study the gut microbiome community to better understand taxonomic classification, metabolic pathways, and the essential proteins and metabolites implicated in a specific host phenotype (64). One of these methods is metabolomics, which is used to profile the metabolites generated by the gut microbiome and define metabolites and biochemical pathways (62). Metatranscriptomics is another method for identifying and measuring gut microbial mRNA that reveals which genes and pathways are active and play essential roles in health and diseases (62). Another powerful approach to identifying and quantifying proteins from microbial communities is metaproteomics (63). Using a single omics approach to explore the gut microbiota has its own set of limitations (62), which can be addressed by combining various omics techniques. Consequently, researchers may better understand the relationship between the microbiome and diseases (65,66).
Previous studies have contrasted these two techniques and concluded that OTUs give poorer taxonomic resolution than ASVs and that choosing between the two can have an influence on alpha diversity estimations (73,74). Nevertheless, data quality and PCR errors have a significant impact on ASV approaches, resulting in the loss of a significant quantity of relevant information. As a result, when the data quality isn't good enough, an OTU-based strategy is required (75).
Prodan et al. compared the most common bioinformatics pipelines for 16S rRNA gene amplicon sequencing (including MOTHUR, QIIME-UCLUST, USEARCH-UPARSE, DADA2, USEARCH-UNOISE3, Qiime2, and Deblur) (76). They found that DADA2 had the best resolution and sensitivity, but USEARCH-UNOISE3 had the best overall performance, combining high sensitivity with excellent specificity. However, to produce more robust data, research in this field needs to move toward improved methods (77). Another study by the Almeida group found that when comparing QIIME, QIIME 2, MAPseq, and Mothur, QIIME 2 was the best tool for composition prediction, while MAPseq was more precise, with few genera being misallocated (78).
The reference database utilized, in addition to selecting the appropriate bioinformatics tool, is a crucial aspect in ensuring the most significant classification performance. The Ribosomal

USE OF METAGENOMIC TECHNIQUES FOR CHARACTERIZING THE NEONATAL GUT MICROBIOME
The main metagenomic techniques and pipelines that have been used in neonatal microbiome studies are listed in Table 2.

16S RRNA Gene Amplicon Sequencing
Previous studies using 16S rRNA gene sequencing to explore the gut microbiome have allowed the identification and functional characterization of microbial communities and highlighted the associations between microbial composition wellbeing and illness (6,7,49). Many studies have found that different factors, such as delivery mode, antibiotic exposure, and milk diet, significantly impact the formation of gut microbiota. In a birth-cohort study in Finland, 697 children showed that early exposure to two or more antibiotics was linked to a higher risk of asthma (83). Another study by Pan et al. examined the impact of the delivery mode on the structure and predicted function of intestinal microbiota in neonates and early infants in the Chinese population and discovered that the genera Bifidobacterium, Lactobacillus, and Bacteroides were more common in the vaginal delivery group than in the cesarean section group, which had a predominance of Staphylococcus, Streptococcus, and Corynebacterium in the 3-day-old infants' samples. Additionally, the Bacteroides level was lower in the cesarean section group than that in the vaginal birth group, suggesting that the latter is associated with an increased risk of obesity (91).
16S rRNA gene amplicon sequencing represents the principal tool for the characterization of bacteria in tissues with low bacterial biomass (i.e., placenta and meconium samples). This approach is limited by challenges associated with polymerase chain reaction (PCR)-based short read length sequencing, including GC bias, sequencing errors, and difficulty identifying OTUs (66,67). Although it is unable to distinguish viable taxa, it continues to have technical difficulties in achieving species-level precision in taxonomic profiling.

Shotgun Metagenomics
Shotgun metagenomics is the most informative technique for assessing taxonomic diversity present in a fecal sample (94). The findings of this analysis can be utilized to predict biological functions. The protein-coding sequences from the metagenomic readings are selected and compared to protein-coding sequences in a database to obtain functional profiling. This method may be used to provide a profile that describes the likely biological functions discovered in the sequenced metagenome (94,95). For example, a recent study found that the HM virome may alter the makeup of an infant's gut microbiome early in infancy, thereby affecting both short-and long-term health (82).
Despite its numerous benefits, this technology has some limitations during DNA preparation, and post-analytical processing techniques suggest that this technology can yet be improved. The next point to consider is the high degree of expertise and high cost required to analyze such massive amounts of data (64,69). Another technological limitation is that it is unable to make the difference between living and dead cells and thus unable to provide actual functional information (64,69). Additionally, there are numerous incompletely annotated bacterial genomic sequences, as well as concerns regarding database correctness and coverage (96,97).
Because metagenomics bioinformatics tools depend on the availability of annotated genomes, they are affected by reference sequence database restrictions. When evaluating metabolic potential, the absence of annotations for a large number of

QIIME2
Proportion of regulatory T cells (Tregs) increases at birth and 3 weeks of age. It is nearly 2 fold higher in exclusively breastfed neonates than those who only received formula milk. Breastfed neonates have a specific and Treg-dependent decrease in proliferative T-cell responses to non-inherited maternal antigens (NIMA), which is associated with a reduction in inflammatory cytokine production enrichment of short-chain fatty acid-producing taxa (Veillonella and Gemella) in stool samples of exclusively breastfed neonates.
16S rRNA gene sequencing Predicted functions of the vaginal delivery group revealed higher metabolic and biodegradation rates of carbohydrates, vitamins, and xenobiotics than those in the CS group, which led to the stability of the microbiota.

DADA2
The amount of fruits and vegetables consumed during pregnancy is linked to different alterations in the newborn gut microbiota at 2 months of age. Abundance of unhealthy infant gut microbiomes, such as Erysipelatoclostridium, Betaproteobacteria, and Lachnospiraceae, was negatively linked with higher maternal nutritional intake of fructose, dietary fiber, folic acid, and ascorbic acid.
16S rRNA gene sequencing Gut microbiota development during infancy: Impact of introducing allergenic foods (93) Fecal samples of 288 exclusively breast-fed infants

DADA2
Exclusively breastfed infant at 3 months, gut microbiota was highly heterogeneous, forming three distinct groups: Bifidobacterium-rich, Bacteroides-rich, and Escherichia/Shigella-rich. Increased abundance of Clostridium sensu stricto at 3 months was linked to the presence of atopic dermatitis on examination at age 3 and 12 month. Introduction of allergenic solids promoted a significant increase in Shannon diversity and representation of particular microbes, such as Prevotellaceae and Proteobacteria (e.g., Escherichia/Shigella) as compared with infants exclusively breast-feed.
microbial species leads to a bias toward highly conserved pathways (housekeeping genes), even if there are major changes in taxonomic composition (64). Additionally, the lack of host DNA depletion kits makes metagenomics unreliable in tissues with low biomass such as placenta and meconium samples. Shotgun metagenomics (non-targeted sequencing) can quickly resolve species-and strain-level categorization, as well as reveal genome content, functional potential and partial genome assembly for organisms with low abundance. It is, however, still more costly than amplicon sequencing, is less tolerant of low biomass or contaminated materials, and requires more complicated and expensive analytic techniques.

Metabolomics
Metabolomics studies on feces provide an essential analysis of the microbiome, including information on the host's metabolic profile, nutrition, and gut microbiota (62). This technique elucidates metabolites that mediate microbe-host interactions. In infancy, diet and mode of delivery have been found to have a significant connection with fecal metabolite composition (98). Human breast milk includes high levels of HMOs, which function as selective nutrients for particular bacterial groups (e.g., Bifidobacterium) in the production of SCFAs whose quantities change in relation to breastfeeding or formula diet. A recent study compared the gut microbiota samples of breastfed vs. formula-fed infants born by cesarean section (86). Breastfed neonates revealed a reduction in proliferative T-cell responses to non-inherited maternal antigens (NIMA) that was specific and Treg-dependent, as well as a decrease in inflammatory cytokine production (86). However, the fecal metabolome is thought to represent a functional output of the microbiome (99). Nevertheless, some of the metabolites will be shared by the gut microbiome and the host since feces include a combined metabolite output of both (62). However, because a large spectrum of metabolites is shared by the human host and intestinal microbes, metabolomics approaches are unable to differentiate them (100). Although metabolomics is an excellent tool for investigating the role of bacteria in a variety of pathologies, including human intestinal disorders, and neurodegenerative diseases, it is limited by the inability to distinguish between host and bacterial metabolites (64), the complexity in associating the relative phylogenetic origin, and the lack of adequate reference databases (69,70). Therefore, selecting suitable analytical methods and pipelines (equipment selection, sample processing, and statistical analysis) is a critical step in allowing relevant biological interpretation (64,69,70). Due to the large number and variety of metabolites discovered in a stool sample, as well as the heterogeneity of existing databases, data analysis may need manual integration of different databases by the user (100), which is a time-consuming and difficult phase that involves a significant chance of user-related mistakes.

CHALLENGES OF LOW MICROBIAL BIOMASS SAMPLES
Most microbiome research has focused on the gut, leading to the development of technologies that are better suited to samples with high microbial biomass (101). Samples from other body sites or, in comparison to the gut, have lower microbial biomass and are technically more difficult to explore. The main challenge with these samples is the amount of DNA, which comes from the environment. Microbial contamination has been presented as a real problem due to a lack of vigilance and understanding of technical problems while working with low microbial biomass samples (101)(102)(103). The publication of placental microbiota is an excellent illustration of this (101). This resulted in a false-positive finding, and unfortunately, other researchers have followed the same path.
Microbiome analysis has previously had problems due to a lack of controls. Initial research on the placental microbiota, for example, aimed to solve many issues about baby development and premature birth during pregnancy (13). Aagaard et al., on the other hand, did not take the essential procedures to monitor and reduce contamination. During the extractions, nontemplate controls were used, but only a fraction was sequenced. The sequences obtained in the negative controls were not compared to those found in the biological samples in any way. Furthermore, no environmental controls were collected or examined, preventing environmental contaminants from being detected.
Finally, the detection limit was not specified, prohibiting the researchers from determining whether a credible signal could be identified. Contaminant microbial species could not be reliably identified or analyzed in placenta samples, making it impossible to verify whether the placental microbial signature was genuinely endogenous. Lauder et al. (23) discovered that the microbial placenta profiles were similar to those of extraction blank control (EBC) and air samples, indicating that the placenta was likely sterile and lacked a varied microbial signature. Many other well-controlled investigations have now shown the absence of a placental microbiome (19)(20)(21). These studies emphasize the necessity of having solid processes and controls in place to avoid biased results.
As awareness of the aforementioned challenges of low microbial biomass sample examination, more researchers are now including controls in their data analysis. Nevertheless, this practice is still not standardized across the board, and researchers urgently need to learn from these past errors to improve microbiome research in the future, in particular for newborns. More researchers are increasingly integrating controls from sampling to data analysis as knowledge of the limitations of low microbial biomass sample inspection has increased. Nonetheless, this method is not uniform across the board, and researchers must learn from previous mistakes if they are to enhance microbiome research in the future.

RECOMMENDATIONS FOR AVOIDING POSSIBLE CONTAMINATION IN LOW-BIOMASS SAMPLES
Common microbiome procedures are not optimal for low microbial biomass samples, even in today's common microbiome research. Low biomass laboratory and analytical techniques have improved as a result of recent research, but there is always a potential for improvement (103,104). To prevent contamination and biases, greater knowledge of when and how they arise is needed. DNA contamination and biases can occur at any step during the sample preparation and analysis process. Therefore, it is recommended to include controls from the samples of the professionals who collect and process samples, the collection room, the laboratory environment, equipment, and reagents. Moreover, all sample collectors and laboratory technicians should wear clothes that cover exposed skin, such as gloves and face masks (102,103). Additionally, ultraviolet radiation can be used to minimize reagent and equipment contamination (101). It is also recommended that low biomass samples should be sequenced at a higher depth to capture a sufficient number of unique sequences (105).
Several bioinformatic approaches have been developed to monitor and remove contaminated DNA that affects samples with low microbial biomass. First, as previously mentioned, the limit of detection can be achieved using positive and negative controls (101)(102)(103). Individual contaminated species may also be traced using their specific sequence from their source (i.e., equipment, reagents, environmental controls, etc.) and then eliminated or identified using publicly accessible software such as SourceTracker and Decontam (101,106). While programs can be used to eliminate or track contaminants, they do not replace the need to monitor and investigate contaminants throughout the sampling and laboratory processes.
As the number of investigations into neonatal microbiome samples grows, it is critical that new protocols and methods are developed to reduce the impact of contamination and biases on these samples, as well as to fully understand the limitations when developing diagnostic tests based on the findings (101)(102)(103). To reduce some of these difficulties and better detect disease causes and consequences, new methods based on sensitive, highthroughput techniques that explore undiscovered microbes and microbial populations as a whole are necessary.

CONCLUSION
With the advent of metagenomics, we can now define the structure and function of the microbiome community at the pre-and post-delivery stages with high precision. Due to the metagenomics' potential, dogmas such as the "sterile womb" hypothesis are being challenged by the discovery of microbes in previously sterile tissues. However, if these bacteria are rucked as real occurrences and not just experimental artifacts/contaminants, it is unknown if they colonize the embryo or are only present for the purpose of priming the fetal immune system.
Our review of maternal and environmental factors influencing intestinal microbiota highlights the need to focus on these aspects during pregnancy. The gut microbiome plays a significant role in human growth and development, as well as in the establishment of immunological responses, indicating that balanced gut microbiota is essential for good health. Emerging data suggesting early-life gut microbiome establishment as a protective factor against gut dysbiosis-related diseases later in life supports the need for targeted therapies to restore the gut microbiome in early-life.
To keep up with the latest sequencing technology and Metagenomic methods, new bioinformatics tools are constantly being developed and updated. The most difficult issue for the user is not being confused on offered alternatives. Indeed, comparing the outcomes obtained through several Bioinformatic tools remains the best strategy. Furthermore, using frequent comparison studies and reviews to help the user is beneficial and should be read and well analyzed before making a final decision on which tool to use. Therefore, to understand the neonatal microbiome and the newborn-health link, it is necessary to choose efficient computational tools and methodologies to analyze the neonatal microbiome.

FUTURE DIRECTIONS AND PERSPECTIVES
Even though microbiological techniques have advanced in the last years, certain aspects still need to be enhanced such as reducing contamination, which is a critical issue in microbiome studies, particularly for samples with low microbial mass, as well as the differences in sample collection, storage, DNA extraction protocols, sequencing methods, and bioinformatics tools, which could all be responsible for introducing biases into the results and variability between microbiome studies. Additionally, other sections of the infant gut microbiota composed of fungi and viruses require additional investigation. Furthermore, using fecal samples to study neonatal gut microbiota has limitations. It is not always representative of the gut microbiome and leaves out gut-adherent bacteria that affect colon epithelial physiology and functions.
Although dysbiosis has been associated with several diseases, such as inflammatory diseases, atopic diseases, and NEC, other conditions have also been linked to disruptions in the microbiome. Nevertheless, in most cases causal correlations have yet to be demonstrated. Moreover, is there a critical period of microbiome development that, if disturbed, leads to a disease state in cases where illness states have been connected to microbiome alterations? Are there some microorganisms that can protect you from diseases? What human host variables and/or environmental interactions are crucial for the development of a healthy newborn microbiome? Scientists working to determine the likely microbial etiologies of infant diseases must address these and other problems. By obtaining these responses, potential protective and therapeutic strategies can be created to modulate initial microbial colonization and decrease the risk of adverse health outcomes associated with unbalanced microbiota evolution.

AUTHOR CONTRIBUTIONS
HG, ZB, and ND contributed to conceptualization and writing. SS, SE, and NA contributed to writing. SH, HB, NA, ML-P, SA, and CN reviewed the manuscript. All authors read and approved the submitted version.

FUNDING
The Article Processing Charge (APC) paid by Mohammed VI University of Health Sciences (UM6SS).