REVIEW article

Front. Immunol., 07 January 2019

Sec. Mucosal Immunity

Volume 9 - 2018 | https://doi.org/10.3389/fimmu.2018.02868

Exploring the Human Microbiome: The Potential Future Role of Next-Generation Sequencing in Disease Diagnosis and Treatment

  • 1. Department of Zoology, Dr. Harisingh Gour Central University, Sagar, India

  • 2. Metagenomics and Secretomics Research Laboratory, Department of Botany, Dr. Harisingh Gour Central University, Sagar, India

  • 3. Department of Botany and Microbiology, College of Science, King Saud University, Riyadh, Saudi Arabia

  • 4. Mycology and Plant Disease Survey Department, Plant Pathology Research Institute, Agriculture Research Center, Giza, Egypt

  • 5. Department of Plant Production, College of Food and Agricultural Sciences, King Saud University, Riyadh, Saudi Arabia

Article metrics

View details

323

Citations

54,3k

Views

13,6k

Downloads

Abstract

The interaction between the human microbiome and immune system has an effect on several human metabolic functions and impacts our well-being. Additionally, the interaction between humans and microbes can also play a key role in determining the wellness or disease status of the human body. Dysbiosis is related to a plethora of diseases, including skin, inflammatory, metabolic, and neurological disorders. A better understanding of the host-microbe interaction is essential for determining the diagnosis and appropriate treatment of these ailments. The significance of the microbiome on host health has led to the emergence of new therapeutic approaches focused on the prescribed manipulation of the host microbiome, either by removing harmful taxa or reinstating missing beneficial taxa and the functional roles they perform. Culturing large numbers of microbial taxa in the laboratory is problematic at best, if not impossible. Consequently, this makes it very difficult to comprehensively catalog the individual members comprising a specific microbiome, as well as understanding how microbial communities function and influence host-pathogen interactions. Recent advances in sequencing technologies and computational tools have allowed an increasing number of metagenomic studies to be performed. These studies have provided key insights into the human microbiome and a host of other microbial communities in other environments. In the present review, the role of the microbiome as a therapeutic agent and its significance in human health and disease is discussed. Advances in high-throughput sequencing technologies for surveying host-microbe interactions are also discussed. Additionally, the correlation between the composition of the microbiome and infectious diseases as described in previously reported studies is covered as well. Lastly, recent advances in state-of-the-art bioinformatics software, workflows, and applications for analysing metagenomic data are summarized.

Introduction

Microbes are ubiquitous in nature, inhabiting almost every conceivable environment, and play an important role in human life. Microbes, though generally invisible, play an essential role in ecosystem functioning (1, 2), modulating key ecosystem processes such as plant growth, soil nutrient cycling, and marine biogeochemical cycling (36). An innumerable number of symbiotic, pathogenic, and commensal microbes colonized the human body; collectively constituting the human microbiota. Interactions between the human body and gut-microbiota are widely recognized as influencing several aspects of human health (7). A functioning microbiome is obligatory for host organisms, as it contributes to the smooth functioning of important physiological processes. In fact, host organisms have co-evolved with their microbiota; with some commensals having evolved as pathobionts while others as symbionts (8, 9). The presence of certain commensals in the human gut induces signals that drive proper functioning and maturation of the immune system. Microbial communities take on a specific structure within different hosts and physical environments (10). Consequently, identification and characterization of the microbes inhabiting a host, their distinct host phenotypes, and the biochemical pathways by which microbes impact their hosts are the major focus of host-microbiome research.

Analyses of host-microbe interactions can reveal the core characteristics of the interaction, including their identification, classification, profile prediction, and mechanisms of interaction. Although the structure, function, dynamics, and interactions of these microorganisms play an essential role in human metabolism; their identification, quantification and characterization can be problematic. The majority of microbial communities are extremely diverse and most of the individual organisms have not yet been cultured (11). Secondly, their interaction with each other and tendency to form intricate networks makes it difficult to predict their behavior (3). Establishing mechanistic connections between gut-microbiota and its functioning adds an extra challenge especially in understanding the biology of intricate microbial consortia (12). Classic approaches to microbial ecology have relied on cultivation-dependent techniques to study host-microbe interactions. Although these culture-dependent techniques have generated interesting data sets, they have also resulted in a spurious view of microbiota. Recently, however, a number of culture-independent techniques, mainly PCR-based methods, have evolved for the qualitative and quantitative identification of microbes. These techniques have entirely changed the perception of the human microbiome and have paved the way for the establishment of metagenomics. Metagenomic studies are increasing our knowledge of host-pathogenic interactions by revealing the genes that potentially allow microbes to influence their hosts in unexpected ways. Metagenomic studies of host-microbe interactions can provide useful information applicable to a wide array of disciplines; including pathogen surveillance, biotechnology, host-microbe interactions, functional dysbiosis, and evolutionary biology (13). Recent studies of host microbiomes using metagenomic approaches have offered key insights into host-microbe interactions.

In addition to allowing researchers to characterize the composition of microbial communities, metagenomic studies have also provided novel information on other aspects of the biological sciences. For example, metagenomic studies on the human microbiome have revealed possible links between the gut microbiome and human diseases such as depression (14), rheumatoid arthritis (15) and diabetes (16). Several studies have utilized materials from ancient communities to trace changes in the microbiome. These studies have conducted metagenomic studies of coprolites (17), teeth (18), and other tissues (19). Provided that nucleic acids can be extracted from the sample, almost any material from an environment can be used in metagenomic analyses. One of the largest metagenomic studies to date is the Global Ocean Sampling. Metagenomics is also being applied to the field of medicine. Figure 1 illustrates the timeline of sequence-based metagenomic studies and shows the range of environments that have been sampled and analyzed between 2003 and 2017. Several articles have been published that have focused on metagenomic methodology and analysis software (2027). The present review attempts to provide an overview of the high-throughput sequencing technologies and analytical software currently available for studying host-microbe interactions. Moreover, there is an attempt to also highlight the advancement of sequencing techniques over time and provide information regarding the appropriateness for applications in exploring the human microbiome and the metagenomes of other diverse environments. Lastly, a discussion is provided of the various bioinformatic options that are available to successfully meet both de novo sequencing and sequence alignment challenges.

Figure 1

Figure 1

Timeline of the sequence-based metagenomic projects showing the variety of the environmental samples.

Unseen Microbial Diversity And Its Global Implications

Microbes conduct significant functions that greatly benefit the health of planet, as well as its inhabitants. Microbes help to regulate, modulate, and maintain earth's atmosphere (28), support the growth of plants and help to suppress plant diseases (29), contribute to human health (30), breakdown harmful chemicals present in contaminated environments (31, 32), support sustainable farming (33), modulate greenhouse gases (34), are primary components of various biogeochemical cycles (35) and greatly contribute to ecological processes, including climate change (36). In addition to remediating contaminated environments and modulating the atmosphere, the combined activity of these invisible microbial communities shape the face of the biosphere and represent untapped reservoirs of novel biomolecules; including pharmacological drugs and industrial enzymes (37). Microbes coexisting in the human body offer a variety of benefits by modulating fundamental metabolic processes, immunity, and signal transduction. Increasing evidence suggests that there is a significant association between the human gut microbiome and the development of human diseases (38).

Previously, it was difficult to study microbes in their natural environment and thus microbiologists were limited to studying individual species in the laboratory. This approach, however, has limited the data that can be obtained on microbial communities inhabiting diverse ecosystems. Metagenomics has helped to resolve this limitation and has greatly increased our understanding of entire microbial communities, thus significantly advancing our knowledge of microbial ecology and microbiology in general. Metagenomics, supported by next-generation-sequencing (NGS) has literally removed the limitations and boundaries associated with classic culture-based approaches (3941). NGS technology has enabled the comprehensive study of diverse microbiomes in their native environments, including the ocean microbiome (42), human skin microbiome (43), human microbiome (44) and the Saragossa Sea microbiome (45). Some of the novel findings enabled by metagenomics involve the identification of endosymbiotic bacterial phyla (46), nitrification processes (47, 48), human disease pathogens associated with epidemics (49), bacteria (50), and viruses (51) associated with inflammatory bowel diseases, and the identification of commensal gut bacteria (52).

Microbiome in Human Health And Disease: a Mechanistic Link

The human body serves as a host to a networked community of microorganisms (microbiome) that outnumber the body's own cells. Research on the human microbiome has been the area of immense interest over the past few years due to intimate linkage of the microbiome with human health. The human microbiome “our second genome” has intimately co-evolved with humans for millions of years and plays a critical role in human health. Deciphering the composition and function of the human microbiome can provide a deeper understanding of its' structural and functional properties. In the future, our understanding of the human microbiome and the application of metagenomic analyses will greatly enhance our understanding of human health and disease in specific individuals. The exploration of human microbiome and metagenome is considered to represent a frontier in human genetics.

The majority of research on the human microbiome has focused on the microbes colonizing the human digestive system, as these microorganisms are believed to influence human health in a number of ways. The digestive system microbiome is extremely diverse, with significant variations in its constituents across individuals (44). Modulation of the microbiome by extraneous factors, such as fecal transplantation and dietary intervention, has been shown to be a potential therapeutic approach to addressing a number of health-related problems (53). The gastrointestinal tract (GIT) harbors a vast diversity of microbes, comprising the intrinsic networks of both microbe-microbe and host-microbe interactions (54). Microbial guilds (species that exploit the same resources) have been found to provide an interesting feature that can be used to help understand processes taking place at both a single cell and community level. Microbes under normal physiological conditions are commensal and mediate digestion, strengthen the immune system and inhibit or prevent pathogens from invading the body. The linkage between the human microbiome and human health remains largely unknown and unexplored, however, a number of epidemiological studies have found that the overall reduction in the diversity of digestive system microbiota is linked to diseases such as eczema (55), asthma and inflammatory diseases (56), diabetes and obesity (57), allergies (58), digestive tract disorders such as IBD (inflammatory bowel disease) (59) and IBS (irritable bowel syndrome) (60). Dysbiosis (microbial imbalance) has also been linked with the genesis and evolution of a plethora of other diseases, including chronic fatigue syndrome (61), cancer (62), colitis (63) bacterial vaginosis (64), and anxiety and depression (65). Several recent studies have highlighted the critical role that the gut microbiome plays in modulating different immune responses, including immune tolerance, via Treg (T regulatory) cell modulation. Studies carried out by Geuking et al. (66), indicated that short-chain fatty acids (SCFA) can promote the development of Treg cells in the gut. Gut-inhabiting microbes facilitate the breakdown of complex carbohydrate (67) and help in the utilization of polysaccharides (68). Other examples of the health-supporting functions of the gut-microbiome are protection against diseases via immune modulation (69), fecal microbiome transplantation (70), metabolism, xenobiotic toxicity and pharmacokinetics (71).

The Microbiome as a Therapeutic Agent

As mentioned, the human body is teeming with trillions of microbes, collectively called the “human microbiome.” Microbiome studies have now become a prominent field of research by offering potential and novel methods of disease diagnosis, prognosis, and treatment. Microbial ecology within an ecosystem involves a cross-talk among its inhabitants. The growth and survival of microbes in any ecosystem are largely governed by their chemical environments, and microbes have evolved the ability to adapt and utilize different chemicals through specific genes (72, 73). Alterations (good and bad) in the microbial equilibrium of the gut microbiome do occur. Science has developed medications that have a significant impact on the microbial equilibrium. Beneficial microbes colonizing the gut produce a variety of chemicals, including analgesics, vitamins, antioxidants and anti-inflammatory factors that protect and support the normal functioning of the human body. Dysbiosis (disruptions in microbiota) has been associated with different diseases. Therefore, maintaining a beneficial gut microbiome, in terms of both composition and function, is important for human health (74, 75). The gut microbiome has an active relationship with its human host and exhibits a regulatory role in cognition, mood, pain, and anxiety, exerted through a gut-brain axis. Drastic changes in the maternal microbiome that occur during pregnancy influence the maturation and immunity of neonates. Studies carried out by Ng et al. (76) indicated that increased levels of salicylic acid in the intestines contribute to the proliferation of pathogenic bacteria in the GIT when patients are treated with antibiotics. Roberfroid et al. (77) reported that the consumption of prebiotics (indigestible plant fiber) induces specific changes in the gut microbiome, elevating levels of SCFA (short chain fatty acid). Studies reported by Cani et al. (78) stated that fermentation activity carried out by the gut microbiome results in reduced hunger and increased satiety levels, which as a result, decreases total energy inputs. Similarly, studies carried out by Archer et al. (79) and Whelan et al. (80) confirmed that fermentation of non-digestible carbohydrates by the gut microbiome controls food intake activity and reduces energy intake. According to Parnell et al. (81), prebiotic-induced changes in the gut microbiome of obese patients decreases the circulation of lenomorelin or ghrelin (a hunger hormone) and increases the peptide, tyrosine or PYY. In contrast, however, studies carried out by Peters et al. (82) and Hess et al. (83) indicated that prebiotic treatments do not influence the appetite. A recent study by Tarini et al. (84) demonstrated that a single dose of insulin significantly decreases levels of lenomorelin blood plasma and augments post-prandial plasma levels of Glucagon-like peptide-1. In short, there is a growing body of evidence on the contribution of the microbiome on human health and increased understanding that the microbiome can serve as a potential therapeutic agent.

Dissecting the Host-Pathogen Microbiome

Host-pathogen interactions have profound consequences in human biology and can be viewed as a battle between two systems. Pathogens, which are the invaders, can seize host cells and use them for their own advantage (8), and they can evolve so quickly that they overpower the human immune system, as with HIV infection (85). The conflict between the interacting partners results in phenotypic changes and is believed to be the main driving force for a number of phenomena, such as speciation and the evolution of sex (86). Detailed mechanistic analyses of host-pathogen interactions are varied with most still in need of further study. Notably, little is known about the molecular level dynamics of host-pathogen interactions and the need for more studies on this topic are critical, especially those dealing with the molecular events that regulate phenotypic changes in the host. Advancements in Next Generation Sequencing (NGS) technologies and bioinformatic tools have offered new approaches for studying host-pathogen systems. Researchers are now able to construct the genomes of both model and non-model organisms. The use of these newly-developed tools allows researchers to not only study the behavior of a single gene under different conditions but also study the extensive impact of these host-pathogen interactions on molecular environments (global gene expression). Several open-source, standalone R packages and web-based software programs have been developed to help and acquire key insights in understanding the host-pathogen microbiome (Figure 2). A more detailed account of metagenomic software and resources are given in a separate section of this review wherein we mentioned some of the standard software used for quality control, taxonomic classification, diversity metrics, annotation and functional information, sequence classification, metabolic pathway reconstruction, and statistical analyses.

Figure 2

Figure 2

Tools and web servers related to gut microbiome studies.

Microbial Census

Culture-independent methods are the most appropriate for ascertaining the abundance of microbes that are present within a community. DNA re-association kinetics provide information on both community structure and diversity (87). 16S rRNA gene sequencing is one of the main methods used for identifying the microbial taxa present in a community (88). The utility of this approach is based on the fact that the DNA sequence of regions between conserved areas of 16S rRNA vary among different bacterial species and can be species specific. Two different sequencing approaches used for studying microbial communities are (i) the targeted sequencing (16Sr RNA) and (ii) shotgun sequencing of the metagenome. Each of these methods can provide strikingly different results when used in metagenomic analyses. Shotgun sequencing methods are generally considered superior for the identification and characterization of microbial communities, as they typically provide a greater level of diversity compared to amplicon sequencing (89). Amplicon-based sequencing matches the DNA sequence amplified using a set of universal primers based on the highly-conserved 16S rRNA to sequences of known bacterial taxa. In contrast to amplicon sequencing, shotgun sequencing engages a genome-wide approach, utilizing random strings of genomic DNA sequences obtained by breaking total genomic DNA and matching the obtained sequences to an annotated database of known DNA sequences using clade-specific marker genes or common sequences. Shotgun metagenomics is often used for gene cataloging and functional inference (10). Deep sequencing of metagenomic samples, as was used in the Human Microbiome Project and Metagenomics of the Human Intestinal Tract, provides extensive sequence information even of minor components (taxa) present in the metagenome. This allows for the identification and characterization of the genes present within a given microbial community. The obtained sequences reads can either be used directly or first assembled into contigs, which are then compared to an available database for the identification of specific genes. De novo gene prediction is also possible (90), which may identify motifs with functional inference. Gene catalogs can also be compared with databases such as KEGG (the Kyoto Encyclopedia of Genes and Genomes) (91), which arranges the gene products into biological processes and pathways (Figure 3).

Figure 3

Figure 3

Microbiome analysis workflow.

Metagenomics and Microbial Studies

Metagenomics is expected to play a major role in advancing our understanding of microbes and microbial communities. It is tempting to suggest that metagenomics can serve as a “universal test” for pathogens, eliminating the need to perform lengthy serial testing involving specific assays. Recent advances in sequencing techniques allow almost the entire genome of individual microorganisms to be assembled directly from environmental samples. Metagenomic analyses are playing a decisive role in the characterization of human microbial communities, as well as in determining the relationship between the resident microbiome and invasive pathogens. The accumulation of sequencing data has enhanced our recognition and understanding of the changing nature of microbial populations and their impact on the environment (92) and on human health (93). Metagenomics is not only helping to identify and characterize the human gut microbiome but is also identifying novel genes and microbial pathways, as well as functional dysbiosis. Clearly, metagenomics has become an indispensable and fast-growing discipline in modern science. Advances in NGS has led to a substantial increase in the number of metagenomic studies listed in the Genomes Online Database (GOLD) (https://gold.jgi.doe.gov). These studies span a broad environmental spectrum, including natural communities; as well as engineered and clinical environments (94, 95).

Study of Microbiome Prior To NGS

Prior to the advent of NGS technologies, the accurate profiling of microbial communities was challenging. The same was true for characterizing the human gut microbiome, a highly dense and diverse community containing only a small proportion of microbes that could be cultured (96). Early studies of the human gut microbiome involved the culturing of the microbes present in samples (97) and studying the interactions between co-cultured microbial taxa (98). These techniques, however, provided information on only a limited set of microbial taxa and microbial interactions. They failed to provide information about the composition of the entire community and the dynamics occurring between the taxa comprising the total community. The emergence of NGS technologies has overcome the limitations characteristic of studies based on culturing techniques.

Deciphering Host-Pathogen Interactions In The Era Of NGS Technologies

The advent of NGS technologies have greatly enhanced the ability to identify and characterize metabolic and regulatory mechanisms through which hosts and microbes interact with each other to define a healthy or diseased state in the host organism. NGS technologies are invaluable for the exploration of the composition of the microbiome and exploring the genetic, functional, and metabolic properties of the microbial community. Sanger sequencing (99), the first generation of DNA sequencing technology, was one of the widely used sequencing method for more than three decades and is still used today for low-throughput DNA sequencing or sequencing of single DNA entities. Sanger DNA sequencing is based on the principle of the selective incorporation of chain-terminating dideoxynucleotides by DNA polymerase. This technique was the major approach used in the Human Genome Project in 2001. The high cost of Sanger sequencing and volume (number of sequences) limitations reduced its potential for high-throughput sequencing.

Exploring Host-Pathogen Interactions

Advances in NGS technologies now provide a fast, cost-effective approach to delivering large volumes of highly-accurate data that has resulted in a major paradigm shift over the past few decades (100, 101). Time and cost were originally the main stumbling blocks associated with sequencing technology. The advantages of NGS over classic Sanger sequencing are that it is cost-effective, devoid of a cloning step, offers highthroughput, and requires minimal technical expertise. A major challenge with NGS data, however, is the analysis of millions of sequences that allows one to achieve statistically and scientifically meaningful conclusions (Table 1).

Table 1

Sequencing reactionLimitationAdvantagesInstrumentsRead length in base pairs (bp)ThroughputTotal number of readsRuntime
Sequencing by ligation or SOLiD sequencing
This sequencing method has been reported to have problems in sequencing particularly palindromic sequences and relatively slower than other methods.Relatively cheapSOLiD 5500 Wildfire50 (SES)80 Gb~700 M+6 days
75 (SES)120 Gb
50 (SES)+160 Gb
SOLiD 5500xl50 (SES)160 Gb~1.4 bn+10 days+
75 (SES)240 Gb
50 (SES)+320 Gb
BGISEQ-500 FCS155*50–100 (SES/PES)+8–40 Gb+NA24 h
BGISEQ-500 FCL15550–100 (SES/PES)+40–200 Gb*NA24 h+
Sequencing by synthesis:CRT
Equipment are very expensive. Requires high concentration of DNA.Potential for high sequencing yield, depending upon sequencer model and desired applicationIllumina MiniSeq Mid Output150 (SES)+2.1–2.4 Gb+14–16 M+17 h+
Illumina MiniSeq High output75 (SES)1.6–1.8 Gb22–25 M(SES)+7 h
75 (PES)3.3–3.7 Gb44– 50 M(PES)+13 h
150 (PES)+6.6–7.5 Gb+24 h+
Illumina MiSeq v236 (SES)540–610 Mb12–15M (SES)4 h
25 (PES)750–850 Mb24–30 M (PES)+5.5 h
150 (PES)4.5–5.1 Gb24 h
250 (PES)*7.5–8.5 Gb+39 h
Illumina MiSeq v375 (PES)3.3–3.8 Gb44–50 M (PES)+21–56 h+
300 (PES)+13.2–15 Gb+
Illumina NextSeq 500/550 Mid output75 (PES)16–20 GbUp to 260 M (PES)+15 h
150 (PES)+32–40 Gb+26 h+
Illumina NextSeq 500/550 High output75 (SES)25–30 Gb400 M(SES)+11 h
75 (PES)50–60 Gb800 M(PES)+18 h
150 (PES)+100–120 Gb+29 h+
Illumina HiSeq2500v2 Rapid run36 (SES)9–11 Gb300 M(SES)+7 h
50 (PES)25–30 Gb600 M(PES)+16 h
100 (PES)50–60 Gb27 h
150 (PES)75–90 Gb40 h
250 (PES)+125–150 Gb+60 h+
Illumina HiSeq2500 v336 (SES)47–52 Gb1.5 bn (SE)2 days
50 (PES)135–150 Gb3 bn(PES)+5.5 days
100 (PES)+270–300 Gb11 days+
Illumina HiSeq2500 v436 (SES)64–72 Gb2 bbn(SES)29 h
50 (PES)180–200 Gb4 B (PES)+2.5 days
100 (PES)360–400 Gb5 days
125 (PES)+450–500 Gb+6 days
Illumina HiSeq 3000/400050 (SES)105–125 Gb2.5 bn (SES)+1–3.5 days+
75 (PES)325–375 Gb
150 (PES)+650–750 Gb+
Illumina HiseqX150 (PES)+800–900 Gb per flow cell*2.6–3 bn (PES)+< 3 days+
Qiagen Gene ReaderNA12 genes; 1,250 mutationsNASeveral days
Sequencing by synthesis: SBS
Homopolymer errorsLess expensive and relatively fast454 GS JuniorUpto 600;400 average (SES,PES)*35 Mb+~ 0.1 M+10 h+
454 GS Junior+Upto 1,000;700 average (SES,PES)+70 Mb+~ 0.1 M+18 h+
454GSFLX TitaniumXLR70Upto 600;450 mode (SES,PES)+450 Mb+~1 M*10 h+
454 GS FLX Titanium XL+Up to 1,000; 700 mode (SE, PE)+700 Mb+~1 M+23 h+
Ion PGM 314200 (SES)30–50400,000–23 h
400 (SES)60–100 Mb+550,000+3.7 h+
Ion PGM 316200 (SES)300–500 Mb2–3 M+3 h
400 (SES)+600 Mb−1 Gb+4.9 h+
Ion PGM 318200 (SES)600 Mb−1 Gb4–5.5 M+4 h
400 (SES)+1–2 Gb+7.3 h+
Ion ProtonUp to 200 (SES)Up to 10 Gb+60–80 M+2–4 h+
Ion S5 520200 (SES)600 Mb−1 Gb3–5 M+2.5 h
400 (SES)+1–2 Gb+4 h+
Ion S5 530200 (SES)3–4 Gb15–20 M+2.5 h
400 (SES)+6–8 Gb+4 h+
Ion S5 540200 (SES)+10–15 Gb+60–80 M+2.5 h+
Single-moleculereal-time long reads or (PacificBioSciences)
Moderate throughput and equipment are very expensiveFast detectionPacific BioSciences RSII~20 Kb500 Mb−1 Gb+~55,000+4 h+
Pacific BioSciences Sequel8–12 Kb3.5–7 Gb+~350,000+0.5–6 h+
Oxford Nanopore MK1MinIONUp to 200 KbUp to 1.5 Gb>100,000Up to 48 h
Oxford Nanopore PromethIONNAUpto 4 Tb+NANA

Advantages and limitations of available Next generation sequencing (NGS) platforms.

+

Manufacturer's data;

*

Rounded from Field Guide to next-generation DNA sequencers and 2014 update;

Information is not available, as this product has been developed recently.

CRT, Cyclic Reversible Termination; NA, Not Available; PES, Paired End Sequencing; SBS, Sequencing by Synthesis; SES, Single End Sequencing.

Several different NGS platforms have been developed (Figure 4) and are commonly used. These include the Roche 454 GS FLX, Illumina (MiSeq and HiSeq), Ion Torrent/IonProton/Ion Proton, SOLiD 5500 series, and Oxford Nanopore. At present, the majority of microbial studies using high-throughput sequencing have focused on either functional metagenomics (103) or amplicon sequencing (104).

Figure 4

Figure 4

Timeline of the introduction of the next-generation DNA sequencing technologies and platforms.

Roche 454 Genome Sequencer

This sequencing platform is based on the principle of pyrophosphate release, which was originally described by Nyrén et al. (105) in 1985 and reported by Hyman (106) in 1988. Roche 454 was produced and made commercially available in 2005, and advertised as the first available high-throughput sequencing system. The system utilizes sequencing by synthesis (SBS), in which adapters are ligated to DNA fragments that cause the binding of the fragments to microbeads in a Pico Titer Plate (https://www.roche.com/). Amplification of the DNA fragments is carried out by Emulsion PCR, in which water droplets containing a single bead and PCR reagents are immersed in oil. The long read length (400–500 bases with paired-ends), along with its high efficiency, were more advantageous than what other NGS platforms could provide at that time; and thus was used for genome sequencing. The system generates 20 Mb of sequences per run with an average read length of approximately 100 bp (107). One of the notable applications of the Roche 454 system was the identification of the agents responsible for the epidemic disease of honeybees (108). Additional information about Roche 454 Genome Sequencer can be obtained at http://www.roche.com.

Illumina Genome Sequencer

The Illumina sequencing platform first emerged in 2006, and was followed by the acquisition of Solexa by Illumina in 2007. Illumina possesses an array of the most commonly sequencing systems and has rapidly been adopted by many researchers throughout the world. This is due to its' cost-effectiveness, and longer read length (although a limitation in the earlier version of the Illumina, which was subsequently improved in the newer version, MiSeq 2 × 300 bp). This led to a major shift by the scientific community from using the Roche 454 platform to Illumina technology (109). Illumina follows the principle of SBS chemistry, by incorporating reversible chain terminator nucleotides for all four bases, the labeling of each base with a different fluorescent dye, and the use of a DNA polymerase (110). Sequencing involves the ligation of specific adapters to both ends of short DNA fragments, and the immobilization of one of the adapters by binding to a solid support. The adapters hybridize with specific oligonucleotides bound to a proprietary substrate within a micro fluid flow cell. Fluorophore-bearing nucleotides are then introduced one by one and incorporated into the growing complementary strand by a DNA polymerase. Sequential images are captured and analyzed to identify the nucleotide that is incorporated in the growing strand and the cycle is repeated with different nucleotide species. The resulting reads have a final length of 35 nucleotides (111).

Illumina, however, introduced an upgraded version of their technology, the Genome Analyser II, which tripled the output relative to earlier versions of the Genome Analyser. Presently, the IlluminaMiSeq offers one of the longest (300 bp) read lengths of all of the Illumina products; facilitating the sequencing of paired-end reads (104). Another Illumina platform, the Illumina HiSeq, however, is able to generate approximately 200 Gbp of sequences with a single read of 2 × 100 bp (paired-end) per run (112). Additional information about the various Illumina sequencers can be obtained at http://solexaqa.sourceforge.net/ (113).

Qiagen Gene Reader

In 2012, Qiagen introduced the Intelligent BioSystems cyclic reversible termination platform, which was commercialized in 2015 under the name Gene Reader (114). In contrast to other next-generation platforms, the Qiagen Gene Reader is the first all-in-one platform that can execute all of the steps required for sequencing DNA, from sample preparation to analysis. To achieve this goal, the Gene Reader sequencer was combined with the QIA cube sample preparation system and the Qiagen Clinical Insight platform for variant analysis. Gene Reader virtually utilizes the same approach as Illumina, apart from the fact that only a small fraction of the added nucleotides incorporate fluorophore-labeled dNTP (115). Qiagen's Gene Reader usually runs up to four flow cells at a time, with each flow cell running up to ten samples. The flow cells can be added in mid-run via a “turntable” within the instrument. Additional information on the Qiagen Gene reader can be obtained at http://www.qiagen.com.

ABI SOLiD (Sequencing by Oligonucleotide Ligation and Detection) System

Applied Biosystems, through the Life Technology subsidiary, introduced the SOLiD platform in 2007. The system employs a unique chemistry for sequencing by ligating oligonucleotide adapters to DNA fragments and immobilizing the ligation products on beads, which are then placed on a water-oil emulsion (116). The beads on which DNA amplification occurs are deposited on glass slides and subjected to sequential hybridization with a universal PCR primer complementary to the adapters. The ligation step is then followed by fluorescence detection.

Ion Torrent Sequencing Technology (PGM, Proton, S5 Series)

Ion Torrent introduced the personal genome machine (PGM) in 2010 as a cost effective platform for DNA sequencing (117). Unlike other sequencing technology, Ion Torrent does not make use of optical signals (118) but rather utilizes an enzymatic cascade to generate a signal. The Ion Torrent system utilizes high-density micro-machined wells to carry out nucleotide additions in a massively parallel approach. Each micro-well contains a different DNA template. There is an ion-sensitive layer and an ion-sensor located under each well. The technology works on the principle of detecting the proton (H+) released during the incorporation of each dNTP in a growing DNA template. The release of H+ ion results in a change in pH that is detected by an integrated ion-sensitive, field-effect transistor (ISFET) (117). In the case of two identical bases, the output voltage is doubled. Ion torrent platform can generate upto 10 Gb of sequence data in a single run, with a maximum of 50 million reads having an average read length of 200 bases. The PGM can also provide 5.5 million reads having an average read length of 400 bp, producing a maximum of 2 Gb of sequence data from 318 V2 chip. A notable aspect of this technology is the size-selection step in which sequencing of longer fragments is omitted (https://www.thermofisher.com/in/en/home/brands/ion-torrent.html). Additional information about Ion Torrent technology can be obtained from https://www.thermofisher.com/in/en/home/brands/ion-torrent.html.

The Third Generation of Sequencing Technology

At present, the described sequencing technologies are the most commonly used for metagenome projects, however, sequencing technologies have undergone rapid advances during the past few years to attempt to resolve the biases associated with the current methods and to obtain a better balance between data yield, read length, and cost. These efforts have resulted in third generation sequencing technologies, such as Oxford Nanopore (119), and PacBiosequencing platforms (120) which are single-molecule and real-time technologies that reduce amplification bias, as well as short read length problems. The reduction in the cost and time presented by these sequencing methods are valuable asset. Although the error rate with the newer technologies is much higher relative to the other described sequencing technologies, this problem can be addressed by increasing the sequencing depth.

Pacific Biosciences

Pacific Biosciences established the first DNA sequencer that utilizes a single-molecule, real-time sequencing (SMRT) approach. This sequencing platform has become one of the most widely used third-generation sequencing technologies (121). The platform is based on the sequencing by synthesis principle. Pacific Biosciences makes use of the same fluorescent dyes as other NGS technologies, however, instead of carrying out the cycles of nucleotide amplification in the same manner as other sequencing technologies, the signals emitted upon the incorporation of the nucleotides are detected in real time. Sequencing is carried out on a chip (SMRT cell) that contains several zero mode wave (ZMW) guides. A single DNA polymerase is immobilized to the bottom of each ZMW guide with a molecule of single stranded DNA template (122). Four phospholinked nucleotides, each labeled with a different fluorescent dye producing a distinct emission spectrum, are also added to SMRT cells. Once the nucleotide is incorporated by the DNA polymerase, a light signal is produced and a base call is made and recorded (122).

Helicos Biosciences

Heliscope was released by Helicos Biosciences in 2007. It is also a single-molecule sequencing device. Sequencing is carried out in a glass flow cell with 25 channels for samples. The samples can either be replicates of the same sample or different samples. The Heliscope platform utilizes emulsion PCR amplification of DNA fragments in order to obtain significantly higher signals for reliable base detection by multiple charge-coupled device cameras. Single-molecule sequencing methods have the potential to deliver consistently low error rates by eliminating amplification-related bias, intensity averaging, and synchronization problems (123, 124). In the Heliscope platform, 100–200 oligonucleotide fragments are initially immobilized on a proprietary substrate within a microfluidics flow cell. Fluorescence-labeled nucleotides are then introduced individually and are incorporated by DNA polymerase into the growing complementary strand. The fluorophore-bearing nucleotide increases detectability and eliminates the need for amplification of the DNA template. Images are recorded and analyzed to identify the nucleotide that has been incorporated into the growing strand before the cycle begins with a different fluorescently-labeled nucleotide. At present, the Heliscope can only provide a read length of 35 nucleotides (111). Additional information can be obtained at http://www.helicosbio.com.

Oxford Nanopore Sequencing

Oxford Nanopore Technologies (ONT) is at the forefront of developing nanopore sequencing technology (http://www.nanoporetech.com/). The Nanopore platform does not require an amplification step as a part of library preparation. The novelty of this approach is that the DNA strand to be sequenced can be directly analyzed. Oxford Nanopore Technologies introduced the MinION (125) device in 2014. It has the potential to provide longer reads with better resolution of repeated sequence elements and structural genomic variants (126). MinION is a mobile, single-molecule Nanopore sequencer measuring four inches in length and is connected to a laptop with USB 3.0. Nanopore sequencing technology is based on the principle of modulation of the ionic current as a DNA molecule traverses through the nanopore, revealing characteristics of the molecule such as conformation, length and diameter. The pore consists of a protein within a conductive electrolytic solution which creates a small potential gradient across the protein pore (127). MinION mk1B is a pocket-sized portable sequencing device, containing 512 nanopore channels, and can be directly linked to a computer for data collection. More recently, a more advanced device, “PromethION,” has been commercialized (127). PromethION is a benchtop sequencer possessing 48 individual flow cells, each consisting of 3,000 pores that are equivalent to 48 MinIONs processing 500 bp/s (128). The capabilities of this instrument provide sequencing power that is sufficient to conduct sequencing of large genomes, such as the human genome. Additional information on Oxford Nanopore sequencing technologies can be obtained at https://www.nanoporetech.com.

So far, the present review has provided an overview of the first through third generations of sequencing technology that have provided significant improvements in the ability to conduct microbiome research. Metagenomic and other omic approaches are the most effective methods that can be used to characterize microbial communities, as well as their metabolic activity. It is now feasible to obtain information on the composition (taxa), diversity, pathogenesis, evolution, and drug resistance of microbes. The selection of any of the above mentioned platforms, however, should be mainly dependent on the aim, design, and purpose of the study. Illumina sequencing technology has made tremendous advances in data output and cost efficiency over the past few years and as a result, presently dominates the NGS market (129, 130). Illumina sequencing technology has been used extensively in numerous microbiome research projects (131133), including the Human Microbiome Project (44). While both Ion Torrent and Illumina sequencers provide a number of advantages in terms of their cost and efficiency, the short read lengths they provide make them less appropriate for addressing a number of scientific questions, including detection of gene isoforms, methylation detection, and genome assembly (118). SMRT (single-molecule real-time) sequencing platforms offer approaches that are more suited for these research objectives. Since PacBiosequencing generates longer reads that provide longer scaffolds (134136), it is well suited for denovo genome assembly. The commercial availability of MinION sequencers by Oxford Nanopore Technologies, which resemble a USB flash drive in appearance, has also enabled applications that require long-read sequencing (137, 138). The efficiency, long read lengths, and single-base sensitivity make nanopore sequencing technology a promising approach for high-throughput sequencing. The MinION system has been used for sequencing the genomes of infectious agents, including the analysis of bacterial antibiotic resistance islands (137), the influenza virus (139) and genome surveillance of the Ebola virus (140). The advancements in high-throughput sequencing technologies now provide the opportunity to choose different sequencing platforms to conduct microbiome research. In a comparative analysis of the Illumina MiSeq, Ion Torrent PGM, and 454 GS Junior sequencing platforms, Loman et al. (141) reported that Illumina provided the highest output per run (1.6Gb/run, 60Mb/h) and the lowest error rates. In a study comparing different sequencing platforms (Ion Torrent PGM, Illumina MiSeq and HiSeq) for the shotgun sequencing of six human stool samples, Clooney et al. (142) concluded that the best assembly values were obtained using the Illumina HiSeq platform, in which 10 million reads per sample were produced. In contrast, the Illumina MiSeq and Ion Torrent PGM did not produce a sufficient number of reads to produce an adequate genome assembly (143).

Correlation Between the Microbiome and Infectious Diseases

Human gut microbiome signatures exhibit individual specificity. There is a high degree of inter individual variation that is based on both host genetics and environmental factors (144, 145). The high degree of individual specificity, however, has hampered our understanding of function of the gut microbiome and its importance in health and disease. The human gut microbiome exhibits a high degree of plasticity, mainly in response to dietary changes that support a healthy gut ecosystem and minimize disease risk (146). The onset of new methodologies, including NGS and bioinformatic pipelines, have resulted in a paradigm shift in the fields of clinical microbiology and infectious diseases due to the realization of the complex interactions that occur within the microbiome. The relationship between human pathogens, infectious diseases, and the gut microbiome are slowly being revealed. Several studies have examined the correlation between the human gut microbiome and health status (141, 142). Reports have indicated that while the gut microbiome appears to be relatively stable under healthy conditions, any qualitative or quantitative changes in the gut microbiome can result in functional modifications and disease as reported (144, 147149). A rich level of bacterial diversity is considered to be an indicator of a healthy status, while a low level of bacterial diversity is correlated with inflammatory, immune, and obesity-related diseases (58, 144, 147153). Several studies have indicated that the human microbiota plays a crucial role in human health and disease (68, 154168). Studies have also revealed that microbial symbiosis plays a central role in the development of a number of diseases, including liver diseases (156), metabolic disorders (154), gastrointestinal (GI) malignancy (157), respiratory diseases (158), autoimmune diseases (160), and mental or psychological diseases (160). Johnson et al. (169) discussed the Bacteroidetes, one of main components of the microbiome, their genetic variability and contrasting effect on metabolic diseases such as obesity and type II diabetes (169). Yiu et al. (170) proposed that body weight, metabolism, and diseases such as obesity are affected by the interplay between the immune system, metabolism and microbiome (170). In discussing chronic IBD, Frick and Wehkamp (171) outlined some of the available therapeutic interventions that can be used to alter mucosal immunity and the composition of the microbiome. While studying the molecular aspects of human gut-brain interactions, Lee et al. (172) demonstrated how the microbiota influences host physiology and neurodegenerative and neurological developmental diseases (172).

Bioinformatic Pipelines for Metagenomic Data Analysis

The advances in NGS have resulted in the production of massive datasets that are increasingly difficult to analyse (128). As larger datasets are generated, more sophisticated computational resources and bioinformatic tools are required. The interpretation and understanding of metagenomic studies depend on the computational tools that can be used to analyse enormous data sets and mine valuable, useful, and valid information regarding the microbial communities being studied. Bioinformatic tools used for metagenomic analysis, especially for translating raw sequences into meaningful data, are continually developing with the aim of providing the ability to examine both the taxonomic and the functional composition of diverse metagenomes (173, 174). A number of the specialized software programs available for analysing the metagenomic data are listed (Table 2). Based on the list provided, an example of a comparative analysis pipeline is presented in the present review that takes into consideration user friendliness, ease of access, open source availability, ability to analyse metagenomic datasets, and ability to provide graphical representations of the analyzed data (Figure 5). A description of the software (MG-RAST, EBI, QIIME and Mothur) used in the different pipelines is described in Table 3, which provides a detailed summary of the functionality and features of the mentioned software programs. The four pipelines share several steps during the analysis such as quality control, clustering, and annotation (Figure 5).

Table 2

SoftwareApplicationLink (website)References
FastQCFastQC, a java based application is performed via a series of analysis modules.
FastQC can either run in a non-interactive mode or in a standalone interactive mode.
FastQC is a quality control tool used for high-throughput sequence data via a series of modular options and giving graphical results of length distribution, quality per base sequence, N numbers, GC content, over representation and duplication.
http://www.bioinformatics.babraham.ac.uk/projects/fastqc/(175)
Fastx-ToolkitFastx is a command based tool kit for the quality control of short-reads and allows processing, format conversion, collapsing and cutting on the basis of sequence identity and length.http://hannonlab.cshl.edu/fastx_toolkit/index.html(176)
PRINSEQA standalone tool allows integration and analysis into the existing data processing pipelines. PRINSEQ as a tool offers a computational resource that is able to handle huge amount of data generated by next-generation sequencers. It is used for sequences trimming based on in the di-nucleotides occurrences and the sequence duplication (mainly 5′/3′).http://prinseq.sourceforge.net/(177)
NGS QC ToolkitNGS QC Toolkit encompasses user-friendly standalone tools for the quality control of the sequence data generated by next-generation sequencing platforms. The analysis is performed in a parallel environment.http://www.nipgr.res.in/ngsqctoolkit.html(178)
Meta-QC-ChainMeta-QC-Chain is a tool for the quality control analysis performed in parallel environment. Performs mapping against 18S rRNA databases in order to remove the eukaryotic contaminant sequences.http://www.computationalbioenergy.org/qc- chain.html(179)
MothurMothur is an open-source, expandable software used for the quality analysis of reads to taxonomic classification, ribosomal gene meta-profiling comparison and calculus of diversity estimators.http://www.mothur.org/(180)
QIIMEQIIME pipeline is designed for the task of analyzing microbial communities sampled via a marker gene (16S or 18S rRNA) amplicon sequencing. In its heart pipeline QIIME performs quality pre-treatment of raw-reads, calculate estimates diversity estimates, taxonomic annotation and comparison of metagenomic data.http://qiime.org/(181)
MEGANMEGAN is a graphical interface tool that allows both taxonomic as well as functional analysis of metagenomic reads. It is based on the BLAST output of short reads and performs comparative metagenomics.http://ab.inf.uni-tuebingen.de/software/megan/(20)
CARMACARMA provides a clear quantitative and statistical characterization of phylogenetic classification of the reads based on Pfam conserved domains.http://omictools.com/carma-s1021.html(182)
PICRUStPICRUSt is a tool that serves in the field of metagenomic analysis where the prediction of the metabolic potential is done from the taxonomic information obtained via 16S rRNA meta-profiling projects. PICRUSt could be thought of as an automated substitute to manually mining the gene families that are believed to be present in organisms whose sequences are found in a 16S ribosomal RNA.http://picrust.github.io/picrust/(183)
TETRATERTA is a web-based stand alone program used for the Taxonomic classification and comparison of tetra nucleotide patterns with in a DNA sequence.http://www.megx.net/tetra(184)
PhylophytiaSComposition-based classifier of sequences based on reference genomes signatureshttps://omictools.com/pps-tool(185)
MOCATMOCAT is a highly configurable and modular pipeline that includes the quality treatment of metagenomic reads based on single copy marker genes classification and gene-coding prediction. The pipeline makes use of a state-of-the-art program to map quality control and assemble reads from metagenome samples sequenced at a very high depth (several billion base pairs).http://www.bork.embl.de/mocat/(186)
Parallel-metaParallel-meta is a comprehensive and automotive software package that offers fast data mining and metabolic function across large number of metagenomic datasets. The functional annotation is based on BLAST best hit results.http://www.computationalbioenergy.org/parallel-meta.html(187)
MetaclusterTAMetaclusterTA is a tool used for the Taxonomic annotation that is based on the binning of reads and contigs. Dependent on reference genomes.http://i.cs.hku.hk/~alse/MetaCluster/(188)
MaxBinMaxBin software is used for the unsupervised binning of metagenomic sequences based on an Expectation-maximization algorithm. For user's expediency MaxBin reports genome-related statistics including GC content, genome size and completeness.http://bowtie-bio.sourceforge.net/index.shtml(189)
Amphora and Amphora2Amphora and Amphora2 is used for the Metagenomic phylotyping via single copy phylogenetic marker genes classification.http://pitgroup.org/amphoranet/(102, 190)
BWABWA is an algorithm used for the mapping of short-low-divergent sequences to large references. It is based on Burrows–Wheeler transform.http://bio-bwa.sourceforge.net/(191)
BowtieBowtie is a fast short read aligner to long reference sequences based on Burrows–Wheeler transform.http://bowtiebio.sourceforge.net/index.shtml(192)
GenometaGenometa is a graphical interface applied for taxonomic and functional annotation of short-reads metagenomic data.http://genomics1.mhhannover.de/genometa/(193)
SOrt-ITEMSSOrt-Items is a tool used for taxonomic annotation via alignment-based orthology of metagenomic reads.https://omictools.com/sort-items-tool(194)
DiScRIBinATETaxonomic assignment by BLASTx best hits classification of reads.https://www.westgrid.ca/support/software/discribinate(195)
IDBA-UDIDBA-UD is a denovo assembler of metagenomic sequences with uneven depth.http://i.cs.hku.hk/~alse/hkubrg/projects/idba_ud/(196)
MetaVelvetMetaVelvet is a denovo assembler of metagenomic short reads.http://metavelvet.dna.bio.keio.ac.jp/(197)
RayMetaRayMeta, a denovo assembler of metagenomic reads and taxonomy profiler by Ray Communities.http://denovoassembler.sourceforge.net/(198)
MetaGeneMarkMetaGeneMark is a gene coding sequences predictor from metagenomic sequences by heuristic model.http://exon.gatech.edu/index.html(199)
GlimmerMGGlimmerMG is a gene coding sequences predictor from metagenomic sequences by unsupervised clustering.http://www.cbcb.umd.edu/software/glimmer-mg/(200)
FragGeneScanFragGeneScan is a gene coding sequences predictor from short reads.http://sourceforge.net/projects/fraggenescan/(201)
CD-HITCD-HIT is a tool used for clustering and comparing of sequences of nucleotides or protein.http://weizhongli-lab.org/cd-hit/(202)
HMMER3HMMER3 is a free and commonly used software package for sequence analysis. It is a Hidden Markov based model used to perform sequences alignments. Used for the identification of the homologus nucleotide and protein sequenceshttp://hmmer.janelia.org/(203)
BLASTXBasic local alignment of translated sequenceshttp://blast.ncbi.nlm.nih.gov/blast/Blast.cgi(203)
MetaORFAMetaORFA is applied for the assembly of peptides obtained from predicted ORFs.Website not available(204)
MinPathMinPath is a tool used for reconstruction of pathways from protein family predictions.http://omics.informatics.indiana.edu/MinPath/(205)
MetaPathMetaPath is used for the identification of metabolic pathways that are differentially abundant within the metagenomic samples.http://metapath.cbcb.umd.edu/(206)
GhostKOALAGhostKOALA is KEGG's internal annotator of metagenomes by k-number assignment by GHOSTX searches against a non-redundant database of KEGG genes.http://www.kegg.jp/ghostkoala/(207)
RAMMCAPRAMMCAP is used for the metagenomic functional annotation and data clustering.http://weizhong-lab.ucsd.edu/rammcap/cgi-bin/rammcap_2d.cgi(208)
ProViDEProViDE is a tool for the analysis of viral diversity in metagenomic samples.https://omictools.com/provide-tool(209)
PhyloseqPhyloseq is a tool-kit for raw reads pre-processing, diversity analysis and graphics production. It is an R, Bioconductor package.https://joey711.github.io/phyloseq/(210)
Metagenome SeqMetagenomeSeq is designed to determine the analysis of differential abundance of 16S rRNA gene in metaprofiling data. It is also designed to address the effects of both under-sampling and normalization of microbial communities on the basis of disease association detection.http://bioconductor.org/packages/release/bioc/html/metagenomeSeq.html(211)
Shotgun Functionalize RShotgun Functionalize is an R-Package for the functional assessment of metagenomic data. The package includes tools designed for importing, annotating and visualizing metagenomic data generated via high-throughput sequencing.http://shotgun.math.chalmers.se/(212)
Galaxy portalGalaxy portal is a web repository of computational tools that can be run without informatics expertise. It is a graphical interface and free service.https://usegalaxy.org/(213)
MG-RASTMG-RAST an open source web application is used for the automatic phylogenetic and functional analysis of metagenomes. MG-RAST is one of the biggest repositories for metagenomic data. It is a Graphical interface, web portal and free service.http://metagenomics.anl.gov/(214)
IMG/MIMG (Integrated Microbial Genomes) system serves as a community resource for the analysis, functional annotation and phylogenetic distribution of genes and comparative metagenomics. It is a graphical interface, web portal and free in service.https://img.jgi.doe.gov/(215)
PhinchPhinch is an open source, interactive exploratory data visualizing tool intended to alleviate the analysis of meta-omic datasets. The main features of this software are streamlined visualization workflow, sleek user interface, novel exploration of larger datasets. Accessible via web browser.http://phinch.org(215)
CAMERACAMERA is an important tool that aims to bridge the gaps and to develop methods so as to monitor microbial communities of the oceans. CAMERA's databases incorporate both the genomic and metagenomic datasets, metadata, results from the pre computed analysis and softwares that endorse commanding cross-analysis of the environmental metagenomes.https://omictools.com/camera-2-tool(216)
Meta CompMeta Comp is a graphical inclusive analysis tool that encompasses a series of statistical analysis approaches along with visualized results for comparative analysis of metagenomics as well as other meta-omics data sets. The software has the features to read files generated via different upstream analysis programs. It has also got the features to automatically choose two-group sample test.http://cqb.pku.edu.cn/ZhuLab/MetaComp/(216)

Lists of software's used in metagenomics analysis.

Figure 5

Figure 5

Overview of the workflow used by metagenomic analysis tools (QIIME, Mothur, EBI and MG-RAST).

Table 3

EBIMGRASTQIIME/QIIME 2MOTHUR
LicenseFree open-sourceFree open-sourceFree open-sourceFree open-source
Implementation (release candidate)PythonPythonPythonC++
Current Version available (March 2018)4.14.0.31.9.1 and 2017.6.0, respectively1.39.5
Websitehttp://www.ebi.ac.uk/metagenomicshttp://metagenomics.anl.gov/http://qiime.org/ and https://qiime2.org/http://www.mothur.org/
Primary UsageGUIGUICL and GUI, respectivelyCL
Amplicon data AnalysisYesYesYesYes
Whole genome shotgun analysisYesYesYes but only experimentalNo
Sequencing technology compatibilitySanger, PacBio, Ion Torrent, Illumina, NanoporeSanger, PacBio, Ion Torrent, Illumina, NanoporeSanger, PacBio, Ion Torrent, Illumina, NanoporeSanger, PacBio, Ion Torrent, Illumina, Nanopore
Quality controlYesYesYesYes
16S rRNA gene Databases searchedSilva, Rfam, MAPSeq, Pfam, TIGRFAM, Prints, Prosite patterns, Gene 3dSilva, M5RNA, RDP and GreengenesGreengenes, RDP, Siva and UniteRDP, Greengenes Silva and Unite
Alignment methodPyNAST, MUSCLE, INFERNALBLATPyNAST, MUSCLE, INFERNALNeedleman-Wunsch, Blastn, Gotoh
Taxonomic assignmentUCLUST, BLAST, Mothur, RDPBLATUCLUST, BLAST, Mothur, RDPWang/RDP approach
Clustering algorithmUCLUST, BLAST Mothur, CD-HITUCLUSTUCLUST, BLAST Mothur, CD-HITMothur, CD-HIT and adapts DOTUR
Diversity analysisAlpha and betaAlphaAlpha and betaAlpha and beta
Phylogenetic TreeYESYESFastTreeClear cut algorithm
VisualizationT, BC, PC, HM, SC, PCA, Krona and CircosT, BC, PC, HM, SC, PCA, Krona and CircosT, BC, PC, HM, SC, PCAT, BC, PC, HM, SC, PCA, Dendrograms, Venn diagrams
Submitted projects as on March 2018Total: 1,653
Public: 1,503
Private:151
Total: 3,24,846
Public:52,615
Private:272,231
NANA

Comparative workflow of the four most commonly used bioinformatics pipeline for analyzing metagenomic datasets.

BC, Bar-Charts; BLAT, Blast like Alignment Tool; CL, Command Line; EBI, European Bioinformatics Institute; GUI, Graphical User Interface; HM, Heat Map; MGRAST, Metagenomic Rapid Annotations using Subsystems Technology; OUT, Operational Taxonomic Unit; PC, Pie-Charts; PCA, Principal Component Analysis; QIIME, Quantitative Insights into Microbial Ecology; RDP, Ribosomal Database Project; SC, Stacked Columns; T, Tabulation.

Metagenomic Data Analysis Software: Command Based Vs. Graphical User Interface

As comprehensive metagenomic studies are becoming more common, they are yielding novel and important insights into the microbial communities in diverse environments; from terrestrial to aquatic ecosystems and from human skin to the human gastrointestinal tract. Advances in NGS have made it more possible than ever for researchers to conduct whole genome sequencing. The analysis of the datasets obtained from NGS is complex and require an intelligent and systematic approach to process the data efficiently. The results obtained from any metagenomic study relies on in silico computational tools that can analyse large data sets and can mine and highlight various aspects about the community being examined. Although the tools and databases developed to investigate the taxonomic composition of a microbial community and provide information on the functional aspects of the community are becoming more elaborate and complex, though CLC microbial genomic package offered by Qiagen are good for these analysis. Nanopore sequencing technology has presented an option for an analysis pipeline, with novel options for assembly and annotation. Figure 6, presents the workflow involved in metagenomic analysis, and indicates all the steps and tools used for analyzing the data generated from metagenomic sequencing. The metagenomic pipeline can utilize any of the presented approaches, based on type of sequencing data (targeted metagenomics or shotgun metagenomics). The flowchart summarizes the basic steps that are followed in the analysis pipeline starting from preprocessing of the sequencing data to the final extraction, storage, and presentation of the data. The most popular tools, along with the databases and algorithms employed for the analysis, are indicated.

Figure 6

Figure 6

Flow chart of basic metagenomics steps and tools currently in use.

Technology and the Changing Landscape of Metagenomic Research

Over the past decade, advancements in NGS have led to a significant reduction in the cost of genome sequencing. These technological advances have enabled the sequencing of several genomes in a day at a cost of approximately $1,000 per genome (Figure 7). The cost estimates presented in Figure 7 represent (A) cost in U.S, dollars per Mb of sequence data from 2001 to 2009, (B) cost in U.S, dollars per Mb from 2009 to 2017, (C) cost in U.S, dollars per Genome from 2001 to 2009, and (D) cost in U.S, dollars per Genome from 2009 to 2017. Although sequencing is now relatively easy and straight forward, NGS technology is not perfect and errors in the data do occur. Moreover, some regions of the DNA have not been successfully sequenced. The underlying costs associated with different approaches to sequencing genomes are of great importance because they impact the scope and scale of genomic projects. Decreases in sequencing costs have led to the establishment of large collaborative projects with broad goals and individual laboratories targeting more specific questions.

Figure 7

Figure 7

Timeline showing the sequencing cost (A) per Mb until year 2009, (B) per Mb between year 2009 and 2017, (C) per genome until year 2009, (D) per genome between year 2009 and 2017.

The decreasing cost structure of DNA sequencing has had and will continue to have an impact on genomics and bio-computing. With the size of databases expanding continuously, the translation of data into biological insight is becoming more and more important. As a result, data analysis a more prominent aspect in obtaining information and value from the data (217). Significant analytical efforts are needed to gain useful insights from the generated data. The fields of microbiology, biotechnology, and medicine are already benefiting from genome sequencing efforts, and as costs continue to decrease, the practice of genome sequencing is expected to become almost routine. For example, the Sanger Institute is sequencing the genomes of patients suffering from cancer and rare diseases as part of the 100,000 Genomes Project organized by Genomics England.

Some patients have already benefitted from metagenomic-based diagnoses and treatments, and researchers are continuing to gain more knowledge about the genetic variations that cause a variety of diseases. Sequencing, however, is not the only option for genetic analysis. An important part of the Precision Medicine Initiative, organized by the US National Institute of Health, is to develop a more predictable and possibly less technically complex method of genetic analysis. Sequencing, however, appears to be the only way to comprehensively explore the complex features of DNA that guide the initiation and progression of a number of diseases. Additionally, comprehensive sequencing also helps determine how our DNA keeps us healthy (218).

Future Perspectives of Metagenomics and Human Health

Though the field of metagenomics pre-dates NGS, modern high-throughput sequencing technologies have greatly transformed this promising field by enabling a comprehensive characterization of all microorganisms present in a sample. As metagenomic approaches become more developed and clinically corroborated, it is expected that metagenomics will be at the forefront as a method for diagnosing infectious diseases. When a complex or unknown infectious disease is encountered, the use of multiple conventional diagnostic tests can potentially lead to unnecessary expenses; more importantly, this can also result in the delay of a diagnosis. Metagenomics can be used to identify potential pathogens, both known and novel, and can also be used to assess the state of an individual's microbiome. As sequencing become easier, faster, and more cost-effective, it will be possible to serially characterize the human microbiota to explore changes that occur in the human microbiome over time. This knowledge could lead to the development of novel medicines and approaches for treating infectious diseases. Indeed, metagenomic studies may become so standard that DNA sequencers could be used in homes to monitor changes in the stool microbiome of an individual to guide the maintenance of health.

Conclusions

All forms of life on this planet are dependent on microbes. They define an environment and are in turn defined by it. Our understanding of host-pathogen systems, however, is only in its infancy. Over the past two decades, sequencing technology, along with bioinformatic tools, have improved significantly; making it feasible to explore microbial communities residing within diverse hosts. There is a strong recognition that the microbial diversity existing in extreme habitats has largely been unexplored. To gain insight into this “latent” microbial flora, novel methodologies are required. NGS technologies have provided a rapid, cost-efficient means of generating sequencing data and provided sequencing platforms that can be used in large genome-sequencing centers, as well as individual laboratories. Illumina, PacBio, and Applied Biosystems, have all announced upgraded versions of their respective DNA-sequencing platforms. These upgrades will increase high-throughput ability and read length, while at the same time significantly reduce the cost of sequencing per base. These developments will significantly contribute to and provide exciting new opportunities to microbiologists. The integration of several approaches to biological studies will be necessary to answer questions about the diversity and ecology of microbial flora. It is the opinion of the authors of the present review that the development of better bioinformatic tools for analysing metagenomic data is urgently needed. The vast amounts of metagenomic data that will be forthcoming will bring new challenges for analysing, storing, and transferring data. Genome-sequencing centers and laboratories are going to become more dependent on information technology and bioinformatics. Bioinformatic expertise will increasingly be necessary to analyse large amounts of data and to mine the data for useful information about microbial diversity. Metagenomics will play an increasing role in the fields of medicine, biotechnology, and environmental science. The authors hope that this review provides a clear overview of the sequencing platforms and bioinformatic analysis of software that are available, including their high value and limitations.

Statements

Author contributions

MM prepared the draft of the manuscript under the guidance of AK and SY. AD prepared the illustrations. EA and AH edited the manuscript.

Funding

MM (Y15465008) was supported by the University Ph.D. Fellowship for this study. AD had financial support through the DST Inspire Ph.D. Fellowship (IF160797) from the Department of Science and Technology, New Delhi, India. AH and EA would like to extend their sincere appreciation to the Deanship of Scientific Research at King Saud University for funding the Research Group No. (RGP-271).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

    Abbreviations

  • PCR

    Polymerase chain reaction

  • NGS

    Next generation sequencing

  • GIT

    Gastrointestinal tract

  • IBD

    Inflammatory bowel disease

  • IBS

    Irritable bowel syndrome

  • SCFA

    Short-chain fatty acids

  • PYY

    Peptide YY

  • HIV

    Human immunodeficiency virus

  • KEGG

    Kyoto Encyclopaedia of genes and genomes

  • GOLD

    Genomes Online Database

  • SBS

    Sequencing by synthesis

  • dNTP

    Deoxyribonucleotide triphosphate

  • PGM

    Personal genome machine

  • ISFET, Ion-sensitive

    field-effect transistor

  • SMRT, Single-molecule

    real-time sequencing

  • ZMW

    Zero mode wave

  • ONT

    Oxford Nanopore Technologies

  • MG-RAST

    Metagenomics Rapid Annotation using Subsystem Technology

  • EBI

    European Bioinformatics Institute

  • QIIME

    Quantitative Insights Into Microbial Ecology.

References

  • 1.

    FiererN. Embracing the unknown: disentangling the complexities of the soil microbiome. Nat Rev Microbiol. (2017) 15:579. 10.1038/nrmicro.2017.87

  • 2.

    BardgettRDvander Putten WH. Belowground biodiversity and ecosystem functioning. Nature (2014) 515:505. 10.1038/nature13855

  • 3.

    FuhrmanJA. Microbial community structure and its functional implications. Nature (2009) 459:193. 10.1038/nature08058

  • 4.

    VanDer Heijden MGABardgettRDVanStraalen NM. The unseen majority: soil microbes as drivers of plant diversity and productivity in terrestrial ecosystems. Ecol Lett. (2008) 11:296310. 10.1111/j.1461-0248.2007.01139.x

  • 5.

    GrahamEBKnelmanJESchindlbacherASicilianoSBreulmannMYannarellAet al. Microbes as engines of ecosystem function: when does community structure enhance predictions of ecosystem processes?Front Microbiol. (2016) 7:214. 10.3389/fmicb.2016.00214

  • 6.

    HamadyMKnightR. Microbial community profiling for human microbiome projects: tools, techniques, and challenges. Genome Res. (2009) 19:114152. 10.1101/gr.085464.108

  • 7.

    QinJLiRRaesJArumugamMBurgdorfKSManichanhCet al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature (2010) 464:5965. 10.1038/nature08821

  • 8.

    KahnRAFuHRoyCR. Cellular hijacking: a common strategy for microbial infection. Trends Biochem Sci. (2002) 27:30814. 10.1016/S0968-0004(02)02108-4

  • 9.

    VieiraSMPagovichOEKriegelMA. Diet, microbiota and autoimmune diseases. Lupus (2014) 23:518526. 10.1177/0961203313501401

  • 10.

    WeinstockGM. Genomic approaches to studying the human microbiota. Nature (2012) 489:250610.1038/nature11553

  • 11.

    KallmeyerJPockalnyRAdhikariRRSmithDCD'HondtS. Global distribution of microbial abundance and biomass in subseafloor sediment. Proc Natl Acad Sci USA. (2012) 109:162136. 10.1073/pnas.1203849109

  • 12.

    ZhaoSWangJBuDLiuKZhuYDongZet al. Novel glycoside hydrolases identified by screening a Chinese Holstein dairy cow rumen-derived metagenome library. Appl Environ Microbiol. (2010) 76:67015. 10.1128/AEM.00361-10

  • 13.

    RosarioKBreitbartM. Exploring the viral world through metagenomics. Curr Opin Virol. (2011) 1:28997. 10.1016/j.coviro.2011.06.004

  • 14.

    FosterJANeufeldK-AM. Gut–brain axis: how the microbiome influences anxiety and depression. Trends Neurosci. (2013) 36:30512. 10.1016/j.tins.2013.01.005

  • 15.

    ScherJUAbramsonSB. The microbiome and rheumatoid arthritis. Nat Rev Rheumatol. (2011) 7:569. 10.1038/nrrheum.2011.121

  • 16.

    DevarajSHemarajataPVersalovicJ. The human gut microbiome and body metabolism: implications for obesity and diabetes. Clin Chem. (2013) 59:61728. 10.1373/clinchem.2012.187617

  • 17.

    TitoRYKnightsDMetcalfJObregon-TitoAJCleelandLNajarFet al. Insights from characterizing extinct human gut microbiomes. PLoS ONE (2012) 7:e51146. 10.1371/journal.pone.0051146

  • 18.

    AdlerCJDobneyKWeyrichLSKaidonisJWalkerAWHaakWet al. Sequencing ancient calcified dental plaque shows changes in oral microbiota with dietary shifts of the Neolithic and Industrial revolutions. Nat Genet. (2013) 45:450. 10.1038/ng.2536

  • 19.

    D'CostaVMKingCEKalanLMorarMSungWWLSchwarzCet al. Antibiotic resistance is ancient. Nature (2011) 477:45761. 10.1038/nature10388

  • 20.

    HusonDHWeberN. Microbial community analysis using MEGAN. Met. Enzymol. (2013) 531:46585. 10.1016/B978-0-12-407863-5.00021-6

  • 21.

    MarkowitzVMChenI-MAPalaniappanKChuKSzetoEPillayMet al. IMG 4 version of the integrated microbial genomes comparative analysis system. Nucleic Acids Res. (2013) 42:D56067. 10.1093/nar/gkt963

  • 22.

    MetzkerML. Sequencing technologies—the next generation. Nat Rev Genet. (2010) 11:31. 10.1038/nrg2626

  • 23.

    GlassEMMeyerF. The metagenomics RAST server: a public resource for the automatic phylogenetic and functional analysis of metagenomes. In: de BruijnFJ (editor). Handbook of Molecular Microbial Ecology I: Metagenomics and Complementary Approaches. New York, NY. (2011). p. 32531. 10.1002/9781118010518.ch37

  • 24.

    OulasAPavloudiCPolymenakouPPavlopoulosGAPapanikolaouNKotoulasGet al. Iliopoulos loannis. Metagenomics: tools and insights for analyzing next-generation sequencing data derived from biodiversity studies. Bioinform Biol Insights (2015) 9:BBI-S12462. 10.4137/BBI.S12462

  • 25.

    PandeyRVNolteVSchlöttererC. CANGS: a user-friendly utility for processing and analyzing 454 GS-FLX data in biodiversity studies. BMC Res Notes (2010) 3:3. 10.1186/1756-0500-3-3

  • 26.

    RothbergJMLeamonJH. The development and impact of 454 sequencing. Nat Biotechnol. (2008) 26:1117. 10.1038/nbt1485

  • 27.

    ThomasTGilbertJMeyerF. Metagenomics-a guide from sampling to data analysis. Microb Inform Exp. (2012) 2:3. 10.1186/2042-5783-2-3

  • 28.

    RouxAPayneSMGilmoreMS. Microbial telesensing: probing the environment for friends, foes, and food. Cell Host Microbe (2009) 6:11524. 10.1016/j.chom.2009.07.004

  • 29.

    DubeyAKumarAAbd_AllahEFHashemAKhanML. Growing more with less: Breeding and developing drought resilient soybean to improve food security. Ecol Indic. (2018). 10.1016/j.ecolind.2018.03.003

  • 30.

    KumarAChordiaN. Role of microbes in human health. Appl Microbiol. (2017) 3:24. 10.4172/2471-9315.1000131

  • 31.

    AhmadIKhanMSAAqilFSinghM. Microbial applications in agriculture and the environment: a broad perspective. In: AhmadIAhmadFPichtelJ (editors). Microbes and Microbial Technology: Agricultural and Environmental Applications. New York, NY: Springer (2011). p. 127. 10.1007/978-1-4419-7931-5_1

  • 32.

    MallaMADubeyAYadavSKumarAHashemAAbd-AllahEF. Understanding and designing the strategies for the microbe-mediated remediation of environmental contaminants using omics approaches. Front Microbiol. (2018) 9:1132. 10.3389/fmicb.2018.01132

  • 33.

    HartmanKvander Heijden MGAWittwerRABanerjeeSWalserJ-CSchlaeppiK. Cropping practices manipulate abundance patterns of root and soil microbiome members paving the way to smart farming. Microbiome (2018) 6:14. 10.1186/s40168-017-0389-9

  • 34.

    SinghRR. Next generation sequencing technologies. ComprehMed Chem III. 35461. 10.1016/B978-0-12-409547-2.12327-3

  • 35.

    MadsenEL. Microorganisms and their roles in fundamental biogeochemical cycles. Curr Opin Biotechnol. (2011) 22:45664. 10.1016/j.copbio.2011.01.008

  • 36.

    ZhouJHeZYangYDengYTringeSGAlvarez-CohenL. High-throughput metagenomic technologies for complex microbial community analysis: open and closed formats. MBio (2015) 6:e02288-14. 10.1128/mBio.02288-14

  • 37.

    TripathiMSinghDVikramSSinghVKumarS. Metagenomic approach towards bioprospection of novel biomolecule (s) and environmental bioremediation. Annu Res Rev Biol. (2018) 22:112. 10.9734/ARRB/2018/38385

  • 38.

    WuGDLewisJD. Analysis of the human gut microbiome and association with disease. Clin Gastroenterol Hepatol. (2013) 11:7747. 10.1016/j.cgh.2013.03.038

  • 39.

    CocolinLDolciPRantsiouK. Biodiversity and dynamics of meat fermentations: the contribution of molecular methods for a better comprehension of a complex ecosystem. Meat Sci. (2011) 89:296302. 10.1016/j.meatsci.2011.04.011

  • 40.

    ErcoliniD. PCR-DGGE fingerprinting: novel strategies for detection of microbes in food. J Microbiol Methods (2004) 56:297314. 10.1016/j.mimet.2003.11.006

  • 41.

    QuigleyLO'SullivanOBeresfordTPRossRPFitzgeraldGFCotterPD. Molecular approaches to analysing the microbial composition of raw milk and raw milk cheese. Int J Food Microbiol. (2011) 150:8194. 10.1016/j.ijfoodmicro.2011.08.001

  • 42.

    SunagawaSCoelhoLPChaffronSKultimaJRLabadieKSalazarGet al. Structure and function of the global ocean microbiome. Science (2015) 348:1261359. 10.1126/science.1261359

  • 43.

    OhJByrdALDemingCConlanSBarnabasBBlakesleyRet al. Biogeography and individuality shape function in the human skin metagenome. Nature (2014) 514:59. 10.1038/nature13786

  • 44.

    HuttenhowerCGeversDKnightRAbubuckerSBadgerJHChinwallaATet al. Structure, function and diversityof the healthy human microbiome. Nature (2012) 486:20714. 10.1038/nature11234

  • 45.

    VenterJCRemingtonKHeidelbergJFHalpernALRuschDEisenJAet al. Environmental genome shotgun sequencing of the Sargasso Sea. Science (2004) 304:6674. 10.1126/science.1093857

  • 46.

    BrownCTHugLAThomasBCSharonICastelleCJSinghAet al. Unusual biology across a group comprising more than 15% of domain Bacteria. Nature (2015) 523:208. 10.1038/nature14486

  • 47.

    DaimsHLebedevaEVPjevacPHanPHerboldCAlbertsenMet al. Complete nitrification by Nitrospira bacteria. Nature (2015) 528:504. 10.1038/nature16461

  • 48.

    vanKessel MAHJSpethDRAlbertsenMNielsenPHdenCamp HJMOKartalBet al. Complete nitrification by a single microorganism. Nature (2015) 528:555. 10.1038/nature16459

  • 49.

    LomanNJConstantinidouCChristnerMRohdeHChanJZ-MQuickJet al. A culture-independent sequence-based metagenomics approach to the investigation of an outbreak of Shiga-toxigenic Escherichia coli O104: H4. JAMA (2013) 309:150210. 10.1001/jama.2013.3231

  • 50.

    GeversDKugathasanSDensonLAVázquez-BaezaYVanTreuren WRenBet al. The treatment-naive microbiome in new-onset Crohn's disease. Cell Host Microbe (2014) 15:38292. 10.1016/j.chom.2014.02.005

  • 51.

    NormanJMHandleySABaldridgeMTDroitLLiuCYKellerBCet al. Disease-specific alterations in the enteric virome in inflammatory bowel disease. Cell (2015) 160:44760. 10.1016/j.cell.2015.01.002

  • 52.

    DoniaMSCimermancicPSchulzeCJBrownLCWMartinJMitrevaMet al. A systematic analysis of biosynthetic gene clusters in the human microbiome reveals a common family of antibiotics. Cell (2014) 158:140214. 10.1016/j.cell.2014.08.032

  • 53.

    LarsenOFAClaassenE. The mechanistic link between health and gut microbiota diversity. Sci Rep. (2018) 8:610. 10.1038/s41598-018-20141-6

  • 54.

    SungJKimSCabatbatJJTJangSJinYSJungGYet al. Global metabolic interaction network of the human gut microbiota for context-specific community-scale analysis. Nat Commun. (2017) 8:15393. 10.1038/ncomms15393

  • 55.

    WangMKarlssonCOlssonCAdlerberthIWoldAEStrachanDPet al. Reduced diversity in the early fecal microbiota of infants with atopic eczema. J Allergy Clin Immunol. (2008) 121:12934. 10.1016/j.jaci.2007.09.011

  • 56.

    AbrahamssonTRJakobssonHEAnderssonAFBjörksténBEngstrandLJenmalmMC. Low gut microbiota diversity in early infancy precedes asthma at school age. Clin Exp Allergy (2014) 44:84250. 10.1111/cea.12253

  • 57.

    KarlssonFTremaroliVNielsenJBäckhedF. Assessing the human gut microbiota in metabolic diseases. Diabetes (2013) 62:33419. 10.2337/db13-0844

  • 58.

    BisgaardHLiNBonnelykkeKChawesBLKSkovTPaludan-MüllerGet al. Reduced diversity of the intestinal microbiota during infancy is associated with increased risk of allergic disease at school age. J Allergy Clin Immunol. (2011) 128:64652. 10.1016/j.jaci.2011.04.060

  • 59.

    FerreiraCMVieiraATVinoloMAROliveiraFACuriRMartinsF dos S. The central role of the gut microbiota in chronic inflammatory diseases. J Immunol Res. (2014) 2014:689492. 10.1155/2014/689492

  • 60.

    KennedyPJCryanJFDinanTGClarkeG. Irritable bowel syndrome: a microbiome-gut-brain axis disorder?World J Gastroenterol. (2014) 20:14105. 10.3748/wjg.v20.i39.14105

  • 61.

    LakhanSEKirchgessnerA. Gut inflammation in chronic fatigue syndrome. Nutr Metab (Lond) (2010) 7:79. 10.1186/1743-7075-7-79

  • 62.

    CastellarinMWarrenRLFreemanJDDreoliniLKrzywinskiMStraussJet al. Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma. Genome Res. (2012) 22:299306. 10.1101/gr.126516.111

  • 63.

    KauALAhernPPGriffinNWGoodmanALGordonJI. Human nutrition, the gut microbiome and the immune system. Nature (2011) 474:327. 10.1038/nature10213

  • 64.

    AfricaCWJNelJStemmetM. Anaerobes and bacterial vaginosis in pregnancy: virulence factors contributing to vaginal colonisation. Int J Environ Res Public Health (2014) 11:69797000. 10.3390/ijerph110706979

  • 65.

    LachGSchellekensHDinanTGCryanJF. Anxiety, depression, and the microbiome: a role for gut peptides. Neurotherapeutics (2018) 15:3659. 10.1007/s13311-017-0585-0

  • 66.

    GeukingMBMcCoyKDMacphersonAJ. Metabolites from intestinal microbes shape Treg. Cell Res. (2013) 23:133940. 10.1038/cr.2013.125

  • 67.

    TremaroliVBäckhedF. Functional interactions between the gut microbiota and host metabolism. Nature (2012) 489:2429. 10.1038/nature11552

  • 68.

    GillSRPopMDeboyRTEckburgPBTurnbaughPJSamuelBSet al. Metagenomic analysis of the human distal gut microbiome. Science (2006) 312:13559. 10.1126/science.1124234

  • 69.

    MedinaMIzquierdoEEnnaharSSanzY. Differential immunomodulatory properties of Bifidobacterium logum strains: relevance to probiotic selection and clinical applications. Clin Exp Immunol. (2007) 150:5318. 10.1111/j.1365-2249.2007.03522.x

  • 70.

    GuptaSAllen-VercoeEPetrofEO. Fecal microbiota transplantation: in perspective. Therap Adv Gastroenterol. (2016) 9:22939. 10.1177/1756283X15607414

  • 71.

    SpanogiannopoulosPBessENCarmodyRNTurnbaughPJ. The microbial pharmacists within us: a metagenomic view of xenobiotic metabolism. Nat Rev Microbiol. (2016) 14:27387. 10.1038/nrmicro.2016.17

  • 72.

    ShankEAKolterR. New developments in microbial interspecies signaling. Curr Opin Microbiol. (2009) 12:20514. 10.1016/j.mib.2009.01.003

  • 73.

    CornforthDMFosterKR. Antibiotics and the art of bacterial war. Proc Natl Acad Sci USA (2015) 112:108278. 10.1073/pnas.1513608112

  • 74.

    SobhaniITapJRoudot-ThoravalFRoperchJPLetulleSLangellaPet al. Microbial dysbiosis in colorectal cancer (CRC) patients. PLoS ONE (2011) 6:e16393. 10.1371/journal.pone.0016393

  • 75.

    CardingSVerbekeKVipondDTCorfeBMOwenLJ. Dysbiosis of the gut microbiota in disease. Microb Ecol Heal Dis. (2015) 26:26191. 10.3402/mehd.v26.26191

  • 76.

    NgKMFerreyraJAHigginbottomSKLynchJBKashyapPCGopinathSet al. post-antibiotic expansion of enteric pathogens. Nature (2013) 502:969. 10.1038/nature12503

  • 77.

    RoberfroidMGibsonGRHoylesLMcCartneyALRastallRRowlandIet al. Prebiotic effects: metabolic and health benefits. Br J Nutr. (2010) 104(Suppl. 2):S163. 10.1017/S0007114510003363

  • 78.

    CaniPDPossemiersSVanDe Wiele TGuiotYEverardARottierOet al. Changes in gut microbiota control inflammation in obese mice through a mechanism involving GLP-2-driven improvement of gut permeability. Gut (2009) 58:1091103. 10.1136/gut.2008.165886

  • 79.

    ArcherBJJohnsonSKDevereuxHMBaxterAL. Effect of fat replacement by inulin or lupin-kernel fibre on sausage patty acceptability, post-meal perceptions of satiety and food intake in men. Br J Nutr. (2004) 91:5919. 10.1079/BJN20031088

  • 80.

    WhelanKEfthymiouLJuddPAPreedyVRTaylorMA. Appetite during consumption of enteral formula as a sole source of nutrition: the effect of supplementing pea-fibre and fructo-oligosaccharides. Br J Nutr (2006) 96:3506. 10.1079/BJN20061791

  • 81.

    ParnellJAReimerRA. Weight loss during oligofructose supplementation is associated with decreased ghrelin and increased peptide YY in overweight and obese adults. Am J Clin Nutr. (2009) 89:17519. 10.3945/ajcn.2009.27465

  • 82.

    PetersHPFBoersHMHaddemanEMelnikovSMQvyjtF. No effect of added β-glucan or of fructooligosaccharide on appetite or energy intake. Am J Clin Nutr. (2009) 89:5863. 10.3945/ajcn.2008.26701

  • 83.

    HessJRBirkettAMThomasWSlavinJL. Effects of short-chain fructooligosaccharides on satiety responses in healthy men and women. Appetite (2011) 56:12834. 10.1016/j.appet.2010.12.005

  • 84.

    TariniJWoleverTMS. The fermentable fibre inulin increases postprandial serum short-chain fatty acids and reduces free-fatty acids and ghrelin in healthy subjects. Appl Physiol Nutr Metab. (2010) 35:916. 10.1139/H09-119

  • 85.

    FrostSD. Dynamics and evolution of HIV-1 during structured treatment interruptions. AIDS Rev. (2002) 4:11927.

  • 86.

    PoulinRThomasF. Epigenetic effects of infection on the phenotype of host offspring: parasites reaching across host generations. Oikos (2008) 117:3315. 10.1111/j.2007.0030-1299.16435.x

  • 87.

    GansJWolinskyMDunbarJ. Computational improvements reveal great bacterial diversity and high metal toxicity in soil. Science (2005) 309:138790. 10.1126/science.1112665

  • 88.

    BentSJPiersonJDForneyLJ. Measuring species richness based on microbial community fingerprints: the emperor has no clothes. Appl Environ Microbiol. (2007) 73:2399401. 10.1128/AEM.02383-06

  • 89.

    TesslerMNeumannJSAfshinnekooEPinedaMHerschRVelhoLFMet al. Large-scale differences in microbial biodiversity discovery between 16S amplicon and shotgun sequencing. Sci Rep. (2017) 7:114. 10.1038/s41598-017-06665-3

  • 90.

    MethéBANelsonKEPopMCreasyHHGiglioMGHuttenhowerCet al. A framework for human microbiome research. Nature (2012) 486:215. 10.1038/nature11209

  • 91.

    KanehisaMGotoSFurumichiMTanabeMHirakawaM. KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res. (2009) 38:D35560. 10.1093/nar/gkp896

  • 92.

    NealAL. Metagenomics: Current Advances and Emerging Concepts. Poole: Caister Academic Press (2017).

  • 93.

    LaskenRS. Genomic sequencing of uncultured microorganisms from single cells. Nat Rev Microbiol. (2012) 10:631. 10.1038/nrmicro2857

  • 94.

    MardisER. Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. (2008) 9:387402. 10.1146/annurev.genom.9.081307.164359

  • 95.

    RoossinckMJ. Plant virus metagenomics: biodiversity and ecology. Annu Rev Genet. (2012) 46:35969. 10.1146/annurev-genet-110711-155600

  • 96.

    EckburgPBBikEMBernsteinCNPurdomEDethlefsenLSargentMet al. Diversity of the human intestinal microbial flora. Science (2005) 308:16358. 10.1126/science.1110591

  • 97.

    GibbonsRSocranskySdeAraujo WvanHoute J. Studies of the predominant cultivable microbiota of dental plaque. Arch Oral Biol. (1964) 9:36570. 10.1016/0003-9969(64)90069-X

  • 98.

    ParkerRBSnyderML. Interactions of the oral microbiota I. A system for the defined study of mixed cultures. Exp Biol Med. (1961) 108:74952. 10.3181/00379727-108-27055

  • 99.

    SangerFCoulsonAR. A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase. J Mol Biol. (1975) 94:441-6. 10.1016/0022-2836(75)90213-2

  • 100.

    HutchisonCA. DNA sequencing: bench to bedside and beyond. Nucleic Acids Res. (2007) 35:622737. 10.1093/nar/gkm688

  • 101.

    SangerF. Sequences, sequences, and sequences. Annu Rev Biochem. (1988) 57:129. 10.1146/annurev.bi.57.070188.000245

  • 102.

    GoodwinSMcPhersonJDMcCombieWR. Coming of age: ten years of next-generation sequencing technologies. Nat Rev Genet. (2016) 17:33351. 10.1038/nrg.2016.49

  • 103.

    PesterMRatteiTFlechlSGröngröftARichterAOvermannJet al. amoA-based consensus phylogeny of ammonia-oxidizing archaea and deep sequencing of amoA genes from soils of four different geographic regions. Environ Microbiol. (2012) 14:52539. 10.1111/j.1462-2920.2011.02666.x

  • 104.

    CaporasoJGLauberCLWaltersWABerg-LyonsDHuntleyJFiererNet al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J (2012) 6:16214. 10.1038/ismej.2012.8

  • 105.

    NyrénPLundinA. Enzymatic method for continuous monitoring of inorganic pyrophosphate synthesis. Anal Biochem. (1985) 151:5049. 10.1016/0003-2697(85)90211-8

  • 106.

    HymanED. A new method of sequencing DNA. Anal Biochem. (1988) 174:42336. 10.1016/0003-2697(88)90041-3

  • 107.

    ThakkarJRSabaraPHKoringaPG. Exploring metagenomes using next-generation sequencing. In: SinghRKothariRKoringaPSinghS (editors). Understanding Host-Microbiome Interactions–An Omics Approach: Omics of Host-Microbiome Association. Singapore: Springer (2017). p. 2940. 10.1007/978-981-10-5050-3_3

  • 108.

    GranbergFVicente-RubianoMRubio-GuerriCKarlssonOEKukielkaDBelákSet al. Metagenomic detection of viral pathogens in Spanish honeybees: co-infection by aphid lethal paralysis, Israel acute paralysis and Lake Sinai viruses. PLoS ONE (2013) 8:e57459. 10.1371/journal.pone.0057459

  • 109.

    QuailMASmithMCouplandPOttoTDHarrisSRConnorTRet al. A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers. BMC Genomics (2012) 13:341. 10.1186/1471-2164-13-341

  • 110.

    AnsorgeWJ. Next-generation DNA sequencing techniques. N Biotechnol. (2009) 25:195203. 10.1016/j.nbt.2008.12.009

  • 111.

    MacLeanDJonesJDGStudholmeDJ. Application of'next-generation'sequencing technologies to microbial genetics. Nat Rev Microbiol. (2009) 7:287. 10.1038/nrmicro2122

  • 112.

    ZhangJChiodiniRBadrAZhangG. The impact of next-generation sequencing on genomics. J Genet Genom. (2011) 38:95109. 10.1016/j.jgg.2011.02.003

  • 113.

    SchusterSCChiKRRuskNKiermerVWoldBMyersRM. Method of the year, next-generation DNA sequencing. Funct Genomics Med Appl Nat Methods (2008) 5:1121.

  • 114.

    KarowJ.Qiagen Launches GeneReader NGS System at AMP; Presents Performance Evaluation by Broad. GenomeWeb (2015). Available online at: https://www.genomeweb.com/molecular-diagnostics/qiagen-launches-genereader-ngs-system-amp-presents-performance-evaluation

  • 115.

    DouglasR. Smith Kevin McKernan. Methods of producing and sequencing modified polynucleotides. Appl Biosyst. (2010) 2.

  • 116.

    MarguliesMEgholmMAltmanWEAttiyaSBaderJSBembenLAet al. Corrigendum: genome sequencing in microfabricated high-density picolitre reactors. Nature (2006) 441:120. 10.1038/nature04726

  • 117.

    MerrimanBTorrentIRothbergJMTeamR. Progress in ion torrent semiconductor chip based sequencing. Electrophoresis (2012) 33:3397417. 10.1002/elps.201200424

  • 118.

    RothbergJMHinzWRearickTMSchultzJMileskiWDaveyMet al. An integrated semiconductor device enabling non-optical genome sequencing. Nature (2011) 475:348. 10.1038/nature10242

  • 119.

    KasianowiczJJBrandinEBrantonDDeamerDW. Characterization of individual polynucleotide molecules using a membrane channel. Proc Natl Acad Sci U.S.A. (1996) 93:137703. 10.1073/pnas.93.24.13770

  • 120.

    FichotEBNormanRS. Microbial phylogenetic profiling with the Pacific Biosciences sequencing platform. Microbiome (2013) 1:10. 10.1186/2049-2618-1-10

  • 121.

    KchoukMGibratJFElloumiM. Generations of sequencing technologies: from first to next generation. Biol Med. (2017) 9:395. 10.4172/0974-8369.1000395

  • 122.

    EidJFehrAGrayJLuongKLyleJOttoGet al. Real-time DNA sequencing from single polymerase molecules. Science (2009) 323:1338. 10.1016/j.yofte.2016.04.005

  • 123.

    BraslavskyIHebertBKartalovEQuakeSR. Sequence information can be obtained from single DNA molecules. Proc Natl Acad Sci USA (2003) 100:39604. 10.1073/pnas.0230489100

  • 124.

    HarrisTDBuzbyPRBabcockHBeerEBowersJBraslavskyIet al. Single-molecule DNA sequencing of a viral genome. Science (2008) 320:1069. 10.1126/science.1150427

  • 125.

    MikheyevASTinMMY. A first look at the Oxford Nanopore MinION sequencer. Mol Ecol Resour. (2014) 14:1097102. 10.1111/1755-0998.12324

  • 126.

    LaehnemannDBorkhardtAMcHardyAC. Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction. Brief Bioinform. (2015) 17:15479. 10.1093/bib/bbv029

  • 127.

    CarterJMHussainS. Robust long-read native DNA sequencing using the ONT CsgG N1. Wellcome Open Res. (2017) 2:23. 10.12688/wellcomeopenres.11246.1

  • 128.

    LuHGiordanoFNingZ. Oxford Nanopore MinION sequencing and genome assembly. Genomics Proteomics Bioinformatics (2016) 14:26579. 10.1016/j.gpb.2016.05.004

  • 129.

    DohmJCLottazCBorodinaTHimmelbauerH. Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res. (2008) 36:e105. 10.1093/nar/gkn425

  • 130.

    ReuterJASpacekDVSnyderMP. High-throughput sequencing technologies. Mol Cell (2015) 58:58697. 10.1016/j.molcel.2015.05.004

  • 131.

    EvansCCLePardKJKwakJWStancukasMCLaskowskiSDoughertyJet al. Exercise prevents weight gain and alters the gut microbiota in a mouse model of high fat diet-induced obesity. PLoS ONE (2014) 9:e92193. 10.1371/journal.pone.0092193

  • 132.

    LambethSMCarsonTLoweJRamarajTLeffJWLuoLet al. Composition, diversity and abundance of gut microbiome in prediabetes and type 2 diabetes. J Diabetes Obes. (2015) 2:1. 10.15436/2376-0949.15.031

  • 133.

    YasirMAngelakisEBibiFAzharEIBacharDLagierJCet al. Comparison of the gut microbiota of people in France and Saudi Arabia. Nutr Diabetes (2015) 5:e153. 10.1038/nutd.2015.3

  • 134.

    TraversKJChinC-SRankDREidJSTurnerSW. A flexible and efficient template format for circular consensus sequencing and SNP detection. Nucleic Acids Res. (2010) 38:e159. 10.1093/nar/gkq543

  • 135.

    CarneiroMORussCRossMGGabrielSBNusbaumCDePristoMA. Pacific biosciences sequencing technology for genotyping and variation discovery in human data. BMC Genomics (2012) 13:375. 10.1186/1471-2164-13-375

  • 136.

    RhoadsAAuKF. PacBio sequencing and its applications. Genomics Proteomics Bioinformatics (2015) 13:27889. 10.1016/j.gpb.2015.08.002

  • 137.

    AshtonPMNairSDallmanTRubinoSRabschWMwaigwisyaSet al. MinION nanopore sequencing identifies the position and structure of a bacterial antibiotic resistance island. Nat Biotechnol. (2015) 33:296. 10.1038/nbt.3103

  • 138.

    JainMFiddesITMigaKHOlsenHEPatenBAkesonM. Improved data analysis for the MinION nanopore sequencer. Nat Methods (2015) 12:3516. 10.1038/nmeth.3290

  • 139.

    WangJMooreNEDengY-MEcclesDAHallRJ. MinION nanopore sequencing of an influenza genome. Front Microbiol. (2015) 6:766. 10.3389/fmicb.2015.00766

  • 140.

    QuickJLomanNJDuraffourSSimpsonJTSeveriECowleyLet al. Real-time, portable genome sequencing for Ebola surveillance. Nature (2016) 530:228. 10.1038/nature16996

  • 141.

    BäckhedFFraserCMRingelYSandersMESartorRBShermanPMet al. Defining a healthy human gut microbiome: current concepts, future directions, and clinical applications. Cell Host Microbe (2012) 12:61122. 10.1016/j.chom.2012.10.012

  • 142.

    HollisterEBGaoCVersalovicJ. Compositional and functional features of the gastrointestinal microbiome and their effects on human health. Gastroenterology (2014) 146:144958. 10.1053/j.gastro.2014.01.052

  • 143.

    ClooneyAGFouhyFSleatorRDO'DriscollAStantonCCotterPDet al. Comparing apples and oranges? Next generation sequencing and its impact on microbiome analysis. PLoS ONE (2016) 11:e0148028. 10.1371/journal.pone.0148028

  • 144.

    OlivaresMNeefACastillejoGDePalma GVareaVCapillaAet al. The HLA-DQ2 genotype selects for early intestinal microbiota composition in infants at high risk of developing coeliac disease. Gut (2014) 64:40617. 10.1136/gutjnl-2014-306931

  • 145.

    SalonenALahtiLSalojärviJHoltropGKorpelaKDuncanSHet al. Impact of diet and individual variation on intestinal microbiota composition and fermentation products in obese men. ISME J. (2014) 8:2218. 10.1038/ismej.2014.63

  • 146.

    MaslowskiKMMackayCR. Diet, gut microbiota and immune responses. Nat Immunol. (2010) 12:5. 10.1038/ni0111-5

  • 147.

    KosticADXavierRJGeversD. The microbiome in inflammatory bowel disease: current status and the future ahead. Gastroenterology (2014) 146:148999. 10.1053/j.gastro.2014.02.009

  • 148.

    SchnablBBrennerDA. Interactions between the intestinal microbiome and liver diseases. Gastroenterology (2014) 146:151324. 10.1053/j.gastro.2014.01.020

  • 149.

    SeveranceEGYolkenRHEatonWW. Autoimmune diseases, gastrointestinal disorders and the microbiome in schizophrenia: more than a gut feeling. Schizophr Res. (2016) 176:2335. 10.1016/j.schres.2014.06.027

  • 150.

    WangYHoenigJDMalinKJQamarSPetrofEOSunJet al. 16S rRNA gene-based analysis of fecal microbiota from preterm infants with and without necrotizing enterocolitis. ISME J. (2009) 3:944. 10.1038/ismej.2009.37

  • 151.

    CaniPD. Gut microbiota and obesity: lessons from the microbiome. Brief Funct Genom. (2013) 12:3817. 10.1093/bfgp/elt014

  • 152.

    ClarkeSFMurphyEFNilaweeraKRossPRShanahanFCotterPWet al. The gut microbiota and its relationship to diet and obesity: new insights. Gut Microbes (2012) 3:117. 10.4161/gmic.20168

  • 153.

    CoxAJWestNPCrippsAW. Obesity, inflammation, and the gut microbiota. Lancet Diabetes Endocrinol. (2015) 3:20715. 10.1016/S2213-8587(14)70134-2

  • 154.

    LeyRETurnbaughPJKleinSGordonJI. Microbial ecology: human gut microbes associated with obesity. Nature (2006) 444:1022. 10.1038/4441022a

  • 155.

    SartorRB. Microbial influences in inflammatory bowel diseases. Gastroenterology (2008) 134:57794. 10.1053/j.gastro.2007.11.059

  • 156.

    LiuQDuanZPHaDKBengmarkSKurtovicJRiordanSM. Synbiotic modulation of gut flora: effect on minimal hepatic encephalopathy in patients with cirrhosis. Hepatology (2004) 39:14419. 10.1002/hep.20194

  • 157.

    ScanlanPDShanahanFCluneYCollinsJKO'sullivanGCO'riordanMet al. Culture-independent analysis of the gut microbiota in colorectal cancer and polyposis. Environ Microbiol. (2008) 10:78998. 10.1111/j.1462-2920.2007.01503.x

  • 158.

    VerhulstSLVaelCBeunckensCNelenVGoossensHDesagerK. A longitudinal analysis on the association between antibiotic use, intestinal microflora, and wheezing during the first year of life. J Asthma (2008) 45:82832. 10.1080/02770900802339734

  • 159.

    FinegoldSMMolitorisDSongYLiuCVaisanenM-LBolteEet al. Gastrointestinal microflora studies in late-onset autism. Clin Infect Dis. (2002) 35:S6S16. 10.1086/341914

  • 160.

    WenLLeyREVolchkovPYStrangesPBAvanesyanLStonebrakerACet al. Innate immunity and intestinal microbiota in the development of Type 1 diabetes. Nature (2008) 455:1109. 10.1038/nature07336

  • 161.

    RoberfroidMBBornetFBouleyCCummingsJH. Colonic microflora: nutrition and health0. summary and conclusions of an International Life Sciences Institute (ILSI)[Europe] Workshop held in Barcelona, Spain. Nutr Rev. (1995) 53:12730. 10.1111/j.1753-4887.1995.tb01535.x

  • 162.

    CashHLWhithamC VBehrendtCLHooperL V. Symbiotic bacteria direct expression of an intestinal bactericidal lectin. Science (2006) 313:112630. 10.1126/science.1127119

  • 163.

    HooperLVStappenbeckTSHongCVGordonJI. Angiogenins: a new class of microbicidal proteins involved in innate immunity. Nat Immunol. (2003) 4:269. 10.1038/ni888

  • 164.

    SchauberJSvanholmCTermenSIfflandKMenzelTScheppachWet al. Expression of the cathelicidin LL-37 is modulated by short chain fatty acids in colonocytes: relevance of signalling pathways. Gut (2003) 52:73541. 10.1136/gut.52.5.735

  • 165.

    BouskraDBrézillonCBérardMWertsCVaronaRBonecaIGet al. Lymphoid tissue genesis induced by commensals through NOD1 regulates intestinal homeostasis. Nature (2008) 456:507. 10.1038/nature07450

  • 166.

    Rakoff-NahoumSMedzhitovR. Innate immune recognition of the indigenous microbial flora. Mucosal Immunol. (2008) 1:S10. 10.1038/mi.2008.49

  • 167.

    MacphersonAJHarrisNL. Interactions between commensal intestinal bacteria and the immune system. Nat Rev Immunol (2004) 4:478. 10.1038/nri1373

  • 168.

    SekirovIRussellSLAntunesLCMFinlayBB. Gut microbiota in health and disease. Physiol Rev. (2010) 90:859904. 10.1152/physrev.00045.2009

  • 169.

    JohnsonELHeaverSLWaltersWALeyRE. Microbiome and metabolic disease: revisiting the bacterial phylum Bacteroidetes. J Mol Med. (2017) 95:18. 10.1007/s00109-016-1492-2

  • 170.

    YiuJHCDorweilerBWooCW. Interaction between gut microbiota and toll-like receptor: from immunity to metabolism. J Mol Med. (2017) 95:1320. 10.1007/s00109-016-1474-4

  • 171.

    WehkampJFrickJ-S. Microbiome and chronic inflammatory bowel diseases. J Mol Med. (2017) 95:218. 10.1007/s00109-016-1495-z

  • 172.

    LeeHUMcPhersonZETanBKoreckaAPetterssonS. Host-microbiome interactions: the aryl hydrocarbon receptor and the central nervous system. J Mol Med. (2017) 95:2939. 10.1007/s00109-016-1486-0

  • 173.

    BrantonDDeamerDWMarzialiABayleyHBennerSAButlerTet al. The potential and challenges of nanopore sequencing. Nat Biotechnol. (2008) 26:1146. 10.1038/nbt.1495

  • 174.

    LadoukakisEKolisisFNChatziioannouAA. Integrative workflows for metagenomic analysis. Front Cell Dev Biol. (2014) 2:70. 10.3389/fcell.2014.00070

  • 175.

    AndrewsS.FastQC: a Quality Control Tool for High Throughput Sequence Data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc

  • 176.

    GordonAHannonGJ. Fastx-Toolkit. FASTQ/A Short-Reads Pre-processing Tools (2010). Available online at: http//hannonlabcshledu/fastx_toolkit

  • 177.

    SchmiederREdwardsR. Quality control and preprocessing of metagenomic datasets. Bioinformatics (2011) 27:8634. 10.1093/bioinformatics/btr026

  • 178.

    PatelRKJainM. NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PLoS ONE (2012) 7:e30619. 10.1371/journal.pone.0030619

  • 179.

    ZhouQSuXJingGNingK. Meta-QC-Chain: comprehensive and fast quality control method for metagenomic data. Genomics Proteomics Bioinformatics (2014) 12:526. 10.1016/j.gpb.2014.01.002

  • 180.

    SchlossPDWestcottSLRyabinTHallJRHartmannMHollisterEBet al. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol. (2009) 75:753741. 10.1128/AEM.01541-09

  • 181.

    CaporasoJGKuczynskiJStombaughJBittingerKBushmanFDCostelloEKet al. QIIME allows analysis of high-throughput community sequencing data. Nat Methods (2010) 7:335. 10.1038/nmeth.f.303

  • 182.

    KrauseLDiazNNGoesmannAKelleySNattkemperTWRohwerFet al. Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Res. (2008) 36:22309. 10.1093/nar/gkn038

  • 183.

    LangilleMGIZaneveldJCaporasoJGMcDonaldDKnightsDReyesJAet al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat Biotechnol. (2013) 31:814. 10.1038/nbt.2676

  • 184.

    TeelingHWaldmannJLombardotTBauerMGlöcknerFO. TETRA: a web-service and a stand-alone program for the analysis and comparison of tetranucleotide usage patterns in DNA sequences. BMC Bioinformatics (2004) 5:163. 10.1186/1471-2105-5-163

  • 185.

    McHardyACMartínHGTsirigosAHugenholtzPRigoutsosI. Accurate phylogenetic classification of variable-length DNA fragments. Nat Methods (2007) 4:63. 10.1038/nmeth976

  • 186.

    KultimaJRSunagawaSLiJChenWChenHMendeDRet al. MOCAT: a metagenomics assembly and gene prediction toolkit. PLoS ONE (2012) 7:e47656. 10.1371/journal.pone.0047656

  • 187.

    SuXPanWSongBXuJNingK. Parallel-META 2.0: enhanced metagenomic data analysis with functional annotation, high performance computing and advanced visualization. PLoS ONE (2014) 9:e89323. 10.1371/journal.pone.0089323

  • 188.

    WangYLeungHCMYiuSMChinFYL. MetaCluster-TA: taxonomic annotation for metagenomic data based on assembly-assisted binning. in BMC Genomics15(Suppl. 1):S12. 10.1186/1471-2164-15-S1-S12

  • 189.

    WuY-WTangY-HTringeSGSimmonsBASingerSW. MaxBin: an automated binning method to recover individual genomes from metagenomes using an expectation-maximization algorithm. Microbiome (2014) 2:26. 10.1186/2049-2618-2-26

  • 190.

    WuMEisenJA. A simple, fast, and accurate method of phylogenomic inference. Genome Biol. (2008) 9:R151. 10.1186/gb-2008-9-10-r151

  • 191.

    LiHDurbinR. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics (2009) 25:175460. 10.1093/bioinformatics/btp324

  • 192.

    LangmeadBSalzbergSL. Fast gapped-read alignment with Bowtie 2. Nat Methods (2012) 9:357. 10.1038/nmeth.1923

  • 193.

    DavenportCFTümmlerB. Advances in computational analysis of metagenome sequences. Environ Microbiol. (2013) 15:15. 10.1111/j.1462-2920.2012.02843.x

  • 194.

    MonzoorulHaque MGhoshTSKomanduriDMandeSS. SOrt-ITEMS: sequence orthology based approach for improved taxonomic estimation of metagenomic sequences. Bioinformatics (2009) 25:172230. 10.1093/bioinformatics/btp317

  • 195.

    GhoshTSHaqueMMandeSS. DiScRIBinATE: a rapid method for accurate taxonomic classification of metagenomic sequences. BMC Bioinformatics11(Suppl. 7):S14. 10.1186/1471-2105-11-S7-S14

  • 196.

    PengYLeungHCMYiuS-MChinFYL. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics (2012) 28:14208. 10.1093/bioinformatics/bts174

  • 197.

    NamikiTHachiyaTTanakaHSakakibaraY. MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads. Nucleic Acids Res. (2012) 40:e155. 10.1093/nar/gks678

  • 198.

    BoisvertSRaymondFGodzaridisÉLavioletteFCorbeilJ. Ray Meta: scalable de novo metagenome assembly and profiling. Genome Biol. (2012) 13:R122. 10.1186/gb-2012-13-12-r122

  • 199.

    ZhuWLomsadzeABorodovskyM. Ab initio gene identification in metagenomic sequences. Nucleic Acids Res. (2010) 38:e132. 10.1093/nar/gkq275

  • 200.

    KelleyDRLiuBDelcherALPopMSalzbergSL. Gene prediction with Glimmer for metagenomic sequences augmented by classification and clustering. Nucleic Acids Res. (2011) 40:e9. 10.1093/nar/gkr1067

  • 201.

    RhoMTangHYeY. FragGeneScan: predicting genes in short and error-prone reads. Nucleic Acids Res. (2010) 38:e191. 10.1093/nar/gkq747

  • 202.

    LiWGodzikA. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics (2006) 22:16589. 10.1093/bioinformatics/btl158

  • 203.

    EddySR. Accelerated profile HMM searches. PLoS Comput Biol. (2011) 7:e1002195. 10.1371/journal.pcbi.1002195

  • 204.

    YeYTangH. An ORFome assembly approach to metagenomics sequences analysis. J Bioinform Comput Biol. (2009) 7:45571. 10.1142/S0219720009004151

  • 205.

    YeYDoakTG. A parsimony approach to biological pathway reconstruction/inference for genomes and metagenomes. PLoS Comput Biol. (2009) 5:e1000465. 10.1371/journal.pcbi.1000465

  • 206.

    LiuBPopM. MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets. BMC Proc.5(Suppl. 2):S9. 10.1186/1753-6561-5-S2-S9

  • 207.

    KanehisaMSatoYMorishimaK. BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J Mol Biol. (2016) 428:72631. 10.1016/j.jmb.2015.11.006

  • 208.

    LiW. Analysis and comparison of very large metagenomes with fast clustering and functional annotation. BMC Bioinformatics (2009) 10:359. 10.1186/1471-2105-10-359

  • 209.

    GhoshTSMohammedMHKomanduriDMandeSS. ProViDE: a software tool for accurate estimation of viral diversity in metagenomic samples. Bioinformation (2011) 6:91. 10.6026/97320630006091

  • 210.

    McMurdiePJHolmesS. phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS ONE (2013) 8:e61217. 10.1371/journal.pone.0061217

  • 211.

    PaulsonJNStineOCBravoHCPopM. Differential abundance analysis for microbial marker-gene surveys. Nat Methods (2013) 10:1200. 10.1038/nmeth.2658

  • 212.

    KristianssonEHugenholtzPDaleviD. ShotgunFunctionalizeR: an R-package for functional comparison of metagenomes. Bioinformatics (2009) 25:27378. 10.1093/bioinformatics/btp508

  • 213.

    GoecksJNekrutenkoATaylorJ. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. (2010) 11:R86. 10.1186/gb-2010-11-8-r86

  • 214.

    MeyerFPaarmannDD'SouzaMOlsonRGlassEMKubalMet al. The metagenomics RAST server–a public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinformatics (2008) 9:386. 10.1186/1471-2105-9-386

  • 215.

    MarkowitzVMChenI-MAChuKSzetoEPalaniappanKGrechkinYet al. IMG/M: the integrated metagenome data management and comparative analysis system. Nucleic Acids Res. (2011) 40:D1239. 10.1093/nar/gkr975

  • 216.

    SeshadriRKravitzSASmarrLGilnaPFrazierM. CAMERA: a community resource for metagenomics. PLoS Biol. (2007) 5:e75. 10.1371/journal.pbio.0050075

  • 217.

    TruongDTTettAPasolliEHuttenhowerCSegataN. Microbial strain-level population structure and genetic diversity from metagenomes. Genome Res. (2017) 27:62638. 10.1101/gr.216242.116

  • 218.

    ScholzMWardDVPasolliETolioTZolfoMAsnicarFet al. Strain-level microbial epidemiology and population genomics from shotgun metagenomics. Nat Methods (2016) 13:435. 10.1038/nmeth.3802

Summary

Keywords

microbes, human microbiome, host-microbe interactions, metagenomics, next generation sequencing, bioinformatics, dysbiosis, diseases

Citation

Malla MA, Dubey A, Kumar A, Yadav S, Hashem A and Abd_Allah EF (2019) Exploring the Human Microbiome: The Potential Future Role of Next-Generation Sequencing in Disease Diagnosis and Treatment. Front. Immunol. 9:2868. doi: 10.3389/fimmu.2018.02868

Received

04 July 2018

Accepted

21 November 2018

Published

07 January 2019

Volume

9 - 2018

Edited by

Ashutosh K. Mangalam, University of Iowa, United States

Reviewed by

Dimitry N. Krementsov, University of Vermont, United States; Rajesh Kumar Mondal, National Institute of Research in Tuberculosis (ICMR), India

Updates

Copyright

*Correspondence: Ashwani Kumar

This article was submitted to Mucosal Immunity, a section of the journal Frontiers in Immunology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics