Next Generation Quantitative Genetics in Plants

Jiménez-Gómez, José  M

doi:10.3389/fpls.2011.00077

REVIEW article

Front. Plant Sci., 15 November 2011

Sec. Plant Physiology

Volume 2 - 2011 | https://doi.org/10.3389/fpls.2011.00077

Next Generation Quantitative Genetics in Plants

JM
José M. Jiménez-Gómez ^*

Department of Plant Breeding and Genetics, Max Planck Institute for Plant Breeding Research Köln, Germany

Abstract

Most characteristics in living organisms show continuous variation, which suggests that they are controlled by multiple genes. Quantitative trait loci (QTL) analysis can identify the genes underlying continuous traits by establishing associations between genetic markers and observed phenotypic variation in a segregating population. The new high-throughput sequencing (HTS) technologies greatly facilitate QTL analysis by providing genetic markers at genome-wide resolution in any species without previous knowledge of its genome. In addition HTS serves to quantify molecular phenotypes, which aids to identify the loci responsible for QTLs and to understand the mechanisms underlying diversity. The constant improvements in price, experimental protocols, computational pipelines, and statistical frameworks are making feasible the use of HTS for any research group interested in quantitative genetics. In this review I discuss the application of HTS for molecular marker discovery, population genotyping, and expression profiling in QTL analysis.

Introduction

For almost one century scientists have dissected the genetic architecture of quantitative traits in plants using Quantitative trait loci (QTL) analysis (Fisher, 1918). These analyses establish associations between genetic markers and the phenotypic variation of a quantitative trait in a segregating population. The techniques used to obtain markers and physiological phenotypes have been constantly improved through history (Schlotterer, 2004; Montes et al., 2007). Recently, the price drop of high-throughput technologies have allowed plant researchers to quantify the general abundance of transcripts, proteins, or metabolites in segregating populations (Kirst et al., 2005; Vuylsteke et al., 2005, 2006; Decook et al., 2006; Keurentjes et al., 2007; West et al., 2007; Lisec et al., 2008; Potokina et al., 2008; Drost et al., 2010). These studies show that there are multiple benefits in using “omic” technologies for QTL analyses, even when the goal is to characterize physiological phenotypic diversity. First, molecular phenotypes are the initial step toward the production of physiological phenotypes and its regulation underlies much of phenotypic diversity (Hoekstra and Coyne, 2007; Stern and Orgogozo, 2008). Second, the availability of genome-wide information significantly increases the ability to identify candidate genes for QTLs (Jimenez-Gomez et al., 2010). Third, molecular traits measured at system scale allow estimation of the effect of QTLs in the genetic pathways of interest, or identification of additional gene networks altered by the loci responsible for the variation (Kliebenstein et al., 2006). Finally, molecular traits offer researchers a better understanding of how mutation drives physiological variation and what are the evolutionary forces acting at primary levels.

High-throughput sequencing, or HTS, allows the rapid and cost–effective generation of massive amounts of short sequences or reads (Metzker, 2010). The potential of this technology for mapping loci responsible for phenotypic differences in plants has already been demonstrated by identifying genes containing EMS-induced mutations in samples of pooled F2 individuals (Schneeberger et al., 2009; Austin et al., 2011). HTS technologies have been in the market for a few years, and new methods are being developed that will be cheaper, require less sample processing, and will produce more and longer reads (Munroe and Harris, 2010; Glenn, 2011; Niedringhaus et al., 2011). It is therefore clear that very soon HTS will be the tool of choice for QTL analyses. One important limiting factor remains to be eliminated: Data analysis. It requires long and computationally intensive pipelines that need to be customized for each particular experimental set up. An increasing number of new algorithms are constantly released to the community, and the debate on which pipelines return the most accurate results is still ongoing. Comparing, combining, and customizing these pipelines requires simple Unix or Linux commands and greatly benefits from knowledge in powerful statistical software such as R, and in scripting languages, such as Perl or Python (R Development Core Team, 2009). For non-bioinformaticians, integrated solutions with convenient interfaces are becoming popular both from collaborative open projects and companies (Blankenberg et al., 2010; Goecks et al., 2010). A popular website that keeps an actualized list of the available software tools is www.seqanswers.com, where users and developers also discuss new technological advances and pipelines. In terms of the computational equipment required for HTS data analysis, the majority of tools are developed for Linux or Unix based systems. Although parts of the analysis can be performed in any modern computer, machines with dozens of gigabytes of RAM are recommended in cases where reference sequences form the species considered are available, or with hundreds if no reference exists. An alternative option that is likely to become popular is to rent storage and computing power in specialized centers, or “the cloud” (Stein, 2010).

Due to the fast improvement of HTS, this review intends only to capture a snapshot in time of the possibilities that it offers for molecular marker discovery, genotyping, and molecular phenotyping in segregating populations of plants. This review has the purpose of helping researchers who have not incorporated this technology to their work to think about the requirements and possibilities of HTS. By no means this review refers to all available experimental designs or analysis tools, and the solutions proposed here are mere suggestions that will certainly soon be substituted by new and better ones. A guide map of the methods proposed in this review is depicted in Figure 1.

Figure 1

Library Preparation

Sample preparation protocols are continuously improved to use fewer amounts of biological material, be completed faster, and reduce the bias in their output. As an example, most current protocols allow multiplexing samples by adding a short sequence tag to all reads in a library, a convenient feature given the increasing numbers of reads produced per HTS run. The same companies that developed the HTS sequencers commercialize library preparation protocols optimized for the most common experimental designs. There are also kits from other companies that give comparable results and may be more cost efficient. Finally, many researchers are developing custom protocols to obtain specific information such as the transcribed strand in RNA-seq experiments, the rate of RNA degradation, or the positions occupied by RNA polymerases, just to name a few (Addo-Quaye et al., 2008; Core et al., 2008; German et al., 2008; Parkhomchuk et al., 2009).

Quality Control and Pre-Processing

Assessing the quality of HTS reads includes detection of biases on base composition, base quality, and sample complexity. The quality of the sequences has an impact on the reliability of the biological interpretations resulting from the analysis (Dohm et al., 2008). Part of these biases are introduced by the sample preparation protocols (Schwartz et al., 2011), particularly during cDNA synthesis in RNA-seq experiments (Hansen et al., 2010; Li et al., 2010b) and PCR amplification (Aird et al., 2011). Additional biases are particular to each HTS technology (Smith et al., 2008; Quince et al., 2011) or specific to each run of the sequencers (Auer and Doerge, 2010).

After quality control it is usually necessary to pre-process the reads by trimming low quality nucleotides and adapter sequences. At this stage, foreign sequences such as vectors or DNA from organisms contaminating the samples can also be removed. Depending of the type of libraries sequenced further pre-processing may be needed, such as trimming poly A or poly T tails and terminal transferase tails in RNA-seq libraries. In cases where several libraries have been multiplexed, reads should be separated by their barcode.

Both quality control and pre-processing can be easily performed with basic scripts written in Perl (Bioperl), R (Bioconductor), or Python (Biopython; Stajich et al., 2002; Gentleman et al., 2004; Cock et al., 2009; R Development Core Team, 2009). For non-programmers, there are some convenient tools that can carry out all or some of these tasks (FastQC, 2008; FASTX-Toolkit, 2009; Blankenberg et al., 2010; Falgueras et al., 2010; Goecks et al., 2010; Cutadapt, 2010; Schmieder et al., 2010; Schmieder and Edwards, 2011).

Molecular Marker Discovery

Depending on the availability of a reference sequence short reads will be aligned or de novo assembled using one of the multiple tools available. There are a number of recent articles that compare the most popular algorithms and software available for these purposes (Bao et al., 2011; Lin et al., 2011; Ruffalo et al., 2011). Please note that the methods proposed below are directed to developing molecular markers for QTL analysis and not to identify the mutation underlying the QTL, which requires much deeper sequencing.

With a reference sequence

A cost efficient solution to obtain molecular markers is to sequence DNA or RNA from the parental genotypes and mine polymorphisms from the resulting reads. These polymorphisms can be used later to design PCR markers or a high-throughput genotyping assay for the full population. This approach works remarkably well in diploid and polyploidy species using as low an amount of sequence as 5× coverage, meaning five times the size of the genome under study (Ossowski et al., 2008; Gore et al., 2009; Trick et al., 2009; Lai et al., 2010; Lam et al., 2010; Arai-Kichise et al., 2011; Geraldes et al., 2011). A recent article reviews the methods and tools available for single nucleotide polymorphism (SNP) identification and genotyping (Nielsen et al., 2011). To align the reads to the reference, mapping softwares based in “seed methods” are preferred despite their slower nature because their robustness to polymorphisms. Before SNP calling users may consider removal of the reads that map to multiple locations in the reference, and of duplicated reads that may have been generated from PCR artifacts. A recent pipeline also recalibrates the quality of the nucleotides in the reads to correct for the high error rates in HTS, and realigns reads in complex genomic positions where the fast processing alignment algorithms may have failed (Depristo et al., 2011). Commonly used indicators of the veracity of polymorphisms are based in the amount and quality of reads showing the polymorphism, frequency of the observed alleles, quality of the alignment, and/or proximity to other polymorphisms. There are some basic and popular options for calling polymorphisms from aligned reads (Li et al., 2009a,b; Depristo et al., 2011), tools specialized in the analysis of reads from particular sequencing platforms (Souaiaia et al., 2011), that have the ability to detect structural variation (Chen et al., 2009; Hormozdiari et al., 2009, 2010), or that have into account the quality of the reference in addition to the quality of the reads (Frohler and Dieterich, 2010). An essential method to control for the quality of the data analysis process is visual inspection through genome viewers specialized in HTS datasets (Huang and Marth, 2008; Bao et al., 2009; Milne et al., 2010; Robinson et al., 2011).

Without a reference sequence

High-throughput sequencing sequences can serve to construct the necessary reference to identify molecular markers if it is not already available. Although assembling de novo a complete genome sequence is possible with HTS, it requires very deep sequencing and extensive bioinformatic analysis, even more given the relatively large size of most plant genomes. A more efficient option is sequencing mRNA, which greatly reduces sample complexity in comparison with genome sequencing and has the advantage of offering functional information such as coding polymorphisms or expression levels (Graham et al., 2010; Mizrachi et al., 2010; Bancroft et al., 2011; Everett et al., 2011; Garg et al., 2011; Guo et al., 2011; Ibarra-Laclette et al., 2011; Ness et al., 2011; Su et al., 2011; Wei et al., 2011). A comprehensive compilation of the methods and tools available for transcriptome assembly has been recently published (Martin and Wang, 2011). De novo assembly algorithms greatly benefit from long and paired-end reads, but are extremely sensitive to errors and polymorphisms and will not perform well during assembly of datasets from mixed genotypes or highly heterozygous individuals. The amount of new genomic positions detected in RNA-seq experiments decrease exponentially as the number of reads increases (Figure 2). The majority of medium and highly expressed transcripts in a sample are detected at low coverage, and increasing coverage will mainly add non-coding RNAs and low expressed transcripts at a very high cost (Tarazona et al., 2011). If the objective is to assemble complete transcriptomes, obtaining samples from diverse tissues, time points, and conditions is preferred to depth of sequencing. Even in the best possible conditions assemblies from RNA-seq reads will return only a subset of the existing transcripts, many of which will be fragmented. This is expected due to low expression of particular transcripts, the non-uniform read coverage, and the presence of different isoforms per gene. To help assembly of low expressed transcripts researchers can use normalization protocols that deplete the most abundant transcripts from the samples (Christodoulou et al., 2011). In any case, contigs resulting from de novo assembly can be effectively used as a reference for molecular marker detection and characterization of transcripts in un-sequenced genomes (Parchman et al., 2010; Wang et al., 2010e; Angeloni et al., 2011; Hiremath et al., 2011; Kaur et al., 2011).

Figure 2

When highly similar genotypes are compared, RNA-seq may not be the best option since it mostly targets coding regions, which are less diverse than non-coding regions. In these cases researchers can construct reduced representation libraries by shearing DNA using restriction endonucleases and size-selecting the fragments that will be sequenced. Reads from these libraries can be clustered by similarity and mined for polymorphisms close to the restriction sites; or used to detect the presence–absence of particular tags, indicating a polymorphism in the restriction site itself (Kerstens et al., 2009; Sanchez et al., 2009; Etter et al., 2011). Obtaining polymorphisms from reduced representation libraries is more efficient when a reference sequence is available (Van Tassell et al., 2008; Wu et al., 2010). However, researchers have already developed tools to genotype samples from these tags using a low number of reads from organisms without a reference (Ratan et al., 2010), or to reconstruct part of the targeted genome using paired-end sequencing (Willing et al., 2011). Additional protocols to obtain markers from reduced representation libraries exist in which different combination of restriction enzymes are used for each of the genotypes involved (Hyten et al., 2010), or that do not shear the DNA but filter the reads for single copy sequences (You et al., 2011). The amount of reads necessary to perform this type of analysis depends on the size of the genome, the restriction enzymes used, and the availability of a reference.

Genotyping Populations

With the price drop of the HTS technologies and the possibility of multiplexing samples, genotyping an entire population has become realistic (Schneeberger and Weigel, 2011). In the case of a sequenced system such as rice, generating reads from the individuals of a population at 0.02–0.055× coverage allowed high-density genotyping by comparisons with the parental genotypes (Huang et al., 2009), or by inferring the parental genotypes from the polymorphisms found in the population (Xie et al., 2010). Since erroneous polymorphism calls are expected at low coverage, more or less complex algorithms need to be defined to correctly genotype each polymorphism in each individual (Huang et al., 2009; Xie et al., 2010; Li et al., 2011). In addition, a reference sequence can serve researchers to design enrichment essays that will target their preferred genomic locations, although at high cost (Blow, 2009; Mamanova et al., 2010; Nijman et al., 2010; Kenny et al., 2011). For species where a genome sequence is not available, a very practical approach is to sequence reduced representation libraries as mentioned above (Baird et al., 2008; Emerson et al., 2010b; Hohenlohe et al., 2010, 2011).

Molecular Phenotyping

The list of molecular phenotypes that can be quantified with HTS is extensive and is rapidly increasing (Hawkins et al., 2010). Examples of these phenotypes are protein–RNA interactions (Licatalosi et al., 2008; Hafner et al., 2010), translation rates (Ingolia et al., 2009; Ingolia, 2010), transcription rates (Core et al., 2008; Churchman and Weissman, 2011), protein–DNA interactions (Albert et al., 2007; Barski et al., 2007; Johnson et al., 2007; Mikkelsen et al., 2007; Robertson et al., 2007; Chen et al., 2008; Hesselberth et al., 2009), RNA degradation rates (Addo-Quaye et al., 2008; German et al., 2008), RNA secondary structure (Kertesz et al., 2010; Underwood et al., 2010), transcription start positions (Plessy et al., 2010), chromatin accessibility (Boyle et al., 2008), methylation states (Cokus et al., 2008; Down et al., 2008; Lister et al., 2008; Meissner et al., 2008), natural antisense transcription (Cloonan et al., 2008; Core et al., 2008; He et al., 2008; Armour et al., 2009; Parkhomchuk et al., 2009) or small RNA profiles (Lu et al., 2005). QTL analysis using these phenotypes as traits is an exciting field that remains un-explored. Therefore, the computational frameworks to quantitatively compare these phenotypes between individuals will need to be established.

Expression profiling with HTS

Although many cases of phenotypic variation caused by coding polymorphisms have been documented, variation in gene expression has been shown to underlie much of phenotypic diversity (Reviewed in Hoekstra and Coyne, 2007; Wray, 2007; Stern and Orgogozo, 2008). One method to detect differences in expression between individuals using HTS is to sequence 26–27 nucleotide-long tags from expressed transcripts (Matsumura et al., 2010; Hong et al., 2011). A recent study shows that this method reaches saturation in mice with 6–8 million reads per sample (Hong et al., 2011). Its advantages over sequencing full transcripts are the lower cost, higher sensitivity, reduced bias during amplification due to the fixed fragment lengths, and use of simplified statistical models to calculate differential expression. On the other hand, methods based in tags will not detect the majority of coding polymorphisms and isoforms, and require a close enough reference sequence to extract biologically meaningful results.

RNA-seq is rapidly becoming a standard in expression profiling because of its simple protocol of preparation, digital nature, large dynamic range, and high sensitivity in comparison with previous technologies (Marioni et al., 2008; Bradford et al., 2010; Liu et al., 2010). In addition, it can serve to genotype individuals, identify novel transcripts, characterize alternative splicing, and quantify allele specific expression (Reviewed in Wang et al., 2009; Costa et al., 2010; Marguerat and Bahler, 2010). Due to the novelty of the technique there is no consensus on which sample preparation protocols present fewer biases (Raz et al., 2011). However, strand-specific methods could become a standard because of their increased precision due to their ability to distinguish between sense and antisense transcripts (He et al., 2008; Levin et al., 2010). In terms of experimental designs, it is necessary to randomize and replicate biological samples, as with any other type of genome-wide analysis (Auer and Doerge, 2010; Fang and Cui, 2011; Hansen et al., 2011). There is little consensus about the depth of sequence needed for expression profiling with RNA-seq. Recent estimates range between 30 million reads to compare the expression profiles of two samples, to 100 million reads to detect most transcribed genes and quantify isoforms, to 500 million to obtain accurate profiles, including low expressed transcripts (Zhang et al., 2010; ENCODE, 2011; Toung et al., 2011). In any case, it is advisable to balance the number of reads between samples in the same experiment in order to perform accurate expression comparisons (Tarazona et al., 2011).

Expression profiling from HTS datasets is necessarily based on counting the reads mapped to each transcript in a reference sequence. When a reference genome or transcriptome is not available, it can be reconstructed using de novo assembly of the reads for at least one of the genotypes as described above. The simpler and less computational intensive protocol for expression profiling is to map the RNA-seq reads to known (or de novo assembled) transcripts and a set of possible exon–exon junctions (when available) to detect alternative splicing. However, in organisms with sequenced genomes this protocol will not allow detection of novel exons, transcripts, and isoforms. The preferred pipeline involves aligning the reads to the genomic reference using an alignment tool that splices the reads to detect intron–exon junctions (For example Trapnell et al., 2009; Ameur et al., 2010; Au et al., 2010; Guttman et al., 2010; Wang et al., 2010b; Lou et al., 2011).

A challenge for expression analyses in samples from two unrelated individuals is the need to perform robust quantification of reads generated from two or more alleles. This implies that reads with the closer genotype to the reference will align better than reads from a more distant genotype, in which more polymorphisms may interfere with their ability to map (Fontanillas et al., 2010). In these cases, aligners based in seed methods will perform better than those based in the Burrows–Wheeler Transform algorithm (For a review see Garber et al., 2011). Although most studies ignore this problem, there are solutions that go from identifying and removing the polymorphisms that cause these biases (Degner et al., 2009), aligning the reads to all references from the genotypes involved (Bullard et al., 2010a) or including the polymorphisms found in the references (Gan et al., 2011). When two references are used, a potential problem may arise from motifs that are more abundant in one reference with respect to the other if only uniquely mapped reads are counted. The use of longer reads and/or paired-end reads greatly decreases the number of ambiguously mapped reads. In addition, there are robust methods to assign these multi-mapped reads to a single location (Faulkner et al., 2008; Mortazavi et al., 2008; Hashimoto et al., 2009; Li et al., 2010a; Wang et al., 2010a; Ji et al., 2011).

There are a number of tools to count the number of reads aligned to each transcriptional unit to calculate expression, most of which require knowledge of Perl, Phyton, Linux/Unix, or R (Carlson et al., 2009; Bio::DB::Sam, 2009; Anders, 2010; Morgan and Pagès, 2010; Quinlan and Hall, 2010). Some alignment tools can directly calculate the number of reads per transcript and/or a measure of expression based in the reads (or fragments) per gene size in kilobases per million reads mapped, called RPKM (or FPKM; Mortazavi et al., 2008; Trapnell et al., 2010). However, these expression units show biases depending on the length, number, abundance of the transcripts present in the samples, or because of technical replication (Oshlack and Wakefield, 2009; Bullard et al., 2010b; Mcintyre et al., 2011). For this reason researchers have developed dedicated R/Bioconductor packages to calculate differential expression between samples based on raw read counts per transcript (Anders and Huber, 2010; Bullard et al., 2010b; Hardcastle and Kelly, 2010; Robinson et al., 2010; Wang et al., 2010c). In addition, there are software packages that take into consideration the biases inherent to RNA-seq when calculating expression or performing downstream analyses such as gene ontology over-representation studies (Young et al., 2010; Zheng et al., 2011).

High-throughput sequencing datasets allow quantification of expression for each isoform separately, resulting in significantly more accurate estimates than calculating expression at the gene level (Wang et al., 2010d). For this, users must first identify splicing events from the reads that align to exon–exon junctions. Quantifying isoform expression is complicated since most reads in an alternatively spliced transcript cannot be assigned to a single isoform. The most promising methods to address this complex problem take advantage from the information offered by paired-end and/or unambiguously mapped reads (Guttman et al., 2010; Katz et al., 2010; Li et al., 2010a; Trapnell et al., 2010; Nicolae et al., 2011). One advantage of going through the intricate process of identification of alternative splicing is that it can also be used as a trait for QTL analysis (Li et al., 2010c; Montgomery et al., 2010; Pickrell et al., 2010; Lalonde et al., 2011).

Allele specific expression in hybrids

An alternative to sequencing a full segregating population to perform eQTL analyses is to sequence F1 hybrid individuals, where allele specific expression can be calculated for loci with coding polymorphisms (Babak et al., 2008, 2010; Bullard et al., 2010a; Emerson et al., 2010a; Mcmanus et al., 2010; Pickrell et al., 2010). For any gene, both alleles in the hybrid share the same cellular environment and, as a result, changes in expression between alleles must necessarily be due to cis-acting regulators (Cowles et al., 2002). Trans-acting eQTLs can be inferred by performing RNA-seq in the parentals and comparing the differences in expression levels between alleles in the hybrid with the differences between the parentals (Wittkopp et al., 2004). Despite the considerable reduction in price and simplicity of experimental design, this method has several drawbacks. Allele specific expression can only be calculated in transcripts with coding polymorphisms that are highly covered, and it is very dependent on read and transcript length (Degner et al., 2009; Fontanillas et al., 2010). New statistical approaches are being developed that will overcome these limitations, starting by being able to estimate false discovery rates and allele specific alternative splicing (Skelly et al., 2011).

In summary, HTS is changing the way we perform QTL analysis by allowing high-throughput genotyping of populations and phenotyping of traits with a precision not achievable before. It is clear that HTS has not reached its peak of development, and that tools and algorithms will have to be modified according to the new technological improvements. Nevertheless, the first experiments using this technology have already identified exciting possibilities for the characterization of natural variation in plants.

Statements

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

1
Addo-QuayeC.EshooT. W.BartelD. P.AxtellM. J. (2008). Endogenous siRNA and miRNA targets identified by sequencing of the Arabidopsis degradome. Curr. Biol.18, 758–762.10.1016/j.cub.2008.04.042
2
AirdD.RossM. G.ChenW. S.DanielssonM.FennellT.RussC.JaffeD. B.NusbaumC.GnirkeA. (2011). Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries. Genome Biol.12, R18.10.1186/1465-6906-12-S1-I18
3
AlbertI.MavrichT. N.TomshoL. P.QiJ.ZantonS. J.SchusterS. C.PughB. F. (2007). Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome. Nature446, 572–576.10.1038/nature05632
4
AmeurA.WetterbomA.FeukL.GyllenstenU. (2010). Global and unbiased detection of splice junctions from RNA-seq data. Genome Biol.11, R34.10.1186/gb-2010-11-3-r34
5
AndersS. (2010). HTSeq: Analysing High-Throughput Sequencing Data With Python. Available at: http://www-huber.embl.de/users/anders/HTSeq/doc/overview.html#author
- Google Scholar
6
AndersS.HuberW. (2010). Differential expression analysis for sequence count data. Genome Biol.11, R106.10.1186/gb-2010-11-10-r106
7
AngeloniF.WagemakerC. A.JettenM. S.Op Den CampH. J.Janssen-MegensE. M.FrancoijsK. J.StunnenbergH. G.OuborgN. J. (2011). De novo transcriptome characterization and development of genomic tools for Scabiosa columbaria L. using next-generation sequencing techniques. Mol. Ecol. Resour.11, 662–674.10.1111/j.1755-0998.2011.02990.x
8
Arai-KichiseY.ShiwaY.NagasakiH.EbanaK.YoshikawaH.YanoM.WakasaK. (2011). Discovery of genome-wide DNA polymorphisms in a landrace cultivar of Japonica rice by whole-genome sequencing. Plant Cell Physiol.52, 274–282.10.1093/pcp/pcr003
9
ArmourC. D.CastleJ. C.ChenR.BabakT.LoerchP.JacksonS.ShahJ. K.DeyJ.RohlC. A.JohnsonJ. M.RaymondC. K. (2009). Digital transcriptome profiling using selective hexamer priming for cDNA synthesis. Nat. Methods6, 647–649.10.1038/nmeth.1360
10
AuK. F.JiangH.LinL.XingY.WongW. H. (2010). Detection of splice junctions from paired-end RNA-seq data by SpliceMap. Nucleic Acids Res.38, 4570–4578.10.1093/nar/gkq211
11
AuerP. L.DoergeR. W. (2010). Statistical design and analysis of RNA sequencing data. Genetics185, 405–416.10.1534/genetics.110.114983
12
AustinR. S.VidaurreD.StamatiouG.BreitR.ProvartN. J.BonettaD.ZhangJ.FungP.GongY.WangP. W.MccourtP.GuttmanD. S. (2011). Next-generation mapping of Arabidopsis genes. Plant J.67, 715–725.10.1111/j.1365-313X.2011.04619.x
13
BabakT.DevealeB.ArmourC.RaymondC.ClearyM. A.Van Der KooyD.JohnsonJ. M.LimL. P. (2008). Global survey of genomic imprinting by transcriptome sequencing. Curr. Biol.18, 1735–1741.10.1016/j.cub.2008.09.044
14
BabakT.Garrett-EngeleP.ArmourC. D.RaymondC. K.KellerM. P.ChenR.RohlC. A.JohnsonJ. M.AttieA. D.FraserH. B.SchadtE. E. (2010). Genetic validation of whole-transcriptome sequencing for mapping expression affected by cis-regulatory variation. BMC Genomics11, 473.10.1186/1471-2164-11-473
15
BairdN. A.EtterP. D.AtwoodT. S.CurreyM. C.ShiverA. L.LewisZ. A.SelkerE. U.CreskoW. A.JohnsonE. A. (2008). Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE3, e3376.10.1371/journal.pone.0003376
16
BancroftI.MorganC.FraserF.HigginsJ.WellsR.ClissoldL.BakerD.LongY.MengJ.WangX.LiuS.TrickM. (2011). Dissecting the genome of the polyploid crop oilseed rape by transcriptome sequencing. Nat. Biotechnol.29, 762–766.10.1038/nbt.1926
17
BaoH.GuoH.WangJ.ZhouR.LuX.ShiS. (2009). MapView: visualization of short reads alignment on a desktop computer. Bioinformatics25, 1554–1555.10.1093/bioinformatics/btp255
18
BaoS.JiangR.KwanW.WangB.MaX.SongY. Q. (2011). Evaluation of next-generation sequencing software in mapping and assembly. J. Hum. Genet.56, 406–414.10.1038/jhg.2011.43
19
BarskiA.CuddapahS.CuiK.RohT. Y.SchonesD. E.WangZ.WeiG.ChepelevI.ZhaoK. (2007). High-resolution profiling of histone methylations in the human genome. Cell129, 823–837.10.1016/j.cell.2007.05.009
20
Bio::DB::Sam. (2009). Available at: http://search.cpan.org/∼lds/Bio-SamTools/lib/Bio/DB/Bam/Alignment.pm
- Google Scholar
21
BlankenbergD.GordonA.Von KusterG.CoraorN.TaylorJ.NekrutenkoA. (2010). Manipulation of FASTQ data with galaxy. Bioinformatics26, 1783–1785.10.1093/bioinformatics/btq281
22
BlowN. (2009). Genomics: catch me if you can. Nat. Methods6, 539–544.10.1038/nmeth0609-465
- CrossRef
- Google Scholar
23
BoyleA. P.DavisS.ShulhaH. P.MeltzerP.MarguliesE. H.WengZ.FureyT. S.CrawfordG. E. (2008). High-resolution mapping and characterization of open chromatin across the genome. Cell132, 311–322.10.1016/j.cell.2007.12.014
24
BradfordJ. R.HeyY.YatesT.LiY.PepperS. D.MillerC. J. (2010). A comparison of massively parallel nucleotide sequencing with oligonucleotide microarrays for global transcription profiling. BMC Genomics11, 282.10.1186/1471-2164-11-282
25
BullardJ. H.MostovoyY.DudoitS.BremR. B. (2010a). Polygenic and directional regulatory evolution across pathways in Saccharomyces. Proc. Natl. Acad. Sci. U.S.A.107, 5058–5063.10.1073/pnas.0912959107
- CrossRef
- Google Scholar
26
BullardJ. H.PurdomE.HansenK. D.DudoitS. (2010b). Evaluation of statistical methods for normalization and differential expression in mRNA-seq experiments. BMC Bioinformatics11, 94.10.1186/1471-2105-11-94
- CrossRef
- Google Scholar
27
CarlsonM.PagesH.AboyounP.FalconS.MorganM.SarkarD.LawrenceM. (2009). Genomic Features: Tools for Making and Manipulating Transcript Centric Annotations. Available at: http://www.bioconductor.org/packages/2.6/bioc/html/GenomicFeatures.html
- Google Scholar
28
ChenK.WallisJ. W.MclellanM. D.LarsonD. E.KalickiJ. M.PohlC. S.McgrathS. D.WendlM. C.ZhangQ.LockeD. P.ShiX.FultonR. S.LeyT. J.WilsonR. K.DingL.MardisE. R. (2009). BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat. Methods6, 677–681.10.1038/nmeth.1363
29
ChenX.XuH.YuanP.FangF.HussM.VegaV. B.WongE.OrlovY. L.ZhangW.JiangJ.LohY. H.YeoH. C.YeoZ. X.NarangV.GovindarajanK. R.LeongB.ShahabA.RuanY.BourqueG.SungW. K.ClarkeN. D.WeiC. L.NgH. H. (2008). Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell133, 1106–1117.10.1016/j.cell.2008.04.043
30
ChristodoulouD. C.GorhamJ. M.HermanD. S.SeidmanJ. (2011). Construction of normalized RNA-seq libraries for next-generation sequencing using the crab duplex-specific nuclease. Curr. Protoc. Mol. Biol.94, 4.12.1–4.12.11.
- Google Scholar
31
ChurchmanL. S.WeissmanJ. S. (2011). Nascent transcript sequencing visualizes transcription at nucleotide resolution. Nature469, 368–373.10.1038/nature09652
32
CloonanN.ForrestA. R.KolleG.GardinerB. B.FaulknerG. J.BrownM. K.TaylorD. F.SteptoeA. L.WaniS.BethelG.RobertsonA. J.PerkinsA. C.BruceS. J.LeeC. C.RanadeS. S.PeckhamH. E.ManningJ. M.MckernanK. J.GrimmondS. M. (2008). Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nat. Methods5, 613–619.10.1038/nmeth.1223
33
CockP. J.AntaoT.ChangJ. T.ChapmanB. A.CoxC. J.DalkeA.FriedbergI.HamelryckT.KauffF.WilczynskiB.De HoonM. J. (2009). Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics25, 1422–1423.10.1093/bioinformatics/btp163
34
CokusS. J.FengS.ZhangX.ChenZ.MerrimanB.HaudenschildC. D.PradhanS.NelsonS. F.PellegriniM.JacobsenS. E. (2008). Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning. Nature452, 215–219.10.1038/nature06745
35
CoreL. J.WaterfallJ. J.LisJ. T. (2008). Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science322, 1845–1848.10.1126/science.1162228
36
CostaV.AngeliniC.De FeisI.CiccodicolaA. (2010). Uncovering the complexity of transcriptomes with RNA-seq. J. Biomed. Biotechnol.2010, 853916.10.1155/2010/853916
37
CowlesC. R.HirschhornJ. N.AltshulerD.LanderE. S. (2002). Detection of regulatory variation in mouse genes. Nat. Genet.32, 432–437.10.1038/ng992
38
Cutadapt. (2010). A Tool That Removes Adapter Sequences From DNA Sequencing Reads. Available at: http://code.google.com/p/cutadapt/
- Google Scholar
39
DecookR.LallS.NettletonD.HowellS. H. (2006). Genetic regulation of gene expression during shoot development in Arabidopsis. Genetics172, 1155–1164.10.1534/genetics.105.042275
40
DegnerJ. F.MarioniJ. C.PaiA. A.PickrellJ. K.NkadoriE.GiladY.PritchardJ. K. (2009). Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data. Bioinformatics25, 3207–3212.10.1093/bioinformatics/btp579
41
DepristoM. A.BanksE.PoplinR.GarimellaK. V.MaguireJ. R.HartlC.PhilippakisA. A.Del AngelG.RivasM. A.HannaM.MckennaA.FennellT. J.KernytskyA. M.SivachenkoA. Y.CibulskisK.GabrielS. B.AltshulerD.DalyM. J. (2011). A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet.43, 491–498.10.1038/ng.806
42
DohmJ. C.LottazC.BorodinaT.HimmelbauerH. (2008). Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res.36, e105.10.1093/nar/gkn425
43
DownT. A.RakyanV. K.TurnerD. J.FlicekP.LiH.KuleshaE.GrafS.JohnsonN.HerreroJ.TomazouE. M.ThorneN. P.BackdahlL.HerberthM.HoweK. L.JacksonD. K.MirettiM. M.MarioniJ. C.BirneyE.HubbardT. J.DurbinR.TavareS.BeckS. (2008). A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis. Nat. Biotechnol.26, 779–785.10.1038/nbt1414
44
DrostD. R.BenedictC. I.BergA.NovaesE.NovaesC. R.YuQ.DervinisC.MaiaJ. M.YapJ.MilesB.KirstM. (2010). Diversification in the genetic architecture of gene expression and transcriptional networks in organ differentiation of Populus. Proc. Natl. Acad. Sci. U.S.A.107, 8492–8497.10.1073/pnas.0914709107
45
EmersonJ. J.HsiehL. C.SungH. M.WangT. Y.HuangC. J.LuH. H.LuM. Y.WuS. H.LiW. H. (2010a). Natural selection on cis and trans regulation in yeasts. Genome Res.20, 826–836.10.1101/gr.101576.109
- CrossRef
- Google Scholar
46
EmersonK. J.MerzC. R.CatchenJ. M.HohenloheP. A.CreskoW. A.BradshawW. E.HolzapfelC. M. (2010b). Resolving postglacial phylogeography using high-throughput sequencing. Proc. Natl. Acad. Sci. U.S.A.107, 16196–16200.10.1073/pnas.1006538107
- CrossRef
- Google Scholar
47
ENCODE. (2011). Standards, guidelines and best practices for RNA-seq. V1.0.
- Google Scholar
48
EtterP. D.PrestonJ. L.BasshamS.CreskoW. A.JohnsonE. A. (2011). Local de novo assembly of RAD paired-end contigs using short sequencing reads. PLoS ONE6, e18561.10.1371/journal.pone.0018561
49
EverettM. V.GrauE. D.SeebJ. E. (2011). Short reads and nonmodel species: exploring the complexities of next-generation sequence assembly and SNP discovery in the absence of a reference genome. Mol. Ecol. Resour.11(Suppl. 1), 93–108.10.1111/j.1755-0998.2010.02969.x
50
FalguerasJ.LaraA. J.Fernandez-PozoN.CantonF. R.Perez-TrabadoG.ClarosM. G. (2010). SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read. BMC Bioinformatics11, 38.10.1186/1471-2105-11-38
51
FangZ.CuiX. (2011). Design and validation issues in RNA-seq experiments. Brief. Bioinform.12, 280–287.10.1093/bib/bbr004
52
FastQC. (2008). Available at: http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc/
- Google Scholar
53
FASTX-Toolkit. (2009). Available at: http://hannonlab.cshl.edu/fastx_toolkit/index.html
- Google Scholar
54
FaulknerG. J.ForrestA. R.ChalkA. M.SchroderK.HayashizakiY.CarninciP.HumeD. A.GrimmondS. M. (2008). A rescue strategy for multi mapping short sequence tags refines surveys of transcriptional activity by CAGE. Genomics91, 281–288.10.1016/j.ygeno.2007.11.003
55
FisherR. A. (1918). The correlation between relatives on the supposition of Mendelian inheritance. Philos. Trans. R. Soc. Edinb.52, 399–433.
- Google Scholar
56
FontanillasP.LandryC. R.WittkoppP. J.RussC.GruberJ. D.NusbaumC.HartlD. L. (2010). Key considerations for measuring allelic expression on a genomic scale using high-throughput sequencing. Mol. Ecol.19(Suppl. 1), 212–227.10.1111/j.1365-294X.2010.04472.x
57
FrohlerS.DieterichC. (2010). ACCUSA – accurate SNP calling on draft genomes. Bioinformatics26, 1364–1365.10.1093/bioinformatics/btq138
58
GanX.StegleO.BehrJ.SteffenJ. G.DreweP.HildebrandK. L.LyngsoeR.SchultheissS. J.OsborneE. J.SreedharanV. T.KahlesA.BohnertR.JeanG.DerwentP.KerseyP.BelfieldE. J.HarberdN. P.KemenE.ToomajianC.KoverP. X.ClarkR. M.RatschG.MottR. (2011). Multiple reference genomes and transcriptomes for Arabidopsis thaliana. Nature477, 419–423.10.1038/nature10414
59
GarberM.GrabherrM. G.GuttmanM.TrapnellC. (2011). Computational methods for transcriptome annotation and quantification using RNA-seq. Nat. Methods8, 469–477.10.1038/nmeth.1613
60
GargR.PatelR. K.JhanwarS.PriyaP.BhattacharjeeA.YadavG.BhatiaS.ChattopadhyayD.TyagiA. K.JainM. (2011). Gene discovery and tissue-specific transcriptome analysis in chickpea with massively parallel pyrosequencing and web resource development. Plant Physiol.156, 1661–1678.10.1104/pp.111.178616
61
GentlemanR. C.CareyV. J.BatesD. M.BolstadB.DettlingM.DudoitS.EllisB.GautierL.GeY.GentryJ.HornikK.HothornT.HuberW.IacusS.IrizarryR.LeischF.LiC.MaechlerM.RossiniA. J.SawitzkiG.SmithC.SmythG.TierneyL.YangJ. Y.ZhangJ. (2004). Bioconductor: open software development for computational biology and bioinformatics. Genome Biol.5, R80.10.1186/gb-2004-5-10-r80
62
GeraldesA.PangJ.ThiessenN.CezardT.MooreR.ZhaoY.TamA.WangS.FriedmannM.BirolI.JonesS. J.CronkQ. C.DouglasC. J. (2011). SNP discovery in black cottonwood (Populus trichocarpa) by population transcriptome resequencing. Mol. Ecol. Resour.11(Suppl. 1), 81–92.10.1111/j.1755-0998.2010.02960.x
63
GermanM. A.PillayM.JeongD. H.HetawalA.LuoS.JanardhananP.KannanV.RymarquisL. A.NobutaK.GermanR.De PaoliE.LuC.SchrothG.MeyersB. C.GreenP. J. (2008). Global identification of microRNA-target RNA pairs by parallel analysis of RNA ends. Nat. Biotechnol.26, 941–946.10.1038/nbt1417
64
GlennT. C. (2011). Field guide to next-generation DNA sequencers. Mol. Ecol. Resour.11, 759–769.10.1111/j.1755-0998.2011.03024.x
65
GoecksJ.NekrutenkoA.TaylorJ. (2010). Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol.11, R86.10.1186/gb-2010-11-8-r86
66
GoreM. A.ChiaJ. M.ElshireR. J.SunQ.ErsozE. S.HurwitzB. L.PeifferJ. A.McmullenM. D.GrillsG. S.Ross-IbarraJ.WareD. H.BucklerE. S. (2009). A first-generation haplotype map of maize. Science326, 1115–1117.10.1126/science.1177837
67
GrahamI. A.BesserK.BlumerS.BraniganC. A.CzechowskiT.EliasL.GutermanI.HarveyD.IsaacP. G.KhanA. M.LarsonT. R.LiY.PawsonT.PenfieldT.RaeA. M.RathboneD. A.ReidS.RossJ.SmallwoodM. F.SeguraV.TownsendT.VyasD.WinzerT.BowlesD. (2010). The genetic map of Artemisia annua L. identifies loci affecting yield of the antimalarial drug artemisinin. Science327, 328–331.10.1126/science.1182612
68
GuoS.LiuJ.ZhengY.HuangM.ZhangH.GongG.HeH.RenY.ZhongS.FeiZ.XuY. (2011). Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles. BMC Genomics12, 454.10.1186/1471-2164-12-71
69
GuttmanM.GarberM.LevinJ. Z.DonagheyJ.RobinsonJ.AdiconisX.FanL.KoziolM. J.GnirkeA.NusbaumC.RinnJ. L.LanderE. S.RegevA. (2010). Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs. Nat. Biotechnol.28, 503–510.10.1038/nbt0710-756b
70
HafnerM.LandthalerM.BurgerL.KhorshidM.HausserJ.BerningerP.RothballerA.AscanoM.Jr.JungkampA. C.MunschauerM.UlrichA.WardleG. S.DewellS.ZavolanM.TuschlT. (2010). Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell141, 129–141.10.1016/j.cell.2010.03.009
71
HansenK. D.BrennerS. E.DudoitS. (2010). Biases in Illumina transcriptome sequencing caused by random hexamer priming. Nucleic Acids Res.38, e131.10.1093/nar/gkp1195
72
HansenK. D.ZhijinW.IrizarryR. A.LeekJ. T. (2011). Sequencing technology does not eliminate biological variability. Nat. Biotechnol.29, 575–573.10.1038/nbt.1910
- CrossRef
- Google Scholar
73
HardcastleT. J.KellyK. A. (2010). baySeq: empirical Bayesian methods for identifying differential expression in sequence count data. BMC Bioinformatics11, 422.10.1186/1471-2105-11-422
74
HashimotoT.De HoonM. J.GrimmondS. M.DaubC. O.HayashizakiY.FaulknerG. J. (2009). Probabilistic resolution of multi-mapping reads in massively parallel sequencing data using MuMRescueLite. Bioinformatics25, 2613–2614.10.1093/bioinformatics/btp438
75
HawkinsR. D.HonG. C.RenB. (2010). Next-generation genomics: an integrative approach. Nat. Rev. Genet.11, 476–486.
- Pubmed Abstract
- Google Scholar
76
HeY.VogelsteinB.VelculescuV. E.PapadopoulosN.KinzlerK. W. (2008). The antisense transcriptomes of human cells. Science322, 1855–1857.10.1126/science.1163853
77
HesselberthJ. R.ChenX.ZhangZ.SaboP. J.SandstromR.ReynoldsA. P.ThurmanR. E.NephS.KuehnM. S.NobleW. S.FieldsS.StamatoyannopoulosJ. A. (2009). Global mapping of protein-DNA interactions in vivo by digital genomic footprinting. Nat. Methods6, 283–289.10.1038/nmeth.1313
78
HiremathP. J.FarmerA.CannonS. B.WoodwardJ.KudapaH.TutejaR.KumarA.BhanuprakashA.MulaosmanovicB.GujariaN.KrishnamurthyL.GaurP. M.KavikishorP. B.ShahT.SrinivasanR.LohseM.XiaoY.TownC. D.CookD. R.MayG. D.VarshneyR. K. (2011). Large-scale transcriptome analysis in chickpea (Cicer arietinum L.), an orphan legume crop of the semi-arid tropics of Asia and Africa. Plant Biotechnol. J.9, 922–931.10.1111/j.1467-7652.2011.00625.x
79
HoekstraH. E.CoyneJ. A. (2007). The locus of evolution: evo devo and the genetics of adaptation. Evolution61, 995–1016.10.1111/j.1558-5646.2007.00105.x
80
HohenloheP. A.AmishS. J.CatchenJ. M.AllendorfF. W.LuikartG. (2011). Next-generation RAD sequencing identifies thousands of SNPs for assessing hybridization between rainbow and west slope cutthroat trout. Mol. Ecol. Resour.11(Suppl. 1), 117–122.10.1111/j.1755-0998.2010.02967.x
81
HohenloheP. A.BasshamS.EtterP. D.StifflerN.JohnsonE. A.CreskoW. A. (2010). Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet.6, e1000862.10.1371/journal.pgen.1000862
82
HongL. Z.LiJ.Schmidt-KuntzelA.WarrenW. C.BarshG. S. (2011). Digital gene expression for non-model organisms. Genome Res. [Epub ahead of print].10.1101/gr.122135.111
83
HormozdiariF.AlkanC.EichlerE. E.SahinalpS. C. (2009). Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes. Genome Res.19, 1270–1278.10.1101/gr.088633.108
84
HormozdiariF.HajirasoulihaI.DaoP.HachF.YorukogluD.AlkanC.EichlerE. E.SahinalpS. C. (2010). Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery. Bioinformatics26, i350–i357.10.1093/bioinformatics/btq216
85
HuangW.MarthG. (2008). EagleView: a genome assembly viewer for next-generation sequencing technologies. Genome Res.18, 1538–1543.10.1101/gr.069674.107
86
HuangX.FengQ.QianQ.ZhaoQ.WangL.WangA.GuanJ.FanD.WengQ.HuangT.DongG.SangT.HanB. (2009). High-throughput genotyping by whole-genome resequencing. Genome Res.19, 1068–1076.10.1101/gr.089516.108
87
HytenD. L.SongQ.FickusE. W.QuigleyC. V.LimJ. S.ChoiI. Y.HwangE. Y.Pastor-CorralesM.CreganP. B. (2010). High-throughput SNP discovery and assay development in common bean. BMC Genomics11, 475.10.1186/1471-2164-11-475
88
Ibarra-LacletteE.AlbertV. A.Perez-TorresC. A.Zamudio-HernandezF.Ortega-Estrada MdeJ.Herrera-EstrellaA.Herrera-EstrellaL. (2011). Transcriptomics and molecular evolutionary rate analysis of the bladderwort (Utricularia), a carnivorous plant with a minimal genome. BMC Plant Biol.11, 101.10.1186/1471-2229-11-101
89
IngoliaN. T. (2010). Genome-wide translational profiling by ribosome footprinting. Meth. Enzymol.470, 119–142.10.1016/S0076-6879(10)70006-9
90
IngoliaN. T.GhaemmaghamiS.NewmanJ. R.WeissmanJ. S. (2009). Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science324, 218–223.10.1126/science.1168978
91
JiY.XuY.ZhangQ.TsuiK.-W.YuanY.NorrisC.Jr.LiangS.LiangH. (2011). BM-map: Bayesian mapping of multireads for next-generation sequencing data. Biometrics. [Epub ahead of print].10.1111/j.1541-0420.2011.01605.x
- CrossRef
- Google Scholar
92
Jimenez-GomezJ. M.WallaceA. D.MaloofJ. N. (2010). Network analysis identifies ELF3 as a QTL for the shade avoidance response in Arabidopsis. PLoS Genet.6, e1001100.10.1371/journal.pgen.1001100
- CrossRef
- Google Scholar
93
JohnsonD. S.MortazaviA.MyersR. M.WoldB. (2007). Genome-wide mapping of in vivo protein-DNA interactions. Science316, 1497–1502.10.1126/science.1141319
94
KatzY.WangE. T.AiroldiE. M.BurgeC. B. (2010). Analysis and design of RNA sequencing experiments for identifying isoform regulation. Nat. Methods7, 1009–1015.10.1038/nmeth.1528
95
KaurS.CoganN. O.PembletonL. W.ShinozukaM.SavinK. W.MaterneM.ForsterJ. W. (2011). Transcriptome sequencing of lentil based on second-generation technology permits large-scale unigene assembly and SSR marker discovery. BMC Genomics12, 265.10.1186/1471-2164-12-265
96
KennyE. M.CormicanP.GilksW. P.GatesA. S.O’dushlaineC. T.PintoC.CorvinA. P.GillM.MorrisD. W. (2011). Multiplex target enrichment using DNA indexing for ultra-high throughput SNP detection. DNA Res.18, 31–38.10.1093/dnares/dsq029
97
KerstensH.CrooijmansR.VeenendaalA.DibbitsB.Chin-a-WoengT.Den DunnenJ.GroenenM. (2009). Large scale single nucleotide polymorphism discovery in unsequenced genomes using second generation high throughput sequencing technology: applied to turkey. BMC Genomics10, 479.10.1186/1471-2164-10-479
98
KerteszM.WanY.MazorE.RinnJ. L.NutterR. C.ChangH. Y.SegalE. (2010). Genome-wide measurement of RNA secondary structure in yeast. Nature467, 103–107.10.1038/nature09322
99
KeurentjesJ. J. B.FuJ.TerpstraI. R.GarciaJ. M.Van Den AckervekenG.SnoekL. B.PeetersA. J. M.VreugdenhilD.KoornneefM.JansenR. C. (2007). Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loci. Proc. Natl. Acad. Sci. U.S.A.104, 1708–1713.10.1073/pnas.0610429104
100
KirstM.BastenC. J.MyburgA. A.ZengZ. B.SederoffR. R. (2005). Genetic architecture of transcript-level variation in differentiating xylem of a eucalyptus hybrid. Genetics169, 2295–2303.10.1534/genetics.104.039198
101
KliebensteinD. J.WestM. A.Van LeeuwenH.LoudetO.DoergeR. W.St ClairD. A. (2006). Identification of QTLs controlling gene expression networks defined a priori. BMC Bioinformatics7, 308.10.1186/1471-2105-7-308
102
LaiJ.LiR.XuX.JinW.XuM.ZhaoH.XiangZ.SongW.YingK.ZhangM.JiaoY.NiP.ZhangJ.LiD.GuoX.YeK.JianM.WangB.ZhengH.LiangH.ZhangX.WangS.ChenS.LiJ.FuY.SpringerN. M.YangH.WangJ.DaiJ.SchnableP. S. (2010). Genome-wide patterns of genetic variation among elite maize inbred lines. Nat. Genet.42, 1027–1030.10.1038/ng.684
103
LalondeE.HaK. C.WangZ.BemmoA.KleinmanC. L.KwanT.PastinenT.MajewskiJ. (2011). RNA sequencing reveals the role of splicing polymorphisms in regulating human gene expression. Genome Res.21, 545–554.10.1101/gr.111211.110
104
LamH. M.XuX.LiuX.ChenW.YangG.WongF. L.LiM. W.HeW.QinN.WangB.LiJ.JianM.WangJ.ShaoG.SunS. S.ZhangG. (2010). Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection. Nat. Genet.42, 1053–1059.10.1038/ng.715
105
LevinJ. Z.YassourM.AdiconisX.NusbaumC.ThompsonD. A.FriedmanN.GnirkeA.RegevA. (2010). Comprehensive comparative analysis of strand-specific RNA sequencing methods. Nat. Methods7, 709–715.10.1038/nmeth.1491
106
LiB.RuottiV.StewartR. M.ThomsonJ. A.DeweyC. N. (2010a). RNA-seq gene expression estimation with read mapping uncertainty. Bioinformatics26, 493–500.10.1093/bioinformatics/btq222
- CrossRef
- Google Scholar
107
LiJ.JiangH.WongW. H. (2010b). Modeling non-uniformity in short-read rates in RNA-seq data. Genome Biol.11, R50.10.1186/gb-2010-11-2-r22
- CrossRef
- Google Scholar
108
LiY.BreitlingR.SnoekL. B.Van Der VeldeK. J.SwertzM. A.RiksenJ.JansenR. C.KammengaJ. E. (2010c). Global genetic robustness of the alternative splicing machinery in Caenorhabditis elegans. Genetics186, 405–410.10.1534/genetics.110.119677
- CrossRef
- Google Scholar
109
LiH.HandsakerB.WysokerA.FennellT.RuanJ.HomerN.MarthG.AbecasisG.DurbinR. (2009a). The sequence alignment/map format and SAM tools. Bioinformatics25, 2078–2079.10.1093/bioinformatics/btp100
- CrossRef
- Google Scholar
110
LiR.LiY.FangX.YangH.WangJ.KristiansenK. (2009b). SNP detection for massively parallel whole-genome resequencing. Genome Res.19, 1124–1132.10.1101/gr.092213.109
- CrossRef
- Google Scholar
111
LiY.SidoreC.KangH. M.BoehnkeM.AbecasisG. R. (2011). Low-coverage sequencing: implications for design of complex trait association studies. Genome Res.21, 940–951.10.1101/gr.117259.110
112
LicatalosiD. D.MeleA.FakJ. J.UleJ.KayikciM.ChiS. W.ClarkT. A.SchweitzerA. C.BlumeJ. E.WangX.DarnellJ. C.DarnellR. B. (2008). HITS-CLIP yields genome-wide insights into brain alternative RNA processing. Nature456, 464–469.10.1038/nature07488
113
LinY.LiJ.ShenH.ZhangL.PapasianC. J.DengH. W. (2011). Comparative studies of de novo assembly tools for next-generation sequencing technologies. Bioinformatics27, 2031–2037.10.1093/bioinformatics/btr319
114
LisecJ.MeyerR. C.SteinfathM.RedestigH.BecherM.Witucka-WallH.FiehnO.TorjekO.SelbigJ.AltmannT.WillmitzerL. (2008). Identification of metabolic and biomass QTL in Arabidopsis thaliana in a parallel analysis of RIL and IL populations. Plant J.53, 960–972.10.1111/j.1365-313X.2007.03383.x
115
ListerR.O’malleyR. C.Tonti-FilippiniJ.GregoryB. D.BerryC. C.MillarA. H.EckerJ. R. (2008). Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell133, 523–536.10.1016/j.cell.2008.03.029
116
LiuS.LinL.JiangP.WangD.XingY. (2010). A comparison of RNA-seq and high-density exon array for detecting differential gene expression between closely related species. Nucleic Acids Res.39, 578–588.10.1093/nar/gkq817
117
LouS. K.NiB.LoL. Y.TsuiS. K.ChanT. F.LeungK. S. (2011). ABMapper: a suffix array-based tool for multi-location searching and splice-junction mapping. Bioinformatics27, 421–422.10.1093/bioinformatics/btq656
118
LuC.TejS. S.LuoS.HaudenschildC. D.MeyersB. C.GreenP. J. (2005). Elucidation of the small RNA component of the transcriptome. Science309, 1567–1569.10.1126/science.1113435
119
MamanovaL.CoffeyA. J.ScottC. E.KozarewaI.TurnerE. H.KumarA.HowardE.ShendureJ.TurnerD. J. (2010). Target-enrichment strategies for next-generation sequencing. Nat. Methods7, 111–118.10.1038/nmeth0610-479c
120
MargueratS.BahlerJ. (2010). RNA-seq: from technology to biology. Cell. Mol. Life Sci.67, 569–579.10.1007/s00018-009-0180-6
121
MarioniJ. C.MasonC. E.ManeS. M.StephensM.GiladY. (2008). RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res.18, 1509–1517.10.1101/gr.079558.108
122
MartinJ. A.WangZ. (2011). Next-generation transcriptome assembly. Nat. Rev. Genet.12, 671–682.10.1038/nrg3068
123
MatsumuraH.YoshidaK.LuoS.KimuraE.FujibeT.AlbertynZ.BarreroR. A.KrugerD. H.KahlG.SchrothG. P.TerauchiR. (2010). High-throughput SuperSAGE for digital gene expression analysis of multiple samples using next generation sequencing. PLoS ONE5, e12010.10.1371/journal.pone.0012010
124
McintyreL. M.LopianoK. K.MorseA. M.AminV.ObergA. L.YoungL. J.NuzhdinS. V. (2011). RNA-seq: technical variability and sampling. BMC Genomics12, 293.10.1186/1471-2164-12-293
125
McmanusC. J.CoolonJ. D.DuffM. O.Eipper-MainsJ.GraveleyB. R.WittkoppP. J. (2010). Regulatory divergence in Drosophila revealed by mRNA-seq. Genome Res.20, 816–825.
- Google Scholar
126
MeissnerA.MikkelsenT. S.GuH.WernigM.HannaJ.SivachenkoA.ZhangX.BernsteinB. E.NusbaumC.JaffeD. B.GnirkeA.JaenischR.LanderE. S. (2008). Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature454, 766–770.
- Pubmed Abstract
- Google Scholar
127
MetzkerM. L. (2010). Sequencing technologies – the next generation. Nat. Rev. Genet.11, 31–46.10.1038/nrg2626
128
MikkelsenT. S.KuM.JaffeD. B.IssacB.LiebermanE.GiannoukosG.AlvarezP.BrockmanW.KimT. K.KocheR. P.LeeW.MendenhallE.O’donovanA.PresserA.RussC.XieX.MeissnerA.WernigM.JaenischR.NusbaumC.LanderE. S.BernsteinB. E. (2007). Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature448, 553–560.10.1038/nature06008
129
MilneI.BayerM.CardleL.ShawP.StephenG.WrightF.MarshallD. (2010). Tablet – next generation sequence assembly visualization. Bioinformatics26, 401–402.10.1093/bioinformatics/btp666
130
MizrachiE.HeferC. A.RanikM.JoubertF.MyburgA. A. (2010). De novo assembled expressed gene catalog of a fast-growing Eucalyptus tree produced by Illumina mRNA-seq. BMC Genomics11, 681.10.1186/1471-2164-11-681
131
MontesJ. M.MelchingerA. E.ReifJ. C. (2007). Novel throughput phenotyping platforms in plant genetic studies. Trends Plant Sci.12, 433–436.10.1016/j.tplants.2007.08.006
132
MontgomeryS. B.SammethM.Gutierrez-ArcelusM.LachR. P.IngleC.NisbettJ.GuigoR.DermitzakisE. T. (2010). Transcriptome genetics using second generation sequencing in a Caucasian population. Nature464, 773–777.10.1038/nature08903
133
MorganM.PagèsH. (2010). Rsamtools: Import Aligned BAM File Format Sequences Into R/Bioconductor. Available at: http://bioconductor.org/packages/release/bioc/html/Rsamtools.html
- Google Scholar
134
MortazaviA.WilliamsB. A.MccueK.SchaefferL.WoldB. (2008). Mapping and quantifying mammalian transcriptomes by RNA-seq. Nat. Methods5, 621–628.10.1038/nmeth.1226
135
MunroeD. J.HarrisT. J. (2010). Third-generation sequencing fireworks at Marco Island. Nat. Biotechnol.28, 426–428.10.1038/nbt0510-426
136
NessR. W.SiolM.BarrettS. C. (2011). De novo sequence assembly and characterization of the floral transcriptome in cross- and self-fertilizing plants. BMC Genomics12, 298.10.1186/1471-2164-12-298
137
NicolaeM.MangulS.MandoiuI. I.ZelikovskyA. (2011). Estimation of alternative splicing isoform frequencies from RNA-seq data. Algorithms Mol. Biol.6, 9.10.1186/1748-7188-6-9
138
NiedringhausT. P.MilanovaD.KerbyM. B.SnyderM. P.BarronA. E. (2011). Landscape of next-generation sequencing technologies. Anal. Chem.83, 4327–4341.10.1021/ac2010857
139
NielsenR.PaulJ. S.AlbrechtsenA.SongY. S. (2011). Genotype and SNP calling from next-generation sequencing data. Nat. Rev. Genet.12, 443–451.10.1038/nrg2986
140
NijmanI. J.MokryM.Van BoxtelR.ToonenP.De BruijnE.CuppenE. (2010). Mutation discovery by targeted genomic enrichment of multiplexed barcoded samples. Nat. Methods7, 913–915.10.1038/nmeth.1516
141
OshlackA.WakefieldM. J. (2009). Transcript length bias in RNA-seq data confounds systems biology. Biol. Direct4, 14.10.1186/1745-6150-4-14
142
OssowskiS.SchneebergerK.ClarkR. M.LanzC.WarthmannN.WeigelD. (2008). Sequencing of natural strains of Arabidopsis thaliana with short reads. Genome Res.18, 2024–2033.10.1101/gr.080200.108
143
ParchmanT. L.GeistK. S.GrahnenJ. A.BenkmanC. W.BuerkleC. A. (2010). Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics11, 180.10.1186/1471-2164-11-180
144
ParkhomchukD.BorodinaT.AmstislavskiyV.BanaruM.HallenL.KrobitschS.LehrachH.SoldatovA. (2009). Transcriptome analysis by strand-specific sequencing of complementary DNA. Nucleic Acids Res.37, e123.10.1093/nar/gkp596
145
PickrellJ. K.MarioniJ. C.PaiA. A.DegnerJ. F.EngelhardtB. E.NkadoriE.VeyrierasJ. B.StephensM.GiladY.PritchardJ. K. (2010). Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature464, 768–772.10.1038/nature08872
146
PlessyC.BertinN.TakahashiH.SimoneR.SalimullahM.LassmannT.VitezicM.SeverinJ.OlivariusS.LazarevicD.HornigN.OrlandoV.BellI.GaoH.DumaisJ.KapranovP.WangH.DavisC. A.GingerasT. R.KawaiJ.DaubC. O.HayashizakiY.GustincichS.CarninciP. (2010). Linking promoters to functional transcripts in small samples with nanoCAGE and CAGEscan. Nat. Methods7, 528–534.10.1038/nmeth.1470
147
PotokinaE.DrukaA.LuoZ.WiseR.WaughR.KearseyM. (2008). Gene expression quantitative trait locus analysis of 16 000 barley genes reveals a complex pattern of genome-wide transcriptional regulation. Plant J.53, 90–101.10.1111/j.1365-313X.2007.03315.x
148
QuinceC.LanzenA.DavenportR. J.TurnbaughP. J. (2011). Removing noise from pyrosequenced amplicons. BMC Bioinformatics12, 38.10.1186/1471-2105-12-38
149
QuinlanA. R.HallI. M. (2010). BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics26, 841–842.10.1093/bioinformatics/btq033
150
R Development Core Team. (2009). R: A Language and Environment for Statistical Computing. Vienna, Austria.
- Google Scholar
151
RatanA.ZhangY.HayesV.SchusterS.MillerW. (2010). Calling SNPs without a reference sequence. BMC Bioinformatics11, 130.10.1186/1471-2105-11-130
152
RazT.KapranovP.LipsonD.LetovskyS.MilosP. M.ThompsonJ. F. (2011). Protocol dependence of sequencing-based gene expression measurements. PLoS ONE6, e19287.10.1371/journal.pone.0019287
153
RobertsonG.HirstM.BainbridgeM.BilenkyM.ZhaoY.ZengT.EuskirchenG.BernierB.VarholR.DelaneyA.ThiessenN.GriffithO. L.HeA.MarraM.SnyderM.JonesS. (2007). Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat. Methods4, 651–657.10.1038/nmeth1068
154
RobinsonJ. T.ThorvaldsdottirH.WincklerW.GuttmanM.LanderE. S.GetzG.MesirovJ. P. (2011). Integrative genomics viewer. Nat. Biotechnol.29, 24–26.10.1038/nbt.1888
155
RobinsonM. D.MccarthyD. J.SmythG. K. (2010). edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics26, 139–140.10.1093/bioinformatics/btp616
156
RuffaloM.LaframboiseT.KoyuturkM. (2011). Comparative analysis of algorithms for next-generation sequencing read alignment. Bioinformatics27, 2790–2796.10.1093/bioinformatics/btr477
157
SanchezC. C.SmithT. P.WiedmannR. T.VallejoR. L.SalemM.YaoJ.RexroadC. E.III. (2009). Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library. BMC Genomics10, 559.10.1186/1471-2164-10-559
158
SchlottererC. (2004). The evolution of molecular markers – just a matter of fashion?Nat. Rev. Genet.5, 63–69.10.1038/nrg1249
159
SchmiederR.EdwardsR. (2011). Quality control and preprocessing of metagenomic datasets. Bioinformatics27, 863–864.10.1093/bioinformatics/btr026
160
SchmiederR.LimY. W.RohwerF.EdwardsR. (2010). TagCleaner: identification and removal of tag sequences from genomic and metagenomic datasets. BMC Bioinformatics11, 341.10.1186/1471-2105-11-341
161
SchneebergerK.OssowskiS.LanzC.JuulT.PetersenA. H.NielsenK. L.JorgensenJ. E.WeigelD.AndersenS. U. (2009). SHOREmap: simultaneous mapping and mutation identification by deep sequencing. Nat. Methods6, 550–551.10.1038/nmeth0809-550
162
SchneebergerK.WeigelD. (2011). Fast-forward genetics enabled by new sequencing technologies. Trends Plant Sci.16, 282–288.10.1016/j.tplants.2011.02.006
163
SchwartzS.OrenR.AstG. (2011). Detection and removal of biases in the analysis of next-generation sequencing reads. PLoS ONE6, e16685.10.1371/journal.pone.0016685
164
SkellyD. A.JohanssonM.MadeoyJ.WakefieldJ.AkeyJ. M. (2011). A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data. Genome Res. [Epub ahead of print].10.1101/gr.119784.110
165
SmithD. R.QuinlanA. R.PeckhamH. E.MakowskyK.TaoW.WoolfB.ShenL.DonahueW. F.TusneemN.StrombergM. P.StewartD. A.ZhangL.RanadeS. S.WarnerJ. B.LeeC. C.ColemanB. E.ZhangZ.MclaughlinS. F.MalekJ. A.SorensonJ. M.BlanchardA. P.ChapmanJ.HillmanD.ChenF.RokhsarD. S.MckernanK. J.JeffriesT. W.MarthG. T.RichardsonP. M. (2008). Rapid whole-genome mutational profiling using next-generation sequencing technologies. Genome Res.18, 1638–1642.10.1101/gr.077776.108
166
SouaiaiaT.FrazierZ.ChenT. (2011). ComB: SNP calling and mapping analysis for color and nucleotide space platforms. J. Comput. Biol.18, 795–807.10.1089/cmb.2011.0027
167
StajichJ. E.BlockD.BoulezK.BrennerS. E.ChervitzS. A.DagdigianC.FuellenG.GilbertJ. G.KorfI.LappH.LehvaslaihoH.MatsallaC.MungallC. J.OsborneB. I.PocockM. R.SchattnerP.SengerM.SteinL. D.StupkaE.WilkinsonM. D.BirneyE. (2002). The Bioperl toolkit: Perl modules for the life sciences. Genome Res.12, 1611–1618.10.1101/gr.361602
168
SteinL. D. (2010). The case for cloud computing in genome informatics. Genome Biol.11, 207.10.1186/gb-2010-11-5-207
169
SternD. L.OrgogozoV. (2008). The loci of evolution: how predictable is genetic evolution?Evolution62, 2155–2177.10.1111/j.1558-5646.2008.00450.x
170
SuC. L.ChaoY. T.Alex ChangY. C.ChenW. C.ChenC. Y.LeeA. Y.HwaK. T.ShihM. C. (2011). De novo assembly of expressed transcripts and global analysis of the Phalaenopsis aphrodite transcriptome. Plant Cell Physiol.52, 1501–1514.10.1093/pcp/pcr097
171
TarazonaS.Garcia-AlcaldeF.DopazoJ.FerrerA.ConesaA. (2011). Differential expression in RNA-seq: a matter of depth. Genome Res. [Epub ahead of print].10.1101/gr.124321.111
172
ToungJ. M.MorleyM.LiM.CheungV. G. (2011). RNA-sequence analysis of human B-cells. Genome Res.21, 991–998.10.1101/gr.116335.110
173
TrapnellC.PachterL.SalzbergS. L. (2009). TopHat: discovering splice junctions with RNA-seq. Bioinformatics25, 1105–1111.10.1093/bioinformatics/btp120
174
TrapnellC.WilliamsB. A.PerteaG.MortazaviA.KwanG.Van BarenM. J.SalzbergS. L.WoldB. J.PachterL. (2010). Transcript assembly and quantification by RNA-seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol.28, 511–515.10.1038/nbt.1621
175
TrickM.LongY.MengJ.BancroftI. (2009). Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing. Plant Biotechnol. J.7, 334–346.10.1111/j.1467-7652.2008.00396.x
176
UnderwoodJ. G.UzilovA. V.KatzmanS.OnoderaC. S.MainzerJ. E.MathewsD. H.LoweT. M.SalamaS. R.HausslerD. (2010). FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencing. Nat. Methods7, 995–1001.10.1038/nmeth.1529
177
Van TassellC. P.SmithT. P.MatukumalliL. K.TaylorJ. F.SchnabelR. D.LawleyC. T.HaudenschildC. D.MooreS. S.WarrenW. C.SonstegardT. S. (2008). SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries. Nat. Methods5, 247–252.10.1038/nmeth.1185
178
VuylstekeM.DaeleH.VercauterenA.ZabeauM.KuiperM. (2006). Genetic dissection of transcriptional regulation by cDNA-AFLP. Plant J.45, 439–446.10.1111/j.1365-313X.2005.02630.x
179
VuylstekeM.Van EeuwijkF.Van HummelenP.KuiperM.ZabeauM. (2005). Genetic analysis of variation in gene expression in Arabidopsis thaliana. Genetics171, 1267–1275.10.1534/genetics.105.041509
180
WangJ.HudaA.LunyakV. V.JordanI. K. (2010a). A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags. Bioinformatics26, 2501–2508.10.1093/bioinformatics/btq241
- CrossRef
- Google Scholar
181
WangK.SinghD.ZengZ.ColemanS. J.HuangY.SavichG. L.HeX.MieczkowskiP.GrimmS. A.PerouC. M.MacleodJ. N.ChiangD. Y.PrinsJ. F.LiuJ. (2010b). MapSplice: accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Res.38, e178.10.1093/nar/gkq069
- CrossRef
- Google Scholar
182
WangL.FengZ.WangX.ZhangX. (2010c). DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics26, 136–138.10.1093/bioinformatics/btq241
- CrossRef
- Google Scholar
183
WangX.WuZ.ZhangX. (2010d). Isoform abundance inference provides a more accurate estimation of gene expression levels in RNA-seq. J. Bioinform. Comput. Biol.8(Suppl. 1), 177–192.10.1142/S0219720010004999
184
WangZ.FangB.ChenJ.ZhangX.LuoZ.HuangL.ChenX.LiY. (2010e). De novo assembly and characterization of root transcriptome using Illumina paired-end sequencing and development of cSSR markers in sweet potato (Ipomoea batatas). BMC Genomics11, 726.10.1186/1471-2164-11-726
- CrossRef
- Google Scholar
185
WangZ.GersteinM.SnyderM. (2009). RNA-seq: a revolutionary tool for transcriptomics. Nat. Rev. Genet.10, 57–63.10.1038/nrm2594
186
WeiW.QiX.WangL.ZhangY.HuaW.LiD.LvH.ZhangX. (2011). Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genomics12, 451.10.1186/1471-2164-12-451
187
WestM. A. L.KimK.KliebensteinD. J.Van LeeuwenH.MichelmoreR. W.DoergeR. W.St ClairD. A. (2007). Global eQTL mapping reveals the complex genetic architecture of transcript-level variation in Arabidopsis. Genetics175, 1441–1450.10.1534/genetics.106.064972
188
WillingE. M.HoffmannM.KleinJ. D.WeigelD.DreyerC. (2011). Paired-end RAD-seq for de novo assembly and marker design without available reference. Bioinformatics27, 2187–2193.10.1093/bioinformatics/btr346
189
WittkoppP. J.HaerumB. K.ClarkA. G. (2004). Evolutionary changes in cis and trans gene regulation. Nature430, 85–88.10.1038/nature02698
190
WrayG. A. (2007). The evolutionary significance of cis-regulatory mutations. Nat. Rev. Genet.8, 206–216.10.1038/nrg2063
191
WuX.RenC.JoshiT.VuongT.XuD.NguyenH. T. (2010). SNP discovery by high-throughput sequencing in soybean. BMC Genomics11, 469.10.1186/1471-2164-11-469
192
XieW.FengQ.YuH.HuangX.ZhaoQ.XingY.YuS.HanB.ZhangQ. (2010). Parent-independent genotyping for constructing an ultrahigh-density linkage map based on population sequencing. Proc. Natl. Acad. Sci. U.S.A.107, 10578–10583.10.1073/pnas.0912315107
193
YouF. M.HuoN.DealK. R.GuY. Q.LuoM. C.McguireP. E.DvorakJ.AndersonO. D. (2011). Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence. BMC Genomics12, 59.10.1186/1471-2164-12-59
194
YoungM. D.WakefieldM. J.SmythG. K.OshlackA. (2010). Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol.11, R14.10.1186/gb-2010-11-s1-i14
195
ZhangG.GuoG.HuX.ZhangY.LiQ.LiR.ZhuangR.LuZ.HeZ.FangX.ChenL.TianW.TaoY.KristiansenK.ZhangX.LiS.YangH.WangJ. (2010). Deep RNA sequencing at single base-pair resolution reveals high complexity of the rice transcriptome. Genome Res.20, 646–654.10.1101/gr.107334.110
196
ZhengW.ChungL. M.ZhaoH. (2011). Bias detection and correction in RNA-sequencing data. BMC Bioinformatics12, 290.10.1186/1471-2105-12-290

Summary

Keywords

QTL analysis, plant genetics, next generation sequencing, genomics, eQTL analysis, RNA-seq

Citation

Jiménez-Gómez JM (2011) Next Generation Quantitative Genetics in Plants. Front. Plant Sci. 2:77. doi: 10.3389/fpls.2011.00077

Received

29 April 2011

Accepted

23 October 2011

Published

15 November 2011

Volume

2 - 2011

Edited by

Alisdair Fernie, Max Planck Institute for Plant Physiology, Germany

Reviewed by

Alisdair Fernie, Max Planck Institute for Plant Physiology, Germany; Mathilde Causse, National Institute of Agricultural Research, France

This is an open-access article subject to a non-exclusive license between the authors and Frontiers Media SA, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and other Frontiers conditions are complied with.

*Correspondence: José M. Jiménez-Gómez, Department of Plant Breeding and Genetics, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829 Köln, Germany. e-mail: jmjimenez@mpipz.mpg.de

This article was submitted to Frontiers in Plant Physiology, a specialty of Frontiers in Plant Science.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Plant Physiology

REVIEW article

Next Generation Quantitative Genetics in Plants

Abstract

Introduction

Library Preparation

Quality Control and Pre-Processing