REVIEW article

Front. Plant Sci., 15 November 2011

Sec. Plant Physiology

Volume 2 - 2011 | https://doi.org/10.3389/fpls.2011.00077

Next Generation Quantitative Genetics in Plants

  • JM

    José M. Jiménez-Gómez *

  • Department of Plant Breeding and Genetics, Max Planck Institute for Plant Breeding Research Köln, Germany

Abstract

Most characteristics in living organisms show continuous variation, which suggests that they are controlled by multiple genes. Quantitative trait loci (QTL) analysis can identify the genes underlying continuous traits by establishing associations between genetic markers and observed phenotypic variation in a segregating population. The new high-throughput sequencing (HTS) technologies greatly facilitate QTL analysis by providing genetic markers at genome-wide resolution in any species without previous knowledge of its genome. In addition HTS serves to quantify molecular phenotypes, which aids to identify the loci responsible for QTLs and to understand the mechanisms underlying diversity. The constant improvements in price, experimental protocols, computational pipelines, and statistical frameworks are making feasible the use of HTS for any research group interested in quantitative genetics. In this review I discuss the application of HTS for molecular marker discovery, population genotyping, and expression profiling in QTL analysis.

Introduction

For almost one century scientists have dissected the genetic architecture of quantitative traits in plants using Quantitative trait loci (QTL) analysis (Fisher, 1918). These analyses establish associations between genetic markers and the phenotypic variation of a quantitative trait in a segregating population. The techniques used to obtain markers and physiological phenotypes have been constantly improved through history (Schlotterer, 2004; Montes et al., 2007). Recently, the price drop of high-throughput technologies have allowed plant researchers to quantify the general abundance of transcripts, proteins, or metabolites in segregating populations (Kirst et al., 2005; Vuylsteke et al., 2005, 2006; Decook et al., 2006; Keurentjes et al., 2007; West et al., 2007; Lisec et al., 2008; Potokina et al., 2008; Drost et al., 2010). These studies show that there are multiple benefits in using “omic” technologies for QTL analyses, even when the goal is to characterize physiological phenotypic diversity. First, molecular phenotypes are the initial step toward the production of physiological phenotypes and its regulation underlies much of phenotypic diversity (Hoekstra and Coyne, 2007; Stern and Orgogozo, 2008). Second, the availability of genome-wide information significantly increases the ability to identify candidate genes for QTLs (Jimenez-Gomez et al., 2010). Third, molecular traits measured at system scale allow estimation of the effect of QTLs in the genetic pathways of interest, or identification of additional gene networks altered by the loci responsible for the variation (Kliebenstein et al., 2006). Finally, molecular traits offer researchers a better understanding of how mutation drives physiological variation and what are the evolutionary forces acting at primary levels.

High-throughput sequencing, or HTS, allows the rapid and cost–effective generation of massive amounts of short sequences or reads (Metzker, 2010). The potential of this technology for mapping loci responsible for phenotypic differences in plants has already been demonstrated by identifying genes containing EMS-induced mutations in samples of pooled F2 individuals (Schneeberger et al., 2009; Austin et al., 2011). HTS technologies have been in the market for a few years, and new methods are being developed that will be cheaper, require less sample processing, and will produce more and longer reads (Munroe and Harris, 2010; Glenn, 2011; Niedringhaus et al., 2011). It is therefore clear that very soon HTS will be the tool of choice for QTL analyses. One important limiting factor remains to be eliminated: Data analysis. It requires long and computationally intensive pipelines that need to be customized for each particular experimental set up. An increasing number of new algorithms are constantly released to the community, and the debate on which pipelines return the most accurate results is still ongoing. Comparing, combining, and customizing these pipelines requires simple Unix or Linux commands and greatly benefits from knowledge in powerful statistical software such as R, and in scripting languages, such as Perl or Python (R Development Core Team, 2009). For non-bioinformaticians, integrated solutions with convenient interfaces are becoming popular both from collaborative open projects and companies (Blankenberg et al., 2010; Goecks et al., 2010). A popular website that keeps an actualized list of the available software tools is www.seqanswers.com, where users and developers also discuss new technological advances and pipelines. In terms of the computational equipment required for HTS data analysis, the majority of tools are developed for Linux or Unix based systems. Although parts of the analysis can be performed in any modern computer, machines with dozens of gigabytes of RAM are recommended in cases where reference sequences form the species considered are available, or with hundreds if no reference exists. An alternative option that is likely to become popular is to rent storage and computing power in specialized centers, or “the cloud” (Stein, 2010).

Due to the fast improvement of HTS, this review intends only to capture a snapshot in time of the possibilities that it offers for molecular marker discovery, genotyping, and molecular phenotyping in segregating populations of plants. This review has the purpose of helping researchers who have not incorporated this technology to their work to think about the requirements and possibilities of HTS. By no means this review refers to all available experimental designs or analysis tools, and the solutions proposed here are mere suggestions that will certainly soon be substituted by new and better ones. A guide map of the methods proposed in this review is depicted in Figure 1.

Figure 1

Library Preparation

Sample preparation protocols are continuously improved to use fewer amounts of biological material, be completed faster, and reduce the bias in their output. As an example, most current protocols allow multiplexing samples by adding a short sequence tag to all reads in a library, a convenient feature given the increasing numbers of reads produced per HTS run. The same companies that developed the HTS sequencers commercialize library preparation protocols optimized for the most common experimental designs. There are also kits from other companies that give comparable results and may be more cost efficient. Finally, many researchers are developing custom protocols to obtain specific information such as the transcribed strand in RNA-seq experiments, the rate of RNA degradation, or the positions occupied by RNA polymerases, just to name a few (Addo-Quaye et al., 2008; Core et al., 2008; German et al., 2008; Parkhomchuk et al., 2009).

Quality Control and Pre-Processing

Assessing the quality of HTS reads includes detection of biases on base composition, base quality, and sample complexity. The quality of the sequences has an impact on the reliability of the biological interpretations resulting from the analysis (Dohm et al., 2008). Part of these biases are introduced by the sample preparation protocols (Schwartz et al., 2011), particularly during cDNA synthesis in RNA-seq experiments (Hansen et al., 2010; Li et al., 2010b) and PCR amplification (Aird et al., 2011). Additional biases are particular to each HTS technology (Smith et al., 2008; Quince et al., 2011) or specific to each run of the sequencers (Auer and Doerge, 2010).

After quality control it is usually necessary to pre-process the reads by trimming low quality nucleotides and adapter sequences. At this stage, foreign sequences such as vectors or DNA from organisms contaminating the samples can also be removed. Depending of the type of libraries sequenced further pre-processing may be needed, such as trimming poly A or poly T tails and terminal transferase tails in RNA-seq libraries. In cases where several libraries have been multiplexed, reads should be separated by their barcode.

Both quality control and pre-processing can be easily performed with basic scripts written in Perl (Bioperl), R (Bioconductor), or Python (Biopython; Stajich et al., 2002; Gentleman et al., 2004; Cock et al., 2009; R Development Core Team, 2009). For non-programmers, there are some convenient tools that can carry out all or some of these tasks (FastQC, 2008; FASTX-Toolkit, 2009; Blankenberg et al., 2010; Falgueras et al., 2010; Goecks et al., 2010; Cutadapt, 2010; Schmieder et al., 2010; Schmieder and Edwards, 2011).

Molecular Marker Discovery

Depending on the availability of a reference sequence short reads will be aligned or de novo assembled using one of the multiple tools available. There are a number of recent articles that compare the most popular algorithms and software available for these purposes (Bao et al., 2011; Lin et al., 2011; Ruffalo et al., 2011). Please note that the methods proposed below are directed to developing molecular markers for QTL analysis and not to identify the mutation underlying the QTL, which requires much deeper sequencing.

With a reference sequence

A cost efficient solution to obtain molecular markers is to sequence DNA or RNA from the parental genotypes and mine polymorphisms from the resulting reads. These polymorphisms can be used later to design PCR markers or a high-throughput genotyping assay for the full population. This approach works remarkably well in diploid and polyploidy species using as low an amount of sequence as 5× coverage, meaning five times the size of the genome under study (Ossowski et al., 2008; Gore et al., 2009; Trick et al., 2009; Lai et al., 2010; Lam et al., 2010; Arai-Kichise et al., 2011; Geraldes et al., 2011). A recent article reviews the methods and tools available for single nucleotide polymorphism (SNP) identification and genotyping (Nielsen et al., 2011). To align the reads to the reference, mapping softwares based in “seed methods” are preferred despite their slower nature because their robustness to polymorphisms. Before SNP calling users may consider removal of the reads that map to multiple locations in the reference, and of duplicated reads that may have been generated from PCR artifacts. A recent pipeline also recalibrates the quality of the nucleotides in the reads to correct for the high error rates in HTS, and realigns reads in complex genomic positions where the fast processing alignment algorithms may have failed (Depristo et al., 2011). Commonly used indicators of the veracity of polymorphisms are based in the amount and quality of reads showing the polymorphism, frequency of the observed alleles, quality of the alignment, and/or proximity to other polymorphisms. There are some basic and popular options for calling polymorphisms from aligned reads (Li et al., 2009a,b; Depristo et al., 2011), tools specialized in the analysis of reads from particular sequencing platforms (Souaiaia et al., 2011), that have the ability to detect structural variation (Chen et al., 2009; Hormozdiari et al., 2009, 2010), or that have into account the quality of the reference in addition to the quality of the reads (Frohler and Dieterich, 2010). An essential method to control for the quality of the data analysis process is visual inspection through genome viewers specialized in HTS datasets (Huang and Marth, 2008; Bao et al., 2009; Milne et al., 2010; Robinson et al., 2011).

Without a reference sequence

High-throughput sequencing sequences can serve to construct the necessary reference to identify molecular markers if it is not already available. Although assembling de novo a complete genome sequence is possible with HTS, it requires very deep sequencing and extensive bioinformatic analysis, even more given the relatively large size of most plant genomes. A more efficient option is sequencing mRNA, which greatly reduces sample complexity in comparison with genome sequencing and has the advantage of offering functional information such as coding polymorphisms or expression levels (Graham et al., 2010; Mizrachi et al., 2010; Bancroft et al., 2011; Everett et al., 2011; Garg et al., 2011; Guo et al., 2011; Ibarra-Laclette et al., 2011; Ness et al., 2011; Su et al., 2011; Wei et al., 2011). A comprehensive compilation of the methods and tools available for transcriptome assembly has been recently published (Martin and Wang, 2011). De novo assembly algorithms greatly benefit from long and paired-end reads, but are extremely sensitive to errors and polymorphisms and will not perform well during assembly of datasets from mixed genotypes or highly heterozygous individuals. The amount of new genomic positions detected in RNA-seq experiments decrease exponentially as the number of reads increases (Figure 2). The majority of medium and highly expressed transcripts in a sample are detected at low coverage, and increasing coverage will mainly add non-coding RNAs and low expressed transcripts at a very high cost (Tarazona et al., 2011). If the objective is to assemble complete transcriptomes, obtaining samples from diverse tissues, time points, and conditions is preferred to depth of sequencing. Even in the best possible conditions assemblies from RNA-seq reads will return only a subset of the existing transcripts, many of which will be fragmented. This is expected due to low expression of particular transcripts, the non-uniform read coverage, and the presence of different isoforms per gene. To help assembly of low expressed transcripts researchers can use normalization protocols that deplete the most abundant transcripts from the samples (Christodoulou et al., 2011). In any case, contigs resulting from de novo assembly can be effectively used as a reference for molecular marker detection and characterization of transcripts in un-sequenced genomes (Parchman et al., 2010; Wang et al., 2010e; Angeloni et al., 2011; Hiremath et al., 2011; Kaur et al., 2011).

Figure 2

When highly similar genotypes are compared, RNA-seq may not be the best option since it mostly targets coding regions, which are less diverse than non-coding regions. In these cases researchers can construct reduced representation libraries by shearing DNA using restriction endonucleases and size-selecting the fragments that will be sequenced. Reads from these libraries can be clustered by similarity and mined for polymorphisms close to the restriction sites; or used to detect the presence–absence of particular tags, indicating a polymorphism in the restriction site itself (Kerstens et al., 2009; Sanchez et al., 2009; Etter et al., 2011). Obtaining polymorphisms from reduced representation libraries is more efficient when a reference sequence is available (Van Tassell et al., 2008; Wu et al., 2010). However, researchers have already developed tools to genotype samples from these tags using a low number of reads from organisms without a reference (Ratan et al., 2010), or to reconstruct part of the targeted genome using paired-end sequencing (Willing et al., 2011). Additional protocols to obtain markers from reduced representation libraries exist in which different combination of restriction enzymes are used for each of the genotypes involved (Hyten et al., 2010), or that do not shear the DNA but filter the reads for single copy sequences (You et al., 2011). The amount of reads necessary to perform this type of analysis depends on the size of the genome, the restriction enzymes used, and the availability of a reference.

Genotyping Populations

With the price drop of the HTS technologies and the possibility of multiplexing samples, genotyping an entire population has become realistic (Schneeberger and Weigel, 2011). In the case of a sequenced system such as rice, generating reads from the individuals of a population at 0.02–0.055× coverage allowed high-density genotyping by comparisons with the parental genotypes (Huang et al., 2009), or by inferring the parental genotypes from the polymorphisms found in the population (Xie et al., 2010). Since erroneous polymorphism calls are expected at low coverage, more or less complex algorithms need to be defined to correctly genotype each polymorphism in each individual (Huang et al., 2009; Xie et al., 2010; Li et al., 2011). In addition, a reference sequence can serve researchers to design enrichment essays that will target their preferred genomic locations, although at high cost (Blow, 2009; Mamanova et al., 2010; Nijman et al., 2010; Kenny et al., 2011). For species where a genome sequence is not available, a very practical approach is to sequence reduced representation libraries as mentioned above (Baird et al., 2008; Emerson et al., 2010b; Hohenlohe et al., 2010, 2011).

Molecular Phenotyping

The list of molecular phenotypes that can be quantified with HTS is extensive and is rapidly increasing (Hawkins et al., 2010). Examples of these phenotypes are protein–RNA interactions (Licatalosi et al., 2008; Hafner et al., 2010), translation rates (Ingolia et al., 2009; Ingolia, 2010), transcription rates (Core et al., 2008; Churchman and Weissman, 2011), protein–DNA interactions (Albert et al., 2007; Barski et al., 2007; Johnson et al., 2007; Mikkelsen et al., 2007; Robertson et al., 2007; Chen et al., 2008; Hesselberth et al., 2009), RNA degradation rates (Addo-Quaye et al., 2008; German et al., 2008), RNA secondary structure (Kertesz et al., 2010; Underwood et al., 2010), transcription start positions (Plessy et al., 2010), chromatin accessibility (Boyle et al., 2008), methylation states (Cokus et al., 2008; Down et al., 2008; Lister et al., 2008; Meissner et al., 2008), natural antisense transcription (Cloonan et al., 2008; Core et al., 2008; He et al., 2008; Armour et al., 2009; Parkhomchuk et al., 2009) or small RNA profiles (Lu et al., 2005). QTL analysis using these phenotypes as traits is an exciting field that remains un-explored. Therefore, the computational frameworks to quantitatively compare these phenotypes between individuals will need to be established.

Expression profiling with HTS

Although many cases of phenotypic variation caused by coding polymorphisms have been documented, variation in gene expression has been shown to underlie much of phenotypic diversity (Reviewed in Hoekstra and Coyne, 2007; Wray, 2007; Stern and Orgogozo, 2008). One method to detect differences in expression between individuals using HTS is to sequence 26–27 nucleotide-long tags from expressed transcripts (Matsumura et al., 2010; Hong et al., 2011). A recent study shows that this method reaches saturation in mice with 6–8 million reads per sample (Hong et al., 2011). Its advantages over sequencing full transcripts are the lower cost, higher sensitivity, reduced bias during amplification due to the fixed fragment lengths, and use of simplified statistical models to calculate differential expression. On the other hand, methods based in tags will not detect the majority of coding polymorphisms and isoforms, and require a close enough reference sequence to extract biologically meaningful results.

RNA-seq is rapidly becoming a standard in expression profiling because of its simple protocol of preparation, digital nature, large dynamic range, and high sensitivity in comparison with previous technologies (Marioni et al., 2008; Bradford et al., 2010; Liu et al., 2010). In addition, it can serve to genotype individuals, identify novel transcripts, characterize alternative splicing, and quantify allele specific expression (Reviewed in Wang et al., 2009; Costa et al., 2010; Marguerat and Bahler, 2010). Due to the novelty of the technique there is no consensus on which sample preparation protocols present fewer biases (Raz et al., 2011). However, strand-specific methods could become a standard because of their increased precision due to their ability to distinguish between sense and antisense transcripts (He et al., 2008; Levin et al., 2010). In terms of experimental designs, it is necessary to randomize and replicate biological samples, as with any other type of genome-wide analysis (Auer and Doerge, 2010; Fang and Cui, 2011; Hansen et al., 2011). There is little consensus about the depth of sequence needed for expression profiling with RNA-seq. Recent estimates range between 30 million reads to compare the expression profiles of two samples, to 100 million reads to detect most transcribed genes and quantify isoforms, to 500 million to obtain accurate profiles, including low expressed transcripts (Zhang et al., 2010; ENCODE, 2011; Toung et al., 2011). In any case, it is advisable to balance the number of reads between samples in the same experiment in order to perform accurate expression comparisons (Tarazona et al., 2011).

Expression profiling from HTS datasets is necessarily based on counting the reads mapped to each transcript in a reference sequence. When a reference genome or transcriptome is not available, it can be reconstructed using de novo assembly of the reads for at least one of the genotypes as described above. The simpler and less computational intensive protocol for expression profiling is to map the RNA-seq reads to known (or de novo assembled) transcripts and a set of possible exon–exon junctions (when available) to detect alternative splicing. However, in organisms with sequenced genomes this protocol will not allow detection of novel exons, transcripts, and isoforms. The preferred pipeline involves aligning the reads to the genomic reference using an alignment tool that splices the reads to detect intron–exon junctions (For example Trapnell et al., 2009; Ameur et al., 2010; Au et al., 2010; Guttman et al., 2010; Wang et al., 2010b; Lou et al., 2011).

A challenge for expression analyses in samples from two unrelated individuals is the need to perform robust quantification of reads generated from two or more alleles. This implies that reads with the closer genotype to the reference will align better than reads from a more distant genotype, in which more polymorphisms may interfere with their ability to map (Fontanillas et al., 2010). In these cases, aligners based in seed methods will perform better than those based in the Burrows–Wheeler Transform algorithm (For a review see Garber et al., 2011). Although most studies ignore this problem, there are solutions that go from identifying and removing the polymorphisms that cause these biases (Degner et al., 2009), aligning the reads to all references from the genotypes involved (Bullard et al., 2010a) or including the polymorphisms found in the references (Gan et al., 2011). When two references are used, a potential problem may arise from motifs that are more abundant in one reference with respect to the other if only uniquely mapped reads are counted. The use of longer reads and/or paired-end reads greatly decreases the number of ambiguously mapped reads. In addition, there are robust methods to assign these multi-mapped reads to a single location (Faulkner et al., 2008; Mortazavi et al., 2008; Hashimoto et al., 2009; Li et al., 2010a; Wang et al., 2010a; Ji et al., 2011).

There are a number of tools to count the number of reads aligned to each transcriptional unit to calculate expression, most of which require knowledge of Perl, Phyton, Linux/Unix, or R (Carlson et al., 2009; Bio::DB::Sam, 2009; Anders, 2010; Morgan and Pagès, 2010; Quinlan and Hall, 2010). Some alignment tools can directly calculate the number of reads per transcript and/or a measure of expression based in the reads (or fragments) per gene size in kilobases per million reads mapped, called RPKM (or FPKM; Mortazavi et al., 2008; Trapnell et al., 2010). However, these expression units show biases depending on the length, number, abundance of the transcripts present in the samples, or because of technical replication (Oshlack and Wakefield, 2009; Bullard et al., 2010b; Mcintyre et al., 2011). For this reason researchers have developed dedicated R/Bioconductor packages to calculate differential expression between samples based on raw read counts per transcript (Anders and Huber, 2010; Bullard et al., 2010b; Hardcastle and Kelly, 2010; Robinson et al., 2010; Wang et al., 2010c). In addition, there are software packages that take into consideration the biases inherent to RNA-seq when calculating expression or performing downstream analyses such as gene ontology over-representation studies (Young et al., 2010; Zheng et al., 2011).

High-throughput sequencing datasets allow quantification of expression for each isoform separately, resulting in significantly more accurate estimates than calculating expression at the gene level (Wang et al., 2010d). For this, users must first identify splicing events from the reads that align to exon–exon junctions. Quantifying isoform expression is complicated since most reads in an alternatively spliced transcript cannot be assigned to a single isoform. The most promising methods to address this complex problem take advantage from the information offered by paired-end and/or unambiguously mapped reads (Guttman et al., 2010; Katz et al., 2010; Li et al., 2010a; Trapnell et al., 2010; Nicolae et al., 2011). One advantage of going through the intricate process of identification of alternative splicing is that it can also be used as a trait for QTL analysis (Li et al., 2010c; Montgomery et al., 2010; Pickrell et al., 2010; Lalonde et al., 2011).

Allele specific expression in hybrids

An alternative to sequencing a full segregating population to perform eQTL analyses is to sequence F1 hybrid individuals, where allele specific expression can be calculated for loci with coding polymorphisms (Babak et al., 2008, 2010; Bullard et al., 2010a; Emerson et al., 2010a; Mcmanus et al., 2010; Pickrell et al., 2010). For any gene, both alleles in the hybrid share the same cellular environment and, as a result, changes in expression between alleles must necessarily be due to cis-acting regulators (Cowles et al., 2002). Trans-acting eQTLs can be inferred by performing RNA-seq in the parentals and comparing the differences in expression levels between alleles in the hybrid with the differences between the parentals (Wittkopp et al., 2004). Despite the considerable reduction in price and simplicity of experimental design, this method has several drawbacks. Allele specific expression can only be calculated in transcripts with coding polymorphisms that are highly covered, and it is very dependent on read and transcript length (Degner et al., 2009; Fontanillas et al., 2010). New statistical approaches are being developed that will overcome these limitations, starting by being able to estimate false discovery rates and allele specific alternative splicing (Skelly et al., 2011).

In summary, HTS is changing the way we perform QTL analysis by allowing high-throughput genotyping of populations and phenotyping of traits with a precision not achievable before. It is clear that HTS has not reached its peak of development, and that tools and algorithms will have to be modified according to the new technological improvements. Nevertheless, the first experiments using this technology have already identified exciting possibilities for the characterization of natural variation in plants.

Statements

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    Addo-QuayeC.EshooT. W.BartelD. P.AxtellM. J. (2008). Endogenous siRNA and miRNA targets identified by sequencing of the Arabidopsis degradome. Curr. Biol.18, 758762.10.1016/j.cub.2008.04.042

  • 2

    AirdD.RossM. G.ChenW. S.DanielssonM.FennellT.RussC.JaffeD. B.NusbaumC.GnirkeA. (2011). Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries. Genome Biol.12, R18.10.1186/1465-6906-12-S1-I18

  • 3

    AlbertI.MavrichT. N.TomshoL. P.QiJ.ZantonS. J.SchusterS. C.PughB. F. (2007). Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome. Nature446, 572576.10.1038/nature05632

  • 4

    AmeurA.WetterbomA.FeukL.GyllenstenU. (2010). Global and unbiased detection of splice junctions from RNA-seq data. Genome Biol.11, R34.10.1186/gb-2010-11-3-r34

  • 5

    AndersS. (2010). HTSeq: Analysing High-Throughput Sequencing Data With Python. Available at: http://www-huber.embl.de/users/anders/HTSeq/doc/overview.html#author

  • 6

    AndersS.HuberW. (2010). Differential expression analysis for sequence count data. Genome Biol.11, R106.10.1186/gb-2010-11-10-r106

  • 7

    AngeloniF.WagemakerC. A.JettenM. S.Op Den CampH. J.Janssen-MegensE. M.FrancoijsK. J.StunnenbergH. G.OuborgN. J. (2011). De novo transcriptome characterization and development of genomic tools for Scabiosa columbaria L. using next-generation sequencing techniques. Mol. Ecol. Resour.11, 662674.10.1111/j.1755-0998.2011.02990.x

  • 8

    Arai-KichiseY.ShiwaY.NagasakiH.EbanaK.YoshikawaH.YanoM.WakasaK. (2011). Discovery of genome-wide DNA polymorphisms in a landrace cultivar of Japonica rice by whole-genome sequencing. Plant Cell Physiol.52, 274282.10.1093/pcp/pcr003

  • 9

    ArmourC. D.CastleJ. C.ChenR.BabakT.LoerchP.JacksonS.ShahJ. K.DeyJ.RohlC. A.JohnsonJ. M.RaymondC. K. (2009). Digital transcriptome profiling using selective hexamer priming for cDNA synthesis. Nat. Methods6, 647649.10.1038/nmeth.1360

  • 10

    AuK. F.JiangH.LinL.XingY.WongW. H. (2010). Detection of splice junctions from paired-end RNA-seq data by SpliceMap. Nucleic Acids Res.38, 45704578.10.1093/nar/gkq211

  • 11

    AuerP. L.DoergeR. W. (2010). Statistical design and analysis of RNA sequencing data. Genetics185, 405416.10.1534/genetics.110.114983

  • 12

    AustinR. S.VidaurreD.StamatiouG.BreitR.ProvartN. J.BonettaD.ZhangJ.FungP.GongY.WangP. W.MccourtP.GuttmanD. S. (2011). Next-generation mapping of Arabidopsis genes. Plant J.67, 715725.10.1111/j.1365-313X.2011.04619.x

  • 13

    BabakT.DevealeB.ArmourC.RaymondC.ClearyM. A.Van Der KooyD.JohnsonJ. M.LimL. P. (2008). Global survey of genomic imprinting by transcriptome sequencing. Curr. Biol.18, 17351741.10.1016/j.cub.2008.09.044

  • 14

    BabakT.Garrett-EngeleP.ArmourC. D.RaymondC. K.KellerM. P.ChenR.RohlC. A.JohnsonJ. M.AttieA. D.FraserH. B.SchadtE. E. (2010). Genetic validation of whole-transcriptome sequencing for mapping expression affected by cis-regulatory variation. BMC Genomics11, 473.10.1186/1471-2164-11-473

  • 15

    BairdN. A.EtterP. D.AtwoodT. S.CurreyM. C.ShiverA. L.LewisZ. A.SelkerE. U.CreskoW. A.JohnsonE. A. (2008). Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE3, e3376.10.1371/journal.pone.0003376

  • 16

    BancroftI.MorganC.FraserF.HigginsJ.WellsR.ClissoldL.BakerD.LongY.MengJ.WangX.LiuS.TrickM. (2011). Dissecting the genome of the polyploid crop oilseed rape by transcriptome sequencing. Nat. Biotechnol.29, 762766.10.1038/nbt.1926

  • 17

    BaoH.GuoH.WangJ.ZhouR.LuX.ShiS. (2009). MapView: visualization of short reads alignment on a desktop computer. Bioinformatics25, 15541555.10.1093/bioinformatics/btp255

  • 18

    BaoS.JiangR.KwanW.WangB.MaX.SongY. Q. (2011). Evaluation of next-generation sequencing software in mapping and assembly. J. Hum. Genet.56, 406414.10.1038/jhg.2011.43

  • 19

    BarskiA.CuddapahS.CuiK.RohT. Y.SchonesD. E.WangZ.WeiG.ChepelevI.ZhaoK. (2007). High-resolution profiling of histone methylations in the human genome. Cell129, 823837.10.1016/j.cell.2007.05.009

  • 20

  • 21

    BlankenbergD.GordonA.Von KusterG.CoraorN.TaylorJ.NekrutenkoA. (2010). Manipulation of FASTQ data with galaxy. Bioinformatics26, 17831785.10.1093/bioinformatics/btq281

  • 22

    BlowN. (2009). Genomics: catch me if you can. Nat. Methods6, 539544.10.1038/nmeth0609-465

  • 23

    BoyleA. P.DavisS.ShulhaH. P.MeltzerP.MarguliesE. H.WengZ.FureyT. S.CrawfordG. E. (2008). High-resolution mapping and characterization of open chromatin across the genome. Cell132, 311322.10.1016/j.cell.2007.12.014

  • 24

    BradfordJ. R.HeyY.YatesT.LiY.PepperS. D.MillerC. J. (2010). A comparison of massively parallel nucleotide sequencing with oligonucleotide microarrays for global transcription profiling. BMC Genomics11, 282.10.1186/1471-2164-11-282

  • 25

    BullardJ. H.MostovoyY.DudoitS.BremR. B. (2010a). Polygenic and directional regulatory evolution across pathways in Saccharomyces. Proc. Natl. Acad. Sci. U.S.A.107, 50585063.10.1073/pnas.0912959107

  • 26

    BullardJ. H.PurdomE.HansenK. D.DudoitS. (2010b). Evaluation of statistical methods for normalization and differential expression in mRNA-seq experiments. BMC Bioinformatics11, 94.10.1186/1471-2105-11-94

  • 27

    CarlsonM.PagesH.AboyounP.FalconS.MorganM.SarkarD.LawrenceM. (2009). Genomic Features: Tools for Making and Manipulating Transcript Centric Annotations. Available at: http://www.bioconductor.org/packages/2.6/bioc/html/GenomicFeatures.html

  • 28

    ChenK.WallisJ. W.MclellanM. D.LarsonD. E.KalickiJ. M.PohlC. S.McgrathS. D.WendlM. C.ZhangQ.LockeD. P.ShiX.FultonR. S.LeyT. J.WilsonR. K.DingL.MardisE. R. (2009). BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat. Methods6, 677681.10.1038/nmeth.1363

  • 29

    ChenX.XuH.YuanP.FangF.HussM.VegaV. B.WongE.OrlovY. L.ZhangW.JiangJ.LohY. H.YeoH. C.YeoZ. X.NarangV.GovindarajanK. R.LeongB.ShahabA.RuanY.BourqueG.SungW. K.ClarkeN. D.WeiC. L.NgH. H. (2008). Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell133, 11061117.10.1016/j.cell.2008.04.043

  • 30

    ChristodoulouD. C.GorhamJ. M.HermanD. S.SeidmanJ. (2011). Construction of normalized RNA-seq libraries for next-generation sequencing using the crab duplex-specific nuclease. Curr. Protoc. Mol. Biol.94, 4.12.14.12.11.

  • 31

    ChurchmanL. S.WeissmanJ. S. (2011). Nascent transcript sequencing visualizes transcription at nucleotide resolution. Nature469, 368373.10.1038/nature09652

  • 32

    CloonanN.ForrestA. R.KolleG.GardinerB. B.FaulknerG. J.BrownM. K.TaylorD. F.SteptoeA. L.WaniS.BethelG.RobertsonA. J.PerkinsA. C.BruceS. J.LeeC. C.RanadeS. S.PeckhamH. E.ManningJ. M.MckernanK. J.GrimmondS. M. (2008). Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nat. Methods5, 613619.10.1038/nmeth.1223

  • 33

    CockP. J.AntaoT.ChangJ. T.ChapmanB. A.CoxC. J.DalkeA.FriedbergI.HamelryckT.KauffF.WilczynskiB.De HoonM. J. (2009). Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics25, 14221423.10.1093/bioinformatics/btp163

  • 34

    CokusS. J.FengS.ZhangX.ChenZ.MerrimanB.HaudenschildC. D.PradhanS.NelsonS. F.PellegriniM.JacobsenS. E. (2008). Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning. Nature452, 215219.10.1038/nature06745

  • 35

    CoreL. J.WaterfallJ. J.LisJ. T. (2008). Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science322, 18451848.10.1126/science.1162228

  • 36

    CostaV.AngeliniC.De FeisI.CiccodicolaA. (2010). Uncovering the complexity of transcriptomes with RNA-seq. J. Biomed. Biotechnol.2010, 853916.10.1155/2010/853916

  • 37

    CowlesC. R.HirschhornJ. N.AltshulerD.LanderE. S. (2002). Detection of regulatory variation in mouse genes. Nat. Genet.32, 432437.10.1038/ng992

  • 38

    Cutadapt. (2010). A Tool That Removes Adapter Sequences From DNA Sequencing Reads. Available at: http://code.google.com/p/cutadapt/

  • 39

    DecookR.LallS.NettletonD.HowellS. H. (2006). Genetic regulation of gene expression during shoot development in Arabidopsis. Genetics172, 11551164.10.1534/genetics.105.042275

  • 40

    DegnerJ. F.MarioniJ. C.PaiA. A.PickrellJ. K.NkadoriE.GiladY.PritchardJ. K. (2009). Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data. Bioinformatics25, 32073212.10.1093/bioinformatics/btp579

  • 41

    DepristoM. A.BanksE.PoplinR.GarimellaK. V.MaguireJ. R.HartlC.PhilippakisA. A.Del AngelG.RivasM. A.HannaM.MckennaA.FennellT. J.KernytskyA. M.SivachenkoA. Y.CibulskisK.GabrielS. B.AltshulerD.DalyM. J. (2011). A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet.43, 491498.10.1038/ng.806

  • 42

    DohmJ. C.LottazC.BorodinaT.HimmelbauerH. (2008). Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res.36, e105.10.1093/nar/gkn425

  • 43

    DownT. A.RakyanV. K.TurnerD. J.FlicekP.LiH.KuleshaE.GrafS.JohnsonN.HerreroJ.TomazouE. M.ThorneN. P.BackdahlL.HerberthM.HoweK. L.JacksonD. K.MirettiM. M.MarioniJ. C.BirneyE.HubbardT. J.DurbinR.TavareS.BeckS. (2008). A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis. Nat. Biotechnol.26, 779785.10.1038/nbt1414

  • 44

    DrostD. R.BenedictC. I.BergA.NovaesE.NovaesC. R.YuQ.DervinisC.MaiaJ. M.YapJ.MilesB.KirstM. (2010). Diversification in the genetic architecture of gene expression and transcriptional networks in organ differentiation of Populus. Proc. Natl. Acad. Sci. U.S.A.107, 84928497.10.1073/pnas.0914709107

  • 45

    EmersonJ. J.HsiehL. C.SungH. M.WangT. Y.HuangC. J.LuH. H.LuM. Y.WuS. H.LiW. H. (2010a). Natural selection on cis and trans regulation in yeasts. Genome Res.20, 826836.10.1101/gr.101576.109

  • 46

    EmersonK. J.MerzC. R.CatchenJ. M.HohenloheP. A.CreskoW. A.BradshawW. E.HolzapfelC. M. (2010b). Resolving postglacial phylogeography using high-throughput sequencing. Proc. Natl. Acad. Sci. U.S.A.107, 1619616200.10.1073/pnas.1006538107

  • 47

    ENCODE. (2011). Standards, guidelines and best practices for RNA-seq. V1.0.

  • 48

    EtterP. D.PrestonJ. L.BasshamS.CreskoW. A.JohnsonE. A. (2011). Local de novo assembly of RAD paired-end contigs using short sequencing reads. PLoS ONE6, e18561.10.1371/journal.pone.0018561

  • 49

    EverettM. V.GrauE. D.SeebJ. E. (2011). Short reads and nonmodel species: exploring the complexities of next-generation sequence assembly and SNP discovery in the absence of a reference genome. Mol. Ecol. Resour.11(Suppl. 1), 93108.10.1111/j.1755-0998.2010.02969.x

  • 50

    FalguerasJ.LaraA. J.Fernandez-PozoN.CantonF. R.Perez-TrabadoG.ClarosM. G. (2010). SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read. BMC Bioinformatics11, 38.10.1186/1471-2105-11-38

  • 51

    FangZ.CuiX. (2011). Design and validation issues in RNA-seq experiments. Brief. Bioinform.12, 280287.10.1093/bib/bbr004

  • 52

  • 53

  • 54

    FaulknerG. J.ForrestA. R.ChalkA. M.SchroderK.HayashizakiY.CarninciP.HumeD. A.GrimmondS. M. (2008). A rescue strategy for multi mapping short sequence tags refines surveys of transcriptional activity by CAGE. Genomics91, 281288.10.1016/j.ygeno.2007.11.003

  • 55

    FisherR. A. (1918). The correlation between relatives on the supposition of Mendelian inheritance. Philos. Trans. R. Soc. Edinb.52, 399433.

  • 56

    FontanillasP.LandryC. R.WittkoppP. J.RussC.GruberJ. D.NusbaumC.HartlD. L. (2010). Key considerations for measuring allelic expression on a genomic scale using high-throughput sequencing. Mol. Ecol.19(Suppl. 1), 212227.10.1111/j.1365-294X.2010.04472.x

  • 57

    FrohlerS.DieterichC. (2010). ACCUSA – accurate SNP calling on draft genomes. Bioinformatics26, 13641365.10.1093/bioinformatics/btq138

  • 58

    GanX.StegleO.BehrJ.SteffenJ. G.DreweP.HildebrandK. L.LyngsoeR.SchultheissS. J.OsborneE. J.SreedharanV. T.KahlesA.BohnertR.JeanG.DerwentP.KerseyP.BelfieldE. J.HarberdN. P.KemenE.ToomajianC.KoverP. X.ClarkR. M.RatschG.MottR. (2011). Multiple reference genomes and transcriptomes for Arabidopsis thaliana. Nature477, 419423.10.1038/nature10414

  • 59

    GarberM.GrabherrM. G.GuttmanM.TrapnellC. (2011). Computational methods for transcriptome annotation and quantification using RNA-seq. Nat. Methods8, 469477.10.1038/nmeth.1613

  • 60

    GargR.PatelR. K.JhanwarS.PriyaP.BhattacharjeeA.YadavG.BhatiaS.ChattopadhyayD.TyagiA. K.JainM. (2011). Gene discovery and tissue-specific transcriptome analysis in chickpea with massively parallel pyrosequencing and web resource development. Plant Physiol.156, 16611678.10.1104/pp.111.178616

  • 61

    GentlemanR. C.CareyV. J.BatesD. M.BolstadB.DettlingM.DudoitS.EllisB.GautierL.GeY.GentryJ.HornikK.HothornT.HuberW.IacusS.IrizarryR.LeischF.LiC.MaechlerM.RossiniA. J.SawitzkiG.SmithC.SmythG.TierneyL.YangJ. Y.ZhangJ. (2004). Bioconductor: open software development for computational biology and bioinformatics. Genome Biol.5, R80.10.1186/gb-2004-5-10-r80

  • 62

    GeraldesA.PangJ.ThiessenN.CezardT.MooreR.ZhaoY.TamA.WangS.FriedmannM.BirolI.JonesS. J.CronkQ. C.DouglasC. J. (2011). SNP discovery in black cottonwood (Populus trichocarpa) by population transcriptome resequencing. Mol. Ecol. Resour.11(Suppl. 1), 8192.10.1111/j.1755-0998.2010.02960.x

  • 63

    GermanM. A.PillayM.JeongD. H.HetawalA.LuoS.JanardhananP.KannanV.RymarquisL. A.NobutaK.GermanR.De PaoliE.LuC.SchrothG.MeyersB. C.GreenP. J. (2008). Global identification of microRNA-target RNA pairs by parallel analysis of RNA ends. Nat. Biotechnol.26, 941946.10.1038/nbt1417

  • 64

    GlennT. C. (2011). Field guide to next-generation DNA sequencers. Mol. Ecol. Resour.11, 759769.10.1111/j.1755-0998.2011.03024.x

  • 65

    GoecksJ.NekrutenkoA.TaylorJ. (2010). Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol.11, R86.10.1186/gb-2010-11-8-r86

  • 66

    GoreM. A.ChiaJ. M.ElshireR. J.SunQ.ErsozE. S.HurwitzB. L.PeifferJ. A.McmullenM. D.GrillsG. S.Ross-IbarraJ.WareD. H.BucklerE. S. (2009). A first-generation haplotype map of maize. Science326, 11151117.10.1126/science.1177837

  • 67

    GrahamI. A.BesserK.BlumerS.BraniganC. A.CzechowskiT.EliasL.GutermanI.HarveyD.IsaacP. G.KhanA. M.LarsonT. R.LiY.PawsonT.PenfieldT.RaeA. M.RathboneD. A.ReidS.RossJ.SmallwoodM. F.SeguraV.TownsendT.VyasD.WinzerT.BowlesD. (2010). The genetic map of Artemisia annua L. identifies loci affecting yield of the antimalarial drug artemisinin. Science327, 328331.10.1126/science.1182612

  • 68

    GuoS.LiuJ.ZhengY.HuangM.ZhangH.GongG.HeH.RenY.ZhongS.FeiZ.XuY. (2011). Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles. BMC Genomics12, 454.10.1186/1471-2164-12-71

  • 69

    GuttmanM.GarberM.LevinJ. Z.DonagheyJ.RobinsonJ.AdiconisX.FanL.KoziolM. J.GnirkeA.NusbaumC.RinnJ. L.LanderE. S.RegevA. (2010). Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs. Nat. Biotechnol.28, 503510.10.1038/nbt0710-756b

  • 70

    HafnerM.LandthalerM.BurgerL.KhorshidM.HausserJ.BerningerP.RothballerA.AscanoM.Jr.JungkampA. C.MunschauerM.UlrichA.WardleG. S.DewellS.ZavolanM.TuschlT. (2010). Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell141, 129141.10.1016/j.cell.2010.03.009

  • 71

    HansenK. D.BrennerS. E.DudoitS. (2010). Biases in Illumina transcriptome sequencing caused by random hexamer priming. Nucleic Acids Res.38, e131.10.1093/nar/gkp1195

  • 72

    HansenK. D.ZhijinW.IrizarryR. A.LeekJ. T. (2011). Sequencing technology does not eliminate biological variability. Nat. Biotechnol.29, 575573.10.1038/nbt.1910

  • 73

    HardcastleT. J.KellyK. A. (2010). baySeq: empirical Bayesian methods for identifying differential expression in sequence count data. BMC Bioinformatics11, 422.10.1186/1471-2105-11-422

  • 74

    HashimotoT.De HoonM. J.GrimmondS. M.DaubC. O.HayashizakiY.FaulknerG. J. (2009). Probabilistic resolution of multi-mapping reads in massively parallel sequencing data using MuMRescueLite. Bioinformatics25, 26132614.10.1093/bioinformatics/btp438

  • 75

    HawkinsR. D.HonG. C.RenB. (2010). Next-generation genomics: an integrative approach. Nat. Rev. Genet.11, 476486.

  • 76

    HeY.VogelsteinB.VelculescuV. E.PapadopoulosN.KinzlerK. W. (2008). The antisense transcriptomes of human cells. Science322, 18551857.10.1126/science.1163853

  • 77

    HesselberthJ. R.ChenX.ZhangZ.SaboP. J.SandstromR.ReynoldsA. P.ThurmanR. E.NephS.KuehnM. S.NobleW. S.FieldsS.StamatoyannopoulosJ. A. (2009). Global mapping of protein-DNA interactions in vivo by digital genomic footprinting. Nat. Methods6, 283289.10.1038/nmeth.1313

  • 78

    HiremathP. J.FarmerA.CannonS. B.WoodwardJ.KudapaH.TutejaR.KumarA.BhanuprakashA.MulaosmanovicB.GujariaN.KrishnamurthyL.GaurP. M.KavikishorP. B.ShahT.SrinivasanR.LohseM.XiaoY.TownC. D.CookD. R.MayG. D.VarshneyR. K. (2011). Large-scale transcriptome analysis in chickpea (Cicer arietinum L.), an orphan legume crop of the semi-arid tropics of Asia and Africa. Plant Biotechnol. J.9, 922931.10.1111/j.1467-7652.2011.00625.x

  • 79

    HoekstraH. E.CoyneJ. A. (2007). The locus of evolution: evo devo and the genetics of adaptation. Evolution61, 9951016.10.1111/j.1558-5646.2007.00105.x

  • 80

    HohenloheP. A.AmishS. J.CatchenJ. M.AllendorfF. W.LuikartG. (2011). Next-generation RAD sequencing identifies thousands of SNPs for assessing hybridization between rainbow and west slope cutthroat trout. Mol. Ecol. Resour.11(Suppl. 1), 117122.10.1111/j.1755-0998.2010.02967.x

  • 81

    HohenloheP. A.BasshamS.EtterP. D.StifflerN.JohnsonE. A.CreskoW. A. (2010). Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags. PLoS Genet.6, e1000862.10.1371/journal.pgen.1000862

  • 82

    HongL. Z.LiJ.Schmidt-KuntzelA.WarrenW. C.BarshG. S. (2011). Digital gene expression for non-model organisms. Genome Res. [Epub ahead of print].10.1101/gr.122135.111

  • 83

    HormozdiariF.AlkanC.EichlerE. E.SahinalpS. C. (2009). Combinatorial algorithms for structural variation detection in high-throughput sequenced genomes. Genome Res.19, 12701278.10.1101/gr.088633.108

  • 84

    HormozdiariF.HajirasoulihaI.DaoP.HachF.YorukogluD.AlkanC.EichlerE. E.SahinalpS. C. (2010). Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery. Bioinformatics26, i350i357.10.1093/bioinformatics/btq216

  • 85

    HuangW.MarthG. (2008). EagleView: a genome assembly viewer for next-generation sequencing technologies. Genome Res.18, 15381543.10.1101/gr.069674.107

  • 86

    HuangX.FengQ.QianQ.ZhaoQ.WangL.WangA.GuanJ.FanD.WengQ.HuangT.DongG.SangT.HanB. (2009). High-throughput genotyping by whole-genome resequencing. Genome Res.19, 10681076.10.1101/gr.089516.108

  • 87

    HytenD. L.SongQ.FickusE. W.QuigleyC. V.LimJ. S.ChoiI. Y.HwangE. Y.Pastor-CorralesM.CreganP. B. (2010). High-throughput SNP discovery and assay development in common bean. BMC Genomics11, 475.10.1186/1471-2164-11-475

  • 88

    Ibarra-LacletteE.AlbertV. A.Perez-TorresC. A.Zamudio-HernandezF.Ortega-Estrada MdeJ.Herrera-EstrellaA.Herrera-EstrellaL. (2011). Transcriptomics and molecular evolutionary rate analysis of the bladderwort (Utricularia), a carnivorous plant with a minimal genome. BMC Plant Biol.11, 101.10.1186/1471-2229-11-101

  • 89

    IngoliaN. T. (2010). Genome-wide translational profiling by ribosome footprinting. Meth. Enzymol.470, 119142.10.1016/S0076-6879(10)70006-9

  • 90

    IngoliaN. T.GhaemmaghamiS.NewmanJ. R.WeissmanJ. S. (2009). Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science324, 218223.10.1126/science.1168978

  • 91

    JiY.XuY.ZhangQ.TsuiK.-W.YuanY.NorrisC.Jr.LiangS.LiangH. (2011). BM-map: Bayesian mapping of multireads for next-generation sequencing data. Biometrics. [Epub ahead of print].10.1111/j.1541-0420.2011.01605.x

  • 92

    Jimenez-GomezJ. M.WallaceA. D.MaloofJ. N. (2010). Network analysis identifies ELF3 as a QTL for the shade avoidance response in Arabidopsis. PLoS Genet.6, e1001100.10.1371/journal.pgen.1001100

  • 93

    JohnsonD. S.MortazaviA.MyersR. M.WoldB. (2007). Genome-wide mapping of in vivo protein-DNA interactions. Science316, 14971502.10.1126/science.1141319

  • 94

    KatzY.WangE. T.AiroldiE. M.BurgeC. B. (2010). Analysis and design of RNA sequencing experiments for identifying isoform regulation. Nat. Methods7, 10091015.10.1038/nmeth.1528

  • 95

    KaurS.CoganN. O.PembletonL. W.ShinozukaM.SavinK. W.MaterneM.ForsterJ. W. (2011). Transcriptome sequencing of lentil based on second-generation technology permits large-scale unigene assembly and SSR marker discovery. BMC Genomics12, 265.10.1186/1471-2164-12-265

  • 96

    KennyE. M.CormicanP.GilksW. P.GatesA. S.O’dushlaineC. T.PintoC.CorvinA. P.GillM.MorrisD. W. (2011). Multiplex target enrichment using DNA indexing for ultra-high throughput SNP detection. DNA Res.18, 3138.10.1093/dnares/dsq029

  • 97

    KerstensH.CrooijmansR.VeenendaalA.DibbitsB.Chin-a-WoengT.Den DunnenJ.GroenenM. (2009). Large scale single nucleotide polymorphism discovery in unsequenced genomes using second generation high throughput sequencing technology: applied to turkey. BMC Genomics10, 479.10.1186/1471-2164-10-479

  • 98

    KerteszM.WanY.MazorE.RinnJ. L.NutterR. C.ChangH. Y.SegalE. (2010). Genome-wide measurement of RNA secondary structure in yeast. Nature467, 103107.10.1038/nature09322

  • 99

    KeurentjesJ. J. B.FuJ.TerpstraI. R.GarciaJ. M.Van Den AckervekenG.SnoekL. B.PeetersA. J. M.VreugdenhilD.KoornneefM.JansenR. C. (2007). Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loci. Proc. Natl. Acad. Sci. U.S.A.104, 17081713.10.1073/pnas.0610429104

  • 100

    KirstM.BastenC. J.MyburgA. A.ZengZ. B.SederoffR. R. (2005). Genetic architecture of transcript-level variation in differentiating xylem of a eucalyptus hybrid. Genetics169, 22952303.10.1534/genetics.104.039198

  • 101

    KliebensteinD. J.WestM. A.Van LeeuwenH.LoudetO.DoergeR. W.St ClairD. A. (2006). Identification of QTLs controlling gene expression networks defined a priori. BMC Bioinformatics7, 308.10.1186/1471-2105-7-308

  • 102

    LaiJ.LiR.XuX.JinW.XuM.ZhaoH.XiangZ.SongW.YingK.ZhangM.JiaoY.NiP.ZhangJ.LiD.GuoX.YeK.JianM.WangB.ZhengH.LiangH.ZhangX.WangS.ChenS.LiJ.FuY.SpringerN. M.YangH.WangJ.DaiJ.SchnableP. S. (2010). Genome-wide patterns of genetic variation among elite maize inbred lines. Nat. Genet.42, 10271030.10.1038/ng.684

  • 103

    LalondeE.HaK. C.WangZ.BemmoA.KleinmanC. L.KwanT.PastinenT.MajewskiJ. (2011). RNA sequencing reveals the role of splicing polymorphisms in regulating human gene expression. Genome Res.21, 545554.10.1101/gr.111211.110

  • 104

    LamH. M.XuX.LiuX.ChenW.YangG.WongF. L.LiM. W.HeW.QinN.WangB.LiJ.JianM.WangJ.ShaoG.SunS. S.ZhangG. (2010). Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection. Nat. Genet.42, 10531059.10.1038/ng.715

  • 105

    LevinJ. Z.YassourM.AdiconisX.NusbaumC.ThompsonD. A.FriedmanN.GnirkeA.RegevA. (2010). Comprehensive comparative analysis of strand-specific RNA sequencing methods. Nat. Methods7, 709715.10.1038/nmeth.1491

  • 106

    LiB.RuottiV.StewartR. M.ThomsonJ. A.DeweyC. N. (2010a). RNA-seq gene expression estimation with read mapping uncertainty. Bioinformatics26, 493500.10.1093/bioinformatics/btq222

  • 107

    LiJ.JiangH.WongW. H. (2010b). Modeling non-uniformity in short-read rates in RNA-seq data. Genome Biol.11, R50.10.1186/gb-2010-11-2-r22

  • 108

    LiY.BreitlingR.SnoekL. B.Van Der VeldeK. J.SwertzM. A.RiksenJ.JansenR. C.KammengaJ. E. (2010c). Global genetic robustness of the alternative splicing machinery in Caenorhabditis elegans. Genetics186, 405410.10.1534/genetics.110.119677

  • 109

    LiH.HandsakerB.WysokerA.FennellT.RuanJ.HomerN.MarthG.AbecasisG.DurbinR. (2009a). The sequence alignment/map format and SAM tools. Bioinformatics25, 20782079.10.1093/bioinformatics/btp100

  • 110

    LiR.LiY.FangX.YangH.WangJ.KristiansenK. (2009b). SNP detection for massively parallel whole-genome resequencing. Genome Res.19, 11241132.10.1101/gr.092213.109

  • 111

    LiY.SidoreC.KangH. M.BoehnkeM.AbecasisG. R. (2011). Low-coverage sequencing: implications for design of complex trait association studies. Genome Res.21, 940951.10.1101/gr.117259.110

  • 112

    LicatalosiD. D.MeleA.FakJ. J.UleJ.KayikciM.ChiS. W.ClarkT. A.SchweitzerA. C.BlumeJ. E.WangX.DarnellJ. C.DarnellR. B. (2008). HITS-CLIP yields genome-wide insights into brain alternative RNA processing. Nature456, 464469.10.1038/nature07488

  • 113

    LinY.LiJ.ShenH.ZhangL.PapasianC. J.DengH. W. (2011). Comparative studies of de novo assembly tools for next-generation sequencing technologies. Bioinformatics27, 20312037.10.1093/bioinformatics/btr319

  • 114

    LisecJ.MeyerR. C.SteinfathM.RedestigH.BecherM.Witucka-WallH.FiehnO.TorjekO.SelbigJ.AltmannT.WillmitzerL. (2008). Identification of metabolic and biomass QTL in Arabidopsis thaliana in a parallel analysis of RIL and IL populations. Plant J.53, 960972.10.1111/j.1365-313X.2007.03383.x

  • 115

    ListerR.O’malleyR. C.Tonti-FilippiniJ.GregoryB. D.BerryC. C.MillarA. H.EckerJ. R. (2008). Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell133, 523536.10.1016/j.cell.2008.03.029

  • 116

    LiuS.LinL.JiangP.WangD.XingY. (2010). A comparison of RNA-seq and high-density exon array for detecting differential gene expression between closely related species. Nucleic Acids Res.39, 578588.10.1093/nar/gkq817

  • 117

    LouS. K.NiB.LoL. Y.TsuiS. K.ChanT. F.LeungK. S. (2011). ABMapper: a suffix array-based tool for multi-location searching and splice-junction mapping. Bioinformatics27, 421422.10.1093/bioinformatics/btq656

  • 118

    LuC.TejS. S.LuoS.HaudenschildC. D.MeyersB. C.GreenP. J. (2005). Elucidation of the small RNA component of the transcriptome. Science309, 15671569.10.1126/science.1113435

  • 119

    MamanovaL.CoffeyA. J.ScottC. E.KozarewaI.TurnerE. H.KumarA.HowardE.ShendureJ.TurnerD. J. (2010). Target-enrichment strategies for next-generation sequencing. Nat. Methods7, 111118.10.1038/nmeth0610-479c

  • 120

    MargueratS.BahlerJ. (2010). RNA-seq: from technology to biology. Cell. Mol. Life Sci.67, 569579.10.1007/s00018-009-0180-6

  • 121

    MarioniJ. C.MasonC. E.ManeS. M.StephensM.GiladY. (2008). RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res.18, 15091517.10.1101/gr.079558.108

  • 122

    MartinJ. A.WangZ. (2011). Next-generation transcriptome assembly. Nat. Rev. Genet.12, 671682.10.1038/nrg3068

  • 123

    MatsumuraH.YoshidaK.LuoS.KimuraE.FujibeT.AlbertynZ.BarreroR. A.KrugerD. H.KahlG.SchrothG. P.TerauchiR. (2010). High-throughput SuperSAGE for digital gene expression analysis of multiple samples using next generation sequencing. PLoS ONE5, e12010.10.1371/journal.pone.0012010

  • 124

    McintyreL. M.LopianoK. K.MorseA. M.AminV.ObergA. L.YoungL. J.NuzhdinS. V. (2011). RNA-seq: technical variability and sampling. BMC Genomics12, 293.10.1186/1471-2164-12-293

  • 125

    McmanusC. J.CoolonJ. D.DuffM. O.Eipper-MainsJ.GraveleyB. R.WittkoppP. J. (2010). Regulatory divergence in Drosophila revealed by mRNA-seq. Genome Res.20, 816825.

  • 126

    MeissnerA.MikkelsenT. S.GuH.WernigM.HannaJ.SivachenkoA.ZhangX.BernsteinB. E.NusbaumC.JaffeD. B.GnirkeA.JaenischR.LanderE. S. (2008). Genome-scale DNA methylation maps of pluripotent and differentiated cells. Nature454, 766770.

  • 127

    MetzkerM. L. (2010). Sequencing technologies – the next generation. Nat. Rev. Genet.11, 3146.10.1038/nrg2626

  • 128

    MikkelsenT. S.KuM.JaffeD. B.IssacB.LiebermanE.GiannoukosG.AlvarezP.BrockmanW.KimT. K.KocheR. P.LeeW.MendenhallE.O’donovanA.PresserA.RussC.XieX.MeissnerA.WernigM.JaenischR.NusbaumC.LanderE. S.BernsteinB. E. (2007). Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature448, 553560.10.1038/nature06008

  • 129

    MilneI.BayerM.CardleL.ShawP.StephenG.WrightF.MarshallD. (2010). Tablet – next generation sequence assembly visualization. Bioinformatics26, 401402.10.1093/bioinformatics/btp666

  • 130

    MizrachiE.HeferC. A.RanikM.JoubertF.MyburgA. A. (2010). De novo assembled expressed gene catalog of a fast-growing Eucalyptus tree produced by Illumina mRNA-seq. BMC Genomics11, 681.10.1186/1471-2164-11-681

  • 131

    MontesJ. M.MelchingerA. E.ReifJ. C. (2007). Novel throughput phenotyping platforms in plant genetic studies. Trends Plant Sci.12, 433436.10.1016/j.tplants.2007.08.006

  • 132

    MontgomeryS. B.SammethM.Gutierrez-ArcelusM.LachR. P.IngleC.NisbettJ.GuigoR.DermitzakisE. T. (2010). Transcriptome genetics using second generation sequencing in a Caucasian population. Nature464, 773777.10.1038/nature08903

  • 133

    MorganM.PagèsH. (2010). Rsamtools: Import Aligned BAM File Format Sequences Into R/Bioconductor. Available at: http://bioconductor.org/packages/release/bioc/html/Rsamtools.html

  • 134

    MortazaviA.WilliamsB. A.MccueK.SchaefferL.WoldB. (2008). Mapping and quantifying mammalian transcriptomes by RNA-seq. Nat. Methods5, 621628.10.1038/nmeth.1226

  • 135

    MunroeD. J.HarrisT. J. (2010). Third-generation sequencing fireworks at Marco Island. Nat. Biotechnol.28, 426428.10.1038/nbt0510-426

  • 136

    NessR. W.SiolM.BarrettS. C. (2011). De novo sequence assembly and characterization of the floral transcriptome in cross- and self-fertilizing plants. BMC Genomics12, 298.10.1186/1471-2164-12-298

  • 137

    NicolaeM.MangulS.MandoiuI. I.ZelikovskyA. (2011). Estimation of alternative splicing isoform frequencies from RNA-seq data. Algorithms Mol. Biol.6, 9.10.1186/1748-7188-6-9

  • 138

    NiedringhausT. P.MilanovaD.KerbyM. B.SnyderM. P.BarronA. E. (2011). Landscape of next-generation sequencing technologies. Anal. Chem.83, 43274341.10.1021/ac2010857

  • 139

    NielsenR.PaulJ. S.AlbrechtsenA.SongY. S. (2011). Genotype and SNP calling from next-generation sequencing data. Nat. Rev. Genet.12, 443451.10.1038/nrg2986

  • 140

    NijmanI. J.MokryM.Van BoxtelR.ToonenP.De BruijnE.CuppenE. (2010). Mutation discovery by targeted genomic enrichment of multiplexed barcoded samples. Nat. Methods7, 913915.10.1038/nmeth.1516

  • 141

    OshlackA.WakefieldM. J. (2009). Transcript length bias in RNA-seq data confounds systems biology. Biol. Direct4, 14.10.1186/1745-6150-4-14

  • 142

    OssowskiS.SchneebergerK.ClarkR. M.LanzC.WarthmannN.WeigelD. (2008). Sequencing of natural strains of Arabidopsis thaliana with short reads. Genome Res.18, 20242033.10.1101/gr.080200.108

  • 143

    ParchmanT. L.GeistK. S.GrahnenJ. A.BenkmanC. W.BuerkleC. A. (2010). Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics11, 180.10.1186/1471-2164-11-180

  • 144

    ParkhomchukD.BorodinaT.AmstislavskiyV.BanaruM.HallenL.KrobitschS.LehrachH.SoldatovA. (2009). Transcriptome analysis by strand-specific sequencing of complementary DNA. Nucleic Acids Res.37, e123.10.1093/nar/gkp596

  • 145

    PickrellJ. K.MarioniJ. C.PaiA. A.DegnerJ. F.EngelhardtB. E.NkadoriE.VeyrierasJ. B.StephensM.GiladY.PritchardJ. K. (2010). Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature464, 768772.10.1038/nature08872

  • 146

    PlessyC.BertinN.TakahashiH.SimoneR.SalimullahM.LassmannT.VitezicM.SeverinJ.OlivariusS.LazarevicD.HornigN.OrlandoV.BellI.GaoH.DumaisJ.KapranovP.WangH.DavisC. A.GingerasT. R.KawaiJ.DaubC. O.HayashizakiY.GustincichS.CarninciP. (2010). Linking promoters to functional transcripts in small samples with nanoCAGE and CAGEscan. Nat. Methods7, 528534.10.1038/nmeth.1470

  • 147

    PotokinaE.DrukaA.LuoZ.WiseR.WaughR.KearseyM. (2008). Gene expression quantitative trait locus analysis of 16 000 barley genes reveals a complex pattern of genome-wide transcriptional regulation. Plant J.53, 90101.10.1111/j.1365-313X.2007.03315.x

  • 148

    QuinceC.LanzenA.DavenportR. J.TurnbaughP. J. (2011). Removing noise from pyrosequenced amplicons. BMC Bioinformatics12, 38.10.1186/1471-2105-12-38

  • 149

    QuinlanA. R.HallI. M. (2010). BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics26, 841842.10.1093/bioinformatics/btq033

  • 150

    R Development Core Team. (2009). R: A Language and Environment for Statistical Computing. Vienna, Austria.

  • 151

    RatanA.ZhangY.HayesV.SchusterS.MillerW. (2010). Calling SNPs without a reference sequence. BMC Bioinformatics11, 130.10.1186/1471-2105-11-130

  • 152

    RazT.KapranovP.LipsonD.LetovskyS.MilosP. M.ThompsonJ. F. (2011). Protocol dependence of sequencing-based gene expression measurements. PLoS ONE6, e19287.10.1371/journal.pone.0019287

  • 153

    RobertsonG.HirstM.BainbridgeM.BilenkyM.ZhaoY.ZengT.EuskirchenG.BernierB.VarholR.DelaneyA.ThiessenN.GriffithO. L.HeA.MarraM.SnyderM.JonesS. (2007). Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat. Methods4, 651657.10.1038/nmeth1068

  • 154

    RobinsonJ. T.ThorvaldsdottirH.WincklerW.GuttmanM.LanderE. S.GetzG.MesirovJ. P. (2011). Integrative genomics viewer. Nat. Biotechnol.29, 2426.10.1038/nbt.1888

  • 155

    RobinsonM. D.MccarthyD. J.SmythG. K. (2010). edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics26, 139140.10.1093/bioinformatics/btp616

  • 156

    RuffaloM.LaframboiseT.KoyuturkM. (2011). Comparative analysis of algorithms for next-generation sequencing read alignment. Bioinformatics27, 27902796.10.1093/bioinformatics/btr477

  • 157

    SanchezC. C.SmithT. P.WiedmannR. T.VallejoR. L.SalemM.YaoJ.RexroadC. E.III. (2009). Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library. BMC Genomics10, 559.10.1186/1471-2164-10-559

  • 158

    SchlottererC. (2004). The evolution of molecular markers – just a matter of fashion?Nat. Rev. Genet.5, 6369.10.1038/nrg1249

  • 159

    SchmiederR.EdwardsR. (2011). Quality control and preprocessing of metagenomic datasets. Bioinformatics27, 863864.10.1093/bioinformatics/btr026

  • 160

    SchmiederR.LimY. W.RohwerF.EdwardsR. (2010). TagCleaner: identification and removal of tag sequences from genomic and metagenomic datasets. BMC Bioinformatics11, 341.10.1186/1471-2105-11-341

  • 161

    SchneebergerK.OssowskiS.LanzC.JuulT.PetersenA. H.NielsenK. L.JorgensenJ. E.WeigelD.AndersenS. U. (2009). SHOREmap: simultaneous mapping and mutation identification by deep sequencing. Nat. Methods6, 550551.10.1038/nmeth0809-550

  • 162

    SchneebergerK.WeigelD. (2011). Fast-forward genetics enabled by new sequencing technologies. Trends Plant Sci.16, 282288.10.1016/j.tplants.2011.02.006

  • 163

    SchwartzS.OrenR.AstG. (2011). Detection and removal of biases in the analysis of next-generation sequencing reads. PLoS ONE6, e16685.10.1371/journal.pone.0016685

  • 164

    SkellyD. A.JohanssonM.MadeoyJ.WakefieldJ.AkeyJ. M. (2011). A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data. Genome Res. [Epub ahead of print].10.1101/gr.119784.110

  • 165

    SmithD. R.QuinlanA. R.PeckhamH. E.MakowskyK.TaoW.WoolfB.ShenL.DonahueW. F.TusneemN.StrombergM. P.StewartD. A.ZhangL.RanadeS. S.WarnerJ. B.LeeC. C.ColemanB. E.ZhangZ.MclaughlinS. F.MalekJ. A.SorensonJ. M.BlanchardA. P.ChapmanJ.HillmanD.ChenF.RokhsarD. S.MckernanK. J.JeffriesT. W.MarthG. T.RichardsonP. M. (2008). Rapid whole-genome mutational profiling using next-generation sequencing technologies. Genome Res.18, 16381642.10.1101/gr.077776.108

  • 166

    SouaiaiaT.FrazierZ.ChenT. (2011). ComB: SNP calling and mapping analysis for color and nucleotide space platforms. J. Comput. Biol.18, 795807.10.1089/cmb.2011.0027

  • 167

    StajichJ. E.BlockD.BoulezK.BrennerS. E.ChervitzS. A.DagdigianC.FuellenG.GilbertJ. G.KorfI.LappH.LehvaslaihoH.MatsallaC.MungallC. J.OsborneB. I.PocockM. R.SchattnerP.SengerM.SteinL. D.StupkaE.WilkinsonM. D.BirneyE. (2002). The Bioperl toolkit: Perl modules for the life sciences. Genome Res.12, 16111618.10.1101/gr.361602

  • 168

    SteinL. D. (2010). The case for cloud computing in genome informatics. Genome Biol.11, 207.10.1186/gb-2010-11-5-207

  • 169

    SternD. L.OrgogozoV. (2008). The loci of evolution: how predictable is genetic evolution?Evolution62, 21552177.10.1111/j.1558-5646.2008.00450.x

  • 170

    SuC. L.ChaoY. T.Alex ChangY. C.ChenW. C.ChenC. Y.LeeA. Y.HwaK. T.ShihM. C. (2011). De novo assembly of expressed transcripts and global analysis of the Phalaenopsis aphrodite transcriptome. Plant Cell Physiol.52, 15011514.10.1093/pcp/pcr097

  • 171

    TarazonaS.Garcia-AlcaldeF.DopazoJ.FerrerA.ConesaA. (2011). Differential expression in RNA-seq: a matter of depth. Genome Res. [Epub ahead of print].10.1101/gr.124321.111

  • 172

    ToungJ. M.MorleyM.LiM.CheungV. G. (2011). RNA-sequence analysis of human B-cells. Genome Res.21, 991998.10.1101/gr.116335.110

  • 173

    TrapnellC.PachterL.SalzbergS. L. (2009). TopHat: discovering splice junctions with RNA-seq. Bioinformatics25, 11051111.10.1093/bioinformatics/btp120

  • 174

    TrapnellC.WilliamsB. A.PerteaG.MortazaviA.KwanG.Van BarenM. J.SalzbergS. L.WoldB. J.PachterL. (2010). Transcript assembly and quantification by RNA-seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol.28, 511515.10.1038/nbt.1621

  • 175

    TrickM.LongY.MengJ.BancroftI. (2009). Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing. Plant Biotechnol. J.7, 334346.10.1111/j.1467-7652.2008.00396.x

  • 176

    UnderwoodJ. G.UzilovA. V.KatzmanS.OnoderaC. S.MainzerJ. E.MathewsD. H.LoweT. M.SalamaS. R.HausslerD. (2010). FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencing. Nat. Methods7, 9951001.10.1038/nmeth.1529

  • 177

    Van TassellC. P.SmithT. P.MatukumalliL. K.TaylorJ. F.SchnabelR. D.LawleyC. T.HaudenschildC. D.MooreS. S.WarrenW. C.SonstegardT. S. (2008). SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries. Nat. Methods5, 247252.10.1038/nmeth.1185

  • 178

    VuylstekeM.DaeleH.VercauterenA.ZabeauM.KuiperM. (2006). Genetic dissection of transcriptional regulation by cDNA-AFLP. Plant J.45, 439446.10.1111/j.1365-313X.2005.02630.x

  • 179

    VuylstekeM.Van EeuwijkF.Van HummelenP.KuiperM.ZabeauM. (2005). Genetic analysis of variation in gene expression in Arabidopsis thaliana. Genetics171, 12671275.10.1534/genetics.105.041509

  • 180

    WangJ.HudaA.LunyakV. V.JordanI. K. (2010a). A Gibbs sampling strategy applied to the mapping of ambiguous short-sequence tags. Bioinformatics26, 25012508.10.1093/bioinformatics/btq241

  • 181

    WangK.SinghD.ZengZ.ColemanS. J.HuangY.SavichG. L.HeX.MieczkowskiP.GrimmS. A.PerouC. M.MacleodJ. N.ChiangD. Y.PrinsJ. F.LiuJ. (2010b). MapSplice: accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Res.38, e178.10.1093/nar/gkq069

  • 182

    WangL.FengZ.WangX.ZhangX. (2010c). DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics26, 136138.10.1093/bioinformatics/btq241

  • 183

    WangX.WuZ.ZhangX. (2010d). Isoform abundance inference provides a more accurate estimation of gene expression levels in RNA-seq. J. Bioinform. Comput. Biol.8(Suppl. 1), 177192.10.1142/S0219720010004999

  • 184

    WangZ.FangB.ChenJ.ZhangX.LuoZ.HuangL.ChenX.LiY. (2010e). De novo assembly and characterization of root transcriptome using Illumina paired-end sequencing and development of cSSR markers in sweet potato (Ipomoea batatas). BMC Genomics11, 726.10.1186/1471-2164-11-726

  • 185

    WangZ.GersteinM.SnyderM. (2009). RNA-seq: a revolutionary tool for transcriptomics. Nat. Rev. Genet.10, 5763.10.1038/nrm2594

  • 186

    WeiW.QiX.WangL.ZhangY.HuaW.LiD.LvH.ZhangX. (2011). Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genomics12, 451.10.1186/1471-2164-12-451

  • 187

    WestM. A. L.KimK.KliebensteinD. J.Van LeeuwenH.MichelmoreR. W.DoergeR. W.St ClairD. A. (2007). Global eQTL mapping reveals the complex genetic architecture of transcript-level variation in Arabidopsis. Genetics175, 14411450.10.1534/genetics.106.064972

  • 188

    WillingE. M.HoffmannM.KleinJ. D.WeigelD.DreyerC. (2011). Paired-end RAD-seq for de novo assembly and marker design without available reference. Bioinformatics27, 21872193.10.1093/bioinformatics/btr346

  • 189

    WittkoppP. J.HaerumB. K.ClarkA. G. (2004). Evolutionary changes in cis and trans gene regulation. Nature430, 8588.10.1038/nature02698

  • 190

    WrayG. A. (2007). The evolutionary significance of cis-regulatory mutations. Nat. Rev. Genet.8, 206216.10.1038/nrg2063

  • 191

    WuX.RenC.JoshiT.VuongT.XuD.NguyenH. T. (2010). SNP discovery by high-throughput sequencing in soybean. BMC Genomics11, 469.10.1186/1471-2164-11-469

  • 192

    XieW.FengQ.YuH.HuangX.ZhaoQ.XingY.YuS.HanB.ZhangQ. (2010). Parent-independent genotyping for constructing an ultrahigh-density linkage map based on population sequencing. Proc. Natl. Acad. Sci. U.S.A.107, 1057810583.10.1073/pnas.0912315107

  • 193

    YouF. M.HuoN.DealK. R.GuY. Q.LuoM. C.McguireP. E.DvorakJ.AndersonO. D. (2011). Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence. BMC Genomics12, 59.10.1186/1471-2164-12-59

  • 194

    YoungM. D.WakefieldM. J.SmythG. K.OshlackA. (2010). Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol.11, R14.10.1186/gb-2010-11-s1-i14

  • 195

    ZhangG.GuoG.HuX.ZhangY.LiQ.LiR.ZhuangR.LuZ.HeZ.FangX.ChenL.TianW.TaoY.KristiansenK.ZhangX.LiS.YangH.WangJ. (2010). Deep RNA sequencing at single base-pair resolution reveals high complexity of the rice transcriptome. Genome Res.20, 646654.10.1101/gr.107334.110

  • 196

    ZhengW.ChungL. M.ZhaoH. (2011). Bias detection and correction in RNA-sequencing data. BMC Bioinformatics12, 290.10.1186/1471-2105-12-290

Summary

Keywords

QTL analysis, plant genetics, next generation sequencing, genomics, eQTL analysis, RNA-seq

Citation

Jiménez-Gómez JM (2011) Next Generation Quantitative Genetics in Plants. Front. Plant Sci. 2:77. doi: 10.3389/fpls.2011.00077

Received

29 April 2011

Accepted

23 October 2011

Published

15 November 2011

Volume

2 - 2011

Edited by

Alisdair Fernie, Max Planck Institute for Plant Physiology, Germany

Reviewed by

Alisdair Fernie, Max Planck Institute for Plant Physiology, Germany; Mathilde Causse, National Institute of Agricultural Research, France

Copyright

*Correspondence: José M. Jiménez-Gómez, Department of Plant Breeding and Genetics, Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, 50829 Köln, Germany. e-mail:

This article was submitted to Frontiers in Plant Physiology, a specialty of Frontiers in Plant Science.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics