<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2014.00340</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Sequencing and <italic>de novo</italic> assembly of a <italic>Dahlia</italic> hybrid cultivar transcriptome</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Lehnert</surname> <given-names>Erik M.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/162161"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Walbot</surname> <given-names>Virginia</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/12391"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Genetics, Stanford University School of Medicine</institution> <country>Stanford, CA, USA</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Medical Microbiology and Immunology, University of Wisconsin School of Medicine</institution> <country>Madison, WI, USA</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Biology, Stanford University</institution> <country>Stanford, CA, USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Gane Ka-Shu Wong, University of Alberta, Canada</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Xun Xu, BGI-Shenzhen, China; Jingfa Xiao, Chinese Academy of Sciences, China</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Erik M. Lehnert, Department of Medical Microbiology and Immunology, University of Wisconsin School of Medicine, 1550 Linden Dr., Madison, WI 53706, USA e-mail: <email>lehnert&#x00040;wisc.edu</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Plant Genetics and Genomics, a section of the journal Frontiers in Plant Science.</p></fn>
</author-notes>
<pub-date pub-type="epreprint">
<day>06</day>
<month>06</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="epub">
<day>17</day>
<month>07</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date>
<volume>5</volume>
<elocation-id>340</elocation-id>
<history>
<date date-type="received">
<day>22</day>
<month>05</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>25</day>
<month>06</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2014 Lehnert and Walbot.</copyright-statement>
<copyright-year>2014</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract><p><italic>Dahlia variabilis</italic>, with an exceptionally high diversity of floral forms and colors, is a popular flower amongst both commercial growers and hobbyists. Recently, some genetic controls of pigment patterns have been elucidated. These studies have been limited, however, by the lack of comprehensive transcriptomic resources for this species. Here we report the sequencing, assembly, and annotation of the transcriptome of the developing leaves, stems, and floral buds of <italic>D. variabilis</italic>. This resulted in 35,638 contigs, most of which seem to contain the complete coding sequence, and of which 20,881 could be successfully annotated by similarity to UniProt. Furthermore, we conducted a preliminary investigation to identify contigs with expression patterns consistent with tissue-specificity. These results will accelerate research into the genetic controls of pigmentation and floral form of <italic>D. variabilis</italic>.</p></abstract>
<kwd-group>
<kwd><italic>Dahlia</italic></kwd>
<kwd>next-generation sequencing</kwd>
<kwd><italic>de novo</italic> transcriptome assembly</kwd>
<kwd>anthocyanin biosynthesis</kwd>
<kwd>floral homeotic factors</kwd>
</kwd-group>
<counts>
<fig-count count="0"/>
<table-count count="6"/>
<equation-count count="0"/>
<ref-count count="25"/>
<page-count count="5"/>
<word-count count="3770"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="introduction" id="s1">
<title>Introduction</title>
<p>Horticultural dahlias (<italic>Dahlia variabilis</italic>) are among the most diverse in floral form and colors of all popular garden flowers. Dahlias are also an important contributor to the more than US &#x00024;100 Billion worldwide flower and potted plant market. Taxonomically dahlias are Compositae, and the center of species diversity is in Mexico and Central America. Likely through cross-breeding of two wild species followed by human selection since 1800 (Gatt et al., <xref ref-type="bibr" rid="B7">1998</xref>), thousands of varieties have been generated by European horticulturists who prized novelty in overall flower size, color, petal number, and petal form.</p>
<p>Garden dahlias are proposed to be recent octoploid derivatives (2N &#x0003D; 64) (Gatt et al., <xref ref-type="bibr" rid="B7">1998</xref>) from crosses between two natural species. Although this high ploidy may have been expected to result in redundancy of genetic factors, loci implicated in flower color have been elucidated and generally behave as diploid factors (Bate-Smith et al., <xref ref-type="bibr" rid="B4">1955</xref>). More recently, genes that are highly expressed in flowers and encoding anthocyanin pathway enzymes have been cloned (Suzuki et al., <xref ref-type="bibr" rid="B21">2002</xref>; Ohno et al., <xref ref-type="bibr" rid="B18">2011b</xref>, <xref ref-type="bibr" rid="B16">2013</xref>) and some enzymes characterized (Yamaguchi et al., <xref ref-type="bibr" rid="B24">1999</xref>; Ogata et al., <xref ref-type="bibr" rid="B15">2001</xref>). Furthermore, a partial reference transcriptome assembly was performed for a comparative genomics study of Compositae, but the majority of contigs did not contain the complete coding sequence (Hodgins et al., <xref ref-type="bibr" rid="B8">2014</xref>).</p>
<p>Given the relatively low cost of deep transcriptome analysis using next-generation DNA sequencing, we wished to determine if reasonable descriptions of messenger RNA diversity could be obtained for leaves, stems, and floral buds of dahlia. The contigs assembled from the transcriptome data are sufficient to propose more than 20,000 likely protein coding genes in this species and can thus serve as a standard for allele comparison among dahlia varieties and for future studies of transposon-mediated variegation in anthocyanin pigmentation and the control of floral form. A cultivar containing anthocyanin in the stems and leaves and flavonoid precursors and anthocyanin in the petals was selected to assess recovery of known pigment factor transcripts as a test of the transcriptome completeness.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Plant growth and RNA isolation</title>
<p>The cultivar &#x0201C;Rio Riata,&#x0201D; originally obtained from Corralitos Gardens (<ext-link ext-link-type="uri" xlink:href="http://www.cgdahlias.com/">http://www.cgdahlias.com/</ext-link>), was grown outdoors at Stanford University; this variety has 8 orange-red petals with yellow-tips, purple (anthocyanin pigment) stems, and green leaves tinged with purple (<ext-link ext-link-type="uri" xlink:href="http://www.stanford.edu/group/dahlia_genetics/cultivars/rio_riata/rio_riata.htm">http://www.stanford.edu/group/dahlia_genetics/cultivars/rio_riata/rio_riata.htm</ext-link>). Total RNA was extracted from newly emerged stems, leaves, and buds (&#x0007E;0.2&#x02013;1 cm in diameter) using the RNAqueous-4PCR Kit (Ambion AM1914, Grand Island NY) following the manufacturer&#x00027;s instructions. The RNA-integrity number (RIN) of each sample was determined using an Agilent 2100 Bioanalyzer, Santa Clara CA), and only samples with a RIN &#x02265; 8 were used. Approximately 1 &#x003BC;g of total RNA was processed (including a poly-A<sup>&#x0002B;</sup>-selection step) from the leaf, stem, and bud extracts using the TruSeq RNA Sample Prep Kit (Illumina FC-122-1001, Hayward CA) following the manufacturer&#x00027;s instructions to produce indexed libraries. The resulting libraries were pooled based on their indices (as described in the kit instructions). Clustering and sequencing were performed by the Stanford Center for Genomics and Personalized Medicine using an Illumina HiSeq 2000 sequencer to generate 101-bp paired-end reads. Accession numbers for reads from bud, leaf, and stem are: SRR1222985, SRR1226613, SRR1226614, respectively.</p>
</sec>
<sec>
<title>Read filtering, transcriptome assembly, and annotation</title>
<p>Transcriptome assembly was performed combining reads from all three tissues. Prior to assembly, the reads were processed as previously described (Lehnert et al., <xref ref-type="bibr" rid="B9">2014</xref>) and outlined here: (1) the first 6 bp from the 5&#x02032; end of each read was discarded; (2) reads of &#x0003C;60 bp or containing &#x02265;1 N were discarded; (3) low quality reads were discarded (if &#x0003E;10 of the first 35 bases had quality scores &#x0003C;30); and (4) reads were trimmed to the first position for which a sliding 4-bp window had an average quality-score of &#x0003C;20. The remaining read-pairs were then processed using FLASH (Magoc and Salzberg, <xref ref-type="bibr" rid="B12">2011</xref>) to join reads whose ends overlapped by &#x02265;10 bp with no mismatches. Finally, adapter sequences were removed using cutadapt (Martin, <xref ref-type="bibr" rid="B13">2011</xref>) with default settings.</p>
<p>The processed reads were assembled using a 43-bp <italic>k</italic>-mer with the Velvet/Oases assembler (Velvet version 1.2.09 and Oases version 0.2.08) (Zerbino and Birney, <xref ref-type="bibr" rid="B26">2008</xref>; Schulz et al., <xref ref-type="bibr" rid="B20">2012</xref>) with the default settings. To choose a single representative contig from each &#x0201C;locus&#x0201D; (the Oases term for a connected component in the de Bruijn graph, which presumably consists of alternative transcripts, alleles, and extremely similar paralogs), the script &#x0201C;process_oases_transcripts.py&#x0201D; was used (Yang and Smith, <xref ref-type="bibr" rid="B25">2013</xref>). This script designates as representative the contig with the highest geometric mean <italic>k</italic>-mer coverage that is also at least 30% as long as the longest contig within the locus. Only contigs longer than 300 bp were retained in the resulting set of representative contigs. Contigs were post-processed to remove terminal Ns, and such that internal runs of Ns were fewer than 14 bases in length.</p>
<p>To assign putative functional roles to the representative contigs, they were aligned to the SwissProt protein database and the NCBI Non-Redundant Protein Database (nr) using the blastx program from the standalone BLAST 2.2.25&#x0002B;software suite (Camacho et al., <xref ref-type="bibr" rid="B5">2009</xref>) with an <italic>E</italic>-value cutoff of 1e-5. The Blast2GO software package (Conesa et al., <xref ref-type="bibr" rid="B6">2005</xref>) was used with default settings to assign Gene Ontology (GO) terms and Enzyme codes to the predicted proteins based on their alignments to SwissProt. Blast2GO was also used to identify protein domains by InterProScan (Quevillon et al., <xref ref-type="bibr" rid="B19">2005</xref>), whose associated GO terms were merged with those identified by alignment to SwissProt.</p>
</sec>
<sec>
<title>Expression analysis</title>
<p>The trimmed forward reads were aligned to the representative contigs using bwa (Li and Durbin, <xref ref-type="bibr" rid="B10">2009</xref>). Aligned reads were counted using the samtools (Li et al., <xref ref-type="bibr" rid="B11">2009</xref>; minimum mapping quality 30). The R package DESeq2 (Anders and Huber, <xref ref-type="bibr" rid="B2">2010</xref>) was used to normalize read counts by library size.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Sequencing and <italic>de novo</italic> transcriptome assembly</title>
<p>A set of 149,304,876 reads of 101-bp was generated in the initial sequencing run. After trimming and discarding low-quality reads, &#x0007E;73 million reads from buds, leaves, and stems (see Table <xref ref-type="table" rid="T1">1</xref>) were assembled into 53,037 loci containing 122,053 contigs. After removing contigs that fell below the 300-bp length cut-off, choosing representative contigs for each locus, and removing unnecessary Ns (see Materials and Methods), 35,638 representative contigs remained. These ranged in size from 269 bp to 13,886 (see Table <xref ref-type="table" rid="T2">2</xref>). These contigs have been deposited in the NCBI TSA (accession &#x00023; GBDN01000001-GBDN01035638) and can also be found at (<ext-link ext-link-type="uri" xlink:href="http://www.stanford.edu/group/dahlia_genetics/">http://www.stanford.edu/group/dahlia_genetics/</ext-link>). 20,881 (58.5%) of contigs could be annotated by BLAST alignment to the UniProt protein database. Using these alignments and the results of InterProScan protein motif searches, 122,654 GO terms were assigned to 21,576 (60.5%) of the contigs (see Table <xref ref-type="table" rid="T3">3</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Summary of read metrics by tissue of origin</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th/>
<th valign="top" align="center"><bold>Total number of reads</bold></th>
<th valign="top" align="center"><bold>Total nucleotides (Mb)</bold></th>
<th valign="top" align="center"><bold>Average Length (bp)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Floral (paired)</td>
<td align="center">12,792,456</td>
<td align="center">1118</td>
<td align="center">87</td>
</tr>
<tr>
<td align="left">Floral (flash extended<xref ref-type="table-fn" rid="TN1"><sup>a</sup></xref>)</td>
<td align="center">9,456,389</td>
<td align="center">1264</td>
<td align="center">133</td>
</tr>
<tr>
<td align="left">Leaf (paired)</td>
<td align="center">14,541,486</td>
<td align="center">1258</td>
<td/>
</tr>
<tr>
<td align="left">Leaf (flash extended)</td>
<td align="center">13,742,631</td>
<td align="center">1840</td>
<td align="center">133</td>
</tr>
<tr>
<td align="left">Stem (paired)</td>
<td align="center">11,051,702</td>
<td align="center">958</td>
<td align="center">86</td>
</tr>
<tr>
<td align="left">Stem (flash extended)</td>
<td align="center">11,455,453</td>
<td align="center">1499</td>
<td align="center">130</td>
</tr>
<tr>
<td align="left">Total</td>
<td align="center">73,040,117</td>
<td align="center">7938</td>
<td align="center">&#x02013;</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN1">
<label>a</label>
<p><italic>Flash extended reads are paired reads that had 10-bp or more of perfect overlap at the ends of their sequence. These were joined to make longer reads, and are not included in the counts of paired reads (see Materials and Methods)</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Size distribution of the representative contigs</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th/>
<th valign="top" align="center"><bold>Representative contigs</bold></th>
<th valign="top" align="center"><bold>All contigs</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Number of loci</td>
<td align="center">35,638</td>
<td align="center">53,037</td>
</tr>
<tr>
<td align="left">Number of contigs</td>
<td align="center">35,638</td>
<td align="center">122,053</td>
</tr>
<tr>
<td align="left">Median contig size (bp)</td>
<td align="center">906</td>
<td align="center">943</td>
</tr>
<tr>
<td align="left">Mean contig size (bp)</td>
<td align="center">1166</td>
<td align="center">1209</td>
</tr>
<tr>
<td align="left">Minimum contig size (bp)</td>
<td align="center">269</td>
<td align="center">100</td>
</tr>
<tr>
<td align="left">Maximum contig size (bp)</td>
<td align="center">13,886</td>
<td align="center">17,090</td>
</tr>
<tr>
<td align="left">Total bases in assembly (Mb)</td>
<td align="center">41.6</td>
<td align="center">148</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>Summary of alignments to SwissProt and nr</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Number of contigs</bold></th>
<th valign="top" align="center"><bold>Number (%) of contigs aligned to SwissProt<xref ref-type="table-fn" rid="TN2"><sup>a</sup></xref></bold></th>
<th valign="top" align="center"><bold>Number (%<xref ref-type="table-fn" rid="TN3"><sup>b</sup></xref>) of unique accessions</bold></th>
<th valign="top" align="center"><bold>Number (%) of contigs aligned to nr<xref ref-type="table-fn" rid="TN2"><sup>a</sup></xref></bold></th>
<th valign="top" align="center"><bold>Number (%<xref ref-type="table-fn" rid="TN3"><sup>b</sup></xref>) of unique accessions</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">35,638</td>
<td align="center">20,880 (58.6)</td>
<td align="center">10,603 (50.8)</td>
<td align="center">26,940 (75.6)</td>
<td align="center">21,376 (79.3)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN2">
<label>a</label>
<p><italic>Alignments with E-value &#x02264; 10<sup>&#x02212;5</sup></italic>.</p></fn>
<fn id="TN3">
<label>b</label>
<p><italic>As % of all alignments</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Comparison to known components of anthocyanin biosynthesis in dahlias and identification of additional gene copies</title>
<p>Previous research has identified and sequenced several genes of the dahlia anthocyanin biosynthetic pathway (Suzuki et al., <xref ref-type="bibr" rid="B21">2002</xref>; Ohno et al., <xref ref-type="bibr" rid="B17">2011a</xref>). To investigate assembly quality, we compared these previously sequenced genes to the contigs in our assembly. As shown in Table <xref ref-type="table" rid="T4">4</xref>, we identified all the genes previously known to be involved in the synthesis of the anthocyanidins beginning with chalcone synthase. Furthermore, we identified additional copies for several pathway genes by a best-reciprocal-blast approach, thereby expanding the dahlia gene list for the phenylpropanoid pathways. For example, we identified a total of six putative chalcone synthase genes, four more than were previously known. All contigs identified contained the complete coding sequence of the enzyme (as determined by alignment of sequence to the stop and start codon of reference protein), except for one chalcone synthase and for DvIVS, a basic helix-loop-helix transcription factor required for anthocyanin synthesis, in which the cDNA was split between two contigs.</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p><bold>Genes putatively required for anthocyanidin synthesis</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Putative enzyme</bold></th>
<th valign="top" align="center"><bold>Locus &#x00023; /transcript &#x00023;</bold></th>
<th valign="top" align="center"><bold>Tissue-specificity<xref ref-type="table-fn" rid="TN4"><sup>a</sup></xref></bold></th>
<th valign="top" align="center"><bold>Previously identified?</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">CHS</td>
<td align="center">27/7</td>
<td align="center">All</td>
<td align="center">CHS2<xref ref-type="table-fn" rid="TN5"><sup>b</sup></xref></td>
</tr>
<tr>
<td align="left">CHS</td>
<td align="center">49578/1</td>
<td align="center">Elevated in stem</td>
<td align="center">CHS1<xref ref-type="table-fn" rid="TN5"><sup>b</sup></xref></td>
</tr>
<tr>
<td align="left">CHS</td>
<td align="center">7505/1</td>
<td align="center">Decreased in leaf</td>
<td/>
</tr>
<tr>
<td align="left">CHS</td>
<td align="center">3129/1</td>
<td align="center">All</td>
<td/>
</tr>
<tr>
<td align="left">CHS</td>
<td align="center">18623/1</td>
<td align="center">Elevated in bud</td>
<td/>
</tr>
<tr>
<td align="left">CHS</td>
<td align="center">9318/2<xref ref-type="table-fn" rid="TN6"><sup>c</sup></xref></td>
<td align="center">Elevated in bud</td>
<td/>
</tr>
<tr>
<td align="left">CHI</td>
<td align="center">977/3</td>
<td align="center">All</td>
<td align="center">CHI<xref ref-type="table-fn" rid="TN5"><sup>b</sup></xref></td>
</tr>
<tr>
<td align="left">CHI</td>
<td align="center">2352/8</td>
<td align="center">All</td>
<td/>
</tr>
<tr>
<td align="left">CHI</td>
<td align="center">13838/4</td>
<td align="center">All</td>
<td/>
</tr>
<tr>
<td align="left">DvIVS</td>
<td align="center">7738/1; 36303/2<xref ref-type="table-fn" rid="TN7"><sup>d</sup></xref></td>
<td align="center">All</td>
<td align="center">DvIVS<xref ref-type="table-fn" rid="TN6"><sup>c</sup></xref></td>
</tr>
<tr>
<td align="left">FLS</td>
<td align="center">4250/4</td>
<td align="center">Elevated in stem</td>
<td/>
</tr>
<tr>
<td align="left">F3&#x00027;H</td>
<td align="center">824/8</td>
<td align="center">All</td>
<td/>
</tr>
<tr>
<td align="left">F3&#x00027;H</td>
<td align="center">16120/4</td>
<td align="center">All</td>
<td/>
</tr>
<tr>
<td align="left">F3&#x00027;H</td>
<td align="center">18455/5</td>
<td align="center">All</td>
<td/>
</tr>
<tr>
<td align="left">DFR</td>
<td align="center">44537/3</td>
<td align="center">All</td>
<td align="center">DFR<xref ref-type="table-fn" rid="TN5"><sup>b</sup></xref></td>
</tr>
<tr>
<td align="left">DFR</td>
<td align="center">23328/5</td>
<td align="center">Decreased in bud</td>
<td/>
</tr>
<tr>
<td align="left">DFR</td>
<td align="center">11673/3</td>
<td align="center">All</td>
<td/>
</tr>
<tr>
<td align="left">DFR</td>
<td align="center">27896/1</td>
<td align="center">Decreased in bud</td>
<td/>
</tr>
<tr>
<td align="left">ANS</td>
<td align="center">45613/2</td>
<td align="center">All</td>
<td align="center">ANS<xref ref-type="table-fn" rid="TN5"><sup>b</sup></xref></td>
</tr>
<tr>
<td align="left">ANS</td>
<td align="center">27069/1</td>
<td align="center">All</td>
<td/>
</tr>
<tr>
<td align="left">F3H</td>
<td align="center">22534/2</td>
<td align="center">All</td>
<td align="center">F3H<xref ref-type="table-fn" rid="TN5"><sup>b</sup></xref></td>
</tr>
<tr>
<td align="left">F3H</td>
<td align="center">33298/1</td>
<td align="center">All</td>
<td/>
</tr>
<tr>
<td align="left">F3H</td>
<td align="center">39959/2</td>
<td align="center">Decreased in bud</td>
<td/>
</tr>
<tr>
<td align="left">F3H</td>
<td align="center">43247/1</td>
<td align="center">All</td>
<td/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN4">
<label>a</label>
<p><italic>Expressed in all three samples at levels that did not differ enough to be designated as consistent with tissue-specific expression, designated as All</italic>.</p></fn>
<fn id="TN5">
<label>b</label>
<p><italic>Ohno et al., <xref ref-type="bibr" rid="B18">2011b</xref></italic>.</p></fn>
<fn id="TN6">
<label>c</label>
<p><italic>Ohno et al., <xref ref-type="bibr" rid="B17">2011a</xref></italic>.</p></fn>
<fn id="TN7">
<label>d</label>
<p><italic>These loci lacked either conserved start or stop codons</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Investigating tissue-specific expression</title>
<p>To achieve statistical power in assigning contigs to classes differentially expressed between tissue types, biological replicates would be required. Nonetheless, as a preliminary investigation that might guide future experimental design, we identified contigs with expression patterns consistent with tissue specificity. To do this, we used DESeq to perform a regularized log2 transformation on the read counts; this analysis generates normalized expression values after accounting for the differences in sequencing depth. We assigned a tentative tissue source to a contig by the following procedures: (1) We calculated the standard deviation of expression values across all tissues; (2) we calculated the standard deviation of expression values for each pair of tissues; (3) a contig&#x00027;s expression was classified as elevated or decreased in a specific tissue if the standard deviation of expression values of the pair of other tissues was 3-fold less than the standard deviation of the expression values of all three tissues and the standard deviation of the three tissues expression value was great than 10% the mean expression value. This selected for contigs that showed stable expression in two of three tissues, with a large difference in read counts in the third. Table <xref ref-type="table" rid="T4">4</xref> presents this analysis for the anthocyanin biosynthetic pathway, and Table <xref ref-type="table" rid="T5">5</xref> considers all classes of representative contigs.</p>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p><bold>Tentative tissue specificity of contigs</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Expression pattern</bold></th>
<th valign="top" align="center"><bold>Number in this class</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Decreased in buds</td>
<td align="center">1407</td>
</tr>
<tr>
<td align="left">Elevated in buds</td>
<td align="center">390</td>
</tr>
<tr>
<td align="left">Decreased in leaves</td>
<td align="center">2059</td>
</tr>
<tr>
<td align="left">Elevated in leaves</td>
<td align="center">1409</td>
</tr>
<tr>
<td align="left">Decreased in stems</td>
<td align="center">2690</td>
</tr>
<tr>
<td align="left">Elevated in stems</td>
<td align="center">868</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>This analysis resulted in tentative tissue-specific expression patterns for 8823 contigs (see Table <xref ref-type="table" rid="T5">5</xref>). Given the distribution of pigment in Rio Riata, genes essential for anthocyanin synthesis would be expected in the transcriptomes of all three organs, and they were found. Interestingly, while genes for some of the enzymes exhibited similar expression in all three tissues, and all tissues had all the essential components of the pathway for pigment synthesis, some copies (alleles) exhibited patterns consistent with tissue-specific expression (see Table <xref ref-type="table" rid="T4">4</xref>). We verified that these results were not affected by reads mapping to multiple contigs, as using only uniquely mapping reads gave nearly identical results. This implies that the apparent diploid nature of dahlia floral pigmentation genes may reflect not only the loss of function of redundant gene-copies, but also evolved tissue-specific expression patterns for some of the redundant genes, a process termed subfunctionalization. As a second test of the classification approach, we investigated the tissue-specificity of contigs annotated as floral homeotic proteins. Of the 22 contigs annotated as similar to floral homeotic proteins, 12 were elevated in buds as expected, while the remainder were classified as lacking tissue specificity (see Table <xref ref-type="table" rid="T6">6</xref>).</p>
<table-wrap position="float" id="T6">
<label>Table 6</label>
<caption><p><bold>Putative floral homeotic genes in Dahlia</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Locus&#x00023; /transcript&#x00023;</bold></th>
<th valign="top" align="center"><bold>Match to floral homeotic gene</bold></th>
<th valign="top" align="center"><bold>Best blast hit accession number</bold></th>
<th valign="top" align="center"><bold><italic>e</italic>-Value</bold></th>
<th valign="top" align="center"><bold>Read counts in bud</bold></th>
<th valign="top" align="center"><bold>Read counts in leaf</bold></th>
<th valign="top" align="center"><bold>Read counts in stem</bold></th>
<th valign="top" align="left"><bold>Tissue-specificit<xref ref-type="table-fn" rid="TN8"><sup>a</sup></xref></bold></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">312/8</td>
<td align="center">PMADS2</td>
<td align="center">Q07474</td>
<td align="center">4.47e-71</td>
<td align="center">5697</td>
<td align="center">1</td>
<td align="center">69</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">3477/2</td>
<td align="center">APETALA1</td>
<td align="center">Q41276</td>
<td align="center">9.28e-92</td>
<td align="center">2034</td>
<td align="center">0</td>
<td align="center">22</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">16671/2</td>
<td align="center">AGAMOUS</td>
<td align="center">Q03489</td>
<td align="center">8.38e-119</td>
<td align="center">1192</td>
<td align="center">3</td>
<td align="center">17</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">324/3</td>
<td align="center">AGAMOUS</td>
<td align="center">Q40872</td>
<td align="center">3.30e-128</td>
<td align="center">1163</td>
<td align="center">7</td>
<td align="center">14</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">5269/3</td>
<td align="center">DEFICIENS</td>
<td align="center">P23706</td>
<td align="center">3.68e-96</td>
<td align="center">895</td>
<td align="center">0</td>
<td align="center">18</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">5867/6</td>
<td align="center">DEFICIENS</td>
<td align="center">P23706</td>
<td align="center">1.82e-91</td>
<td align="center">729</td>
<td align="center">2</td>
<td align="center">18</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">3943/2</td>
<td align="center">AGAMOUS</td>
<td align="center">Q03489</td>
<td align="center">9.65e-133</td>
<td align="center">645</td>
<td align="center">41</td>
<td align="center">11</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">643/1</td>
<td align="center">AGAMOUS</td>
<td align="center">Q40872</td>
<td align="center">5.90e-127</td>
<td align="center">637</td>
<td align="center">3</td>
<td align="center">13</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">8582/1</td>
<td align="center">AGAMOUS</td>
<td align="center">Q40872</td>
<td align="center">4.94e-122</td>
<td align="center">245</td>
<td align="center">0</td>
<td align="center">5</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">4783/1</td>
<td align="center">AGAMOUS</td>
<td align="center">Q01540</td>
<td align="center">1.72e-06</td>
<td align="center">157</td>
<td align="center">0</td>
<td align="center">0</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">9368/1</td>
<td align="center">PMADS1</td>
<td align="center">Q07472</td>
<td align="center">1.28e-69</td>
<td align="center">67</td>
<td align="center">6</td>
<td align="center">1</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">9276/1</td>
<td align="center">APETALA2</td>
<td align="center">P47927</td>
<td align="center">1.35e-36</td>
<td align="center">33</td>
<td align="center">0</td>
<td align="center">3</td>
<td align="left">Elevated in bud</td>
</tr>
<tr>
<td align="left">7009/1</td>
<td align="center">DEFICIENS</td>
<td align="center">P23706</td>
<td align="center">2.93e-94</td>
<td align="center">1675</td>
<td align="center">27</td>
<td align="center">227</td>
<td align="left">All</td>
</tr>
<tr>
<td align="left">1095/3</td>
<td align="center">AGAMOUS</td>
<td align="center">Q40872</td>
<td align="center">4.44e-137</td>
<td align="center">1118</td>
<td align="center">0</td>
<td align="center">54</td>
<td align="left">All</td>
</tr>
<tr>
<td align="left">15403/1</td>
<td align="center">APETALA2</td>
<td align="center">P47927</td>
<td align="center">8.53e-111</td>
<td align="center">693</td>
<td align="center">1197</td>
<td align="center">616</td>
<td align="left">All</td>
</tr>
<tr>
<td align="left">22399/7</td>
<td align="center">APETALA2</td>
<td align="center">P47927</td>
<td align="center">5.50e-90</td>
<td align="center">350</td>
<td align="center">557</td>
<td align="center">323</td>
<td align="left">All</td>
</tr>
<tr>
<td align="left">17401/4</td>
<td align="center">APETALA2</td>
<td align="center">P47927</td>
<td align="center">1.45e-85</td>
<td align="center">214</td>
<td align="center">240</td>
<td align="center">502</td>
<td align="left">All</td>
</tr>
<tr>
<td align="left">23847/1</td>
<td align="center">APETALA2</td>
<td align="center">P47927</td>
<td align="center">3.84e-88</td>
<td align="center">140</td>
<td align="center">8</td>
<td align="center">61</td>
<td align="left">All</td>
</tr>
<tr>
<td align="left">40009/3</td>
<td align="center">APETALA2</td>
<td align="center">P47927</td>
<td align="center">7.44e-101</td>
<td align="center">135</td>
<td align="center">105</td>
<td align="center">347</td>
<td align="left">All</td>
</tr>
<tr>
<td align="left">15550/2</td>
<td align="center">APETALA2</td>
<td align="center">P47927</td>
<td align="center">2.8e-76</td>
<td align="center">98</td>
<td align="center">19</td>
<td align="center">40</td>
<td align="left">All</td>
</tr>
<tr>
<td align="left">31911/1</td>
<td align="center">APETALA1</td>
<td align="center">D7KWY6</td>
<td align="center">1.06e-09</td>
<td align="center">30</td>
<td align="center">14</td>
<td align="center">6</td>
<td align="left">All</td>
</tr>
<tr>
<td align="left">38896/1</td>
<td align="center">APETALA2</td>
<td align="center">P47927</td>
<td align="center">3.57e-106</td>
<td align="center">16</td>
<td align="center">3</td>
<td align="center">183</td>
<td align="left">All</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN8">
<label>a</label>
<p><italic>Expressed in all three samples at levels that did not differ enough to be designated as consistent with tissue-specific expression, designated as All</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>Despite the large genome size of <italic>D. variabilis</italic> (9.62 pg; Temsch et al., <xref ref-type="bibr" rid="B23">2008</xref>) and the expectation of two to four paralogs per locus type in an octoploid (Gatt et al., <xref ref-type="bibr" rid="B7">1998</xref>), a paired-end read strategy using next generation sequencing yielded sufficient data to assemble more than 20,000 contigs that could be annotated with GO terms. As flowering plants sequenced to date contain &#x0007E;25,000&#x02013;40,000 genes (e.g., Arabidopsis Genome Initiative, <xref ref-type="bibr" rid="B3">2000</xref>; Matsumoto et al., <xref ref-type="bibr" rid="B14">2005</xref>; Amborella Genome Project, <xref ref-type="bibr" rid="B1">2013</xref>) it is likely that the transcriptome data define approximately half of dahlia loci. Genes expressed in seeds, roots, and non-abundant specialized cell types were likely missed as well as genes expressed during biotic or abiotic stress. Given the scanty knowledge about <italic>Dahlia</italic> genes, this new transcriptome resource can serve as a guide in gene identification and cloning, as a standard for comparison among varieties and species, and as a database for identifying genes unique to <italic>Dahlia</italic> and paralogs and orthologs shared with other composites, flowering plants, and all plants. Furthermore, we identified contigs whose expression patterns were consistent with tissue-specificity; future researchers may test our assignments by qPCR or further sequencing. Our method was limited by a lack of biological replicates, which prevents any estimate of the variability of expression of genes within a tissue-type. For this reason, we attempted to be conservative in our calling of specificity and have likely both failed to identify some contigs whose expression is highly enriched in certain tissues, as well as potentially misassigned contigs whose expression is not tissue-specific but is highly variable. However, we believe this resource will be of use for researchers attempting to clone cDNAs or perform <italic>in situ</italic> hybridization to localize specific transcripts. This report indicates that reference transcriptomes for plant species with complex genomes are a feasible and relatively inexpensive method for generating a toolkit for molecular and genetic analysis.</p>
</sec>
<sec>
<title>Author contributions</title>
<p>Erik M. Lehnert performed all of the experiments; Virginia Walbot maintained the dahlias; both authors contributed to design of data collection and analysis and manuscript writing and editing.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
</sec>
</body>
<back>
<ack>
<p>We thank Tim Culbertson and Blaine Marchant for constructing the Stanford Dahlia Project website (<ext-link ext-link-type="uri" xlink:href="http://web.stanford.edu/group/dahlia_genetics/dahlia_reference_transcriptome.htm">http://web.stanford.edu/group/dahlia_genetics/dahlia_reference_transcriptome.htm</ext-link>) and for starting the dahlia collection. John Fernandes incorporated the transcriptome data into the website. This effort was supported by the Savitsky Fund and by the outreach component of a National Science Foundation grant (PGRP 07-01880) to Virginia Walbot. Erik M. Lehnert was supported by the Gordon and Betty Moore Foundation (grant &#x00023;2629).</p>
</ack>
<sec sec-type="supplementary-material" id="s5">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://www.frontiersin.org/journal/10.3389/fpls.2014.00340/abstract">http://www.frontiersin.org/journal/10.3389/fpls.2014.00340/abstract</ext-link></p>
<supplementary-material xlink:href="DataSheet1.XLSX" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><collab>Amborella Genome Project.</collab></person-group> (<year>2013</year>). <article-title>The <italic>Amborella</italic> genome and the evolution of flowering plants</article-title>. <source>Science</source> <volume>342</volume>, <fpage>1241089</fpage>. <pub-id pub-id-type="doi">10.1126/science.1241089</pub-id><pub-id pub-id-type="pmid">24357323</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anders</surname> <given-names>S.</given-names></name> <name><surname>Huber</surname> <given-names>W.</given-names></name></person-group> (<year>2010</year>). <article-title>Differential expression analysis for sequence count data</article-title>. <source>Genome Biol</source>. <volume>11</volume>, <fpage>R106</fpage>. <pub-id pub-id-type="doi">10.1186/gb-2010-11-10-r106</pub-id><pub-id pub-id-type="pmid">20979621</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><collab>Arabidopsis Genome Initiative.</collab></person-group> (<year>2000</year>). <article-title>Analysis of the genome sequence of the flowering plant <italic>Arabidopsis thaliana</italic></article-title>. <source>Nature</source> <volume>408</volume>, <fpage>796</fpage>&#x02013;<lpage>815</lpage>. <pub-id pub-id-type="doi">10.1038/35048692</pub-id><pub-id pub-id-type="pmid">11130711</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bate-Smith</surname> <given-names>E. C.</given-names></name> <name><surname>Swain</surname> <given-names>T.</given-names></name> <name><surname>N&#x000F6;rdstrom</surname> <given-names>C. G.</given-names></name></person-group> (<year>1955</year>). <article-title>Chemistry and inheritance of flower colour in the Dahlia</article-title>. <source>Nature</source> <volume>176</volume>, <fpage>1016</fpage>&#x02013;<lpage>1018</lpage>. <pub-id pub-id-type="doi">10.1038/1761016a0</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Camacho</surname> <given-names>C.</given-names></name> <name><surname>Coulouris</surname> <given-names>G.</given-names></name> <name><surname>Avagyan</surname> <given-names>V.</given-names></name> <name><surname>Ma</surname> <given-names>N.</given-names></name> <name><surname>Papadopoulos</surname> <given-names>J.</given-names></name> <name><surname>Bealer</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>BLAST&#x0002B;: architecture and applications</article-title>. <source>BMC Bioinformatics</source> <volume>10</volume>:<fpage>421</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-10-421</pub-id><pub-id pub-id-type="pmid">20003500</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Conesa</surname> <given-names>A.</given-names></name> <name><surname>G&#x000F6;tz</surname> <given-names>S.</given-names></name> <name><surname>Garc&#x000ED;a-G&#x000F3;mez</surname> <given-names>J.</given-names></name> <name><surname>Terol</surname> <given-names>J.</given-names></name> <name><surname>Tal&#x000F3;n</surname> <given-names>M.</given-names></name> <name><surname>Robles</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>3674</fpage>&#x02013;<lpage>3676</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bti610</pub-id><pub-id pub-id-type="pmid">16081474</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gatt</surname> <given-names>M.</given-names></name> <name><surname>Ding</surname> <given-names>H.</given-names></name> <name><surname>Hammet</surname> <given-names>K.</given-names></name> <name><surname>Murray</surname> <given-names>B.</given-names></name></person-group> (<year>1998</year>). <article-title>Polyploidy and evolution in wild and cultivated <italic>Dahlia</italic> species</article-title>. <source>Ann. Bot</source>. <volume>81</volume>, <fpage>647</fpage>&#x02013;<lpage>656</lpage>. <pub-id pub-id-type="doi">10.1006/anbo.1998.0614</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hodgins</surname> <given-names>K. A.</given-names></name> <name><surname>Lai</surname> <given-names>Z.</given-names></name> <name><surname>Oliveira</surname> <given-names>L. O.</given-names></name> <name><surname>Still</surname> <given-names>D. W.</given-names></name> <name><surname>Scascitelli</surname> <given-names>M.</given-names></name> <name><surname>Barker</surname> <given-names>M. S.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Genomics of Compositae crops: reference transcriptome assemblies and evidence of hybridization with wild relatives</article-title>. <source>Mol. Ecol. Resour</source>. <volume>14</volume>, <fpage>166</fpage>&#x02013;<lpage>1177</lpage>. <pub-id pub-id-type="doi">10.1111/1755-0998.12163</pub-id><pub-id pub-id-type="pmid">24103297</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lehnert</surname> <given-names>E. M.</given-names></name> <name><surname>Mouchka</surname> <given-names>M. E.</given-names></name> <name><surname>Burriesci</surname> <given-names>M. S.</given-names></name> <name><surname>Gallo</surname> <given-names>N. D.</given-names></name> <name><surname>Schwarz</surname> <given-names>J. A.</given-names></name> <name><surname>Pringle</surname> <given-names>J. R.</given-names></name></person-group> (<year>2014</year>). <article-title>Extensive differences in gene expression between symbiotic and aposymbiotic cnidarians</article-title>. <source>G3</source> <volume>4</volume>, <fpage>277</fpage>&#x02013;<lpage>295</lpage>. <pub-id pub-id-type="doi">10.1534/g3.113.009084</pub-id><pub-id pub-id-type="pmid">24368779</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name></person-group> (<year>2009</year>). <article-title>Fast and accurate short read alignment with Burrows-Wheeler transform</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>1754</fpage>&#x02013;<lpage>1760</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp324</pub-id><pub-id pub-id-type="pmid">19451168</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Handsaker</surname> <given-names>B.</given-names></name> <name><surname>Wysoker</surname> <given-names>A.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name> <name><surname>Ruan</surname> <given-names>J.</given-names></name> <name><surname>Homer</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>The sequence alignment/map format and SAMtools</article-title>. <source>Bioinformatics</source> <volume>25</volume>, <fpage>2078</fpage>&#x02013;<lpage>2079</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id><pub-id pub-id-type="pmid">19505943</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Magoc</surname> <given-names>T.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2011</year>). <article-title>FLASH: fast length adjustment of short reads to improve genome assemblies</article-title>. <source>Bioinformatics</source> <volume>27</volume>, <fpage>1</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr507</pub-id><pub-id pub-id-type="pmid">21903629</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martin</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Cutadapt removes adapter sequences from high-throughput sequencing reads</article-title>. <source>EMBnet.J</source>. <volume>17</volume>, <fpage>10</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.14806/ej.17.1.200</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsumoto</surname> <given-names>T.</given-names></name> <name><surname>Wu</surname> <given-names>J. Z.</given-names></name> <name><surname>Kanamori</surname> <given-names>H.</given-names></name> <name><surname>Katayose</surname> <given-names>Y.</given-names></name> <name><surname>Fujisawa</surname> <given-names>M.</given-names></name> <name><surname>Namiki</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2005</year>). <article-title>The map-based sequence of the rice genome</article-title>. <source>Nature</source> <volume>436</volume>, <fpage>793</fpage>&#x02013;<lpage>800</lpage>. <pub-id pub-id-type="doi">10.1038/Nature03895</pub-id><pub-id pub-id-type="pmid">16100779</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ogata</surname> <given-names>J.</given-names></name> <name><surname>Sakamoto</surname> <given-names>T.</given-names></name> <name><surname>Yamaguchi</surname> <given-names>M.</given-names></name> <name><surname>Kawanobu</surname> <given-names>S.</given-names></name> <name><surname>Yoshitama</surname> <given-names>K.</given-names></name></person-group> (<year>2001</year>). <article-title>Isolation and characterization of anthocyanin 5-O-glucosyltransferase from flowers of <italic>Dahlia variabilis</italic></article-title>. <source>J. Plant Physiol</source>. <volume>158</volume>, <fpage>709</fpage>&#x02013;<lpage>714</lpage>. <pub-id pub-id-type="doi">10.1078/0176-1617-00370</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ohno</surname> <given-names>S.</given-names></name> <name><surname>Deguchi</surname> <given-names>A.</given-names></name> <name><surname>Hosokawa</surname> <given-names>M.</given-names></name> <name><surname>Tatsuzawa</surname> <given-names>F.</given-names></name> <name><surname>Doi</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>A basic helix-loop-helix transcription factor DvIVS determines flower color intensity in cyanic dahlia cultivars</article-title>. <source>Planta</source> <volume>238</volume>, <fpage>331</fpage>&#x02013;<lpage>343</lpage>. <pub-id pub-id-type="doi">10.1007/s00425-013-1897-x</pub-id><pub-id pub-id-type="pmid">23689377</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ohno</surname> <given-names>S.</given-names></name> <name><surname>Hosokawa</surname> <given-names>M.</given-names></name> <name><surname>Hoshino</surname> <given-names>A.</given-names></name> <name><surname>Kitamura</surname> <given-names>Y.</given-names></name> <name><surname>Morita</surname> <given-names>Y.</given-names></name> <name><surname>Park</surname> <given-names>K.-I.</given-names></name> <etal/></person-group>. (<year>2011a</year>). <article-title>A bHLH transcription factor, DvIVS, is involved in regulation of anthocyanin synthesis in dahlia (<italic>Dahlia variabilis</italic>)</article-title>. <source>J. Exp. Bot</source>. <volume>62</volume>, <fpage>5105</fpage>&#x02013;<lpage>5116</lpage>. <pub-id pub-id-type="doi">10.1093/jxb/err216</pub-id><pub-id pub-id-type="pmid">21765172</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ohno</surname> <given-names>S.</given-names></name> <name><surname>Hosokawa</surname> <given-names>M.</given-names></name> <name><surname>Kojima</surname> <given-names>M.</given-names></name> <name><surname>Kitamura</surname> <given-names>Y.</given-names></name> <name><surname>Hoshino</surname> <given-names>A.</given-names></name> <name><surname>Tatsuzawa</surname> <given-names>F.</given-names></name> <etal/></person-group>. (<year>2011b</year>). <article-title>Simultaneous post-transcriptional gene silencing of two different chalcone synthase genes resulting in pure white flowers in the octoploid dahlia</article-title>. <source>Planta</source> <volume>234</volume>, <fpage>945</fpage>&#x02013;<lpage>958</lpage>. <pub-id pub-id-type="doi">10.1007/s00425-011-1456-2</pub-id><pub-id pub-id-type="pmid">21688014</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quevillon</surname> <given-names>E.</given-names></name> <name><surname>Silventoinen</surname> <given-names>V.</given-names></name> <name><surname>Pillai</surname> <given-names>S.</given-names></name> <name><surname>Harte</surname> <given-names>N.</given-names></name> <name><surname>Mulder</surname> <given-names>N.</given-names></name> <name><surname>Apweiler</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2005</year>). <article-title>InterProScan: protein domains identifier</article-title>. <source>Nucl. Acids Res</source>. <volume>33</volume>, <fpage>W116</fpage>&#x02013;<lpage>W120</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gki442</pub-id><pub-id pub-id-type="pmid">15980438</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schulz</surname> <given-names>M. H.</given-names></name> <name><surname>Zerbino</surname> <given-names>D. R.</given-names></name> <name><surname>Vingron</surname> <given-names>M.</given-names></name> <name><surname>Birney</surname> <given-names>E.</given-names></name></person-group> (<year>2012</year>). <article-title>Oases: robust <italic>de novo</italic> RNA-seq assembly across the dynamic range of expression levels</article-title>. <source>Bioinformatics</source> <volume>28</volume>, <fpage>1086</fpage>&#x02013;<lpage>1092</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bts094</pub-id><pub-id pub-id-type="pmid">22368243</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Suzuki</surname> <given-names>H.</given-names></name> <name><surname>Nakayama</surname> <given-names>T.</given-names></name> <name><surname>Yonekura-Sakakibara</surname> <given-names>K.</given-names></name> <name><surname>Fukui</surname> <given-names>Y.</given-names></name> <name><surname>Nakamura</surname> <given-names>N.</given-names></name> <name><surname>Yamaguchi</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2002</year>). <article-title>cDNA cloning, heterologous expressions, and functional characterization of malonyl-coenzyme a: anthocyanidin 3-o-glucoside-6&#x00022;-o-malonyltransferase from dahlia flowers</article-title>. <source>Plant Physiol</source>. <volume>130</volume>, <fpage>2142</fpage>&#x02013;<lpage>2151</lpage>. <pub-id pub-id-type="doi">10.1104/pp.010447</pub-id><pub-id pub-id-type="pmid">12481098</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Temsch</surname> <given-names>E. M.</given-names></name> <name><surname>Greilhuber</surname> <given-names>J.</given-names></name> <name><surname>Hammett</surname> <given-names>K. R. W.</given-names></name></person-group> (<year>2008</year>). <article-title>Genome size in Dahlia Cav. <italic>(Asteraceae&#x02013;Coreopsideae)</italic>.</article-title> <source>Plant Syst. Evol</source>. <volume>276</volume>, <fpage>157</fpage>&#x02013;<lpage>166</lpage>. <pub-id pub-id-type="doi">10.1007/s00606-008-0077-0</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yamaguchi</surname> <given-names>M. A.</given-names></name> <name><surname>Oshida</surname> <given-names>N.</given-names></name> <name><surname>Nakayama</surname> <given-names>M.</given-names></name> <name><surname>Koshioka</surname> <given-names>M.</given-names></name> <name><surname>Yamaguchi</surname> <given-names>Y.</given-names></name> <name><surname>Ino</surname> <given-names>I.</given-names></name></person-group> (<year>1999</year>). <article-title>Anthocyanidin 3-glucoside malonyltransferase from <italic>Dahlia variabilis</italic></article-title>. <source>Phytochemistry</source> <volume>52</volume>, <fpage>15</fpage>&#x02013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1016/S0031-9422(99)00099-0</pub-id><pub-id pub-id-type="pmid">12481098</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>Y.</given-names></name> <name><surname>Smith</surname> <given-names>S. A.</given-names></name></person-group> (<year>2013</year>). <article-title>Optimizing <italic>de novo</italic> assembly of short-read RNA-seq data for phylogenomics</article-title>. <source>BMC Genomics</source> <volume>14</volume>:<fpage>328</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-14-328</pub-id><pub-id pub-id-type="pmid">23672450</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zerbino</surname> <given-names>D. R.</given-names></name> <name><surname>Birney</surname> <given-names>E.</given-names></name></person-group> (<year>2008</year>). <article-title>Velvet: algorithms for de novo short read assembly using de Bruijn graphs</article-title>. <source>Genome Res</source>. <volume>18</volume>, <fpage>821</fpage>&#x02013;<lpage>829</lpage>. <pub-id pub-id-type="doi">10.1101/gr.074492.107</pub-id><pub-id pub-id-type="pmid">18349386</pub-id></citation>
</ref>
</ref-list>
</back>
</article>
