ORIGINAL RESEARCH article

Front. Plant Sci., 23 December 2016

Sec. Plant Development and EvoDevo

Volume 7 - 2016 | https://doi.org/10.3389/fpls.2016.01934

De novo Sequencing and Comparative Transcriptomics of Floral Development of the Distylous Species Lithospermum multiflorum

  • Kettering University Flint, MI, USA

Abstract

Genes controlling the morphological, micromorphological, and physiological components of the breeding system distyly have been hypothesized, but many of the genes have not been investigated throughout development of the two floral morphs. To this end, the present study is an examination of comparative transcriptomes from three stages of development for the floral organs of the morphs of Lithospermum multiflorum. Transcriptomes of flowers of the two morphs, from various stages of development, were sequenced using an Illumina HiSeq 2000. The floral transcriptome of L. multiflorum was assembled, and differential gene expression (DE) was identified between morphs, throughout development. Additionally, Gene Ontology (GO) terms for DE genes were determined. Fewer genes were DE early in development compared to later in development, with more genes highly expressed in the gynoecium of the SS morph and the corolla and androecium of the LS morph. A reciprocal pattern was observed later in development, and many more genes were DE during this latter stage. During early development, DE genes appear to be involved in growth and floral development, and during later development, DE genes seem to affect physiological functions. Interestingly, many genes involved in response to stress were identified as DE between morphs.

Introduction

Heterostyly is a complex and elegant breeding system characterized by the presence, in a species, of two or three floral morphs with sexual organs at distinct, yet fixed, heights (Barrett et al., 2000; Barrett and Shore, 2008). In heterostyly's simplest form, distyly, two floral morphs exist. The long-style (LS) morph has flowers with the stigma elevated above the anthers, and the short-style (SS) morph produces flowers in which the anthers are positioned above the stigma. The stigma in one morph is at the same height as the anthers in the other morph, a condition known as reciprocal herkogamy (Figure 1). In heterostyly, reciprocal herkogamy is usually accompanied by a self- and intramorph incompatibility (SI) mechanism that results in only sexual organs at the same height being compatible and able to produce fertile offspring. Researchers have demonstrated that together reciprocal herkogamy and SI allow for efficient pollen transfer among individuals of different morphs throughout populations, and this results in increased outbreeding and reduced inbreeding (Barrett, 1990; Ferrero et al., 2011; Zhou et al., 2015) [although see Björkman (1995) for an example of lack of efficient pollen transfer]. These functional and ecological aspects of heterostyly have been the subject of much research, dating back to Darwin's pivotal work on these components of the breeding system (1877), but the genetics of heterostyly, while investigated using numerous crossing studies (e.g., Ernst, 1928, 1933), still remain elusive (Fornoni and Domínguez, 2015; Gilmartin, 2015). One reason this is the case is that heterostyly is known from members of at least 28 families from across the angiosperms (Barrett and Shore, 2008; Cohen, 2010), but detailed data on the genetics of the breeding system is primarily limited to a small number of species (Gilmartin, 2015; Nowak et al., 2015). Indeed, much of the information on the genetics of heterostyly has been informed by studies on Primula L., and in this taxon, evidence suggests that the heterostylous condition is controlled by a supergene (S-locus), multiple genes that are tightly linked together, involved in the control of the breeding system (Barrett and Shore, 2008; Charlesworth, 2016). Outside of Primula, evidence for the presence of a supergene is weaker, primarily due to lack of data (Barrett and Shore, 2008; Cohen, 2010; Thompson and Jiggins, 2014). The present study employs de novo sequencing and comparative transcriptomics of three developmental stages of the flowers of the two morphs of the distylous species Lithospermum multiflorum Torr. ex A. Gray (Boraginaceae) in order to identify genes involved in the control of the heterostylous condition.

Figure 1

The genetics of heterostyly was initially characterized through crossing studies conducted during the late 1800s and early 1900s (Darwin, 1877; Ernst, 1928, 1933), and the outcomes of these studies resulted in the hypothesis of the three-gene S-locus (Lewis, 1954; Dowrick, 1956; Richards, 1997; Nowak et al., 2015). Recent studies have focused on uncovering the underlying genetics of a small number of heterostylous species, including Linum L., Turnera L., and Primula (McCubbin et al., 2006; Somers et al., 2008; Labonne et al., 2009; Ushijima et al., 2012, 2015). Many of the studies have recognized genes or proteins either linked to the S-locus or with morph-specific patterns of expression.

In Linum, Ushijima et al. (2012, 2015) identified a gene, TSS1, that was tightly linked to the S-locus and only segregated with SS individuals, while other genes unlinked to the S-locus, such as LgSKS1, which are also highly expressed in the SS morph, have post-transcriptional regulation. This latter finding suggests that the development of the two floral morphs is controlled by more than just the components of the S-locus. In Turnera, Somers et al. (2008) and Labonne et al. (2009) constructed a genetic map of the S-locus and identified three genes, TkNACA, TkST1, and TsRETRO, linked to it. Further studies on Turnera, using X-ray-generated mutants, allowed gene order to be postulated for the S-locus (Labonne et al., 2010).

Research on Primula has been more extensive (Gilmartin, 2015; Nowak et al., 2015; Huu et al., 2016), and recent work has resulted in the identification of multiple genes associated with the S-locus, such as PVeGLO2, a GLOBOSA homolog, and CYP734A50, both present in Primula veris L. PVeGLO2, which was identified via a draft genome of the species, is not expressed in the LS morph (Nowak et al., 2015). Similarly, CYP734A50 is only expressed in the styles of the SS morph, and the loss of the expression of this gene results in the elongated styles of the LS morph (Huu et al., 2016). Using BAC contigs, Li et al. (2015) identified a one megabase (Mb) region that contains 82 genes linked to the S-locus, and this region appears to be located close to the centromere. That location may be one of the reasons that recombination among the genes in the S-locus occurs infrequently (Talbert and Henikoff, 2010). In a related species of Primula, P. vulgaris Huds., McCubbin et al. (2006) utilized subtractive cDNA libraries of morph-specific floral tissue to identify differentially expressed (DE) genes between the flowers of the two morphs. These researchers identified 11 DE genes involved in a variety of different processes, including stress response, cell wall development, and RNA-binding; however, none of these DE genes were linked to the S-locus. This provides evidence that multiple genes are involved in downstream processes of morph-specific floral development in heterostylous species. Additionally, genes and proteins that differ between morphs have been studied in other heterostylous taxa, such as Fagopyrum Mill. (Fesenko and Fesenko, 2006) and Averrhoa L. (Wong et al., 1994; Fesenko and Fesenko, 2006), but to a lesser extent.

While identification of the S-locus is important in understanding heterostyly, at least in some species, the breeding system provides an excellent opportunity to investigate genes involved in floral organ height and length. In most species, identification of these genes would require large quantitative trait loci (QTL) studies involving examination of the range of morphological variation and associated molecular markers [e.g., over 6500 plants observed in a QTL study on flower size of Erythranthe Spach (formerly Mimulus L.) (Kelly and Monica, 2011)]. Given the distinct differences in floral organ heights and lengths in heterostylous species, these species do not succumb to these issues. Consequently, examining floral transcriptomes of heterostylous species, at various stages of development, allows for the identification of genes involved in controlling the sizes of flowers and flower organs. It is possible to compare patterns of gene expression of flowers of the morphs of heterostylous species because the flowers of the morphs develop following the same genetic program, with the only exceptions hypothesized to be those resulting in the morph-specific differences (Cohen, 2010). Genes that are differentially expressed between the morphs should be those that are involved in heterostyly. While the S-locus, at least in Primula, may be the genomic region ultimately controlling the heterostylous condition, other genes from throughout the genome may be involved in development of the breeding system (McCubbin et al., 2006), with identification of the S-locus only providing limited data.

L. multiflorum is a distylous species (Figure 1) common in the intermountain west in North America (Cronquist et al., 1984). The species is a member of Boraginaceae, the family that has the largest number of independent origins of heterostyly, and this type of breeding system has evolved multiple times in Lithospermum L. (Cohen, 2011). Cohen et al. (2012) demonstrated that there are differences in the development of the flowers within the genus, and data concerning SI and floral morphology and development based on studies by Philipp and Schou (1981), Casper (1985), Li and Johnston (2010), Ferrero et al. (2012), and Cohen (2014) suggest that heterostyly evolved following distinct patterns in different tribes of the family. Heterostyly arose independently in L. mulitflorum, and the species is sister to Lithospermum macromeria. Cohen (2011), a species with floral morphology quite different from L. multiflorum (i.e., long, green to white corollas and exserted anthers and stigmas). Consequently, L. multiflorum is located at an intriguing phylogenetic position for an investigation of patterns of morph-specific flower gene expression. To this end, the present study is an examination of the comparative transcriptomics of (1) the corolla and androecium and (2) the gynoecium of the two morphs of L. multiflorum at three stages of development in order to recognize genes that are differentially expressed between the morphs, and, therefore, regulating floral morphology in this species.

Materials and methods

Plant material, RNA extraction, and transcriptome sequencing

Flowers of the two morphs of L. multiflorum were collected from three populations in and around Flagstaff, Arizona, USA during June 2011 and 2012. In order to capture appropriate developmental stages, multiple flowers of different sizes were taken from plants, and after quickly removing some sepals, flowers were immediately placed in RNAlater (Applied Biosystems, Foster City, CA, USA), and subsequently stored either at 4°C overnight or placed on dry ice. All specimens were then stored indefinitely at −20°C. Voucher specimens for the populations were made (Cohen 353, 354, 356, and 398), and specimens were deposited at MICH.

Flowers were divided into three stages—early, middle, and late—based on the developmental patterns identified by Cohen et al. (2012) (Figure 2). Flowers in the early stage are up to 4 mm in length, which is before morph-specific differences are apparent in the flowers. Flowers in the middle stage are 4–9 mm in length, which is after morph-specific floral differences are observed in heights of anthers and stigmas. Flowers in the late stage are greater than 9 mm in length, and these flowers range from near anthesis to anthesis, with morph-specific differences well-defined. For each morph at each stage, three extractions were made for the organ group of the corolla, androecium, and gynoecium, and two extractions were conducted for the organ group of the corolla and androecium, resulting in a total of 30 extractions (Table S1). Flowers for RNA extraction were arbitrarily selected from either 2011 or 2012, and given the similar environmental conditions around Flagstaff, AZ during June of the 2 years (https://www.wunderground.com/), it would appear that different collection times would have minimal impact on variation in patterns of gene expression.

Figure 2

RNA was extracted from one or two RNAlater-preserved flowers using a modification of the Li-Cl method of Jaakola et al. (2001). Right before RNA isolation, the remaining sepals and pedicel were removed. Before the extraction buffer was added, flowers were homogenized using the method outlined in Alexander et al. (2007). The primary modification from Jaakola et al. (2001) involved the extraction buffer, which in the present study included 2% CTAB, 2% PVP (Mol WT 40,000), 0.5% SDS, 100 mM Tris- HCl (pH 8.0), 25 mM EDTA, 2.0 M NaCl, and 0.5 g/L spermidine, with 2% β-mercaptoethanol and 1.5 mg/mL Proteinase K added just before use. After RNA was isolated, the quantity, 260/280, and 260/230 ratios were determined using a Nanodrop ND-1000 spectrophotometer (Thermofisher Scientific, Waltham, MA, USA). RNA that was at least a total of 1 μg and with 260/280 and 260/230 > 1.8 was treated with the TURBO DNA-free™ Kit (Applied Biosystems) following the manufacturer's protocol. Subsequently, isolations were sent to the Duke University Genome Sequencing & Analysis Core Resource, where the quality of the RNA was determined via an Agilent 2200 TapeStation (Agilent Technologies, Santa Clara, CA, USA), and then prepared for transcriptome sequencing, on an Illumina HiSeq 2000, using the TruSeq RNA Kit. One extraction from each stage from each group of organs (total of 12) was sequenced using 100 base pair (bp) paired-end (PE) reads across three lanes, and the remaining extractions (18) were sequenced using 100 bp single-reads (SR) across five lanes. Afterwards, adapters were trimmed and low quality and short reads were removed at the Duke University Genome Sequencing & Analysis Core Resource. Sequence data are deposited at the Sequence Read Archive (SRP093214, BioProject PRJNA353131).

Transcriptome assembly and mapping

The transcriptome of L. multiflorum was assembled, at Duke University, based on the 12 PE transcriptomes from the three developmental stages of the two morphs. Using the ~389 million high-quality pairs of reads sequenced from the 12 RNA isolations, the floral transcriptome of the species was assembled using Trinity v 1 (Grabherr et al., 2011). The transcriptome was annotated and gene ontology terms identified (GO terms) using the Integrative Services for Genomic Analysis (http://isga.cgb.indiana.edu). In order to test for DE genes between the morphs and organs at various stages of development, the protocol of Van Verk et al. (2013) was followed. FASTQ files were uploaded to the Galaxy server (usegalaxy.org), and five base pairs were trimmed from the ends of each read, followed by filtering the reads to remove those with low quality scores (reads <90% of the bp with a Phred quality score of ≥20). After filtering, reads from each RNA isolation were mapped to the isoforms of the Trinity-assembled transcriptome. Using Bowtie (Galaxy tool version 1.1.2) (Langmead et al., 2009), reads were mapped, with the appropriate settings for SR and PE employed, using the default (lenient) settings as well as with a stringent criterion (i.e., -a,-best, -strata, -m 1, -n 3, -l 30, -e 288, -y off, -k 1, -maxbts 800). The SAM files from the Bowtie runs were analyzed using eXpress (Roberts and Pachter, 2013), with the Galaxy server, to determine the number of reads that mapped to each isoform. This was undertaken using three different methods: Default settings and two more thorough analyses, involving two additional rounds of each of the batch and online expectation-maximization (EM) algorithm, a mean fragment length either of 90 bp for all reads or 90 bp for SR and 190 bp for PE, and a standard deviation of one. Total counts and effective counts, which are counts adjusted for fragment and target length biases, were recorded (Roberts and Pachter, 2013).

Differential gene expression and gene ontology

The total number of reads mapped to each isoform for each putative gene were compiled in order to determine the number of reads mapped to each putative gene. To test for similarity between transcriptomes from the same developmental stage and morph, Pearson correlations were calculated in Microsoft Excel. For each putative gene and isoform, both the total and effective counts from eXpress from the two Bowtie mapping strategies were analyzed in edgeR v.3.0.0 (Robinson et al., 2010) to test for differential expression (DE) of transcripts among floral organs at early, middle, and late stages of floral development for the corolla and androecium and for the gynoecium. Putative DE genes were identified based on a P-value ≤ 0.05 and a false discovery rate (FDR), using the Benjamini-Hochberg adjustment, of ≤0.1 (Anders et al., 2013). To investigate overlap of DE genes among morphs, organs, and stages of development, venn diagrams were constructed with VennDiagram (Chen and Boutros, 2011). Hierarchical clustering of DE genes was conducted across the three developmental stages, for the two organ groups, using NMF (Gaujoux and Seoighe, 2010).

After the DE putative genes and isoforms were identified for each organ or group of organs at the three developmental stages, genes were annotated using Blast2GO (Conesa et al., 2005) running blastx on the nr database at NCBI, and GO terms were determined, using Blast2GO, with InterProScan (Zdobnov and Apweiler, 2001) and by mapping genes with the Gene Ontology Database (GO Consortium, 2004). With GOstats 2.34.0 (Falcon and Gentleman, 2007), GO terms for the DE genes for each organ or group of organs for each developmental stage were compared to those for all genes in the floral transcriptome in which GO terms were identified. GO terms for Biological Processes, Cellular Components, and Molecular Function were identified that were over- and underrepresented, based on a P-value ≤ 0.5. The hyperGTest in GOstats was run with and without the conditional algorithm. For these analyses, because of the small number of DE genes and the similarity to those during early development, the mid-stage DE genes were included with those from the early stage of development.

Results

Transcriptome sequencing, assembly, and mapping

A total of 30 transcriptomes were sequenced for flowers at the three stages of development for the two morphs of L. multiflorum. For the paired-end (PE) reads, the total number of reads for a transcriptome ranged from 58,474,582 to 134,651,292 100-bp reads. For the single reads (SR), the total number of reads for a transcriptome ranged from 38,244,776 to 78,986,321 100-bp reads. The assembled floral transcriptome of L. multiflorum included 97,603 components (putative genes) and 265,144 isoforms (putative gene variants). The minimum length of a putative gene was 201 bp, and the maximum assembled length was 16,847 bp. For the genes, the average length was 690 bp, with an N50 of 1201 bp, and for the isoforms, the average length was 1144 bp, with an N50 of 2029 bp. Statistics for the assembled transciptome are provided in Table 1. Approximately 2–8% of the total reads were removed via the filtering and trimming steps. Using the default (lenient) and more stringent Bowtie settings, ca. 70–80 and 40% of the reads mapped to the assembled transcriptome, respectively. Data on the number of reads are presented in Table S1.

Table 1

StatisticsGenesIsoforms
Minimum length201201
Maximum length16,84716,847
Mean length6911144
Standard deviation8441142
Median length361661
N50 length12012029
Number of assembled reads >1 kb16,724103,530
Number of isotigs in N5014,10948,516
Number of bases in all isotigs67,417,306303,313,357
Number of bases in isotigs ≥1 kb36,575,211232,659,690
GC content of isotigs38.53%38.55%

Statistics on assembled transcriptome of Lithospermum multiflorum.

Differential gene expression and gene ontology

The Pearson correlations for the components and isoforms were greater than 0.85 for the corolla and androecium and for corolla, androecium, and gynoecium at early and late stages of development, suggesting high correlation among the transcriptomes. For the middle stages of development, some of the transcriptomes had high Pearson correlations (>0.85), particularly for the transcriptomes of the LS corolla, androecium, and gynoecium; however, most of the others had weaker associations, with values ranging from 0.4 to 0.57 (Table S2).

The two different mapping criteria in Bowtie (lenient and stringent), in conjunction with the six different methods for determining the number of reads mapped, in eXpress [total and effective counts for each of (1) no specified fragment length, (2) a mean fragment length of 90 bp for all reads, and (3) a mean fragment length of 90 bp for SR and 190 bp for PE], resulted in 12 manners in which the number of differentially expressed (DE) genes were calculated (Table 2). Of the 12 different manners in which DE genes were calculated, the lenient Bowtie analyses tended to result in more DE genes identified, with edgeR, compared to the more stringent Bowtie analyses. Additionally, the use of effective counts, rather than total counts, generally resulted in fewer DE genes. Specifying the length of the mean fragments at either 90 bp for each read or 190 bp for PE reads and 90 bp for SR, did not seem to influence the number of DE genes as much as the mapping criteria and the use of total or effective counts, with the former resulting in fewer DE genes compared to the latter (Table 2).

Table 2

Developmental stageBowtie lenient, 90 + 190 bp length, effective countsBowtie lenient, 90 + 190 bp length, total countsBowtie stringent, 90 + 190 bp length, effective countsBowtie stringent, 90 + 190 bp length, total countsBowtie lenient, no length, effective countsBowtie lenient, 90 bp length, effective countsBowtie lenient, 90 bp length, total countsBowtie lenient, no length, total countsBowtie stringent, no length, effective countsBowtie stringent, no length, total countss Bowtie stringent, 90 bp length, effective countsBowtie stringent, 90 bp length, total countsTotal unique genes
Early corolla5359004759535913130068
Mid corolla1000001101001
Late corolla26013440023913442621344172108902561444
Early gynoecium133311113123376220111111
Mid gynoecium71714717217131422
Late gynoecium11271593309812102311251593131090615233118123135
Total1408298731182712822498189127481081263631310834713

Number of differentially expressed genes from analyses of developmental stages and organs, see text for explanation of different mapping and differential expression (DE) methods.

In the corolla and androecium, more DE genes early in development were more highly expressed in the short-style (SS) morph (although this was a small number), while a greater number of DE genes were more highly expressed in the long-style (LS) morph during late development (Figures 3, 4). The opposite pattern was observed in the gynoecium, with more highly expressed DE genes in the LS morph during early development (again, a small number), and many more DE genes more highly expressed in the SS morph during late development of the gynoecium (Figures 3, 4). Indeed, the greatest number of DE genes between morphs was observed in the gynoecium during the late stage of growth. Only a small number of DE genes was identified during the mid-stage of development both for the corolla and androecium and for the gynoecium, with the smallest number of DE genes across stages and organs identified in the corolla and androecium of the mid-stage of development (Table 2). The DE genes that were identified in the greatest number of analyses, for each stage and organ, are presented in Table 3. This number ranged from 4 to 12 depending on the stage and organ combination. The majority of these genes were successfully annotated using Blast2GO, and most exhibit greater than six log-fold changes, in expression, between morphs.

Figure 3

Figure 4

Table 3

Early corolla DE genesPutative geneNumber of analyses gene DELog-fold changeLog counts-per-millionLikelihood ratioP-valueFalse discovery rate
comp113449Myosin family protein with dil domain8–7.24645.381624.1378**0.0219
comp94365Glutamine dumper8–10.59773.467724.8753**0.0235
comp107848Continuous vascular ring family protein8–11.94744.656223.0457**0.0262
comp112590Hypothetical protein MIMGU_mgv11b0241222mg, partial7–12.54206.144023.0341**0.0262
comp111181Pleckstrin homology domain-containing family A member 88–5.72775.156421.7499**0.0312
comp117098Lysine histidine transporter 18–8.37506.761621.8987**0.0312
comp102592lactoylglutathione lyase family protein8–7.53944.524020.8535**0.0329
comp106343n/a8–8.87662.566621.0607**0.0329
comp98284Hypothetical protein MIMGU_mgv1a016237mg8–9.00305.442121.0653**0.0329
comp115443Hypothetical protein CICLE_v10002660mg7–6.48994.001620.6916**0.0329
comp89953PREDICTED: uncharacterized protein LOC1012591107–9.92414.462119.5456**0.0458
comp104913n/a8–9.25700.243318.5503**0.0589
comp75402Heavy metal-associated isoprenylated plant protein 26-like7–9.53823.436716.9332**0.0844
MID COROLLA DE GENES
comp90112Carotenoid cleavage dioxygenase chloroplastic-like4–8.15031.331525.4450**0.0444
LATE COROLLA DE GENES
comp1117673n/a1014.63362.091216.8029**0.0143
comp4162239Sugar transport protein 10-like1012.92090.131516.8089**0.0143
comp43701Beta-galactosidase 13-like1013.05430.289016.7540**0.0144
comp44582n/a1012.6192–0.236716.7332**0.0144
comp79612Cysteine proteinase rd21a-like1012.89430.097316.7229**0.0144
comp2645921Endoglucanase 16-like1013.34910.636116.6791**0.0145
comp88548Pathogenesis-related maize seed protein1016.39963.949016.6826**0.0145
comp1603779Ribosome associated membrane protein ramp41012.7303–0.099216.5179**0.0149
comp2540996n/a1012.4408–0.450316.5224**0.0149
comp2683611Pectinesterase pectinesterase inhibitor 28-like1013.27770.551016.5344**0.0149
comp65255Pollen allergen ole e 6-like1014.85402.330116.5087**0.0149
comp95123L-ascorbate oxidase homolog1016.34233.891516.44750.00010.0154
comp1701545n/a1014.20741.619516.37110.00010.0158
comp3100697n/a1013.00100.221716.33700.00010.0159
comp2190644Olee1-like protein1013.78951.146415.93040.00010.0173
comp1767292Pectinesterase pectinesterase inhibitor 28-like1013.33560.623015.88390.00010.0175
comp1986942n/a1013.08740.327615.64880.00010.0187
comp2830467Pollen-specific protein sf3-like1014.12371.527115.33550.00010.0204
comp107951Wat1-related protein at3g28050-like109.8738–1.253915.08760.00010.0215
comp2384086Arabinogalactan protein 141013.25550.527314.81950.00010.0233
EARLY GYNOECIUM DE GENES
comp127830241k polyprotein7–6.450013.599429.1451**0.0022
comp88015Cp protein7–16.3710–1.441228.0797**0.0028
comp106811Beta-galactosidase 13-like53.806614.132827.7009**0.0028
comp108439Probable n-acetyltransferase hls1-like511.05832.632726.2899**0.0036
comp118914Probable ccr4-associated factor 1 homolog 7-like50.977114.970124.4516**0.0062
comp47121n/a612.7101–1.892824.6451**0.0096
comp102300Aldose 1-epimerase-like60.786314.706723.9491**0.0121
comp94365Glutamine dumper912.40724.735125.9591**0.0170
comp119774Phospholipase a1- chloroplastic-like5–4.28632.868021.5807**0.0184
comp100745Rapid alkalinization factor 23-like53.705613.954221.1962**0.0405
comp1174059n/a611.4198–0.875820.9731**0.0413
comp123276Subtilase family protein55.327614.584819.2794**0.0501
comp107848Continuous vascular ring family protein511.41454.656219.0432**0.0542
comp14260n/a66.8712–2.575619.2161**0.0760
comp112590Hypothetical protein MIMGU_mgv11b0241222mg, partial612.02407.266318.3478**0.0945
comp50910PREDICTED: uncharacterized protein LOC101253901615.90104.279218.4226**0.0945
comp75402Heavy metal-associated isoprenylated plant protein 26-like511.37393.436717.2721**0.0958
MID GYNOECIUM DE GENES
comp127830241k polyprotein8–4.298212.452939.6674****
comp118914Probable ccr4-associated factor 1 homolog 7-like6–2.735714.970125.1782**0.0085
comp106811Beta-galactosidase 13-like5–2.362913.317723.8908**0.0142
comp129234n/a515.04683.417223.1366**0.0184
comp118394n/a56.16554.075221.9946**0.0243
LATE GYNOECIUM DE GENES
comp115916Polygalacturonase at1g48100-like12–10.90824.692978.1199****
comp127830241k polyprotein12–1.952212.452940.1043****
comp102478Non-specific lipid-transfer protein at5g64080-like129.25533.361233.8880****
comp110356Cysteine-rich repeat secretory protein 60-like12–10.80932.029630.3242**0.0001
comp104697Aspartic proteinase-like protein 1-like12–13.87272.180230.1851**0.0001
comp102300Aldose 1-epimerase-like122.844913.882227.0702**0.0003
comp116177Serine threonine-protein kinase pbs1-like127.00651.566026.4948**0.0004
comp107662n/a12–13.16290.462421.7722**0.0023
comp101506Laccase diphenol oxidase family protein12–13.2946–1.583719.4333**0.0049
comp124850Glutaredoxin family protein12–4.65472.736217.0746**0.0107
comp16116Agc kinase12–14.2467–2.101315.61940.00010.0168
comp103048Geraniol 8-hydroxylase-like12–9.7069–1.228315.24360.00010.0191
comp126173Protein-tyrosine kinase 2-beta-like12–5.69883.780613.11160.00030.0389
comp59609Cytochrome p450 716b2-like1211.2034–1.136912.95910.00030.0407
comp84994n/a1211.2034–1.136912.95910.00030.0407
comp93273Early nodulin 55-21213.0025–1.950612.95430.00030.0407
comp95648Sulfate transporter-like12–4.72642.243611.73730.00060.0623
comp110461Nicotianamine synthase-like12–5.50011.102811.65280.00060.0637
comp121779Mitochondrial glycoprotein family protein12–5.96711.400311.46040.00070.0678
comp97467n/a12–9.87690.912010.57220.00110.0918
comp1218480n/a12–8.5381–3.076710.42820.00120.0962

Statistics and putative identity of differentially expressed (DE) genes of developmental stages and organs most frequently identified from across analytical methods, positive and negative values in log-fold change are greater expression in long-style (LS) and short-style (SS) morphs, respectively.

**

Denotes P < 0.0001.

No genes expressed in the corolla and androecium overlap in patterns of expression among the developmental stages. In contrast, 12 genes are DE in the gynoecium across all three developmental stages, with smaller numbers overlapping between the early and mid-stage of development (two) and the mid- and late stage (four) (Figure 5). Interestingly, 11 genes are DE in both the early and late stage of the gynoecium, but not in the mid-stage of development (Figure 5). The vast majority of genes DE during one stage are not DE at other stages (Figures 4, 5). However, 892 DE genes are common at the late stage of development for the corolla and androecium and the gynoecium, although the patterns of expression may differ among these genes (Figures 4, 5). Analyses of hierarchical clustering also illustrate that a large number of genes that are DE during the late stage are not DE during the earlier stages (Figure 4).

Figure 5

The pluralities of the Gene Ontology (GO) terms were identified from Glycine max (L.) Merr., Vitis vinifera L., Populus trichocarpa Torr. & A. Gray ex Hook., and Citrus sinensis Osbeck, although GO terms identified from species throughout the angiosperms were represented in the annotation. The top-hit species were primarily members of Lamiidae, including Coffea canephora Pierre ex A. Froehner, Erythranthe guttata (DC.) G. L. Nesom, Solanum tuberosum L., and Solanum lycopersicum L., as well as V. vinifera. A total of 65,535 GO terms were identified for the floral transcriptome of L. multiflorum. GO terms were overrepresented during all stages of development, and only in the early and mid-stage of the development of the corolla and androecium were no GO terms underrepresented (Table 4). During early and mid-development, most of the GO terms that differed in frequency between DE genes and the transcriptome as a whole were overrepresented in the DE gene set, whereas during late development, a greater number were underrepresented in the DE gene set (Table 4).

Table 4

GO term typeEarly corolla and androeciumEarly gynoeciumLate corolla and andreociumLate gynoeciumTotal
Biological processes overrepresented165383041210
Biological processes under-represented023199281223
Cellular components overrepresented21581136
Cellular components under-represented01454957
Molecular function overrepresented451081561
Molecular function under-represented012107150159

Number of Gene Ontology (GO) terms over- and underrepresented during developmental stages of the floral organs.

A summary of the most over- and underrepresented GO terms is presented in Table 5. Many more genes are DE during late floral development compared to during early and mid-floral development (Figures 4, 5), regardless of the type of analysis (Table 2). Early in development, gene ontology (GO) terms that are overrepresented, for Biological Processes (BP), Cellular Components (CC), and Molecular Function (MF), appear to be involved in growth. For example, GO terms associated with membrane fusion, actin cytoskeleton organization, cell tip, cell pole, site of polarized growth, and actin binding are overrepresented early in development in the corolla and androecium of the SS morph (Tables 5, 6). During late development, more genes are DE compared to early development, and the genes that are DE differ between the two stages. Later in development, many DE genes appear to be involved in physiological functions, which can include self- and intramorph-incompatiblity and response to stress. Indeed, it is worth noting that response to stress is the only GO term overrepresented throughout the developmental stages in all investigated organs when the conditional algorithm of GOstats was not employed, and this GO term was one of only three BP GO terms overrepresented when the condition algorithm was used (Table 6). As with the patterns of DE genes in general, more of those involved in response to stress are expressed later in development than earlier in development. While specific GO terms related to distinct responses to stress, such as those related to the regulation of particular hormones, were not found to be over- or underrepresented, some genes that respond to hormones known to be involved in response to stress (e.g., ethylene and abscisic acid) were DE during various stages of development (Table 7). During late development, the BP and MF GO terms overrepresented, in both the corolla and androecium and the gynoecium, may be involved in SI, including cell wall organization, cytoskeletal protein, and others (Tables 5, 6). Additionally, overrepresented CC GO terms tend to be either extracellular or toward the periphery of the cell, which is not the case for overrepresented CC GO terms early in development, which are more internal to the cell or involved in growth, such as an overrepresentation of plant-type vacuole membrane and intracellular organelle during early development but not latter development (Tables 5, 6). Genes putatively involved in these functions and locations are overexpressed in the corolla and androecium of the LS morph and in the gynoecium of the SS morph.

Table 5

Biological processes overrepresentedEarly corolla and androeciumP-valueEarly gynoeciumP-valueLate corolla and androeciumP-valueLate gynoeciumP-value
Lipid storage**Response to biotic stimulus0.0017Cell wall organization or biogenesis**Cell wall organization or biogenesis**
Membrane fusion**Stem vascular tissue pattern formation0.0036Growth**Membrane organization**
Ammonium transport0.0001N-terminal protein myristoylation0.0036Cell differentiation**Lipid metabolic process**
Actin cytoskeleton organization0.0003Amino acid import0.0036Membrane organization**Growth**
Sulfur amino acid metabolic process0.0003Basic amino acid transport0.0036Carbohydrate metabolic process**Reproduction**
BIOLOGICAL PROCESSES UNDER-REPRESENTED
N/ACellular biosynthetic process0.0003Oxidation-reduction process**Organonitrogen compound biosynthetic process**
Single-organism cellular process0.0025Response to inorganic substance**Oxidation-reduction process**
Single-organism metabolic process0.0031Metal ion transport**Response to inorganic substance**
Organonitrogen compound biosynthetic process0.0084Nucleobase-containing compound biosynthetic process**Small molecule biosynthetic process**
Phosphorus metabolic process0.0085Cellular macromolecule biosynthetic process**Regulation of macromolecule metabolic process**
CELLULAR COMPONENTS OVERREPRESENTED
Pollen tube tip0.0031Mitochondrion0.0002Extracellular region**Extracellular region**
Site of polarized growth0.0031Intracellular organelle0.0138Cell periphery**Cell periphery**
Cell tip0.0031Nuclear lumen0.0419Cell wall**Cell wall**
Central vacuole0.0031Membrane-bounded organelle0.0423Plasma membrane**Thylakoid**
6-phosphofructokinase complex0.0031Plant-type vacuole membrane0.0476Golgi apparatus**Golgi apparatus**
CELLULAR COMPONENTS UNDER-REPRESENTED
N/AProtein complex0.0426Intracellular organelle part**Integral component of membrane**
Integral component of membrane**Intracellular organelle part**
Membrane part**Chloroplast**
Chromosome**Membrane part**
Chloroplast**Integral component of endoplasmic reticulum membrane**
MOLECULAR FUNCTION OVERREPRESENTED
Actin binding0.0003Glutathione dehydrogenase (ascorbate) activity0.0036Glutathione dehydrogenase (ascorbate) activity0.0036Enzyme regulator activity**
Transferase activity, transferring acyl groups0.0021Carbohydrate binding0.0054Carbohydrate binding0.0054Peptidase activity**
Carbohydrate kinase activity0.0026Enzyme regulator activity0.0057Enzyme regulator activity0.0057Nucleic acid binding transcription factor activity**
6-phosphofructokinase activity0.0027Acidic amino acid transmembrane transporter activity0.0071Acidic amino acid transmembrane transporter activity0.0071Cytoskeletal protein binding**
UDP-glucuronate decarboxylase activity0.0027Neutral amino acid transmembrane transporter activity0.0071Neutral amino acid transmembrane transporter activity0.0071Oxidoreductase activity**
MOLECULAR FUNCTION UNDER-REPRESENTED
N/AAnion binding**Organic cyclic compound binding**Organic cyclic compound binding**
carbohydrate derivative binding0.0002heterocyclic compound binding**heterocyclic compound binding**
Purine ribonucleotide binding0.0002Anion binding**Anion binding**
Ribonucleoside binding0.0002Nucleotide binding**Nucleotide binding**
Purine nucleoside binding0.0002Carbohydrate derivative binding**Carbohydrate derivative binding**

Five most significant Gene Ontology (GO) terms over- and underrepresented during developmental stages of the floral organs.

**

Denotes P < 0.0001.

Table 6

Gene ontology termsEarly corolla and androecium P-valueEarly gynoecium P-valueLate corolla and androecium P-valueLate gynoecium P-value
BIOLOGICAL PROCESSES OVERREPRESENTED
Intracellular transport0.00460.0101**
Response to stress0.02620.00150.0001
Symbiosis, encompassing mutualism through parasitism0.02140.04230.0031
BIOLOGICAL PROCESSES UNDERREPRESENTED
Regulation of transcription, DNA-templated0.0446****
Cellular amino acid metabolic process0.03370.0001**
Regulation of biosynthetic process0.0403****
Phosphorylation0.0392****
Regulation of nucleobase-containing compound metabolic process0.0419****
Nucleobase-containing compound biosynthetic process0.0223****
Oxoacid metabolic process0.0153****
Cellular biosynthetic process0.0003****
Small molecule biosynthetic process0.0330****
Single organism reproductive process0.0326****
Carboxylic acid biosynthetic process0.0405****
Nucleic acid-templated transcription0.0383****
Organonitrogen compound biosynthetic process0.0084****
Regulation of RNA biosynthetic process0.0446****
CELLULAR COMPONENTS OVERREPRESENTED
Extracellular region****
Cell wall****
Peroxisome0.00130.0043
Golgi apparatus****
Microtubule organizing center0.01970.0070
Plasma membrane**0.0001
Plant-type vacuole membrane0.04310.0476
Intracellular organelle0.03630.0138
Cell periphery****
CELLULAR COMPONENTS UNDERREPRESENTED
Ubiquitin ligase complex******
Chromatin******
Nucleosome******
Cell0.0003**0.0003
Nucleus0.02150.00050.0220
Chromosome******
Cytoplasm******
Mitochondrial envelope0.0001**0.0001
Mitochondrial inner membrane0.00020.00010.0002
Vacuolar membrane******
Endoplasmic reticulum membrane******
Spindle0.0004**0.0004
Cytosol0.0025**0.0025
Microtubule******
Glycerol-3-phosphate dehydrogenase complex******
MOLECULAR FUNCTION OVERREPRESENTED
Enzyme regulator activity0.00570.0057**
MOLECULAR FUNCTION UNDER-REPRESENTED
Purine nucleoside binding0.0002****0.0002
ATP binding0.0005****0.0005
Phosphotransferase activity, alcohol group as acceptor0.0401**0.03420.0744
Active transmembrane transporter activity0.04340.0028**0.0462
Adenyl nucleotide binding0.0004****0.0004
Ribonucleoside binding0.0002****0.0002
Purine ribonucleotide binding0.0002****0.0002
Anion binding********
Carbohydrate derivative binding0.0002****0.0002

Most common Gene Ontology (GO) terms over- and underrepresented during floral development.

Lack of numbers in a cell represents lack of significant over- or underrepresentation.

**

Denotes P < 0.0001.

Table 7

Organ and componentPutative geneLog-fold changeP-valueFalse discovery rate
EARLY COROLLA DE GENES
comp109661ca+2-binding ef hand family protein−7.8602**0.0875
EARLY GYNOECIUM DE GENES
comp111457Ethylene-responsive transcription factor 1b4.76160.00010.0830
MID COROLLA DE GENES
comp90112Carotenoid cleavage dioxygenase chloroplastic-like−8.1503**0.0444
LATE COROLLA DE GENES
comp113274Ethylene-responsive transcription factor erf034-like−3.64150.00010.0229
comp115033Ethylene-responsive transcription factor erf034-like−9.67930.00020.0284
comp125113Abscisic acid receptor pyr1-like−2.85890.41330.9142
LATE GYNOECIUM DE GENES
comp109390Nodulin-related protein5.39460.00010.0145
comp97404Ethylene-responsive transcription factor erf012-like−13.3995**0.0128
comp108406Ethylene-responsive transcription factor erf017-like12.0045****
comp110611Ethylene-responsive transcription factor win1-like−3.67100.00120.0923
comp111802Ethylene receptor−3.67100.00120.0923
comp113114Ap2-like ethylene-responsive transcription factor bbm2-like−10.61530.00050.0512
comp114072Ethylene-responsive transcription factor rap2-11-like4.84310.00010.0254
comp116180Ethylene-responsive transcription factor rap2-4−4.29440.00050.0531
comp122880Ethylene-responsive transcription factor rap2-7-like5.0128**0.0026

Statistics and putative identity of differentially expressed (DE) genes identified as involved in response to hormones ethylene or abscisic acid; positive and negative values in log-fold change are greater expression in long-style (LS) and short-style (SS) morphs, respectively.

**

Denotes P < 0.0001.

Discussion

Floral transcriptome of Lithospermum multiforum

The present study demonstrates the utility of transcriptomics to identify differentially expressed genes in flowers, throughout development, of a heterostylous species. The transcriptome of L. multiforum includes 97,603 putative genes and 265,144 isoforms of these genes. In comparison to other floral transcriptomes of heterostylous species, this is a much larger number of genes. For example, Fagopyrum, Primula, and each of two species of Eichhornia Kunth express ca. 25,400, 55,000, and 20,000—24,000 genes, respectively (Logacheva et al., 2011; Ness et al., 2011; Zhang et al., 2013). Additionally, the transcriptome of L. multiflorum is greater than that of the only other species of Boraginaceae, Echium wildpretii H. Pearson ex. Hook.f., with an assembled transcriptome, which includes ca. 58,500 genes and 69,500 transcripts (White et al., 2016). The number of putative genes and transcripts of E. wildpretii is greater than that of the other aforementioned species as well, suggesting the possibility that the species of Boraginaceae may have large transcriptomes.

The large number of putative genes of the L. multiflorum transcriptome could be due to a variety of different factors, including the complexity of the transcriptome of the species, the inclusion of multiple stages of development in the construction of the transcriptome, paralogs due to an ancient polyploidy event in the genus (N = 14, Ward and Spellenberg, 1988), or technical issues, such as the assembly of incomplete contigs due to solely using transcriptome, not genome, data. Additionally, the flowers used in the present study were collected at different stages of development from wild populations, which could result in the expression of genes that may not have been expressed under more controlled conditions. Therefore, particular genes, such as those involved in stress, may not be expressed if conditions are more stable for the plants. However, it should be noted that all of the plants used in the present study were growing under similar conditions (including between years), even if these were in the native environment of the species, which should result in a negligible impact on DE solely due to environmental factors.

Bowtie and eXpress criteria

The present study employed different mapping criteria and parameters for quantifying reads mapped. Unsurprisingly, the more stringent mapping criteria employed in Bowtie (Langmead et al., 2009) resulted in fewer reads mapped, and subsequently fewer DE genes identified using edgeR (Robinson et al., 2010). In general, the use of effective counts, in eXpress (Roberts and Pachter, 2013), resulted in the identification of fewer DE genes, as did the addition of length information. It is notable that using multiple criteria can result in different numbers of DE genes identified. Indeed, the most conservative methods employed found ca. 10% of the DE genes identified with the more lenient approaches. By utilizing different criteria, it was possible to identify a suite of genes DE across multiple analyses, which may be a useful approach for determining genes to target for future research, such as real-time PCR and in-situ hybridizations, to better identify patterns of expression among related homostylous and hetersotylous species of Lithospermum and Boraginaceae.

Differential expression of genes throughout development

During early development, genes overexpressed in the corolla and androecium of the SS morph have been demonstrated to be involved in cell division and actin dynamism, and this includes particular genes that are DE, such as myosin family protein with dil domain and pleckstrin homology domain-containing family A member 8 (Lehman et al., 1996; Berg et al., 2000). In contrast, genes overexpressed in the developing gynoecium of the LS morph, such as n-acyltransferase hls-1 like, have been shown to control cell growth via auxin regulation (Table 3) (Lehman et al., 1996; Berg et al., 2000; Peremyslov et al., 2010). Together, this pattern of expression suggests that different genes are involved in the elongation and growth of various organs in the developing flower bud. Later in development, DE genes and GO terms have been demonstrated to be involved in pollen-specific responses, reproduction, and cell signaling, all of which may play a role in SI. During this stage of development, DE genes include pollen allergen ole e 6-like protein, ole e 1-like protein, pollen-specific protein sf3-like, and protein-tyrosine kinase 2-beta-like (Table 3), while overrepresented CC GO terms comprise extracellular region, cell wall, plasma membrane (Table 6) (Baltz et al., 1992a,b; Hubbard and Till, 2000; Staiger and Franklin-Tong, 2003; Jiménez-López et al., 2011; Lopez-Casado et al., 2012). This pattern of expression demonstrates a shift in the function of DE genes between early and late development.

The small number of DE genes involved in growth expressed early in development, along with the larger number involved in physiological processes expressed later in development, provide evidence that the morphological components of heterostyly (i.e., herkogamy) are controlled by a smaller number of genes than those involved in the physiological aspects of the breeding system (i.e., SI). This suggests that fewer changes in patterns of expression would need to develop in order to result in shifts in herkogamy compared to changes in SI, which is consistent with the model for the evolutionary development proposed by Lloyd and Webb (1992) and supported by phylogenetic and developmental data in Lithospermum (Cohen, 2011; Cohen et al., 2012), hypothesizing that reciprocal herkogamy developed prior to SI in the genus. Furthermore, some heterostylous species in Boraginaceae (e.g., Oreocarya flava A. Nelson) have SI that is not fully established (Casper, 1985), so the morphological components would have arisen without the origin of SI.

Along with the disparity in the number of DE genes at different stages of development, there is an asymmetry present during late development in the pattern of expression of DE genes in floral organs. In the corolla and androecium, the majority of DE genes are overexpressed in the long-style (LS) morph, while in the gynoecium, the situation is reversed, with the majority of the DE genes overexpressed in the short-style (SS) morph (Figures 35). While this pattern is reversed during early development, it is certainly not as pronounced owing to the much smaller number of DE genes during earlier stages of development (Figures 35). This late-development asymmetry is notable because it has been demonstrated that SI does not necessarily act in the same manner in each morph in heterostylous species (Schou and Philipp, 1984). This has not been studied in L. multiflorum, but given the different patterns of expression in the floral organs of the two morphs, this may be the case, with the pollen from the LS morph interacting with the gynoecium of SS morph in a different way than that of the pollen of SS morph with the gynoecium of LS morph.

The mid-stage of development differs from the early and late stages of development in two manners—a much smaller number of DE genes identified and low intramorph Pearson correlations. Fewer than 25 DE genes were identified for the mid-stage of development, with fewer than 20 for any of the individual analyses (Figure 4). This number is less than one third of the DE genes during early development and far smaller than the DE genes during late development (Table 3). This could be due to the mid-stage not being reflective of a distinct biological stage of development. There may be an early stage that establishes morphological differences, followed by a later stage in which SI develops. Therefore, the use of a mid-stage of development, in the present study, may be artificial and include flowers at both the early and late stages of development, which could result in the lower Pearson correlations, as opposed to the high Pearson correlations for floral transcriptomes within the respective early and late stages of development. This variation may also result in fewer DE genes due to differences among the biological replicates of the flowers at the mid-stage.

A limited number of studies have used comparative transcriptomics to identify genes involved in herkogamy and heterostyly, and it is notable that some of the genes identified as DE in other taxa are also DE in L. multiflorum. Ness et al. (2011) compared the floral transcriptomes of two species of Eichhornia that include selfing and outcrossing representatives. One hundred and forty-seven genes were upregulated in the flowers of the selfing individuals studied, with 10 genes identified as being involved in pollination or flower development. Ness et al. (2011) extracted RNA from flowers at various stages of development, but the RNA was pooled for sequencing. Of these 10 DE genes, two auxin response factors were identified, and this was also the case for early development in the flowers of L. multiflorum. During late development, six genes similar to those recognized as DE in Eichhornia were also DE in L. multiflorum, including a gibberellin-regulated protein, oligopeptide transporter 7-like, exocyst complex component, cellulose synthase-like protein d4-like, auxin-responsive protein, and a myb family transcription factor family protein. This suggests that there may be similar genes involved in floral organ height and length across the angiosperms because the taxa are quite distantly related, with Eichhornia being a monocot and L. multiflorum being a dicot. Landis et al. (2015) studied two species of Saltugilia (V. E. Grant) L. A. Johnson (Polemoniaceae), one with small corollas and one with large corollas. These authors recognized over 700 DE genes comparing mature flowers of the two species, with many DE genes involved in cellular functions.

In a recent study of the transcriptomes of flowers of heterostylous morphs, Nowak et al. (2015) compared two species of Primula, P. veris and P. vulgaris. These authors identified ca. 650 genes as DE in each of the two species throughout development, although their FDR-adjusted P-value threshold was 0.05, not 0.1, which was employed in the present study. Additionally, these authors utilized floral tissue from only one stage of development, immature floral buds 3 to 5 days prior to anthesis. Comparing the two species, Nowak et al. (2015) found 113 genes that were DE in both, with more overexpressed in the SS morph than in the LS morph. This pattern of DE is similar to the situation in the gynoecium of L. multiflorum. The study of Primula focused, in part, on the identification of the S-locus, the supergene controlling the heterostylous condition. The genes identified as being linked to the S-locus in P. veris, including a GLOBOSA homolog expressed only in the SS morph, were not DE in the present study, and it should be noted that while the S-locus appears to strongly influence heterostyly in Primula (Fornoni and Domínguez, 2015; Nowak et al., 2015) its presence has yet to be identified in Lithospermum and other species of Boraginaceae. Therefore, the S-locus may control the development of the breeding system in Primula, but other genes and molecular interactions may govern the development of heterostyly in Lithospermum and its relatives. As further genomic information becomes available, it will be possible to more carefully and critically examine differences among heterostylous species.

Some of the genes that have previously been identified as involved in flower size [reviewed in Krizek and Anderson (2013)] were also DE between morphs of L. multiflorum. During early development of the gynoecium, cytochrome p450 93a1-like, which encourages increase in flower size (Anastasiou et al., 2007), was upregulated in the LS morph, and this was also the case for two auxin-response proteins and one auxin-induced protein. These types of genes have been demonstrated to increase floral organ size in other plants (Hu et al., 2003; Varaud et al., 2011). During late development, growth-regulating factors were upregulated in the corolla and androecium of the LS morph, and mutations in these genes have resulted in smaller petals (Kim et al., 2003). This is notable because the flowers of the LS morph of most heterostylous species, including L. multiflorum (Cohen et al., 2012), are smaller than those of the SS morph (Cohen, 2010).

In the corolla and androecium of the SS morph and in the gynoecium of the LS morph, the transcription factor tcp14-like is upregulated (Table 3). TCP transcription factors are known to regulate growth either through promotion or repression of growth (Martín-Trillo and Cubas, 2010). Given the pattern of expression of this TCP transcription factor, it is hypothesized that in L. multiflorum the putative transcription factor tcp14-like is involved in promoting growth, which would result in the larger corollas of the SS morph and the longer styles of the LS morph. Additionally, various genes involved in responding to auxin are DE, with most of these genes upregulated in the larger organs of the corolla and androecium of the SS morph and of the gynoecium of the LS morph. In P. veris, Huu et al. (2016) identified a gene, CYP734A50, that is a putative brassinosteroid-degrading enzyme that regulates style length. In L. multflorum, six genes, including brassinosteroid-regulated protein bru1, are involved in the brassinosteroid biosynthetic pathway. These genes are more highly expressed in the corolla and androecium of the LS morph and the gynoecium of the SS morph. This pattern is the opposite of that for auxin suggesting that this hormone may be involved in the development of smaller flowers and floral organs.

Given the distinct differences in herkogamy between floral morphs of heterostylous species and the importance of herkogamy in influencing pollination biology, genes involved in controlling floral organ height and length should be considered speciation genes (Rieseberg and Blackman, 2010). Mutations in these genes, such as auxin response factors (Hu et al., 2003; Varaud et al., 2011), may result in an allogamous species becoming autogamous or requiring a different pollinator to effectively transfer pollen between the anthers and stigmas of flowers that bear organs at different heights. As the function of the DE genes in the present study is further elucidated, particularly for genes also DE in flowers of other species, the importance of these genes on speciation will become evident, which will be aided by additional floral transcriptomes of species exhibiting variation in herkogamy.

Identification of DE genes involved in response to stress

While the number of DE genes, between morphs, involved in response to stress could be due to the collection of plants from wild populations, it seems likely that given the biological replicates and the consistency among them as well as similar climate and weather patterns among collections and between years, the variation in stress response between morphs has a biological basis. The two morphs may have different genes expressed in response to stress or one morph may have a stronger response to stress than the other. Some of the DE genes involved in stress response are influenced by hormones known to be involved in this type of process, including ethylene and abscisic acid, and these genes include various ethylene-responsive transcription factors, abscisic acid receptor pyr1-like, and others (Thirugnanasambantham et al., 2015; Wang et al., 2015) (Table 7). Future research can investigate variation in hormones or hormone response between morphs, which may influence the observed morph-specific responses to stress.

Ralston (1993) provides evidence that the two morphs of L. multiflorum can respond differently to herbivory. She found that following herbivory individuals with longer styles increased seed production. Additionally, Ralston determined that, compared to the LS morph, the SS morph had significantly greater herbivory and seed production following herbivory. In a study on the heterostylous species Gelsemium sempervirens (L.) J. St.-Hil. (Gelsemiaceae), Leege and Wolfe (2002) provide evidence that floral herbivory is morph-specific. In G. sempervirens, the floral organs toward the apex of the floral tube receive a greater amount of herbivory, resulting in the styles of the LS morph and the stamens of the SS morph being more damaged than these same organs in the flowers of the other morph. These differences in herbivory may result in distinct responses to stress, which were observed in the present study. The different stress responses may be similar to contrasting SI responses between morphs.

Future directions

Comparative transcriptomics provides a powerful tool for the identification of genes involved in heterostyly and the heights and lengths of floral organs, and this methodology allows for a variety of additional studies to be undertaken in L. multiflorum and other taxa. For example, in L. multiflorum it is now possible to examine patterns of variants of genes, including alternative splicing, not only genes that are DE between morphs. Additionally, investigating the patterns of DE genes throughout development of the flowers of the same morph can provide further insight into the molecular underpinnings of heterostyly within this species. An exciting area of research involves comparisons with other heterostylous species within the genus and family and across angiosperms as data collected can help identify whether a suite of genes involved in heterostyly is present in all heterostylous species [or all heterostylous species from a particular clade (e.g., Lamiidae)] or if new genes are recruited during each origin of heterostyly.

Statements

Author contributions

JC planned, designed, and implemented the experiment. He was involved with collecting plants, laboratory research, data analysis and interpretation, and writing and editing the manuscript.

Acknowledgments

The author would like to thank Jeremy Coate and Caroline Kellogg for their invaluable help throughout the project. Olivier Fedrigo and Jean Qin-Qin at the Genome Sequencing & Analysis Core at Duke University were instrumental in the sequencing and assembly efforts. Veronica Moorman, Shannon Straub, two reviewers, and Verónica S. Di Stilio provided helpful comments on the manuscript. The project was supported by start-up funds from Texas A&M International University and Kettering University.

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fpls.2016.01934/full#supplementary-material

Table S1

Morph, developmental stage, floral organs, and sequencing and mapping statistics on floral transcriptomes, C, corolla; A, androecium; G, gynoecium; PE, paired-end reds; SR, single reads.

Table S2

Results of Pearson correlation test for all analyzed floral transcriptomes of Lithospermum multiflorum. Numbers above diagonal are for isoforms, and numbers below diagonal are for components (putative genes).

References

  • 1

    AlexanderP. J.RajanikanthG.BaconC. D.BaileyC. D. (2007). Recovery of plant DNA using a reciprocating saw and silica-based columns. Mol. Ecol. Notes7, 59. 10.1111/j.1471-8286.2006.01549.x

  • 2

    AnastasiouE.KenzS.GerstungM.MacLeanD.TimmerJ.FleckC.et al. (2007). Control of plant organ size by KLUH/CYP78A5-dependent intercellular signaling. Dev. Cell13, 843856. 10.1016/j.devcel.2007.10.001

  • 3

    AndersS.McCarthyD. J.ChenY.OkoniewskiM.SmythG. K.HuberW.et al. (2013). Count-based differential expression analysis of RNA sequencing data using R and Bioconductor. Nat. Protoc.8, 17651786. 10.1038/nprot.2013.099

  • 4

    BaltzR.DomonC.PillayD. T.SteinmetzA. (1992a). Characterization of a pollen-specific cDNA from sunflower encoding a zinc finger protein. Plant J.2, 713721.

  • 5

    BaltzR.EvrardJ.-L.DomonC.SteinmetzA. (1992b). A LIM motif is present in a pollen-specific protein. Plant Cell4, 1465.

  • 6

    BarrettS. C. (1990). The evolution and adaptive significance of heterostyly. Trends Ecol. Evol.5, 144148. 10.1016/0169-5347(90)90220-8

  • 7

    BarrettS. C.JessonL. K.BakerA. M. (2000). The evolution and function of stylar polymorphisms in flowering plants. Ann. Bot.85(Suppl. 1), 253265. 10.1006/anbo.1999.1067

  • 8

    BarrettS.ShoreJ. (2008). New insights on heterostyly: comparative biology, ecology and genetics, in Self-Incompatibility in Flowering Plants, ed Franklin-TongV. E. (Berlin: Springer), 332.

  • 9

    BergJ. S.DerflerB. H.PennisiC. M.CoreyD. P.CheneyR. E (2000). Myosin-X, a novel myosin with pleckstrin homology domains, associates with regions of dynamic actin. J. Cell Sci.113 (Pt 19), 34393451. Available online at: http://jcs.biologists.org/content/113/19/3439

  • 10

    BjörkmanT. (1995). The effectiveness of heterostyly in preventing illegitimate pollination in dish-shaped flowers. Sex. Plant Reprod.8, 143146. 10.1007/BF00242258

  • 11

    CasperB. B. (1985). Self-compatibility in distylous Cryptantha flava (Boraginaceae). New Phytol.99, 149154. 10.1111/j.1469-8137.1985.tb03644.x

  • 12

    CharlesworthD. (2016). The status of supergenes in the 21st century: recombination suppression in Batesian mimicry and sex chromosomes and other complex adaptations. Evol. Appl.9, 7490. 10.1111/eva.12291

  • 13

    ChenH.BoutrosP. C. (2011). VennDiagram: a package for the generation of highly-customizable Venn and Euler diagrams in R. BMC Bioinformatics12:35. 10.1186/1471-2105-12-35

  • 14

    CohenJ. I. (2010). “A case to which no parallel exists”: the influence of darwin's different forms of flowers. Am. J. Bot.97, 701716. 10.3732/ajb.0900395

  • 15

    CohenJ. I. (2011). A phylogenetic analysis of morphological and molecular characters of Lithospermum, L.(Boraginaceae) and related taxa: evolutionary relationships and character evolution. Cladistics27, 559580. 10.1111/j.1096-0031.2011.00352.x

  • 16

    CohenJ. I. (2014). A phylogenetic analysis of morphological and molecular characters of Boraginaceae: evolutionary relationships, taxonomy, and patterns of character evolution. Cladistics30, 139169. 10.1111/cla.12036

  • 17

    CohenJ. I.LittA.DavisJ. I. (2012). Comparative floral development in Lithospermum (Boraginaceae) and implications for the evolution and development of heterostyly. Am. J. Bot.99, 797805. 10.3732/ajb.1100329

  • 18

    ConesaA.GötzS.García-GómezJ. M.TerolJ.TalónM.RoblesM. (2005). Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics21, 36743676. 10.1093/bioinformatics/bti610

  • 19

    ConsortiumG. O. (2004). The Gene Ontology (GO) database and informatics resource. Nucleic acids Res.32(Suppl. 1), D258D261. 10.1093/nar/gkh036

  • 20

    CronquistA.HolmgrenA. H.HolmgrenN. H.RevealJ. L.HolmgrenP. K. (1984). Intermountain Flora: Vascular Plants of the Intermountain West, USA: Vol. 4. Subclass Asteridae (Except Asteraceae). New York, NY: New York Botanical Garden.

  • 21

    DarwinC. (1877). The Different Forms of Flowers on Plants of the Same Species. London: John Murray.

  • 22

    DowrickV. (1956). Heterostyly and homostyly in Primula obconika. Heredity10, 219236. 10.1038/hdy.1956.19

  • 23

    ErnstA. (1928). Zur vererbung der morphologischen heterostyliemerkmale. Ber. Dtsch. Bot. Ges.46, 573588.

  • 24

    ErnstA. (1933). Weitere untersuchungen zur phänanalyse, zum fertilitäts problem und zur genetik heterostyler Primeln. I. Primula viscosa. Arch. der Julius-Klaus-Stiftung für Vererbungsforschung, Sozialanthropologie und Rassenhygiene8, 1215.

  • 25

    FalconS.GentlemanR. (2007). Using GOstats to test gene lists for GO term association. Bioinformatics23, 257258. 10.1093/bioinformatics/btl567

  • 26

    FesenkoN.N.FesenkoI. (2006). Homostyly of two morphologically different lineages of Fagopyrum homotropicum Ohnishi is determined by locus S4, which is an S-locus related gene in the linkage group# 4. Fagopyrum23, 1115.

  • 27

    FerreroV.ArroyoJ.CastroS.NavarroL. (2012). Unusual heterostyly: style dimorphism and self-incompatibility are not tightly associated in Lithodora and Glandora (Boraginaceae). Ann. Bot.109, 655665. 10.1093/aob/mcr222

  • 28

    FerreroV.CastroS.SánchezJ. M.NavarroL. (2011). Stigma–anther reciprocity, pollinators, and pollen transfer efficiency in populations of heterostylous species of Lithodora and Glandora (Boraginaceae). Plant Syst. Evol.291, 267276. 10.1007/s00606-010-0387-x

  • 29

    FornoniJ.DomínguezC. A. (2015). Beyond the heterostylous syndrome. New Phytol.206, 11911192. 10.1111/nph.13415

  • 30

    GaujouxR.SeoigheC. (2010). A flexible R package for nonnegative matrix factorization. BMC Bioinformatics11:367. 10.1186/1471-2105-11-367

  • 31

    GilmartinP. M. (2015). On the origins of observations of heterostyly in Primula. New Phytol.208, 3951. 10.1111/nph.13558

  • 32

    GrabherrM. G.HaasB. J.YassourM.LevinJ. Z.ThompsonD. A.AmitI.et al. (2011). Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol.29, 644652. 10.1038/nbt.1883

  • 33

    HuY.XieQ.ChuaN.-H. (2003). The Arabidopsis auxin-inducible gene ARGOS controls lateral organ size. Plant Cell15, 19511961. 10.1105/tpc.013557

  • 34

    HubbardS. R.TillJ. H. (2000). Protein tyrosine kinase structure and function. Annu. Rev. Biochem.69, 373398. 10.1146/annurev.biochem.69.1.373

  • 35

    HuuC. N.KappelC.KellerB.SicardA.TakebayashiY.BreuningerH.et al. (2016). Presence versus absence of CYP734A50 underlies the style-length dimorphism in primroses. Elife5:e17956. 10.7554/eLife.17956

  • 36

    JaakolaL.PirttiläA. M.HalonenM.HohtolaA. (2001). Isolation of high quality RNA from bilberry (Vaccinium myrtillus L.) fruit. Mol. Biotechnol.19, 201203. 10.1385/MB:19:2:201

  • 37

    Jiménez-LópezJ. C.de Dios AlcheJ.Rodríguez-GarcíaM. I. (2011). Systematic and Phylogenetic Analysis of the Ole e 1 Pollen Protein Family Members in Plants. INTECH Open Access Publisher.

  • 38

    KellyJ. K.MonicaJ. P. (2011). Interactions among flower size QTLs of Mimulus guttatus are abundant but highly variable in nature. Genetics189, 14611471. 10.1534/genetics.111.132423

  • 39

    KimJ. H.ChoiD.KendeH. (2003). The AtGRF family of putative transcription factors is involved in leaf and cotyledon growth in Arabidopsis. Plant J.36, 94104. 10.1046/j.1365-313X.2003.01862.x

  • 40

    KrizekB. A.AndersonJ. T. (2013). Control of flower size. J. Exp. Bot.64, 14271437. 10.1093/jxb/ert025

  • 41

    LabonneJ. J.GoultiaevaA.ShoreJ. S. (2009). High-resolution mapping of the S-locus in Turnera leads to the discovery of three genes tightly associated with the S-alleles. Mol. Genet. Genomics281, 673685. 10.1007/s00438-009-0439-5

  • 42

    LabonneJ.TamariF.ShoreJ. (2010). Characterization of X-ray-generated floral mutants carrying deletions at the S-locus of distylous Turnera subulata. Heredity105, 235243. 10.1038/hdy.2010.39

  • 43

    LandisJ. B.O'TooleR. D.VenturaK. L.GitzendannerM. A.OppenheimerD. G.SoltisD. E.et al. (2015). The phenotypic and genetic underpinnings of flower size in Polemoniaceae. Front. Plant Sci.6:1144. 10.3389/fpls.2015.01144

  • 44

    LangmeadB.TrapnellC.PopM.SalzbergS. L. (2009). Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol.10:R25. 10.1186/gb-2009-10-3-r25

  • 45

    LeegeL. M.WolfeL. M. (2002). Do floral herbivores respond to variation in flower characteristics in Gelsemium sempervirens (Loganiaceae), a distylous vine?Am. J. Bot.89, 12701274. 10.3732/ajb.89.8.1270

  • 46

    LehmanA.BlackR.EckerJ. R. (1996). HOOKLESS1, an ethylene response gene, is required for differential cell elongation in the Arabidopsis hypocotyl. Cell85, 183194. 10.1016/S0092-8674(00)81095-8

  • 47

    LewisD. (1954). Comparative incompatibility in angiosperms and fungi. Adv. Genet.6, 235283. 10.1016/s0065-2660(08)60131-5

  • 48

    LiJ.WebsterM. A.WrightJ.CockerJ. M.SmithM. C.BadakshiF.et al. (2015). Integration of genetic and physical maps of the Primula vulgaris S locus and localization by chromosome in situ hybridization. New Phytol.208, 137148. 10.1111/nph.13373

  • 49

    LiP.JohnstonM. O. (2010). Flower development and the evolution of self-fertilization in Amsinckia: the role of heterochrony. Evol. Biol.37, 143168. 10.1007/s11692-010-9091-6

  • 50

    LloydD.WebbC. (1992). The evolution of heterostyly, in Evolution and Function of Heterostyly, BarrettS. C. H. (Berlin: Springer), 151178.

  • 51

    LogachevaM. D.KasianovA. S.VinogradovD. V.SamigullinT. H.GelfandM. S.MakeevV. J.et al. (2011). De novo sequencing and characterization of floral transcriptome in two species of buckwheat (Fagopyrum). BMC Genomics12:30. 10.1186/1471-2164-12-30

  • 52

    Lopez-CasadoG.CoveyP. A.BedingerP. A.MuellerL. A.ThannhauserT. W.ZhangS.et al. (2012). Enabling proteomic studies with RNA-Seq: the proteome of tomato pollen as a test case. Proteomics12, 761774. 10.1002/pmic.201100164

  • 53

    Martín-TrilloM.CubasP. (2010). TCP genes: a family snapshot ten years later. Trends Plant Sci.15, 3139. 10.1016/j.tplants.2009.11.003

  • 54

    McCubbinA. G.LeeC.HetrickA. (2006). Identification of genes showing differential expression between morphs in developing flowers of Primula vulgaris. Sex. Plant Reprod.19, 6372. 10.1007/s00497-006-0022-8

  • 55

    NessR. W.SiolM.BarrettS. C. (2011). De novo sequence assembly and characterization of the floral transcriptome in cross-and self-fertilizing plants. BMC Genomics12:298. 10.1186/1471-2164-12-298

  • 56

    NowakM. D.RussoG.SchlapbachR.HuuC. N.LenhardM.ContiE. (2015). The draft genome of Primula veris yields insights into the molecular basis of heterostyly. Genome Biol.16, 117. 10.1186/s13059-014-0567-z

  • 57

    PeremyslovV. V.ProkhnevskyA. I.DoljaV. V. (2010). Class XI myosins are required for development, cell expansion, and F-Actin organization in Arabidopsis. Plant Cell22, 18831897. 10.1105/tpc.110.076315

  • 58

    PhilippM.SchouO. (1981). An unusual heteromorphic incompatibility system. New Phytol.89, 693703. 10.1111/j.1469-8137.1981.tb02348.x

  • 59

    RalstonB. E. (1993). Phylogenetic Systematics and the Evolution of Mating Systems in Lithospermum (Boraginaceae). Flagstaff, AZ: Northern Arizona University.

  • 60

    RichardsA. J. (1997). Plant Breeding Systems. London: Chapman & Hall.

  • 61

    RiesebergL. H.BlackmanB. K. (2010). Speciation genes in plants. Ann. Bot.106, 439455. 10.1093/aob/mcq126

  • 62

    RobertsA.PachterL. (2013). Streaming fragment assignment for real-time analysis of sequencing experiments. Nat. Methods10, 7173. 10.1038/nmeth.2251

  • 63

    RobinsonM. D.McCarthyD. J.SmythG. K. (2010). edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics26, 139140. 10.1093/bioinformatics/btp616

  • 64

    SchouO.PhilippM. (1984). An unusual heteromorphic incompatibility system. Theor. Appl. Genet.68, 139144. 10.1007/bf00252330

  • 65

    SomersD.LabonneJ.VaismanA.ShoreJ. (2008). Construction of a first genetic map of distylous Turnera and a fine-scale map of the S-locus region. Genome51, 471478. 10.1139/G08-031

  • 66

    StaigerC.Franklin-TongV. (2003). The actin cytoskeleton is a target of the self-incompatibility response in Papaver rhoeas. J. Exp. Bot.54, 103113. 10.1093/jxb/erg003

  • 67

    TalbertP. B.HenikoffS. (2010). Centromeres convert but don't cross. PLoS Biol.8:e1000326. 10.1371/journal.pbio.1000326

  • 68

    ThirugnanasambanthamK.DurairajS.SaravananS.KarikalanK.MuralidaranS.IslamV. I. H. (2015). Role of Ethylene Response Transcription Factor (ERF) and its regulation in response to stress encountered by plants. Plant Mol. Biol. Rep.33, 347357. 10.1007/s11105-014-0799-9

  • 69

    ThompsonM.JigginsC. (2014). Supergenes and their role in evolution. Heredity113, 18. 10.1038/hdy.2014.20

  • 70

    UshijimaK.IkedaK.NakanoR.MatsubaraM.TsudaY.KuboY. (2015). Genetic control of floral morph and petal pigmentation in Linum grandiflorum Desf., a heterostylous flax. Hortic. J.84, 261268. 10.2503/hortj.MI-045

  • 71

    UshijimaK.NakanoR.BandoM.ShigezaneY.IkedaK.NambaY.et al. (2012). Isolation of the floral morph-related genes in heterostylous flax (Linum grandiflorum): the genetic polymorphism and the transcriptional and post-transcriptional regulations of the S locus. Plant J.69, 317331. 10.1111/j.1365-313X.2011.04792.x

  • 72

    Van VerkM. C.HickmanR.PieterseC. M.Van WeesS. C. (2013). RNA-Seq: revelation of the messengers. Trends Plant Sci.18, 175179. 10.1016/j.tplants.2013.02.001

  • 73

    VaraudE.BrioudesF.SzécsiJ.LerouxJ.BrownS.Perrot-RechenmannC.et al. (2011). AUXIN RESPONSE FACTOR8 regulates Arabidopsis petal growth by interacting with the bHLH transcription factor BIGPETALp. Plant Cell23, 973983. 10.1105/tpc.110.081653

  • 74

    WangZ.-Y.GehringC.ZhuJ.LiF.-M.ZhuJ.-K.XiongL. (2015). The Arabidopsis vacuolar sorting receptor1 is required for osmotic stress-induced abscisic acid biosynthesis. Plant Physiol.167, 137152. 10.1104/pp.114.249268

  • 75

    WardD.SpellenbergR. (1988). Chromosome counts of angiosperms from New Mexico and adjacent areas. Phytologia64, 390398.

  • 76

    WhiteO. W.DooB.CarineM. A.ChapmanM. A. (2016). Transcriptome sequencing and simple sequence repeat marker development for three Macaronesian endemic plant species. Appl. Plant Sci.4:1600050. 10.3732/apps.1600050

  • 77

    WongK.WatanabeM.HinataK. (1994). Protein profiles in pin and thrum floral organs of distylous Averrhoa carambola L. Sex. Plant Reprod.7, 107115. 10.1007/BF00230579

  • 78

    ZdobnovE. M.ApweilerR. (2001). InterProScan–an integration platform for the signature-recognition methods in InterPro. Bioinformatics17, 847848. 10.1093/bioinformatics/17.9.847

  • 79

    ZhangL.YanH.-F.WuW.YuH.GeX.-J. (2013). Comparative transcriptome analysis and marker development of two closely related Primrose species (Primula poissonii and Primula wilsonii). BMC Genomics14:329. 10.1186/1471-2164-14-329

  • 80

    ZhouW.BarrettS. C.WangH.LiD. Z. (2015). Reciprocal herkogamy promotes disassortative mating in a distylous species with intramorph compatibility. New Phytol.206, 15031512. 10.1111/nph.13326

Summary

Keywords

Boraginaceae, breeding system, distyly, herkogamy, heterostyly, Lithospermum, plant stress, transcriptome

Citation

Cohen JI (2016) De novo Sequencing and Comparative Transcriptomics of Floral Development of the Distylous Species Lithospermum multiflorum. Front. Plant Sci. 7:1934. doi: 10.3389/fpls.2016.01934

Received

22 September 2016

Accepted

06 December 2016

Published

23 December 2016

Volume

7 - 2016

Edited by

Verónica S. Di Stilio, University of Washington, USA

Reviewed by

Timothy John Tranbarger, Institute of Research for Development, France; Jeanne Marie Harris, University of Vermont, USA

Updates

Copyright

*Correspondence: James I. Cohen

This article was submitted to Plant Evolution and Development, a section of the journal Frontiers in Plant Science

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics