<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Plant Sci.</journal-id>
<journal-title>Frontiers in Plant Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Plant Sci.</abbrev-journal-title>
<issn pub-type="epub">1664-462X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpls.2014.00230</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Plant Science</subject>
<subj-group>
<subject>Original Research Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Distinct evolutionary strategies in the GGPPS family from plants</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Coman</surname> <given-names>Diana</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/143250"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Altenhoff</surname> <given-names>Adrian</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/155709"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Zoller</surname> <given-names>Stefan</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/161596"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Gruissem</surname> <given-names>Wilhelm</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/25245"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Vranov&#x000E1;</surname> <given-names>Eva</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/152562"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Biology, ETH Zurich</institution> <country>Zurich, Switzerland</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Computer Science, ETH Zurich</institution> <country>Zurich, Switzerland</country></aff>
<aff id="aff3"><sup>3</sup><institution>Swiss Institute of Bioinformatics</institution> <country>Zurich, Switzerland</country></aff>
<aff id="aff4"><sup>4</sup><institution>Institute of Biology and Ecology, Pavol Jozef &#x00160;af&#x000E1;rik University</institution> <country>Ko&#x00161;ice, Slovakia</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Catherine Anne Kidner, University of Edinburgh, UK</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Jinling Huang, East Carolina University, USA; Ben Holt, University of Oklahoma, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Eva Vranov&#x000E1;, Faculty of Science, Institute of Biology and Ecology, Pavol Jozef &#x00160;af&#x000E1;rik University in Ko&#x00161;ice, M&#x000E1;nesova 23, Ko&#x00161;ice, 04154, Slovakia e-mail: <email>eva.vranova&#x00040;upjs.sk</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Plant Evolution and Development, a section of the journal Frontiers in Plant Science.</p></fn>
</author-notes>
<pub-date pub-type="epreprint">
<day>06</day>
<month>04</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="epub">
<day>27</day>
<month>05</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date>
<volume>5</volume>
<elocation-id>230</elocation-id>
<history>
<date date-type="received">
<day>14</day>
<month>03</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>05</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2014 Coman, Altenhoff, Zoller, Gruissem and Vranov&#x000E1;.</copyright-statement>
<copyright-year>2014</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract><p>Multiple geranylgeranyl diphosphate synthases (GGPPS) for biosynthesis of geranylgeranyl diphosphate (GGPP) exist in plants. GGPP is produced in the isoprenoid pathway and is a central precursor for various primary and specialized plant metabolites. Therefore, its biosynthesis is an essential regulatory point in the isoprenoid pathway. We selected 119 GGPPSs from 48 species representing all major plant lineages, based on stringent homology criteria. After the diversification of land plants, the number of <italic>GGPPS</italic> paralogs per species increases. Already in the moss <italic>Physcomitrella patens</italic>, GGPPS appears to be encoded by multiple paralogous genes. In gymnosperms, neofunctionalization of GGPPS may have enabled optimized biosynthesis of primary and specialized metabolites. Notably, lineage-specific expansion of GGPPS occurred in land plants. As a representative species we focused here on <italic>Arabidopsis thaliana</italic>, which retained the highest number of GGPPS paralogs (twelve) among the 48 species we considered in this study. Our results show that the <italic>A. thaliana GGPPS</italic> gene family is an example of evolution involving neo- and subfunctionalization as well as pseudogenization. We propose subfunctionalization as one of the main mechanisms allowing the maintenance of multiple <italic>GGPPS</italic> paralogs in <italic>A. thaliana</italic> genome. Accordingly, the changes in the expression patterns of the <italic>GGPPS</italic> paralogs occurring after gene duplication led to developmental and/or condition specific functional evolution.</p></abstract>
<kwd-group>
<kwd>GGPPS</kwd>
<kwd>isoprenoids</kwd>
<kwd>paralogs</kwd>
<kwd>specialized metabolism</kwd>
<kwd>subfunctionalization</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="80"/>
<page-count count="12"/>
<word-count count="9619"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="introduction" id="s1">
<title>Introduction</title>
<p>Isoprenoids represent the largest group of biologically active specialized metabolites in plants. Many have roles in protecting the plants against pathogens and herbivores or conversely they attract pollinators and seed-dispersing animals. (Bouvier et al., <xref ref-type="bibr" rid="B10">2005</xref>). Other isoprenoids have important roles in photosynthesis and respiration or as hormones (abscisic acid, brassinosteroids, cytokinins, gibberellic acid, strigolactones) in development and growth regulation (Bouvier et al., <xref ref-type="bibr" rid="B10">2005</xref>; Liang, <xref ref-type="bibr" rid="B47">2009</xref>; Vranov&#x000E1; et al., <xref ref-type="bibr" rid="B75">2012</xref>).</p>
<p>In spite of their broad diversity of functions and structures, the biosynthesis of all isoprenoids in plants invariably requires two five-carbon (C5) building units: the isopentenyl diphosphate (IPP) and its isomer dimethylallyl diphosphate (DMAPP) (Liang et al., <xref ref-type="bibr" rid="B48">2002</xref>; Hsieh et al., <xref ref-type="bibr" rid="B31">2011</xref>; Vranov&#x000E1; et al., <xref ref-type="bibr" rid="B76">2013</xref>). In plants, the mevalonic acid pathway (MVA) produces cytosolic IPP, and the methylerythritol pathway (MEP) produces IPP and DMAPP in plastids (Goldstein and Brown, <xref ref-type="bibr" rid="B25">1990</xref>; Rohmer, <xref ref-type="bibr" rid="B62">1999</xref>; Rodr&#x000ED;guez-Concepci&#x000F3;n and Boronat, <xref ref-type="bibr" rid="B61">2002</xref>). The MVA and MEP pathways are linear step enzymatic reactions until the synthesis of the allylic prenyl diphosphates. Then, prenyl diphosphate synthases catalyze chain elongation reactions by coupling IPP to DMAPP producing allylic prenyl diphosphates of different length (Vranov&#x000E1; et al., <xref ref-type="bibr" rid="B76">2013</xref>). Most of the essential plant isoprenoids are derived from the C15 and C20 allylic prenyl diphosphates farnesyl-PP (FPP) and geranylgeranyl-PP (GGPP), whose pools represent nodes of the major metabolic branch points in the isoprenoid synthesis (Vranov&#x000E1; et al., <xref ref-type="bibr" rid="B77">2011</xref>).</p>
<p>In plants, the enzymes catalyzing the steps upstream of GGPP biosynthesis are encoded either by single copy genes or by pairs of genes (Goldstein and Brown, <xref ref-type="bibr" rid="B25">1990</xref>; Rodr&#x000ED;guez-Concepci&#x000F3;n and Boronat, <xref ref-type="bibr" rid="B61">2002</xref>; Closa et al., <xref ref-type="bibr" rid="B18">2010</xref>; Vranov&#x000E1; et al., <xref ref-type="bibr" rid="B76">2013</xref>). Intriguingly, at the GGPP branch point, a high number of genes encoding GGPP synthase is predicted for plant genomes, reaching up to 12 members per species (PLAZA, <ext-link ext-link-type="uri" xlink:href="http://bioinformatics.psb.ugent.be/plaza/">http://bioinformatics.psb.ugent.be/plaza/</ext-link>).</p>
<p>Multiple gene copies result from duplication events, which can involve individual genes, chromosomal segments, or entire genomes (whole-genome duplication, WGD). Such genes descend from a common ancestor and are homologous (Innan and Kondrashov, <xref ref-type="bibr" rid="B33">2010</xref>). Homologous genes are further classified into paralogs, which are related by duplication events and orthologs, which are genes in different species that evolved from a common ancestor through speciation events (Fitch, <xref ref-type="bibr" rid="B23">1970</xref>). Whereas orthologs tend to share similar functions, paralogs tend to have different roles (Studer and Robinson-Rechavi, <xref ref-type="bibr" rid="B67">2009</xref>). Following duplication, one of the outcomes for a paralog is to accumulate inactivating mutation and become a pseudogene (Innan and Kondrashov, <xref ref-type="bibr" rid="B33">2010</xref>). Alternatively, paralogs are preserved in the genome, particularly if they confer selective advantages. For example, one gene may retain the ancestral function whereas the other undergoes accelerated evolution to acquire a new function (&#x0201C;neofunctionalization&#x0201D;) (Innan and Kondrashov, <xref ref-type="bibr" rid="B33">2010</xref>). Or both paralogous copies might specialize and retain only distinct subsets of the ancestral gene function (&#x0201C;subfunctionalization&#x0201D;), which may increase the fitness of the organism (Lynch and Conery, <xref ref-type="bibr" rid="B49">2000</xref>; Lynch and Force, <xref ref-type="bibr" rid="B50">2000</xref>).</p>
<p>Although biosynthesis of GGPP is an essential step in the isoprenoid pathway providing the common precursor for key metabolic pathways involved in both primary and specialized metabolism, to date, our understanding of specific function of individual geranylgeranyl diphosphate synthases (GGPPS) paralogs is limited (Ament et al., <xref ref-type="bibr" rid="B1">2006</xref>; Jassbi et al., <xref ref-type="bibr" rid="B35">2008</xref>; Schmidt et al., <xref ref-type="bibr" rid="B65">2010</xref>). Reports on basic characterization of individual GGPPS isozymes from <italic>A. thaliana</italic> date back more than a decade ago (Zhu et al., <xref ref-type="bibr" rid="B83">1997a</xref>,<xref ref-type="bibr" rid="B84">b</xref>; Okada et al., <xref ref-type="bibr" rid="B57">2000</xref>), being completed only in the recent years (Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). This emphasizes the difficulties of studying multiple paralog gene families <italic>in vivo</italic>.</p>
<p>According to our current knowledge, 10 GGPPS (GGPPS1-GGPPS4 and GGPPS6-GGPPS11) out of 12 predicted paralogs from <italic>A. thaliana</italic> are functional, i.e., GGPP is the major product they synthesize <italic>in vitro</italic> and/or they complement <italic>E. coli</italic> strains engineered to synthesize lycopene but lacking GGPPS activity (Zhu et al., <xref ref-type="bibr" rid="B83">1997a</xref>,<xref ref-type="bibr" rid="B84">b</xref>; Okada et al., <xref ref-type="bibr" rid="B57">2000</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>).</p>
<p>Furthermore, the GGPPSs from <italic>A. thaliana</italic> reside in distinct subcellular compartments and have distinct expression patterns during plant development. GGPPS1 is targeted to mitochondria, GGPPS3 and GGPPS4 to the ER, GGPPS2 and GGPPS6-GGPPS11 to plastids (Zhu et al., <xref ref-type="bibr" rid="B83">1997a</xref>,<xref ref-type="bibr" rid="B84">b</xref>; Okada et al., <xref ref-type="bibr" rid="B57">2000</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). <italic>GGPPS11</italic> is ubiquitously and abundantly expressed, mainly in photosynthetically active tissues (Okada et al., <xref ref-type="bibr" rid="B57">2000</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>), likely providing the GGPP substrate for biosynthesis of essential photosynthesis-related isoprenoid compounds such as chlorophylls, carotenoids, phylloquinones or plastoquinones. <italic>GGPPS1</italic>-<italic>GGPPS10</italic> expression is different during plant development. These paralogs are expressed predominantly in specific root or seed tissues (Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). Additionally, <italic>GGPPS5</italic> was proposed to be a pseudogene based on sequence analysis (Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>), whereas GGPPS12, the most distant paralog from all predicted GGPP synthases in <italic>A. thaliana</italic>, does not have GGPP synthase activity (Okada et al., <xref ref-type="bibr" rid="B57">2000</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). However, GGPPS12 seems to be active as a heterodimer and together with GGPPS11 can synthesize geranyl diphosphate (GPP) (Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>).</p>
<p>The localization in different subcellular compartments as well as the distinct expression pattern suggest specific roles for the GGPPS paralogs during <italic>A. thaliana</italic> development. Yet, the biological significance of a highly expanded GGPP branch point and the relationship between the sequence and function of the GGPPS isozymes is not fully understood.</p>
<p>Here, we investigate the evolutionary relationships and molecular characteristics of the GGPPS homologs in plants using a combination of computational analyses and integration with meta-analysis of existing data sets. We identified the GGPPS homologs from 48 plant species representing major plant lineages (green algae, mosses, gymnosperms, and angiosperms) and inferred their evolutionary relationships. We show that multiple within-species GGPPS paralogs exist in several land plants lineages, particularly in angiosperms. The presence of GGPPS paralogs in the moss <italic>P. patens</italic> suggests that GGPPS duplicated early after the diversification from green algae. In gymnosperms, molecular changes in the GGPPS protein domain may have enabled the transition from biosynthesis of primary GGPP-derived compounds to specialized GPP (geranyl diphosphate) metabolites, which play roles in plant-environment interactions. In land plants, a lineage-specific expansion trend of GGPPS is observed.</p>
<p>We have particularly focused on the model plant <italic>A. thaliana</italic> whose nuclear genome retained 12 GGPPS (Lange and Ghassemian, <xref ref-type="bibr" rid="B43">2003</xref>), the highest number of GGPPS paralogs in plants whose genomes have been sequenced to date. Our results suggest that the expansion of the GGPPS family in <italic>A. thaliana</italic> occurred at distinct time points in evolution and by different duplication mechanisms. <italic>GGPPS12, GGPPS2-4</italic>, and <italic>GGPPS11</italic> diverged first. <italic>GGPPS2-4</italic> and <italic>GGPPS11</italic> arose during the most recent WGD event that occurred in <italic>A. thaliana</italic>. In contrast, the most recently diverged paralogs (<italic>GGPPS6, GGPPS7, GGPPS9</italic>, and <italic>GGPPS10</italic>) arose by tandem and segmental genome duplication. Moreover, we hypothesized that if the GGPPS paralogs from <italic>A. thaliana</italic> are not redundant, their persistence in the genome might be attributed to acquired neo- or subfunctionalization. To test this hypothesis, we have inferred the expression states of individual <italic>GGPPS</italic> during plant development. Subsequently, we have mapped these expression states onto the phylogenetic tree of the GGPPS paralogs from <italic>A. thaliana</italic> and inferred the most parsimonious expression pattern of the ancestral GGPPS gene. A statistically significant correlation of sequence and expression divergence substantiated our hypothesis of subfunctionalization in terms of differential expression pattern.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Sequence retrieval and phylogenetic analysis</title>
<p>To study the phylogeny of the GGPPS family a rooted maximum-likelihood (ML) tree from 119 homologous protein sequences spanning 48 plant genomes was reconstructed as follows. First, the homologs were selected by searching sequences (i.e., protein sequences including targeting peptides) similar to the 12 predicted GGPPS proteins from <italic>A. thaliana</italic> in the UniProtKB database (The UniProt Consortium, <xref ref-type="bibr" rid="B85">2009</xref>) augmented with the <italic>A. lyrata</italic> genome retrieved from Ensembl Plants v3 (Kersey et al., <xref ref-type="bibr" rid="B39">2010</xref>). The current protein model for GGPPS5 reposited at TAIR v.10 (<ext-link ext-link-type="uri" xlink:href="http://www.arabidopsis.org/tools/bulk/sequences/index.jsp">http://www.arabidopsis.org/tools/bulk/sequences/index.jsp</ext-link>), which proposes that the translation could be initiated at an alternative start codon, resulting in a protein that lacks a plastidial targeting sequence at the N terminus but has a conserved polyprenyl synthase domain was used (Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>).</p>
<p>To qualify as a homolog, sequences had to exceed a Dayhoff alignment score of 130 to all GGPPS from <italic>A. thaliana</italic> proteins using Darwin&#x00027;s Align function (Gonnet et al., <xref ref-type="bibr" rid="B26">2000</xref>). From this set of homologs, a multiple sequence alignment (MSA) was reconstructed (Supplementary Dataset <xref ref-type="supplementary-material" rid="SM6">1</xref>) using the Mafft FFT-NS-2 method (Katoh and Toh, <xref ref-type="bibr" rid="B37">2008</xref>). From the resulting MSA, a maximum likelihood tree was reconstructed using the PhyML 3.0 software (Guindon and Gascuel, <xref ref-type="bibr" rid="B28">2003</xref>; Guindon et al., <xref ref-type="bibr" rid="B27">2009</xref>). The default parameters were kept, i.e., we have used the <italic>LG</italic> amino acid substitution matrices (Le and Gascuel, <xref ref-type="bibr" rid="B45">2008</xref>), without invariant sites and with four discrete rate categories chosen according to an estimated gamma shape parameter. The reconstruction was done 50 times from different starting topologies and the overall highest scoring reconstruction was kept for the subsequent analysis. Branch support values were computed using the approximate likelihood ratio test (aLRT) (Anisimova and Gascuel, <xref ref-type="bibr" rid="B2">2006</xref>). To root the phylogenetic tree, a parsimony-based method was used (Berglund-Sonnhammer et al., <xref ref-type="bibr" rid="B7">2006</xref>). In brief, from all possible rootings the tree which minimized the number of implied duplication events and gene losses was chosen. Finally, to infer internal nodes of the tree as speciation or duplication nodes we used the species overlap method, which does not assume a particular species phylogeny (Van Der Heijden et al., <xref ref-type="bibr" rid="B72">2007</xref>). In brief, at every inner node of the gene tree, the overlap of species that are present in each of the two subtrees were counted. In cases one species appeared on both sides of the gene tree, a duplication was inferred; else a speciation event was inferred.</p>
<p>Relative divergence dates of the <italic>GGPPS</italic> paralogs from the Arabidopsis lineage were estimated using Bayesian phylogeny reconstruction with the BEAST 1.6.1 and the BEAGLE software (Drummond et al., <xref ref-type="bibr" rid="B21">2006</xref>). From the previously computed MSA, taxa outside the relevant Arabidopsis lineage were removed and the syntenic orthologs from <italic>Carica papaya</italic> were included (CP00020G01300 and CP00158G00190; PGDD database, <ext-link ext-link-type="uri" xlink:href="http://chibba.agtec.uga.edu/duplication/">http://chibba.agtec.uga.edu/duplication/</ext-link>). The aligned amino acid sequences were mapped to their corresponding codon sequences. Using the <italic>ECM</italic> &#x0002B; <italic>F</italic> &#x0002B; &#x003C9; &#x0002B; 2<sub>K</sub> codon substitution model (Kosiol et al., <xref ref-type="bibr" rid="B42">2007</xref>) in the BEAST software, proposition trees for the tree sampling process were generated by a Yule speciation process using an uncorrelated relaxed clock model with logNormal distribution (Drummond et al., <xref ref-type="bibr" rid="B21">2006</xref>). To calibrate the evolutionary timescale, the following normal distribution priors from the literature on the age of two evolutionary events were used: the <italic>A. thaliana</italic> and <italic>A. lyrata</italic> split was set to 13 &#x000B1; 3 mya (Beilstein et al., <xref ref-type="bibr" rid="B6">2010</xref>) and the stem lineage subtending the eudicot crown group was set to 130 &#x000B1; 5.5 mya (Davies et al., <xref ref-type="bibr" rid="B19">2004</xref>). The Markov Chain Monte Carlo (MCMC) chain-length was set to 8 &#x000D7; 10<sup>6</sup>. The first 1% of the trees was discarded as burn-in. The TreeAnnotator module from the BEAST software was used to create the consensus trees.</p>
</sec>
<sec>
<title>Expression analysis</title>
<p>The expression profile map of the GGPPS paralogs from <italic>A. thaliana</italic> was assembled based on ATH1 22K Affymetrix GeneChip microarray data generated by the AtGenExpress Consortium (<ext-link ext-link-type="uri" xlink:href="http://www.weigelworld.org/resources/microarray/AtGenExpress">http://www.weigelworld.org/resources/microarray/AtGenExpress</ext-link>). The AtGenExpress normalized datasets &#x0201C;tissue extended plus&#x0201D; was retrieved from the Bio-Array Resource website (BAR, <ext-link ext-link-type="uri" xlink:href="http://www.bar.utoronto.ca">www.bar.utoronto.ca</ext-link>). Only experiments using wild-type plants were considered. The probesets for the majority of the GGPPS paralogs are specific to their corresponding transcript, except for <italic>GGPPS6</italic> and <italic>GGPPS7</italic> whose transcripts are ambiguously recognized by the same probeset (258121_s_at) due to their high nucleotide sequence similarity. The common expression profiles for these two genes will be referred in figures with the notation &#x0201C;<italic>GGPPS6/7.</italic>&#x0201D; Expression values below a threshold of 2.5 (log2 scale) were considered as not detectable on the microarray (Schmid et al., <xref ref-type="bibr" rid="B63">2005</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). Hierarchical agglomerative clustering with a threshold set at a tree height <italic>h</italic> &#x0003D; 0.35 (equivalent to a Pearson correlation coefficient of 0.65) was used to estimate the number of clusters and their composition. The cluster analysis was conducted in R (R Development Core Team, <xref ref-type="bibr" rid="B59">2010</xref>).</p>
</sec>
<sec>
<title>Ancestral state reconstruction and statistical analysis</title>
<p>The ancestral state reconstruction and random permutations were performed with the Mesquite system for phylogenetic computing version 2.75 (Maddison and Maddison, <xref ref-type="bibr" rid="B53">2011</xref>). The character matrix was generated by discretizing the expression clusters, i.e., each expression cluster is assigned to a distinct character state. The ancestral state reconstruction was performed under a parsimony model assuming an unordered model in which all state changes are weighed equally. To evaluate the statistical significance of an observed parsimony score, the data were randomly permuted by reshuffling the discrete states among taxa 1 &#x000D7; 10<sup>4</sup> times and calculating the parsimony score for each repetition. The <italic>p</italic>-value was estimated from the distribution of the random parsimony scores, as the fraction of random scores (including the observed score) less than or equal to the observed score: <italic>p</italic> &#x0003D; (1&#x0002B;<italic>k</italic>)/<italic>n</italic> where <italic>k</italic> is the number of replications with less or as many steps than the actual observed data and <italic>n</italic> is the total number of replications. A significant phylogenetic signal was observed at a <italic>p-</italic>value smaller than 0.05 (Faith and Cranston, <xref ref-type="bibr" rid="B22">1991</xref>; Wahlberg, <xref ref-type="bibr" rid="B78">2001</xref>).</p>
</sec>
</sec>
<sec>
<title>Results and discussion</title>
<sec>
<title>The number of <italic>GGPPS</italic> gene paralogs increases during the evolution of plant functional complexity</title>
<p>We have investigated the phylogenetic relationships among GGPPSs from plants to infer evolutionary mechanisms leading to the formation and maintenance of multiple gene copies particularly within the <italic>A. thaliana</italic> genome, which had retained the highest number of paralogs (twelve).</p>
<p>In total, 119 homologous protein sequences exceeding a Dayhoff alignment score of 130 to all GGPPS from <italic>A. thaliana</italic> (see Materials and Methods) were identified and selected for the phylogenetic tree reconstruction. The selected GGPPS homologs represent 48 plant genomes ranging from green algae and mosses to gymnosperms and angiosperms (Supplementary Table <xref ref-type="supplementary-material" rid="SM4">1</xref>).</p>
<p>The GGPPS phylogenetic tree revealed five main subfamilies, referred here to as sub. I to sub.V (Figure <xref ref-type="fig" rid="F1">1</xref>). Plant-specific <italic>GGPPS</italic> genes might have originated from an ancestral copy that was present in the common ancestor of land plants and green algae. This is in agreement with earlier publications proposing that all trans-isoprenyl diphosphate synthases, an enzyme class including the GGPPSs, are derived from a common ancestral gene whose precise identity as archaeal or bacterial homolog is not fully elucidated to date (Chen et al., <xref ref-type="bibr" rid="B15">1994</xref>; Tachibana et al., <xref ref-type="bibr" rid="B68">2000</xref>). Early after the diversification of land plants, the number of <italic>GGPPS</italic> paralogs per species increases and already in the moss <italic>P. patens</italic> GGPPS appears to be encoded by multiple gene paralogs. Furthermore, the phylogenetic analysis showed lineage-specific expansion and divergence events occurring in land plants (Figure <xref ref-type="fig" rid="F1">1</xref> and Supplementary Figure <xref ref-type="supplementary-material" rid="SM1">1</xref>). The increase in the predicted number of GGPPSs per species mirrors the increase in complexity of the species. From one GGPPS in green algae (sub. I), three in mosses (sub. II and sub. V) and one to four in gymnosperms (sub. III&#x02013;V), the number of GGPPS paralogs per species reaches a maximum of twelve copies within angiosperms in <italic>A. thaliana</italic> (sub. V; Supplementary Table <xref ref-type="supplementary-material" rid="SM4">1</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Maximum likelihood consensus tree of the 119 GGPPS homologs from plants</bold>. Posterior probabilities are shown next to the branches. Branch lengths correspond to evolution distances (see Materials and Methods). Duplication (red dots) and speciation (green dots) events are shown at nodes. The tree is divided into five classes (sub. I&#x02013;V). Branch colors represent the major plant lineages: spring green, green algae; orange, mosses; dark green, gymnosperms; and blue- angiosperms. Branches holding homologs from gymnosperms and angiosperms are collapsed and the number of homologs in each collapsed group is shown. The homologs from the Arabidopsis lineage are shown: in blue-<italic>A. thaliana</italic>, in cyan-<italic>A. lyrata</italic>.</p></caption>
<graphic xlink:href="fpls-05-00230-g0001.tif"/>
</fig>
</sec>
<sec>
<title>The molecular evolution of the polyprenyl synthase domain enables the neofunctionalization of GGPPS</title>
<p>To gain further insights in molecular changes underlying the evolution of the GGPPS homologs in plants, we have analyzed the evolution of the characteristic polyprenyl synthase domain (Liang et al., <xref ref-type="bibr" rid="B48">2002</xref>). The GGPPS polyprenyl synthase domain has a first aspartate rich motif, FARM (DDxxxxD; x is any amino acid) and a second aspartate rich motif, SARM (DDxxD; x is any amino acid), which are involved in IPP and DMAPP substrate binding and are critical for GGPP biosynthesis (Liang et al., <xref ref-type="bibr" rid="B48">2002</xref>).</p>
<p>Whereas GGPPSs are typically active as homodimers (Vandermoten et al., <xref ref-type="bibr" rid="B74">2009</xref>), heterodimeric complexes between functional GGPPS and SSUI and SSUII (heterodimeric GPP synthase small subunit I and II, respectively) synthesizing GPP have been reported (Burke et al., <xref ref-type="bibr" rid="B13">1999</xref>; Tholl et al., <xref ref-type="bibr" rid="B70">2004</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>). SSUI lost both aspartate rich motifs but has two conserved CxxxC motifs (where x is any hydrophobic amino acid) (Tholl et al., <xref ref-type="bibr" rid="B70">2004</xref>). SSUII has conserved FARM and two CxxxC motifs (Burke et al., <xref ref-type="bibr" rid="B13">1999</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>). In heterodimeric complexes between functional GGPPS and SSUII, the CxxxC motifs were shown to be important for physical interaction between subunits. Furthermore, such complexes were shown to be able to produce, with increased efficiency, GPP (Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>). GPP can be also produced by homodimeric GPS (geranyl diphosphate synthase) (Hsiao et al., <xref ref-type="bibr" rid="B30">2008</xref>; Schmidt and Gershenzon, <xref ref-type="bibr" rid="B64">2008</xref>). Interestingly, a protein from <italic>A. thaliana</italic> initially classified as GPS (At2g34630; (Bouvier et al., <xref ref-type="bibr" rid="B11">2000</xref>; Van Schie et al., <xref ref-type="bibr" rid="B73">2007</xref>)), which lost the CxxxC motifs but has conserved FARM and SARM, was shown to produce medium (C25) to long (C45) chain isoprenoid products, and was therefore renamed as polyprenyl pyrophosphate synthase (AtPPPS; Hsieh et al., <xref ref-type="bibr" rid="B31">2011</xref>).</p>
<p>The GGPPS homologs from sub. I, II and V have highly conserved FARM, SARM and one CxxxC motif (Figure <xref ref-type="fig" rid="F2">2</xref> and Supplementary Figure <xref ref-type="supplementary-material" rid="SM2">2</xref>). Homologs from <italic>A. thaliana</italic> with such protein domain structure were shown to be active as homodimers and produce GGPP (Okada et al., <xref ref-type="bibr" rid="B57">2000</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>).</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Molecular evolution of the polyprenyl synthase domain</bold>. The summarized phylogenetic tree of GGPPS from plants is shown. Branches holding more than one homolog are collapsed and the number of homologs is shown. The five classes (sub. I&#x02013;V) of GGPPS homologs in plants are shown. Branch colors represent the major plant lineages: spring green, green algae; orange, mosses; dark green, gymnosperms; and blue, angiosperms. The representative polyprenyl synthase motifs for each of the five classes are shown: the two CxxxC motifs in gray and FARM, SARM in purple. Asterisk (<sup>&#x0002A;</sup>) indicates variable amino acid residues (Supplementary Figure <xref ref-type="supplementary-material" rid="SM1">1</xref> and Supplementary Table <xref ref-type="supplementary-material" rid="SM5">2</xref>). ph-GPS: putative homologs of GPS, ph-GPS/GGPPS: putative homologs of the bifunctional GPS/GGPPS. A prototype of the second CxxxC motif (CxxxS; the serine residue is shown in yellow) appears to have been acquired in a common ancestor of gymnosperms. ph-PPPS: putative homologs of polyprenyl pyrophosphate synthase. ph-SSUI and ph-SSUII: putative homologs of the small subunit (SSU) of heterodimeric GPS. Ph-SSUI proteins have lost the two conserved FARM and SARM motifs. None of the ph-SSUII proteins have a conserved SARM (the variable mutated amino acid residue is shown in yellow) indicating loss of GGPPS capacity.</p></caption>
<graphic xlink:href="fpls-05-00230-g0002.tif"/>
</fig>
<p>Several homologs from sub. V, have lost the CxxxC motif (Figure <xref ref-type="fig" rid="F2">2</xref>). Such proteins, referred here to as ph-PPPS (putative homologs of polyprenyl pyrophosphate synthase) retain solely FARM and SARM motifs and are found at <italic>d</italic> &#x0003D; 7.03 distance from root supporting their rapid divergence (Supplementary Figure <xref ref-type="supplementary-material" rid="SM1">1</xref> and Supplementary Table <xref ref-type="supplementary-material" rid="SM5">2</xref>). The polyprenyl pyrophosphate synthase (AtPPPS, At2g34630) from <italic>A. thaliana</italic>, which can synthesize medium (C25) to long (C45) chain isoprenoid products, has a similar domain structure as the ph-PPPS proteins (Hsieh et al., <xref ref-type="bibr" rid="B31">2011</xref>).</p>
<p>Within sub. III that is found exclusively in gymnosperms, in addition to the conserved FARM and SARM, a prototype of a second CxxxC motif (CxxxS) appears to have been acquired in a common ancestor of Ginkgo, Taxus, Abies and Picea species (Figure <xref ref-type="fig" rid="F2">2</xref>, Supplementary Figure <xref ref-type="supplementary-material" rid="SM1">1</xref> and Supplementary Table <xref ref-type="supplementary-material" rid="SM5">2</xref>). A protein with similar domain structure was recently reported to be bifunctional, producing both GPP and GGPP (Schmidt et al., <xref ref-type="bibr" rid="B65">2010</xref>). GPP is the precursor for biosynthesis of monoterpenoids, a class of specialized metabolites which play roles in pollination, seed dispersal and defense mechanisms (Bohlmann and Croteau, <xref ref-type="bibr" rid="B9">1999</xref>). This suggests that the molecular changes in the protein domains of orthologs found in this class may have enabled the transition from biosynthesis of primary GGPP-derived compounds to specialized GPP-derived metabolites. In Abies and Picea species, mutation of the serine residue to cysteine resulted in a conserved second CxxxC motif (Figure <xref ref-type="fig" rid="F2">2</xref>, Supplementary Figure <xref ref-type="supplementary-material" rid="SM1">1</xref> and Supplementary Table <xref ref-type="supplementary-material" rid="SM5">2</xref>). The homolog B1A9K6 from <italic>Picea abies</italic> (Supplementary Table <xref ref-type="supplementary-material" rid="SM5">2</xref>), which retains two conserved CxxxC concomitant with FARM and SARM, was shown to produce only GPP (Schmidt and Gershenzon, <xref ref-type="bibr" rid="B64">2008</xref>).</p>
<p>The GGPPS homologs from sub. IV appear to have experienced faster sequence divergence compared to sub. III, indicated by the branch length (Figure <xref ref-type="fig" rid="F1">1</xref>). Both FARM and SARM are either missing or SARM is mutated in sub. IV but both CxxxC motifs are present (Figure <xref ref-type="fig" rid="F2">2</xref>). Sub. IV comprises of GGPPS from monocots and dicots and one homolog from gymnosperms, most of them being uncharacterized to date (Figure <xref ref-type="fig" rid="F1">1</xref>). Sub. IV is further comprised of two subclasses referred to here as ph-SSUI and ph-SSUII, i.e., putative homologs of the small subunit (SSU) of heterodimeric GPS (Tholl et al., <xref ref-type="bibr" rid="B70">2004</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>). Members of both ph-SSUI and ph-SSUII were shown to be active not as GGPPS but as SSU in heterodimeric GPS complexes, producing the GPP (Tholl et al., <xref ref-type="bibr" rid="B70">2004</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>). Interestingly, ph-SSUI members are mainly found in flowering plant species (Figure <xref ref-type="fig" rid="F2">2</xref> and Supplementary Table <xref ref-type="supplementary-material" rid="SM5">2</xref>). They have lost both aspartate rich motifs (Figure <xref ref-type="fig" rid="F2">2</xref>), likely rendering them inactive as homodimeric enzymes. Consistently, the Q6QLU5 homolog from <italic>Clarkia breweri</italic> (Figure <xref ref-type="fig" rid="F1">1</xref>; ph-SSUI) does not produce GGPP (Tholl et al., <xref ref-type="bibr" rid="B70">2004</xref>). A homolog from <italic>Antirrhinum majus</italic>, with similar protein domain structure was shown to form heterodimeric GPS complexes with functional GGPPS and synthesize GPP as main product in reproductive organs (Tholl et al., <xref ref-type="bibr" rid="B70">2004</xref>). In summary, this subclass of proteins with the unique motif organization (lacking both SARM and FARM but retaining both CxxxC motifs) seems to be responsible for monoterpenoids precursor biosynthesis in reproductive plant organs. Members of the ph-SSUII branch from sub. IV have intact FARM but mutated SARM (Figures <xref ref-type="fig" rid="F1">1</xref>, <xref ref-type="fig" rid="F2">2</xref> and Supplementary Table <xref ref-type="supplementary-material" rid="SM5">2</xref>). The GGPPS12 homolog from <italic>A. thaliana</italic> has such a protein domain structure and consequently, is unable to produce GGPP (Okada et al., <xref ref-type="bibr" rid="B57">2000</xref>). Furthermore, similarly to characterized proteins from ph-SSUI (Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>), GGPPS12 forms heterodimeric complexes with GGPPS11 and redirects biosynthesis toward GPP (Okada et al., <xref ref-type="bibr" rid="B57">2000</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>). In contrast to ph-SSUI homologs, which are likely to play a role in monoterpenoid biosynthesis mainly in reproductive organs, members of the ph-SSUII were proposed, based on their expression pattern, to constitutively participate in GPP biosynthesis during plant development (Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>).</p>
<p>Taken together, GGPPS homologs with canonical protein domain structure are present in all major plant lineages investigated here. Early after the diversification of land plants, duplication events led to multiple GGPPS genes per species, providing raw material for evolutionary change. Yet, with the divergence of land plants their functional complexity and need for defense strategies also diversified.</p>
<p>By neofunctionalization of GGPPS, novel heterodimeric GPS complex formation capacity, and thereby the GPP biosynthesis was enabled by the acquisition of a second CxxxC motif that likely occurred in the ancestor of gymnosperms. GPP serves as precursor of monoterpenes, which are involved in direct defense mechanisms against herbivores or pathogens, they can indirectly protect plants by attracting predators of attacking herbivores, or they can be emitted from floral tissues to attract pollinators (Pichersky and Gershenzon, <xref ref-type="bibr" rid="B58">2002</xref>; Chen et al., <xref ref-type="bibr" rid="B17">2003</xref>; Keeling and Bohlmann, <xref ref-type="bibr" rid="B38">2006</xref>). Members of the ph-PPPS (sub. V), whose protein domains are similar to the AtPPPS from <italic>A. thaliana</italic> (Bouvier et al., <xref ref-type="bibr" rid="B11">2000</xref>; Hsieh et al., <xref ref-type="bibr" rid="B31">2011</xref>) are likely another example of neofunctionalization. They have lost the two CxxxC motifs and in <italic>A. thaliana</italic>, this enzyme is able to generate multiple products with medium to long chain lengths (C25&#x02013;C45) (Hsieh et al., <xref ref-type="bibr" rid="B31">2011</xref>).</p>
</sec>
<sec>
<title>Lineage-specific expansion of <italic>GGPPS</italic> is most evident in arabidopsis</title>
<p>Duplication events leading to lineage-specific expansion of GGPPS (i.e., no discernible ortholog in closely related species) occurred in land plants (Supplementary Figure <xref ref-type="supplementary-material" rid="SM1">1</xref>). The most prominent example of lineage-specific expansion, with respect to our taxon sampling, is found in the Arabidopsis lineage where, the high GGPPSs sequence similarity determines their clustering in the phylogenetic tree (Figure <xref ref-type="fig" rid="F1">1</xref>). The majority of the GGPPS paralogs in <italic>A. thaliana</italic> and its closest relative <italic>A. lyrata</italic> are found in the same clade and are more similar to each other than to homologs from other species, which is supported by the high branch support values (aLRT &#x02265; 0.8). In particular, <italic>A. thaliana</italic> encodes the largest number of paralogs from the species investigated here, including a unique set of GGPPSs (GGPPS6, GGPPS7, GGPPS9, and GGPPS10) found only in this species (Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
<p>Lineage-specific expansion followed by subfunctionalization is known to be an important mechanism for diversification of gene function (Lespinet et al., <xref ref-type="bibr" rid="B46">2002</xref>; Nowick and Stubbs, <xref ref-type="bibr" rid="B56">2010</xref>). For example, the expression of lineage-specific genes in <italic>A. thaliana</italic> was observed to be confined to fewer tissues, where they are involved particularly in abiotic stress responses (Donoghue et al., <xref ref-type="bibr" rid="B20">2011</xref>).</p>
<p>The expression of the <italic>GGPPS</italic> paralogs specific to <italic>A. thaliana</italic> is under strict developmental control, being expressed in specific tissues and at distinct time during plant development (Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). For example, <italic>GGPPS6</italic> is expressed only in the meristematic zone of the root tip (columella and lateral root cap), whereas <italic>GGPPS10</italic> expression is distributed over the length of the root but not in the root tip (Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). Together, these indicate that LSG GGPPS paralogs may have special function only at particular stages during plant development and possibly in response to external environmental signals.</p>
</sec>
<sec>
<title>Subfunctionalization maintains multiple <italic>GGPPS</italic> paralogs in the <italic>a. thaliana</italic> genome</title>
<p>Multiple <italic>GGPPS</italic> paralogs might have been maintained in the genome of <italic>A. thaliana</italic> due to the divergence in their expression patterns. There should be no selective constraints blocking this divergence as long as the initial expression pattern of the ancestral gene is maintained. Thus, we expect that the GGPPS paralogs may have specialized functions in <italic>A. thaliana</italic> according to their expression profiles.</p>
<p>To test this hypothesis we mapped <italic>A. thaliana GGPPSs</italic> expression data onto the phylogenetic tree and reconstructed the ancestral expression states (Figure <xref ref-type="fig" rid="F3">3</xref>). Using a comprehensive dataset for gene expression during <italic>A. thaliana</italic> development (see Materials and Methods) we defined eight expression clusters containing the <italic>GGPPS</italic> paralogs referred to as cI-VIII (Figure <xref ref-type="fig" rid="F3">3A</xref>). Next, we mapped the expression clusters as discrete states onto the phylogenetic tree of the <italic>GGPPS</italic> paralogs in <italic>A. thaliana.</italic> The reconstruction of ancestral expression states was performed using the Mesquite v2.75 system for phylogenetic computing (Maddison and Maddison, <xref ref-type="bibr" rid="B53">2011</xref>), which allows the inference of the most likely hypothetical expression states for the ancestral gene under a maximum parsimony model (Figure <xref ref-type="fig" rid="F3">3B</xref>). The expression states (state 1&#x02013;8) are shown as colored boxes at the terminal branches. A change in color between sister branches indicates a putative divergence in the expression pattern of the paralog.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>Expression pattern analysis of the <italic>GGPPS</italic> genes from <italic>A. thaliana</italic> and ancestral states reconstruction. (A)</bold> The clustering of microarray expression data is shown as heatmap. The expression clusters (cI-VIII) of the <italic>GGPPS</italic> paralogs identified based on Pearson correlation coefficients with a threshold set to <italic>r</italic> &#x0003D; 0.65 (see Materials and Methods) are shown. The various organ and tissue samples were assigned to three major classes: root (white box), vegetative (green box; includes samples from the seedlings, rosette leaves, stems, and cauline leaves) and reproductive (pink box; includes samples from flowers and seeds). <bold>(B)</bold> The phylogenetic reconstruction of ancestral expression states using parsimony is shown. The colors corresponding to each expression state (state 1&#x02013;8) are shown in the legend. Colored boxes are shown at terminal branches indicating the observed expression pattern cluster. Branches with multiple colors are associated with several possible expression states.</p></caption>
<graphic xlink:href="fpls-05-00230-g0003.tif"/>
</fig>
<p>The ancestral expression pattern, state 2, is represented by an ubiquitous gene expression during plant development (Figure <xref ref-type="fig" rid="F3">3B</xref>). From an evolutionary perspective, ubiquitous expression is characteristic to housekeeping genes, which are generally associated with slower evolutionary rates (Hurst and Smith, <xref ref-type="bibr" rid="B32">1999</xref>; Koonin, <xref ref-type="bibr" rid="B41">2009</xref>). Thus, housekeeping genes are less likely to experience divergence of their expression pattern. As expected, the parsimony reconstruction supports a ubiquitous expression pattern (state 2) of the ancestral <italic>GGPPS</italic> in <italic>A. thaliana</italic> during plant development. <italic>GGPPS11</italic> and <italic>GGPPS12</italic> represent expression state 2, while the expression pattern of the remaining <italic>GGPPS</italic> paralogs appears to be under developmental control. As such, the expression pattern of the <italic>GGPPS</italic> gene family during development diverged during several rounds of duplication. Some of the emerging expression states are clade specific (state 6; Figure <xref ref-type="fig" rid="F3">3B</xref>). However, there is also an example of same or similar expression pattern that appears to have emerged at different positions in the tree. For example, <italic>GGPPS5</italic> and <italic>GGPPS8</italic> are part of the same class V as they have a similar expression pattern (<italic>r</italic> &#x0003D; 0.76) but are found in distinct phylogenetic clades (Figure <xref ref-type="fig" rid="F3">3</xref>). This suggests that these two paralogs may have independently acquired or lost similar cis-regulatory elements responsible for the regulation of expression during development. Furthermore, several paralogs share a similar expression pattern, which likely reflects the short time since their divergence as in the case of <italic>GGPPS9</italic> and <italic>GGPPS10</italic> (Figure <xref ref-type="fig" rid="F3">3B</xref>).</p>
<p>To exclude random events, we evaluated the statistical significance of the correlation between sequence and expression divergence by performing a permutation test in which the expression states were randomly reshuffled. Subsequently, we performed 10,000 ancestral states reconstructions and compared the observed parsimony score against the random distribution from which we calculated the <italic>p</italic>-values. The number of steps required in the random distribution ranged from 7 to 10 in the case of the ancestral states reconstruction of the expression patterns during development. The observed parsimony score of 7 steps indicates non-random distribution that is supported statistically by a permutation <italic>p</italic>-value of 0.008. Therefore, during the evolution of the <italic>GGPPS</italic> gene family in <italic>A. thaliana</italic> the divergence in expression pattern appears to be coupled, at least partially, to sequence divergence.</p>
<p><italic>GGPPS12</italic> and <italic>GGPPS11</italic> genes have an ancestral, ubiquitous expression pattern (Figure <xref ref-type="fig" rid="F3">3</xref>) that may reflect their requirement as housekeeping genes encoding for GGPPS and SSUII, respectively. <italic>GGPPS5</italic> was proposed to encode a pseudogene based on the sequence analysis, which identified a frame shift mutation rendering translation of a truncated GGPPS protein (Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). Nevertheless, probe based hybridization arrays were able to detect specific expression of <italic>GGPPS5</italic> gene in different organs of <italic>A. thaliana</italic> (Figure <xref ref-type="fig" rid="F3">3</xref>) indicating that <italic>GGPPS5</italic> is an expressed pseudogene also known as ghost pseudogene (Zheng and Gerstein, <xref ref-type="bibr" rid="B82">2007</xref>). As a ghost pseudogene, <italic>GGPPS5</italic> could play a role in regulating the function of closely related paralogs, for example by competing for the cellular RNA degradation machinery (Hirotsune et al., <xref ref-type="bibr" rid="B29">2003</xref>).</p>
<p><italic>GGPPS1</italic> and <italic>GGPPS2</italic> are expressed ubiquitously in all plant organs, but at much lower levels than <italic>GGPPS11</italic> and <italic>GGPPS12</italic> (Figure <xref ref-type="fig" rid="F3">3A</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). <italic>GGPPS3, GGPPS4</italic>, and <italic>GGPPS8</italic> have a mosaic of expression patterns during the plant development. <italic>GGPPS3</italic> and <italic>GGPPS4</italic> are predominantly expressed in reproductive organs and root vasculature, whereas <italic>GGPPS8</italic> is specifically expressed in the outer cell layers above the mitotically active area of the root (Figure <xref ref-type="fig" rid="F3">3A</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). The expression of the <italic>GGPPS</italic> paralogs specific to <italic>A. thaliana</italic> (<italic>GGPPS6, GGPPS7, GGPPS9</italic>, and <italic>GGPPS10</italic>) is confined to particular tissues (Figure <xref ref-type="fig" rid="F3">3A</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>), suggesting that they might play a role only at defined developmental stages and/or in fine tuning adaptation to specific conditions.</p>
<p>Collectively, in addition to neofunctionalization of GGPPS, another mechanism allowing the maintenance of multiple duplicated <italic>GGPPS</italic> paralogs in the <italic>A. thaliana</italic> genome appears to be their subfunctionalization in terms of differential expression pattern during plant development.</p>
</sec>
<sec>
<title>The duplication timing reveals a correlation between age and expression pattern of the <italic>GGPPSs</italic> from <italic>a. thaliana</italic></title>
<p><italic>A. thaliana</italic> is an ancient polyploid that through evolutionary history experienced three major whole genome duplication events termed &#x003B3;, &#x003B2;, and &#x003B1; in the order of their occurrence (Bowers et al., <xref ref-type="bibr" rid="B12">2003</xref>). Species such as <italic>Carica papaya</italic> that have not experienced any other whole genome duplication since the &#x003B3;-WGD event, should have a final set of duplicated genes that have been retained after polyploidisation (Langham et al., <xref ref-type="bibr" rid="B44">2004</xref>; Ming et al., <xref ref-type="bibr" rid="B55">2008</xref>).</p>
<p>To identify the <italic>GGPPS</italic> homologs in <italic>A. thaliana</italic> retained in the <italic>C. papaya</italic> genome, we performed a cross-genome syntenic analysis using the Plant Genome Duplication Database (PGDD, <ext-link ext-link-type="uri" xlink:href="http://chibba.agtec.uga.edu/duplication/">http://chibba.agtec.uga.edu/duplication/</ext-link>). We selected 100 kb of genomic regions adjacent to the <italic>A. thaliana GGPPS</italic> paralogs and the <italic>C. papaya</italic> genome as outgroup. <italic>GGPPS11</italic> and <italic>GGPPS12</italic> are the only paralogs from <italic>A. thaliana</italic>, which have orthologs in syntenic regions of the <italic>C. papaya</italic> genome (Supplementary Figure <xref ref-type="supplementary-material" rid="SM3">3A</xref>). Next, we have estimated the relative divergence dates of the <italic>GGPPSs</italic> from <italic>A. thaliana, A. lyrata</italic> and <italic>C. papaya</italic> based on their codon evolution and using an uncorrelated relaxed clock model (see Materials and Methods).</p>
<p>The molecular-dated phylogenetic tree indicates that after the duplication of an ancestral GGPPS within the time range of the oldest &#x003B3;-WGD one copy evolved into the common ancestor of the extant <italic>GGPPS12</italic> from <italic>A. thaliana</italic> and its orthologs from <italic>A. lyrata</italic> and <italic>C. papaya</italic>. The other copy duplicated ca. 97 mya and evolved into a <italic>GGPPS</italic> gene in <italic>C. papaya</italic> and into the common ancestor of the remaining 11 extant paralogs in <italic>A. thaliana</italic> (<italic>GGPPS1-GGPPS11</italic>) and their orthologs from <italic>A. lyrata</italic> (Figure <xref ref-type="fig" rid="F4">4</xref>). The <italic>GGPPS</italic> family from the Arabidopsis lineage continued diversifying and expanding during a time range spanning the subsequent &#x003B2; and &#x003B1;-WGD events (Figure <xref ref-type="fig" rid="F4">4</xref>). As such, during the &#x003B1;-WGD, the extant <italic>GGPPS2</italic> and <italic>GGPPS11</italic> arose (ca. 48 mya) followed by <italic>GGPPS3</italic> and <italic>GGPPS4</italic>, which formed ca. 41 mya (Figure <xref ref-type="fig" rid="F4">4</xref>). The remaining extant paralogs (<italic>GGPPS1, GGPPS5&#x02013;GGPPS10</italic>) became fixed in their actual location within the <italic>A. thaliana</italic> genome only after the most recent &#x003B1;-WGD. <italic>GGPPS1</italic> and <italic>GGPPS8</italic> are estimated to have diverged ca. 30 mya, whereas the most recently evolved paralogs in <italic>A. thaliana</italic> are <italic>GGPPS6, GGPPS7, GGPPS9</italic>, and <italic>GGPPS10</italic>, which arose after sequential duplication of their most recent ancestor between 6 and 9 mya (Figure <xref ref-type="fig" rid="F4">4</xref>).</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>The calibrated <italic>GGPPS</italic> chronogram</bold>. The maximum clade credibility tree and the estimated divergence dates based on total evidence across 24 homologs from <italic>A.thaliana, A. lyrata</italic> and <italic>C. papaya</italic> are shown. Branch support values are shown in gray. Note the difference in the relative order between the two clades holding GGPPS2, GGPPS11 and GGPPS5-GGPPS7, GGPPS9, GGPPS10 from Figure <xref ref-type="fig" rid="F1">1</xref>. Both topologies in Figures <xref ref-type="fig" rid="F1">1</xref>, <xref ref-type="fig" rid="F4">4</xref> have high support values but are based on different models of evolution that use amino acid and codon sequences, respectively (see Materials and Methods). Mean divergence dates for all nodes are shown in bold black. Gray bars represent the 95% high posterior density credibility interval for node age. Putative intervals for the WGD events are shown. The most ancient event, common to Arabidopsis, Carica, Vitis, and Populus, is the &#x003B3;-WGD, which separated monocots and eudicot lineages ca. 125&#x02013;140 mya (Blanc and Wolfe, <xref ref-type="bibr" rid="B8">2004</xref>; Davies et al., <xref ref-type="bibr" rid="B19">2004</xref>; Jaillon et al., <xref ref-type="bibr" rid="B34">2007</xref>). The following more recent WGDs are assumed to have occurred within the <italic>Brassicales</italic>, with the &#x003B2; event having uncertain position after the point of divergence from <italic>Caricaceae</italic> ca. 72 mya (Ming et al., <xref ref-type="bibr" rid="B55">2008</xref>). The most recent &#x003B1;-WGD that occurred ca. 38&#x02013;70 mya is placed within the <italic>Brassicaceae</italic> (Bowers et al., <xref ref-type="bibr" rid="B12">2003</xref>; Barker et al., <xref ref-type="bibr" rid="B4">2009</xref>) and predates the divergence of <italic>A. thaliana</italic> and <italic>A. lyrata</italic>, which was estimated to have occurred ca. 13 mya (Beilstein et al., <xref ref-type="bibr" rid="B6">2010</xref>). The nodes used as calibration points are indicated by black squares.</p></caption>
<graphic xlink:href="fpls-05-00230-g0004.tif"/>
</fig>
<p>Generally, following WGD events, many genes return to single copy by fractionation (Lyons et al., <xref ref-type="bibr" rid="B51">2008</xref>). However, some duplicate gene pairs such as genes encoding specialized metabolism enzymes or transcription factors are preferentially maintained (Blanc and Wolfe, <xref ref-type="bibr" rid="B8">2004</xref>; Cannon et al., <xref ref-type="bibr" rid="B14">2004</xref>; Freeling, <xref ref-type="bibr" rid="B24">2009</xref>). Based on the synteny of the surrounding genomic regions, four <italic>GGPPS</italic> paralogs (<italic>GGPPS2, GGPPS3, GGPPS4</italic>, and <italic>GGPPS11</italic>) are found within &#x003B1;-WGD blocks (Bowers et al., <xref ref-type="bibr" rid="B12">2003</xref>; Thomas et al., <xref ref-type="bibr" rid="B71">2006</xref>) (Supplementary Figure <xref ref-type="supplementary-material" rid="SM3">3B</xref>). Whereas <italic>GGPPS2</italic> and <italic>GGPPS11</italic> form a pair within one &#x003B1;-WGD block, <italic>GGPPS3</italic> and <italic>GGPPS4</italic> are not retained in pairs with other <italic>GGPPS</italic> paralogs, suggesting that their counterparts were most probably lost due to fractionation processes.</p>
<p>Together, <italic>GGPPS12</italic> appears to be the oldest paralog in <italic>A. thaliana</italic> followed by <italic>GGPPS2-4</italic> and <italic>GGPPS11</italic> (Figure <xref ref-type="fig" rid="F4">4</xref>). Furthermore, <italic>GGPPS2-4</italic> and <italic>GGPPS11</italic> were found in &#x003B1;-WGD blocks and the dated molecular phylogeny confirms their divergences during the time range of the &#x003B1;-WGD, after the ancestor of Arabidopsis split from <italic>C. papaya</italic>. In contrast to the old paralogs in <italic>A. thaliana, GGPPS6, GGPPS7, GGPPS9</italic>, and <italic>GGPPS10</italic> are paralogs specific to <italic>A. thaliana</italic>. After splitting from <italic>A. lyrata</italic>, the genome of <italic>A. thaliana</italic> experienced a 30% reduction in size and at least nine chromosomal rearrangements (Yogeeswaran et al., <xref ref-type="bibr" rid="B81">2005</xref>; Lysak et al., <xref ref-type="bibr" rid="B52">2006</xref>). Thus, it is possible that the <italic>GGPPSs</italic> specific to <italic>A. thaliana</italic> evolved during these genome reshaping events.</p>
<p>The relative age of the <italic>GGPPSs</italic> corresponds to their divergence in their expression pattern. Old paralogs (e.g., <italic>GGPPS11</italic> and <italic>GGPPS12</italic>) are ubiquitously expressed and at high levels whereas young paralogs (e.g., <italic>GGPPS6</italic> and <italic>GGPPS10</italic>) are predominantly expressed in specific tissues and cell types and generally at lower levels (Figure <xref ref-type="fig" rid="F3">3A</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>) bringing further indication for subfunctionalization of young paralogs.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s3">
<title>Conclusions</title>
<p>The <italic>A. thaliana GGPPS</italic> gene family is an interesting example of gene evolution involving gene duplication followed by neo- and subfunctionalization as well as pseudogenization. GGPPS homologs with canonical protein domain structure are present in all major plant lineages investigated in this study. Nevertheless, it is possible that neofunctionalization of GGPPS paralogs enabled optimized biosynthesis of primary and specialized metabolites. Furthermore, it was recently proposed that functionality inference for the polyprenyl transferases, should not solely rely on primary sequence due to promiscuity of this class of enzymes (Wallrapp et al., <xref ref-type="bibr" rid="B79">2013</xref>). In the case of the GGPPS family from <italic>A. thaliana</italic>, 10 out of 12 predicted isozymes were shown, using <italic>in vitro</italic> and/or <italic>E. coli</italic> complementation assays, to produce GGPP as major product (see Introduction; Zhu et al., <xref ref-type="bibr" rid="B83">1997a</xref>,<xref ref-type="bibr" rid="B84">b</xref>; Okada et al., <xref ref-type="bibr" rid="B57">2000</xref>; Wang and Dixon, <xref ref-type="bibr" rid="B80">2009</xref>; Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>). Still, one cannot exclude that some GGPPS will produce longer polyprenyl diphosphates, thereby providing further means of neofunctionalization.</p>
<p>Our functional divergence analysis suggests that changes in the expression patterns of the <italic>GGPPS</italic> paralogs occurring after gene duplication led to developmental and/or condition specific functional evolution. The ancestral states reconstruction showed a highly non-random distribution of developmental expression patterns in the phylogeny, indicating a significant degree of coupling between sequence and developmental expression divergence. This has prompted us to predict that preserving paralogs with different expression may be of importance for the functional divergence of the <italic>GGPPS</italic> paralogs in <italic>A. thaliana.</italic> Moreover, it was recently proposed that the distinct subcellular localization of the GGPPS paralogs may enable a differential allocation of GGPP precursors to downstream isoprenoid pathways, and as such provide an additional mean of their maintenance in the genome (Beck et al., <xref ref-type="bibr" rid="B5">2013</xref>).</p>
<p>The evolutionary pattern of the <italic>GGPPS</italic> gene family in plants, including variation in paralog number mirroring evolution of plant complexity, lineage-specific expansion, neo- and subfunctionalization is consistent with the idea of GGPPSs as flexible enzymes that might have evolved to support adaptation to various specific conditions. This evolutionary pattern can be recognized in many other gene families, in particular those involved in the specialized metabolism: the cytochrome P450-dependent monooxygenases (P450s) (Bak et al., <xref ref-type="bibr" rid="B3">2011</xref>), glucosidases (Kliebenstein et al., <xref ref-type="bibr" rid="B40">2005</xref>) or the terpene synthase family (Tholl, <xref ref-type="bibr" rid="B69">2006</xref>).</p>
<p>It will be interesting to examine by functional analyses of ggpps single and multiple mutants whether the newly evolved <italic>GGPPS</italic> paralogs in <italic>A. thaliana</italic> are functionally redundant or have indeed specific roles in adaptation to various conditions in a distinct spatial-temporal fashion and in response to specific environmental conditions.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
</sec>
</body>
<back>
<ack>
<p>We would like to thank Dr. Katja B&#x000E4;renfaller for critically reading the manuscript and Dr. Christophe Dessimoz for valuable discussion and suggestions. This work was supported by a grant from ETH Zurich (TH-51 06-1) and the EU FP7 contract 245143 (TiMet).</p>
</ack>
<sec sec-type="supplementary material" id="s4">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://www.frontiersin.org/journal/10.3389/fpls.2014.00230/abstract">http://www.frontiersin.org/journal/10.3389/fpls.2014.00230/abstract</ext-link></p>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM1" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Figure 1</label>
<caption><p><bold>Maximum likelihood consensus tree of the GGPPS homologs from plants</bold>. Posterior probabilities are shown. Branch lengths correspond to evolutionary distances. Branch colors represent the major plant lineages: spring green, green algae; orange, mosses; dark green, gymnosperms; and blue, angiosperms.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM2" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Figure 2</label>
<caption><p><bold>Amino acid MSA of 119 GGPPS homologs from plants</bold>. The CxxxC motifs are shown in gray. The FARM and SARM motifs are shown in purple.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM3" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Figure 3</label>
<caption><p><bold>Syntenic relationships of <italic>GGPPS</italic> paralogs from <italic>A. thaliana</italic> using <italic>C. papaya</italic> as outgroup. (A)</bold> Blocks duplicated by WGD and harboring <italic>GGPPS11</italic> and <italic>GGPPS12</italic> are shown. Their orthologs found in syntenic region of <italic>C. papaya</italic> genome are indicated by red connecting lines. <bold>(B)</bold> <italic>GGPPS2, GGPPS3, GGPPS4</italic> and <italic>GGPPS11</italic> paralogs from <italic>A. thaliana</italic> found within &#x003B1;-WGD blocks on chromosome 2 and 4, respectively, are shown. Only <italic>GGPPS2</italic> and <italic>GGPPS11</italic> are retained as a pair (connected by red line), whereas the counterparts of <italic>GGPPS3</italic> and <italic>GGPPS4</italic> appear to have been lost from the corresponding syntenic region. Each genomic region spans 100 kb. The <italic>GGPPS</italic> paralogs and their orthologs from <italic>C. papaya</italic> are shown as red arrows. Blue arrows indicate anchor genes and they are connected by blue lines if retain within a WGD block.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM4" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Table 1</label>
<caption><p><bold>119 GGPPS protein sequences used for the phylogenetic tree reconstruction</bold>.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM5" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Table 2</label>
<caption><p><bold>Polyprenyl synthase domain evolution</bold>.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM6" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Dataset 1</label>
<caption><p><bold>MAFFT MSA in FASTA format of 119 GGPPS homologs from plants</bold>.</p></caption>
</supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ament</surname> <given-names>K.</given-names></name> <name><surname>Van Schie</surname> <given-names>C. C.</given-names></name> <name><surname>Bouwmeester</surname> <given-names>H. J.</given-names></name> <name><surname>Haring</surname> <given-names>M. A.</given-names></name> <name><surname>Schuurink</surname> <given-names>R. C.</given-names></name></person-group> (<year>2006</year>). <article-title>Induction of a leaf specific geranylgeranyl pyrophosphate synthase and emission of (E,E)-4,8,12-trimethyltrideca-1,3,7,11-tetraene in tomato are dependent on both jasmonic acid and salicylic acid signaling pathways</article-title>. <source>Planta</source> <volume>224</volume>, <fpage>1197</fpage>&#x02013;<lpage>1208</lpage>. <pub-id pub-id-type="doi">10.1007/s00425-006-0301-5</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anisimova</surname> <given-names>M.</given-names></name> <name><surname>Gascuel</surname> <given-names>O.</given-names></name></person-group> (<year>2006</year>). <article-title>Approximate likelihood-ratio test for branches: a fast, accurate, and powerful alternative</article-title>. <source>Syst. Biol</source>. <volume>55</volume>, <fpage>539</fpage>&#x02013;<lpage>552</lpage>. <pub-id pub-id-type="doi">10.1080/10635150600755453</pub-id><pub-id pub-id-type="pmid">16785212</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bak</surname> <given-names>S.</given-names></name> <name><surname>Beisson</surname> <given-names>F.</given-names></name> <name><surname>Bishop</surname> <given-names>G.</given-names></name> <name><surname>Hamberger</surname> <given-names>B.</given-names></name> <name><surname>Hofer</surname> <given-names>R.</given-names></name> <name><surname>Paquette</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Cytochromes p450</article-title>. <source>Arabidopsis Book</source> <volume>9</volume>:<fpage>e0144</fpage>. <pub-id pub-id-type="doi">10.1199/tab.0144</pub-id><pub-id pub-id-type="pmid">22303269</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barker</surname> <given-names>M. S.</given-names></name> <name><surname>Vogel</surname> <given-names>H.</given-names></name> <name><surname>Schranz</surname> <given-names>M. E.</given-names></name></person-group> (<year>2009</year>). <article-title>Paleopolyploidy in the Brassicales: analyses of the Cleome transcriptome elucidate the history of genome duplications in Arabidopsis and other Brassicales</article-title>. <source>Genome Biol. Evol</source>. <volume>1</volume>, <fpage>391</fpage>&#x02013;<lpage>399</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evp040</pub-id><pub-id pub-id-type="pmid">20333207</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beck</surname> <given-names>G.</given-names></name> <name><surname>Coman</surname> <given-names>D.</given-names></name> <name><surname>Herren</surname> <given-names>E.</given-names></name> <name><surname>Ruiz-Sola</surname> <given-names>M. &#x000C1;.</given-names></name> <name><surname>Rodr&#x000ED;guez-Concepci&#x000F3;n</surname> <given-names>M.</given-names></name> <name><surname>Gruissem</surname> <given-names>W.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Characterization of the GGPP synthase gene family in <italic>Arabidopsis thaliana</italic></article-title>. <source>Plant Mol. Biol</source>. <volume>82</volume>, <fpage>393</fpage>&#x02013;<lpage>416</lpage>. <pub-id pub-id-type="doi">10.1007/s11103-013-0070-z</pub-id><pub-id pub-id-type="pmid">23729351</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beilstein</surname> <given-names>M. A.</given-names></name> <name><surname>Nagalingum</surname> <given-names>N. S.</given-names></name> <name><surname>Clements</surname> <given-names>M. D.</given-names></name> <name><surname>Manchester</surname> <given-names>S. R.</given-names></name> <name><surname>Mathews</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>Dated molecular phylogenies indicate a Miocene origin for <italic>Arabidopsis thaliana</italic></article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>107</volume>, <fpage>18724</fpage>&#x02013;<lpage>18728</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0909766107</pub-id><pub-id pub-id-type="pmid">20921408</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berglund-Sonnhammer</surname> <given-names>A. C.</given-names></name> <name><surname>Steffansson</surname> <given-names>P.</given-names></name> <name><surname>Betts</surname> <given-names>M. J.</given-names></name> <name><surname>Liberles</surname> <given-names>D. A.</given-names></name></person-group> (<year>2006</year>). <article-title>Optimal gene trees from sequences and species trees using a soft interpretation of parsimony</article-title>. <source>J. Mol. Evol</source>. <volume>63</volume>, <fpage>240</fpage>&#x02013;<lpage>250</lpage>. <pub-id pub-id-type="doi">10.1007/s00239-005-0096-1</pub-id><pub-id pub-id-type="pmid">16830091</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blanc</surname> <given-names>G.</given-names></name> <name><surname>Wolfe</surname> <given-names>K. H.</given-names></name></person-group> (<year>2004</year>). <article-title>Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution</article-title>. <source>Plant Cell</source> <volume>16</volume>, <fpage>1679</fpage>&#x02013;<lpage>1691</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.021410</pub-id><pub-id pub-id-type="pmid">15208398</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bohlmann</surname> <given-names>J.</given-names></name> <name><surname>Croteau</surname> <given-names>R.</given-names></name></person-group> (<year>1999</year>). <article-title>Diversity and variability of terpenoid defences in conifers: molecular genetics, biochemistry and evolution of the terpene synthase gene family in grand fir (Abies Grandis)</article-title>, in <source>Insect&#x02013;Plant Interactions and Induced Plant Defence</source>, eds <person-group person-group-type="editor"><name><surname>Chadwick</surname> <given-names>D. J.</given-names></name> <name><surname>Goode</surname> <given-names>J. A.</given-names></name></person-group> (<publisher-loc>Chichester</publisher-loc>: <publisher-name>John Wiley and Sons, Ltd</publisher-name>.), <fpage>132</fpage>&#x02013;<lpage>149</lpage>.</citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bouvier</surname> <given-names>F.</given-names></name> <name><surname>Rahier</surname> <given-names>A.</given-names></name> <name><surname>Camara</surname> <given-names>B.</given-names></name></person-group> (<year>2005</year>). <article-title>Biogenesis, molecular regulation and function of plant isoprenoids</article-title>. <source>Prog. Lipid Res</source>. <volume>44</volume>, <fpage>357</fpage>&#x02013;<lpage>429</lpage>. <pub-id pub-id-type="doi">10.1016/j.plipres.2005.09.003</pub-id><pub-id pub-id-type="pmid">16289312</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bouvier</surname> <given-names>F.</given-names></name> <name><surname>Suire</surname> <given-names>C.</given-names></name> <name><surname>D&#x00027;harlingue</surname> <given-names>A.</given-names></name> <name><surname>Backhaus</surname> <given-names>R. A.</given-names></name> <name><surname>Camara</surname> <given-names>B.</given-names></name></person-group> (<year>2000</year>). <article-title>Molecular cloning of geranyl diphosphate synthase and compartmentation of monoterpene synthesis in plant cells</article-title>. <source>Plant J</source>. <volume>24</volume>, <fpage>241</fpage>&#x02013;<lpage>252</lpage>. <pub-id pub-id-type="doi">10.1046/j.1365-313x.2000.00875.x</pub-id><pub-id pub-id-type="pmid">11069698</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowers</surname> <given-names>J. E.</given-names></name> <name><surname>Chapman</surname> <given-names>B. A.</given-names></name> <name><surname>Rong</surname> <given-names>J. K.</given-names></name> <name><surname>Paterson</surname> <given-names>A. H.</given-names></name></person-group> (<year>2003</year>). <article-title>Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events</article-title>. <source>Nature</source> <volume>422</volume>, <fpage>433</fpage>&#x02013;<lpage>438</lpage>. <pub-id pub-id-type="doi">10.1038/nature01521</pub-id><pub-id pub-id-type="pmid">12660784</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burke</surname> <given-names>C. C.</given-names></name> <name><surname>Wildung</surname> <given-names>M. R.</given-names></name> <name><surname>Croteau</surname> <given-names>R.</given-names></name></person-group> (<year>1999</year>). <article-title>Geranyl diphosphate synthase: cloning, expression, and characterization of this prenyltransferase as a heterodimer</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>96</volume>, <fpage>13062</fpage>&#x02013;<lpage>13067</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.96.23.13062</pub-id><pub-id pub-id-type="pmid">10557273</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cannon</surname> <given-names>S. B.</given-names></name> <name><surname>Mitra</surname> <given-names>A.</given-names></name> <name><surname>Baumgarten</surname> <given-names>A.</given-names></name> <name><surname>Young</surname> <given-names>N. D.</given-names></name> <name><surname>May</surname> <given-names>G.</given-names></name></person-group> (<year>2004</year>). <article-title>The roles of segmental and tandem gene duplication in the evolution of large gene families in <italic>Arabidopsis thaliana</italic></article-title>. <source>BMC Plant Biol</source>. <volume>4</volume>:<fpage>10</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2229-4-10</pub-id><pub-id pub-id-type="pmid">15171794</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>A. J.</given-names></name> <name><surname>Kroon</surname> <given-names>P. A.</given-names></name> <name><surname>Poulter</surname> <given-names>C. D.</given-names></name></person-group> (<year>1994</year>). <article-title>Isoprenyl diphosphate synthases - protein-sequence comparisons, a phylogenetic tree, and predictions of secondary structure</article-title>. <source>Protein Sci</source>. <volume>3</volume>, <fpage>600</fpage>&#x02013;<lpage>607</lpage>. <pub-id pub-id-type="pmid">8003978</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>F.</given-names></name> <name><surname>Tholl</surname> <given-names>D.</given-names></name> <name><surname>D&#x00027;auria</surname> <given-names>J. C.</given-names></name> <name><surname>Farooq</surname> <given-names>A.</given-names></name> <name><surname>Pichersky</surname> <given-names>E.</given-names></name> <name><surname>Gershenzon</surname> <given-names>J.</given-names></name></person-group> (<year>2003</year>). <article-title>Biosynthesis and emission of terpenoid volatiles from Arabidopsis flowers</article-title>. <source>Plant Cell</source> <volume>15</volume>, <fpage>481</fpage>&#x02013;<lpage>494</lpage>. <pub-id pub-id-type="doi">10.1105/tpc.007989</pub-id><pub-id pub-id-type="pmid">12566586</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Closa</surname> <given-names>M.</given-names></name> <name><surname>Vranov&#x000E1;</surname> <given-names>E.</given-names></name> <name><surname>Bortolotti</surname> <given-names>C.</given-names></name> <name><surname>Bigler</surname> <given-names>L.</given-names></name> <name><surname>Arro</surname> <given-names>M.</given-names></name> <name><surname>Ferrer</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>The <italic>Arabidopsis thaliana</italic> FPP synthase isozymes have overlapping and specific functions in isoprenoid biosynthesis, and complete loss of FPP synthase activity causes early developmental arrest</article-title>. <source>Plant J</source>. <volume>63</volume>, <fpage>512</fpage>&#x02013;<lpage>525</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2010.04253.x</pub-id><pub-id pub-id-type="pmid">20497375</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davies</surname> <given-names>T. J.</given-names></name> <name><surname>Barraclough</surname> <given-names>T. G.</given-names></name> <name><surname>Chase</surname> <given-names>M. W.</given-names></name> <name><surname>Soltis</surname> <given-names>P. S.</given-names></name> <name><surname>Soltis</surname> <given-names>D. E.</given-names></name> <name><surname>Savolainen</surname> <given-names>V.</given-names></name></person-group> (<year>2004</year>). <article-title>Darwin&#x00027;s abominable mystery: Insights from a supertree of the angiosperms</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>101</volume>, <fpage>1904</fpage>&#x02013;<lpage>1909</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0308127100</pub-id><pub-id pub-id-type="pmid">14766971</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Donoghue</surname> <given-names>M. T. A.</given-names></name> <name><surname>Keshavaiah</surname> <given-names>C.</given-names></name> <name><surname>Swamidatta</surname> <given-names>S. H.</given-names></name> <name><surname>Spillane</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>Evolutionary origins of Brassicaceae specific genes in <italic>Arabidopsis thaliana</italic></article-title>. <source>BMC Evol. Biol</source>. <volume>11</volume>:<fpage>47</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2148-11-47</pub-id><pub-id pub-id-type="pmid">21332978</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Drummond</surname> <given-names>A. J.</given-names></name> <name><surname>Ho</surname> <given-names>S. Y. W.</given-names></name> <name><surname>Phillips</surname> <given-names>M. J.</given-names></name> <name><surname>Rambaut</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>Relaxed phylogenetics and dating with confidence</article-title>. <source>PLoS Biol</source>. <volume>4</volume>: <fpage>e88</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pbio.0040088</pub-id><pub-id pub-id-type="pmid">16683862</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Faith</surname> <given-names>D. P.</given-names></name> <name><surname>Cranston</surname> <given-names>P. S.</given-names></name></person-group> (<year>1991</year>). <article-title>Could a cladogram this short have arisen by chance alone - on permutation tests for cladistic structure</article-title>. <source>Cladistics</source> <volume>7</volume>, <fpage>1</fpage>&#x02013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.1111/j.1096-0031.1991.tb00020.x</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fitch</surname> <given-names>W. M.</given-names></name></person-group> (<year>1970</year>). <article-title>Distinguishing homologous from analogous proteins</article-title>. <source>Syst. Zool</source>. <volume>19</volume>, <fpage>99</fpage>&#x02013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.2307/2412448</pub-id><pub-id pub-id-type="pmid">5449325</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Freeling</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition</article-title>. <source>Annu. Rev. Plant Biol</source>. <volume>60</volume>, <fpage>433</fpage>&#x02013;<lpage>453</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.arplant.043008.092122</pub-id><pub-id pub-id-type="pmid">19575588</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goldstein</surname> <given-names>J. L.</given-names></name> <name><surname>Brown</surname> <given-names>M. S.</given-names></name></person-group> (<year>1990</year>). <article-title>Regulation of the mevalonate pathway</article-title>. <source>Nature</source> <volume>343</volume>, <fpage>425</fpage>&#x02013;<lpage>430</lpage>. <pub-id pub-id-type="doi">10.1038/343425a0</pub-id><pub-id pub-id-type="pmid">1967820</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonnet</surname> <given-names>G. H.</given-names></name> <name><surname>Hallett</surname> <given-names>M. T.</given-names></name> <name><surname>Korostensky</surname> <given-names>C.</given-names></name> <name><surname>Bernardin</surname> <given-names>L.</given-names></name></person-group> (<year>2000</year>). <article-title>Darwin v. 2.0: an interpreted computer language for the biosciences</article-title>. <source>Bioinformatics</source> <volume>16</volume>, <fpage>101</fpage>&#x02013;<lpage>103</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/16.2.101</pub-id><pub-id pub-id-type="pmid">10842729</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guindon</surname> <given-names>S.</given-names></name> <name><surname>Delsuc</surname> <given-names>F.</given-names></name> <name><surname>Dufayard</surname> <given-names>J. F.</given-names></name> <name><surname>Gascuel</surname> <given-names>O.</given-names></name></person-group> (<year>2009</year>). <article-title>Estimating maximum likelihood phylogenies with PhyML</article-title>. <source>Methods Mol. Biol</source>. <volume>537</volume>, <fpage>113</fpage>&#x02013;<lpage>137</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-59745-251-9_6</pub-id><pub-id pub-id-type="pmid">19378142</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guindon</surname> <given-names>S.</given-names></name> <name><surname>Gascuel</surname> <given-names>O.</given-names></name></person-group> (<year>2003</year>). <article-title>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</article-title>. <source>Syst. Biol</source>. <volume>52</volume>, <fpage>696</fpage>&#x02013;<lpage>704</lpage>. <pub-id pub-id-type="doi">10.1080/10635150390235520</pub-id><pub-id pub-id-type="pmid">14530136</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hirotsune</surname> <given-names>S.</given-names></name> <name><surname>Yoshida</surname> <given-names>N.</given-names></name> <name><surname>Chen</surname> <given-names>A.</given-names></name> <name><surname>Garrett</surname> <given-names>L.</given-names></name> <name><surname>Sugiyama</surname> <given-names>F.</given-names></name> <name><surname>Takahashi</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2003</year>). <article-title>An expressed pseudogene regulates the messenger-RNA stability of its homologous coding gene</article-title>. <source>Nature</source> <volume>423</volume>, <fpage>91</fpage>&#x02013;<lpage>96</lpage>. <pub-id pub-id-type="doi">10.1038/nature01535</pub-id><pub-id pub-id-type="pmid">12721631</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hsiao</surname> <given-names>Y. Y.</given-names></name> <name><surname>Jeng</surname> <given-names>M. F.</given-names></name> <name><surname>Tsai</surname> <given-names>W. C.</given-names></name> <name><surname>Chuang</surname> <given-names>Y. C.</given-names></name> <name><surname>Li</surname> <given-names>C. Y.</given-names></name> <name><surname>Wu</surname> <given-names>T. S.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>A novel homodimeric geranyl diphosphate synthase from the orchid <italic>Phalaenopsis bellina</italic> lacking a DD(X)(2-4)D motif</article-title>. <source>Plant J</source>. <volume>55</volume>, <fpage>719</fpage>&#x02013;<lpage>733</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2008.03547.x</pub-id><pub-id pub-id-type="pmid">18466308</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hsieh</surname> <given-names>F. L.</given-names></name> <name><surname>Chang</surname> <given-names>T. H.</given-names></name> <name><surname>Ko</surname> <given-names>T. P.</given-names></name> <name><surname>Wang</surname> <given-names>A. H. J.</given-names></name></person-group> (<year>2011</year>). <article-title>Structure and mechanism of an Arabidopsis medium/long-chain-length prenyl pyrophosphate synthase</article-title>. <source>Plant Physiol</source>. <volume>155</volume>, <fpage>1079</fpage>&#x02013;<lpage>1090</lpage>. <pub-id pub-id-type="doi">10.1104/pp.110.168799</pub-id><pub-id pub-id-type="pmid">21220764</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hurst</surname> <given-names>L. D.</given-names></name> <name><surname>Smith</surname> <given-names>N. G. C.</given-names></name></person-group> (<year>1999</year>). <article-title>Do essential genes evolve slowly?</article-title> <source>Curr. Biol</source>. <volume>9</volume>, <fpage>747</fpage>&#x02013;<lpage>750</lpage>. <pub-id pub-id-type="doi">10.1016/S0960-9822(99)80334-0</pub-id><pub-id pub-id-type="pmid">10421576</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Innan</surname> <given-names>H.</given-names></name> <name><surname>Kondrashov</surname> <given-names>F.</given-names></name></person-group> (<year>2010</year>). <article-title>The evolution of gene duplications: classifying and distinguishing between models</article-title>. <source>Nat. Rev. Genet</source>. <volume>11</volume>, <fpage>97</fpage>&#x02013;<lpage>108</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2689</pub-id><pub-id pub-id-type="pmid">20051986</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaillon</surname> <given-names>O.</given-names></name> <name><surname>Aury</surname> <given-names>J. M.</given-names></name> <name><surname>Noel</surname> <given-names>B.</given-names></name> <name><surname>Policriti</surname> <given-names>A.</given-names></name> <name><surname>Clepet</surname> <given-names>C.</given-names></name> <name><surname>Casagrande</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla</article-title>. <source>Nature</source> <volume>449</volume>, <fpage>463</fpage>&#x02013;<lpage>U465</lpage>. <pub-id pub-id-type="doi">10.1038/nature06148</pub-id><pub-id pub-id-type="pmid">17721507</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jassbi</surname> <given-names>A. R.</given-names></name> <name><surname>Gase</surname> <given-names>K.</given-names></name> <name><surname>Hettenhausen</surname> <given-names>C.</given-names></name> <name><surname>Schmidt</surname> <given-names>A.</given-names></name> <name><surname>Baldwin</surname> <given-names>I. T.</given-names></name></person-group> (<year>2008</year>). <article-title>Silencing geranylgeranyl diphosphate synthase in <italic>Nicotiana attenuata</italic> dramatically impairs resistance to tobacco hornworm</article-title>. <source>Plant Physiol</source>. <volume>146</volume>, <fpage>974</fpage>&#x02013;<lpage>986</lpage>. <pub-id pub-id-type="doi">10.1104/pp.107.108811</pub-id><pub-id pub-id-type="pmid">17965175</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Katoh</surname> <given-names>K.</given-names></name> <name><surname>Toh</surname> <given-names>H.</given-names></name></person-group> (<year>2008</year>). <article-title>Recent developments in the MAFFT multiple sequence alignment program</article-title>. <source>Brief. Bioinformatics</source> <volume>9</volume>, <fpage>286</fpage>&#x02013;<lpage>298</lpage>. <pub-id pub-id-type="doi">10.1093/bib/bbn013</pub-id><pub-id pub-id-type="pmid">18372315</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keeling</surname> <given-names>C. I.</given-names></name> <name><surname>Bohlmann</surname> <given-names>J.</given-names></name></person-group> (<year>2006</year>). <article-title>Genes, enzymes and chemicals of terpenoid diversity in the constitutive and induced defence of conifers against insects and pathogens</article-title>. <source>New Phytol</source>. <volume>170</volume>, <fpage>657</fpage>&#x02013;<lpage>675</lpage>. <pub-id pub-id-type="doi">10.1111/j.1469-8137.2006.01716.x</pub-id><pub-id pub-id-type="pmid">16684230</pub-id></citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kersey</surname> <given-names>P. J.</given-names></name> <name><surname>Lawson</surname> <given-names>D.</given-names></name> <name><surname>Birney</surname> <given-names>E.</given-names></name> <name><surname>Derwent</surname> <given-names>P. S.</given-names></name> <name><surname>Haimel</surname> <given-names>M.</given-names></name> <name><surname>Herrero</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Ensembl genomes: extending ensembl across the taxonomic space</article-title>. <source>Nucleic Acids Res</source>. <volume>38</volume>, <fpage>D563</fpage>&#x02013;<lpage>D569</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkp871</pub-id><pub-id pub-id-type="pmid">19884133</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kliebenstein</surname> <given-names>D. J.</given-names></name> <name><surname>Kroymann</surname> <given-names>J.</given-names></name> <name><surname>Mitchell-Olds</surname> <given-names>T.</given-names></name></person-group> (<year>2005</year>). <article-title>The glucosinolate-myrosinase system in an ecological and evolutionary context</article-title>. <source>Curr. Opin. Plant Biol</source>. <volume>8</volume>, <fpage>264</fpage>&#x02013;<lpage>271</lpage>. <pub-id pub-id-type="doi">10.1016/j.pbi.2005.03.002</pub-id><pub-id pub-id-type="pmid">15860423</pub-id></citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koonin</surname> <given-names>E. V.</given-names></name></person-group> (<year>2009</year>). <article-title>Darwinian evolution in the light of genomics</article-title>. <source>Nucleic Acids Res</source>. <volume>37</volume>, <fpage>1011</fpage>&#x02013;<lpage>1034</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkp089</pub-id><pub-id pub-id-type="pmid">19213802</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kosiol</surname> <given-names>C.</given-names></name> <name><surname>Holmes</surname> <given-names>I.</given-names></name> <name><surname>Goldman</surname> <given-names>N.</given-names></name></person-group> (<year>2007</year>). <article-title>An empirical codon model for protein sequence evolution</article-title>. <source>Mol. Biol. Evol</source>. <volume>24</volume>, <fpage>1464</fpage>&#x02013;<lpage>1479</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msm064</pub-id><pub-id pub-id-type="pmid">17400572</pub-id></citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lange</surname> <given-names>B. M.</given-names></name> <name><surname>Ghassemian</surname> <given-names>M.</given-names></name></person-group> (<year>2003</year>). <article-title>Genome organization in <italic>Arabidopsis thaliana</italic>: a survey for genes involved in isoprenoid and chlorophyll metabolism</article-title>. <source>Plant Mol. Biol</source>. <volume>51</volume>, <fpage>925</fpage>&#x02013;<lpage>948</lpage>. <pub-id pub-id-type="doi">10.1023/a:1023005504702</pub-id><pub-id pub-id-type="pmid">12777052</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Langham</surname> <given-names>R. J.</given-names></name> <name><surname>Walsh</surname> <given-names>J.</given-names></name> <name><surname>Dunn</surname> <given-names>M.</given-names></name> <name><surname>Ko</surname> <given-names>C.</given-names></name> <name><surname>Goff</surname> <given-names>S. A.</given-names></name> <name><surname>Freeling</surname> <given-names>M.</given-names></name></person-group> (<year>2004</year>). <article-title>Genomic duplication, fractionation and the origin of regulatory novelty</article-title>. <source>Genetics</source> <volume>166</volume>, <fpage>935</fpage>&#x02013;<lpage>945</lpage>. <pub-id pub-id-type="doi">10.1534/genetics.166.2.935</pub-id><pub-id pub-id-type="pmid">15020478</pub-id></citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Le</surname> <given-names>S. Q.</given-names></name> <name><surname>Gascuel</surname> <given-names>O.</given-names></name></person-group> (<year>2008</year>). <article-title>An improved general amino acid replacement matrix</article-title>. <source>Mol. Biol. Evol</source>. <volume>25</volume>, <fpage>1307</fpage>&#x02013;<lpage>1320</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msn067</pub-id><pub-id pub-id-type="pmid">18367465</pub-id></citation>
</ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lespinet</surname> <given-names>O.</given-names></name> <name><surname>Wolf</surname> <given-names>Y. I.</given-names></name> <name><surname>Koonin</surname> <given-names>E. V.</given-names></name> <name><surname>Aravind</surname> <given-names>L.</given-names></name></person-group> (<year>2002</year>). <article-title>The role of lineage-specific gene family expansion in the evolution of eukaryotes</article-title>. <source>Genome Res</source>. <volume>12</volume>, <fpage>1048</fpage>&#x02013;<lpage>1059</lpage>. <pub-id pub-id-type="doi">10.1101/gr.174302</pub-id><pub-id pub-id-type="pmid">12097341</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liang</surname> <given-names>P. H.</given-names></name></person-group> (<year>2009</year>). <article-title>Reaction kinetics, catalytic mechanisms, conformational changes, and inhibitor design for prenyltransferases</article-title>. <source>Biochemistry</source> <volume>48</volume>, <fpage>6562</fpage>&#x02013;<lpage>6570</lpage>. <pub-id pub-id-type="doi">10.1021/bi900371p</pub-id><pub-id pub-id-type="pmid">19537817</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liang</surname> <given-names>P. H.</given-names></name> <name><surname>Ko</surname> <given-names>T. P.</given-names></name> <name><surname>Wang</surname> <given-names>A. H. J.</given-names></name></person-group> (<year>2002</year>). <article-title>Structure, mechanism and function of prenyltransferases</article-title>. <source>Eur. J. Biochem</source>. <volume>269</volume>, <fpage>3339</fpage>&#x02013;<lpage>3354</lpage>. <pub-id pub-id-type="doi">10.1046/j.1432-1033.2002.03014.x</pub-id><pub-id pub-id-type="pmid">12135472</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lynch</surname> <given-names>M.</given-names></name> <name><surname>Conery</surname> <given-names>J. S.</given-names></name></person-group> (<year>2000</year>). <article-title>The evolutionary fate and consequences of duplicate genes</article-title>. <source>Science</source> <volume>290</volume>, <fpage>1151</fpage>&#x02013;<lpage>1155</lpage>. <pub-id pub-id-type="doi">10.1126/science.290.5494.1151</pub-id><pub-id pub-id-type="pmid">11073452</pub-id></citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lynch</surname> <given-names>M.</given-names></name> <name><surname>Force</surname> <given-names>A.</given-names></name></person-group> (<year>2000</year>). <article-title>The probability of duplicate gene preservation by subfunctionalization</article-title>. <source>Genetics</source> <volume>154</volume>, <fpage>459</fpage>&#x02013;<lpage>473</lpage>. <pub-id pub-id-type="pmid">10629003</pub-id></citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lyons</surname> <given-names>E.</given-names></name> <name><surname>Pedersen</surname> <given-names>B.</given-names></name> <name><surname>Kane</surname> <given-names>J.</given-names></name> <name><surname>Alam</surname> <given-names>M.</given-names></name> <name><surname>Ming</surname> <given-names>R.</given-names></name> <name><surname>Tang</surname> <given-names>H. B.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Finding and comparing syntenic regions among Arabidopsis and the outgroups papaya, poplar, and grape: CoGe with rosids</article-title>. <source>Plant Physiol</source>. <volume>148</volume>, <fpage>1772</fpage>&#x02013;<lpage>1781</lpage>. <pub-id pub-id-type="doi">10.1104/pp.108.124867</pub-id><pub-id pub-id-type="pmid">18952863</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lysak</surname> <given-names>M. A.</given-names></name> <name><surname>Berr</surname> <given-names>A.</given-names></name> <name><surname>Pecinka</surname> <given-names>A.</given-names></name> <name><surname>Schmidt</surname> <given-names>R.</given-names></name> <name><surname>Mcbreen</surname> <given-names>K.</given-names></name> <name><surname>Schubert</surname> <given-names>I.</given-names></name></person-group> (<year>2006</year>). <article-title>Mechanisms of chromosome number reduction in <italic>Arabidopsis thaliana</italic> and related Brassicaceae species</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>103</volume>, <fpage>5224</fpage>&#x02013;<lpage>5229</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0510791103</pub-id><pub-id pub-id-type="pmid">16549785</pub-id></citation>
</ref>
<ref id="B53">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Maddison</surname> <given-names>W. P.</given-names></name> <name><surname>Maddison</surname> <given-names>D. R.</given-names></name></person-group> (<year>2011</year>). <source>Mesquite: a Modular System for Evolutionary Analysis. Version 2.75</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://mesquiteproject.org">http://mesquiteproject.org</ext-link></citation>
</ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ming</surname> <given-names>R.</given-names></name> <name><surname>Hou</surname> <given-names>S. B.</given-names></name> <name><surname>Feng</surname> <given-names>Y.</given-names></name> <name><surname>Yu</surname> <given-names>Q. Y.</given-names></name> <name><surname>Dionne-Laporte</surname> <given-names>A.</given-names></name> <name><surname>Saw</surname> <given-names>J. H.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus)</article-title>. <source>Nature</source> <volume>452</volume>, <fpage>U991</fpage>&#x02013;<lpage>U997</lpage>. <pub-id pub-id-type="doi">10.1038/nature06856</pub-id><pub-id pub-id-type="pmid">18432245</pub-id></citation>
</ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nowick</surname> <given-names>K.</given-names></name> <name><surname>Stubbs</surname> <given-names>L.</given-names></name></person-group> (<year>2010</year>). <article-title>Lineage-specific transcription factors and the evolution of gene regulatory networks</article-title>. <source>Brief. Funct. Genomics</source> <volume>9</volume>, <fpage>65</fpage>&#x02013;<lpage>78</lpage>. <pub-id pub-id-type="doi">10.1093/bfgp/elp056</pub-id><pub-id pub-id-type="pmid">20081217</pub-id></citation>
</ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Okada</surname> <given-names>K.</given-names></name> <name><surname>Saito</surname> <given-names>T.</given-names></name> <name><surname>Nakagawa</surname> <given-names>T.</given-names></name> <name><surname>Kawamukai</surname> <given-names>M.</given-names></name> <name><surname>Kamiya</surname> <given-names>Y.</given-names></name></person-group> (<year>2000</year>). <article-title>Five geranylgeranyl diphosphate synthases expressed in different organs are localized into three subcellular compartments in Arabidopsis</article-title>. <source>Plant Physiol</source>. <volume>122</volume>, <fpage>1045</fpage>&#x02013;<lpage>1056</lpage>. <pub-id pub-id-type="doi">10.1104/pp.122.4.1045</pub-id><pub-id pub-id-type="pmid">10759500</pub-id></citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pichersky</surname> <given-names>E.</given-names></name> <name><surname>Gershenzon</surname> <given-names>J.</given-names></name></person-group> (<year>2002</year>). <article-title>The formation and function of plant volatiles: perfumes for pollinator attraction and defense</article-title>. <source>Curr. Opin. Plant Biol</source>. <volume>5</volume>, <fpage>237</fpage>&#x02013;<lpage>243</lpage>. <pub-id pub-id-type="doi">10.1016/S1369-5266(02)00251-0</pub-id><pub-id pub-id-type="pmid">11960742</pub-id></citation>
</ref>
<ref id="B59">
<citation citation-type="book"><person-group person-group-type="author"><collab>R Development Core Team.</collab></person-group> (<year>2010</year>). <source>R: A Language and Environment for Statistical Computing</source>. <publisher-loc>Vienna</publisher-loc>: <publisher-name>R Foundation for Statistical Computing</publisher-name>.</citation>
</ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rodr&#x000ED;guez-Concepci&#x000F3;n</surname> <given-names>M.</given-names></name> <name><surname>Boronat</surname> <given-names>A.</given-names></name></person-group> (<year>2002</year>). <article-title>Elucidation of the methylerythritol phosphate pathway for isoprenoid biosynthesis in bacteria and plastids. A metabolic milestone achieved through genomics</article-title>. <source>Plant Physiol</source>. <volume>130</volume>, <fpage>1079</fpage>&#x02013;<lpage>1089</lpage>. <pub-id pub-id-type="doi">10.1104/pp.007138</pub-id><pub-id pub-id-type="pmid">12427975</pub-id></citation>
</ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rohmer</surname> <given-names>M.</given-names></name></person-group> (<year>1999</year>). <article-title>The discovery of a mevalonate-independent pathway for isoprenoid biosynthesis in bacteria, algae and higher plants</article-title>. <source>Nat. Prod. Rep</source>. <volume>16</volume>, <fpage>565</fpage>&#x02013;<lpage>574</lpage>. <pub-id pub-id-type="doi">10.1039/A709175c</pub-id><pub-id pub-id-type="pmid">10584331</pub-id></citation>
</ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmid</surname> <given-names>M.</given-names></name> <name><surname>Davison</surname> <given-names>T. S.</given-names></name> <name><surname>Henz</surname> <given-names>S. R.</given-names></name> <name><surname>Pape</surname> <given-names>U. J.</given-names></name> <name><surname>Demar</surname> <given-names>M.</given-names></name> <name><surname>Vingron</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2005</year>). <article-title>A gene expression map of <italic>Arabidopsis thaliana</italic> development</article-title>. <source>Nat. Genet</source>. <volume>37</volume>, <fpage>501</fpage>&#x02013;<lpage>506</lpage>. <pub-id pub-id-type="doi">10.1038/ng1543</pub-id><pub-id pub-id-type="pmid">15806101</pub-id></citation>
</ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt</surname> <given-names>A.</given-names></name> <name><surname>Gershenzon</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <article-title>Cloning and characterization of two different types of geranyl diphosphate synthases from Norway spruce (<italic>Picea abies</italic>)</article-title>. <source>Phytochemistry</source> <volume>69</volume>, <fpage>49</fpage>&#x02013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1016/j.phytochem.2007.06.022</pub-id><pub-id pub-id-type="pmid">17673268</pub-id></citation>
</ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt</surname> <given-names>A.</given-names></name> <name><surname>Wachtler</surname> <given-names>B.</given-names></name> <name><surname>Temp</surname> <given-names>U.</given-names></name> <name><surname>Krekling</surname> <given-names>T.</given-names></name> <name><surname>Seguin</surname> <given-names>A.</given-names></name> <name><surname>Gershenzon</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>A bifunctional geranyl and geranylgeranyl diphosphate synthase is involved in terpene oleoresin formation in <italic>Picea abies</italic></article-title>. <source>Plant Physiol</source>. <volume>152</volume>, <fpage>639</fpage>&#x02013;<lpage>655</lpage>. <pub-id pub-id-type="doi">10.1104/pp.109.144691</pub-id><pub-id pub-id-type="pmid">19939949</pub-id></citation>
</ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Studer</surname> <given-names>R. A.</given-names></name> <name><surname>Robinson-Rechavi</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>How confident can we be that orthologs are similar, but paralogs differ?</article-title> <source>Trends Genet</source>. <volume>25</volume>, <fpage>210</fpage>&#x02013;<lpage>216</lpage>. <pub-id pub-id-type="doi">10.1016/j.tig.2009.03.004</pub-id><pub-id pub-id-type="pmid">19368988</pub-id></citation>
</ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tachibana</surname> <given-names>A.</given-names></name> <name><surname>Yano</surname> <given-names>Y.</given-names></name> <name><surname>Otani</surname> <given-names>S.</given-names></name> <name><surname>Nomura</surname> <given-names>N.</given-names></name> <name><surname>Sako</surname> <given-names>Y.</given-names></name> <name><surname>Taniguchi</surname> <given-names>M.</given-names></name></person-group> (<year>2000</year>). <article-title>Novel prenyltransferase gene encoding farnesylgeranyl diphosphate synthase from a hyperthermophilic archaeon, <italic>Aeropyrum pernix</italic> - Molecular evolution with alteration in product specificity</article-title>. <source>Eur. J. Biochem</source>. <volume>267</volume>, <fpage>321</fpage>&#x02013;<lpage>328</lpage>. <pub-id pub-id-type="doi">10.1046/j.1432-1327.2000.00967.x</pub-id><pub-id pub-id-type="pmid">10632701</pub-id></citation>
</ref>
<ref id="B85">
<citation citation-type="journal"><person-group person-group-type="author"><collab>The UniProt Consortium.</collab></person-group> (<year>2009</year>). <article-title>The Universal Protein Resource (UniProt) 2009</article-title>. <source>Nucl. Acids Res</source>. <volume>37</volume>, <fpage>D169</fpage>&#x02013;<lpage>D174</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkn664</pub-id><pub-id pub-id-type="pmid">18836194</pub-id></citation>
</ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tholl</surname> <given-names>D.</given-names></name></person-group> (<year>2006</year>). <article-title>Terpene synthases and the regulation, diversity and biological roles of terpene metabolism</article-title>. <source>Curr. Opin. Plant Biol</source>. <volume>9</volume>, <fpage>297</fpage>&#x02013;<lpage>304</lpage>. <pub-id pub-id-type="doi">10.1016/j.pbi.2006.03.014</pub-id><pub-id pub-id-type="pmid">16600670</pub-id></citation>
</ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tholl</surname> <given-names>D.</given-names></name> <name><surname>Kish</surname> <given-names>C. M.</given-names></name> <name><surname>Orlova</surname> <given-names>I.</given-names></name> <name><surname>Sherman</surname> <given-names>D.</given-names></name> <name><surname>Gershenzon</surname> <given-names>J.</given-names></name> <name><surname>Pichersky</surname> <given-names>E.</given-names></name> <etal/></person-group>. (<year>2004</year>). <article-title>Formation of monoterpenes in <italic>Antirrhinum majus</italic> and <italic>Clarkia breweri</italic> flowers involves heterodimeric geranyl diphosphate synthases</article-title>. <source>Plant Cell</source> <volume>16</volume>, <fpage>977</fpage>&#x02013;<lpage>992</lpage>. <pub-id pub-id-type="doi">10.1105/Tpc.020156</pub-id><pub-id pub-id-type="pmid">15031409</pub-id></citation>
</ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <name><surname>Pedersen</surname> <given-names>B.</given-names></name> <name><surname>Freeling</surname> <given-names>M.</given-names></name></person-group> (<year>2006</year>). <article-title>Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes</article-title>. <source>Genome Res</source>. <volume>16</volume>, <fpage>934</fpage>&#x02013;<lpage>946</lpage>. <pub-id pub-id-type="doi">10.1101/gr.4708406</pub-id><pub-id pub-id-type="pmid">16760422</pub-id></citation>
</ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Der Heijden</surname> <given-names>R. T. J. M.</given-names></name> <name><surname>Snel</surname> <given-names>B.</given-names></name> <name><surname>Van Noort</surname> <given-names>V.</given-names></name> <name><surname>Huynen</surname> <given-names>M. A.</given-names></name></person-group> (<year>2007</year>). <article-title>Orthology prediction at scalable resolution by phylogenetic tree analysis</article-title>. <source>BMC Bioinformatics</source> <volume>8</volume>:<fpage>83</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-8-83</pub-id><pub-id pub-id-type="pmid">17346331</pub-id></citation>
</ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Schie</surname> <given-names>C. C. N.</given-names></name> <name><surname>Ament</surname> <given-names>K.</given-names></name> <name><surname>Schmidt</surname> <given-names>A.</given-names></name> <name><surname>Lange</surname> <given-names>T.</given-names></name> <name><surname>Haring</surname> <given-names>M. A.</given-names></name> <name><surname>Schuurink</surname> <given-names>R. C.</given-names></name></person-group> (<year>2007</year>). <article-title>Geranyl diphosphate synthase is required for biosynthesis of gibberellins</article-title>. <source>Plant J</source>. <volume>52</volume>, <fpage>752</fpage>&#x02013;<lpage>762</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-313X.2007.03273.x</pub-id><pub-id pub-id-type="pmid">17877699</pub-id></citation>
</ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vandermoten</surname> <given-names>S.</given-names></name> <name><surname>Haubruge</surname> <given-names>E.</given-names></name> <name><surname>Cusson</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>New insights into short-chain prenyltransferases: structural features, evolutionary history and potential for selective inhibition</article-title>. <source>Cell. Mol. Life Sci</source>. <volume>66</volume>, <fpage>3685</fpage>&#x02013;<lpage>3695</lpage>. <pub-id pub-id-type="doi">10.1007/s00018-009-0100-9</pub-id><pub-id pub-id-type="pmid">19633972</pub-id></citation>
</ref>
<ref id="B75">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vranov&#x000E1;</surname> <given-names>E.</given-names></name> <name><surname>Coman</surname> <given-names>D.</given-names></name> <name><surname>Gruissem</surname> <given-names>W.</given-names></name></person-group> (<year>2012</year>). <article-title>Structure and dynamics of the isoprenoid pathway network</article-title>. <source>Mol. Plant</source> <volume>5</volume>, <fpage>318</fpage>&#x02013;<lpage>333</lpage>. <pub-id pub-id-type="doi">10.1093/mp/sss015</pub-id><pub-id pub-id-type="pmid">22442388</pub-id></citation>
</ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vranov&#x000E1;</surname> <given-names>E.</given-names></name> <name><surname>Coman</surname> <given-names>D.</given-names></name> <name><surname>Gruissem</surname> <given-names>W.</given-names></name></person-group> (<year>2013</year>). <article-title>Network analysis of the MVA and MEP pathways for isoprenoid synthesis</article-title>. <source>Annu. Rev. Plant Biol</source>. <volume>64</volume>, <fpage>665</fpage>&#x02013;<lpage>700</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-arplant-050312-120116</pub-id><pub-id pub-id-type="pmid">23451776</pub-id></citation>
</ref>
<ref id="B77">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vranov&#x000E1;</surname> <given-names>E.</given-names></name> <name><surname>Hirsch-Hoffmann</surname> <given-names>M.</given-names></name> <name><surname>Gruissem</surname> <given-names>W.</given-names></name></person-group> (<year>2011</year>). <article-title>AtIPD: a curated database of Arabidopsis isoprenoid pathway models and genes for isoprenoid network analysis</article-title>. <source>Plant Physiol</source>. <volume>156</volume>, <fpage>1655</fpage>&#x02013;<lpage>1660</lpage>. <pub-id pub-id-type="doi">10.1104/pp.111.177758</pub-id><pub-id pub-id-type="pmid">21617028</pub-id></citation>
</ref>
<ref id="B78">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wahlberg</surname> <given-names>N.</given-names></name></person-group> (<year>2001</year>). <article-title>The phylogenetics and biochemistry of host-plant specialization in Melitaeine butterflies (Lepidoptera: Nymphalidae)</article-title>. <source>Evolution</source> <volume>55</volume>, <fpage>522</fpage>&#x02013;<lpage>537</lpage>. <pub-id pub-id-type="doi">10.1554/0014-3820(2001)055[0522:Tpaboh]2.0.Co;2</pub-id><pub-id pub-id-type="pmid">11327160</pub-id></citation>
</ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wallrapp</surname> <given-names>F. H.</given-names></name> <name><surname>Pan</surname> <given-names>J. J.</given-names></name> <name><surname>Ramamoorthy</surname> <given-names>G.</given-names></name> <name><surname>Almonacid</surname> <given-names>D. E.</given-names></name> <name><surname>Hillerich</surname> <given-names>B. S.</given-names></name> <name><surname>Seidel</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Prediction of function for the polyprenyl transferase subgroup in the isoprenoid synthase superfamily</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>110</volume>, <fpage>E1196</fpage>&#x02013;<lpage>E1202</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1300632110</pub-id><pub-id pub-id-type="pmid">23493556</pub-id></citation>
</ref>
<ref id="B80">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>G.</given-names></name> <name><surname>Dixon</surname> <given-names>R. A.</given-names></name></person-group> (<year>2009</year>). <article-title>Heterodimeric geranyl(geranyl)diphosphate synthase from hop (<italic>Humulus lupulus</italic>) and the evolution of monoterpene biosynthesis</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A</source>. <volume>106</volume>, <fpage>9914</fpage>&#x02013;<lpage>9919</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0904069106</pub-id><pub-id pub-id-type="pmid">19482937</pub-id></citation>
</ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yogeeswaran</surname> <given-names>K.</given-names></name> <name><surname>Frary</surname> <given-names>A.</given-names></name> <name><surname>York</surname> <given-names>T. L.</given-names></name> <name><surname>Amenta</surname> <given-names>A.</given-names></name> <name><surname>Lesser</surname> <given-names>A. H.</given-names></name> <name><surname>Nasrallah</surname> <given-names>J. B.</given-names></name> <etal/></person-group>. (<year>2005</year>). <article-title>Comparative genome analyses of Arabidopsis spp.: inferring chromosomal rearrangement events in the evolutionary history of <italic>A-thaliana</italic></article-title>. <source>Genome Res</source>. <volume>15</volume>, <fpage>505</fpage>&#x02013;<lpage>515</lpage>. <pub-id pub-id-type="doi">10.1101/gr.3436305</pub-id><pub-id pub-id-type="pmid">15805492</pub-id></citation>
</ref>
<ref id="B82">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>D. Y.</given-names></name> <name><surname>Gerstein</surname> <given-names>M. B.</given-names></name></person-group> (<year>2007</year>). <article-title>The ambiguous boundary between genes and pseudogenes: the dead rise up, or do they?</article-title> <source>Trends Genet</source>. <volume>23</volume>, <fpage>219</fpage>&#x02013;<lpage>224</lpage>. <pub-id pub-id-type="doi">10.1016/j.tig.2007.03.003</pub-id><pub-id pub-id-type="pmid">17382428</pub-id></citation>
</ref>
<ref id="B83">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>X. F.</given-names></name> <name><surname>Suzuki</surname> <given-names>K.</given-names></name> <name><surname>Okada</surname> <given-names>K.</given-names></name> <name><surname>Tanaka</surname> <given-names>K.</given-names></name> <name><surname>Nakagawa</surname> <given-names>T.</given-names></name> <name><surname>Kawamukai</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>1997a</year>). <article-title>Cloning and functional expression of a novel geranylgeranyl pyrophosphate synthase gene from <italic>Arabidopsis thaliana</italic> in <italic>Escherichia coli</italic></article-title>. <source>Plant Cell Physiol</source>. <volume>38</volume>, <fpage>357</fpage>&#x02013;<lpage>361</lpage>. <pub-id pub-id-type="pmid">9150607</pub-id></citation>
</ref>
<ref id="B84">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>X. F.</given-names></name> <name><surname>Suzuki</surname> <given-names>K.</given-names></name> <name><surname>Saito</surname> <given-names>T.</given-names></name> <name><surname>Okada</surname> <given-names>K.</given-names></name> <name><surname>Tanaka</surname> <given-names>K.</given-names></name> <name><surname>Nakagawa</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>1997b</year>). <article-title>Geranylgeranyl pyrophosphate synthase encoded by the newly isolated gene GGPS6 from <italic>Arabidopsis thaliana</italic> is localized in mitochondria</article-title>. <source>Plant Mol. Biol</source>. <volume>35</volume>, <fpage>331</fpage>&#x02013;<lpage>341</lpage>. <pub-id pub-id-type="pmid">9349257</pub-id></citation>
</ref>
</ref-list>
</back>
</article>
