Abstract
Proteins change over the course of evolutionary time. New protein-coding genes and gene families emerge and diversify, ultimately affecting an organism’s phenotype and interactions with its environment. Here we survey the range of structural protein change observed in plants and review the role these changes have had in the evolution of plant form and function. Verified examples tying evolutionary change in protein structure to phenotypic change remain scarce. We will review the existing examples, as well as draw from investigations into domestication, and quantitative trait locus (QTL) cloning studies searching for the molecular underpinnings of natural variation. The evolutionary significance of many cloned QTL has not been assessed, but all the examples identified so far have begun to reveal the extent of protein structural diversity tolerated in natural systems. This molecular (and phenotypic) diversity could come to represent part of natural selection’s source material in the adaptive evolution of novel traits. Protein structure and function can change in many distinct ways, but the changes we identified in studies of natural diversity and protein evolution were predicted to fall primarily into one of six categories: altered active and binding sites; altered protein–protein interactions; altered domain content; altered activity as an activator or repressor; altered protein stability; and hypomorphic and hypermorphic alleles. There was also variability in the evolutionary scale at which particular changes were observed. Some changes were detected at both micro- and macroevolutionary timescales, while others were observed primarily at deep or shallow phylogenetic levels. This variation might be used to determine the trajectory of future investigations in structural molecular evolution.
INTRODUCTION
In the study of the molecular changes underlying adaptive evolution, there is debate as to whether regulatory or structural changes are of greater importance. Regulatory changes, especially those affecting where and when a transcriptional regulator is expressed, are thought to predominate. Structural changes are thought to have a higher degree of negative pleiotropy, and are probably not tolerated to the same degree as regulatory changes (, , ; Stern, 2000). Despite this prevailing view, structural changes have been shown to have had a noteworthy role in the evolution of some key adaptive traits (). In the evolution of plant form and function in particular, examples of both regulatory () and structural () changes exist. With time and more data, the argument may be resolved, but the point that every trait is different may be key (Wessinger and Rausher, 2012). In all likelihood, in most cases there is no single quantitative trait nucleotide (QTN), but rather a collection of myriad small changes, both regulatory and structural, that have contributed to the evolution of a novel phenotype (Rockman, 2012).
Regardless of which class of changes predominates, both regulatory and structural mutations have happened through the course of evolution. We have chosen to review those cases where structural mutations have had demonstrably functional consequences. Interpreting a mutation as either regulatory or structural is not always straightforward. We use the definition proposed by , with some modifications. They propose that mutations that occur in the coding sequences of genes are structural, and all other mutations, including those that occur in introns, are regulatory. This definition includes nonsense null mutations, altered miRNA-binding sites, and silent mutations affecting transcription and translation dynamics as “structural” (). We prefer a more narrow definition of “structural mutation,” and include only those examples where amino acid sequence is changed and protein function is not completely lost. This circumscription of structural mutations thus includes mostly missense mutations, but also insertions and deletions, frameshifts, and premature stop codons that produce proteins with altered functions. We have chosen this definition out of expediency. Compelling arguments exist for putting all excluded mutations (e.g., miRNA-binding site mutations) back into the set of structural mutations, and then for taking them right back out again. For the purpose of investigating the evolution of protein function, we feel that our narrow definition best frames the discussion.
Our review focuses on those cases where protein function has been altered, in turn altering some aspect of phenotype. It is important to highlight that many of the phenotypes we discuss may or may not be adaptive, but that is not the focus of this review. It is not trivial to unambiguously determine the molecular cause of a phenotypic adaptation, or even to confirm some phenotype as adaptive (). Moreover, there are few studies that have explicitly investigated the quantitative trait loci (QTL) underlying natural variation in an evolutionary framework, and as a consequence it is hard to determine their adaptive significance. A more widespread genotype might hint at some adaptive value, but for many of the examples we cite the responsible protein change is only found in a single accession where it may be deleterious and/or of short duration. These isolated QTL are not inherently less interesting, however, because they reveal the scope of molecular diversity to be found in natural environments, diversity that selection may ultimately act upon.
Another important preliminary consideration is that protein diversification through deep time can only be discussed in a framework of gene birth. In plants in particular, and in eukaryotes in general, a major source of new genes is gene duplication. Most gene families have expanded considerably through gene duplication and divergence, and often these expansions show lineage-specific patterns (). The new gene duplicates are thought to have one of three fates. Formally, duplicate genes may divide up the functions of the progenitor gene between them (subfunctionalization), one of the duplicates may gain an entirely new function (neofunctionalization), or one of the duplicates may decay into a non-functional pseudogene (; ). These categories are often difficult to assign, but where they are relevant, most of the examples we will discuss are of neofunctionalization.
In our review of the literature, we found that functional protein changes, the result of underlying structural mutations, fell into six broad, non-mutually exclusive categories. We have divided up our discussion according to these categories: (I) altered active or binding sites; (II) altered protein–protein interactions; (III) altered domain content; (IV) altered activity as a transcriptional activator or repressor; (V) altered protein stability; or (VI) hypomorphic and hypermorphic alleles (Figure 1 and Table 1).
FIGURE 1
Table 1
| Organism | Protein | Trait | Amino acid change | Scale* | Reference |
|---|---|---|---|---|---|
| Altered active or binding site | |||||
| Arabidopsis thaliana | Cytochrome P450’s | Arabidopyrone synthesis | Gradual replacement | M | Weng et al. (2012) |
| Boechera stricta | CYP79F1 duplicates | Glucosinolate biosynthesis | G134L, P536K | M | |
| Eleusine indica | α-tubulin | Herbicide resistance | T239I | m | Yamamoto et al. (1998) |
| Ipomoea spp. | Dfr | Flower color | Five amino acid sites | M | Smith et al. (2013) |
| Oryza sativa | SH4 | Seed abscission | K80N, DNA-binding domain | m | |
| Setaria viridis | α-tubulin | Herbicide resistance | T239I | m | |
| Solanum lycopersicum | LIN5 | Sucrose metabolism | E348D, close proximity to catalytic site | m | |
| Asteraceae | HHS | Pyrrolizidine alkaloid synthesis | Gradual replacement | M | |
| Convolvulaceae | HHS | Pyrrolizidine alkaloid synthesis | Gradual replacement | M | |
| Flowering plants | psbA | Herbicide resistance | S264G | m | |
| Flowering plants | AHAS | Herbicide resistance | Seven amino acid sites | m | |
| Land plants | LEAFY | Vegetative to reproductive transition | Gradual replacement | M | |
| Eukaryotes | MADS TF’s | Morphogenesis | Gradual replacement | M | |
| Altered protein–protein interactions | |||||
| Antirrhinum majus | FAR, PLE | Floral organ identity | Q insertion (K-C domain junction) | M | |
| Arabidopsis thaliana | ATMYC1 | Trichome density | P189A | m | Symonds et al. (2011) |
| Arabidopsis thaliana (Cvi) | PHYB | Light response | I143L (PPI domain) | m | |
| Arabidopsis thaliana | AGL6 | Branching | P201L (C-terminal) | m | |
| Helianthus annuus | FT | Flowering time | Frameshift mutation, 17aa insertion | m | |
| Hordeum vulgare | PPD-H1 | Flowering time | G588W in the CCT domain | m | Turner et al. (2005) |
| Medicago | SHP homologs | Fruit morphology | S insertion (C-terminal) | M | |
| Thalictrum thalictroides | C class MADS TF | Floral organ identity | 8 or 13 amino acid deletion, Keratin-like domain | m | |
| Triticum aestivum | Q | Seed free-threshing | I329V | m | Simons et al. (2006) |
| Land plants | DELLA | GA-mediated growth responses | Gradual changes to DELLA, affecting DELLA-GID1 PPI | M | Yasumura et al. (2007) |
| Flowering plants | B class MADS TF | Floral development | Uncharacterized | M | Winter et al. (2002) |
| Altered domain content | |||||
| Arabidopsis thaliana (Cvi) | ANAC089 | Fructose sensitivity | Truncated protein, membrane-anchoring domain lost. | m | |
| Core eudicots | EuAP3 and TM6 | Floral development | Frameshift mutation, novel C-terminal | M | |
| Core eudicots | EuAP1 and EuFUL | Floral development | Frameshift mutation, novel C-terminal | M | ; |
| Grasses (Oryza sativa) | OsMADS5 | Floral morphology | Truncated protein, C-terminal lost | M | |
| Land plants | Terpene synthases | Secondary metabolism | γ-domain lost | M | |
| Green plants | AP2 domain | Development, stress response | HGT of novel domain | M | |
| Green plants | MEKHLA domain | Development | HGT of novel domain | M | |
| Altered activity as an activator or repressor | |||||
| Beta vulgaris | BvFT1, BvFT2 | Flowering, vernalization | Three amino acids in segment B | M | |
| Glycine max | Ln | Leaf morphology | D9H, EAR transcriptional repression motif | m | |
| Zea mays | TGA1 | Inflorescence morphology | K8N | m | Wang et al. (2005) |
| Angiosperms (Arabidopsis thaliana) | TFL, FT | Flowering | Y85H (FT to TFL), H88Y (TFL to FT) | M | |
| Altered protein stability | |||||
| Arabidopsis thaliana (Lm-2) | PHYA | Light response | M548T | m | |
| Arabidopsis thaliana (Cvi) | CRY2 | Light response | V367M | m | |
| Arabidopsis thaliana | ETC2 | Trichome density | K19E | m | |
| Hypomorphic and hypermorphic alleles | |||||
| Arabidopsis thaliana (Cvi) | HMA5 | Cu tolerance | Missense mutations in conserved transmembrane domain | m | |
| Arabidopsis thaliana | HMA3 | Cd accumulation | Missense mutations at ATP-binding site (or nearby) | m | |
| Arabidopsis thaliana (Shahdara) | APR2 | Sulfate accumulation | A399E | m | |
| Arabidopsis thaliana | ACD6 | Late onset necrosis, leaf initiation | Two or three aa replacements in transmembrane domain | m | Todesco et al. (2010) |
| Arabidopsis thaliana (Sy-0) | HUA2 | Plant architecture | K525E | m | Wang et al. (2007) |
| Oryza sativa | SKC1 | Salt tolerance | Four amino acids, transmembrane domain | m | |
| Uncertain function | |||||
| Arabidopsis lyrata | FRI-like | Flowering | 14 aa insertion | m | |
| Oryza sativa | PROG1 | Plant architecture (tillering) | T152S | m |
Structural changes implicated in phenotypic variation and evolution in plants.
M = macroevolutionary scale, m = microevolutionary scale.
ALTERED ACTIVE AND BINDING SITES
Amino acid replacement in the active sites of enzymes, or the DNA-binding sites of transcription factors, is perhaps the most easily understood mechanism of protein evolution. Changes to the core functional domain of a protein, either through gradual replacement of many amino acids over the course of time (Zhao et al., 2008), or through the replacement of a few key residues (; ), has the potential to generate novel protein function. Active and binding site changes also have the greatest potential to be deleterious if they destroy a protein’s primary function (). Despite this potential for negative effects, numerous examples (outlined below) have been uncovered where active and binding site evolution has been tolerated and led to neofunctionalization.
SECONDARY METABOLITES IN DEFENSE
Plants are remarkable for their secondary metabolite chemistry: they possess a diversity of chemical compounds, often involved in defense (). Gene duplication followed by neofunctionalization is a novelty-generating mechanism observed frequently in the evolution of enzymes and secondary metabolites. Gene duplication followed by active site evolution has been described in the synthesis of the arabidopyrones (Arabidopsis-specific compounds; Weng et al., 2012); glucosinolates in the Arabidopsis relative Boechera (); and pyrrolizidine alkaloids in the Convolvulaceae and the Asteraceae (; ). Both in the evolution of novel glucosinolate-producing enzymes in Boechera (), and in the evolution of pyrrolizidine alkaloid production in the Convolvulaceae (), positive selection acting on active site amino acid residues was detected. The positively selected residues were assayed for function, and found to indeed alter enzyme function in predictable ways (; ). This pattern of gene duplication, positive selection, and neofunctionalization has been proposed as a mechanism for glucosinolate biosynthesis evolution in the Brassicaceae (), and appears to be relevant for a broader spectrum of secondary metabolite evolution.
A second theme observed in the evolution of novel enzymes is that of a promiscuous enzyme becoming more specialized through the course of evolution. In both pyrrolizidine alkaloid and arabidopyrone synthesis, the enzyme maintaining ancestral function shows weak activity toward the substrate used by the neofunctionalized enzyme (Weng et al., 2012; ). In these cases, which may be fairly prevalent, the catalytic activity of the progenitor enzyme may be considered a molecular exaptation. An exaptation, as defined by , is a feature coopted for some current function following an origin for a different function, or no function at all. Promiscuous catalytic activity of an enzyme may serve as an exaptation in the evolution of new enzyme functions after gene duplication (; ). This may also still be considered neofunctionalization, depending on the definition of function used. If an evolutionary definition of function is used – an enzyme’s function is the function it was selected for – then the increased specialization is indeed neofunctionalization. If, instead, we choose a purely mechanical definition – a promiscuous enzyme functions to produce a range of products – then exaptation, but not neofunctionalization, would be better applied. In the case of biochemical enzymes, many neofunctionalization events may be exaptations, but not all neofunctionalization events are because of exaptation.
HERBICIDE RESISTANCE
Herbicide resistance, both naturally and experimentally derived, is often the result of structural changes, particularly in the active sites of enzymes. The possible shifts to resistance in an herbicide-sensitive protein is dependent on where a particular herbicide binds. If an herbicide binds within an enzyme’s catalytic site, there are relatively few amino acid changes that can confer resistance, while still maintaining catalytic activity. If an herbicide binds outside of an enzyme’s catalytic site, a larger spectrum of changes can confer resistance while still maintaining enzyme function. Because herbicide treatment represents extremely strong selective pressure, applied in agricultural settings worldwide, and because both sets of tolerated amino acid changes are relatively small, the evolution of herbicide resistance is a story of molecular convergence. For example, a single amino acid change that confers triazine herbicide resistance in a key photosystem II gene, psbA (S264G), has evolved independently at least 68 times worldwide. Similarly, 22 amino acid replacements at seven sites in the enzyme acetohydroxyacid synthase (AHAS) have been identified in herbicide-resistant weeds (reviewed in ). In a final example of molecular convergence, the same herbicide resistance-conferring mutation (T239I) has arisen separately in the α-tubulin genes of the grasses Eleusine indica and Setaria viridis (; Yamamoto et al., 1998).
FLOWER COLOR EVOLUTION
Flower color evolution is another domain where structural changes in enzymes, along with regulatory changes and enzyme inactivations, have been shown to be important (Wessinger and Rausher, 2012). In Iochroma (Solanaceae) a color change from blue (ancestral) to red (derived) occurred because of three changes: inactivation of one enzyme, downregulation of a second by a distinct locus, and altered functional specificity of a third (Dfr; Smith and Rausher, 2011). It remains unclear which changes occurred first, and were ultimately responsible for the color shift, but it is clear that changes in Dfr specificity occurred both before and after the emergence of the red-flowered ancestor. The five amino acids that differ between the red-flowered and blue-flowered ancestral proteins evolved under positive selection. Ancestral sequence estimation, coupled to site-directed mutagenesis and functional assays revealed that each amino acid change, when it occurs in a specific protein sequence background, confers progressively more specificity for the red color precursor. These results suggest that each of the amino acid changes in Dfr may have been adaptive (Smith et al., 2013).
DNA-BINDING SITE EVOLUTION
Regulatory changes are often considered more prevalent in the evolution of transcription factor function, and hence in the evolution of morphology. However, there is evidence that binding (active) site evolution is of some importance in the evolution of the LEAFY (LFY) and MADS box transcription factors. The A. thaliana protein LFY, like its orthologs in other flowering plants, is a floral integrator and a master regulator of floral organ identity (). In the moss Physcomitrella patens, however, the two LFY genes control the first zygotic cell division and numerous aspects of sporophyte development, not the vegetative to reproductive transition in the sporophyte (Tanahashi et al., 2005). In Ceratopteris, a fern, the expression patterns of LFY homologs and other MADS box genes are not overlapping, suggesting that LFY does not induce MADS box gene expression, as it does in the flowering plants (). Changes in the DNA-binding domain appear to have been important in this altered functional specificity of LFY across the evolutionary history of land plants. Heterologous expression studies, domain swaps, and site-directed mutagenesis experiments suggest that gradual amino acid replacement in the DNA-binding domain, through the course of plant evolution, may have been of some importance in the evolution of altered LFY function ().
The MADS box genes are found in almost all eukaryotic genomes, and have expanded considerably in plant genomes in particular. Plant MADS box genes have key roles in many morphogenetic processes, including flowering, floral development, and fruit development. Careful and exhaustive database searches and phylogenetic analyses have revealed that the MADS box genes of eukaryotes may have evolved from a gene encoding a topoisomerase subunit (TopoIIA subunit A). DNA topoisomerases, like TopoIIA, have central roles in DNA replication, transcription, recombination, and chromosome segregation. Gradual changes in the DNA-binding domain may have eventually led to the DNA-binding specificity for CArG boxes observed in MADS box proteins ().
A single amino acid replacement (K80N) in the MYB domain transcription factor SHATTERING4 (SH4) is responsible for the non-shattering phenotype characteristic of cultivated rice (; Zhu et al., 2012). In wild rice species, the seeds abscise from the inflorescence axis (shattering) because of the formation of an abscission zone. In cultivated rice, seeds are retained on the inflorescence axis and the abscission zone is reduced, allowing for easier harvest. K80N is in the DNA-binding domain of sh4 and probably undermines or changes (but not abolishes) protein function, thus interrupting abscission layer formation ().
Structural active site changes may well be tolerated at a higher frequency in biosynthetic enzymes, and lead to novel phenotypes more often than analogous changes in transcription factors, but we see no particular reason to consider the evolution of transcription factors and the evolution of biochemical enzymes as two fundamentally distinct processes. We suspect that one of the recurrent themes identified in enzyme evolution – gene duplication followed by neofunctionalization – may rather become a more general theme in protein evolution. Gradual binding site evolution of transcription factors, as demonstrated in LFY and suggested in the MADS box proteins, may be more widespread. Although DNA-binding domains are often deeply conserved in gene families, it remains conceivable that the DNA-binding profile of a transcription factor may diverge following a gene duplication event. It is fairly laborious to identify transcription factor binding sites, even in model organisms. More sequenced genomes, however, along with new techniques such as chromatin immunoprecipitation coupled to next generation sequencing (ChIP-Seq) may allow us to uncover more examples of structural transcription factor evolution. ChIP-Seq has the potential to reveal altered DNA-binding profiles through time, whether this is because of altered binding sites, altered protein–protein interactions (PPIs), or other mechanisms. This is not to discount the demonstrated importance of changes in transcription factor gene expression in morphological evolution (), but only to highlight the potential importance of structural and regulatory changes occurring together through deep time.
Molecular convergence may also become a more general theme in protein evolution (). As with biosynthetic enzymes, a protein with DNA-binding activity has a finite genotypic space to explore in adopting some new function (binding a new DNA motif, for example) (Wagner, 2011). Consequently, the subset of changes that can occur at key residues is relatively small. Further examples may reveal recurrent changes in homologous protein domains not just in biosynthetic enzymes and herbicide-targeted proteins, but also in transcription factors.
ALTERED PROTEIN–PROTEIN INTERACTIONS
Protein–protein interactions are of prime importance in plant development. There are many examples of particular interactions regulating key developmental and physiological processes (e.g., Riechmann et al., 1996a; ; ). Altered PPIs may be one way to generate functional diversity without negatively affecting core protein function. The DNA-binding domain of a protein may stay intact, but an interaction domain may change to interact with a new partner, perhaps expressed in a discrete domain. In this way novel functions can emerge, while the protein’s original functions are preserved (). Despite this compelling argument for investigating PPI evolution, and despite their integral role in development, few studies have tackled PPIs in an evolutionary framework.
One interaction that has been studied in an evolutionary context occurs between the gibberellin phytohormones (GA), GID1-like proteins (GLP1), and the DELLA transcriptional repressors. In A. thaliana DELLA proteins, as part of GLP1–GA–DELLA complexes, are polyubiquitinated and recruited to the 26S proteasome for destruction, releasing DELLA targets from repression (reviewed in Sun, 2011). The GLP1–GA–DELLA interaction is deeply conserved in angiosperms (Sun, 2011), and appears to have been acquired gradually through the course of land plant evolution (Yasumura et al., 2007). The results of mutant analyses and heterologous transformation experiments suggest that DELLA’s acquired their characteristic growth-repressive function after the divergence of the lycophytes from the rest of the land plants, perhaps through cis-regulatory changes. The GA-stimulated GLP1–DELLA interaction appears to have arisen after the divergence of the bryophytes from the remainder of the land plants, probably through structural alterations to DELLA proteins. Thus DELLA protein changes that facilitate the GLP1–DELLA interaction, together with the evolution of an altered GA response, allowed for the emergence of the GLP1–GA–DELLA module characteristic of flowering plants (Yasumura et al., 2007).
In the study of plant development, the network of interactions between the ABC(E) MADS box proteins has been extensively investigated. The ABC(E) class MADS box genes, and the single non-MADS A class gene AP2 (APETALA2), control floral organ identity in a combinatorial manner. In Arabidopsis and Antirrhinum the A class genes control sepal identity. The A and B class genes together control petal identity, B and C class genes together confer stamen identity, and the C class genes control carpel identity (). The E class genes are needed in all four whorls of the flower for proper organ identity specification (; ). The ABC(E) MADS box proteins are known to dimerize, but probably function as part of tetramers (“floral quartets”). These proteins have four domains: the DNA-binding MADS domain, an Intervening domain (I), a keratin-like coiled coil (K), and a disordered C-terminal domain. The I, K, and C-domains have been implicated in mediating PPIs amongst MADS box proteins (reviewed in ).
There are a few examples where novel mutant phenotypes are probably caused by disrupted PPIs of MADS box transcription factors. The fast neutron induced seirena mutant of the California poppy, Eschscholzia californica (Ranunculaceae), shows a B class mutant phenotype, and may result from compromised interactions between the B class, C class, and E class MADS box proteins. Site-directed mutagenesis experiments revealed that the B–C–E interaction in Eschscholzia may be mediated by the PISTILLATA (PI) motif, missing from sei-1 mutant protein (). The PI motif is conserved, but not universally present, in PI homologs. Although the PI motif may well have a role in MADS box complex formation wherever it is found, distinct interaction motifs may have evolved convergently in lineages where the PI motif is missing or altered, but higher order complexes still form (). The double-flowered mutant phenotype of an ornamental cultivar of Thalictrum thalictroides (Ranunculaceae) may also be the result of disrupted PPIs between C and E class MADS box proteins ().
APETALA3 (AP3)-like and PI-like genes comprise the two main lineages of B class MADS box genes. In all core eudicots investigated thus far, B class proteins bind DNA as obligate heterodimers: AP3-like proteins cannot bind DNA without PI-like proteins and vice-versa (Riechmann et al., 1996a,b). The AP3–PI heterodimer in Arabidopsis goes on to autoregulate late PI and AP3 expression (). This obligate heterodimer relationship is uncommon in the large MADS box gene family (Riechmann et al., 1996a), and obligate heterodimerization coupled with autoregulation is a rare, if not unique regulatory mechanism. All angiosperms investigated thus far have at least one AP3-like and one PI-like gene, and AP3-like and PI-like proteins bind DNA as obligate heterodimers in distantly related angiosperms, including the grass Zea mays (Vandenbussche et al., 2004; Whipple et al., 2004; ; ). The only characterized B class proteins isolated from a gymnosperm thus far (the Gnetalean Gnetum gnemon) bind DNA as homodimers (Winter et al., 2002; Wang et al., 2010). These data, taken together, suggested that the obligate heterodimerization relationship evolved from a homodimerizing ancestor shortly after the duplication event that led to the AP3 and PI gene lineages, and prior to the diversification of the angiosperms. However, PI homologs from Lilium were found to be capable of homodimerizing and heterodimerizing (Winter et al., 2002), but with no other data points, it was unclear whether this was an autapomorphy or indicative of a broader evolutionary trend. The single PI-like protein (J-PI) in Joinvillea, a close grass relative, can homodimerize (Whipple and Schmidt, 2006). PI-like homodimerization has also been observed in Chloranthus (Chloranthaceae; ) and Eschscholzia (). Together with the data from Lilium, these data imply the intriguing convergent evolution of obligate heterodimerization both in the monocots and in the lineage leading to the core eudicots. What remains to be deciphered is why obligate B class heterodimerization evolved at least twice. What, if any, is the functional difference between B class homodimers and heterodimers? One hypothesis suggests that the convergent evolution of obligate AP3–PI interaction is not the result of drift, but rather because the AP3–PI heterodimer confers a selective advantage: a robust switch in floral development (). It must be stated that all investigations into B class homo- vs. heterodimerization have been conducted in vitro. There is no evidence as of yet that PI-like homodimers function in planta.
The C class genes (PLENA and FARINELLI) of Antirrhinum have subfunctionalized, in part because of shifting PPIs. PLENA controls both male and female organ identity (stamens and carpels), while FARINELLI confers only male organ identity, both in A. majus and when overexpressed in A. thaliana flowers (; ; ). When ectopically expressed, PLE, like AG, is capable of specifying both male (stamen) and female (carpel) organ identity, but FAR confers only stamen identity. This functional divergence has been traced to a single glutamine insertion in FAR, the result of an altered splice site. This amino acid insertion affects PPIs with the E class SEPALLATA (SEP) proteins: FAR can only interact with SEP3, while AG can interact with SEP1, 2, and 3. This change in PPIs, overlaid on SEP homolog expression patterns, has resulted in the subfunctionalization of FAR and PLE. Structural and regulatory changes have acted in concert to effect functional differentiation (). In the genus Medicago (Fabaceae), a major difference in fruit morphology is correlated with a similar single amino acid insertion into SHATTERPROOF (SHP)-like MADS box proteins. Rather than disrupting PPIs, however, the amino acid insertion may strengthen the interaction between Medicago SHP and SEP3 homologs ().
Outside of the MADS box genes, there is evidence that PPIs affect natural variation in altered trichome density (Symonds et al., 2011) and light response in A. thaliana (), domestication traits in wheat (Simons et al., 2006), and flowering time in barley (Turner et al., 2005). Trichome density, in particular, changes in response to herbivore pressure, and has a fitness effect (). The bHLH transcription factor ATMYC1 was found to underlie a QTL for trichome density in four separate A. thaliana mapping populations. A single amino acid change (P189A) was sufficient to abolish binding of atmyc1 to TTG (TRANSPARENT TESTA GLABRA) and GL1 (GLABROUS1) in yeast two hybrid assays (Symonds et al., 2011). Both TTG and GL1 are essential for trichome initiation in A. thaliana (reviewed in ). Presumably it is this altered interface with the trichome initiation pathway that results in reduced trichome initiation in plants with the Ler atmyc1 allele. In a cautionary tale for evolutionary biologists, positive selection acting on the ATMYC1 coding sequence was detected, but the region under selection was downstream of the trichome-reducing P189A substitution (Symonds et al., 2011).
COMPETITIVE INHIBITION AND DOMINANT NEGATIVES
Competitive inhibition of transcription factors by similar, but truncated, proteins represents one special PPI that has repeatedly surfaced as a regulatory mechanism (Staudt and Wenkel, 2010; Seo et al., 2011a). For example, the HD-ZIPIII transcription factor REVOLUTA, a key regulator of vegetative development (reviewed in ), is negatively regulated by the LITTLE ZIPPER (ZPR) proteins. HD-ZIPIII transcription factors consist of four domains: a DNA-binding homeodomain, a leucine zipper domain, a START domain predicted to bind small hydrophobic molecules, and a MEKHLA domain (discussed below). All of the HD-ZIPIII proteins bind DNA as dimers. One class of genes that is upregulated by REV in particular is the ZPR genes. In contrast to the HD-ZIPIII proteins, the only recognizable domain in the ZPR proteins is the leucine zipper domain (Wenkel et al., 2007; ). The ZPR proteins bind REV in vitro, and inhibit DNA binding by REV. The ZPR overexpression phenotypes resemble those seen when HD-ZIPIII function is reduced. These data suggest a negative feedback loop, where the HD-ZIPIII proteins upregulate ZPR expression and the ZPR proteins repress HD-ZIPIII genes by sequestering them in inactive heterodimers. ZPR genes have been found in Arabidopsis, maize, and rice, so this form of gene regulation may be relatively ancient in the flowering plants (Wenkel et al., 2007).
The form of competitive inhibition demonstrated in the HD-ZIPIII/ZPR system is evident in a number of other transcription factor families: IDD14 in starch accumulation (Seo et al., 2011b), ZHD5 and MIF in floral and leaf development (), Aux/IAA and ARF proteins in auxin response (Ulmasov et al., 1997;Vernoux et al., 2011), MEINOX and BELL proteins in leaf development (), and the MYB proteins DIVARICATA and RADIALIS in establishing floral symmetry (; ). The smaller, competitive inhibitor proteins have been termed microProteins or short interfering peptides (siPEPs; Staudt and Wenkel, 2010; Seo et al., 2011a). Very few of these systems have been investigated in an evolutionary context, so it remains unclear whether the siPEPs have arisen because of convergent evolution, or whether they share a common ancestor with their competitors and have undergone domain loss. The second scenario, common ancestry and domain loss, seems more likely given the widespread occurrence of domain loss in gene family evolution (). In the case of IDD14, the competitive inhibitor is the result of an alternative splicing event, suggesting that there may be many more examples of competitive inhibition lurking in plant genomes (Staudt and Wenkel, 2010; Seo et al., 2011a).
The above examples of competitive inhibition are reminiscent of the effects of dominant-negative alleles. Often, dominant-negative alleles are thought to “poison” the protein complexes they are part of, ultimately causing a mutant phenotype. Two separate cases of dominant-negative alleles in natural variation have recently been described in A. thaliana and in Helianthus annuus (Asteraceae). In A. thaliana, QTL mapping of natural variation in branching pattern resulted in the identification of a naturally occurring allele of the MADS box protein AGL6 that, in combination with other loci, causes reduced shoot branching. This dominant-negative allele results in single amino acid replacement (P201L) in the C-terminus, a region of the protein thought to mediate higher-order PPIs ().
In H. annuus, the sunflower, three tandem duplicate homologs of the A. thaliana floral inducer FT(FLOWERING LOCUS T) underlie a single large-effect QTL for flowering time. All three paralogs show divergent expression patterns, indicative of subfunctionalization. In addition, there is a frameshift mutation in the domesticated version of one of the paralogs, HaFT1, that causes a 17aa insertion in the encoded protein. In A. thaliana, the frameshift HaFT1 allele abrogates the early flowering phenotype (under long days) conferred by a 35S::HaFT4 transgene. This dominant-negative effect may result from disrupted PPIs between HaFT1 and its floral induction partners. The frameshifted allele is found almost exclusively in domesticated, not wild, sunflower cultivars, and there is evidence for a selective sweep at the genomic region surrounding HaFT1, indicating that this altered gene may have been a target of selection during domestication ().
ALTERED DOMAIN CONTENT
Protein domains have been described that target proteins to particular cellular compartments [e.g., nuclear localization signals ()]; that act as repressor or activator domains [e.g., the EAR repression domain ()]; that function in mediating the assembly of protein complexes [e.g., the PDZ domain ()]; that act as post-translational modification (PTM) sites (); and that target proteins for destruction [e.g., the D box, ()], to name a tiny subset of the existing diversity. The evolutionary origin of many characterized protein domains is often unclear or unexamined, except in a few cases. In a study of the evolution of plant protein domain gain and loss, showed that new, plant-specific domains have emerged throughout plant history, but the highest rate of novel domain emergence was detected on the branch leading to the seed plants. This study also demonstrated that the arrangement of domains in individual proteins varies considerably, particularly at shallower phylogenetic levels. Lineage-specific domain architectures are not uncommon ().
Plant-specific gene lineages may possess domains present in all eukaryotes, but in land-plant-specific combinations (Xing et al., 2013). For example, the F-box and the tubulin DNA-binding domain are both found in all eukaryotes, but they are found adjacent to one another only in plants (). Similarly, HMG-box and AT-rich interaction domains are found in combination only in plants (). To catalog all characterized protein motifs and domains, and their occurrence in plant genomes, is beyond the scope of this paper. Instead, we have chosen to discuss examples where new functional domains in plant proteins have arisen through defined mechanisms, and to discuss examples where domain loss has been shown to have some defined functional consequence.
NOVEL DOMAINS FROM HORIZONTAL GENE TRANSFER
There is evidence for horizontal gene transfer (HGT) between closely allied eukaryotic species (, ; ; Xi et al., 2013), for massive chloroplast–nuclear gene transfer (, ; Stegemann et al., 2003), and for inter-species chloroplast movement under stress (Stegemann and Bock, 2009; Stegemann et al., 2012). Combined, these data support the notion that new genes and new domains may arise in plant genomes through HGT. Two examples in particular highlight the recruitment of domains from HGT (the MEKHLA and the AP2 domains) to key developmental processes in plants.
The AP2 domain is found in 144 Arabidopsis transcription factors with diverse, important roles in plant development and in stress response (). Outside of Arabidopsis, the AP2 domain has been found in all lineages of green plants investigated – from green algae to monocots. In P. patens, four proteins with AP2 domains have been found to be important for specifying cell-type identity (). The AP2 domain was initially considered to be plant-specific (Riechmann and Meyerowitz, 1998), but more sophisticated database-searching methods revealed the existence of AP2 domains in homing endonucleases from a cyanobacterium (Trichodesmium erythraeum), a ciliate (Tetrahymena thermophila), and in two phages. No AP2 domains were detected in any other eukaryotes, apart from plants and T. thermophila.The T. erythraeum AP2 domain aligns best with plant AP2 domains, and is also capable of binding DNA in a sequence-specific manner ().
Multiple lines of evidence support the hypothesis that the AP2 domain arose in plant genomes through HGT from a prokaryote, rather than convergent or divergent evolution:(1) There is homology between the cyanobacterial gene and plant AP2-containing genes that extends beyond the AP2 domain. (2) Very few (15%) AP2/ERF transcription factor genes have introns. (3) The identified non-plant AP2 domains have a very similar predicted secondary structure to that of plant AP2 domains, and share more than 40% sequence identity with plant AP2 domains. (4) The nature of homing endonucleases themselves: homing endonuclease genes duplicate themselves in a process of gene conversion (). In addition, there is evidence that they have moved extensively, through HGT, into all of the biological kingdoms (reviewed in Stoddard, 2011).
The MEKHLA domain of REV is important for proper protein function (), but it is not required for transcriptional activation. Instead, the MEKHLA domain may be acting as a negative regulator of REV (). Phylogenetic analysis suggests that the MEKHLA domain, characteristic of HD-ZIP III transcription factors, found its way into plant genomes through either HGT from plant-associated bacteria, or through mass nuclear transfer from the early chloroplast (; ).
The evolution of the AP2 and MEKHLA domains demonstrates how new domains may arise and adopt important regulatory roles in plant development. Both domains were recruited into plant genomes at deep nodes in their phylogenetic histories: AP2 and MEKHLA domains are found in all plants, including the green alga Chlamydomonas. Given the hypothesized widespread occurrence of HGT in plant genomes (), these examples may not be remarkable. Careful phylogenetic analysis, focused on particular domains rather than genes, may well reveal many more horizontally transferred protein domains.
NOVEL DOMAINS FROM FRAMESHIFT MUTATIONS
The B class MADS box genes AP3 and PI are key for controlling petal and stamen development in many flowering plants (; Vandenbussche et al., 2004; Whipple et al., 2004; ; ). There are two AP3-like genes in most core eudicots, products of a gene duplication event that generated the euAP3 and TM6 gene lineages (). The two gene lineages possess distinct, evolutionarily conserved C-terminal domains (Vandenbussche et al., 2003; ). The derived euAP3 C-terminal domain (including the euAP3 motif) was probably generated through a frameshift mutation that occurred at the base of the core eudicots (). Where they have been investigated, the euAP3 and TM6 gene lineages have distinct but overlapping roles in floral development (Vandenbussche et al., 2004). There is some evidence that this functional distinction in the core eudicots is mediated, at least in part, by the proteins’ divergent C-termini (). Frameshift mutations have arisen and been maintained in other taxa with AP3-like gene duplications, and in other gene lineages, although the functional significance of the novel motifs generated has not been extensively investigated (; Vandenbussche et al., 2003; ; ).
DOMAIN LOSS
Domain loss can be detected by phylogenetic analysis of individual protein families (Zhang and Wang, 2005; ), and a large-scale analysis of protein domain evolution in plants revealed that domain loss occurs fairly frequently in plant lineages, particularly at family and subfamily-specific phylogenetic levels (). Although relatively easy to detect, the functional significance of these novel domain architectures is difficult to assess. Three examples where the function of domain loss has been shown involve the terpene synthase biosynthetic enzymes (); the E class MADS box transcription factors from rice (); and a NAC domain transcription factor from A. thaliana ().
Plant terpene synthases are thought to have evolved from diterpene synthases, essential enzymes in the gibberellin synthesis pathway. Huge chemical diversity exists in plants, partly because of the evolution of the terpene synthases. Terpene synthases have lost the central γ-domain characteristic of diterpene synthases. There is some evidence that γ-domain loss has occurred multiple times in various taxonomic groups, but it remains uncertain whether γ-loss was a single evolutionary event, or the result of several parallel domain losses ().
The E class MADS box genes of rice Leafy hull sterile (LHS) and OsMADS5 (OSM5) are the products of a gene duplication event that occurred early on in the diversification of the grasses (). Lhs1 mutants are characterized by leafy lemmas, paleas, and lodicules, fewer stamens, and occasional extra pistils and/or florets (). osm5 mutants show a very mild floral phenotype: partial fusion between the lodicules (petal homologs) and the lemma and palea (sepal homologs; ). There is a premature stop codon in OSM5, shortly after the DNA-binding MADS domain of the protein. Perhaps because of this truncation, postdating the gene duplication event that produced OSM5, OSM5 has a different spectrum of binding partners to LHS, which may contribute to its divergent function (; ).
The Cvi and Ler accessions of Arabidopsis have differing sensitivities to fructose. A QTL for fructose sensitivity was cloned, and it corresponds to a gain-of-function mutation in a NAC domain transcription factor gene (ANAC089). A premature stop codon in the Cvi allele leads to a truncated protein, missing a predicted membrane-bound domain (). In some NAC transcription factors, the membrane-bound domain serves to retain the protein in the cytoplasm in an inactive form (Seo et al., 2008). Without the membrane-anchoring domain, ANAC089 is constitutively active in the nucleus, probably as a transcriptional activator. Although it does demonstrate some of the molecular diversity that might be tolerated in nature, the Cvi allele of ANAC089 is rare, and possibly deleterious ().
ALTERED ACTIVITY OF TRANSCRIPTIONAL REPRESSORS AND ACTIVATORS
FT (FLOWERING LOCUS T) and TFL (TERMINAL FLOWER) are distantly related paralogous regulators of flowering in Arabidopsis. FT is a floral integrator, and FT expression induces flowering. TFL is a floral repressor and maintains indeterminate growth of the shoot apical meristem. This functional distinction between FT and TFL has been separately traced to a single amino acid difference in the predicted anion-binding pocket (Y85 in FT and H88 in TFL; ) and to differences in an external protein loop termed “segment B” (). There is evidence that FT and TFL exert their respective functions as part of transcriptional activator and repressor complexes (reviewed in Taoka et al., 2013). Y85 in FT and H88 in TFL may be working to recruit transcriptional coactivators or corepressors, either alone or in concert with “segment B” (; Taoka et al., 2013).
Similarly, two FT homologs in Beta vulgaris (sugarbeet) show antagonistic functions in the regulation of flowering. BvFT2 function is conserved with FT and acts as a floral promoter while BvFT1 represses flowering. The antagonistic functions of BvFT1 and BvFT2 have been traced to differences at three amino acid residues in “segment B.” BvFT1 and BvFT2 appear to be the products of a relatively recent gene duplication event: BvFT2 homologs have not been found outside of the genus Beta ().
Some soybean (Glycine max, Fabaceae) cultivars display a narrow leaflet phenotype, long been known to be controlled by a single gene, ln. Ln has been mapped to a genomic region that includes a single gene – Gm-JAG1– a homolog of the A. thaliana zinc-finger gene JAGGED. A single amino acid substitution (D9H) in the transcriptional repressor EAR motif of Gm-JAG1 is likely to be the causal ln mutation, rendering Gm-JAG1 non- or hypofunctional (). In addition to altering leaf morphology, the ln mutation affects the number of seeds per fruit (You et al., 1995; ). This example highlights how pleiotropic protein mutations may be tolerated and maintained in populations, possibly because of some fitness advantage. In this case, a fitness advantage may be conferred by the higher seed set of the Ln/ln heterozygote ().
Teosinte glume architecture1 (tga1), an SBP-domain transcription factor, has been identified as a key locus in the domestication of maize from its wild progenitor, teosinte (Wang et al., 2005). Morphological differences between maize and teosinte ears are probably caused by a single coding change (K6N) in Tga1. This single amino acid change alters the biochemical function of TGA1, but the exact mechanism of this change remains unclear (). Given the degree of morphological change associated with this single amino acid change, it is reasonable to hypothesize that TGA1 is a transcriptional activator, activating the set of genes responsible for the development of teosinte-like glume and inflorescence morphology. The single amino acid change observed in maize was sufficient to abolish, or significantly alter, this role of TGA1 (Wang et al., 2005).
ALTERED PROTEIN STABILITY
Protein degradation is one common mechanism of post-translational gene regulation. In plants, polyubiquitylation of proteins, followed by proteolysis mediated by the 26S proteasome, is a particularly prevalent mechanism of post-translational regulation (Vierstra, 2003). Examples of altered protein stability, possibly because of altered polyubiquitylation and degradation, have been observed in the light-sensing cryptochromes and phytochromes, known to be degraded in a light- and ubiquitin-dependent manner (; ; ).
Light responses, such as flowering time, vary considerably amongst A. thaliana accessions (). Multiple independent inactivations of FRIGIDA and FLOWERING LOCUS C have been identified in the study of natural variation in flowering time (reviewed in ), but structural changes in light-sensing cryptochromes and phytochromes have also been implicated. For example, a novel allele of CRYPTOCHROME-2 (CRY2) underlies a large-effect QTL controlling daylength sensitivity (). A single missense amino acid substitution in CRY2 (V367M) results in a more stable protein as compared to the more common Ler allele (). The same amino acid substitution in CRY2 (V367M) is also associated with shorter fruits, and decreased ovule number (). A single amino acid (M548T) substitution in the phytochrome protein PHYA underlies reduced far-red light sensitivity in the Lm-2 accession of A. thaliana (). The substituted amino acid is able to affect multiple aspects of PHYA function: the photochemical properties of Lm-2 PHYA are affected by the M548T substitution; Lm-2 PHYA levels remained high in the light; and Lm-2 PHYA showed reduced autophosphorylation activity (). It is conceivable that the observed amino acid substitutions in both CRY2 and PHYA are interfering with some aspect of the phosphorylation, polyubiquitination, and 26S-mediated protein degradation pathway.
surveyed naturally occurring A. thaliana accessions for variation in trichome density. A single amino acid change, K19E, in the MYB domain transcription factor gene ENHANCER OF TRY AND CPC 2 (ETC2), underlies one large effect trichome density QTL. K19, although highly conserved in single-repeat R3 MYB proteins, is not in a characterized protein domain, but may represent an ubiquitination site. In the low-density accessions, where this lysine is replaced with a glutamate, ubiquitination of the ETC2 repressor may have been reduced or lost, resulting in higher stability of ETC2 and, ultimately, fewer trichomes (). An interesting point arising from this study is the relationship between trichomes and root hairs. ETC2 is the only characterized single-repeat R3 MYB gene family member that affects trichome density, but not root hair density. The K19E replacement, found at a relatively high frequency in naturally occurring accessions, may be tolerated because it occurs in a gene with low pleiotropy ().
HYPOMORPHIC AND HYPERMORPHIC ALLELES
Mutations that either decrease or increase protein function can be termed hypomorphs or hypermorphs, respectively (). Examples of both hypomorphic and hypermorphic alleles in natural variation in a number of A. thaliana phenotypes have been described.
Hyperaccumulation and salt tolerance have repeatedly been associated with altered functionality of transporters and biosynthetic enzymes. Amino acid substitutions in conserved domains of HMA3 and HMA5 underlie A. thaliana QTL for Cd accumulation () and Cu tolerance (), respectively. The amino acid substitutions in HMA3 result in a hypofunctional translocator and, ultimately, higher Cd accumulation. Similarly, high sulfate accumulation in the Shahdara accession of A. thaliana () and differences in salt tolerance between rice accessions () have been separately associated with hypomorphic alleles.
The late flowering Sy-0 accession of A. thaliana is distinctive in its morphology. The basal rosette is enlarged, aerial rosettes form in the axils of stem leaves, and early floral meristems revert to indeterminate growth (). A single amino acid replacement in the pre-mRNA processing factor, HUA2, is responsible for the majority of the Sy-0 aerial rosette phenotype. HUA2 has been shown to positively regulate the flowering genes AG (floral patterning, floral determinacy) and FLC (flowering time). In the Sy-0 accession, AG function is attenuated, and FLC expression is enhanced. Thus, the single Sy-0 amino acid replacement in HUA2 (K525E) is a partial loss-of-function (hypomorphic) allele with respect to its effects on AG, and a gain-of-function (hypermorphic) allele with respect to FLC expression. Although the morphological phenotype exhibited by the Sy-0 accession is not rare, the nucleotide polymorphism that causes the K525E amino acid replacement is rare. In a survey of 113 A. thaliana accessions, only Sy-0 was found to possess the causative single nucleotide polymorphism (SNP; Wang et al., 2007).
Naturally occurring accessions of A. thaliana exhibit considerable diversity in the rate of leaf production. One accession, Est-1, shows both slower leaf production, as well as extensive necrosis on older leaves. Both slower leaf production and late onset leaf necrosis in Est-1 are due to gain of function (hypermorphic) mutations in a single gene, ACCELERATED CELL DEATH6 (ACD6). ACD6 encodes a transmembrane protein involved in the regulation of salicylic acid accumulation and the defense response. The increased activity of ACD6 observed in Est-1, and 14 other A. thaliana accessions, may confer enhanced pathogen resistance, but with costs. Enhanced pathogen resistance comes at the price of reduced biomass (fewer, smaller leaves), which in turn is associated with fitness costs (; Todesco et al., 2010).
MICRO- vs. MACROEVOLUTIONARY DYNAMICS IN PROTEIN EVOLUTION
We have divided our discussion into six broad categories of protein change, but we could also have divided the examples according to the evolutionary scale at which the change was predicted to occur (Table 1). Evolutionary change can be considered microevolutionary (occurring within a single population or species) or macroevolutionary (transcending species boundaries; ). When protein evolution is considered with these categories in mind, do certain changes occur preferentially on a micro- or macroevolutionary scale? It must be stated that all evolutionary events probably happen at a microevolutionary scale, within a population, but the scale at which we observe these events changes. Some categories of change were detected at both micro- and macroevolutionary scales, including active site evolution of enzymes, altered activity as a transcriptional activator or a repressor, and the evolution of PPIs. The evolution of competitive inhibition appears to occur primarily on macroevolutionary time scales, while dominant negatives were detected exclusively at a microevolutionary scale. Dominant-negative alleles and competitive inhibition are similar in character, and it is conceivable that dominant-negative alleles might represent the first step on one pathway to the evolution of competitive inhibition. Domain loss, observed at both micro- and macroevolutionary scales, may represent another pathway leading to competitive inhibition.
The existing examples of DNA-binding domain evolution occur on very deep, macroevolutionary time scales. Similarly, there were no examples of novel domains at microevolutionary timescales. Are these events so rare, and so often deleterious, that they are seldom uncovered in the study of population-level natural variation? Or, would systematic analysis of DNA-binding or protein domain architecture at a population-level reveal microevolutionary examples?
At the opposite side of the spectrum, but similarly illuminating, lie changes that were detected predominantly on microevolutionary scales. In addition to dominant negatives, hypo- and hypermorphic alleles and altered protein stability were detected almost exclusively on microevolutionary, or intrageneric, time scales. These examples may suggest where to look for innovation on macroevolutionary scales. These changes, sometimes causing drastically altered phenotypes, are tolerated in natural environments. Often it is difficult to distinguish functional and phylogenetic signal from the noise in evolutionary analyses of gene families. Perhaps looking for altered stability of evolutionary variants, for example, might yield insight into the functional consequences of molecular evolution. Altered protein stability, in particular, may represent one way in which a protein’s function might stay intact, but the protein may persist for a shorter or longer period of time. This could conceivably result in a heterochronic shift () in a particular trait.
CONCLUSIONS
One interesting point arising from our survey of the existing literature is that proteins can change in a number of ways that were not uncovered here. One class of changes, in particular, that remained elusive was PTMs. The examples of altered protein stability may have ultimately been because of altered PTMs, but that remains to be determined. The absence of altered PTMs in the study of protein evolution is perhaps because many of the PTMs of individual extant proteins are still incompletely understood, so assessing PTMs in an evolutionary context remains extremely challenging. In the case of QTL cloning, PTM alterations may not be tolerated very often, and will therefore vary only very rarely on microevolutionary scales. Examples do arise in mutant analyses (Soppe et al., 2000; ), so more cases of natural variation in PTMs may be forthcoming. PTMs have clearly arisen and diversified in proteins and the study of their evolution represents an interesting area of future exploration.
Although many of the discussed changes primarily affect transcription factors, the phenotypic outcomes of these changes are often vastly different. Even within one class of change, altered PPIs, one altered interaction affects trichome density in Arabidopsis, another affects floral morphology in Thalictrum. Although similar biochemical changes might have occurred, the ultimate phenotypes on which natural selection might act are distinct and not evolutionarily equivalent.
Genetic analysis (QTL cloning) has deepened our understanding of the molecular underpinnings of phenotypic diversity to a considerable degree. As more QTL are uncovered and cloned, no doubt this understanding will grow ever deeper. But systematically cloning QTLs will not tell us everything there is to know about the evolution of plant form and function. It remains important to combine all of the strategies available to us, including phylogenetic analyses of gene families, structural analyses, and functional analyses of proteins in an evolutionary context, in order to gain a more complete picture of protein evolution. It would also be extremely informative to know how many of the QTL that have been cloned confer adaptive phenotypes, or have the potential to be adaptive under certain conditions. Although challenging, field and laboratory selection tests on some of the more promising accessions would no doubt yield fascinating results.
Statements
Acknowledgments
We would like to thank the three reviewers for helpful suggestions on an earlier version of the manuscript. We gratefully acknowledge support from the National Science Foundation (IOS-1025121 to Clinton J. Whipple). Finally, we would like to apologize to those authors whose work we have inadvertently omitted or could not review at length due to space limitations.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
REFERENCES
1
AbreuM. EMunné-BoschS. (2009). Salicylic acid deficiency in NahG transgenic lines and sid2 mutants increases seed yield in the annual plant Arabidopsis thaliana.J. Exp. Bot.601261–1271. 10.1093/jxb/ern363
2
AgrawalG. K.AbeK.YamazakiM.MiyaoA.HirochikaH. (2005). Conservation of the E-function for floral organ identity in rice revealed by the analysis of tissue culture-induced loss-of-function mutants of the OsMADS1 gene.Plant Mol. Biol.59125–135. 10.1007/s11103-005-2161-y
3
AharoniA.GaidukovL.KhersonskyO.GouldS. M.RoodveldtC.TawfikD. S. (2004). The ‘evolvability’ of promiscuous protein functions.Nat. Genet.3773–76.
4
AhnJ. H.MillerD.WinterV. J.BanfieldM. J.LeeJ. H.YooS. Y.et al (2006). A divergent external loop confers antagonistic activity on floral regulators FT and TFL1.EMBO J.25605–614. 10.1038/sj.emboj.7600950
5
AiroldiC. A.BergonziS.DaviesB. (2010). Single amino acid change alters the ability to specify male or female organ identity.Proc. Natl. Acad. Sci. U.S.A.10718898–18902. 10.1073/pnas.1009050107
6
Alonso-BlancoC.AartsM. G.BentsinkL.KeurentjesJ. J.ReymondM.VreugdenhilD.et al (2009). What has natural variation taught us about plant development, physiology, and adaptation?Plant Cell211877–1896. 10.1105/tpc.109.068114
7
AnkeS.NiemüllerD.MollS.HänschR.OberD. (2004). Polyphyletic origin of pyrrolizidine alkaloids within the Asteraceae. Evidence from differential tissue expression of homospermidine synthase.Plant Physiol.1364037–4047. 10.1104/pp.104.052357
8
AnthonyR. G.WaldinT. R.RayJ. A.BrightS. W. J.HusseyP. J. (1998). Herbicide resistance caused by spontaneous mutation of the cytoskeletal protein tubulin.Nature393260–263. 10.1038/30484
9
AoyamaT.HiwatashiY.ShigyoM.KofujiR.KuboM.ItoM.et al (2012). AP2-type transcription factors determine stem cell identity in the moss Physcomitrella patens.Development1393120–3129. 10.1242/dev.076091
10
ArnaudN.LawrensonT.ÒstergaardL.SablowskiR. (2011). The Same regulatory point mutation changed seed-dispersal structures in evolution and domestication.Curr. Biol.211215–1219. 10.1016/j.cub.2011.06.008
11
BalkundeR.PeschM.HulskampM. (2010). Trichome patterning in Arabidopsis thaliana from genetic to molecular models.Curr. Top. Dev. Biol.91299–321. 10.1016/S0070-2153(10)91010-7
12
BarrettR. D.HoekstraH. E. (2011). Molecular spandrels: tests of adaptation at the genetic level.Nat. Rev. Genet.12767–780. 10.1038/nrg3015
13
BenderothM.TextorS.WindsorA. J.Mitchell-OldsT.GershenzonJ.KroymannJ. (2006). Positive selection driving diversification in plant secondary metabolism.Proc. Natl. Acad. Sci. U.S.A.1039118–9123. 10.1073/pnas.0601738103
14
BergthorssonU.AdamsK. L.ThomasonB.PalmerJ. D. (2003). Widespread horizontal transfer of mitochondrial genes in flowering plants.Nature424197–201. 10.1038/nature01743
15
BergthorssonU.RichardsonA. O.YoungG. J.GoertzenL. R.PalmerJ. D. (2004). Massive horizontal transfer of mitochondrial genes from diverse land plant donors to the basal angiosperm Amborella.Proc. Natl. Acad. Sci. U.S.A.10117747–17752. 10.1073/pnas.0408336102
16
BlackmanB. K.StrasburgJ. L.RaduskiA. R.MichaelsS. D.RiesebergL. H. (2010). The role of recently derived FT paralogs in sunflower domestication.Curr. Biol.20629–635. 10.1016/j.cub.2010.01.059
17
Bornberg-BauerE.HuylmansA.-K.SikosekT. (2010). How do new proteins arise?Curr. Opin. Struct. Biol.20390–396. 10.1016/j.sbi.2010.02.005
18
CarrollS. (2000). Endless forms: the evolution of gene regulation and morphological diversity.Cell101577. 10.1016/S0092-8674(00)80868-5
19
CarrollS. B. (2005). Evolution at two levels: on genes and form.PLoS Biol.3:e245. 10.1371/journal.pbio.0030245
20
CarrollS. B. (2008). Evo-devo and an expanding evolutionary synthesis: a genetic theory of morphological evolution.Cell13425–36. 10.1016/j.cell.2008.06.030
21
CausierB.CastilloR.ZhouJ.IngramR.XueY.Schwarz-SommerZ.et al (2005). Evolution in action: following function in duplicated floral homeotic Genes.Curr. Biol.151508–1512. 10.1016/j.cub.2005.07.063
22
ChaoD. Y.SilvaA.BaxterI.HuangY. S.NordborgM.DankuJ.et al (2012). Genome-wide association studies identify heavy metal ATPase3 as the primary determinant of natural variation in leaf cadmium in Arabidopsis thaliana.PLoS Genetics8:e1002923. 10.1371/journal.pgen.1002923
23
CharoensawanV.WilsonD.TeichmannS. A. (2010). Lineage-specific expansion of DNA-binding transcription factor families.Trends Genet.26388–393. 10.1016/j.tig.2010.06.004
24
ChristensenA.MalcomberS. (2012). Duplication and diversification of the LEAFY HULL STERILE1 and Oryza sativa MADS5 SEPALLATA lineages in graminoid Poales.Evodevo34. 10.1186/2041-9139-3-4
25
CoenE. S.MeyerowitzE. M. (1991). The war of the whorls: genetic interactions controlling flower development.Nature35331–37. 10.1038/353031a0
26
CorleyS. B.CarpenterR.CopseyL.CoenE. (2005). Floral asymmetry involves an interplay between TCP and MYB transcription factors in Antirrhinum.Proc. Natl. Acad. Sci. U.S.A.1025068–5073. 10.1073/pnas.0501340102
27
CuiH.LevesqueM. P.VernouxT.JungJ. W.PaquetteA. J.GallagherK. L.et al (2007). An evolutionarily conserved mechanism delimiting SHR movement defines a single layer of endodermis in plants.Science316421–425. 10.1126/science.1139531
28
CuiR.HanJ.ZhaoS.SuK.WuF.DuX.et al (2010). Functional conservation and diversification of class E floral homeotic genes in rice (Oryza sativa).Plant J.61767–781. 10.1111/j.1365-313X.2009.04101.x
29
DaviesB.MotteP.KeckE.SaedlerH.SommerH.Schwarz-SommerZ. (1999). PLENA and FARINELLI: redundancy and regulatory interactions between two Antirrhinum MADS-box factors controlling flower development.EMBO J.184023–4034. 10.1093/emboj/18.14.4023
30
DinkinsR. D.KeimK. R.FarnoL.EdwardsL. H. (2002). Expression of the narrow leaflet gene for yield and agronomic traits in soybean.J. Hered.93346–351. 10.1093/jhered/93.5.346
31
DixonR. A. (2001). Natural products and plant disease resistance.Nature411843–847. 10.1038/35081178
32
DreaS.HilemanL. C.De MartinoG.IrishV. F. (2007). Functional analyses of genetic pathways controlling petal specification in poppy.Development1344157–4166. 10.1242/dev.013136
33
El-AssalS. E.Alonso-BlancoC.HanhartC. J.KoornneefM. (2004). Pleiotropic effects of the Arabidopsis cryptochrome2 allelic variation underlie fruit trait-related QTL.Plant Biol.6370–374. 10.1055/s-2004-820890
34
El-AssalS. E.Alonso-BlancoC.PeetersA. J.RazV.KoornneefM. (2001). A QTL for flowering time in Arabidopsis reveals a novel allele of CRY2.Nat. Genet.29435–440. 10.1038/ng767
35
FiliaultD. L.WessingerC. A.DinnenyJ. R.LutesJ.BorevitzJ. O.WeigelD.et al (2008). Amino acid polymorphisms in Arabidopsis phytochrome B cause differential responses to light.Proc. Natl. Acad. Sci. U.S.A.1053157–3162. 10.1073/pnas.0712174105
36
FinetC.Berne-DedieuA.ScuttC. P.MarletazF. (2013). Evolution of the ARF gene family in land plants: old domains, new tricks.Mol. Biol. Evol.3045–56. 10.1093/molbev/mss220
37
FlagelL. E.WendelJ. F. (2009). Gene duplication and evolutionary novelty in plants.New Phytol.183557–564. 10.1111/j.1469-8137.2009.02923.x
38
FloydS. K.ZalewskiC. S.BowmanJ. L. (2006). Evolution of class III homeodomain–leucine zipper genes in streptophytes.Genetics173373–388. 10.1534/genetics.105.054239
39
FourquinC.Del CerroC.VictoriaF. C.Vialette-GuiraudA.De OliveiraA. CFerrándizC. (2013). A change in SHATTERPROOF protein lies at the origin of a fruit morphological novelty and a new strategy for seed dispersal in Medicago genus.Plant Physiol.162907–917. 10.1104/pp.113.217570
40
FridmanE.CarrariF.LiuY.-S.FernieA. R.ZamirD. (2004). Zooming in on a quantitative trait for tomato yield using interspecific introgressions.Science3051786–1789. 10.1126/science.1101666
41
GalimbaK. D.TolkinT. R.SullivanA. M.MelzerR.TheissenGDi StilioV. S. (2012). Loss of deeply conserved C-class floral homeotic gene function and C- and E-class protein interaction in a double-flowered ranunculid mutant.Proc. Natl. Acad. Sci. U.S.A.109E2267–E2275. 10.1073/pnas.1203686109
42
GherardiniP. F.WassM. N.Helmer-CitterichMSternbergM. J. E. (2007). Convergent evolution of enzyme active sites is not a rare phenomenon.J. Mol. Biol.372817–845. 10.1016/j.jmb.2007.06.017
43
GouldS. J. (2002). The Structure of Evolutionary Theory.Harvard University Press.
44
GouldS. J.VrbaE. S. (1982). Exaptation – a missing term in the science of form.Paleobiology84–15.
45
GramzowL.RitzM. S.TheissenG. (2010). On the origin of MADS-domain transcription factors.Trends Genet.26149–153. 10.1016/j.tig.2010.01.004
46
GreenhagenB. T.O’mailleP. E.NoelJ. P.ChappellJ. (2006). Identifying and manipulating structural determinates linking catalytic specificities in terpene synthases.Proc. Natl. Acad. Sci. U.S.A.1039826–9831. 10.1073/pnas.0601605103
47
HansenF. T.MadsenC. K.NordlandA. M.GrasserM.MerkleT.GrasserK. D. (2008). A novel family of plant DNA-binding proteins containing both HMG-box and AT-rich interaction domains.Biochemistry4713207–13214. 10.1021/bi801772k
48
HanzawaY.MoneyT.BradleyD. (2005). A single amino acid converts a repressor to an activator of flowering.Proc. Natl. Acad. Sci. U.S.A.1027748–7753. 10.1073/pnas.0500932102
49
HillwigM. L.XuM.ToyomasuT.TiernanM. S.WeiG.CuiG.et al (2011). Domain loss has independently occurred multiple times in plant terpene synthase evolution.Plant J.681051–1060. 10.1111/j.1365-313X.2011.04756.x
50
HilscherJ.SchlûttererC.HauserM.-T. (2009). A single amino acid replacement in ETC2 shapes trichome patterning in natural Arabidopsis populations.Curr. Biol.191747–1751. 10.1016/j.cub.2009.08.057
51
HimiS.SanoR.NishiyamaT.TanahashiT.KatoM.UedaK.et al (2001). Evolution of MADS-box gene induction by FLO/LFY genes.J. Mol. Evol.53387–393. 10.1007/s002390010228
52
HoM.OuC.ChanY.-R.ChienC.-T.PiH. (2008). The utility F-box for protein destruction.Cell. Mol. Life Sci.651977–2000. 10.1007/s00018-008-7592-6
53
HoekstraH. E.CoyneJ. A. (2007). The locus of evolution: evo devo and the genetics of adaptation.Evolution61995–1016. 10.1111/j.1558-5646.2007.00105.x
54
HongS. Y.KimO. K.KimS. G.YangM. S.ParkC. M. (2011). Nuclear import and DNA binding of the ZHD5 transcription factor is modulated by a competitive peptide inhibitor in Arabidopsis.J. Biol. Chem.2861659–1668. 10.1074/jbc.M110.167692
55
HonmaT.GotoK. (2000). The Arabidopsis floral homeotic gene PISTILLATA is regulated by discrete cis-elements responsive to induction and maintenance signals.Development1272021–2030.
56
HonmaT.GotoK. (2001). Complexes of MADS-box proteins are sufficient to convert leaves into floral organs.Nature409525–529. 10.1038/35054083
57
HuangX.EffgenS.MeyerR. C.TheresK.KoornneefM. (2012). Epistatic natural allelic variation reveals a function of AGAMOUS-LIKE6 in axillary bud formation in Arabidopsis.Plant Cell242364–2379. 10.1105/tpc.112.099168
58
ImminkR. G.KaufmannK.AngenentG. C. (2010). The ‘ABC’ of MADS domain protein behaviour and interactions.Semin. Cell Dev. Biol.2187–93. 10.1016/j.semcdb.2009.10.004
59
JeonJ.JangS.LeeS.NamJ.KimC.LeeS.et al (2000). leafy hull sterile1 is a homeotic mutation in a rice MADS box gene affecting rice flower development.Plant Cell12871–884. 10.1105/tpc.12.6.871
60
JeongN.SuhS. J.KimM.-H.LeeS.MoonJ.-K.KimH. S.et al (2012). Ln is a key regulator of leaflet shape and number of seeds per pod in soybean.Plant Cell244807–4818. 10.1105/tpc.112.104968
61
JinJ.HuangW.GaoJ. P.YangJ.ShiM.ZhuM. Z.et al (2008). Genetic control of rice plant architecture under domestication.Nat. Genet.401365–1369. 10.1038/ng.247
62
KalteneggerE.EichE.OberD. (2013). Evolution of homospermidine synthase in the convolvulaceae: a story of gene duplication, gene loss, and periods of various selection pressures.Plant Cell251213–1227. 10.1105/tpc.113.109744
63
KennedyM. B. (1995). Origin of PDZ (DHR, GLGF) domains.Trends Biochem. Sci.20350. 10.1016/S0968-0004(00)89074-X
64
KerstingA. R.Bornberg-BauerE.MooreA. D.GrathS. (2012). Dynamics and adaptive benefits of protein domain emergence and arrangements during plant genome evolution.Genome Biol. Evol.4316–329. 10.1093/gbe/evs004
65
KimH. J.RyuH.HongS. H.WooH. R.LimP. O.LeeI. C.et al (2006). Cytokinin-mediated control of leaf longevity by AHK3 through phosphorylation of ARR2 in Arabidopsis.Proc. Natl. Acad. Sci. U.S.A.103814–819. 10.1073/pnas.0505150103
66
KimJ.HarterK.TheologisA. (1997). Protein–protein interactions among the Aux/IAA proteins.Proc. Natl. Acad. Sci. U.S.A.9411786–11791. 10.1073/pnas.94.22.11786
67
KimY.-S.KimS.-G.LeeM.LeeI.ParkH.-Y.SeoP. J.et al (2008). HD-ZIP III activity is modulated by competitive inhibitors via a feedback loop in Arabidopsis shoot apical meristem development.Plant Cell20920–933. 10.1105/tpc.107.057448
68
KlingenbergC. P. (1998). Heterochrony and allometry: the analysis of evolutionary change in ontogeny.Biol. Rev.7379–123. 10.1017/S000632319800512X
69
KobayashiY.KurodaK.KimuraK.Southron-FrancisJ. L.FuruzawaA.KimuraK.et al (2008). Amino acid polymorphisms in strictly conserved domains of a P-type ATPase HMA5 are involved in the mechanism of copper tolerance variation in Arabidopsis.Plant Physiol.148969–980. 10.1104/pp.108.119933
70
KramerE. M.DoritR. L.IrishV. F. (1999). Molecular evolution of genes controlling petal and stamen development: duplication and divergence within the APETALA3 and PISTILLATA MADS-box gene lineages.Genetics151915–915.
71
KramerE. M.HolappaL.GouldB.JaramilloM. A.SetnikovD.SantiagoP. M. (2007). Elaboration of B gene function to include the identity of novel floral organs in the lower eudicot Aquilegia.Plant Cell19750–766. 10.1105/tpc.107.050385
72
KramerE. M.SuH.-J.WuC.-C.HuJ.-M. (2006). A simplified explanation for the frameshift mutation that created a novel C-terminal motif in the APETALA3 gene lineage.BMC Evol. Biol.6:30. 10.1186/1471-2148-6-30
73
KuittinenH.NiittyvuopioA.RinneP.SavolainenO. (2008). Natural variation in Arabidopsis lyrata vernalization requirement conferred by a FRIGIDA indel polymorphism.Mol. Biol. Evol.25319–329. 10.1093/molbev/msm257
74
LambR. S.IrishV. F. (2003). Functional divergence within the APETALA3/PISTILLATA floral homeotic gene lineages.Proc. Natl. Acad. Sci. U.S.A.1006558–6563. 10.1073/pnas.0631708100
75
LangeA.MillsR. E.LangeC. J.StewartM.DevineS. E.CorbettA. H. (2007). Classical nuclear localization signals: definition, function, and interaction with importin α.J. Biol. Chem.2825101–5105. 10.1074/jbc.R600026200
76
LangeM.OrashakovaS.LangeS.MelzerR.TheissenG.SmythD. R.et al (2013). The seirena B class floral homeotic mutant of California Poppy (Eschscholzia californica) reveals a function of the enigmatic PI motif in the formation of specific multimeric MADS domain protein complexes.Plant Cell25438–453. 10.1105/tpc.112.105809
77
LenserT.TheissenG.DittrichP. (2009). Developmental robustness by obligate interaction of class B floral homeotic genes and proteins.PLoS Comput. Biol.5:e1000264. 10.1371/journal.pcbi.1000264
78
LiC.ZhouA.SangT. (2006). Rice domestication by reducing shattering.Science3111936–1939. 10.1126/science.1123604
79
LiG.-S.MengZ.KongH.-Z.ChenZ.-D.TheissenG.LuA.-M. (2005). Characterization of candidate class A, B and E floral homeotic genes from the perianthless basal angiosperm Chloranthus spicatus (Chloranthaceae).Dev. Genes Evol.215437–449. 10.1007/s00427-005-0002-2
80
LiP.WindJ. J.ShiX.ZhangH.HansonJ.SmeekensS. C.et al (2011). Fructose sensitivity is suppressed in Arabidopsis by the transcription factor ANAC089 lacking the membrane-bound domain.Proc. Natl. Acad. Sci. U.S.A.1083436–3441. 10.1073/pnas.1018665108
81
LittA.IrishV. F. (2003). Duplication and diversification in the APETALA1/FRUITFULL floral homeotic gene lineage: implications for the evolution of floral development.Genetics165821–833.
82
LoudetO.Saliba-ColombaniV.CamilleriC.CalengeF.GaudonV.KoprivovaA.et al (2007). Natural variation for sulfate content in Arabidopsis thaliana is highly controlled by APR2.Nat. Genet.39896–900. 10.1038/ng2050
83
LusserA.KölleD.LoidlP. (2001). Histone acetylation: lessons from the plant kingdom.Trends Plant Sci.659. 10.1016/S1360-1385(00)01839-2
84
LynchM.O’helyM.WalshB.ForceA. (2001). The probability of preservation of a newly arisen gene duplicate.Genetics1591789–1804.
85
LynchV. J.WagnerG. P. (2008). Resurrecting the role of transcription factor change in developmental evolution.Evolution622131–2154. 10.1111/j.1558-5646.2008.00440.x
86
MagnaniE.BartonM. K. (2011). A per-ARNT-sim-like sensor domain uniquely regulates the activity of the homeodomain leucine zipper transcription factor REVOLUTA in Arabidopsis.Plant Cell23567–582. 10.1105/tpc.110.080754
87
MagnaniE.HakeS. (2008). KNOX lost the OX: the Arabidopsis KNATM gene defines a novel class of KNOX transcriptional regulators missing the homeodomain.Plant Cell20875–887. 10.1105/tpc.108.058495
88
MagnaniE.SjölanderK.HakeS. (2004). From endonucleases to transcription factors: evolution of the AP2 DNA binding domain in plants.Plant Cell162265–2277. 10.1105/tpc.104.023135
89
MaizelA.BuschM. A.TanahashiT.PerkovicJ.KatoM.HasebeM.et al (2005). The floral regulator LEAFY evolves by substitutions in the DNA binding domain.Science308260–263. 10.1126/science.1108229
90
MaloofJ. N.BorevitzJ. O.DabiT.LutesJ.NehringR. B.RedfernJ. L.et al (2001). Natural variation in light sensitivity of Arabidopsis.Nat. Genet.29441–446. 10.1038/ng777
91
MartinW.RujanT.RichlyE.HansenA.CornelsenS.LinsT.et al (2002). Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus.Proc. Natl. Acad. Sci. U.S.A.9912246–12251. 10.1073/pnas.182432999
92
MartinW.StoebeB.GoremykinV.HansmannS.HasegawaM.KowallikK. V. (1998). Gene transfer to the nucleus and the evolution of chloroplasts.Nature393162–165. 10.1038/30234
93
MauricioR. (1998). Costs of resistance to natural enemies in field populations of the annual plant Arabidopsis thaliana.Am. Nat.15120–28. 10.1086/286099
94
MoyroudE.KustersE.MonniauxM.KoesR.ParcyF. (2010). LEAFY blossoms.Trends Plant Sci.15346–352. 10.1016/j.tplants.2010.03.007
95
MukherjeeK.BuerglinT. R. (2006). MEKHLA, a novel domain with similarity to PAS domains, is fused to plant homeodomain-leucine zipper III proteins.Plant Physiol.1401142–1150. 10.1104/pp.105.073833
96
MullerH. J. (1932). “Further studies on the nature and causes of gene mutations,” inProceedings of the 6th International Congress of GeneticsIthacaNew York213–255.
97
O’BrienP. J.HerschlagD. (1999). Catalytic promiscuity and the evolution of new enzymatic activities.Chem. Biol.6R91–R105. 10.1016/S1074-5521(99)80033-7
98
OhnoS. (1970). Evolution by Gene Duplication.Berlin: Springer-Verlag.
99
OhtaM.MatsuiK.HiratsuK.ShinshiH.Ohme-TakagiM. (2001). Repression domains of class II ERF transcriptional repressors share an essential motif for active repression.Plant Cell131959–1968.
100
OkamuroJ. K.CasterB.VillarroelR.Van MontaguM.JofukuK. D. (1997). The AP2 domain of APETALA2 defines a large new family of DNA binding proteins in Arabidopsis.Proc. Natl. Acad. Sci. U.S.A.947076–7081. 10.1073/pnas.94.13.7076
101
O’MailleP. E.MaloneA.DellasN.Andes HessB.Jr.SmentekL.SheehanI.et al (2008). Quantitative exploration of the catalytic landscape separating divergent plant sesquiterpene synthases.Nat. Chem. Biol.4617–623. 10.1038/nchembio.113
102
Pabón-MoraN.AmbroseB. A.LittA. (2012). Poppy APETALA1/FRUITFULL orthologs control flowering time, branching, perianth identity, and fruit development.Plant Physiol.1581685–1704. 10.1104/pp.111.192104
103
PelazS.DittaG.BaumannE.WismanE.YanofskyM. (2000). B and C floral organ identity functions require SEPALLATA MADS-box genes.Nature405200–203. 10.1038/35012103
104
PinP. A.BenllochR.BonnetD.Wremerth-WeichE.KraftT.GielenJ. J.et al (2010). An antagonistic pair of FT homologs mediates the control of flowering time in sugar beet.Science3301397–1400. 10.1126/science.1197004
105
PoduskaB.HumphreyT.RedweikAGrbićV. (2003). The synergistic activation of FLOWERING LOCUS C by FRIGIDA and a new flowering gene AERIAL ROSETTE 1 underlies a novel morphology in Arabidopsis.Genetics1631457–1465.
106
PowlesS. B.YuQ. (2010). Evolution in action: plants resistant to herbicides.Annu. Rev. Plant Biol.61317–347. 10.1146/annurev-arplant-042809-112119
107
PrasadK. V. S. K.SongB.-H.Olson-ManningC.AndersonJ. T.LeeC.-R.SchranzM. E.et al (2012). A gain-of-function polymorphism controlling complex traits and fitness in nature.Science3371081–1084. 10.1126/science.1221636
108
PrestonJ. C.WangH.KurselL.DoebleyJ.KelloggE. A. (2012). The role of teosinte glume architecture (tga1) in coordinated regulation and evolution of grass glumes and inflorescence axes.New Phytol.193204–215. 10.1111/j.1469-8137.2011.03908.x
109
PriggeM. J.OtsugaD.AlonsoJ. M.EckerJ. R.DrewsG. N.ClarkS. E. (2005). Class III homeodomain-leucine zipper gene family members have overlapping, antagonistic, and distinct roles in Arabidopsis development.Plant Cell1761–76. 10.1105/tpc.104.026161
110
RaimundoJ.SobralR.BaileyP.AzevedoH.GalegoL.AlmeidaJ.et al (2013). A subcellular tug of war involving three MYB-like proteins underlies a molecular antagonism in Antirrhinum flower asymmetry.Plant J.75527–538. 10.1111/tpj.12225
111
ReimannA.NurhayatiN.BackenköhlerA.OberD. (2004). Repeated evolution of the pyrrolizidine alkaloid-mediated defense system in separate angiosperm lineages.Plant Cell162772–2784. 10.1105/tpc.104.023176
112
RenZ.-H.GaoJ.-P.LiL.-G.CaiX.-L.HuangW.ChaoD.-Y.et al (2005). A rice quantitative trait locus for salt tolerance encodes a sodium transporter.Nat. Genet.371141–1146. 10.1038/ng1643
113
RichardsonA. O.PalmerJ. D. (2007). Horizontal gene transfer in plants.J. Exp. Bot.581–9. 10.1093/jxb/erl148
114
RiechmannJ. L.KrizekB. A.MeyerowitzE. M. (1996a). Dimerization specificity of Arabidopsis MADS domain homeotic proteins APETALA1, APETALA3, PISTILLATA, and AGAMOUS.Proc. Natl. Acad. Sci. U.S.A.934793–4798. 10.1073/pnas.93.10.4793
115
RiechmannJ. L.WangM.MeyerowitzE. M. (1996b). DNA-binding properties of Arabidopsis MADS domain homeotic proteins APETALA1, APETALA3, PISTILLATA and AGAMOUS.Nucleic Acids Res.243134–3141. 10.1093/nar/24.16.3134
116
RiechmannJ. L.MeyerowitzE. M. (1998). The AP2/EREBP family of plant transcription factors.Biol. Chem.379633–646.
117
RockmanM. V. (2012). The QTN program and the alleles that matter for evolution: all that’s gold does not glitter.Evolution661–17. 10.1111/j.1558-5646.2011.01486.x
118
SeoP. J.HongS.-Y.KimS.-G.ParkC.-M. (2011a). Competitive inhibition of transcription factors by small interfering peptides.Trends Plant Sci.16541–549. 10.1016/j.tplants.2011.06.001
119
SeoP. J.KimM. J.RyuJ. Y.JeongE. Y.ParkC. M. (2011b). Two splice variants of the IDD14 transcription factor competitively form nonfunctional heterodimers which may regulate starch metabolism.Nat. Commun.2303. 10.1038/ncomms1303
120
SeoP. J.KimS.-G.ParkC.-M. (2008). Membrane-bound transcription factors in plants.Trends Plant Sci.13550–556. 10.1016/j.tplants.2008.06.008
121
SimonsK. J.FellersJ. P.TrickH. N.ZhangZ.TaiY.-S.GillB. S.et al (2006). Molecular characterization of the major wheat domestication gene Q.Genetics172547–555. 10.1534/genetics.105.044727
122
SmithS. D.RausherM. D. (2011). Gene loss and parallel evolution contribute to species difference in flower color.Mol. Biol. Evol.282799–2810. 10.1093/molbev/msr109
123
SmithS. D.WangS.RausherM. D. (2013). Functional evolution of an anthocyanin pathway enzyme during a flower color transition.Mol. Biol. Evol.30602–612. 10.1093/molbev/mss255
124
SoppeW. J.JacobsenS. E.Alonso-BlancoC.JacksonJ. P.KakutaniT.KoornneefM.et al (2000). The late flowering phenotype of fwa mutants is caused by gain-of-function epigenetic alleles of a homeodomain gene.Mol. Cell.6791–802. 10.1016/S1097-2765(05)00090-0
125
StaudtA.-C.WenkelS. (2010). Regulation of protein function by ‘microProteins’.EMBO Rep.1235–42. 10.1038/embor.2010.196
126
StegemannS.BockR. (2009). Exchange of genetic material between cells in plant tissue grafts.Science324649–651. 10.1126/science.1170397
127
StegemannS.HartmannS.RufS.BockR. (2003). High-frequency gene transfer from the chloroplast genome to the nucleus.Proc. Natl. Acad. Sci. U.S.A.1008828–8833. 10.1073/pnas.1430924100
128
StegemannS.KeutheM.GreinerS.BockR. (2012). Horizontal transfer of chloroplast genomes between plant species.Proc. Natl. Acad. Sci. U.S.A.1092434–2438. 10.1073/pnas.1114076109
129
SternD. L. (2000). Perspective: evolutionary developmental biology and the problem of variation.Evolution541079–1091.
130
StoddardB. L. (2011). Homing endonucleases: from microbial genetic invaders to reagents for targeted DNA modification.Structure197–15. 10.1016/j.str.2010.12.003
131
SunT.-P. (2011). The molecular mechanism and evolution of the GA–GID1–DELLA signaling module in plants.Curr. Biol.21R338–R345. 10.1016/j.cub.2011.02.036
132
SymondsV. V.HatlestadG.LloydA. M. (2011). Natural allelic variation defines a role for ATMYC1: trichome cell fate determination.PLoS Genet.7:e1002069. 10.1371/journal.pgen.1002069
133
TanahashiT.SumikawaN.KatoM.HasebeM. (2005). Diversification of gene function: homologs of the floral regulator FLO/LFY control the first zygotic cell division in the moss Physcomitrella patens.Development1321727–1736. 10.1242/dev.01709
134
TaokaK.-I.OhkiI.TsujiH.KojimaC.ShimamotoK. (2013). Structure and function of florigen and the receptor complex.Trends Plant Sci.18287–294. 10.1016/j.tplants.2013.02.002
135
TodescoM.BalasubramanianS.HuT. T.TrawM. B.HortonM.EppleP.et al (2010). Natural allelic variation underlying a major fitness trade-off in Arabidopsis thaliana.Nature465632–636. 10.1038/nature09083
136
TurnerA.BealesJ.FaureS.DunfordR. P.LaurieD. A. (2005). The pseudo-response regulator Ppd-H1 provides adaptation to photoperiod in barley.Science3101031–1034. 10.1126/science.1117619
137
UlmasovT.MurfettJ.HagenG.GuilfoyleT. J. (1997). Aux/IAA proteins repress expression of reporter genes containing natural and highly active synthetic auxin response elements.Plant Cell91963–1971. 10.1105/tpc.9.11.1963
138
VandenbusscheM.TheissenG.Van De PeerY.GeratsT. (2003). Structural diversification and neo-functionalization during floral MADS-box gene evolution by C-terminal frameshift mutations.Nucleic Acids Res.314401–4409. 10.1093/nar/gkg642
139
VandenbusscheM.ZethofJ.RoyaertS.WeteringsK.GeratsT. (2004). The duplicated B-class heterodimer model: whorl-specific effects and complex genetic interactions in Petunia hybrida flower development.Plant Cell16741–754. 10.1105/tpc.019166
140
VernouxT.BrunoudG.FarcotE.MorinV.Van Den DaeleH.LegrandJ.et al (2011). The auxin signalling network translates dynamic input into robust patterning at the shoot apex.Mol. Syst. Biol.7508. 10.1038/msb.2011.39
141
VierstraR. D. (2003). The ubiquitin/26S proteasome pathway, the complex last chapter in the life of many plant proteins.Trends Plant Sci.8135–142. 10.1016/S1360-1385(03)00014-1
142
WagnerA. (2011). The molecular origins of evolutionary innovations.Trends Genet.27397–410. 10.1016/j.tig.2011.06.002
143
WangH.Nussbaum-WaglerT.LiB.ZhaoQ.VigourouxY.FallerM.et al (2005). The origin of the naked grains of maize.Nature436714–719. 10.1038/nature03863
144
WangQ.SajjaU.RosloskiS.HumphreyT.KimM. C.BombliesK.et al (2007). HUA2 caused natural variation in shoot morphology of A.thaliana. Curr. Biol.171513–1519. 10.1016/j.cub.2007.07.059
145
WangY.-Q.MelzerR.TheissenG. (2010). Molecular interactions of orthologues of floral homeotic proteins from the gymnosperm Gnetum gnemon provide a clue to the evolutionary origin of ‘floral quartets’.Plant J.64177–190. 10.1111/j.1365-313X.2010.04325.x
146
WengJ.-K.LiY.MoH.ChappleC. (2012). Assembly of an evolutionarily new pathway for α-pyrone biosynthesis in Arabidopsis.Science337960–964. 10.1126/science.1221614
147
WenkelS.EmeryJ.HouB.-H.EvansM. M.BartonM. (2007). A feedback regulatory module formed by LITTLE ZIPPER and HD-ZIPIII genes.Plant Cell193379–3390. 10.1105/tpc.107.055772
148
WessingerC. A.RausherM. D. (2012). Lessons from flower colour evolution on targets of selection.J. Exp. Bot.635741–5749. 10.1093/jxb/ers267
149
WhippleC. J.CiceriP.PadillaC. M.AmbroseB. A.BandongS. L.SchmidtR. J. (2004). Conservation of B-class floral homeotic gene function between maize and Arabidopsis.Development1316083–6091. 10.1242/dev.01523
150
WhippleC. J.SchmidtR. J. (2006). “Genetics of Grass Flower Development,” inAdvances in Botanical ResearchedsSoltisD. E.Leebens-MackJ. H.SoltisP. S.CallowJ. A. (Waltham: Academic Press) 385–424.
151
WinterK.-U.WeiserC.KaufmannK.BohneA.KirchnerC.KannoA.et al (2002). Evolution of class B floral homeotic proteins: obligate heterodimerization originated from homodimerization.Mol. Biol. Evol.19587–596. 10.1093/oxfordjournals.molbev.a004118
152
XiZ.WangY.BradleyR. K.SugumaranM.MarxC. J.RestJ. S.et al (2013). Massive mitochondrial gene transfer in a parasitic flowering plant clade.PLoS Genet.9:e1003265. 10.1371/journal.pgen.1003265
153
XingS.LiM.LiuP. (2013). Evolution of S-domain receptor-like kinases in land plants and origination of S-locus receptor kinases in Brassicaceae.BMC Evol. Biol.13:69. 10.1186/1471-2148-13-69
154
YamamotoE.ZengL.BairdW. V. (1998). α-Tubulin missense mutations correlate with antimicrotubule drug resistance in Eleusine indica.Plant Cell10297–308.
155
YasumuraY.Crumpton-TaylorM.FuentesS.HarberdN. P. (2007). Step-by-step acquisition of the gibberellin-DELLA growth-regulatory mechanism during land-plant evolution.Curr. Biol.171225–1230. 10.1016/j.cub.2007.06.037
156
YouM. G.LiuY. B.ZhaoT. J.GaiJ. Y. (1995). Effects of leaf shape on seed yield and its components in soybeans.Soyb. Genet. Newsl.2266–70.
157
ZhangY.WangL. (2005). The WRKY transcription factor superfamily: its origin in eukaryotes and expansion in plants.BMC Evol. Biol.5:1. 10.1186/1471-2148-5-1
158
ZhaoN.FerrerJ. L.RossJ.GuanJ.YangY.PicherskyE.et al (2008). Structural, biochemical, and phylogenetic analyses suggest that indole-3-acetic acid methyltransferase is an evolutionarily ancient member of the SABATH family.Plant Physiol.146455–467. 10.1104/pp.107.110049
159
ZhuY.EllstrandN. C.LuB.-R. (2012). Sequence polymorphisms in wild, weedy, and cultivated rice suggest seed-shattering locus sh4 played a minor role in Asian rice domestication.Ecol. Evol.22106–2113. 10.1002/ece3.318
Summary
Keywords
molecular evolution, structural mutations, protein evolution, coding vs. non-coding changes, plant evo-devo, genotype to phenotype map
Citation
Bartlett ME and Whipple CJ (2013) Protein change in plant evolution: tracing one thread connecting molecular and phenotypic diversity. Front. Plant Sci. 4:382. doi: 10.3389/fpls.2013.00382
Received
01 June 2013
Accepted
06 September 2013
Published
10 October 2013
Volume
4 - 2013
Edited by
Jill Christine Preston, University of Vermont, USA
Reviewed by
Stephan Wenkel, University of Tuebingen, Germany; Elena M. Kramer, Harvard University, USA; Alma Pineyro-Nelson, University of California Berkeley, USA
Copyright
© Bartlett and Whipple.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Clinton J. Whipple, Biology Department, Brigham Young University, 401 WIDB, Provo, UT 84602, USA e-mail: whipple@byu.edu
This article was submitted to Plant Evolution and Development, a section of the journal Frontiers in Plant Science.
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.