Abstract
Gene duplication and divergence is a major contributor to the generation of morphological diversity and the emergence of novel features in vertebrates during evolution. The availability of sequenced genomes has facilitated our understanding of the evolution of genes and regulatory elements. However, progress in understanding conservation and divergence in the function of proteins has been slow and mainly assessed by comparing protein sequences in combination with in vitro analyses. These approaches help to classify proteins into different families and sub-families, such as distinct types of transcription factors, but how protein function varies within a gene family is less well understood. Some studies have explored the functional evolution of closely related proteins and important insights have begun to emerge. In this review, we will provide a general overview of gene duplication and functional divergence and then focus on the functional evolution of HOX proteins to illustrate evolutionary changes underlying diversification and their role in animal evolution.
Introduction
Evolution has brought an incredible range of morphological and physiological novelties to diverse animals. Centuries of classical research has served to catalog diverse novelties in 1.2 million species and sub-divide them into ∼36 phyla and 107 classes, 500 orders, 5500 families and 110000 genera (Mora et al., 2011). These efforts have uncovered the emergence of novelties during the progressive evolution of animals, but we know relatively little about the genetic and genomic changes and mechanisms that underlie this diversity. Technological advances which enabled the systematic sequencing of animal genomes has reenergized this field of research and provided an opportunity for comparative genomics of the diverse animals to probe the underlying genetic causes of their morphological and physiological differences (Rogers and Gibbs, 2014). These genome-wide analyses have highlighted common origins and similar physiological functions but have found it challenging to uncover the genetic changes and mechanisms that underlie animal diversity. Comparative genomic analyses reveal a very similar number of genes in diverse animals (Figure 1), indicating that the total gene number does not reflect diversity (Hahn and Wray, 2002; Copley, 2008). Furthermore, many of the same genes and gene families are present in a broad range of animal species, suggesting there is a shared or common “gene toolkit.”
FIGURE 1
The discovery of a very similar number of genes and a common “gene toolkit” in animal genomes appeared to refute the hypothesis that gene duplication and diversification is a major contributor to animal diversity. This led to a shift in the focus of research from analyses of coding regions to identifying and characterizing diversification of cis-regulatory (non-coding) regions and gene regulatory networks embedded in the genome (Biemont and Vieira, 2006). A broad array of technological advances have also revolutionized this field and enhanced our ability to identify and functionally validate the cis-regulatory code embedded in the genome by the integration of comparative genomics, transgenic analyses, CRISPR/Cas9 genome modifications and genome-wide approaches (i.e., ChIP-seq, ATAC-seq, massive parallel reporter assays and single cell transcriptomics) (Zhen and Andolfatto, 2012; Paul et al., 2014; He et al., 2015; Gasperskaja and Kucinskas, 2017; Avsec et al., 2021). Application of these approaches, has revealed that diversity in non-coding cis-regulatory regions of the genome has played a major role in the emergence of animal diversity (Carroll, 2008; Rubinstein and de Souza, 2013; Reilly and Noonan, 2016; Franchini and Pollard, 2017; Xie et al., 2019; Roberts Kingman et al., 2021a; Roberts Kingman et al., 2021b).
Evaluating how changes in protein sequence impact in vivo function following gene duplications in animal evolution has been challenging to investigate in the absence of technologies for precise manipulation of endogenous genes and quantitative and qualitative functional assays to evaluate activity in an in vivo context. The development of CRISPER/Cas9 gene editing technology for precise manipulation of endogenous genes in the genome of diverse animals has opened the door for more direct cross-species comparisons of homologous protein functions (Doudna and Charpentier, 2014). This approach has the advantage of expressing the proteins being compared at the same physiological levels in their normal spatial, temporal and tissue-specific contexts under control of regulatory components of the endogenous loci. In the past, functional studies have primarily relied on comparing the degree of conservation in amino acid sequences, in silico structure predictions, in vitro assays for activity and ectopic over-expression assays in vivo. The in vitro assays of protein activity, such as ligand binding, enzymatic activity, transcription factor binding properties, can be limited by ex-vivo conditions, which often lack key co-factors or components important in vivo. Hence, they may provide a limited perspective on a subset of functional activities relevant to their in vivo roles. To overcome some of the limitations of in vitro assays, transgenic approaches have been used to ectopically express genes in the animals and compare the in vivo properties of candidate proteins (Mcginnis et al., 1990; Quiring et al., 1994; Hanks et al., 1998). However, this approach often involved broad over-expression of proteins at high levels and ectopic sites, making it difficult to compare activities in normal physiological and developmental contexts. CRISPR/Cas9 technologies now offer possibilities to manipulate the genome to precisely compare function of homologous genes in vivo. As a result, cross-species analyses of gene function are beginning to uncover unexpected changes and mechanisms that contribute to conservation and divergence of protein functions contributing to animal diversity (Enard et al., 2009; Truong and Boeke, 2017; Laurent et al., 2020; Singh et al., 2020). In addition, recent advances in cryo-EM and computational approaches for predicting protein structures are rapidly changing our ability to analyze and compare the properties of proteins (Assaiya et al., 2021; Jumper et al., 2021).
Gene Duplication and Divergence
Sequence analyses have revealed a high level of conservation of many domains in proteins with very diverse functions across the animal phylum (Laity et al., 2001; Ponting and Russell, 2002; Noyes et al., 2008). This implies that during evolution, the appearance of novel functions is not associated with widespread de novo evolution of new genes and that novel functional activities most likely arose by diversification of existing genes (Holland et al., 1994; Hughes, 1994; Friedman and Hughes, 2001; Blomme et al., 2006). Altering the function of an essential gene could be detrimental to the survival or fitness of a species, but gene duplication events provide a mechanism to circumvent this limitation. Generating multiple copies of a gene provides a range of opportunities to maintain essential functions, releasing selective pressure on a single essential gene, while also producing new substrates that can diversify and evolve novel functions. Analyses of gene sets across the animal kingdom revealed that vertebrate genomes have multiple copies of many invertebrate genes, including those that regulate development, differentiation and physiological processes, such as transcription factors, cell signaling pathways, odorant receptor genes etc. (Paps and Holland, 2018; Richter et al., 2018; Fernandez and Gabaldon, 2020). Large genome duplication events followed by gene losses are considered as a critical step in the emergence and evolution of vertebrates. Susumu Ohno suggested that two rounds of whole-genome duplications (2R-WGDs) could be a major source of gene amplification and functional diversification in vertebrate lineage (Ohno, 1970; Panopoulou et al., 2003; Vandepoele et al., 2004; Dehal and Boore, 2005). While this hypothesis was used to explain a major cause of gene duplications, comparing all homologous gene families between Drosophila and humans showed that less than 5% of these families display a predicted 1:4 gene ratio (Friedman and Hughes, 2001). This was not consistent with Ohno’s hypothesis and lead to an alternative idea, suggesting that selective regional duplications (segmental duplications) created a mixed repertoire of duplicated genes in the genome.
Despite these conflicting models, which remain challenging to resolve, there is evidence for whole genome duplications in many plants, yeast, and vertebrate species, such as teleost fish (3R), salmonid fish (4R) and Xenopus laevis (Wendel, 2000; De Bodt et al., 2005; Scannell et al., 2006; Pascual-Anaya et al., 2013; Session et al., 2016) (Figure 2). Advocates of the 2R hypothesis have further refined the model to suggest that during vertebrate evolution frequent gene loss after duplication, as a consequence of redundancy, contributed to the observed digression from the expected 1:4 gene ratio. Analyses of many known genome duplications indicate that gene loss is the most common fate of duplicated genes (Nadeau and Sankoff, 1997; Albalat and Canestro, 2016). Loss of many duplicated genes and nonessential genes could also explain how a similar number of total genes are present in higher animal genomes despite differences in the whole genome or segmental duplication events. Despite the controversy on the underlying mechanisms for widespread amplification of gene families (Braasch et al., 2018; Sandve et al., 2018), the importance of gene duplication in creating a major substrate for functional divergence and emergence of novel functions is widely accepted in the field (Lynch and Conery, 2000).
FIGURE 2
Fate of the Duplicated Genes
The contribution of gene duplication and divergence to the emergence of novel protein functions in animal evolution has gained more traction as gene functions have been characterized in different animals across the phylogenetic spectrum (Blomme et al., 2006). Comparing classes of genes based on their functions revealed that genes involved in animal development and cell signaling pathways are highly amplified during the evolution, while many other classes of genes have been lost. Based on these analyses several hypotheses came to explain the forces that shaped the future of a duplicated genes. Loss of function is the most common fate of the duplicated genes. Studies have shown that only half of duplicated genes are retained while others lose functional activities by processes that include deletion, rearrangement, point mutations, and pseudogene formation (Nadeau and Sankoff, 1997; Blomme et al., 2006; Scannell et al., 2006; Albalat and Canestro, 2016; Guijarro-Clarke et al., 2020a; b). Even in the absence of duplicated genes, a large number of genes (90% in bacteria, 80% in yeast, 65% in C. elegans and 85% in Drosophila) are dispensable for animal survival, which provides a large set of substrates for evolutionary change. Gene duplication events further expand the repertoire of substrates for change and generate opportunities for functional redundancy that allows for non-deleterious functional diversification of genes.
Essential genes and their functional roles need to be retained to ensure animal survival and fitness. Hence, at least one member of a duplicated gene family must retain the key ancestral functions during evolution. Other members are free to accumulate mutations that potentiates diversification and the emergence of novel activities, which is called neofunctionalization (Ohno, 1970; Clark, 1994; Holland et al., 1994; Lundin, 1999; Friedman and Hughes, 2001; Mazet and Shimeld, 2002; Sandve et al., 2018). A variation of this idea is that the ancestral functions of essential genes maybe collectively retained by partitioning sub-sets of the functional roles between different duplicated family members, which is termed subfunctionalization (Ohno, 1970; Force et al., 1999; Lynch and Force, 2000; Sandve et al., 2018). These are not mutually exclusive processes. A study by Le and Zhang demonstrated that neofunctionalization or subfunctionalization alone do not adequately explain the diversification of protein function. They proposed that many duplicated genes may go through a combination of subfunctionalization and neofunctionalization to produce duplicated genes that possess new and retain some ancestral roles (He and Zhang, 2005; Marcussen et al., 2010). Various theoretical and functional studies have explored these ideas and confirmed that the retention of duplicated genes appears to be mediated by a varying combination of these processes (Force et al., 1999; Lynch and Force, 2000; Vandenbussche et al., 2003; Walsh, 2003; Burki and Kaessmann, 2004; Escriva et al., 2006; Perry et al., 2007; Kleinjan et al., 2008; Innan, 2009; Truong and Boeke, 2017; Zimmer et al., 2018; Singh et al., 2020).
Hox Genes as a Paradigm for Duplication and Divergence of Function
Hox genes, encode a broadly conserved family of transcription factors in animals, and represent an interesting paradigm for examining the duplication and divergence of gene functions. The HOX proteins are involved in patterning and specification of the anterior-posterior (AP) axis of all bilaterian animals (Mcginnis and Krumlauf, 1992; Krumlauf, 1994; Carroll, 1995; Pearson et al., 2005). The temporal and spatial order of Hox gene expression and function across the embryo is “colinear” and correlated with their organization along the chromosome (Lewis, 1978; Duboule and Dolle, 1989; Graham et al., 1989; Duboule, 1998; Kmita and Duboule, 2003). These genes are typically found to be tightly clustered in the genome except for some animals where evolution has led to the disintegration of the ancestral complex (Kaufman et al., 1980; Akam et al., 1994; Seo et al., 2004; Sekigami et al., 2017). Each gene in a cluster specifies distinct cellular identities along AP axis during very early embryonic development, which ultimately patterns tissues and structures in adult animals. Evidence of Hox genes in animal genomes is traced back to Cnidarians, however, their role in patterning the AP axis is observed only in bilaterians, as they have roles in patterning radial segmentation in cnidarians (Pascual-Anaya et al., 2013; Arendt, 2018; He et al., 2018). Mutations that affect the expression and function of Hox genes in bilaterians lead to homeotic transformation of one part of the body into another (Lewis, 1994). Furthermore, diversification of Hox gene number and function correlates with increased diversity in the evolution of animals (Wagner et al., 2003; Lemons and Mcginnis, 2006). There are fewer Hox genes in lower invertebrates as compared to higher invertebrates, chordates, and vertebrates, as illustrated by the 5 Hox genes in nematodes (C. elegans), 8 in arthropods (Drosophila), 14 in chordates, and 39 in mammals (Human) (Ikuta, 2011; Pascual-Anaya et al., 2013; Irie et al., 2018) (Figure 3A).
FIGURE 3
Despite their diverse and distinct functional activities, sequence analysis indicates that Hox gene cluster was generated by tandem duplication of a single gene. Analysis of Hox gene clusters in invertebrate genomes suggests that they expanded from a common ancestral gene through tandem regional duplication events to form a maximum of fifteen alloparalogs in cephalochordates (ex. Branchiostoma floridae) (Garcia-Fernandez and Holland, 1994; Koonin, 2005). There are four Hox clusters in mammalian genomes with a maximum of 14 genes in each cluster suggesting two rounds of whole complex duplication (2R) from a common invertebrate ancestor with 14 genes (Maconochie et al., 1996; Hoegg and Meyer, 2005; Lemons and Mcginnis, 2006; Duboule, 2007; Kuraku and Meyer, 2009; Pascual-Anaya et al., 2013; Holland and Ocampo Daza, 2018; Smith et al., 2018). However, there is a maximum of 13 Hox genes in each cluster of tetrapod genomes, such as mouse and human, indicating a loss of 14th paralog during vertebrate diversification (Figure 3A). There is also evidence for two additional lineage specific whole-genome duplications events in vertebrates, one in teleosts (3R) (Pascual-Anaya et al., 2013) and an additional round (4R) in salmonid lineages to further amplify HOX genes (Figure 2) (Soshnikova et al., 2013; Vieux-Rochas et al., 2013). These data suggest that genome duplication events have dramatically increased the number of Hox genes in vertebrate genomes and provide opportunities for evolution of novel functions. Further, functional compensation or redundancy after gene duplication events allowed for loss and diversification of these genes which have a critical role in patterning the AP axis and the properties of tissues in a manner that has been remarkably conserved across the bilateral animals (Wagner et al., 2003).
Conservation and Diversification of Hox Gene Function
The correlation between the expansion of Hox genes, a master regulator of development, and greater complexity of vertebrates highlights the importance of studying the functional evolution of HOX proteins. There are reports to suggest both conservation and diversification of Hox gene function through evolution (Lawrence and Morata, 1994; Saurin et al., 2018). Early work in the field has demonstrated that HOX proteins from different alloparalogs groups defines distinct morphological features along the AP body axis implying that each regulates a distinct set of gene regulatory networks and developmental pathways (Garcia-Bellido et al., 1973; Mcginnis and Krumlauf, 1992; Lawrence and Morata, 1994; Averof, 2002; Maeda and Karch, 2006). However, HOX proteins appear to share many similar biochemical properties. They display nearly identical DNA binding specificities in vitro, as a consequence of the presence of a highly conserved 60 amino acid long homeodomain (HD) (Scott et al., 1989; Gehring et al., 1994a; Piper et al., 1999). HOX proteins have the ability to interact with a PBC class (e.g., PBX, MEIS, PREP) of homeodomain transcription factors, which serve as cofactors in binding DNA and modulating gene expression (Desplan et al., 1988; Gehring et al., 1994a; Knoepfler and Kamps, 1995; Mann and Chan, 1996; Noyes et al., 2008). The interaction of HOX proteins with PBC factors is primarily mediated by a conserved six amino acid domain, referred to as the hexapeptide (HP). HOX-PBX cofactor interactions modify the affinity and specificity of HOX DNA binding on target sites in the genome (Slattery et al., 2011; Merabet and Mann, 2016). These generic DNA binding properties of HOX proteins and interactions with shared cofactors, such as PBX, make it difficult to explain the paralog-specific functions of HOX proteins. Hence, the distinct differences in the functional roles and genome-wide binding preferences of HOX proteins in vivo is likely to be a consequence of additional unidentified features and interactions of HOX proteins that impact and modulate their context-dependent activities.
Conservation
Early studies on Hox genes explored the evolutionary conservation of functions among homologous genes across the animal evolution by using transgenesis to express vertebrate Hox genes in Drosophila (Malicki et al., 1990; Mcginnis et al., 1990; Zhao et al., 1993; Lutz et al., 1996). These studies uncovered deep conservation of Hox gene function in specifying regional identity along the AP body axis over 600 million years of animal evolution. Protein sequences of alloparalogs, formed by tandem duplication of the ancestral Hox genes, show high diversification and each of them are known to drive distinct developmental and differentiation programs to regulate regional identities in specific tissues along the AP axis (Lamka et al., 1992; Mclain et al., 1992; Yokouchi et al., 1995; Carapuco et al., 2005). Despite these differences in functional roles, many alloparalogs show redundancy or overlaps of function in regulating developmental events in some specific tissues (Saurin et al., 2018). For example, it has been observed that ubiquitous expression of many Hox genes is important in Drosophila larval fat body cells, which appears distinctly different from their roles along the AP axis facilitated by their nested collinear expression patterns in embryos. Furthermore, functional analyses in autophagy inhibition uncovered no paralog specificity and suggested redundant functions of many HOX proteins (Banreti et al., 2014). Similarly, Drosophila Hox genes have been also shown to have redundant activity in specification of tritocerebrum identity, endocrine ring gland development, dorsal DA3 muscle lineage specification and head repression etc. (Hirth et al., 2001; Coiffier et al., 2008; Enriquez et al., 2010; Sanchez-Higueras et al., 2014). The redundant role of Hox genes is not limited to Drosophila, vertebrate HOX proteins have also been shown to have overlapping or redundant functions during development of several tissue types (Young et al., 2009; Lacombe et al., 2013; Denans et al., 2015). For example, vertebrate Hox6 paralogs (Hoxa6, Hoxc6, and Hoxb6) are required to specify lateral motor column motoneurons and functional studies displayed that Hox paralogs 5, 7, and 8 can all substitute for this function (Lacombe et al., 2013). These observations suggest that despite evidence for sequence and functional diversification among HOX alloparalogs, which underlies their distinct roles in axial patterning, some aspects of their functional activities have been conserved during evolution and play roles in specific tissue contexts during development.
As expected, the functional redundancy among the Hox genes is more common among the symparalogs formed more recently after whole cluster duplications in vertebrate lineage (Figure 3A). Deletion of a single gene or even a whole cluster does not show dramatic consequences on embryonic development, consistent with the idea of extensive functional redundancy between Hox genes (Medina-Martinez et al., 2000; Suemori and Noguchi, 2000; Spitz et al., 2001; van Den Akker et al., 2001; Hunter and Prince, 2002; Soshnikova et al., 2013). Gene swap experiments in mouse models have also demonstrated that symparalogs, formed after duplication of ancestral invertebrate cluster, are functionally equivalent (Horan et al., 1995; Manley and Capecchi, 1997; Greer et al., 2000; Tvrdik and Capecchi, 2006; Iacovino et al., 2009). An interesting example is a gene swap of mouse Hoxa3 and Hoxd3 that resulted in adult mice with no detectable developmental defects (Greer et al., 2000). These observations suggests that HOX paralogs have retained similarity in their activity through millions of years of evolution, which has been attributed to conservation of DNA binding properties of the homeodomain and a shared hexapeptide domain that mediates interaction with PBC factors.
Diversification
Over expression studies have provided evidence that Hox alloparalogs regulate development of specific organs across the body axis, indicating that they can drive distinct gene regulatory networks (Schneuwly et al., 1987; Lamka et al., 1992; Mclain et al., 1992; Yokouchi et al., 1995; Carapuco et al., 2005). Comparative analyses of gene expression profiles upon ubiquitous expression of Drosophila Hox genes show a very small number (1.3%) of common changes, and most of changes are unique to each gene, suggesting they individually regulate distinct set of targets (Hueber et al., 2007). This data implies that despite the high conservation of homeodomain region, HOX proteins regulate distinct set of genes to specify unique cellular identities across the AP body axis during animal development (Mcginnis and Krumlauf, 1992; Averof and Akam, 1993; Lawrence and Morata, 1994; Patterson et al., 2001; Averof, 2002). The functional diversity among the HOX proteins may be generated in multiple ways. Domain swaps of the HD, the most conserved region of HOX proteins, can result in functionally distinct activity in some developmental contexts. This suggests that even a small number of changes can lead to diversification of the DNA binding properties and transcriptional activity of HOX proteins (Zhao and Potter, 2001; 2002). In addition, studies have shown that amino acid differences among the Hox alloparalogs may not alter DNA binding preference but change their ability to recruit different coactivators or corepressors (Li and Mcginnis, 1999; Gebelein et al., 2004; Joshi et al., 2010). Another mechanism for generating diversity among HOX proteins is through its interaction with PBC group of cofactors, which alters DNA binding specificity. In fact, a high throughput study revealed that DNA binding specificities of HOX-Exd complex are only revealed upon heterodimerization (Slattery et al., 2011). These observations suggest that HOX proteins are subject to a variety of ways that can diversify or modulate their functional properties.
As discussed earlier, there are many examples of functional redundancy among the HOX symparalogs (Horan et al., 1995; Chen and Capecchi, 1997; Greer et al., 2000; Wahba et al., 2001; Tvrdik and Capecchi, 2006; Iacovino et al., 2009). However, several studies have also found functional diversification among symparalogs (Fromental-Ramain et al., 1996; Miguez et al., 2012; Singh et al., 2020; Singh et al., 2021). Analyses of Hoxa9 and Hoxd9 mutants in mouse revealed that these two symparalogous genes have both specific and redundant functions in lumbosacral axial skeleton patterning and in limb morphogenesis (Fromental-Ramain et al., 1996). Similarly, Hoxa2 and Hoxb2 symparalogs show synergistic interactions in regulation of gene expression in the hindbrain, however, during oligodendrogenesis in the mouse hindbrain Hoxb2 antagonizes Hoxa2 function (Davenne et al., 1999; Miguez et al., 2012). These analyses indicate that symparalogs have retained a lot of overlapping functions during evolution, but they have also diversified their functional roles thought changes in the patterns of expression and protein structure. An important understudied question in the field is how changes in amino acid sequences of symparalogous after duplication from the ancestor homolog relate to altered functions. Analyses of HOX1 proteins in Xenopus suggests they have redundant roles of HOXA1, B1 and D1 in hindbrain development (Mcnulty et al., 2005). Gene swaps of HoxA1 and HoxB1 in mouse also suggest they have largely overlapping or redundant roles (Tvrdik and Capecchi, 2006). However, in a recent study, we utilized CRISPR/Cas9 technology to replace the Drosophila Hox gene labial with its mouse homologs HoxA1, HoxB1 and HoxD1 to investigate conservation of ancestral functions and assess diversification during evolution (Singh et al., 2020). Despite similar degrees of protein sequence diversification of mouse HOX1 proteins from Drosophila Labial, our results revealed that among the HOX1 symparalogs (HOXA1, HOXB1 and HOXD1) only HOXA1 is able to rescue labial function, as HOXB1 and HOXD1 failed to do so (Figures 3B, 4A). This demonstrates a remarkable conservation of ancestral activity by HOXA1 and indicates that HOXB1 and HOXD1 have diversified through 600 million years of evolution (Figure 3B). Furthermore, consistent with their share ancestral activities, comparative genome-wide DNA binding properties revealed that HOXA1 and Labial have similar patterns of binding in mouse genome, while HOXB1 binds to a distinctly different set of targets. This adds support for neofunctionalization of HOXB1 by regulating a district gene regulatory program (Figure 3B) (Singh et al., 2020; Singh et al., 2021). Studies on mouse HOXD1 have shown that it has lost ancestral activities and appears to have undergone neofunctionalization through expression in new tissue types and altered activities in regulating novel gene regulatory programs (Guo et al., 2011). These distinct functional properties of mouse HOX1 symparalogs illustrate the diversification of HOXB1 and HOXD1 function and loss of ancestral activity in mammalian lineage. Mapping of the protein sequences that underlie functional diversification of HOXA1 and HOXB1 proteins revealed that a small number of changes across the protein cause functional diversification. It will be important to examine similar changes in other paralogy groups of HOX proteins to determine if this is a common means for modulating functional activities.
FIGURE 4
Critical Regions Underly HOX Protein Function and Functional Diversification
The deep conservation of HOX protein function across bilaterians and their redundant role in some tissue types suggest that the essential roles of HOX proteins in AP patterning has restricted their diversification. Preservation of many of their ancestral functions is likely mediated through conservation of the homeodomain and hexapeptide regions. The homeodomain of Drosophila HOX proteins is known to bind directly to DNA (Galant et al., 2002; Merabet et al., 2003). However, on many in vivo target sites, binding affinity and specificity is enhanced by interaction with PBC factors (Loker et al., 2021). The regions outside these domains are highly diversified among the HOX proteins that can impact activation or repression of transcription of potential target genes (Li et al., 1999; Li and Mcginnis, 1999). In this section of the review, we will discuss evidences on what is known about various domains of HOX proteins and how they may have diversified to adopt novel functions during evolution.
Homeodomain
The HD stands out as the most conserved region of all HOX proteins (Figure 5). Three-dimensional structural studies using X-ray crystallography and NMR spectroscopy have revealed the presence of three alpha-helix regions in the HDs (Gehring et al., 1994b; Passner et al., 1999; Piper et al., 1999). The third helix, also known as the recognition helix, directly contacts the DNA through the major groove of DNA, while the region between the first and second helices establishes specific contact with the phosphate backbone. In addition, sequences adjacent to the N-terminus of the HD, referred to as the N-terminal extension region, also makes specific contact with DNA in the minor groove. These regions that insert into the minor groove have been shown to confer specificity to the HDs of Drosophila Ultrabithorax and Sex combs reduced Hox proteins imparting distinct DNA recognition properties (Joshi et al., 2007). The interactions of the N-terminal extension along with contacts mediated by the third helix are the primary determinates of DNA binding specificity and serve as a key constraint, maintaining the high level of sequence conservation.
FIGURE 5
Genetic and biochemical data suggest that the HD of HOX proteins is not freely interchangeable with each other. Functional assays of chimeric HOX proteins with swapped homeodomains of paralogs have been found to alter their functions, suggesting that homeodomains are not equivalent to each other (Zhao and Potter, 2001; Zhao and Potter, 2002). This indicates that the small differences between HD sequences contribute to differences in their DNA binding properties and functional activities (Furukubo-Tokunaga et al., 1993; Phelan et al., 1994; Zappavigna et al., 1994; Noyes et al., 2008; Breitinger et al., 2012). For example, 28 out of 60 amino acids are identical between all Drosophila HOX proteins and 18/60 are identical between all mouse HOXA paralogs (Figures 5A,B). In addition, many differences among the paralogs are conservative amino acid replacements that would be expected to preserve biochemical properties. This means that there are a series of non-conservative amino acid replacements or changes (19/60 in Drosophila and 22/60 in mouse HOXA paralogs), that have the potential to alter the DNA binding properties of HDs. For example, the crystal structures of HOXB1 and HOXA9 display many paralog specific amino acid contacts with DNA (Figure 5B) (Piper et al., 1999; LaRonde-LeBlanc and Wolberger, 2003). On the basis of sequence alignments and specific amino-acid residues within the homeodomain along with their relative positions in Hox clusters, Hox genes have been assigned to 14 different alloparalogous groups (PG1-14). These fall into three general classes: the anterior class contains PG1-5, the central class PG6-8 and the posterior class genes (PG9-14) (Domsch et al., 2015; Frobius and Funch, 2017). Hence, despite high conservation between homeodomain sequences there are paralog-specific differences that may alter DNA binding properties and preferences of HDs to modulate HOX function. The paralog specific changes in homeodomains may also alter the interaction of conserved amino acids with the DNA (Gruschus et al., 1997).
Hexapeptide
The PBC group of proteins, Drosophila Extradenticle (Exd) and Homothorax (Hth) proteins and their vertebrate orthologs Pre-B-cell leukemia transcription factor (PBX1, PBX 2, PBX3 and PBX4) and myeloid ectopic leukemia virus integration site (MEIS1, MEIS2 and MEIS3) respectively, are well characterized cofactors of HOX proteins (Chan et al., 1994; Chang et al., 1995; Lu et al., 1995; Mann, 1995; Ryoo et al., 1999). Interactions with PBC proteins change DNA binding affinity and specificity of HOX monomer proteins (Slattery et al., 2011). In vitro binding analyses have demonstrated that the interaction between most HOX proteins and PBX are highly dependent on the HP motif, which contains the four core residues YPWM. This interaction alters the DNA binding properties of each protein, leading to the recognition of a bipartite HOX-PBC consensus site, and it enhances affinity and specificity of HOX target selection (van Dijk and Murre, 1994; Chang et al., 1995; Phelan et al., 1995; Popperl et al., 1995; Chan and Mann, 1996; Hudry et al., 2012).
The presence of the HP is important for HOX function, as loss of this motif has been implicated in the eliminating homeotic functions in several genes (fushi-tarazu, zerknüllt, bicoid) formed after duplication of Hox genes (Alonso et al., 2001; Panfilio and Akam, 2007). There is also evidence for sequence diversity among HP regions of different HOX proteins, which may impact the DNA binding properties and function of HOX proteins (Chang et al., 1995; Neuteboom et al., 1995; Medina-Martinez and Ramirez-Solis, 2003; Remacle et al., 2004). While the four core amino acids in the HP motif have been shown to be critical for its activity, only the tryptophan (W) residue in the 4th position is highly conserved. The rest of the amino acids are highly variable among the HOX protein paralogs, which may alter their interactions with PBX (Figures 5A,B) (Neuteboom et al., 1995; Neuteboom and Murre, 1997; Mann et al., 2009). The divergence of the HP region away from YPWM motif in HOX proteins from paralogous groups 8–13 suggests a progressive change in their function through evolution of this motif (Figure 5B).
Consistent with the divergence of the HP regions, in vitro DNA binding experiments show that HOX proteins from paralog groups 1–10 mainly interact with PBX, while paralogy groups 11-13 preferentially interact with MEIS (Chan et al., 1994; Shen W. F. et al., 1997; Shen W.-F. et al., 1997). HOX proteins also form trimeric complex with PBX and MEIS proteins upon binding to target genes (Berthelsen et al., 1998; Shanmugam et al., 1999; Ferretti et al., 2000; Penkov et al., 2013; Amin et al., 2015). The presence of MEIS proteins have also been shown to induce remodeling of HOX-PBX interactions that leads to changes in the requirements for motifs that drive trimeric complex formation (Dard et al., 2019). High throughput SELEX-seq technology, which measures the relative affinities of transcription factor complexes with all possible DNA sequences, has shown that interaction of HOX proteins with Exd-Hth dimers alters the DNA binding properties of all eight Drosophila HOX proteins. Based on this data, the DNA binding specificities of HOX proteins can be subdivided into three main classes with similar DNA binding preferences: 1) anterior, containing Labial and Proboscipedia; 2) middle with Deformed and Sex comb reduced; and posterior, with Antennapedia, Ultrabithorax, Abdominal-A and Abdominal-B. These observations are consistent with the idea that the nature of interactions with PBX and MEIS varies among different HOX proteins and this directly impacts their DNA binding properties and functions.
Recent research has opened new insights into the complex nature of interactions between HOX and PBC proteins. Studies have revealed that novel HOX-PBX cofactor interactions arise through the loss or gain of other interacting domains beyond the HP and these have the potential to further modify HOX DNA binding properties and functions (Shen W. F. et al., 1997; Liu et al., 2008; Noro et al., 2011; Slattery et al., 2011; Hudry et al., 2012; Merabet and Mann, 2016; Dard et al., 2019; Singh et al., 2020). Use of sensitive in vivo assays to quantify HOX-PBX interactions have revealed that, in the presence of MEIS, the HP motif is dispensable in all HOX proteins except those from anterior paralog groups 1 and 2 (Dard et al., 2018). Furthermore, detailed analyses uncovered alternative PBC interaction motifs in human HOXB3, HOXA7 and HOXC8 proteins that are critical for HOX-PBC interaction in specific cell contexts and DNA-binding site topologies. While in the case of HOXA9, the HOXA9-PBX-MEIS interaction is dependent on the activity of the HP motif and two paralog-specific residues of the homeodomain region. These observations suggest that HOX-PBC interactions are not rigid and may behave in a dynamic manner that vary based on specific cellular and genomic contexts. The array of HOX- PBC interactions might have evolved independently in novel ways as a common regulatory node or mechanism to diversify DNA binding properties of HOX proteins. The highly conserved W residue in the HP region has been shown to be required for binding on HOX–PBX consensus motifs, while HOX protein binding on non-consensus motifs and low-affinity binding sites may be altered through HP diversity in combination with other novel interaction domains (Foos et al., 2015; Dard et al., 2018; Singh et al., 2020).
Linker Region
HP region of HOX proteins is connected to the N-terminal of the HD through a linker (LK) region (Figures 5A,B). The sequence and size of the linker region is highly variable among the HOX paralogs, ranging from 3 to 50 amino acids in the vertebrate HOX proteins. Since the HP region of HOX proteins interacts with a highly conserved three amino acid loop extension domain in the HD region of PBX cofactors, the size and sequence of the linker region may constrain these interactions and further modulate DNA binding properties. Structural studies have revealed that the LK region of the Drosophila Sex Combs Reduced (Scr) HOX protein is critical for binding at some target sites, but on other binding targets it is disordered and makes minimal contribution to binding (Joshi et al., 2007). Converting the LK region of Drosophila Antp to that of Scr changes the DNA binding preference of the protein such that it binds very similar targets to those of Scr (Abe et al., 2015). Furthermore, comparative functional studies of mouse HOXA1 with HOXB1 proteins, Drosophila Ubx with Abd-A and Dfd with Scr show that the linker region is required for some aspects of the paralog specific functions of these HOX proteins (Merabet et al., 2003; Joshi et al., 2010; Singh et al., 2020). This indicates that the LK sequence has a role in modulating DNA binding preferences of HOX proteins and diversity in this region may be a determinant that underlies aspects of the paralog-specific functions of HOX proteins.
C-Terminal Region
The C-terminal region flanking the homeodomain of HOX proteins is highly variable in size and sequence. The importance of this part of HOX proteins has been generally ignored because of the high degree of variability among the HOX paralogs. This region has also been left out of the studies that analyzed the three-dimensional structure of HOX proteins with PBX and DNA. Sequence analyses show that this region varies from 7 amino acids in HOXA13 to 168 amino acids in HOXA2 (Figure 5B). This is also highly variable between the HOXA alloparalogs (HOXA1 to HOXA13) and symparalogs among Hox A, B, C and D clusters. Our recent cross-species functional analyses revealed that a highly conserved CTM motif (KEGLLP) is a key determinant involved in maintaining the homologous ancestral functions of Drosophila Labial in the mouse HOXA1 protein (Figures 4A,C) (Singh et al., 2020). Diversification of this motif in mammalian homologs, HOXB1 and HOXD1led to a loss of the ancestral activity. Furthermore, structure prediction analyses suggested that the CTM region may establish another interaction domain with PBX1 on DNA. Consistent with this idea, in vitro DNA binding analyses revealed that the CTM region is not sufficient for HOX1 interaction with PBX1, but it can modulate the ability of HOXA1 to interact with PBX1 when bound to a target site (Singh et al., 2020). Similarly, small, conserved regions in the C-terminal domains of HOX proteins have also been observed in Drosophila HOX proteins Ubx and Abd-A (Lelli et al., 2011). In vitro DNA binding analyses and three-dimensional structures show that this region in Ubx (called Ubd-A region) is required for direct physical interaction with Exd, a homolog of PBX1, and affects DNA binding properties (Foos et al., 2015). Hence, it may play an analogous role to the CTM, identified in HOXA1. The UBD-A region is highly conserved among insect orthologs of Ubx, but it is absent from other arthropods and onychophorans (Galant and Carroll, 2002; Ronshaugen et al., 2002). Functional analyses of this domain with a transgenic reporter line displayed a repressive role on a Distal-less (Dll) cis-regulatory element that is involved in promoting limb development. These results suggests that evolution of the UBD-A domain suppressed limb formation in abdominal segments and provided an evolutionary transition to hexapod limb pattern. Furthermore, in vitro DNA binding and in vivo reporter assays show cooperation between linker and UBD-A region of the Ubx protein, which suggests subtle changes in HOX–PBC complexes have played a major role in the diversification of HOX protein function in evolution (Saadaoui et al., 2011). These data illustrate that flexible extensions outside of the HD helix have the potential to mediate additional contacts between HOX proteins and their cofactors in concert with those mediated by the HP motif on the opposing side of the DNA. Together these findings demonstrate that small differences in sequences outside of HDs, which do not contact DNA themselves, may be a common mechanism for modulating protein-protein interactions that impact DNA binding specificity of HOX proteins.
Summary
A small number of changes in key amino acids may affect DNA binding properties and protein-protein interactions of transcription factors that can influence their DNA binding targets and potential for transcriptional activation or repression (Lai et al., 2001; Johnson et al., 2003; Chi, 2005; Sakazume et al., 2007; Shoubridge et al., 2012; Webb et al., 2012; Jubb et al., 2017; Singh et al., 2020; Chi, 2005). This indicates that sequence conservation alone may be a poor determinant in predicting the functions of HOX proteins. Genome-wide binding and gene expression analyses have revealed both overlapping and paralog-specific targets of HOX proteins (Hueber et al., 2007; Sorge et al., 2012; Beh et al., 2016; Bulajic et al., 2020; Singh et al., 2020). This suggests that there could be many common downstream targets in the genome, but the unique targets might have evolved by diversification of HOX proteins, resulting in selective alterations in their downstream target genes and inputs into novel gene regulatory programs. Paralog specific binding at unique target sites could arise through small differences in DNA binding domain and associated regions that alter interaction with cofactors such as PBX. Studies have shown that HOX-PBX interactions have diversified by altering interactions through the HP region and evolving novel contact points beyond it (Merabet et al., 2011; Saadaoui et al., 2011; Rivas et al., 2013). Altered HOX-PBX interactions may affect both DNA binding specificity of the HOX proteins and the transcriptional state of the target site. The diversification in the function of HOX proteins can be also introduced by changes outside the homeodomain and hexapeptide region (Chauvet et al., 2000; Gebelein et al., 2002; Singh et al., 2020). Several conserved short linear motifs (SLiMs) have been identified in HOX proteins that can often restrain the interaction potential of HOX proteins (Baeza et al., 2015). Deletion of SLiM motifs leads to loss, gain or interestingly enhanced interaction with cofactors that can alter regulatory potential of HOX proteins in a context specific manner. These dynamic changes in interaction with cofactors may alter Hox activity in tissue and cell type-specific manners which vary depending upon the cellular context (Capovilla et al., 1994; Joshi et al., 2010; Jung et al., 2014). These observations illustrate that a small number or subtle changes in multiple regions of HOX proteins can have a dramatic effect on their activity and may be an important feature that underlies the paralog specific functions by modifying DNA binding specificity and/or protein-protein interactions. Investigating the in vivo functional roles and evolution of other domains of HOX proteins beyond the HD should help to unravel how such similar proteins can exert diverse functions and be relevant in determining if this is a general mechanism used by other transcription factor families in the generation of diversity and evolution of novel functional activities of proteins.
Statements
Author contributions
NS and RK researched, wrote, edited and organized the manuscript.
Funding
This work was supported by funds from the Stowers Institute for Medical Research (Grant no. 1001) to RK.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1
AbeN.DrorI.YangL.SlatteryM.ZhouT.BussemakerH. J.et al (2015). Deconvolving the Recognition of DNA Shape from Sequence. Cell161, 307–318. 10.1016/j.cell.2015.02.008
2
AkamM.AverofM.Castelli-GairJ.DawesR.FalcianiF.FerrierD. (1994). The Evolving Role of Hox Genes in Arthropods. Dev. Suppl.1994, 209–215. 10.1242/dev.1994.supplement.209
3
AlbalatR.CañestroC. (2016). Evolution by Gene Loss. Nat. Rev. Genet.17, 379–391. 10.1038/nrg.2016.39
4
AlonsoC. R.Maxton-KuechenmeisterJ.AkamM. (2001). Evolution of Ftz Protein Function in Insects. Curr. Biol.11, 1473–1478. 10.1016/s0960-9822(01)00425-0
5
AminS.DonaldsonI. J.ZanninoD. A.HensmanJ.RattrayM.LosaM.et al (2015). Hoxa2 Selectively Enhances Meis Binding to Change a Branchial Arch Ground State. Develop. Cel32, 265–277. 10.1016/j.devcel.2014.12.024
6
ArendtD. (2018). Hox Genes and Body Segmentation. Science361, 1310–1311. 10.1126/science.aav0692
7
AssaiyaA.BuradaA. P.DhingraS.KumarJ. (2021). An Overview of the Recent Advances in Cryo-Electron Microscopy for Life Sciences. Emerg. Top. Life Sci.5, 151–168. 10.1042/etls20200295
8
AverofM.AkamM. (1993). HOM/Hox Genes of Artemia: Implications for the Origin of Insect and Crustacean Body Plans. Curr. Biol.3, 73–78. 10.1016/0960-9822(93)90158-k
9
AverofM. (2002). Arthropod Hox Genes: Insights on the Evolutionary Forces that Shape Gene Functions. Curr. Opin. Genet. Develop.12, 386–392. 10.1016/s0959-437x(02)00314-3
10
AvsecŽ.WeilertM.ShrikumarA.KruegerS.AlexandariA.DalalK.et al (2021). Base-resolution Models of Transcription-Factor Binding Reveal Soft Motif Syntax. Nat. Genet.53, 354–366. 10.1038/s41588-021-00782-6
11
BaëzaM.VialaS.HeimM.DardA.HudryB.DuffraisseM.et al (2015). Inhibitory Activities of Short Linear Motifs Underlie Hox Interactome Specificity In Vivo. Elife4, 6034. 10.7554/eLife.06034
12
BanretiA.HudryB.SassM.SaurinA. J.GrabaY. (2014). Hox Proteins Mediate Developmental and Environmental Control of Autophagy. Develop. Cel28, 56–69. 10.1016/j.devcel.2013.11.024
13
BehC. Y.El-SharnoubyS.ChatzipliA.RussellS.ChooS. W.WhiteR. (2016). Roles of Cofactors and Chromatin Accessibility in Hox Protein Target Specificity. Epigenetics. Chromatin9, 1. 10.1186/s13072-015-0049-x
14
BerthelsenJ.ZappavignaV.FerrettiE.MavilioF.BlasiF. (1998). The Novel Homeoprotein Prep1 Modulates Pbx-Hox Protein Cooperativity. Embo J.17, 1434–1445. 10.1093/emboj/17.5.1434
15
BiémontC.VieiraC. (2006). Junk DNA as an Evolutionary Force. Nature443, 521–524. 10.1038/443521a
16
BlommeT.VandepoeleK.De BodtS.SimillionC.MaereS.Van De PeerY. (2006). The Gain and Loss of Genes during 600 Million Years of Vertebrate Evolution. Genome Biol.7, R43. 10.1186/gb-2006-7-5-r43
17
BraaschI.BobeJ.GuiguenY.PostlethwaitJ. H. (2018). Reply to: 'Subfunctionalization versus Neofunctionalization after Whole-Genome Duplication'. Nat. Genet.50, 910–911. 10.1038/s41588-018-0163-3
18
BreitingerC.MaethnerE.Garcia-CuellarM.-P.SlanyR. K. (2012). The Homeodomain Region Controls the Phenotype of HOX-Induced Murine Leukemia. Blood120, 4018–4027. 10.1182/blood-2011-10-384685
19
BulajicM.SrivastavaD.DasenJ. S.WichterleH.MahonyS.MazzoniE. O. (2020). Differential Abilities to Engage Inaccessible Chromatin Diversify Vertebrate Hox Binding Patterns. Development147, dev194761. 10.1242/dev.194761
20
BurkiF.KaessmannH. (2004). Birth and Adaptive Evolution of a Hominoid Gene that Supports High Neurotransmitter Flux. Nat. Genet.36, 1061–1063. 10.1038/ng1431
21
CapovillaM.BrandtM.BotasJ. (1994). Direct Regulation of Decapentaplegic by Ultrabithorax and its Role in Drosophila Midgut Morphogenesis. Cell76, 461–475. 10.1016/0092-8674(94)90111-2
22
CarapuçoM.NóvoaA.BobolaN.MalloM. (2005). Hox Genes Specify Vertebral Types in the Presomitic Mesoderm. Genes Dev.19, 2116–2121. 10.1101/gad.338705
23
CarrollS. B. (2008). Evo-devo and an Expanding Evolutionary Synthesis: a Genetic Theory of Morphological Evolution. Cell134, 25–36. 10.1016/j.cell.2008.06.030
24
CarrollS. B. (1995). Homeotic Genes and the Evolution of Arthropods and Chordates. Nature376, 479–485. 10.1038/376479a0
25
ChanS.-K.JaffeL.CapovillaM.BotasJ.MannR. S. (1994). The DNA Binding Specificity of Ultrabithorax Is Modulated by Cooperative Interactions with Extradenticle, Another Homeoprotein. Cell78, 603–615. 10.1016/0092-8674(94)90525-8
26
ChanS. K.MannR. S. (1996). A Structural Model for a Homeotic Protein-Extradenticle-DNA Complex Accounts for the Choice of HOX Protein in the Heterodimer. Proc. Natl. Acad. Sci. U.S.A.93, 5223–5228. 10.1073/pnas.93.11.5223
27
ChangC. P.ShenW. F.RozenfeldS.LawrenceH. J.LargmanC.ClearyM. L. (1995). Pbx Proteins Display Hexapeptide-dependent Cooperative DNA Binding with a Subset of Hox Proteins. Genes Dev.9, 663–674. 10.1101/gad.9.6.663
28
ChauvetS.MerabetS.BilderD.ScottM. P.PradelJ.GrabaY. (2000). Distinct Hox Protein Sequences Determine Specificity in Different Tissues. Proc. Natl. Acad. Sci. U.S.A.97, 4064–4069. 10.1073/pnas.070046997
29
ChenF.CapecchiM. R. (1997). Targeted Mutations inHoxa-9andHoxb-9Reveal Synergistic Interactions. Develop. Biol.181, 186–196. 10.1006/dbio.1996.8440
30
ChiY.-I. (2005). Homeodomain Revisited: a Lesson from Disease-Causing Mutations. Hum. Genet.116, 433–444. 10.1007/s00439-004-1252-1
31
ClarkA. G. (1994). Invasion and Maintenance of a Gene Duplication. Proc. Natl. Acad. Sci. U.S.A.91, 2950–2954. 10.1073/pnas.91.8.2950
32
CoiffierD.CharrouxB.KerridgeS. (2008). Common Functions of central and Posterior Hox Genes for the Repression of Head in the Trunk of Drosophila. Development135, 291–300. 10.1242/dev.009662
33
CopleyR. R. (2008). The Animal in the Genome: Comparative Genomics and Evolution. Phil. Trans. R. Soc. B363, 1453–1461. 10.1098/rstb.2007.2235
34
DardA.JiaY.RebouletJ.BleicherF.LavauC.MerabetS. (2019). The Human HOXA9 Protein Uses Paralog-specific Residues of the Homeodomain to Interact with TALE-Class Cofactors. Sci. Rep.9, 5664. 10.1038/s41598-019-42096-y
35
DardA.RebouletJ.JiaY.BleicherF.DuffraisseM.VanakerJ.-M.et al (2018). Human HOX Proteins Use Diverse and Context-dependent Motifs to Interact with TALE Class Cofactors. Cel Rep.22, 3058–3071. 10.1016/j.celrep.2018.02.070
36
DavenneM.MaconochieM. K.NeunR.PattynA.ChambonP.KrumlaufR.et al (1999). Hoxa2 and Hoxb2 Control Dorsoventral Patterns of Neuronal Development in the Rostral Hindbrain. Neuron22, 677–691. 10.1016/s0896-6273(00)80728-x
37
DebodtS.MaereS.VandepeerY. (2005). Genome Duplication and the Origin of Angiosperms. Trends Ecol. Evol.20, 591–597. 10.1016/j.tree.2005.07.008
38
DehalP.BooreJ. L. (2005). Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate. Plos Biol.3, e314. 10.1371/journal.pbio.0030314
39
DenansN.IimuraT.PourquiéO. (2015). Hox Genes Control Vertebrate Body Elongation by Collinear Wnt Repression. Elife4. 10.7554/eLife.04379
40
DesplanC.TheisJ.O'farrellP. H. (1988). The Sequence Specificity of Homeodomain-DNA Interaction. Cell54, 1081–1090. 10.1016/0092-8674(88)90123-7
41
DomschK.PapagiannouliF.LohmannI. (2015). The HOX-Apoptosis Regulatory Interplay in Development and Disease. Curr. Top. Dev. Biol.114, 121–158. 10.1016/bs.ctdb.2015.07.014
42
DoudnaJ. A.CharpentierE. (2014). The New Frontier of Genome Engineering with CRISPR-Cas9. Science346, 1258096. 10.1126/science.1258096
43
DubouleD.DolléP. (1989). The Structural and Functional Organization of the Murine HOX Gene Family Resembles that of Drosophila Homeotic Genes. EMBO J.8, 1497–1505. 10.1002/j.1460-2075.1989.tb03534.x
44
DubouleD. (2007). The Rise and Fall of Hox Gene Clusters. Development134, 2549–2560. 10.1242/dev.001065
45
DubouleD. (1998). Vertebrate Hox Gene Regulation: Clustering And/or Colinearity?Curr. Opin. Genet. Develop.8, 514–518. 10.1016/s0959-437x(98)80004-x
46
EnardW.GehreS.HammerschmidtK.HölterS. M.BlassT.SomelM.et al (2009). A Humanized Version of Foxp2 Affects Cortico-Basal Ganglia Circuits in Mice. Cell137, 961–971. 10.1016/j.cell.2009.03.041
47
EnriquezJ.BoukhatmiH.DuboisL.PhilippakisA. A.BulykM. L.MichelsonA. M.et al (2010). Multi-step Control of Muscle Diversity by Hox Proteins in the Drosophila Embryo. Development137, 457–466. 10.1242/dev.045286
48
EscrivaH.BertrandS.GermainP.Robinson-RechaviM.UmbhauerM.CartryJ.et al (2006). Neofunctionalization in Vertebrates: the Example of Retinoic Acid Receptors. Plos Genet.2, e102. 10.1371/journal.pgen.0020102
49
FernándezR.GabaldónT. (2020). Gene Gain and Loss across the Metazoan Tree of Life. Nat. Ecol. Evol.4, 524–533. 10.1038/s41559-019-1069-x
50
FerrettiE.MarshallH.PöpperlH.MaconochieM.KrumlaufR.BlasiF. (2000). Segmental Expression of Hoxb2 in R4 Requires Two Separate Sites that Integrate Cooperative Interactions between Prep1, Pbx and Hox Proteins. Development127, 155–166. 10.1242/dev.127.1.155
51
FoosN.Maurel-ZaffranC.MatéM. J.VincentelliR.HainautM.BerengerH.et al (2015). A Flexible Extension of the Drosophila Ultrabithorax Homeodomain Defines a Novel Hox/PBC Interaction Mode. Structure23, 270–279. 10.1016/j.str.2014.12.011
52
ForceA.LynchM.PickettF. B.AmoresA.YanY.-l.PostlethwaitJ. (1999). Preservation of Duplicate Genes by Complementary, Degenerative Mutations. Genetics151, 1531–1545. 10.1093/genetics/151.4.1531
53
FranchiniL. F.PollardK. S. (2017). Human Evolution: the Non-coding Revolution. BMC Biol.15, 89. 10.1186/s12915-017-0428-9
54
FriedmanR.HughesA. L. (2001). Pattern and Timing of Gene Duplication in Animal Genomes. Genome Res.11, 1842–1847. 10.1101/gr.200601
55
FröbiusA. C.FunchP. (2017). Rotiferan Hox Genes Give New Insights into the Evolution of Metazoan Bodyplans. Nat. Commun.8, 9. 10.1038/s41467-017-00020-w
56
Fromental-RamainC.WarotX.LakkarajuS.FavierB.HaackH.BirlingC.et al (1996). Specific and Redundant Functions of the Paralogous Hoxa-9 and Hoxd-9 Genes in Forelimb and Axial Skeleton Patterning. Development122, 461–472. 10.1242/dev.122.2.461
57
Furukubo-TokunagaK.FlisterS.GehringW. J. (1993). Functional Specificity of the Antennapedia Homeodomain. Proc. Natl. Acad. Sci. U.S.A.90, 6360–6364. 10.1073/pnas.90.13.6360
58
GalantR.CarrollS. B. (2002). Evolution of a Transcriptional Repression Domain in an Insect Hox Protein. Nature415, 910–913. 10.1038/nature717
59
GalantR.WalshC. M.CarrollS. B. (2002). Hox Repression of a Target Gene: Extradenticle-independent, Additive Action through Multiple Monomer Binding Sites. Development129, 3115–3126. 10.1242/dev.129.13.3115
60
Garcia-BellidoA.RipollP.MorataG. (1973). Developmental Compartmentalisation of the wing Disk of Drosophila. Nat. New Biol.245, 251–253. 10.1038/newbio245251a0
61
Garcia-FernàndezJ.HollandP. W. H. (1994). Archetypal Organization of the Amphioxus Hox Gene Cluster. Nature370, 563–566. 10.1038/370563a0
62
GasperskajaE.KučinskasV. (2017). The Most Common Technologies and Tools for Functional Genome Analysis. Aml24, 1–11. 10.6001/actamedica.v24i1.3457
63
GebeleinB.CuliJ.RyooH. D.ZhangW.MannR. S. (2002). Specificity of Distalless Repression and Limb Primordia Development by Abdominal Hox Proteins. Develop. Cel3, 487–498. 10.1016/s1534-5807(02)00257-5
64
GebeleinB.MckayD. J.MannR. S. (2004). Direct Integration of Hox and Segmentation Gene Inputs during Drosophila Development. Nature431, 653–659. 10.1038/nature02946
65
GehringW. J.AffolterM.BürglinT. (1994a). Homeodomain Proteins. Annu. Rev. Biochem.63, 487–526. 10.1146/annurev.bi.63.070194.002415
66
GehringW. J.QianY. Q.BilleterM.Furukubo-TokunagaK.SchierA. F.Resendez-PerezD.et al (1994b). Homeodomain-DNA Recognition. Cell78, 211–223. 10.1016/0092-8674(94)90292-5
67
GrahamA.PapalopuluN.KrumlaufR. (1989). The Murine and Drosophila Homeobox Gene Complexes Have Common Features of Organization and Expression. Cell57, 367–378. 10.1016/0092-8674(89)90912-4
68
GreerJ. M.PuetzJ.ThomasK. R.CapecchiM. R. (2000). Maintenance of Functional Equivalence during Paralogous Hox Gene Evolution. Nature403, 661–665. 10.1038/35001077
69
GruschusJ. M.TsaoD. H. H.WangL.-H.NirenbergM.FerrettiJ. A. (1997). Interactions of the vnd/NK-2 Homeodomain with DNA by Nuclear Magnetic Resonance Spectroscopy: Basis of Binding Specificity. Biochemistry36, 5372–5380. 10.1021/bi9620060
70
Guijarro-ClarkeC.HollandP. W. H.PapsJ. (2020a). Publisher Correction: Widespread Patterns of Gene Loss in the Evolution of the Animal Kingdom. Nat. Ecol. Evol.4, 661. 10.1038/s41559-020-1159-9
71
Guijarro-ClarkeC.HollandP. W. H.PapsJ. (2020b). Widespread Patterns of Gene Loss in the Evolution of the Animal Kingdom. Nat. Ecol. Evol.4, 519–523. 10.1038/s41559-020-1129-2
72
GuoT.MandaiK.CondieB. G.WickramasingheS. R.CapecchiM. R.GintyD. D. (2011). An Evolving NGF-Hoxd1 Signaling Pathway Mediates Development of Divergent Neural Circuits in Vertebrates. Nat. Neurosci.14, 31–36. 10.1038/nn.2710
73
HahnM. W.WrayG. A. (2002). The G-Value Paradox. Evol. Dev.4, 73–75. 10.1046/j.1525-142x.2002.01069.x
74
HanksM. C.LoomisC. A.HarrisE.TongC. X.Anson-CartwrightL.AuerbachA.et al (1998). Drosophila Engrailed Can Substitute for Mouse Engrailed1 Function in Mid-hindbrain, but Not Limb Development. Development125, 4521–4530. 10.1242/dev.125.22.4521
75
HeQ.JohnstonJ.ZeitlingerJ. (2015). ChIP-nexus Enables Improved Detection of In Vivo Transcription Factor Binding Footprints. Nat. Biotechnol.33, 395–401. 10.1038/nbt.3121
76
HeS.Del VisoF.ChenC.-Y.IkmiA.KroesenA. E.GibsonM. C. (2018). An Axial Hox Code Controls Tissue Segmentation and Body Patterning in Nematostella vectensis. Science361, 1377–1380. 10.1126/science.aar8384
77
HeX.ZhangJ. (2005). Rapid Subfunctionalization Accompanied by Prolonged and Substantial Neofunctionalization in Duplicate Gene Evolution. Genetics169, 1157–1164. 10.1534/genetics.104.037051
78
HedgesS. B.MarinJ.SuleskiM.PaymerM.KumarS. (2015). Tree of Life Reveals Clock-like Speciation and Diversification. Mol. Biol. Evol.32, 835–845. 10.1093/molbev/msv037
79
HirthF.LoopT.EggerB.MillerD. F. B.KaufmanT. C.ReichertH. (2001). Functional Equivalence of Hox Gene Products in the Specification of the Tritocerebrum during Embryonic Brain Development of Drosophila. Development128, 4781–4788. 10.1242/dev.128.23.4781
80
HoeggS.MeyerA. (2005). Hox Clusters as Models for Vertebrate Genome Evolution. Trends Genet.21, 421–424. 10.1016/j.tig.2005.06.004
81
HollandL. Z.Ocampo DazaD. (2018). A New Look at an Old Question: when Did the Second Whole Genome Duplication Occur in Vertebrate Evolution?Genome Biol.19, 209. 10.1186/s13059-018-1592-0
82
HollandP. W. H.Garcia-FernàndezJ.WilliamsN. A.SidowA. (1994). Gene Duplications and the Origins of Vertebrate Development. Dev. Suppl.1994, 125–133. 10.1242/dev.1994.supplement.125
83
HoranG. S.Ramírez-SolisR.FeatherstoneM. S.WolgemuthD. J.BradleyA.BehringerR. R. (1995). Compound Mutants for the Paralogous Hoxa-4, Hoxb-4, and Hoxd-4 Genes Show More Complete Homeotic Transformations and a Dose-dependent Increase in the Number of Vertebrae Transformed. Genes Dev.9, 1667–1677. 10.1101/gad.9.13.1667
84
HudryB.RemacleS.DelfiniM.-C.RezsohazyR.GrabaY.MerabetS. (2012). Hox Proteins Display a Common and Ancestral Ability to Diversify Their Interaction Mode with the PBC Class Cofactors. Plos Biol.10, e1001351. 10.1371/journal.pbio.1001351
85
HueberS. D.BezdanD.HenzS. R.BlankM.WuH.LohmannI. (2007). Comparative Analysis of Hox Downstream Genes inDrosophila. Development134, 381–392. 10.1242/dev.02746
86
HughesA. L. (1994). The Evolution of Functionally Novel Proteins after Gene Duplication. Proc. Biol. Sci.256, 119–124. 10.1098/rspb.1994.0058
87
HunterM. P.PrinceV. E. (2002). Zebrafish Hox Paralogue Group 2 Genes Function Redundantly as Selector Genes to Pattern the Second Pharyngeal Arch. Develop. Biol.247, 367–389. 10.1006/dbio.2002.0701
88
IacovinoM.HernandezC.XuZ.BajwaG.PratherM.KybaM. (2009). A Conserved Role for Hox Paralog Group 4 in Regulation of Hematopoietic Progenitors. Stem Cell Develop.18, 783–792. 10.1089/scd.2008.0227
89
IkutaT. (2011). Evolution of Invertebrate Deuterostomes and Hox/ParaHox Genes. Genomics. Proteomics. Bioinformatics9, 77–96. 10.1016/s1672-0229(11)60011-9
90
InnanH. (2009). Population Genetic Models of Duplicated Genes. Genetica137, 19–37. 10.1007/s10709-009-9355-1
91
IrieN.SatohN.KurataniS. (2018). The Phylum Vertebrata: a Case for Zoological Recognition. Zoolog. Lett4, 32. 10.1186/s40851-018-0114-y
92
JohnsonD.KanS.-h.OldridgeM.TrembathR. C.RocheP.EsnoufR. M.et al (2003). Missense Mutations in the Homeodomain of HOXD13 Are Associated with Brachydactyly Types D and E. Am. J. Hum. Genet.72, 984–997. 10.1086/374721
93
JoshiR.PassnerJ. M.RohsR.JainR.SosinskyA.CrickmoreM. A.et al (2007). Functional Specificity of a Hox Protein Mediated by the Recognition of Minor Groove Structure. Cell131, 530–543. 10.1016/j.cell.2007.09.024
94
JoshiR.SunL.MannR. (2010). Dissecting the Functional Specificities of Two Hox Proteins. Genes Dev.24, 1533–1545. 10.1101/gad.1936910
95
JubbH. C.PanduranganA. P.TurnerM. A.Ochoa-MontañoB.BlundellT. L.AscherD. B. (2017). Mutations at Protein-Protein Interfaces: Small Changes over Big Surfaces Have Large Impacts on Human Health. Prog. Biophys. Mol. Biol.128, 3–13. 10.1016/j.pbiomolbio.2016.10.002
96
JumperJ.EvansR.PritzelA.GreenT.FigurnovM.RonnebergerO.et al (2021). Highly Accurate Protein Structure Prediction with AlphaFold. Nature596, 583–589. 10.1038/s41586-021-03819-2
97
JungH.MazzoniE. O.SoshnikovaN.HanleyO.VenkateshB.DubouleD.et al (2014). Evolving Hox Activity Profiles Govern Diversity in Locomotor Systems. Develop. Cel29, 171–187. 10.1016/j.devcel.2014.03.008
98
KaufmanT. C.LewisR.WakimotoB. (1980). Cytogenetic Analysis of Chromosome 3 in Drosophila Melanogaster: The Homoeotic Gene Complex in Polytene Chromosome Interval 84a-B. Genetics94, 115–133. 10.1093/genetics/94.1.115
99
KleinjanD. A.BancewiczR. M.GautierP.DahmR.SchonthalerH. B.DamanteG.et al (2008). Subfunctionalization of Duplicated Zebrafish Pax6 Genes by Cis-Regulatory Divergence. Plos Genet.4, e29. 10.1371/journal.pgen.0040029
100
KmitaM.DubouleD. (2003). Organizing Axes in Time and Space; 25 Years of Colinear Tinkering. Science301, 331–333. 10.1126/science.1085753
101
KnoepflerP. S.KampsM. P. (1995). The Pentapeptide Motif of Hox Proteins Is Required for Cooperative DNA Binding with Pbx1, Physically Contacts Pbx1, and Enhances DNA Binding by Pbx1. Mol. Cel Biol15, 5811–5819. 10.1128/mcb.15.10.5811
102
KooninE. V. (2005). Orthologs, Paralogs, and Evolutionary Genomics. Annu. Rev. Genet.39, 309–338. 10.1146/annurev.genet.39.073003.114725
103
KrumlaufR. (1994). Hox Genes in Vertebrate Development. Cell78, 191–201. 10.1016/0092-8674(94)90290-9
104
KurakuS.MeyerA. (2009). The Evolution and Maintenance of Hox Gene Clusters in Vertebrates and the Teleost-specific Genome Duplication. Int. J. Dev. Biol.53, 765–773. 10.1387/ijdb.072533km
105
LacombeJ.HanleyO.JungH.PhilippidouP.SurmeliG.GrinsteinJ.et al (2013). Genetic and Functional Modularity of Hox Activities in the Specification of Limb-Innervating Motor Neurons. Plos Genet.9, e1003184. 10.1371/journal.pgen.1003184
106
LaiC. S. L.FisherS. E.HurstJ. A.Vargha-KhademF.MonacoA. P. (2001). A Forkhead-Domain Gene Is Mutated in a Severe Speech and Language Disorder. Nature413, 519–523. 10.1038/35097076
107
LaityJ. H.LeeB. M.WrightP. E. (2001). Zinc finger Proteins: New Insights into Structural and Functional Diversity. Curr. Opin. Struct. Biol.11, 39–46. 10.1016/s0959-440x(00)00167-6
108
LamkaM. L.BouletA. M.SakonjuS. (1992). Ectopic Expression of UBX and ABD-B Proteins during Drosophila Embryogenesis: Competition, Not a Functional Hierarchy, Explains Phenotypic Suppression. Development116, 841–854. 10.1242/dev.116.4.841
109
Laronde-LeblancN. A.WolbergerC. (2003). Structure of HoxA9 and Pbx1 Bound to DNA: Hox Hexapeptide and DNA Recognition Anterior to Posterior. Genes Dev.17, 2060–2072. 10.1101/gad.1103303
110
LaurentJ. M.GargeR. K.TeufelA. I.WilkeC. O.KachrooA. H.MarcotteE. M. (2020). Humanization of Yeast Genes with Multiple Human Orthologs Reveals Functional Divergence between Paralogs. Plos Biol.18, e3000627. 10.1371/journal.pbio.3000627
111
LawrenceP. A.MorataG. (1994). Homeobox Genes: Their Function in Drosophila Segmentation and Pattern Formation. Cell78, 181–189. 10.1016/0092-8674(94)90289-5
112
LelliK. M.NoroB.MannR. S. (2011). Variable Motif Utilization in Homeotic Selector (Hox)-Cofactor Complex Formation Controls Specificity. Proc. Natl. Acad. Sci. U.S.A.108, 21122–21127. 10.1073/pnas.1114118109
113
LemonsD.McginnisW. (2006). Genomic Evolution of Hox Gene Clusters. Science313, 1918–1922. 10.1126/science.1132040
114
LewisE. B. (1978). A Gene Complex Controlling Segmentation in Drosophila. Nature276, 565–570. 10.1038/276565a0
115
LewisE. B. (1994). Homeosis: the First 100 Years. Trends Genet.10, 341–343. 10.1016/0168-9525(94)90117-1
116
LiX.McginnisW. (1999). Activity Regulation of Hox Proteins, a Mechanism for Altering Functional Specificity in Development and Evolution. Proc. Natl. Acad. Sci. U.S.A.96, 6802–6807. 10.1073/pnas.96.12.6802
117
LiX.MurreC.McginnisW. (1999). Activity Regulation of a Hox Protein and a Role for the Homeodomain in Inhibiting Transcriptional Activation. Embo J.18, 198–211. 10.1093/emboj/18.1.198
118
LiuY.MatthewsK. S.BondosS. E. (2008). Multiple Intrinsically Disordered Sequences Alter DNA Binding by the Homeodomain of the Drosophila Hox Protein Ultrabithorax. J. Biol. Chem.283, 20874–20887. 10.1074/jbc.m800375200
119
LokerR.SannerJ. E.MannR. S. (2021). Cell-type-specific Hox Regulatory Strategies Orchestrate Tissue Identity. Curr. Biol.10.1016/j.cub.2021.07.030
120
LuQ.KnoepflerP.ScheeleJ.WrightD.KampsM. (1995). Both Pbx1 and E2A-Pbx1 Bind the DNA Motif ACCAATCAA Cooperatively with the Products of Mutliple Murine HOX Genes, Some of Which Themselves Are Oncogenes. Mol. Cel Biol15, 3786. 10.1128/mcb.15.7.3786
121
LundinL. G. (1999). Gene Duplications in Early Metazoan Evolution. Semin. Cel Develop. Biol.10, 523–530. 10.1006/scdb.1999.0333
122
LutzB.LuH. C.EicheleG.MillerD.KaufmanT. C. (1996). Rescue of Drosophila Labial Null Mutant by the Chicken Ortholog Hoxb-1 Demonstrates that the Function of Hox Genes Is Phylogenetically Conserved. Genes Dev.10, 176–184. 10.1101/gad.10.2.176
123
LynchM.ConeryJ. S. (2000). The Evolutionary Fate and Consequences of Duplicate Genes. Science290, 1151–1155. 10.1126/science.290.5494.1151
124
LynchM.ForceA. (2000). The Probability of Duplicate Gene Preservation by Subfunctionalization. Genetics154, 459–473. 10.1093/genetics/154.1.459
125
MaconochieM.NonchevS.MorrisonA.KrumlaufR. (1996). Paralogous Hox Genes: Function and Regulation. Annu. Rev. Genet.30, 529–556. 10.1146/annurev.genet.30.1.529
126
MaedaR. K.KarchF. (2006). The ABC of the BX-C: the Bithorax Complex Explained. Development133, 1413–1422. 10.1242/dev.02323
127
MalickiJ.SchughartK.McginnisW. (1990). Mouse Hox-2.2 Specifies Thoracic Segmental Identity in Drosophila Embryos and Larvae. Cell63, 961–967. 10.1016/0092-8674(90)90499-5
128
ManleyN. R.CapecchiM. R. (1997). Hox Group 3 Paralogous Genes Act Synergistically in the Formation of Somitic and Neural Crest-Derived Structures. Develop. Biol.192, 274–288. 10.1006/dbio.1997.8765
129
MannR. S.ChanS.-K. (1996). Extra Specificity from Extradenticle: the Partnership between HOX and PBX/EXD Homeodomain Proteins. Trends Genet.12, 258–262. 10.1016/0168-9525(96)10026-3
130
MannR. S.LelliK. M.JoshiR. (2009). Chapter 3 Hox Specificity. Curr. Top. Dev. Biol.88, 63–101. 10.1016/s0070-2153(09)88003-4
131
MannR. S. (1995). The Specificity of Homeotic Gene Function. Bioessays17, 855–863. 10.1002/bies.950171007
132
MarcussenT.OxelmanB.SkogA.JakobsenK. S. (2010). Evolution of Plant RNA Polymerase IV/V Genes: Evidence of Subneofunctionalization of Duplicated NRPD2/NRPE2-like Paralogs in Viola (Violaceae). BMC Evol. Biol.10, 45. 10.1186/1471-2148-10-45
133
MazetF.M. ShimeldS. (2002). Gene Duplication and Divergence in the Early Evolution of Vertebrates. Curr. Opin. Genet. Develop.12, 393–396. 10.1016/s0959-437x(02)00315-5
134
McginnisN.KuzioraM. A.McginnisW. (1990). Human Hox-4.2 and Drosophila Deformed Encode Similar Regulatory Specificities in Drosophila Embryos and Larvae. Cell63, 969–976. 10.1016/0092-8674(90)90500-e
135
McginnisW.KrumlaufR. (1992). Homeobox Genes and Axial Patterning. Cell68, 283–302. 10.1016/0092-8674(92)90471-n
136
MclainK.SchreinerC.YagerK. L.StockJ. L.Steven PotterS. (1992). Ectopic Expression of Hox-2.3 Induces Craniofacial and Skeletal Malformations in Transgenic Mice. Mech. Develop.39, 3–16. 10.1016/0925-4773(92)90021-b
137
McnultyC. L.PeresJ. N.BardineN.Van Den AkkerW. M. R.DurstonA. J. (2005). Knockdown of the Complete Hox Paralogous Group 1 Leads to Dramatic Hindbrain and Neural Crest Defects. Development132, 2861–2871. 10.1242/dev.01872
138
Medina-MartinezO.BradleyA.Ramírez-SolisR. (2000). A Large Targeted Deletion of Hoxb1-Hoxb9 Produces a Series Single-Segment Anterior Homeotic Transformations. Develop. Biol.222, 71–83. 10.1006/dbio.2000.9683
139
Medina-MartínezO.Ramírez-SolisR. (2003). In Vivo mutagenesis of the Hoxb8 Hexapeptide Domain Leads to Dominant Homeotic Transformations that Mimic the Loss-Of-Function Mutations in Genes of the Hoxb Cluster. Develop. Biol.264, 77–90. 10.1016/j.ydbio.2003.07.020
140
MerabetS.KambrisZ.CapovillaM.BérengerH.PradelJ.GrabaY. (2003). The Hexapeptide and Linker Regions of the AbdA Hox Protein Regulate its Activating and Repressive Functions. Develop. Cel4, 761–768. 10.1016/s1534-5807(03)00126-6
141
MerabetS.Litim-MecheriI.KarlssonD.DixitR.SaadaouiM.MonierB.et al (2011). Insights into Hox Protein Function from a Large Scale Combinatorial Analysis of Protein Domains. Plos Genet.7, e1002302. 10.1371/journal.pgen.1002302
142
MerabetS.MannR. S. (2016). To Be Specific or Not: The Critical Relationship between Hox and TALE Proteins. Trends Genet.32, 334–347. 10.1016/j.tig.2016.03.004
143
MiguezA.DucretS.Di MeglioT.ParrasC.HmidanH.HatonC.et al (2012). Opposing Roles for Hoxa2 and Hoxb2 in Hindbrain Oligodendrocyte Patterning. J. Neurosci.32, 17172–17185. 10.1523/jneurosci.0885-12.2012
144
MoraC.TittensorD. P.AdlS.SimpsonA. G. B.WormB. (2011). How many Species Are There on Earth and in the Ocean?Plos Biol.9, e1001127. 10.1371/journal.pbio.1001127
145
NadeauJ. H.SankoffD. (1997). Comparable Rates of Gene Loss and Functional Divergence after Genome Duplications Early in Vertebrate Evolution. Genetics147, 1259–1266. 10.1093/genetics/147.3.1259
146
NeuteboomS. T.MurreC. (1997). Pbx Raises the DNA Binding Specificity but Not the Selectivity of Antennapedia Hox Proteins. Mol. Cel Biol17, 4696–4706. 10.1128/mcb.17.8.4696
147
NeuteboomS. T.PeltenburgL. T.Van DijkM. A.MurreC. (1995). The Hexapeptide LFPWMR in Hoxb-8 Is Required for Cooperative DNA Binding with Pbx1 and Pbx2 Proteins. Proc. Natl. Acad. Sci. U.S.A.92, 9166–9170. 10.1073/pnas.92.20.9166
148
NoroB.LelliK.SunL.MannR. S. (2011). Competition for Cofactor-dependent DNA Binding Underlies Hox Phenotypic Suppression. Genes Dev.25, 2327–2332. 10.1101/gad.175539.111
149
NoyesM. B.ChristensenR. G.WakabayashiA.StormoG. D.BrodskyM. H.WolfeS. A. (2008). Analysis of Homeodomain Specificities Allows the Family-wide Prediction of Preferred Recognition Sites. Cell133, 1277–1289. 10.1016/j.cell.2008.05.023
150
OhnoS. (1970). Evolution by Gene Duplication. Springer-Verlag Berlin Heidelberg, 160.
151
PanfilioK. A.AkamM. (2007). A Comparison of Hox3 and Zen Protein Coding Sequences in Taxa that Span the Hox3/zen Divergence. Dev. Genes Evol.217, 323–329. 10.1007/s00427-007-0133-8
152
PanopoulouG.HennigS.GrothD.KrauseA.PoustkaA. J.HerwigR.et al (2003). New Evidence for Genome-wide Duplications at the Origin of Vertebrates Using an Amphioxus Gene Set and Completed Animal Genomes. Genome Res.13, 1056–1066. 10.1101/gr.874803
153
PapsJ.HollandP. W. H. (2018). Reconstruction of the Ancestral Metazoan Genome Reveals an Increase in Genomic novelty. Nat. Commun.9, 1730. 10.1038/s41467-018-04136-5
154
Pascual-AnayaJ.D’AnielloS.KurataniS.Garcia-FernàndezJ. (2013). Evolution of Hox Gene Clusters in Deuterostomes. BMC Develop. Biol.13, 26. 10.1186/1471-213x-13-26
155
PassnerJ. M.RyooH. D.ShenL.MannR. S.AggarwalA. K. (1999). Structure of a DNA-Bound Ultrabithorax-Extradenticle Homeodomain Complex. Nature397, 714–719. 10.1038/17833
156
PattersonL. T.PembaurM.PotterS. S. (2001). Hoxa11andHoxd11regulate Branching Morphogenesis of the Ureteric Bud in the Developing Kidney. Development128, 2153–2161. 10.1242/dev.128.11.2153
157
PaulD. S.SoranzoN.BeckS. (2014). Functional Interpretation of Non‐coding Sequence Variation: Concepts and Challenges. Bioessays36, 191–199. 10.1002/bies.201300126
158
PearsonJ. C.LemonsD.McginnisW. (2005). Modulating Hox Gene Functions during Animal Body Patterning. Nat. Rev. Genet.6, 893–904. 10.1038/nrg1726
159
PenkovD.San MartínD. M.Fernandez-DíazL. C.RossellóC. A.TorrojaC.Sánchez-CaboF.et al (2013). Analysis of the DNA-Binding Profile and Function of TALE Homeoproteins Reveals Their Specialization and Specific Interactions with Hox Genes/proteins. Cel Rep.3, 1321–1333. 10.1016/j.celrep.2013.03.029
160
PerryG. H.DominyN. J.ClawK. G.LeeA. S.FieglerH.RedonR.et al (2007). Diet and the Evolution of Human Amylase Gene Copy Number Variation. Nat. Genet.39, 1256–1260. 10.1038/ng2123
161
PhelanM. L.RambaldiI.FeatherstoneM. S. (1995). Cooperative Interactions between HOX and PBX Proteins Mediated by a Conserved Peptide Motif. Mol. Cel Biol15, 3989–3997. 10.1128/mcb.15.8.3989
162
PhelanM. L.SadoulR.FeatherstoneM. S. (1994). Functional Differences between HOX Proteins Conferred by Two Residues in the Homeodomain N-Terminal Arm. Mol. Cel. Biol.14, 5066–5075. 10.1128/mcb.14.8.5066
163
PiperD. E.BatchelorA. H.ChangC.-P.ClearyM. L.WolbergerC. (1999). Structure of a HoxB1-Pbx1 Heterodimer Bound to DNA. Cell96, 587–597. 10.1016/s0092-8674(00)80662-5
164
PontingC. P.RussellR. R. (2002). The Natural History of Protein Domains. Annu. Rev. Biophys. Biomol. Struct.31, 45–71. 10.1146/annurev.biophys.31.082901.134314
165
PöpperlH.BienzM.StuderM.ChanS.-K.AparicioS.BrennerS.et al (1995). Segmental Expression of Hoxb-1 Is Controlled by a Highly Conserved Autoregulatory Loop Dependent upon Exd/pbx. Cell81, 1031–1042. 10.1016/s0092-8674(05)80008-x
166
QuiringR.WalldorfU.KloterU.GehringW. J. (1994). Homology of the Eyeless Gene of Drosophila to the Small Eye Gene in Mice and Aniridia in Humans. Science265, 785–789. 10.1126/science.7914031
167
ReillyS. K.NoonanJ. P. (2016). Evolution of Gene Regulation in Humans. Annu. Rev. Genom. Hum. Genet.17, 45–67. 10.1146/annurev-genom-090314-045935
168
RemacleS.AbbasL.De BackerO.PacicoN.GavalasA.GofflotF.et al (2004). Loss of Function but No Gain of Function Caused by Amino Acid Substitutions in the Hexapeptide of Hoxa1 In Vivo. Mol. Cel Biol24, 8567–8575. 10.1128/mcb.24.19.8567-8575.2004
169
RichterD. J.FozouniP.EisenM. B.KingN. (2018). Gene Family Innovation, Conservation and Loss on the Animal Stem Lineage. Elife7, 34226. 10.7554/eLife.34226
170
RivasM. L.Espinosa-VázquezJ. M.SambraniN.GreigS.MerabetS.GrabaY.et al (2013). Antagonism versus Cooperativity with TALE Cofactors at the Base of the Functional Diversification of Hox Protein Function. Plos Genet.9, e1003252. 10.1371/journal.pgen.1003252
171
Roberts KingmanG. A.LeeD.JonesF. C.DesmetD.BellM. A.KingsleyD. M. (2021a). Longer or Shorter Spines: Reciprocal Trait Evolution in Stickleback via Triallelic Regulatory Changes in Stanniocalcin2a. Proc. Natl. Acad. Sci. U S A.118, 694118. 10.1073/pnas.2100694118
172
Roberts KingmanG. A.VyasD. N.JonesF. C.BradyS. D.ChenH. I.ReidK.et al (2021b). Predicting Future from Past: The Genomic Basis of Recurrent and Rapid Stickleback Evolution. Sci. Adv.7, 5285. 10.1126/sciadv.abg5285
173
RogersJ.GibbsR. A. (2014). Comparative Primate Genomics: Emerging Patterns of Genome Content and Dynamics. Nat. Rev. Genet.15, 347–359. 10.1038/nrg3707
174
RonshaugenM.McginnisN.McginnisW. (2002). Hox Protein Mutation and Macroevolution of the Insect Body Plan. Nature415, 914. 10.1038/nature716
175
RubinsteinM.De SouzaF. S. J. (2013). Evolution of Transcriptional Enhancers and Animal Diversity. Phil. Trans. R. Soc. B368, 20130017. 10.1098/rstb.2013.0017
176
RyooH. D.MartyT.CasaresF.AffolterM.MannR. S. (1999). Regulation of Hox Target Genes by a DNA Bound Homothorax/Hox/Extradenticle Complex. Development126, 5137–5148. 10.1242/dev.126.22.5137
177
SaadaouiM.MerabetS.Litim-MecheriI.ArbeilleE.SambraniN.DamenW.et al (2011). Selection of Distinct Hox-Extradenticle Interaction Modes fine-tunes Hox Protein Activity. Proc. Natl. Acad. Sci. U.S.A.108, 2276–2281. 10.1073/pnas.1006964108
178
SakazumeS.SorokinaE.IwamotoY.SeminaE. V. (2007). Functional Analysis of Human Mutations in Homeodomain Transcription Factor PITX3. BMC Mol. Biol.8, 84. 10.1186/1471-2199-8-84
179
Sánchez-HiguerasC.SotillosS.Castelli-Gair HombríaJ. (2014). Common Origin of Insect Trachea and Endocrine Organs from a Segmentally Repeated Precursor. Curr. Biol.24, 76–81. 10.1016/j.cub.2013.11.010
180
SandveS. R.RohlfsR. V.HvidstenT. R. (2018). Subfunctionalization versus Neofunctionalization after Whole-Genome Duplication. Nat. Genet.50, 908–909. 10.1038/s41588-018-0162-4
181
SaurinA. J.DelfiniM. C.Maurel-ZaffranC.GrabaY. (2018). The Generic Facet of Hox Protein Function. Trends Genet.34, 941–953. 10.1016/j.tig.2018.08.006
182
ScannellD. R.ByrneK. P.GordonJ. L.WongS.WolfeK. H. (2006). Multiple Rounds of Speciation Associated with Reciprocal Gene Loss in Polyploid Yeasts. Nature440, 341–345. 10.1038/nature04562
183
SchneuwlyS.KlemenzR.GehringW. J. (1987). Redesigning the Body Plan of Drosophila by Ectopic Expression of the Homoeotic Gene Antennapedia. Nature325, 816–818. 10.1038/325816a0
184
ScottM. P.TamkunJ. W.Hartzell IIIG. W. (1989). 3dThe Structure and Function of the Homeodomain. Biochim. Biophys. Acta (Bba) - Rev. Cancer989, 25–48. 10.1016/0304-419x(89)90033-4
185
SekigamiY.KobayashiT.OmiA.NishitsujiK.IkutaT.FujiyamaA.et al (2017). Hox Gene Cluster of the Ascidian, Halocynthia Roretzi, Reveals Multiple Ancient Steps of Cluster Disintegration during Ascidian Evolution. Zoolog. Lett3, 17. 10.1186/s40851-017-0078-3
186
SeoH.-C.EdvardsenR. B.MaelandA. D.BjordalM.JensenM. F.HansenA.et al (2004). Hox Cluster Disintegration with Persistent Anteroposterior Order of Expression in Oikopleura dioica. Nature431, 67–71. 10.1038/nature02709
187
SessionA. M.UnoY.KwonT.ChapmanJ. A.ToyodaA.TakahashiS.et al (2016). Genome Evolution in the Allotetraploid Frog Xenopus laevis. Nature538, 336–343. 10.1038/nature19840
188
ShanmugamK.GreenN. C.RambaldiI.SaragoviH. U.FeatherstoneM. S. (1999). PBX and MEIS as Non-DNA-binding Partners in Trimeric Complexes with HOX Proteins. Mol. Cel Biol19, 7577–7588. 10.1128/mcb.19.11.7577
189
ShenW.-F.RozenfeldS.LawrenceH. J.LargmanC. (1997b). The Abd-B-like Hox Homeodomain Proteins Can Be Subdivided by the Ability to Form Complexes with Pbx1a on a Novel DNA Target. J. Biol. Chem.272, 8198–8206. 10.1074/jbc.272.13.8198
190
ShenW. F.MontgomeryJ. C.RozenfeldS.MoskowJ. J.LawrenceH. J.BuchbergA. M.et al (1997a). AbdB-like Hox Proteins Stabilize DNA Binding by the Meis1 Homeodomain Proteins. Mol. Cel Biol17, 6448–6458. 10.1128/mcb.17.11.6448
191
ShoubridgeC.TanM. H.SeibothG.GeczJ. (2012). ARX Homeodomain Mutations Abolish DNA Binding and lead to a Loss of Transcriptional Repression. Hum. Mol. Genet.21, 1639–1647. 10.1093/hmg/ddr601
192
SinghN. P.De KumarB.PaulsonA.ParrishM. E.ScottC.ZhangY.et al (2021). Genome-Wide Binding Analyses of HOXB1 Revealed a Novel DNA Binding Motif Associated with Gene Repression. J. Dev. Biol.9, 6. 10.3390/jdb9010006
193
SinghN. P.De KumarB.PaulsonA.ParrishM. E.ZhangY.FlorensL.et al (2020). A Six-Amino-Acid Motif Is a Major Determinant in Functional Evolution of HOX1 Proteins. Genes Dev.34, 1680–1696. 10.1101/gad.342329.120
194
SlatteryM.RileyT.LiuP.AbeN.Gomez-AlcalaP.DrorI.et al (2011). Cofactor Binding Evokes Latent Differences in DNA Binding Specificity between Hox Proteins. Cell147, 1270–1282. 10.1016/j.cell.2011.10.053
195
SmithJ. J.TimoshevskayaN.YeC.HoltC.KeinathM. C.ParkerH. J.et al (2018). The Sea Lamprey Germline Genome Provides Insights into Programmed Genome Rearrangement and Vertebrate Evolution. Nat. Genet.50, 270–277. 10.1038/s41588-017-0036-1
196
SorgeS.HaN.PolychronidouM.FriedrichJ.BezdanD.KasparP.et al (2012). Thecis-regulatory Code of Hox Function inDrosophila. EMBO J.31, 3323–3333. 10.1038/emboj.2012.179
197
SoshnikovaN.DewaeleR.JanvierP.KrumlaufR.DubouleD. (2013). Duplications of Hox Gene Clusters and the Emergence of Vertebrates. Develop. Biol.378, 194–199. 10.1016/j.ydbio.2013.03.004
198
SpitzF.GonzalezF.PeichelC.VogtT. F.DubouleD.ZákányJ. (2001). Large Scale Transgenic and Cluster Deletion Analysis of the HoxD Complex Separate an Ancestral Regulatory Module from Evolutionary Innovations. Genes Dev.15, 2209–2214. 10.1101/gad.205701
199
SuemoriH.NoguchiS. (2000). Hox C Cluster Genes Are Dispensable for Overall Body Plan of Mouse Embryonic Development. Develop. Biol.220, 333–342. 10.1006/dbio.2000.9651
200
TruongD. M.BoekeJ. D. (2017). Resetting the Yeast Epigenome with Human Nucleosomes. Cell171, 1508–1519. 10.1016/j.cell.2017.10.043
201
TvrdikP.CapecchiM. R. (2006). Reversal of Hox1 Gene Subfunctionalization in the Mouse. Develop. Cel11, 239–250. 10.1016/j.devcel.2006.06.016
202
Van Den AkkerE.Fromental-RamainC.De GraaffW.Le MouellicH.BruletP.ChambonP.et al (2001). Axial Skeletal Patterning in Mice Lacking All Paralogous Group 8 Hox Genes. Development128, 1911–1921. 10.1242/dev.128.10.1911
203
Van DijkM. A.MurreC. (1994). Extradenticle Raises the DNA Binding Specificity of Homeotic Selector Gene Products. Cell78, 617–624. 10.1016/0092-8674(94)90526-6
204
VandenbusscheM.TheissenG.Van De PeerY.GeratsT. (2003). Structural Diversification and Neo-Functionalization during floral MADS-Box Gene Evolution by C-Terminal Frameshift Mutations. Nucleic Acids Res.31, 4401–4409. 10.1093/nar/gkg642
205
VandepoeleK.De VosW.TaylorJ. S.MeyerA.Van De PeerY. (2004). Major Events in the Genome Evolution of Vertebrates: Paranome Age and Size Differ Considerably between ray-finned Fishes and Land Vertebrates. Proc. Natl. Acad. Sci. U.S.A.101, 1638–1643. 10.1073/pnas.0307968100
206
Vieux-RochasM.MascrezB.KrumlaufR.DubouleD. (2013). Combined Function of HoxA and HoxB Clusters in Neural Crest Cells. Develop. Biol.382, 293–301. 10.1016/j.ydbio.2013.06.027
207
WagnerG. P.AmemiyaC.RuddleF. (2003). Hox Cluster Duplications and the Opportunity for Evolutionary Novelties. Proc. Natl. Acad. Sci. U.S.A.100, 14603–14606. 10.1073/pnas.2536656100
208
WahbaG. M.HostikkaS. L.CarpenterE. M. (2001). The Paralogous Hox Genes Hoxa10 and Hoxd10 Interact to Pattern the Mouse Hindlimb Peripheral Nervous System and Skeleton. Develop. Biol.231, 87–102. 10.1006/dbio.2000.0130
209
WalshB. (2003). Population-genetic Models of the Fates of Duplicate Genes. Genetica118, 279–294. 10.1007/978-94-010-0229-5_16
210
WebbB. D.ShaabanS.GasparH.CunhaL. F.SchubertC. R.HaoK.et al (2012). HOXB1 Founder Mutation in Humans Recapitulates the Phenotype of Hoxb1 Mice. Am. J. Hum. Genet.91, 171–179. 10.1016/j.ajhg.2012.05.018
211
WendelJ. F. (2000). Genome Evolution in Polyploids. Plant Mol. Biol.42, 225–249. 10.1007/978-94-011-4221-2_12
212
XieK. T.WangG.ThompsonA. C.WucherpfennigJ. I.ReimchenT. E.MaccollA. D. C.et al (2019). DNA Fragility in the Parallel Evolution of Pelvic Reduction in Stickleback Fish. Science363, 81–84. 10.1126/science.aan1425
213
YokouchiY.NakazatoS.YamamotoM.GotoY.KamedaT.IbaH.et al (1995). Misexpression of Hoxa-13 Induces Cartilage Homeotic Transformation and Changes Cell Adhesiveness in Chick Limb Buds. Genes Dev.9, 2509–2522. 10.1101/gad.9.20.2509
214
YoungT.RowlandJ. E.Van De VenC.BialeckaM.NovoaA.CarapucoM.et al (2009). Cdx and Hox Genes Differentially Regulate Posterior Axial Growth in Mammalian Embryos. Develop. Cel17, 516–526. 10.1016/j.devcel.2009.08.010
215
ZappavignaV.SartoriD.MavilioF. (1994). Specificity of HOX Protein Function Depends on DNA-Protein and Protein-Protein Interactions, Both Mediated by the Homeo Domain. Genes Dev.8, 732–744. 10.1101/gad.8.6.732
216
ZhaoJ. J.LazzariniR. A.PickL. (1993). The Mouse Hox-1.3 Gene Is Functionally Equivalent to the Drosophila Sex combs Reduced Gene. Genes Dev.7, 343–354. 10.1101/gad.7.3.343
217
ZhaoY.PotterS. S. (2002). Functional Comparison of the Hoxa 4, Hoxa 10, and Hoxa 11 Homeoboxes. Develop. Biol.244, 21–36. 10.1006/dbio.2002.0595
218
ZhaoY.PotterS. S. (2001). Functional Specificity of theHoxa13homeobox. Development128, 3197–3207. 10.1242/dev.128.16.3197
219
ZhenY.AndolfattoP. (2012). Methods to Detect Selection on Noncoding DNA. Methods Mol. Biol.856, 141–159. 10.1007/978-1-61779-585-5_6
220
ZimmerC. T.GarroodW. T.SinghK. S.RandallE.LuekeB.GutbrodO.et al (2018). Neofunctionalization of Duplicated P450 Genes Drives the Evolution of Insecticide Resistance in the Brown Planthopper. Curr. Biol.28, 268–274. 10.1016/j.cub.2017.11.060
Summary
Keywords
gene duplication and divergence, protein evolution, HOX proteins, Drosophila, mouse
Citation
Singh NP and Krumlauf R (2022) Diversification and Functional Evolution of HOX Proteins. Front. Cell Dev. Biol. 10:798812. doi: 10.3389/fcell.2022.798812
Received
20 October 2021
Accepted
08 April 2022
Published
13 May 2022
Volume
10 - 2022
Edited by
Edwina McGlinn, Monash University, Australia
Reviewed by
René Rezsohazy, Catholic University of Louvain, Belgium
Jennifer Mansfield, Columbia University, United States
Updates
Copyright
© 2022 Singh and Krumlauf.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Robb Krumlauf, rek@stowers.org
†Present address: Narendra Pratap Singh, Laboratory of Systems Genetics, Systems Biology Center, National Heart Lung and Blood Institute, Bethesda, MD, United States
This article was submitted to Morphogenesis and Patterning, a section of the journal Frontiers in Cell and Developmental Biology
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.