REVIEW article

Front. Cell Dev. Biol., 27 January 2022

Sec. Molecular and Cellular Reproduction

Volume 10 - 2022 | https://doi.org/10.3389/fcell.2022.827454

The Importance of Gene Duplication and Domain Repeat Expansion for the Function and Evolution of Fertilization Proteins

  • Department of Genome Sciences, University of Washington, Seattle, WA, United States

Article metrics

View details

20

Citations

6k

Views

1,3k

Downloads

Abstract

The process of gene duplication followed by gene loss or evolution of new functions has been studied extensively, yet the role gene duplication plays in the function and evolution of fertilization proteins is underappreciated. Gene duplication is observed in many fertilization protein families including Izumo, DCST, ZP, and the TFP superfamily. Molecules mediating fertilization are part of larger gene families expressed in a variety of tissues, but gene duplication followed by structural modifications has often facilitated their cooption into a fertilization function. Repeat expansions of functional domains within a gene also provide opportunities for the evolution of novel fertilization protein. ZP proteins with domain repeat expansions are linked to species-specificity in fertilization and TFP proteins that experienced domain duplications were coopted into a novel sperm function. This review outlines the importance of gene duplications and repeat domain expansions in the evolution of fertilization proteins.

Introduction

The fertilization of oocytes by sperm is an essential function in sexual reproduction, and multiple stages of the fertilization cascade have been described (Vacquier, 1998). First the sperm is drawn to the egg through chemotaxis (Ramírez-Gómez et al., 2019), and it then binds to the egg and releases proteins stored in the acrosome. The sperm then passes through the glycoproteinaceous egg coat (Monne et al., 2008; Wilburn and Swanson, 2016) (named Zona Pellucida in mammals), and proceeds to the oocyte cell membrane to initiate fusion (Siu et al., 2021). Understanding fertilization requires knowledge of both these broad steps of the fertilization cascade and the molecular mechanism underlying them. Research into the evolution and function of gametic proteins has implications for the development of novel contraception or treatments for unexplained human infertility (Gelbaya et al., 2014).

Many fertilization proteins are members of gene families that result from whole gene duplication events, which is a common mechanism for gene birth (Hughes, 1994). There has been extensive research into the relationship between gene duplication and other aspects of reproductive biology, including the neuroendocrine control of reproduction (Dufour et al., 2020), protease activity in the female reproductive tract (Kelleher et al., 2007; Kelleher and Markow, 2009), the resolution of sexual conflict (Gallach et al., 2010, 2011; Connallon and Clark, 2011; Gallach and Betrán, 2011), and hybridization barriers (Ting et al., 2004). This review specifically focuses on our growing knowledge of duplicated protein families implicated in fertilization. These proteins include the Izumo1 and Juno pair of interacting proteins, which each arose from independent gene duplication events and are essential to gamete membrane fusion function in mammals (Bianchi et al., 2014). DCST1 and DCST2 are paralogous proteins expressed in the sperm membrane of some bilateral animals, that are essential for fertilization (Inoue et al., 2021a, 1). Other duplicated proteins that act in fertilization include ADAMs (Primakoff and Myles, 2000; Civetta, 2003; Finn and Civetta, 2010), CRISPs (Busso et al., 2007; Da Ros et al., 2008; Gibbs et al., 2011; Maldera et al., 2014), Catspers (Clapham and Garbers, 2005; Navarro et al., 2008; Speer et al., 2021), and PKDREJ on the male side (Sutton et al., 2008), and tetraspanins (CD9,CD81) (Le Naour et al., 2000, 9; Miyado et al., 2000; Frolikova et al., 2018) and EBR1 on the female side (Kamei and Glabe, 2003; Hart, 2013). Genomic resources suggests that most of these families (ADAMs, tetraspanins, EBR, PKRDEJ, Catsper) have orthologs in other bilateral animals, while CRISP has orthologs in animals and in yeast (Howe et al., 2021).

Duplicated genes can experience further structural diversification, such as the duplication of individual functional protein domains. Proteins containing tandemly duplicated domains constitute a small, but significant portion of the genome (Han et al., 2007; Nacher et al., 2010). Independent tandem duplications of individual functional domains is also a recurrent trend in some protein families (TFP,ZP) (Galindo et al., 2002; Aagaard et al., 2010; Doty et al., 2016). There are several families of reproductive proteins on both the sperm and egg that show a history of being coopted from non-reproductive functions (Figure 1). Three finger proteins (TFPs) have been frequently coopted for fertilization including SPACA4 in tetrapods, Bouncer in fish, and multiple classes of sperm proteins in plethodontid salamanders (PMF, SPFs) (Doty et al., 2016; Fujihara et al., 2021). Salamander SPFs have a duplicated three finger protein domain, and have evolved structural modifications to those domains (Doty et al., 2016). Similarly, the family of ZP proteins (named after the Zona Pellucida), essential components of egg coats across vertebrates and invertebrates (Wilburn and Swanson, 2016), show evidence of independent expansions of ZP-N domains in different lineages (Liang and Dean, 1993; Galindo et al., 2002). These highlight the role of gene duplication and repeat domain expansions in fertilization. An observed trend is rapid sequence evolution in reproductive proteins (Swanson and Vacquier, 2002), and newly duplicated domains can provide novel substrates for evolving new functions at multiple stages of the fertilization cascade.

FIGURE 1

FIGURE 1

A cartoon schematic listing several protein families involved in reproduction. Those with notable repeat expansions are bolded.

The role of duplications in genome evolution is well documented across the tree of life. (Kondrashov et al., 2002; Conant and Wolfe, 2008). Gene duplication (Ponting, 2008) is an important source for new genetic material that facilitates biological innovation. The duplication and differentiation of genomic regions has been linked to the evolution of modularity in organisms (Wagner et al., 2007). Modularity is an abstract concept in which part of an organism (such as a network of protein interactions) functions largely autonomously relative to other aspects of the organisms’ biology (Wagner and Altenberg, 1996; West-Eberhard, 2005). Duplicated genes can participate in existing modular protein interaction networks, which facilitates increasing biological complexity of these networks (Wagner et al., 2007). Such increases in modular network complexity through gene duplication has been linked to adaptations in humans (Perry et al., 2007). Duplicated functional domains can similarly contribute to the evolution of biological complexity. This review will discuss both whole gene duplications and within gene domain duplications, and their role in the evolution of reproductive functions.

When genes duplicate they experience one of three possible fates: pseudogenization, subfunctionalization, and neofunctionalization (Walsh, 2003; Innan, 2009). Due to redundancies in function, the duplicated gene may no longer experience conservation and accumulate silencing mutations, resulting in a non-coding “pseudogene” (Figure 2). New mutations are frequently deleterious, so pseudogenization is hypothesized to be the most common fate of duplicated genes (Lynch and Conery, 2000). However, the other two fates of duplicated genes (subfunctionalization and neofunctionalization) are common mechanisms for biological innovation. Under neofunctionalization, one gene copy maintains its original function while the other experiences positive selection and evolves a novel function. While under subfunctionalization, both copies parse the original function, and neither gene is sufficient (Walsh, 2003; Innan, 2009).

FIGURE 2

FIGURE 2

There are multiple possible combinations of whole gene and domain duplications that can birth new genes and functional domains. Often a whole gene duplication begins the process, and then one of the gene duplicates experiences a domain expansion. These genes can then act as substrates for further duplication and neofunctionalization or subfunctionalization events.

Tandem duplications of individual protein domains within a gene can add greater complexity to the duplication process. Paralogous genes experiencing relaxed selection can have greater freedom for tandem domain duplications. There is strong research interest in the mechanisms underlying domain repeat expansions and how they affect the evolution of protein families (Björklund et al., 2005, 2006; Vogel et al., 2005; Weiner et al., 2006; Moore et al., 2008; Buljan and Bateman, 2009). Repeats can experience concerted evolution where they maintain a high degree of sequence identity (Elder and Turner, 1995; Liao, 1999), through unequal recombination and gene conversion (Schimenti, 1999). Under this scenario, the repeat expansion of highly identical domains is itself an innovation that could allow proteins to evolve novel functions. A repeat domain expansion could also affect dosage or protein interaction networks. Repeated domains could similarly differentiate in amino acid sequence, leading to neofunctionalization or subfunctionalization with the original domain. There are many possible orders and combinations of whole gene duplications and domain duplications that can contribute to the expansion of gene families (Figure 2). The process by which duplicate genes are maintained and experience subfunctionalization or neofunctionalization has been characterized under the duplication-degeneration-complementation model (DDC) (Force et al., 1999). While most classical population genetics models (Walsh, 2003; Innan, 2009) primarily discuss the effect of silencing or beneficial mutations on coding regions, the DDC model focuses on the effect of mutations on regulatory regions and subfunctionalization. Essentially, mutations that can silence certain regulatory regions in a duplicate gene can lead to the two genes partitioning expression and eventually function (Force et al., 1999). Other models have suggested subfunctionalization is primarily important as a transition phase to neofunctionalization (Rastogi and Liberles, 2005). The mechanisms of subfunctionalization and neofunctionalization remain a subject of rich debate, and concepts like the DDC model could have ramifications for protein evolution.

Subfunctionalization and neofunctionalization are foundational to the evolution of increased complexity in genomes and protein networks, and it is worth examining their particular importance in fertilization. Fertilization proteins are some of the most rapidly evolving proteins in genomes, as evidenced by high amino divergence (Swanson and Vacquier, 2002). Their rapid evolution is likely driven by factors such as sexual conflict and molecular arms race dynamics between gametes, which can also contribute to the maintenance of fertilization barriers between species (Gavrilets and Waxman, 2002; Gavrilets, 2014). The general trend of rapid evolution in reproductive proteins could facilitate the subfunctionalization or neofunctionalization of domains.

Izumo/Juno

The fusion of sperm and egg is necessary for fertilization, but there are only a few known pairs of interacting gametic proteins identified at this stage (Wilburn and Swanson, 2016). After years of research the interacting pair Izumo1 and Juno were identified in mammals (Bianchi et al., 2014). Izumo1 is the sperm expressed protein that mediates fusion (Inoue et al., 2005), and it interacts with the egg surface bound folate receptor 4 (known as Juno) (Bianchi and Wright, 2014). Izumo1 and Juno are each part of protein families with multiple paralogues, but only the Izumo1/Juno pair is capable of interacting (Bianchi et al., 2014). There are four members of both the Izumo (Ellerman et al., 2009) and folate receptor families (FOLR) in mammals (Elwood, 1989; Shen et al., 1994; Spiegelstein et al., 2000; Petronella and Drouin, 2014). Despite being part of the folate receptor family, Juno does not actually bind folate, exemplifying how a single member of this gene family has been coopted for a novel reproductive function (Bianchi et al., 2014).

While Juno represents a clear cooption into fertilization, the evolution of the Izumo gene family could also present an interesting example of neofunctionalization. Izumo1-4 all have a highly structurally conserved Izumo domain, but Izumo1 and Izumo4 have a shared pair of β-strands extending from this domain. Izumo1 experienced further structural modifications, as its β-strand extensions act as a hinge between the Izumo domain and a coopted immunoglobulin-like domain (Aydin et al., 2016; Ohto et al., 2016). Such substantial structural changes could be important for the protein’s ability to bind Juno. Research into other Izumo proteins suggests their involvement in fertilization. Izumo1-3 are transmembrane testis expressed proteins (Ellerman et al., 2009), while Izumo4 lacks a transmembrane domain and is expressed in the acrosome (Guasti et al., 2020). Izumo3 shows evidence of positive selection (Grayson and Civetta, 2012), and is necessary for sperm acrosome formation (Inoue et al., 2021b). The parallel histories of structural modifications in Izumo1 and Juno allowed for this essential interaction to evolve.

The relationship between Izumo1, Juno and their paralogs is highlighted by our phylogeny (Figure 3), which contains a long branch leading to Juno (FOLR4). This could reflect the rapid accumulation of mutations in the Juno branch as it was coopted to bind Izumo1 during gametic membrane fusion. Crystal structures confirm that 1:1 binding complexes form between Izumo1 and Juno (Aydin et al., 2016; Ohto et al., 2016). The adhesion of Izumo1 and Juno is conserved in mammals, and after the adhesion event Juno is released from the egg’s surface in vesicles and may act to bind and neutralize acrosome reacted sperm (Bianchi et al., 2014). In mammals, this interaction functions as a block against polyspermy (Bianchi and Wright, 2014). Blocks to polyspermy are essential, because eggs that fuse with multiple sperm are not viable and mammalian blocks to polyspermy exist at both the cell membrane (Evans, 2020) and egg coat (Fahrenkamp et al., 2020).

FIGURE 3

FIGURE 3

Unrooted maximum likelihood phylogenies for Izumo and FOLR gene families in a subset of primates, based on multiple sequence alignments (Katoh and Standley, 2013; Kozlov et al., 2019). Both gene families independently duplicated, but FOLR4 was coopted to bind Izumo1. Crystal structures have been obtained for the Izumo1-Juno complex (Aydin et al., 2016). For other proteins, alphafold predicted structures were used (Jumper et al., 2021). Using predictions of signal peptides and transmembrane domains, and secondary structural alignments, we identified shared izumo domains (Sonnhammer et al., 1998; Krogh et al., 2001; Almagro Armenteros et al., 2019).

Mutations to residues conserved in mammals greatly reduce binding, highlighting that particular changes to amino acid sequence and protein structure facilitated the neofunctionalization of Juno (Aydin et al., 2016). The more variable structural features (Ohto et al., 2016) in Juno may be important for the species-specificity of its binding to Izumo1 (Bianchi et al., 2014; Bianchi and Wright, 2015; Han et al., 2016). Comparative genetic analyses identify positive selection in a subset of mammals (Laurasiatheria) (Grayson and Civetta, 2012), and that Juno is likely rapidly coevolving with Izumo1, which contributes to the specificity of their interactions (Grayson, 2015). This specific binding is essential to both Juno’s function in initiating membrane fusion, and the post-fusion neutralization of acrosome-reacted sperm (Wright and Bianchi, 2016).

DCST

While Izumo1 and Juno are thought to initiate the complex molecular process of gametic membrane fusion in mammals, recent transgenic experiments and complementation studies have demonstrated that DCST1 and DCST2 are also essential (Inoue et al., 2021a). The DCST1/2 proteins are expressed on the sperm surface, and contain variable (4–6) transmembrane helical domains (DC-STAMP) (Inoue et al., 2021a, 1). DC-STAMP (dendritic cell specific transmembrane protein) refers to both the name of the domain and one of the proteins that contains this domain (Hartgers et al., 2000). The originally identified DC-STAMP protein has four transmembrane domains (Hartgers et al., 2001), and it is highly expressed in myeloid dendrocytes (Hartgers et al., 2000, 2001; Eleveld-Trancikova et al., 2005, 2008). The expression of DC-STAMP has been induced in macrophages (Staege et al., 2001) and osteoclasts (Nomiyama et al., 2005). This broad array of functions has motivated much research into the molecular mechanisms of DC-STAMP interactions, which has supported a role in osteoclast fusion (Kukita et al., 2004; Yagi et al., 2005; Jansen et al., 2009). There is also evidence of DC-STAMP related signaling in immune response (Nair et al., 2016). Along with these other diverse functions, it seem that DC-STAMP domains have been coopted into an essential role in sperm-egg membrane fusion.

DCST1/2 are the first known essential fertilization factors that are conserved in both vertebrates and invertebrates (Inoue et al., 2021a). DCST1/2 orthologues have been identified in both Caenorhabditis and Drosophila (Kroft et al., 2005; Wilson et al., 2006, 2018), which is the first known example sperm related factors being conserved this broadly across vertebrates and invertebrates (Inoue et al., 2021a, 1). However, there has been extensive structural diversification of these DCST1/2 across animals (Figure 4), especially between invertebrates and vertebrates. The low sequence identity of DCST1/2 proteins across animals, makes the conservation of reproductive function all the more remarkable. The ubiquitin ligase activity of DCST1 (Nair et al., 2016) raises questions about the function of DCST1/2 in sperm. There is intense research interest into the signal activity of long non-coding RNA produced by DCST1 and its effect on cancer cell progression (Hu et al., 2020; Ai et al., 2021, 1; Wang et al., 2021). More investigation is necessary to understand the function of DC-STAMP domains in a broad range of signaling networks, and how they were neofunctionalized in sperm DCST1/2.

FIGURE 4

FIGURE 4

A schematic of DCST1/2 proteins in multiple species. The number of transmembrane domains and loop lengths differ across species. Transmembrane domains and loops are colored based on conservation (Pei et al., 2008), where red coloration signifies amino acid conservation relative to humans. Therefore, the human examples are all red.

ZP Domains

ZP proteins are an essential class of egg coat proteins. An important feature of ZP proteins is the ZP module that consists of two domains, ZP-N and ZP-C, named after their relative N-terminal and C-terminal positioning. ZP-N and ZP-C domain are immunoglobular domains with characteristic patterns of disulfide bonding and β-sheets (Bokhove and Jovine, 2018), and likely resulted from an ancestral domain duplication. The variability in amino acid sequence, disulfide placement, and loop structures between ZP-N and ZP-C (Lin et al., 2011) suggests differences in their biological function and evolutionary history.

ZP-N domains are of particular interest, because they form asymmetric dimers with their β-sandwich edges which are believed to promote polymerization between ZP modules (Jovine et al., 2002; Wilburn and Swanson, 2017; Bokhove and Jovine, 2018). There are several ZP proteins identified in vertebrates (ZP1-4, ZPAX and ZPD), and there appears to be a history of lineage specific gain and loss of ZP proteins among vertebrates (Galindo et al., 2002; Conner et al., 2005; Goudet et al., 2008; Claw and Swanson, 2012; Meslin et al., 2012; Shu et al., 2015; Killingbeck and Swanson, 2018). Like other families discussed in this review, there also multiple ZP proteins with non-reproductive functions (e.g., uromodulin and tectorin-alpha) (Legan et al., 1997; Brunati et al., 2015; Bokhove et al., 2016). This may be another example of domains being coopted into a reproductive function, and ZP-N polymerization domains may be important for egg coat assembly and structure.

Not only has gene duplication produced an assortment of ZP proteins, there are also examples of independent repeat expansions of ZP-N in both vertebrates and invertebrate egg coat proteins (Figure 5). Some have only one additional ZP-N domain, but there are more dramatic repeat expansion like mammalian ZP2 (4 ZP-Ns) and abalone VERL (23 ZP-Ns) (Galindo et al., 2002). This process of domain duplications helped contribute to the diversity of ZP proteins. Given the ability of ZP-N domains to dimerize (Jovine et al., 2002; Bokhove and Jovine, 2018; Litscher and Wassarman, 2020), their duplications could create opportunities to evolve novel binding functions. Proteins with duplicated ZP-N domains, such as mammalian ZP2 and abalone VERL, are thought to be essential for species-specific in fertilization (Avella et al., 2013, 2014; Raj et al., 2017). Species-specificity in abalone is associated with the coevolution between VERL and the sperm protein lysin (Galindo et al., 2003; Clark et al., 2009), suggesting a cooption of ZP-Ns in sperm-egg interactions during egg coat dissolution.

FIGURE 5

FIGURE 5

Cladograms of ZP-N proteins are based on phylogenies from the literature (Aagaard et al., 2010; Claw and Swanson, 2012). These suggest independent repeat expansion of the ZP-N domain in both abalone and human egg coat genes.

Neofunctionalization of ZP-N domains can also drive new interactions between ZP proteins, such as the evolution of essential intermolecular crosslinks (Nishimura et al., 2019), which affect the physical assemblage of proteins in the supramolecular structure of the egg coat. Indeed, mouse research has suggested the importance of egg coat supramolecular structure in fertilization (Rankin et al., 2003; Avella et al., 2013). The structure of the egg coat is also important for the oocyte’s ability to block polyspermy. Protein cleavage of ZP2 is thought to initiate other egg coat structural modifications, which “harden” the egg coat and prevent sperm binding (Bleil et al., 1981; Gahlay et al., 2010; Fahrenkamp et al., 2020). Gene and domain duplications has produced a family of ZP proteins that contribute to the egg coat supramolecular structure, and are involved in both sperm recognition and polyspermy avoidance.

TFP Superfamily

Three finger proteins are defined by their TFP domains, which have a characteristic disulfide bonding pattern and fold (Galat, 2008; Galat et al., 2008). The broader TFP protein superfamily also includes proteins with structurally modified TFP-like domains (Galat, 2015). While TFPs were originally identified in snake toxins (Low et al., 1976; Tsernoglou and Petsko, 1977), members of the TFP superfamily have been to coopted for reproductive functions into sperm (SPACA4, PMFs, and SPFs), egg (Bouncer), and pheromones (PMFs, and SPFs) (Doty et al., 2016; Fujihara et al., 2021; Wilburn et al., 2022) (Figure 6). Bouncer plays a role in species-specific sperm-egg fusion in teleost fish (Herberg et al., 2018), which raises questions about how other TFPs may function in fertilization. The TFP superfamily includes both soluble and membrane bound proteins, and has great functional diversity across many tissues and taxa (Alape-Girón et al., 1999; Tsetlin, 1999; Kini, 2002; Nirthanan et al., 2003; Kessler et al., 2017). Similar to ZP proteins, we observe a history of gene duplication, repeat expansion of domains, and functional diversification of TFP containing proteins.

FIGURE 6

FIGURE 6

These two cladograms outline the whole gene and domain duplications within the three finger protein superfamilty (TFPs) and their expansions into reproductive systems. An ancestral single domain TFP (1D-TFP), duplicated into multiple vertebrate 1D-TFPs, and also had a domain level duplication which created a lineage of two TFP domain proteins (2D-TFPs). The 1D-TFPs produced tetrapod SPACA4, fish Bouncer, and multiple salamander PMFs. The 2D-TFPs also duplicated throughout vertebrates including salamander SPFs. Both salamander PMF and SPF protein families include both sperm and pheromone expressed members (Wilburn et al., 2022).

An ancestral TFP protein experienced gene duplication to produce an assortment of single TFP-like domain proteins (1D-TFPs). One of these TFP genes experienced a tandem domain expansion to produce the ancestor of proteins with two TFP-like domains (2D-TFPs). Three independent cooption events have produced TFPs in gametes (Figure 6). A cooption of 1D-TFPs occurred in the ancestor of tetrapods and produced both Bouncer in fish, and SPACA4 in amniotes (Figure 6). Despite their protein homology, Bouncer is egg expressed while SPACA4 is sperm expressed and it is implicated in interactions between the sperm and egg coat (Fujihara et al., 2021), highlighting the functional diversification of TFPs. Another independent cooption of 1D-TFPs resulted in the sperm expressed plethodontid modulating factor (PMFs) salamanders, which extensively duplicated producing a diverse family of reproductive molecules (Wilburn et al., 2012, 2014, 2017; Doty et al., 2016). Salamander PMFs are hypervariable proteins expressed in multiple tissues, and while they are structurally similar to other TFPs, they differ in loop length and disulfide bridge patterning, and show evidence of persistent diversification and positive selection (Palmer et al., 2010; Wilburn et al., 2012, 2014).

Among 2D-TFPs there was independent cooption into the sodefrin precursor-like factors (SPFs) of salamander sperm. SPFs then experienced their own history of gene duplications and radiation (Palmer et al., 2007). Both PMFs and SPFs experienced disulfide bond reshuffling relative to the canonical 1D-TFP and 2D-TFP binding patterns, and these changes reflect the neofunctionalization of these molecules (Doty et al., 2016). These striking examples of independent gene duplications and neofunctionalization for reproductive functions raises questions as to whether there a more additional unknown cooptions of TFPs, and whether some protein domains are more susceptible to cooption in diverse biological contexts.

Both PMFs and SPFs are highly duplicated protein families, with some members being coopted into pheromone function and others for sperm expression (Doty et al., 2016; Wilburn et al., 2022). As the sperm paralogs of PMFs and SPFs have only recently been discovered, functional studies have not yet been conducted. Male salamanders produce large number of PMFS and SPFs within their mental glands which promote ritual courtship behavior in females (Doty et al., 2016). Duplications of secreted male-expressed sperm proteins could have provided an evolutionary substrate to evolve new pheromones (Wilburn et al., 2022). Structural changes in PMFs and SPFs, such as disulfide shuffling, may contribute to new functions in both sperm and pheromones. The TFP’s superfamily’s history of gene duplication, domain duplication, and neofunctionalization provides a unique model for the evolution of large gene families involved in fertilization.

Discussion and Conclusion

Within this review we discussed examples of duplicated gene families with roles in fertilization. Gene duplication and neofunctionalization is an essential process for the evolution of greater genomic and functional complexity in organisms. Duplicated paralogous genes have been coopted into both sperm (Izumo1, DCST1/2) and egg (Juno) proteins involved in gamete membrane fusion (Bianchi et al., 2014; Inoue et al., 2021a, 1). Domain duplications within paralogs is also observed in the TFP superfamily and ZPs and has allowed both groups of genes to adopt novel functions at multiple stages of fertilization. As seen with TFPs, duplication events are often followed by notable protein structural changes (Doty et al., 2016) which may be tied to their cooption for novel fertilization functions. It is intriguing to consider hypotheses that account for these patterns of gene family expansion and diversification common in reproductive molecules.

Duplication events can facilitate the rapid evolution and neofunctionalization observed in many families of fertilization proteins. This rapid evolution can also be influenced by multiple factors such as sexual conflict, polyspermy avoidance, or genetic drift (Vacquier et al., 1997). The necessity of pathogen avoidance or blocks to polyspermy can drive oocytes to evolve reduced sperm binding ability. The sperm would then coevolutionarily “chase” the egg, which can contribute to the rapid sequence evolution of gametic proteins, and to the species-specificity of these protein interactions (Gavrilets and Waxman, 2002; Gavrilets, 2014). The rapid evolution of reproductive proteins is explored in terms of amino acid mutations, but the repeat expansion of domains could also be part of this trend. Proteins with repeated domains could experience drift resulting in ever-changing molecular target, that interacting proteins must coevolutionarily chase (Vacquier et al., 1997).

Duplications of reproductive proteins can also contribute to the phenomenon of functional redundancy, in which two duplicated genes have partially overlapping functions and can compensate for each other’s loss (Kafri et al., 2009). Functional redundancy has been observed in the CRISP family of reproductive proteins (Curci et al., 2020), and this property could emerge in other large protein families. While functional redundancy seems like it would be temporary as duplicated genes subfunctionalized or neofunctionalized, it can be a surprisingly evolutionarily stable property. Functional redundancy could confer fitness advantages by maintaining the robusticity of protein interaction networks in spite of stochasticity of expression between cells (Kafri et al., 2009). The rapid evolution of other reproductive proteins in these networks could place even greater value on robustness and stability of essential functions. Robusticity in these protein networks is believed to reduce the fitness cost of new mutations, which would increase the “evolvability” of these proteins and facilitate functional innovation (Kirschner and Gerhart, 2008). The concepts of functional redundancy and robusticity of function may also apply to domain repeat expansions like the ZP-N domains of VERL. The processes of gene duplication, repeat domain expansion, structural modification, and neofunctionalization have been fundamental to the evolution of reproductive molecules across life.

Statements

Author contributions

Both authors were involved in the conception of this review. AR principally conducted the literature review, and the writing of the manuscript. WS provided substantial literature suggestions and editorial feedback.

Funding

The lab is funded by the NIH grant HD105025 awarded to WS.

Acknowledgments

We thank Damien B. Wilburn for sharing his code for visualizing transmembrane proteins, and fellow lab members Jolie Carlisle and Jan Aagaard for engaging in discussions.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  • 1

    AagaardJ. E.VacquierV. D.MacCossM. J.SwansonW. J. (2010). ZP Domain Proteins in the Abalone Egg Coat Include a Paralog of VERL under Positive Selection that Binds Lysin and 18-kDa Sperm Proteins. Mol. Biol. Evol.27, 193203. 10.1093/molbev/msp221

  • 2

    AiY.LiuS.LuoH.WuS.WeiH.TangZ.et al (2021). lncRNA DCST1-AS1 Facilitates Oral Squamous Cell Carcinoma by Promoting M2 Macrophage Polarization through Activating NF-κB Signaling. J. Immunol. Res.2021, 19. 10.1155/2021/5524231

  • 3

    Alape-GirónA.PerssonB.CederlundE.Flores-DíazM.GutiérrezJ. M.ThelestamM.et al (1999). Elapid Venom Toxins: Multiple Recruitments of Ancient Scaffolds. Eur. J. Biochem.259, 225234. 10.1046/j.1432-1327.1999.00021.x

  • 4

    Almagro ArmenterosJ. J.TsirigosK. D.SønderbyC. K.PetersenT. N.WintherO.BrunakS.et al (2019). SignalP 5.0 Improves Signal Peptide Predictions Using Deep Neural Networks. Nat. Biotechnol.37, 420423. 10.1038/s41587-019-0036-z

  • 5

    AvellaM. A.XiongB.DeanJ. (2013). The Molecular Basis of Gamete Recognition in Mice and Humans. Mol. Hum. Reprod19, 279289. 10.1093/molehr/gat004

  • 6

    AvellaM. A.BaibakovB.DeanJ. (2014). A Single Domain of the ZP2 Zona Pellucida Protein Mediates Gamete Recognition in Mice and Humans. J. Cell. Biol.205, 801809. 10.1083/jcb.201404025

  • 7

    AydinH.SultanaA.LiS.ThavalingamA.LeeJ. E. (2016). Molecular Architecture of the Human Sperm IZUMO1 and Egg JUNO Fertilization Complex. Nature534, 562565. 10.1038/nature18595

  • 8

    BianchiE.WrightG. J. (2014). Izumo Meets Juno. Cell Cycle13, 20192020. 10.4161/cc.29461

  • 9

    BianchiE.WrightG. J. (2015). Cross-species Fertilization: the Hamster Egg Receptor, Juno, Binds the Human Sperm Ligand, Izumo1. Phil. Trans. R. Soc. B370, 20140101. 10.1098/rstb.2014.0101

  • 10

    BianchiE.DoeB.GouldingD.WrightG. J. (2014). Juno Is the Egg Izumo Receptor and Is Essential for Mammalian Fertilization. Nature508, 483487. 10.1038/nature13203

  • 11

    BjörklundÅ. K.EkmanD.LightS.Frey-SköttJ.ElofssonA. (2005). Domain Rearrangements in Protein Evolution. J. Mol. Biol.353, 911923. 10.1016/j.jmb.2005.08.067

  • 12

    BjörklundÅ. K.EkmanD.ElofssonA. (2006). Expansion of Protein Domain Repeats. Plos Comput. Biol.2, e114. 10.1371/journal.pcbi.0020114

  • 13

    BleilJ. D.BeallC. F.WassarmanP. M. (1981). Mammalian Sperm-Egg Interaction: Fertilization of Mouse Eggs Triggers Modification of the Major Zona Pellucida Glycoprotein, ZP2. Dev. Biol.86, 189197. 10.1016/0012-1606(81)90329-8

  • 14

    BokhoveM.JovineL. (2018). Structure of Zona Pellucida Module Proteins. Curr. Topic Dev. Biol.130, 413442. 10.1016/bs.ctdb.2018.02.007

  • 15

    BokhoveM.NishimuraK.BrunatiM.HanL.de SanctisD.RampoldiL.et al (2016). A Structured Interdomain Linker Directs Self-Polymerization of Human Uromodulin. Proc. Natl. Acad. Sci. USA113, 15521557. 10.1073/pnas.1519803113

  • 16

    BrunatiM.PeruccaS.HanL.CattaneoA.ConsolatoF.AndolfoA.et al (2015). The Serine Protease Hepsin Mediates Urinary Secretion and Polymerisation of Zona Pellucida Domain Protein Uromodulin. Elife4, e08887. 10.7554/eLife.08887

  • 17

    BuljanM.BatemanA. (2009). The Evolution of Protein Domain Families. Biochem. Soc. Trans.37, 751755. 10.1042/BST0370751

  • 18

    BussoD.GoldweicN. M.HayashiM.KasaharaM.CuasnicúP. S. (2007). Evidence for the Involvement of Testicular Protein CRISP2 in Mouse Sperm-Egg Fusion1. Biol. Reprod.76, 701708. 10.1095/biolreprod.106.056770

  • 19

    CivettaA. (2003). Positive Selection within Sperm-Egg Adhesion Domains of Fertilin: An ADAM Gene with a Potential Role in Fertilization. Mol. Biol. Evol.20, 2129. 10.1093/molbev/msg002

  • 20

    ClaphamD. E.GarbersD. L. (2005). International Union of Pharmacology. L. Nomenclature and Structure-Function Relationships of CatSper and Two-Pore Channels. Pharmacol. Rev.57, 451454. 10.1124/pr.57.4.7

  • 21

    ClarkN. L.GasperJ.SekinoM.SpringerS. A.AquadroC. F.SwansonW. J. (2009). Coevolution of Interacting Fertilization Proteins. Plos Genet.5, e1000570. 10.1371/journal.pgen.1000570

  • 22

    ClawK. G.SwansonW. J. (2012). Evolution of the Egg: New Findings and Challenges. Annu. Rev. Genom. Hum. Genet.13, 109125. 10.1146/annurev-genom-090711-163745

  • 23

    ConantG. C.WolfeK. H. (2008). Turning a Hobby into a Job: How Duplicated Genes Find New Functions. Nat. Rev. Genet.9, 938950. 10.1038/nrg2482

  • 24

    ConnallonT.ClarkA. G. (2011). The Resolution of Sexual Antagonism by Gene Duplication. Genetics187, 919937. 10.1534/genetics.110.123729

  • 25

    ConnerS. J.LefièvreL.HughesD. C.BarrattC. L. R. (2005). Cracking the Egg: Increased Complexity in the Zona Pellucida. Human Reprod.20, 11481152. 10.1093/humrep/deh835

  • 26

    CurciL.BrukmanN. G.Weigel MuñozM.RojoD.CarvajalG.SulzykV.et al (2020). Functional Redundancy and Compensation: Deletion of Multiple Murine Crisp Genes Reveals Their Essential Role for Male Fertility. FASEB j.34, 1571815733. 10.1096/fj.202001406R

  • 27

    Da RosV. G.MalderaJ. A.WillisW. D.CohenD. J.GouldingE. H.GelmanD. M.et al (2008). Impaired Sperm Fertilizing Ability in Mice Lacking Cysteine-RIch Secretory Protein 1 (CRISP1). Dev. Biol.320, 1218. 10.1016/j.ydbio.2008.03.015

  • 28

    DotyK. A.WilburnD. B.BowenK. E.FeldhoffP. W.FeldhoffR. C. (2016). Co-option and Evolution of Non-olfactory Proteinaceous Pheromones in a Terrestrial Lungless Salamander. J. Proteomics135, 101111. 10.1016/j.jprot.2015.09.019

  • 29

    DufourS.QuératB.TostivintH.PasqualiniC.VaudryH.RousseauK. (2020). Origin and Evolution of the Neuroendocrine Control of Reproduction in Vertebrates, with Special Focus on Genome and Gene Duplications. Physiol. Rev.100, 869943. 10.1152/physrev.00009.2019

  • 30

    ElderJ. F.TurnerB. J. (1995). Concerted Evolution of Repetitive DNA Sequences in Eukaryotes. Q. Rev. Biol.70, 297320. 10.1086/419073

  • 31

    Eleveld-TrancikovaD.TriantisV.MoulinV.LoomanM. W. G.WijersM.FransenJ. A. M.et al (2005). The Dendritic Cell-Derived Protein DC-STAMP Is Highly Conserved and Localizes to the Endoplasmic Reticulum. J. Leukoc. Biol.77, 337343. 10.1189/jlb.0804441

  • 32

    Eleveld-TrancikovaD.JanssenR. A. J.HendriksI. A. M.LoomanM. W. G.MoulinV.JansenB. J. H.et al (2008). The DC-Derived Protein DC-STAMP Influences Differentiation of Myeloid Cells. Leukemia22, 455459. 10.1038/sj.leu.2404910

  • 33

    EllermanD. A.PeiJ.GuptaS.SnellW. J.MylesD.PrimakoffP. (2009). Izumo Is Part of a Multiprotein Family Whose Members Form Large Complexes on Mammalian Sperm. Mol. Reprod. Dev.76, 11881199. 10.1002/mrd.21092

  • 34

    ElwoodP. C. (1989). Molecular Cloning and Characterization of the Human Folate-Binding Protein cDNA from Placenta and Malignant Tissue Culture (KB) Cells. J. Biol. Chem.264, 1489314901. 10.1016/S0021-9258(18)63786-X

  • 35

    EvansJ. P. (2020). Preventing Polyspermy in Mammalian Eggs-Contributions of the Membrane Block and Other Mechanisms. Mol. Reprod. Dev.87, 341349. 10.1002/mrd.23331

  • 36

    FahrenkampE.AlgarraB.JovineL. (2020). Mammalian Egg Coat Modifications and the Block to Polyspermy. Mol. Reprod. Dev.87, 326340. 10.1002/mrd.23320

  • 37

    FinnS.CivettaA. (2010). Sexual Selection and the Molecular Evolution of ADAM Proteins. J. Mol. Evol.71, 231240. 10.1007/s00239-010-9382-7

  • 38

    ForceA.LynchM.PickettF. B.AmoresA.YanY.-l.PostlethwaitJ. (1999). Preservation of Duplicate Genes by Complementary, Degenerative Mutations. Genetics151, 15311545. 10.1093/genetics/151.4.1531

  • 39

    FrolikovaM.Manaskova-PostlerovaP.CernyJ.JankovicovaJ.SimonikO.PohlovaA.et al (2018). CD9 and CD81 Interactions and Their Structural Modelling in Sperm Prior to Fertilization. Ijms19, 1236. 10.3390/ijms19041236

  • 40

    FujiharaY.HerbergS.BlahaA.PanserK.KobayashiK.LarasatiT.et al (2021). The Conserved Fertility Factor SPACA4/Bouncer Has Divergent Modes of Action in Vertebrate Fertilization. Proc. Natl. Acad. Sci. USA118, e2108777118. 10.1073/pnas.2108777118

  • 41

    GahlayG.GauthierL.BaibakovB.EpifanoO.DeanJ. (2010). Gamete Recognition in Mice Depends on the Cleavage Status of an Egg's Zona Pellucida Protein. Science329, 216219. 10.1126/science.1188178

  • 42

    GalatA.GrossG.DrevetP.SatoA.MénezA. (2008). Conserved Structural Determinants in Three-Fingered Protein Domains. FEBS J.275, 32073225. 10.1111/j.1742-4658.2008.06473.x

  • 43

    GalatA. (2008). The Three-Fingered Protein Domain of the Human Genome. Cell. Mol. Life Sci.65, 34813493. 10.1007/s00018-008-8473-8

  • 44

    GalatA. (2015). Multidimensional Drift of Sequence Attributes and Functional Profiles in the Superfamily of the Three-Finger Proteins and Their Structural Homologues. J. Chem. Inf. Model.55, 20262041. 10.1021/acs.jcim.5b00322

  • 45

    GalindoB. E.MoyG. W.SwansonW. J.VacquierV. D. (2002). Full-Length Sequence of VERL, the Egg Vitelline Envelope Receptor for Abalone Sperm Lysin. Gene288, 111117. 10.1016/s0378-1119(02)00459-6

  • 46

    GalindoB. E.VacquierV. D.SwansonW. J. (2003). Positive Selection in the Egg Receptor for Abalone Sperm Lysin. Proc. Natl. Acad. Sci.100, 46394643. 10.1073/pnas.0830022100

  • 47

    GallachM.BetránE. (2011). Intralocus Sexual Conflict Resolved through Gene Duplication. Trends Ecol. Evol.26, 222228. 10.1016/j.tree.2011.02.004

  • 48

    GallachM.ChandrasekaranC.BetránE. (2010). Analyses of Nuclearly Encoded Mitochondrial Genes Suggest Gene Duplication as a Mechanism for Resolving Intralocus Sexually Antagonistic Conflict in Drosophila. Genome Biol. Evol.2, 835850. 10.1093/gbe/evq069

  • 49

    GallachM.DominguesS.BetránE. (2011). Gene Duplication and the Genome Distribution of Sex-Biased Genes. Int. J. Evol. Biol.2011, 120. 10.4061/2011/989438

  • 50

    GavriletsS.WaxmanD. (2002). Sympatric Speciation by Sexual Conflict. Proc. Natl. Acad. Sci.99, 1053310538. 10.1073/pnas.152011499

  • 51

    GavriletsS. (2014). Is Sexual Conflict an "Engine of Speciation". Cold Spring Harbor Perspect. Biol.6, a017723. 10.1101/cshperspect.a017723

  • 52

    GelbayaT. A.PotdarN.JeveY. B.NardoL. G. (2014). Definition and Epidemiology of Unexplained Infertility. Obstet. Gynecol. Surv.69(2), 109115. 10.1097/OGX.0000000000000043

  • 53

    GibbsG. M.OrtaG.ReddyT.KoppersA. J.Martinez-LopezP.Luis de la Vega-BeltranJ.et al (2011). Cysteine-rich Secretory Protein 4 Is an Inhibitor of Transient Receptor Potential M8 with a Role in Establishing Sperm Function. Proc. Natl. Acad. Sci.108, 70347039. 10.1073/pnas.1015935108

  • 54

    GoudetG.MugnierS.CallebautI.MongetP. (2008). Phylogenetic Analysis and Identification of Pseudogenes Reveal a Progressive Loss of Zona Pellucida Genes during Evolution of Vertebrates1. Biol. Reprod.78, 796806. 10.1095/biolreprod.107.064568

  • 55

    GraysonP.CivettaA. (2012). Positive Selection and the Evolution of Izumo Genes in Mammals. Int. J. Evol. Biol.2012, 17. 10.1155/2012/958164

  • 56

    GraysonP. (2015). Izumo1 and Juno: the Evolutionary Origins and Coevolution of Essential Sperm-Egg Binding Partners. R. Soc. Open Sci.2, 150296. 10.1098/rsos.150296

  • 57

    GuastiP. N.SouzaF. F.ScottC.PapaP. M.CamargoL. S.SchmithR. A.et al (2020). Equine Seminal Plasma and Sperm Membrane: Functional Proteomic Assessment. Theriogenology156, 7081. 10.1016/j.theriogenology.2020.06.014

  • 58

    HanJ.-H.BateyS.NicksonA. A.TeichmannS. A.ClarkeJ. (2007). The Folding and Evolution of Multidomain Proteins. Nat. Rev. Mol. Cel. Biol.8, 319330. 10.1038/nrm2144

  • 59

    HanL.NishimuraK.Sadat Al HosseiniH.BianchiE.WrightG. J.JovineL. (2016). Divergent Evolution of Vitamin B9 Binding Underlies Juno-Mediated Adhesion of Mammalian Gametes. Curr. Biol.26, R100R101. 10.1016/j.cub.2015.12.034

  • 60

    HartM. W. (2013). Structure and Evolution of the Sea star Egg Receptor for Sperm Bindin. Mol. Ecol.22, 21432156. 10.1111/mec.12251

  • 61

    HartgersF. C.VissersJ. L. M.LoomanM. W. G.ZoelenC. v.HuffineC.FigdorC. G.et al (2000). DC-STAMP, a Novel Multimembrane-Spanning Molecule Preferentially Expressed by Dendritic Cells. Eur. J. Immunol.30, 35853590. 10.1002/1521-4141(200012)30:12<3585:aid-immu3585>3.0.co;2-y

  • 62

    HartgersF. C.LoomanM. W. G.van der WoningB.MerkxG. F. M.FigdorC. G.AdemaG. J. (2001). Genomic Organization, Chromosomal Localization, and 5′ Upstream Region of the Human DC-STAMP Gene. Immunogenetics53, 145149. 10.1007/s002510100302

  • 63

    HerbergS.GertK. R.SchleifferA.PauliA. (2018). The Ly6/uPAR Protein Bouncer Is Necessary and Sufficient for Species-specific Fertilization. Science361, 10291033. 10.1126/science.aat7113

  • 64

    HoweK. L.AchuthanP.AllenJ.AllenJ.Alvarez-JarretaJ.AmodeM. R.et al (2021). Ensembl 2021. Nucleic Acids Res.49, D884D891. 10.1093/nar/gkaa942

  • 65

    HuS.YaoY.HuX.ZhuY. (2020). LncRNA DCST1-AS1 Downregulates miR-29b through Methylation in Glioblastoma (GBM) to Promote Cancer Cell Proliferation. Clin. Transl. Oncol.22, 22302235. 10.1007/s12094-020-02363-1

  • 66

    HughesA. L. (1994). The Evolution of Functionally Novel Proteins after Gene Duplication. Proc. R. Soc. Lond. B256, 119124. 10.1098/rspb.1994.0058

  • 67

    InnanH. (2009). Population Genetic Models of Duplicated Genes. Genetica137, 1937. 10.1007/s10709-009-9355-1

  • 68

    InoueN.IkawaM.IsotaniA.OkabeM. (2005). The Immunoglobulin Superfamily Protein Izumo Is Required for Sperm to Fuse with Eggs. Nature434, 234238. 10.1038/nature03362

  • 69

    InoueN.HagiharaY.WadaI. (2021a). Evolutionarily Conserved Sperm Factors, DCST1 and DCST2, Are Required for Gamete Fusion. Elife10, e66313. 10.7554/eLife.66313

  • 70

    InoueN.SatouhY.WadaI. (2021b). IZUMO Family Member 3, IZUMO3, Is Involved in Male Fertility through the Acrosome Formation. Mol. Reprod. Dev.88, 479481. 10.1002/mrd.23520

  • 71

    JansenB. J. H.Eleveld-TrancikovaD.SaneckaA.van Hout-KuijerM.HendriksI. A. M.LoomanM. G. W.et al (2009). OS9 Interacts with DC-STAMP and Modulates its Intracellular Localization in Response to TLR Ligation. Mol. Immunol.46, 505515. 10.1016/j.molimm.2008.06.032

  • 72

    JovineL.QiH.WilliamsZ.LitscherE.WassarmanP. M. (2002). The ZP Domain Is a Conserved Module for Polymerization of Extracellular Proteins. Nat. Cell Biol.4, 457461. 10.1038/ncb802

  • 73

    JumperJ.EvansR.PritzelA.GreenT.FigurnovM.RonnebergerO.et al (2021). Highly Accurate Protein Structure Prediction with AlphaFold. Nature596, 583589. 10.1038/s41586-021-03819-2

  • 74

    KafriR.SpringerM.PilpelY. (2009). Genetic Redundancy: New Tricks for Old Genes. Cell136, 389392. 10.1016/j.cell.2009.01.027

  • 75

    KameiN.GlabeC. G. (2003). The Species-specific Egg Receptor for Sea Urchin Sperm Adhesion Is EBR1,a Novel ADAMTS Protein. Genes Dev.17, 25022507. 10.1101/gad.1133003

  • 76

    KatohK.StandleyD. M. (2013). MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol. Biol. Evol.30, 772780. 10.1093/molbev/mst010

  • 77

    KelleherE. S.MarkowT. A. (2009). Duplication, Selection and Gene Conversion in a Drosophila mojavensis Female Reproductive Protein Family. Genetics181, 14511465. 10.1534/genetics.108.099044

  • 78

    KelleherE. S.SwansonW. J.MarkowT. A. (2007). Gene Duplication and Adaptive Evolution of Digestive Proteases in Drosophila Arizonae Female Reproductive Tracts. Plos Genet.3, e148. 10.1371/journal.pgen.0030148

  • 79

    KesslerP.MarchotP.SilvaM.ServentD. (2017). The Three-finger Toxin Fold: a Multifunctional Structural Scaffold Able to Modulate Cholinergic Functions. J. Neurochem.142, 718. 10.1111/jnc.13975

  • 80

    KillingbeckE. E.SwansonW. J. (2018). Egg Coat Proteins Across Metazoan Evolution. Curr. Top. Dev. Biol.130, 443488. 10.1016/bs.ctdb.2018.03.005

  • 81

    KiniR. M. (2002). Molecular Moulds with Multiple Missions: Functional Sites in Three-finger Toxins. Clin. Exp. Pharmacol. Physiol.29, 815822. 10.1046/j.1440-1681.2002.03725.x

  • 82

    KirschnerM. W.GerhartJ. C. (2008). The Plausibility of Life: Resolving Darwin's Dilemma. New Haven: Yale University Press. 10.12987/9780300128673

  • 83

    KondrashovF. A.RogozinI. B.WolfY. I.KooninE. V. (2002). Selection in the Evolution of Gene Duplications. Genome Biol.3, research0008. 10.1186/gb-2002-3-2-research0008

  • 84

    KozlovA. M.DarribaD.FlouriT.MorelB.StamatakisA. (2019). RAxML-NG: a Fast, Scalable and User-Friendly Tool for Maximum Likelihood Phylogenetic Inference. Bioinformatics35, 44534455. 10.1093/bioinformatics/btz305

  • 85

    KroftT. L.GleasonE. J.L'HernaultS. W. (2005). The Spe-42 Gene Is Required for Sperm-Egg Interactions during C. elegans Fertilization and Encodes a Sperm-specific Transmembrane Protein. Dev. Biol.286, 169181. 10.1016/j.ydbio.2005.07.020

  • 86

    KroghA.LarssonB.von HeijneG.SonnhammerE. L. L. (2001). Predicting Transmembrane Protein Topology with a Hidden Markov Model: Application to Complete genomes11Edited by F. Cohen. J. Mol. Biol.305, 567580. 10.1006/jmbi.2000.4315

  • 87

    KukitaT.WadaN.KukitaA.KakimotoT.SandraF.TohK.et al (2004). RANKL-induced DC-STAMP Is Essential for Osteoclastogenesis. J. Exp. Med.200, 941946. 10.1084/jem.20040518

  • 88

    Le NaourF.RubinsteinE.JasminC.PrenantM.BoucheixC. (2000). Severely Reduced Female Fertility in CD9-Deficient Mice. Science287, 319321. 10.1126/science.287.5451.319

  • 89

    LeganP. K.RauA.KeenJ. N.RichardsonG. P. (1997). The Mouse Tectorins: Modular Matrix Proteins of the Inner Ear Homologous To Components of the Sperm-Egg Adhesion System. J. Biol. Chem.272, 87918801. 10.1074/jbc.272.13.8791

  • 90

    LiangL. F.DeanJ. (1993). Conservation of Mammalian Secondary Sperm Receptor Genes Enables the Promoter of the Human Gene to Function in Mouse Oocytes. Dev. Biol.156, 399408. 10.1006/dbio.1993.1087

  • 91

    LiaoD. (1999). Concerted Evolution: Molecular Mechanism and Biological Implications. Am. J. Hum. Genet.64, 2430. 10.1086/302221

  • 92

    LinS. J.HuY.ZhuJ.WoodruffT. K.JardetzkyT. S. (2011). Structure of Betaglycan Zona Pellucida (ZP)-C Domain Provides Insights into ZP-Mediated Protein Polymerization and TGF- Binding. Proc. Natl. Acad. Sci.108, 52325236. 10.1073/pnas.1010689108

  • 93

    LitscherE. S.WassarmanP. M. (2020). Zona Pellucida Proteins, Fibrils, and Matrix. Annu. Rev. Biochem.89, 695715. 10.1146/annurev-biochem-011520-105310

  • 94

    LowB. W.PrestonH. S.SatoA.RosenL. S.SearlJ. E.RudkoA. D.et al (1976). Three Dimensional Structure of Erabutoxin B Neurotoxic Protein: Inhibitor of Acetylcholine Receptor. Proc. Natl. Acad. Sci.73, 29912994. 10.1073/pnas.73.9.2991

  • 95

    LynchM.ConeryJ. S. (2000). The Evolutionary Fate and Consequences of Duplicate Genes. Science290, 11511155. 10.1126/science.290.5494.1151

  • 96

    MalderaJ. A.Weigel MunozM.ChirinosM.BussoD.Ge RaffoF.BattistoneM. A.et al (2014). Human Fertilization: Epididymal hCRISP1 Mediates Sperm-Zona Pellucida Binding through its Interaction with ZP3. Mol. Hum. Reprod.20, 341349. 10.1093/molehr/gat092

  • 97

    MeslinC.MugnierS.CallebautI.LaurinM.PascalG.PouponA.et al (2012). Evolution of Genes Involved in Gamete Interaction: Evidence for Positive Selection, Duplications and Losses in Vertebrates. PLOS ONE7, e44548. 10.1371/journal.pone.0044548

  • 98

    MiyadoK.YamadaG.YamadaS.HasuwaH.NakamuraY.RyuF.et al (2000). Requirement of CD9 on the Egg Plasma Membrane for Fertilization. Science287, 321324. 10.1126/science.287.5451.321

  • 99

    MonnéM.HanL.SchwendT.BurendahlS.JovineL. (2008). Crystal Structure of the ZP-N Domain of ZP3 Reveals the Core Fold of Animal Egg coats. Nature456, 653657. 10.1038/nature07599

  • 100

    MooreA. D.BjörklundÅ. K.EkmanD.Bornberg-BauerE.ElofssonA. (2008). Arrangements in the Modular Evolution of Proteins. Trends Biochem. Sci.33, 444451. 10.1016/j.tibs.2008.05.008

  • 101

    NacherJ. C.HayashidaM.AkutsuT. (2010). The Role of Internal Duplication in the Evolution of Multi-Domain Proteins. Biosystems101, 127135. 10.1016/j.biosystems.2010.05.005

  • 102

    NairS.BistP.DikshitN.KrishnanM. N. (2016). Global Functional Profiling of Human Ubiquitome Identifies E3 Ubiquitin Ligase DCST1 as a Novel Negative Regulator of Type-I Interferon Signaling. Sci. Rep.6, 36179. 10.1038/srep36179

  • 103

    NavarroB.KirichokY.ChungJ. J.ClaphamD. E. (2008). Ion Channels that Control Fertility in Mammalian Spermatozoa. Int. J. Dev. Biol.52, 607613. 10.1387/ijdb.072554bn

  • 104

    NirthananS.GopalakrishnakoneP.GweeM. C. E.KhooH. E.KiniR. M. (2003). Non-conventional Toxins from Elapid Venoms. Toxicon41, 397407. 10.1016/S0041-0101(02)00388-4

  • 105

    NishimuraK.DioguardiE.NishioS.VillaA.HanL.MatsudaT.et al (2019). Molecular Basis of Egg Coat Cross-Linking Sheds Light on ZP1-Associated Female Infertility. Nat. Commun.10, 3086. 10.1038/s41467-019-10931-5

  • 106

    NomiyamaH.EgamiK.WadaN.TouK.HoriuchiM.MatsusakiH.et al (2005). Short Communication: Identification of Genes Differentially Expressed in Osteoclast-like Cells. J. Interferon Cytokine Res.25, 227231. 10.1089/jir.2005.25.227

  • 107

    OhtoU.IshidaH.KrayukhinaE.UchiyamaS.InoueN.ShimizuT. (2016). Structure of IZUMO1-JUNO Reveals Sperm-Oocyte Recognition during Mammalian Fertilization. Nature534, 566569. 10.1038/nature18596

  • 108

    PalmerC. A.WattsR. A.HouckL. D.PicardA. L.ArnoldS. J. (2007). Evolutionary Replacement of Components in a Salamander Pheromone Signaling Complex: More Evidence for Phenotypic-Molecular Decoupling. Evolution61, 202215. 10.1111/j.1558-5646.2007.00017.x

  • 109

    PalmerC. A.WattsR. A.HastingsA. P.HouckL. D.ArnoldS. J. (2010). Rapid Evolution of Plethodontid Modulating Factor, a Hypervariable Salamander Courtship Pheromone, Is Driven by Positive Selection. J. Mol. Evol.70, 427440. 10.1007/s00239-010-9342-2

  • 110

    PeiJ.KimB. H.GrishinN. V. (2008). PROMALS3D: a Tool for Multiple Protein Sequence and Structure Alignments. Nucleic Acids Res.36, 22952300. 10.1093/nar/gkn072

  • 111

    PerryG. H.DominyN. J.ClawK. G.LeeA. S.FieglerH.RedonR.et al (2007). Diet and the Evolution of Human Amylase Gene Copy Number Variation. Nat. Genet.39, 12561260. 10.1038/ng2123

  • 112

    PetronellaN.DrouinG. (2014). Purifying Selection against Gene Conversions in the Folate Receptor Genes of Primates. Genomics103, 4047. 10.1016/j.ygeno.2013.10.004

  • 113

    PontingC. P. (2008). The Functional Repertoires of Metazoan Genomes. Nat. Rev. Genet.9, 689698. 10.1038/nrg2413

  • 114

    PrimakoffP.MylesD. G. (2000). The ADAM Gene Family: Surface Proteins with Adhesion and Protease Activity. Trends Genet.16, 8387. 10.1016/S0168-9525(99)01926-5

  • 115

    RajI.Sadat Al HosseiniH.DioguardiE.NishimuraK.HanL.VillaA.et al (2017). Structural Basis of Egg Coat-Sperm Recognition at Fertilization. Cell169, 13151326.e17. 10.1016/j.cell.2017.05.033

  • 116

    Ramírez-GómezH. V.TuvalI.GuerreroA.DarszonA. (2019). Analysis of Sperm Chemotaxis. Methods Cell Biol.151, 473486. 10.1016/bs.mcb.2018.12.002

  • 117

    RankinT. L.ColemanJ. S.EpifanoO.HoodbhoyT.TurnerS. G.CastleP. E.et al (2003). Fertility and Taxon-Specific Sperm Binding Persist after Replacement of Mouse Sperm Receptors with Human Homologs. Dev. Cel.5, 3343. 10.1016/s1534-5807(03)00195-3

  • 118

    RastogiS.LiberlesD. A. (2005). Subfunctionalization of Duplicated Genes as a Transition State to Neofunctionalization. BMC Evol. Biol.5, 28. 10.1186/1471-2148-5-28

  • 119

    SchimentiJ. C. (1999). Mice and the Role of Unequal Recombination in Gene-Family Evolution. Am. J. Hum. Genet.64, 4045. 10.1086/302220

  • 120

    ShenF.RossJ. F.WangX.RatnamM. (1994). Identification of a Novel Folate Receptor, a Truncated Receptor, and Receptor Type .Beta. In Hematopoietic Cells: cDNA Cloning, Expression, Immunoreactivity, and Tissue Specificity. Biochemistry33, 12091215. 10.1021/bi00171a021

  • 121

    ShuL.SuterM. J.-F.RäsänenK. (2015). Evolution of Egg coats: Linking Molecular Biology and Ecology. Mol. Ecol.24, 40524073. 10.1111/mec.13283

  • 122

    SiuK. K.SerrãoV. H. B.ZiyyatA.LeeJ. E. (2021). The Cell Biology of Fertilization: Gamete Attachment and Fusion. J. Cell Biol.220, e202102146. 10.1083/jcb.202102146

  • 123

    SonnhammerE. L.von HeijneG.KroghA. (1998). A Hidden Markov Model for Predicting Transmembrane Helices in Protein Sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol.6, 175182.

  • 124

    SpeerK. F.Allen-WallerL.NovikovD. R.BarottK. L. (2021). Molecular Mechanisms of Sperm Motility Are Conserved in an Early-Branching Metazoan. Proc. Natl. Acad. Sci. USA118, e2109993118. 10.1073/pnas.2109993118

  • 125

    SpiegelsteinO.EudyJ. D.FinnellR. H. (2000). Identification of Two Putative Novel Folate Receptor Genes in Humans and Mouse. Gene258, 117125. 10.1016/S0378-1119(00)00418-2

  • 126

    StaegeH.BrauchlinA.SchoedonG.SchaffnerA. (2001). Two Novel Genes FIND and LIND Differentially Expressed in Deactivated and Listeria -infected Human Macrophages. Immunogenetics53, 105113. 10.1007/s002510100306

  • 127

    SuttonK. A.JungnickelM. K.FlormanH. M. (2008). A Polycystin-1 Controls Postcopulatory Reproductive Selection in Mice. Proc. Natl. Acad. Sci.105, 86618666. 10.1073/pnas.0800603105

  • 128

    SwansonW. J.VacquierV. D. (2002). The Rapid Evolution of Reproductive Proteins. Nat. Rev. Genet.3, 137144. 10.1038/nrg733

  • 129

    TingC. T.TsaurS.-C.SunS.BrowneW. E.ChenY.-C.PatelN. H.et al (2004). Gene Duplication and Speciation in Drosophila: Evidence from the Odysseus Locus. Proc. Natl. Acad. Sci.101, 1223212235. 10.1073/pnas.0401975101

  • 130

    TsernoglouD.PetskoG. A. (1977). Three-dimensional Structure of Neurotoxin a from Venom of the Philippines Sea Snake. Proc. Natl. Acad. Sci.74, 971974. 10.1073/pnas.74.3.971

  • 131

    TsetlinV. (1999). Snake Venom Alpha-Neurotoxins and Other 'three-finger' Proteins. Eur. J. Biochem.264, 281286. 10.1046/j.1432-1327.1999.00623.x

  • 132

    VacquierV. D.SwansonW. J.LeeY.-H. (1997). Positive Darwinian Selection on Two Homologous Fertilization Proteins: what Is the Selective Pressure Driving Their Divergence. J. Mol. Evol.44, S15S22. 10.1007/PL00000049

  • 133

    VacquierV. D. (1998). Evolution of Gamete Recognition Proteins. Science281, 19951998. 10.1126/science.281.5385.1995

  • 134

    VogelC.TeichmannS. A.Pereira-LealJ. (2005). The Relationship between Domain Duplication and Recombination. J. Mol. Biol.346, 355365. 10.1016/j.jmb.2004.11.050

  • 135

    WagnerG. P.AltenbergL. (1996). Perspective: Complex Adaptations and the Evolution of Evolvability. Evolution50, 967976. 10.1111/j.1558-5646.1996.tb02339.x

  • 136

    WagnerG. P.PavlicevM.CheverudJ. M. (2007). The Road to Modularity. Nat. Rev. Genet.8, 921931. 10.1038/nrg2267

  • 137

    WalshB. (2003). Population-Genetic Models of the Fates of Duplicate Genes. Genetica118, 279294. 10.1007/978-94-010-0229-5_16

  • 138

    WangJ.LeiC.ShiP.TengH.LuL.GuoH.et al (2021). LncRNA DCST1-AS1 Promotes Endometrial Cancer Progression by Modulating the MiR-665/HOXB5 and MiR-873-5p/CADM1 Pathways. Front. Oncol.11, 3112. 10.3389/fonc.2021.714652

  • 139

    WeinerJ.3rdBeaussartF.Bornberg-BauerE. (2006). Domain Deletions and Substitutions in the Modular Protein Evolution. FEBS J.273, 20372047. 10.1111/j.1742-4658.2006.05220.x

  • 140

    West-EberhardM. J. (2005). Developmental Plasticity and the Origin of Species Differences. Proc. Natl. Acad. Sci.102, 65436549. 10.1073/pnas.0501844102

  • 141

    WilburnD. B.SwansonW. J. (2016). From Molecules to Mating: Rapid Evolution and Biochemical Studies of Reproductive Proteins. J. Proteomics135, 1225. 10.1016/j.jprot.2015.06.007

  • 142

    WilburnD. B.SwansonW. J. (2017). The “ZP Domain” Is Not One, but Likely Two Independent Domains. Mol. Reprod. Dev.84, 284285. 10.1002/mrd.22781

  • 143

    WilburnD. B.BowenK. E.GreggR. G.CaiJ.FeldhoffP. W.HouckL. D.et al (2012). Proteomic and UTR Analyses of a Rapidly Evolving Hypervariable Family of Vertebrate Pheromones. Evolution66, 22272239. 10.1111/j.1558-5646.2011.01572.x

  • 144

    WilburnD. B.BowenK. E.DotyK. A.ArumugamS.LaneA. N.FeldhoffP. W.et al (2014). Structural Insights into the Evolution of a Sexy Protein: Novel Topology and Restricted Backbone Flexibility in a Hypervariable Pheromone from the Red-Legged Salamander, Plethodon shermani. PLOS ONE9, e96975. 10.1371/journal.pone.0096975

  • 145

    WilburnD. B.ArnoldS. J.HouckL. D.FeldhoffP. W.FeldhoffR. C. (2017). Gene Duplication, Co-option, Structural Evolution, and Phenotypic Tango in the Courtship Pheromones of Plethodontid Salamanders. Herpetologica73, 206219. 10.1655/Herpetologica-D-16-00082.1

  • 146

    WilburnD. B.KunkelC. L.FeldhoffR. C.FeldhoffP. W.SearleB. C. (2022). Recurrent Co-option and Recombination of Cytokine and Three finger Proteins in Multiple Reproductive Tissues throughout Salamander Evolution. bioRxiv. 10.1101/2022.01.04.475003

  • 147

    WilsonK. L.FitchK. R.BafusB. T.WakimotoB. T. (2006). Sperm Plasma Membrane Breakdown during Drosophila Fertilization Requires Sneaky, an Acrosomal Membrane Protein. Development133, 48714879. 10.1242/dev.02671

  • 148

    WilsonL. D.ObakpolorO. A.JonesA. M.RichieA. L.MieczkowskiB. D.FallG. T.et al (2018). The Caenorhabditis elegans Spe‐49 Gene Is Required for Fertilization and Encodes a Sperm‐specific Transmembrane Protein Homologous to SPE‐42. Mol. Reprod. Dev.85, 563578. 10.1002/mrd.22992

  • 149

    WrightG. J.BianchiE. (2016). The Challenges Involved in Elucidating the Molecular Basis of Sperm-Egg Recognition in Mammals and Approaches to Overcome Them. Cell Tissue Res.363, 227235. 10.1007/s00441-015-2243-3

  • 150

    YagiM.MiyamotoT.SawataniY.IwamotoK.HosoganeN.FujitaN.et al (2005). DC-STAMP Is Essential for Cell-Cell Fusion in Osteoclasts and Foreign Body Giant Cells. J. Exp. Med.202, 345351. 10.1084/jem.20050645

Summary

Keywords

gene duplication, fertilization, subfunctionalization, neofunctionalization, sperm, egg, reproduction

Citation

Rivera AM and Swanson WJ (2022) The Importance of Gene Duplication and Domain Repeat Expansion for the Function and Evolution of Fertilization Proteins. Front. Cell Dev. Biol. 10:827454. doi: 10.3389/fcell.2022.827454

Received

02 December 2021

Accepted

12 January 2022

Published

27 January 2022

Volume

10 - 2022

Edited by

Enrica Bianchi, University of York, United Kingdom

Reviewed by

Shunsuke Nishio, Karolinska Institutet (KI), Sweden

Esther Betran, University of Texas at Arlington, United States

Updates

Copyright

*Correspondence: Alberto M. Rivera, ,

This article was submitted to Molecular and Cellular Reproduction, a section of the journal Frontiers in Cell and Developmental Biology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics