Original Research ARTICLE
Symbiote transmission and maintenance of extra-genomic associations
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, TN, USA
Symbiotes can be transmitted from parents to offspring or horizontally from unrelated hosts or the environment. A key question is whether symbiote transmission is similar enough to Mendelian gene transmission to generate and maintain coevolutionary associations between host and symbiote genes. Recent papers come to opposite conclusions, with some suggesting that any horizontal transmission eliminates genetic association. These studies are hard to compare owing to arbitrary differences in modeling approach, parameter values, and assumptions about selection. I show that associations between host and symbiote genes (extra-genomic associations) can be described by the same dynamic model as conventional linkage disequilibria between genes in the same genome. Thus, covariance between host and symbiote genomes depends on population history, geographic structure, selection, and co-transmission rate, just as covariance between genes within a genome. The conclusion that horizontal transmission rapidly erodes extra-genomic associations is equivalent to the conclusion that recombination rapidly erodes associations between genes within a genome. The conclusion is correct in the absence of population structure or selection. However, population structure can maintain spatial associations between host and symbiote traits, and non-additive selection (interspecific epistasis) can generate covariances between host and symbiote genotypes. These results can also be applied to cultural or other non-genetic traits. This work contributes to a growing consensus that genomic, symbiotic, and gene-culture evolution can be analyzed under a common theoretical framework. In terms of coevolutionary potential, symbiotes can be viewed as lying on a continuum between the intimacy of genes and the indifference of casually co-occurring species.
The view that organism phenotypes can be described in terms of a dichotomy between inherited genes and non-inherited environmental factors has been an enormously useful simplification in the development of quantitative genetics and evolutionary theory (Lynch and Walsh, 1998; Futuyma, 2009). However, several fields now present important opportunities to understand the prevalence and importance of additional influences. For example, studies of gene-culture coevolution (Feldman and Zhivotovsky, 1992; Henrich et al., 2008) and symbiosis (Bright and Bulgheresi, 2010; Gilbert et al., 2010) have emphasized the roles of factors with a mixture of horizontal and vertical transmission on development and evolution. Formation of intimate symbioses has contributed to major transitions in eukaryotic evolution and community ecology (Maynard Smith and Szathmary, 1995; Selosse et al., 2006), while gene-culture coevolution can promote speciation in learning animals (Vallin and Qvarnstrom, 2011) and is associated with the emergence of Homo sapiens as a global ecosystem engineer (Maynard Smith and Szathmary, 1995; Vitousek et al., 1997).
The degree to which symbiotic associations and cultural traits are passed vertically from parent to offspring vs. horizontally between unrelated individuals can be critical in determining the evolutionary trajectories of traits affecting cooperation, resource use, and the functional integration of systems (symbiotic or social) more inclusive than the individual organism. Vertical transmission is thought to promote partner fidelity feedbacks between genes (Frank, 1994). That is, when host and symbiote genes maintain a statistical association across generations, the evolution of stable mutualism is likely (Frank, 1994; Doebeli and Knowlton, 1998; Fletcher and Doebeli, 2009; Wyatt et al., 2013). When vertical transmission is perfect, the evolutionary dynamics converge on the dynamics of coevolution between genes within the same genome.
Several studies have sought to extend the tools and concepts of population genetics to analyze host-symbiote and gene-culture coevolution. For example efforts have been very successful in advancing understanding and manipulation of incompatibility-inducing endosymbiotes (Turelli, 1994; Hoffmann et al., 2011). Analyses of gene-culture coevolution have been illuminating but controversial when applied to human culture (Cavalli-Sforza and Feldman, 1981; Boyd and Richerson, 1985; Richerson and Boyd, 2005; Ackland et al., 2007; Claidiere and Andre, 2012; Houkes, 2012). Even genetic models of speciation can be extended to include interactions between host and symbiote genomes (Brucker and Bordenstein, 2012, 2013). Important technical advances include the recognition that gene-culture covariance and interspecific disequilibrium (covariance between host and symbiote genes) resemble conventional linkage disequilibrium and cytonuclear disequilibrium (Feldman and Cavalli-Sforza, 1984; Sanchez et al., 2000). Further, the notion of “generalized epistasis” (Feldman and Cavalli-Sforza, 1984; Feldman and Zhivotovsky, 1992) emphasizes the potential for non-additive interactions between genotypes and cultural traits to determine the direction of evolution.
These results have inspired a generalized concept of “non-genetic inheritance” to include symbiotes, cultural traits, and potentially other non-genetic or epigenetic traits with some possibility of both vertical and horizontal transmission (Bonduriansky and Day, 2009). Proponents of this expanded view of inheritance have generally concluded that there are important similarities between models of non-genetic inheritance and classic genetic models, and that extra-genomic traits can be significant factors in evolution (Feldman and Zhivotovsky, 1992; Jablonka and Lamb, 1998; Bonduriansky and Day, 2009; Day and Bonduriansky, 2011; Odling-Smee et al., 2013). However, attempts to extend classical population and quantitative genetics theory to accommodate non-genetic inheritance have been relatively complex, making it difficult to establish general principles (Feldman and Zhivotovsky, 1992; Santure and Spencer, 2006; Tal et al., 2010; Johannes and Colomé-Tatché, 2011; Bonduriansky, 2012). Moreover, the general importance of non-genetic traits in evolution remains debated (Jablonka and Lamb, 1998; Haig, 2007; Dickins and Dickins, 2008; Bonduriansky, 2012).
The opposing conclusions from recent theoretical studies most likely arise from differing premises, assumptions, and parameter spaces. Early work on gene-culture coevolution adapted classical population genetic models to assumptions about cultural transmission, and tended to emphasize effects of selection on cultural traits (Feldman and Cavalli-Sforza, 1984; Boyd and Richerson, 1985; Feldman and Cavalli-Sforza, 1989; Feldman and Zhivotovsky, 1992). Studies of genomic imprinting have generally used a quantitative genetic framework emphasizing phenotype distributions in families (Santure and Spencer, 2006; Tal et al., 2010; Johannes and Colomé-Tatché, 2011). Day and Bonduriansky (2011) used the Price equation to develop expressions for phenotypic change owing to selection, genetic inheritance, and non-genetic inheritance. Their approach appears to be quite general, but as a trade-off for generality, it does not immediately reveal the importance of any particular mechanism of non-genetic inheritance, nor any particular measure of coevolutionary association. In contrast, Brandvain et al. (2011) made an explicit population genetic model focused on how maternal transmission of a symbiote affects interspecific covariance (disequilibrium) in a panmictic population with no selection.
Under those conditions, Brandvain et al. (2011) showed that genetic covariances (interspecific disequilibria) between neutral organelle and symbiote alleles decay rapidly with even a little horizontal transmission in a panmictic population. They concluded that imperfect vertical transmission would leave negligible statistical signature in molecular marker data and that the potential for interactions between host and symbiote genes to affect evolution would be limited. Qualitatively, the same can be said for conventional nuclear genes or cytoplasmic and nuclear genomes; linkage disequilibria (covariances) decay rapidly in panmictic populations (Lewontin, 1974; Hartl and Clark, 1997). However, conditions favoring persistent covariance between nuclear genes or between cytoplasmic and nuclear genes are common in nature (Arnold, 1993; Hewitt, 2001; Zapata et al., 2001; Laurie et al., 2007). Moreover epistasis is widely acknowledged as an important factor in genome evolution and speciation (Wolf et al., 2000; Coyne and Orr, 2004; Petkov et al., 2005; Weinreich et al., 2005; Muir and Moyle, 2009; Rohlfs et al., 2010), and Drown et al. (2013) showed conditions under which interspecific epistasis would favor an evolutionary transition from horizontal to vertical transmission in a host-symbiote system.
The apparently unresolved question is whether associations between host and symbiote genes have fundamentally different dynamics from associations between genes in the same genome (as implied by Brandvain et al., 2011), or whether fundamental dynamic similarities trump a superficial distinction between genetic and non-genetic inheritance (Day and Bonduriansky, 2011). Here, I attempt to address this question by deriving a few very simple relationships, and then illustrating some of their implications with simulations of a few biologically interesting scenarios. My conclusion might not be surprising to theoretical population geneticists; extra-genomic traits like symbiotes can be studied by extending the tools of classical population genetics (Cavalli-Sforza and Feldman, 1981; Boyd and Richerson, 1985; Sanchez et al., 2000; Day and Bonduriansky, 2011). However, my analysis might be helpful for microbial ecologists. I shed some new light on the early work on associations (disequilibria) between classical alleles and non-genetic traits (Feldman and Cavalli-Sforza, 1984) and contradict the inferred conclusions (though not the mathematical results) of Brandvain et al. (2011). My results help illustrate the value of a unified view of inheritance and coevolution (Day and Bonduriansky, 2011), with symbiosis falling on a continuum of intimacy including genes, organelles, free-living species, and even environmental factors (Lewontin, 1983; Keeling and Archibald, 2008; Odling-Smee et al., 2013).
Brandvain et al. (2011) made two claims against the evolutionary relevance of associations between host organelle and symbiote alleles (interspecific disequilibrium). First, high rates of vertical transmission of symbiotes do not result in high levels of genetic association. Second (consequently), host-symbiote interactions will not respond to natural selection on non-additive fitness effects (interspecific epistasis). The basis of these claims is a model showing rapid decay of covariances between organelle and symbiote genomes in a single randomly mating population with no selection. Therefore, I first evaluate the veracity of their result using a simpler, more traditional population genetics model.
The model follows earlier work generalizing the traditional measure of association between genes in the same genome (linkage disequilibrium) to extra-genomic associations (Feldman and Cavalli-Sforza, 1984; Feldman and Zhivotovsky, 1992), but I provide a new result, a closed-form solution for the evolutionary dynamics of extra-genomic associations. To put a finer point on the question of whether extra-genomic associations can be informative about the transmission process, I explicitly compare the dynamics of symbiote-organelle covariance with the dynamics of symbiote-nuclear covariance. To my knowledge, this has not been done previously.
Second, I evaluate the effects of population structure on patterns of extra-genomic covariance. Population structure is well known to promote within-genome covariance (linkage disequilibrium), but the impact of subdivision and gene flow on the dynamics of extra-genomic traits has rarely been investigated (Ackland et al., 2007). I derive new analytical results for equilibrium covariance in a simple admixture model (Asmussen and Arnold, 1991) and then show numerical results for a more complicated hybrid zone model modified from Dakin (2006).
Finally, I test whether epistatic interactions between host and symbiote genes can affect how a population responds to selection. I illustrate the earlier results of Feldman and Cavalli-Sforza (1989), which demonstrate an evolutionary response to selection on epistatic effects between a gene and a cultural trait. I show conditions under which host-symbiote genetic covariance is built up by selection despite the tendency for horizontal transmission to reduce covariance. I also evaluate effects of hitchhiking between host and symbiote genes in a single population and across a hybrid zone.
3.1. Covariance Between Symbiote and Organelle Genomes
To derive the expected covariance between organelle (cytoplasmic) and symbiote genomes, consider a system where each genome has two types (haplotypes or haplotype groups): C and c for the cytoplasmic organelle, and S and s for the symbiote. If xij is the frequency of individuals with cytotype i and symbiote j, and pi and pj are the marginal frequencies of cytotype i and symbiote j, the covariance (cyto-symbiote disequilibrium, D) is
Equation (1) is the covariance between two binary random variables, and can be applied to phenotypes, genes, symbiotic states or any pair of binary categorical variables (Clark, 1984; Feldman and Cavalli-Sforza, 1984). Statistical association or disequilibrium can also be expressed as a correlation . If the probability of vertical (maternal) transmission of the symbiote is v, the set of frequencies is expected to change across generations according to the transition matrix in Table 1. Therefore, the expected cyto-symbiote covariance in the next generation is
Substitute pC = xCS+xCs
And the general solution is Dt = D0vt. This novel result is directly comparable to the classic result for covariance between nuclear genes Dt = D0ct, where c is the probability of cosegregation (1−recombination rate) (Lewontin, 1974).
If the symbiosis is not obligate and exclusive, i.e., if a host can have zero symbiotes or more than one symbiote genotype, the result is similar. Let xCS be the frequency of individuals with cytotype C and symbiote genotype S, regardless of what other symbiote genotypes are present. Then x′CS = xCSv(1 − hS) + pChS, where hS is the probability of gaining a symbiote with genotype S by horizontal transfer (from the environment or another host). Let D = xCS − pCpS, where pS is the fraction of hosts with symbiote S (rather than the fraction of symbiotes with genotype S). In this case
That is, when the ecological relationship between host and symbiote is facultative, the effective recombination rate is 1 − v(1 − hs), which is greater than the obligate case (1 − v).
3.2. Covariance Between Symbiote and Nuclear Genes
For the covariance between a symbiote genotype and an allele at a nuclear locus (with alleles A and a), the nuclear-symbiote covariance is D = xAAS + xAaS − pApS (Asmussen et al., 1987). The relevant host-symbiote genotype frequencies in the next generation are given by
The nuclear-symbiote covariance in the next generation D′ = x′AAS + x′AaS − pApS simplifies to
And the general solution is . A pertinent special case is perfect maternal transmission (v = 1), which eliminates the distinction between symbiote and organelle. Covariance between a maternally transmitted organelle and a nuclear gene (cyto-nuclear disequilibrium) decays by one half each generation , just like covariance between unlinked nuclear genes (Asmussen et al., 1987).
Even though all covariances decay rapidly in a panmictic population (Lewontin, 1974; Sanchez et al., 2000; Brandvain et al., 2011), cyto-symbiote covariance (Equation 2) decays much more slowly (by a factor of 2) than nuclear-symbiote covariance (Equation 4) whenever there is non-zero maternal transmission (Figure 1). The key difference between host-symbiote transmission and conventional gene or organelle transmission is that completely free recombination corresponds to a cosegregation probability of , whereas completely horizontal transmission corresponds to a vertical transmission probability of zero. Thus, although conventional linkage disequilibrium between unlinked loci is expected to decay no faster than by each generation, host symbiote covariance can drop to zero in one generation if there is no tendency for parent-offspring transmission. Aside from this quantitative difference, the population genetic principles describing the relationship between nuclear and cytoplasmic genomes (Asmussen et al., 1987; Arnold et al., 1988; Arnold, 1993) can be extended to the relationship between hosts and symbiotes. This insight underscores a fundamental continuity between nuclear genes, organelles, and symbiotes.
Figure 1. Decay of cyto-symbiote covariance (solid line, Equation 2) and nuclear-symbiote covariance (dashed line, Equation 4) for vertical transmission probability v = 0.90.
3.3. Host-Symbiote Covariance in Structured Populations
3.3.1. Continent-island structure
Admixture or immigration between populations with divergent allele frequencies is a common cause of within-genome association (conventional linkage disequilibrium) in nature (Conner and Hartl, 2004). To assess the effect of this kind of population structure on host-symbiote genetic associations, first consider a simple continent-island admixture model (Asmussen and Arnold, 1991) in which a focal population receives a proportion m of immigrants from two or more divergent source populations (Figure 2). Let the frequency of immigrants with cytoplasmic allele C and symbiote S be πCS. After immigration, the covariance between cytoplasmic and symbiote genotypes (cyto-symbiote covariance DCS) is
where Dm is the covariance among immigrants. The averages (pC and pS) and covariance of the allele frequencies are over immigrant and resident born sets with weights m and 1 − m. Exactly the same dynamics can be derived for conventional nuclear or cyto-nuclear linkage disequilibria (Asmussen and Arnold, 1991). If we assume the source populations are themselves unchanged by gene flow, so that Dm and the allele frequencies are constant, this difference equation has a non-zero equilibrium at
Figure 2. Intermixture model used to illustrate how covariance between a cytoplasmic genotype C and symbiote genotype S can be maintained by gene flow (Equations 5–7). Relative contribution of source population 1 is represented by f. xCS(i) is the frequency of individuals with cytoplasmic genotype C and symbiote genotype S in Source i, and πCS is the frequency of that genotypic combination in the immigrant pool. x′CS is the frequency of that genotypic combination in native-born residents in the intermixed population, and m is the immigration rate.
Likewise for nuclear-symbiote covariance,
Setting v = 1 in Equation (7) recovers Asmussen and Arnold's (1991) solution for cyto-nuclear disequilibrium. Thus, interspecific genetic covariances can be maintained by immigration. Moreover, ; the expected cyto-symbiote covariance is greater than the expected nuclear-symbiote covariance at equilibrium between immigration and recombination.
3.3.2. Stepping-stone structure
To investigate the effects of more complicated population structure on cyto-symbiote and nuclear-symbiote covariances, I added a symbiote to the stepping stone model analyzed thoroughly for cyto-nuclear covariance by Dakin (2006). The basic framework is a line of demes between two source populations. Dispersal occurs only between adjacent demes and mating is random within demes (Kimura and Weiss, 1964; Goodisman et al., 1998; Dakin, 2006). This framework can be modified to include some other classic models as special cases, including the continent-island (above) and two-population intermixture models (Nei and Li, 1973; Asmussen and Arnold, 1991).
First, I determined how to incorporate a symbiote with vertical transmission probability v into Dakin's (2006) deterministic recursion equations (Appendix Equation 9). I then iterated the system of equations to illustrate transient and equilibrium patterns. I followed Dakin (2006) by initializing ten stepping stones, the first five (nearest source 1) fixed for A, C, and S, and the second five (nearest source 2) fixed for a, c, and s. This simulates secondary contact. Each time step began with symmetrical dispersal between neighboring demes, followed by random mating within demes. In one set of simulations, I followed Dakin (2006) in assuming infinite source populations unaffected by gene flow (remaining fixed for their respective alleles for all time). In a second set, I allowed gene flow into the source populations, assuming they received half the number of immigrants (because they have only one rather than two neighbors). I report the covariances among loci in the adults (after dispersal, before mating); covariances in the offspring are lower in magnitude (by factors of v, , and ) but the differences between cyto-symbiote, nuclear-symbiote, and cyto-nuclear associations are greater.
Dynamics of the stepping stone model over the first 100 generations of contact are illustrated in Figure 3. Over this time frame, the difference between the infinite source and finite source models was negligible. With moderate immigration (m = 0.10 shown) the central demes (at the initial contact front) show transient covariances many times higher than their equilibrium values, as shown previously for cyto-nuclear covariances (Asmussen and Arnold, 1991). Most important, with high but imperfect vertical transmission (v = 0.90 shown), the cyto-symbiote association is greater and shows larger changes in time and space than the nuclear-symbiote or cyto-nuclear associations.
Figure 3. Dynamics of covariances over the first 100 generations of secondary contact with immigration rate m = 0.10 and vertical transmission probability v = 0.90. Only the middle six demes are shown; demes 1, 2, 9, and 10 have essentially flat lines at this scale.
Allele and genotype frequencies were very close to their asymptotic values within a few hundred generations. For the finite source model, all covariances approached zero, as expected (Asmussen and Arnold, 1991). Asymptotic results for the infinite source model (after 1000 generations) presumed to reflect equilibrium are summarized in Figure 4. Cyto-nuclear results for the allelic and genotypic covariances agree with those of Dakin (2006). Cyto-symbiote associations are many times higher, as expected from equations 2 and 4 for a high probability of vertical transmission (v = 0.90 shown).
Figure 4. Asymptotic covariances after 1000 generations of secondary contact with immigration rate m = 0.10 and vertical transmission probability v = 0.90. Circles show averages of 100 replicates of the stochastic version of the model (1000 individuals per deme) and lines show the deterministic (no drift) expectations. Cyto-symbiote associations (black circles, solid line) are several times greater than nuclear-symbiote (open circles, dashed line) and cyto-nuclear (gray) associations. The correlation (r2) is affected more by drift.
To check the effects of genetic drift on these predictions, I also modeled the system with finite populations. At each generation in each deme, I sampled N = 1000 adults from the genotype distribution in focal deme with probability 1 − m and from each neighboring deme with probability m/2. Then I sampled N = 1000 offspring given the adult genotype frequencies and transmission probabilities. Results for this stochastic version were similar to the deterministic results for asymptotic covariances (Figure 4). Drift tends to increase genetic associations, but this is somewhat masked by its effects on allele frequencies. The correlation is more sensitive than the covariance (Figure 4) because r accounts for local allele frequencies.
3.4. Generalized Epistasis and Selection: Cattle as Symbiotes
Feldman and Cavalli-Sforza (1984) pointed out that the covariance between genes and a culturally transmitted trait could be described by a disequilibrium (D or r) in exactly the same way as the standard covariance between genes, and Sanchez et al. (2000) observed that gene-culture covariance and host-symbiote covariance were formally analogous to cytonuclear disequilibrium (all being covariances between binary random variables). My analysis shows that this similarity is deeper than the summary statistics; the evolutionary associations in host-symbiote and gene-culture systems follow the same kind of dynamics as Mendelian genetic systems. It follows that evolutionary relationships between host and symbiote genes should respond to epistatic selection (contra Brandvain et al., 2011). That is, the notion of epistasis can be generalized to include interactions between traits with different inheritance mechanisms (Feldman and Zhivotovsky, 1992), just as the notion of coevolution can be extended to include interactions between genes in the same genome (Lovell and Robertson, 2010).
To underscore this fundamental correspondence, consider the coevolutionary model of human lactose absorption and dairy farming investigated by Feldman and Cavalli-Sforza (1989). Their model included an autosomal (diploid) gene affecting lactase activity, and milk use as a binary cultural trait. This is a model of interspecific “generalized epistasis” because the fitness effect of a genotype depends on the presence or absence of the interaction between humans and cattle (Feldman and Cavalli-Sforza, 1989; Feldman and Zhivotovsky, 1992). The change in frequency of the absorption allele depended strongly on the degree of vertical transmission of the cultural trait (the probability that children of milk users would become milk users). The model applies equally well if we characterize dairy farming as a symbiotic association between humans and cattle. If cattle or farms tended to be passed from parent to offspring, the gene-culture covariance describing an association between human genotype and behavior (milk use) is exactly the same as host-symbiote covariance describing an association between human genotype and symbiote (cattle) presence.
I implemented the Feldman and Cavalli-Sforza (FCS) model as follows (see the online appendix and Feldman and Cavalli-Sforza, 1989 for mathematical and coding details). The first step is random mating with imperfect mother-offspring transmission of the symbiote (U vs. u for present or absent) and Mendelian transmission of an autosomal diploid locus determining whether a host benefits or suffers a net fitness cost from the symbiosis (Table 2). In the second step, following Feldman and Cavalli-Sforza (1989), offspring have a chance of picking up the symbiote via horizontal transmission. That is, each individual without the symbiote has probability fpU of becoming infected (pU is the frequency of infection/symbiosis among other hosts in the population). The third step is selection according to Table 2, and the resulting genotypes mate at random to produce the next generation.
Table 2. Fitness of host genotypes participating in the symbiosis (U) or not (u) in the Feldman and Cavalli-Sforza (FCS) model.
To illustrate that selection can operate on interspecific epistatic effects despite imperfect vertical transmission, I present numerical results for a range of vertical transmission rates given selection s1 = 0.05 and s2 = 0.15, and horizontal transmission rate f = 0.5. These values are among those used by Feldman and Cavalli-Sforza (1989) in their discussion of the coevolution of lactase and dairying, and I chose them by trial and error for the simple purpose of providing a counterexample to the conjecture that imperfect transmission will make interspecific epistatic effects unresponsive to selection (Brandvain et al., 2011).
Perfect vertical transmission is not required. Rather, epistatic fitness effects can drive coevolution between host and symbiote genes if the vertical transmission rate is above some threshold value determined by the strength and mode of selection and the probability of horizontal transmission (Figure 5). In the specific example depicted (Figure 5), there is a strong deleterious effect of the symbiosis (or cultural trait) on the “wild-type” genotype (s2 = 0.2) relative to its benefit for the mutant (s1 = 0.05), that is, the additive effect of the symbiote is negative when the A host allele is rare. Nonetheless, the symbiosis can spread (along with the A allele) if vertical transmission is strong enough to maintain an association between the symbiote and the host allele that makes it advantageous (v > 0.76 in this case, with f = 0.5).
Figure 5. Coevolution of a “host” gene and presence or absence of a cultural/symbiotic trait in the Feldman and Cavalli-Sforza (FCS) model for a single population. Panel (A) shows prevalence of hosts (e.g., humans) with the symbiote (e.g., dairy cattle) over time for six different vertical transmission rates (see legend in B). Blue and black lines approach equilibrium values between 0.5 and 1.0 (Note logarithmic y-axis). (B) shows concurrent changes in frequency of the allele affecting fitness of hosts with and without the symbiotic relationship (e.g., a lactose absorption allele). (C) Shows the correlation (interspecific disequilibrium) between the host allele and symbiote presence.
Feldman and Cavalli-Sforza (1989) allowed transmission (v and f) to vary with genotype. The resulting dynamics differ only quantitatively from the simplified version used here. Day and Bonduriansky (2011) analyzed a version of the FCS model including self-learning. Individuals could (with some probability) choose the cultural practice that best suited their genotype (analogous to partner choice in symbiosis). They solved for a threshold strength of selection above which milk use increases when rare. This threshold was lowered by self-learning. High rates of vertical transmission lowered the threshold when self-learning was weak, but increased it when self-learning was strong.
3.5. Hitchhiking Effects
To evaluate the potential for selection on extra-genomic traits to drive changes in organelle genotype frequencies via hitchhiking, I evaluated the correlated response of organelle genotype frequencies to selection in the basic model, the FCS model, and the hybrid zone model. The effect of an advantageous, maternally transmitted symbiote on a neutral cytoplasmic marker can be found for the basic model (Equation 2 and Table 1) by adding a fitness advantage of 1 + s for individuals with symbiote S. Then the frequency of the cytoplasmic allele C changes as
where is mean fitness.
To evaluate hitchhiking effects of a non-obligate symbiote, I added a maternally inherited cytoplasmic marker to the FCS model. I numerically iterated the FCS model to illustrate hitchhiking effects for three kinds of conditions.
First, for comparison to the standard case of a neutral marker linked to an advantageous mutation (Maynard Smith and Haigh, 1974; Barton, 2000), I used a single panmictic population with a uniformly advantageous symbiote in the FCS framework (s1 = −s2 = s). The symbiote was initially in complete disequilibrium with a cytoplasmic marker. I.e., there were two kinds of individuals: CU and cu. Each CU produced CU offspring with probability v and Cu offspring with probability 1 − v. Cu and cu offspring were converted to CU and cU with probability fpU, where pU was the frequency of individuals with the symbiote at that time. Individuals with the symbiote had relative fitness 1 + s. For comparison to the standard Maynard-Smith and Haigh (MSH) model, I calculated the expected final frequency of the cytoplasmic marker (pC,∞) in terms of initial frequencies of C and U and selection (s) from Barton (2000) Equation (1): pC,∞ = pC,0 + (1 − pC,0)pr/sU,0, with the approximate effective recombination rate r = 1 − v (1 − f) from Equation (3). This MSH expectation is not perfectly identical to the cyto-symbiote hitchhiking model (there are differences in ploidy, initial conditions, and the “effective recombination rate” is only an approximation), but the magnitude of hitchhiking should be comparable if the claim is true that the FCS model produces evolutionary dynamics for host-symbiote gene pairs that are comparable to dynamics for conventional genes.
As expected based on standard population genetics and Equations (2, 3, and 8), the hitchhiking effect of a spreading symbiote depends on the strength of selection, the co-transmission rate, and the initial conditions (Table 3). The examples in Figure 6 illustrate the similarity between cyto-symbiote hitchhiking (or extra-genomic hitchhiking in general) and the hitchhiking effect of one nuclear locus on another (Maynard Smith and Haigh, 1974; Barton, 2000). Graphs for other parameter combinations are available as Appendix A.
Table 3. Cytoplasmic marker frequencies after 100 generations of cyto-symbiote hitchhiking in the FCS model for various vertical transmission rates (v) and fitness effects.
Figure 6. Cyto-symbiote hitchhiking effect over 100 generations with a universally advantageous symbiote in a panmictic population. Panel (A) shows dynamics of symbiote prevalence and cytoplasmic marker frequency for an initially rare symbiote (bottom pair of lines) and for 1:1 admixture (upper pair) under the basic (CS) model with selection s = 0.15 and vertical transmission v = 0.9025. (B) Shows final cytoplasmic marker frequencies (after 100 generations) as a function of the initial frequency (of both cytoplasmic marker and symbiote) for the basic CS model in comparison to the Maynard Smith and Haig (MSH) approximation for a pair of nuclear loci with s = 0.15 and r = 1 − v = 0.0975. (C,D) Show analogous results from the Feldman and Cavalli-Sforza (FCS) model with a universally advantageous symbiote (s1 = 0.15, s2 = −0.15), vertical transmission probability v = 0.95 and horizontal transmission probability f = 0.05. Here I used a recombination rate 1 − v(1 − f) = 0.0975 for the MSH model, making the two sets of results as comparable as possible.
Second, to evaluate the dynamics of secondary contact with a universally advantageous symbiote, I used the stepping-stone model described above with inheritance and selection according to the FCS model after dispersal. I used 100 stepping stones and iterated the deterministic difference equations for 100 generations after secondary contact for several combinations of transmission rates (v and f) and selection strengths (s1 = −s2 = s).
Finally, I repeated the simulations of secondary contact with selection according to the FCS model with sign epistasis (Weinreich et al., 2005). That is the effect of the symbiote is positive or negative according the host genotype, as in the original formulation (Feldman and Cavalli-Sforza, 1989). For the examples depicted, I varied the strength of selection but kept the ratio constant (s2 = s1/2).
Cyto-symbiote hitchhiking can cause significant differential introgression of cytoplasmic DNA (Figure 7, Table 3). The spatial cline for the allele frequency of a neutral nuclear marker is unaffected by the spread of a symbiote across a hybrid zone, but there is a significant spatial displacement of the cline for a neutral cytoplasmic marker owing to the slow decay of cyto-symbiote covariance. As it does for intragenomic hitchhiking (Barton, 2000), spatial structure reduces the overall hitchhiking effect (Table 3) by slowing the response to selection and allowing more time for recombination. Introgression is generally slower with epistasis (Table 3), in part because the overall selective advantage of the symbiote is reduced when s2 > −s1 (Table 3), and in part because the advantage of the symbiote depends on co-introgression of the epistatic nuclear allele, producing a steeper, more slowly advancing wave front (Figure 7).
Figure 7. Cyto-symbiote hitchhiking effect after 100 generations of the Feldman and Cavalli-Sforza (FCS) model in a secondary contact zone with 100 demes in a stepping stone pattern (only the central 20 demes are shown). Selection favoring the symbiote was s1 = 0.15, vertical transmission probability v = 0.95 and horizontal transmission probability f = 0.05. Panel (A) shows results for a universally advantageous symbiote with no epistasis (s2 = −s1). (B) Shows results for sign epistasis with s2 = s1.
The simple models presented here extend previous theoretical work on nuclear, cyto-nuclear, and extra-genomic covariances (Nei and Li, 1973; Asmussen and Arnold, 1991; Feldman and Zhivotovsky, 1992; Goodisman et al., 1998; Sanchez et al., 2000; Dakin, 2006; Brandvain et al., 2011). My results illustrate a very important conceptual and mathematical correspondence between the dynamics of associations between genes in different genomes and associations between genes in the same genome. Although there have been doubts about the stability of coevolutionary associations between host and symbiote genes with less than 100% vertical transmission (Brandvain et al., 2011), such concerns are no more or less valid for extra-genomic associations than for associations between genes within a genome. This inference can be extracted from the general frameworks of dual-inheritance theory (Cavalli-Sforza and Feldman, 1981; Feldman and Cavalli-Sforza, 1984; Boyd and Richerson, 1985; Feldman and Zhivotovsky, 1992) or non-genetic inheritance (Bonduriansky and Day, 2009; Day and Bonduriansky, 2011). My analysis uses more specific models to put a finer point on the idea that coevolution between genomes and within genomes share fundamental similarities.
The study of covariance between genes in symbiotic species is important because assortment between mutually beneficial genotypes is a key to the evolutionary origin and stability of mutualism (Frank, 1994; Fletcher and Doebeli, 2009; Wyatt et al., 2013). Vertical transmission does not guarantee the evolution of mutualism; persistent conflicts are sometimes evident within genomes or between host and organelle or endosymbiote genomes (Burt and Trivers, 2006). However, cross-generation covariance between host and symbiote genes can strongly affect the direction of evolution.
My analysis shows that extra-genomic covariances follow the same dynamics as conventional linkage disequilibria. In situations where conventional linkage disequilibria tend to persist (e.g., structured populations), extra-genomic covariances can also be expected if there is some level of vertical transmission. Therefore, extra-genomic covariances might often be informative about rates of vertical vs. horizontal transmission. In addition, the potential for non-additive fitness effects to generate significant evolutionary change at the level of symbiotic or cultural systems can be similar to the potential for epistasis to affect change at the level of conventional genomes.
4.1. Similarities and Differences Between Symbiote Transmission, Cultural Transmission, and Gene Transmission
The derivations of Equations (2–7) expose a striking dynamic similarity of within-genome and between-genome covariances. Disequilibria (covariances) are expected to decay as exponential functions of time. The consequence in a large, panmictic population is that we expect covariances to become negligible after a few generations (Lewontin, 1974; Sanchez et al., 2000; Brandvain et al., 2011). However, even in such an idealized population, the rate of decay depends on the probability of co-transmission. Therefore, maternally transmitted symbiotes will tend to have higher covariance with maternally transmitted genes (e.g., cytoplasmic genomes) than with nuclear genes (Figure 1).
Two differences between intra-genomic and extra-genomic associations can modify the exponential decay pattern. First, zero linkage corresponds to a cotransmission probability of 0.5 for a pair genes within a genome, whereas zero vertical transmission corresponds to a cotransmission probability of 0.0 for host and symbiote genes. This means the decay of extra-genomic covariance can be instantaneous (when v = 0). Second, if the symbiotic relationship is not obligate, some offspring can be without the symbiote (i.e., they neither inherited it from a parent nor picked it up from someone else or the environment). This effectively increases the decay of extra-genomic covariance and causes a systematic tendency for the symbiosis to be lost in the absence of selection or an environmental source (Feldman and Cavalli-Sforza, 1989; Lipsitch et al., 1995; Sanchez et al., 2000).
4.2. Using Extra-Genomic Covariance to Make Inferences About Vertical Transmission
Given the rapid decay of covariances in panmictic populations, even for physically linked genes, lack of evident covariances cannot be interpreted as evidence against vertical transmission (Lewontin, 1974; Sanchez et al., 2000; Brandvain et al., 2011). However, when present, covariances can be informative. My analysis shows how even imperfect maternal transmission of symbiotes generates greater cyto-symbiote association than cytonuclear or nuclear-symbiote association (Figure 1). This result suggests that extra-genomic covariance can be used to make inferences about rates of vertical transmission in structured populations. Cytonuclear covariance and covariance between nuclear genes with known linkage relationships can be used as benchmarks for the detectability of vertical transmission of symbiotes. In particular, maternally inherited symbiotes are expected to have greater covariance with maternally inherited genes than with biparentally inherited genes (by a factor of 2t in the simple case of neutral decay in a panmictic population). Maternal transmission of bacteria might be common in animals (Funkhouser and Bordenstein, 2013). Nuclear and cytonuclear covariances are frequently observed in nature, particularly in zones of admixture or hybridization (Arnold, 1993; Hewitt, 2001; Zapata et al., 2001; Laurie et al., 2007). Interspecific covariances should be common in those same situations for symbiotes with substantial vertical transmission.
Recently, Thierry et al. (2011) documented extra-genomic covariance between whitefly genotypes and prevalence of several endosymbiotic bacteria in a zone of recent contact between native and introduced whiteflies. These endosymbiotes are maternally transmitted and showed strong correspondence between mtDNA and symbiotypes in hybrids (Thierry et al., 2011). However, the symbiotes have varying degrees of horizontal transmission and only one of three is obligate (Ahmed et al., 2013). Despite imperfect vertical transmission, host mtDNA and symbiote genotypes covary in the admixture zone (Thierry et al., 2011) and at a regional scale (Ahmed et al., 2013). These observations from nature indicate that it will be feasible to test for patterns of extra-genomic covariance consistent with maternal transmission in many wild systems. In the whitefly system, it will be interesting to compare patterns of nuclear gene flow with those of mtDNA, primary endosymbiotes, and secondary endosymbiotes.
4.3. Implications for Symbiosis and Extra-Genomic Coevolution
Selection on non-genetic traits such as symbioses or learned behaviors can produce hitchhiking effects and extra-genomic covariances even when vertical transmission is less than perfect. Even non-additive interactions (generalized epistasis) between genes in different genomes can affect the direction of evolution. This has been shown before, particularly in models of gene-culture evolution (Feldman and Cavalli-Sforza, 1989; Ackland et al., 2007; Day and Bonduriansky, 2011), but the continuity of coevolution within genomes, between genomes, and even between genes and culture is underappreciated (Brandvain et al., 2011).
All forms of coevolution depend on consistent associations between partners to produce reciprocal responses to selection, but intimate symbioses are often thought of as special. Recent enthusiasm over the evolutionary ecology of human-microbe interactions has raised broad questions about whether plants and animals should be conceptualized as “holobionts” or “meta-organisms”, including hosts and associated microbial communities as units or levels of organization in evolution and ecology (Rosenberg et al., 2007; Doolittle and Zhaxybayeva, 2010). The answer depends, in part, on whether coevolutionary dynamics between genes in different genomes (host and microbe) are similar to dynamics between genes in the same genome. The decay of associations between genomes in a panmictic population with imperfect vertical transmission might be interpreted as leaving little opportunity for non-additive interactions between genomes to drive evolution (Brandvain et al., 2011). However, my analysis and several previous studies illustrate how associations between genomes are affected by the same factors and follow the same kinds of dynamics as associations between genes within a genome. Population structure can generate and maintain associations that might affect local coevolution (Doebeli and Knowlton, 1998; Thompson, 2005; Wyatt et al., 2013) and generalized epistasis can generate associations and determine the outcome of coevolution (Feldman and Cavalli-Sforza, 1984; Feldman and Zhivotovsky, 1992; Drown et al., 2013). Interspecific epistasis can even contribute to host speciation (Brucker and Bordenstein, 2012), for example by impacting mating behavior (Sharon et al., 2010) or hybrid viability (Brucker and Bordenstein, 2013). It does not necessarily follow that host-symbiote systems should be conceptualized as “meta-organisms,” but the theoretical continuity between coevolution of genes within genomes and between genomes is encouraging for further synthesis between evolutionary genetics and evolutionary ecology.
The sole author (Benjamin M. Fitzpatrick) is responsible for all aspects of this article.
University of Tennessee.
Conflict of Interest Statement
The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
I am grateful to Y. Brandvain, T. Day, and S. Gavrilets for helpful comments and discussion.
The Online Appendices B–E: the R codes for this article are available in Data Sheet 1 can be found online at: http://www.frontiersin.org/journal/10.3389/fmicb.2014.00046/abstract
Ackland, G. J., Signitzer, M., Stratford, K., and Cohen, M. H. (2007). Cultural hitchhiking on the wave of advance of beneficial technologies. Proc. Natl. Acad. Sci. U.S.A. 104, 8714–8719. doi: 10.1073/pnas.0702469104
Ahmed, M. Z., De Barro, P. J., Ren, S.-X., Greeff, J. M., and Qiu, B.-L. (2013). Evidence for horizontal transmission of secondary endosymbionts in the Bemisia tabaci cryptic species complex. PLoS ONE 8:e53084. doi: 10.1371/journal.pone.0053084
Arnold, J., Asmussen, M. A., and Avise, J. C. (1988). An epistatic mating system model can produce permanent cytonuclear disequilibria in a hybrid zone. Proc. Natl. Acad. Sci. U.S.A. 85, 1893–1896. doi: 10.1073/pnas.85.6.1893
Brandvain, Y., Goodnight, C. J., and Wade, M. J. (2011). Horizontal transmission rapidly erodes disequilibria between organelle and symbiont genomes. Genetics 189, 397–404. doi: 10.1534/genetics.111.130906
Dickins, T. E., and Dickins, B. J. A. (2008). Mother nature's tolerant ways: why non-genetic inheritance has nothing to do with evolution. New Ideas Psychol. 26, 41–54. doi: 10.1016/j.newideapsych.2007.03.004
Feldman, M. W., and Cavalli-Sforza, L. L. (1989). “On the theory of evolution under genetic and cultural transmission with application to the lactose absorption problem,” in Mathematical Evolutionary Theory, eds M. W. Feldman (Princeton: Princeton University Press), 145–173.
Gilbert, S. F., McDonald, E., Boyle, N., Buttino, N., Gyi, L., Mai, M., et al. (2010). Symbiosis as a source of selectable epigenetic variation: taking the heat for the big guy. Philos. Trans. R. Soc. Lond. Ser. B 365, 671–678. doi: 10.1098/rstb.2009.0245
Goodisman, M. A. D., Shoemaker, D. D., and Asmussen, M. A. (1998). Cytonuclear theory for haplodiploid species and x-linked genes. II. Stepping-stone models of gene flow and application to a fire ant hybrid zone. Evolution 52, 1423–1440. doi: 10.2307/2411312
Hoffmann, A. A., Montgomery, B. L., Popovici, J., Iturbe-Ormaetxe, I., Johnson, P. H., Muzzi, F., et al. (2011). Successful establishment of wolbachia in aedes populations to suppress dengue transmission. Nature 476, 454–457. doi: 10.1038/nature10356
Laurie, C. C., Nickerson, D. A., Anderson, A. D., Weir, B. S., Livingston, R. J., Dean, M. D., et al. (2007). Linkage disequilibrium in wild mice. PLoS Genetics 3:e144 doi: 10.1371/journal.pgen.0030144
Lipsitch, M., Nowak, M. A., Ebert, D., and May, R. M. (1995). The population dynamics of vertically and horizontally transmitted parasites. Proc. R. Soc. Lond. B 260, 321–327. doi: 10.1098/rspb.1995.0099
Petkov, P. M., Graber, J. H., Churchill, G. A., DiPetrillo, K., King, B. L., and Paigen, K. (2005). Evidence of a large-scale functional organization of mammalian chromosomes. PLoS Genetics 1:e33. doi: 10.1371/journal.pgen.0010033
Rohlfs, R. V., Swanson, W. J., and Weir, B. S. (2010). Detecting coevolution through allelic association between physically unlinked loci. Am. J. Hum. Genet. 86, 674–685. doi: 10.1016/j.ajhg.2010.03.001
Rosenberg, E., Koren, O., Reshef, L., Efrony, R., and Zilber-Rosenberg, I. (2007). The role of microorganisms in coral health, disease, and evolution. Nat. Rev. Microbiol. 5, 355–362. doi: 10.1038/nrmicro1635
Sharon, G., Segal, D., Ringo, J. M., Hefetz, A., Zilber-Rosenberg, I., and Rosenberg, E. (2010). Commensal bacteria play a role in mating preference of drosophila melanogaster. Proc. Natl. Acad. Sci. U.S.A. 107, 20051–20056. doi: 10.1073/pnas.1009906107
Thierry, M., Becker, N., Hajri, A., Reynaud, B., Lett, J.-M., and Delatte, H. (2011). Symbiont diversity and non-random hybridization among indigenous (ms) and invasive (b) biotypes of bemisia tabaci. Mol. Ecol. 20, 2172–2187. doi: 10.1111/j.1365-294X.2011.05087.x
Appendix A: Expanded Description of Methods for Simulating Extra-Genomic Trait Transmission
Host-symbiont covariance in structured populations
To investigate the effects of population structure on cyto-symbiont and nuclear-symbiont disequilibria, I added a symbiont to the stepping stone model analyzed for cyto-nuclear disequilibria by Dakin (2006). Within each deme, random mating and inheritance of the nuclear locus A, maternally transmitted cytoplasmic locus C and imperfectly transmitted symbiont S follows the set of deterministic recursions given in Equation (9).
The expected frequencies after dispersal are given by the weighted averages of adjacent demes such that a fraction m of each deme is composed of immigrants coming in equal proportions from the two adjacent demes (Kimura and Weiss, 1964; Goodisman et al., 1998; Dakin, 2006). I followed Dakin (2006) by initializing ten stepping stones, the first five (nearest source 1) fixed for A, C, and S, and the second five (nearest source 2) fixed for a, c, and s. This simulates secondary contact. Each time step began with symmetrical dispersal between neighboring demes, followed by random mating within demes. In one set of simulations, I followed Dakin (2006) in assuming infinite source populations unaffected by gene flow (remaining fixed for their respective alleles for all time). In a second set, I allowed gene flow into the source populations, assuming they received half the number of immigrants (because they have only one rather than two neighbors). I report the disequilibria among loci in the adults (after dispersal, before mating); disequilibria in the offspring are lower in magnitude (by factors of v, , and ) but the differences between cyto-symbiont, nuclear-symbiont, and cyto-nuclear associations are greater.
I iterated the system of equations to illustrate transient and equilibrium patterns using the R code in Online Appendix B. To check the effects of genetic drift on these predictions, I also modeled a system of finite populations. At each generation in each deme, I sampled N adults from the focal deme with probability 1 − m and from each neighboring deme with probability m/2. Then I sampled N offspring from the frequency distribution described by Equation (9) given the adult genotype frequencies.
Epistasis and gene-culture coevolution: cattle as symbionts
I implemented the Feldman and Cavalli-Sforza (FCS) model as follows (Feldman and Cavalli-Sforza, 1989). The first step is mating and mother-offspring transmission of the symbiote with probability v (U vs. u represent presence or absence of the symbiote), such that the offspring gene-culture distribution is
where pA and pa are the host allele frequencies. In the second step, following Feldman and Cavalli-Sforza (1989), offspring have a chance of picking up the symbiote via horizontal transmission. That is, each individual with trait u has probability fpU of being converted to trait U (pU is the frequency of the symbiote U among other hosts in the population). The third step is selection according to
R code implementing the FCS model is provided in Online Appendix C.
R code for illustrating hitchhiking effects is provided in Online Appendices D and E. These give examples of commands for iterating the FCS model in a series of stepping stones after secondary contact and keeping track of allele and symbiont frequencies.
Keywords: vertical transmission, horizontal transmission, symbiosis, gene-culture, coevolution, dual-inheritance theory, interspecific disequilibrium, metagenome
Citation: Fitzpatrick BM (2014) Symbiote transmission and maintenance of extra-genomic associations. Front. Microbiol. 5:46. doi: 10.3389/fmicb.2014.00046
Received: 18 December 2013; Paper pending published: 20 January 2014;
Accepted: 21 January 2014; Published online: 24 February 2014.
Edited by:Monica Medina, Pennsylvania State University, USA
Reviewed by:Carl J. Yeoman, Montana State University, USA
Martin Zimmer, Universität Salzburg, Austria
Copyright © 2014 Fitzpatrick. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Benjamin M. Fitzpatrick, Department of Ecology and Evolutionary Biology, University of Tennessee, 569 Dabney, Knoxville, TN 37996, USA e-mail: firstname.lastname@example.org