Population Social Structure Facilitates Indirect Fitness Benefits from Extra-Pair Mating

Despite decades of research, empirical support for the ‘compatible genes’ and ‘good genes’ hypotheses as explanations for adaptive female extra-pair mating remains discordant. One largely un-tested theoretical prediction that could explain equivocal findings is that mating for compatible genes benefits should reduce selection for good genes. However, this prediction does not consider demographic parameters, such as social structuring, that can indirectly influence extra-pair paternity (EPP) outcomes. Drawing on evidence from a previous study, we re-evaluate this hypothesis whilst considering social structuring in a population of tui, Prosthemadera novaeseelandiae, a socially monogamous passerine. Previous research has found possible evidence for mating for good genes because male ornament size correlates with EPP success in this population. Here, we test whether non-random inbreeding of social-pairs indirectly provides the opportunity to gain compatibility benefits from EPP, and thus whether selection for compatible genes and good genes can operate simultaneously. We found that 1) social mates were more closely related than expected through random mating, 2) extra-pair males were less closely related to females than were the females’ within-pair mates, and 3) genetically dissimilar males sired offspring with faster growth rates. These results demonstrate that females gain compatible genes benefits from EPP. However, contrary to the compatible genes hypothesis, females in highly-related pairs did not engage more frequently in extra-pair mating. Together with previous research investigating female mate choice in this population, our findings suggest that social pairings between relatives provide a pathway for females to gain compatible genes benefits whilst engaging in extra-pair mating for good genes. This study provides evidence that some fitness benefits from EPP arise automatically through non-random social mating rather than through direct selection on extra-pair mate choice. We argue that when variance in compatibility benefits is an outcome of population structuring, compatible genes benefits need not be gained at the expense of good genes benefits.


INTRODUCTION
The adaptive function of female multiple mating in socially monogamous species has been variously debated over the past few decades (Westneat et al., 1990;Birkhead and Møller, 1992;Westneat and Stewart, 2003), however so far no obvious conclusion has been reached (Wan et al., 2013). Proposed hypotheses include non-adaptive mechanisms, for example pleiotropic gene effects such as the outcome of selection on males (Forstmeier et al., 2011), or convenience polyandry where females submit to extra-pair copulations to avoid harassment (Forstmeier et al., 2014). Of the adaptive hypotheses proposed to explain female extra-pair mating behavior, the "good genes" and "compatible genes" hypotheses in which females gain indirect fitness benefits have been the most extensively examined (reviewed in Griffith et al., 2002;Neff and Pitcher, 2005;Akçay and Roughgarden, 2007;Griffith and Immler, 2009;Prokop et al., 2012).
At the genic level, the effects of good genes and compatible genes differ in their functional mechanism. Under selection for good genes, a male's genetic quality is additive and conveyed through genetic covariance with phenotypic indicator traits (Zahavi, 1975;Hamilton and Zuk, 1982;Kirkpatrick and Ryan, 1991;Andersson, 1994;Kirkpatrick, 1996;Kokko et al., 2002Kokko et al., , 2003. Therefore, a trait indicating good genes benefits can be used as proxy for genetic quality, and hence for the heritable fitness benefits (in the form of either increased viability or attractiveness) that are conferred to offspring by the mate choice decisions of all individuals regardless of their own genotype. Under the good genes hypothesis, females are therefore predicted to allocate paternity toward males with extreme forms of the trait. In contrast, under the compatible genes hypothesis, not all females stand to derive the same benefits from mating with a particular male (Neff and Pitcher, 2005;Puurtinen et al., 2009). Instead, it is the compatibility of male and female genotypes that act, through either heterozygote overdominance or avoidance of the deleterious effects of inbreeding, to produce genetically superior offspring Zeh, 1996, 1997;Jennions, 1997;Tregenza and Wedell, 2000). Inbreeding depression is a consequence of intragenomic conflict caused by increased shared genetic similarity by common descent and lowered functional genome-wide heterozygosity (Coulson et al., 1998). These genetic consequences have the potential to influence fitness through their detrimental effects on offspring birth weights, growth, and survival (Keller and Waller, 2002). Thus, the compatible genes hypothesis predicts that paternity should be biased toward genetically dissimilar males.
Theory predicts that good genes and compatible genes benefits are not mutually exclusive and therefore that females should be selected to exploit both types of benefits in mate choice (Colegrave et al., 2002;Neff and Pitcher, 2005;Puurtinen et al., 2005Puurtinen et al., , 2009). Yet few empirical studies have demonstrated simultaneous optimization of both types of benefit (but see Foerster et al., 2003). As a possible explanation, Colegrave et al. (2002) hypothesized that when selection for both good genes and compatible genes operates in a population, unless there are large numbers of males of equally high additive genetic quality available to all females, selection for compatibility will weaken directional sexual selection for good genes. It has therefore been argued that females cannot simultaneously select the most ornamented and most genetically compatible male, and that female optimization of both good genes and compatible genes benefits can only operate via plasticity in mate choice in relation to the relative fitness advantages of each strategy (e.g., Roberts and Gosling, 2003;Oh and Badyaev, 2006;Mays et al., 2008). This antagonistic mechanism of selection could explain the so far equivocal support for the good genes and compatible genes hypotheses (reviewed in Griffith et al., 2002;Mays and Hill, 2004;Akçay and Roughgarden, 2007;Mays et al., 2008;Forstmeier et al., 2014).
Because extra-pair paternity (EPP) is purported to be a mechanism for socially monogamous females to circumvent the negative impacts of social inbreeding (Birkhead and Møller, 1998), compatible genes theory predicts that females should allocate paternity to genetically dissimilar extra-pair males (EPMs; Pusey and Wolf, 1996;Keller and Waller, 2002;Wheelwright et al., 2006;Kempenaers, 2007). Although the compatible genes hypothesis is emerging as an important component of EPP through its influence on offspring viability (Blomqvist et al., 2002;Masters et al., 2003;Richardson et al., 2005;Dowling and Mulder, 2006;Freeman-Gallant et al., 2006;Cohas et al., 2007;Rubenstein, 2007;Brouwer et al., 2011;Arct et al., 2015) the exact selective mechanism through which it acts remains unclear (reviewed in Griffith and Immler, 2009). Potential mechanisms can arise through pre-or post-copulatory (i.e., cryptic female choice; Eberhard, 1991;Colegrave et al., 2002;Birkhead and Pizzari, 2002) mate choice, or through sperm selection in which the sperm of more compatible males are more likely to fertilize an egg (Wilson et al., 1997;Zeh and Zeh, 1997;Birkhead and Møller, 1998;Tregenza and Wedell, 2000). These processes may operate disparately in different animal species. For example, while sophisticated olfactory mechanisms present in mammals, as well as visual and acoustic learning in birds with cooperative breeding systems, commonly facilitate the kin recognition mechanism necessary for female choice based on genetic compatibility (e.g., Pusey and Wolf, 1996;Mateo, 2003;Boulet et al., 2009;Leclaire et al., 2013), the role of preor post-copulatory female choice for genetic compatibility in socially monogamous birds remains unconvincing because there is scarce empirical support for the recognition of unfamiliar kin mechanism required for such precise copulatory control (Jamieson et al., 2009; but see Arct et al., 2010).
However, patterns in EPP can also be indirectly affected by demographic parameters such as population social structure; although to date this has been rarely considered in the EPP literature (Reid et al., 2015). Arguably the most common example of population structuring occurs when individuals mate non-randomly with respect to relatedness. In several socially monogamous species, studies have shown that individuals pair with relatives more frequently than expected through random mating (e.g., Foerster et al., 2006;Szulkin et al., 2009;Brekke et al., 2012). If related social-pairs then engage in extra-pair mating, compatible genes benefits could be gained automatically without specific mate choice for more genetically compatible partners, simply because of the high likelihood that the EPM would be less-related to the female than the within-pair male (WPM).
Recently, Reid et al. (2015) demonstrated that female song sparrows, Melospiza melodia, paired to related social males can lower offspring coefficients of inbreeding simply by engaging in random extra-pair mating. This finding suggests that social structuring by itself has the potential to influence the fitness outcomes of EPP without the need for pre-or post-copulatory kin recognition mechanisms. However, Reid et al. (2015) did not include any sexually selected traits in their analyses. This information is important to consider because social structuring could also have potential consequences for the predicted opposing interaction of selection for good genes and compatible genes. For example, in a non-randomly inbreeding population, females engaging in EPP for good genes benefits also stand to gain compatibility benefits as an automatic outcome of extrapair mating. Under these circumstances, antagonistic selection between good genes and compatible genes effects would be minimized. Therefore, social structuring could provide a pathway through which the benefits of compatible genes and good genes could operate simultaneously. We test this hypothesis by investigating population social structure and its effect on the fitness outcomes of EPP in a naturally occurring population of the New Zealand tui, Prosthemadera novaeseelandiae.
Tui are socially monogamous but have a very high rate of EPP (57% of all offspring in 72% of all broods), and therefore the potential exists for females to gain significant indirect benefits from extra-pair mating (Wells et al., 2015). A previous study on a population of this honeyeater (Meliphagidae), revealed that both within-pair and EPP success are positively related to the size of a male's white throat feather ornament, suggesting mating preference for good genes (Wells et al., 2015). However, mate choice for genetic compatibility could also be an important adaptation to avoid inbreeding in this species because natal philopatry is common in both sexes and is exacerbated by competition for breeding territories near nectar resources (Stewart, 1980;Bergquist, 1985;Stewart and Craig, 1985).
In this study, we use offspring growth rates to quantify potential fitness benefits for genetic compatibility gained from extra-pair mating in this same population of tui, and examine how social structure can influence these potential benefits by increasing the occurrence of non-random social mating. To investigate population social structure, we first determine whether social-pairs are more genetically related than expected through random mating. We then test whether within-pair and extra-pair genetic relatedness influences male paternity success in addition to male ornamentation. If EPP functions as an inbreeding avoidance mechanism, females paired to related social partners are predicted to engage in extra-pair copulations and EPP should be biased toward genetically dissimilar males. Finally we test for increased offspring performance as a predicted fitness outcome of mating with genetically dissimilar EPMs (Zeh and Zeh, 1996;Puurtinen et al., 2009). Because growth rates are an important component of offspring performance (Lindstrom, 1999) that are associated with non-additive genetic quality (Keller and Waller, 2002;Kempenaers, 2007;Pitcher and Neff, 2007;Dziminski et al., 2008;Rosivall et al., 2009), we specifically examine the effect of genetic relatedness of biological parents on offspring growth rates, and compare within-pair and extrapair maternal half-siblings to determine whether genetically dissimilar males provide compatibility benefits to females.

Field Methods
This study was conducted on a naturally breeding population of tui at Tawharanui Regional Park (36 • 22 ′ S, 174 • 50 ′ E), on the North Island of New Zealand. The large and open nature of this population (∼1,000 pairs; Wells, unpublished data) means that we were unable to sample all individuals in the population, and thus poses a potential issue with regards to missing data on pairwise relatedness and identification of potential extra-pair sires. To mitigate this problem, we focussed our sampling effort on two main areas of the park (Ecology Bush and Jones Bay) separated by 2.5 km. We captured 390 adult tui using mist-nets between the years 2009 and 2012. Birds were color-banded for individual identification and blood samples were taken for DNA analysis. During the breeding seasons (Oct-Feb) of 2009-2012 we searched for nests belonging to banded pairs. Social pairbonds were assessed by observation of territorial defense around a nest, mate guarding, song matching between pairs, and nestling provisioning. During this period we also caught as many unbanded neighboring males as possible to enable identification of potential EPMs. Blood samples were collected from 152 nestlings from 53 broods; 11 broods in 2009, 23 in 2010, 15 in 2011, and four in 2012.
In 2010 and 2011 twenty broods (N = 56 nestlings) with known hatching dates were followed through to fledging in order to track nestling growth. Nestlings were weighed at three stages of the nestling phase; day 0-1, 4-6, and 9-11 (Figure 1). For eight nestlings from three nests, measurements from days 15 to 16 were also available and were included in the analysis. Sampling was conducted at the same time of day to reduce diurnal variation in nestling mass. Nestlings were ringed with elastic cotton bands on the first measurement to allow future identification. On the second measurement they were provisionally banded with a single plastic color band until they could be permanently colorbanded on the third measurement. Blood samples for paternity analysis were taken during the second measurement. This is study was approved and carried out in accordance with the recommendations of the Auckland Council, the New Zealand Department of Conservation (Research permit #37088_FAU) and the Massey University Animal Ethics Committee. The protocol was approved by the Massey University Animal Ethics Committee (permit #MUAEC09-39).

Paternity Analysis and Genetic Relatedness
Fourteen microsatellites were developed for the tui (Wells et al., 2013) and twelve of these were used to investigate paternity using the program CERVUS v.3.0.3 see Wells et al., 2015 for complete details of paternity analyses). Withinpair males (WPMs) were excluded as being the genetic father if they (a) mismatched at one or more loci within the motheroffspring dyad that could not be attributed to the presence of a low frequency null allele or (b) mismatched at three or more loci where null alleles could be present, but were unlikely due to their low frequencies, and if also (c) the WPM did not receive a positive LOD score (multilocus log-likelihood ratio score). Extra-pair males (EPMs) were assigned to a nestling only if they mismatched at zero loci within the mother-offspring dyad.
In order to calculate pairwise genetic relatedness estimates, the most appropriate estimator for the microsatellite dataset was chosen using COANCESTRY v.1.0.1.1 (Wang, 2011). This is because different marker-based relatedness estimators may perform better on different sets of loci due to variation between population composition and allele frequencies (van de Casteele et al., 2001). COANCESTRY estimates the performance of seven different relatedness estimators conditional on the degree of correlation of each estimator to simulated relatedness values based on allelic information from markers from the real dataset. TrioML, a maximum likelihood estimator (Wang, 2007) displayed the highest correlation to the true values (r = 0.96). Pairwise TrioML values for the Tawharanui population were subsequently calculated in COANCESTRY and used as an estimate of genetic relatedness in all further analyses. TrioML uses the genotype of a third individual as a reference in estimating the pairwise genetic relatedness between the two focal individuals, reducing the chance that genes identical in state are mistaken for genes that are identical by descent (IBD). This estimator calculates the probabilities of the IBD coefficients between two individuals X and Y which are then used to estimate relatedness according to the equation r XY = 2θ XY where θ XY is the coancestry coefficient defined by the IBD coefficients probabilities (Wang, 2007).
Pairwise TrioML values displayed a non-normal distribution with an overrepresentation of zero values ( Figure S1). We therefore used a randomization procedure to investigate whether social pairings occur at random with respect to genetic relatedness. Two-tailed significance was approximated using 10,000 Monte Carlo resamplings. For each permutation the mean expected genetic relatedness was calculated by randomly selecting one male per female. This was repeated 10,000 times to give an approximated null distribution of relatedness values. The observed mean relatedness of social pairs was compared with this distribution and the null hypothesis of random mating was rejected if the observed mean relatedness was below or above the 2.5th and 97.5th percentiles of the null distribution respectively. Because the two areas of the park do not differ in their mean pairwise relatedness of either social pairs (randomization test: p = 0.66) or random male-female pairs within an area (randomization test: p = 0.15), the randomization procedure was performed for both areas together. However, only pairwise genetic relatedness values between females and their potential mates (i.e., males from the same area) were included in the analysis. We conducted this analysis for all years combined because (1) there is a high level of site philopatry across years in this population meaning that the pool of available males was temporally stable, (2) there is no significant difference in relatedness of social pairs between years (randomization test: p = 0.75, Figure S2), and (3) we had limited data on social pairings in some years. Because randomization tests assume independence between observations, if we had data on more than one pairing of a female (N = 21), we included data from only the first social pairing of that female in the analysis. To test the effect of eliminating these nests, and also to test whether a potential deviation from random mating occurs in only some of the observed years, we verified our results with all pairings using mixed-effects model analysis which can account for the non-independence of observations (Pinheiro and Bates, 2000). We examined the probability of male-female pairs from the same area being a social pair using a generalized linear mixed-effects model (GLMM) with a logit link function and binomial error distribution. We input as the binary response variable whether a pair was a social pair or not. Year and the pairwise genetic relatedness of the pair were included as fixed effects, and female identity as a random effect to control for the non-independence of observations. There is likely to have been some minor changes in individual composition of the population between years due to the migration of a small proportion of the breeding population. However, because tui exhibit significant natal and breeding site philopatry (Stewart, 1980;Bergquist, 1985) this was expected to affect only a small proportion of the sampled subpopulations. Therefore, for our analysis we considered that all males from the same area were available for each year.

Effect of Genetic Relatedness on Paternity Success
A previous analysis of this population found that paternity success is correlated to male ornamentation and body size (Wells et al., 2015). Here, we examine whether paternity success is also related to the genetic relatedness of the parents. GLMMs with a binomial error distribution and logit link function were used to test two measures of within-pair paternity success; the probability (as a binary response) of EPP occurring in a nest, and the proportion of extra-pair young (EPY) in a nest with brood size as the binomial denominator. To evaluate if WPMs that are closely related to the female are more likely to be cuckolded we included a predictor specifying the TrioML genetic relatedness value of the social pair. A random intercept for female identity was included to account for the non-independence of multiple broods from the same female (Pinheiro and Bates, 2000). Year was also included as a random effect to control for any annual variation in EPP ( Figure S3).
To investigate whether EPP is biased toward EPMs that are genetically dissimilar to the female, we compared the genetic relatedness of females to their corresponding WPMs and EPMs in broods where both partners could be identified (N = 28). To account for some males being EPMs for more than one female and some broods having more than one EPM, a GLMM with a binomial error distribution and logit link function was used (Stiratelli et al., 1984;Anderson and Aitkin, 1985). The probability of a male being an EPM was modeled as the binary response variable (WPM = 0, EPM = 1), and the TrioML genetic relatedness value of the pair was included as a predictor. Female identity and year were included as random effects.

Effect of Parental Genetic Relatedness on Offspring Growth
To test the hypothesis that compatible genes benefits are gained from mate choice, we investigated offspring performance as a function of parental genetic compatibility. Specifically, we examined whether parental genetic relatedness predicted the pre-fledging offspring growth rates in mass. Nestling mass and tarsus length (i.e., skeletal size) are highly correlated in this population (Pearson's r = 0.98). Therefore, we only provide tests of offspring growth rates in mass. We chose mass rather than skeletal size as a measure of offspring growth because skeletal size is often heritable in birds (e.g., Alatalo and Lundberg, 1986), and therefore we posited that offspring skeletal size might simply be a function of parental size, rather than a genetic compatibility fitness proxy due to the relatedness of the parents. Only offspring for which we knew the identity of the true genetic father were included in the analysis. We also a priori only included offspring which survived to fledging (N = 38). Only fledged offspring were used because offspring that died before fledging includes nestlings with severely reduced growth rates (see Figure S4) that would have died due to causes such as differential nestling provisioning rates by parents when resources are short (i.e., brood reduction; Gebhardt-Henrich and Richner, 1998; e.g., Lack, 1947;Moreno-Rueda et al., 2007;Shizuka and Lyon, 2013), or differential male mortality in sexually size dimorphic species (e.g., Røskaft and Slagsvold, 1985). These factors are expected to have much more severe effects on growth rates (and hence on survival) than the subtle effects of parental relatedness. Thus, by including only fledglings in the analysis we expected that we would be more likely to detect any potential effect of relatedness on performance, free from additional confounding effects on survival. However, to verify the robustness of our results, we also performed all analyses investigating offspring growth on the full dataset of all offspring (N = 46), but also included a factor variable specifying whether or not an offspring survived to fledging (see Supplementary Material). This model specifically tests the effect of relatedness on all offspring whilst accounting for the reduced growth rates of non-fledged offspring.
To obtain a mean growth rate for each offspring we first used a linear mixed-effects model (LMM) to model offspring mass as a function of age (Table S1). To account for significant differences in mass due to sex we fitted sex-specific growth rates by including fixed effects for offspring age, offspring sex, and their interaction. We suppressed a random intercept for each nestling due to the low variation in hatching size, and over-parameterization identified by convergence errors when a random intercept was specified. We included random slopes for female identity, and for each nestling identity nested within female identity. These account for both multiple unbalanced repeated measures from each nestling, as well as non-independence of nestlings from the same female (Pinheiro and Bates, 2000). From this model, we extracted an individual nestling's residual mean growth rate from the model's random slope coefficients. These values were then used as the response variable in the LMM investigating the effect of parental relatedness on offspring growth rates. We included the genetic parents' relatedness values as a predictor and specified random intercepts for female identity and year.

Comparison of Within-Pair and Extra-Pair Young
If females gain genetic benefits from extra-pair mating, EPY should be fitter than within-pair young (WPY) in mixed paternity broods. To test this, we examined whether EPY possessed faster mass growth rates than their WPY maternal half-siblings. We fitted a LMM with offspring residual mean mass growth rates as the response variable and specified a factor variable indicating the nestling's extra-pair status (0 = WPY, 1 = EPY). We included a random intercept for female identity to compare maternal half-siblings (Stiratelli et al., 1984;Anderson and Aitkin, 1985). However, half (7 of 13) of the broods with EPY contained 100% EPP, leaving few mixed paternity broods to test this hypothesis. Therefore, we also conducted this analysis using data from all broods containing EPY (including nests with 100% EPP) and compared the results. Again, we also re-ran these models including all offspring and specifying a factor variable for whether an offspring fledged or not.
Due to our small sample sizes for some tests, we used the R package simr to conduct post-hoc power analyses for significant predictors to assess the power of our models to be able to detect the effect sizes that we observed. Tests were run for 1000 simulations. All analyses were conducted in R v.2.15.1 (R Development Core Team, 2011), using the lme4 package (Bates et al., 2015) for all mixed-model analyses. Significance for mixedmodels was assessed using likelihood ratio tests (LRT) between nested models.

Paternity Analysis
EPP was present in 72% (38 of 53) of all the broods and EPY accounted for 57% (87 of 152) of all offspring studied. Five out of the 28 broods in which the EPMs could be identified contained offspring from two EPMs. For the subset of nests that we had data on growth rates (N = 20), EPP occurred in 65% of all nests, and EPY accounted for 71% of all offspring.

Genetic Relatedness of Social Pairs
Pairwise relatedness between social pairs was often high, with 36% of all pairings being between close kin (equivalent to mating between first cousins or closer, r ≥ 0.125). Eight of the 13 pairs for which we had information on their pairings from multiple years maintained pair-bonds for at least 2 years. All four of the highlyrelated pairs for which we had information on their pairings from multiple years maintained pair-bonds for at least 2 years, and one of these was the only pair known to maintain a pair-bond for all 4 years of this study. Results from the mixed-model analysis indicate that social pairs are more closely related than expected by random mating (β = 4.517 ± 0.839, χ 2 = 22.335, p < 0.001). The randomization test result demonstrated a similar outcome, however this analysis was not quite statistically significant (mean genetic relatedness of social pairs = 0.09 standard deviation (SD) 0.14, mean genetic relatedness of random pairs = 0.05 SD 0.09, p = 0.061). There was no significant difference in the mean genetic relatedness of pairs between years (p = 0.94).

Effect of Parental Genetic Relatedness on Offspring Growth
When offspring mean mass growth rates were examined, less-related genetic parents produced offspring that exhibited significantly faster growth rates than more highly-related parents (β = −0.476 ± 0.234, N = 38, χ 2 = 4.035, p = 0.044; Figure 3). The power for this effect size was 42.1%. When offspring that did not fledge, and a fixed effect taking into account the fact that non-fledged offspring have reduced growth rates, are included in the model, the effect of parental relatedness was highly significant (Table S2). As predicted, non-fledged offspring had significantly lower growth rates than fledged offspring (Table S2). We also tested the effect of relatedness whilst accounting for whether an offspring was a WPY or EPY by including an additional categorical variable specifying the nestling's extra-pair status (Table S3). Although we were limited by sample size for this test, both the relatedness predictor and the EP status predictor remained almost significant in the expected direction, and this test showed that while accounting for genetic relatedness of the parents, EPY tended to have faster growth rates than WPY (Table S3).

Comparison of Within-Pair and Extra-Pair Young
EPY possessed significantly faster growth rates in mass (β = 0.173 ± 0.081, N = 13 nests, χ 2 = 4.527, p = 0.033, power = 55.1%) than WPY. However, because 7 out of the 13 nests contained 100% EPP we consequently analyzed only mixedpaternity nests. Although this test lacked power (46.8%), EPY still tended to have faster growth rates than WPY (Figure 4, β = 0.106 ± 0.058, N = 6 nests, χ 2 = 3.343, p = 0.067). Including all offspring in these models, with a variable specifying whether or FIGURE 3 | Effect of genetic relatedness of the biological parents on an offspring's residual mass growth rate (N = 38). Residual growth rates were extracted from the random slopes of a LMM of nestling mass on nestling age and represent the degree to which a certain nestling's growth rate is above or below the mean growth rate for that sex. Points are raw data and solid line is the fitted values output from a LMM with 95% confidence intervals (dashed lines).
not an offspring survived to fledging, did not change these results (Tables S4, S5).

DISCUSSION
Theory predicts that compatible genes selection should be greatest in populations where the cost of incompatibility is high (Pusey and Wolf, 1996;Colegrave et al., 2002;Michalczyk et al., 2011). This would occur, for example, in inbred populations where inbreeding depression can have detrimental effects on offspring performance (Keller and Waller, 2002). In our study, this prediction is borne out by several lines of evidence. First, social pairings between kin in tui are relatively common and occur more frequently than expected by chance, increasing the risk of inbreeding depression. Second, the offspring of genetically dissimilar biological parents had faster growth rates than the offspring of highly-related pairs, suggesting that inbreeding can reduce fitness under some circumstances. Third, EPMs were significantly less-related to the female than were the WPMs they cuckolded, EPY generally possessed faster growth rates than WPY, and EPY tended to have faster growth rates than their WPY half-siblings, which suggest that extra-pair mating helps females to circumvent the costs of social inbreeding. Together these findings provide empirical evidence that extra-pair mating in tui can provide compatible genes benefits.
The demonstrated adverse effect of inbreeding on offspring fitness in this study suggests that the high rate of EPP in tui could, at least in part, be an inbreeding avoidance mechanism that has evolved in response to significant natal philopatry (Birkhead and Møller, 1998;Foerster et al., 2006). Yet, we found no evidence that social inbreeding was compensated for by a facultative increase in frequency of EPP. Our findings are thus congruent with studies in other passerines in which females mated with genetically dissimilar EPMs and yet the genetic similarity of the FIGURE 4 | Difference in residual growth rates in mass between WPY and EPY maternal half-siblings in mixed paternity broods (N = 19 nestlings in 6 brood comparisons). Where multiple comparisons of half-siblings within broods existed, growth rates within each class have been averaged.
social-pair did not predict the occurrence of EPP (Foerster et al., 2006;Suter et al., 2007;Brekke et al., 2012). Comparably, two of these studies also found some support for non-random social inbreeding (Foerster et al., 2006;Brekke et al., 2012). In our study, this was primarily driven by the social pairings of some highlyrelated individuals. We therefore argue that although female tui gain compatible genes benefits from EPP, this mating pattern is not a consequence of direct selection on pre-or post-copulatory female extra-pair mate choice (Fossøy et al., 2008) but is rather the automatic consequence of within-pair mating decisions. In support of this hypothesis, if highly-related pairs are removed from our analysis, extra-pair mating appears to be random with respect to genetic relatedness. Nevertheless, this mating pattern does not preclude that EPP in tui evolved as an inbreeding avoidance mechanism, because the social structuring of pairs with high genetic relatedness negates the requirement for female extra-pair mate choice based on genetic compatibility in order for EPP to be adaptive.
Indeed, mate choice based on relatedness would require the existence of sophisticated recognition of unfamiliar kin mechanisms. Although kin recognition by phenotype matching is common among mammals with refined olfactory mechanisms (e.g., Pusey and Wolf, 1996;Mateo, 2003;Boulet et al., 2009;Leclaire et al., 2013) and in cooperative breeders where visual or acoustic cues may be important in kin selection (Komdeur and Hatchwell, 1999;Hatchwell et al., 2001;Cornwallis et al., 2009;McDonald and Wright, 2011), few studies have definitively identified the methods through which socially monogamous birds can assess relatedness (Jamieson et al., 2009). Consequently, significant doubt still surrounds the ability of females to select mates based on genetic compatibility through either pre-or post-copulatory mechanisms (see Griffith, 2007 for a review). Instead, the EPP success of genetically dissimilar tui males likely derives from their superior post-copulatory fertilization success (Tregenza and Wedell, 2000;Griffith and Immler, 2009). Critically, this system does not require an active female preference or facultative manipulation of paternity. For example, under the "genetically loaded raffle" model (Ball and Parker, 2003) females may mate with many different males but only sperm from the most compatible males achieve fertilization (Wilson et al., 1997;Birkhead, 1998;Birkhead and Pizzari, 2002;Bernasconi et al., 2004;Griffith and Immler, 2009;Pryke et al., 2010). Foerster et al. (2006) similarly argued that sperm selection was responsible for the EPP success of genetically dissimilar blue tit, Cyanistes caeruleus, males. The tui system could be considered a specific form of the genetically loaded raffle driven by population social structure; if females are paired to highlyrelated social males, any chosen EPM should achieve significant paternity success because of his high likelihood of having more compatible sperm.
Previous studies have shown that population social structuring can influence rates of EPP in some species (e.g., Cohas and Allainé, 2009). However, the extent to which social structure influences the fitness outcomes of extra-pair mating has been little researched. To our knowledge, our study is the first to demonstrate that females increase offspring fitness by gaining compatible genes benefits from EPP simply as a result of population social structuring. Our findings are similar to those of a recent study (Reid et al., 2015) which found that female song sparrows, M. melodia, alter offspring inbreeding coefficients through random extra-pair mating due to their propensity to pair with related social partners more frequently than expected by random mating alone. These two studies demonstrate that population social structure can influence the fitness consequences of extra-pair mating without the overt need for an evolutionary mechanism based on kin recognition. However, Reid et al. (2015) did not include any aspect of sexually selected traits in their study, therefore it was unclear how selection for good genes might be affected. Although we do not include a measure of good genes selection here, this was the focus of another study on this tui population during the same time period (Wells et al., 2015). In contrast to the findings of Reid et al. (2015), the study found that extra-pair mating in this population of tui is not random: EPMs are more ornamented than their WPM counterparts, and males with smaller throat ornaments are more likely to be cuckolded (Wells et al., 2015). Theory predicts that females cannot gain good genes and compatible genes benefits from the same male (Mays et al., 2008), and that when these two mating strategies coexist in a population, selection for one benefit will weaken selection for the other (Colegrave et al., 2002). However, this may not be the case when compatibility benefits are the outcome of social structure, as we argue here. Most tui broods containing EPP are sired by only one EPM (Wells et al., 2015). Therefore, it is unlikely that female tui select different EPMs for good genes and compatible genes benefits, as has been demonstrated in blue tits (Foerster et al., 2003). Instead, any females paired to relatives that seek extra-pair copulations with more ornamented EPMs also stand to gain compatible genes benefits because of the higher likelihood that the chosen EPM will be more genetically dissimilar to her than the WPM. However, we stress that our finding that EPY have faster growth rates than WPY does not alone distinguish whether this is due to good genes or compatible genes benefits, or indeed whether it is a result of both processes, and examination of the fitness outcomes of good genes matings is needed to confirm this. Nevertheless, our findings highlight a potential pathway through which EPP in tui can provide the opportunity to simultaneously gain good genes and compatible genes benefits. Thus, population social structuring can facilitate a means by which selection for good genes and compatible genes can operate through EPP with little opposing selection.
The relatively stable pair-bonds and non-random social mating with respect to relatedness in tui then raises the question exactly what benefits, if any, being paired to a relative might provide (reviewed in Kokko and Ots, 2006)? One possibility is that inclusive fitness benefits gained from inbreeding and/or the costs of avoiding inbreeding may outweigh any cost of inbreeding depression (Lehmann and Perrin, 2003;Parker, 2006;Szulkin et al., 2013;Lehtonen and Kokko, 2015). Alternatively, individuals may not actively choose related social partners, yet inbreeding may provide females with some correlated benefit. For example, females may pair with relatives if local males are more likely than immigrants to procure high quality territories. This hypothesis is particularly relevant to tui, in which natal philopatry (Stewart, 1980;Bergquist, 1985;Stewart and Craig, 1985) exacerbated by social aggregations and competition for territories occurring at nectar resources at the beginning of the breeding season (Stewart, 1980), could render such a strategy adaptive.
Like most studies on a large, open population, our study has some limitations. First, our sample sizes, and hence the power of our tests, in this study are small, and should be replicated with a larger dataset. Second, due to the correlative nature of this study, we are unable to differentiate between the effects of relatedness itself on offspring fitness and between potential confounding factors such as age or condition. For example, individuals of low genetic quality or condition may experience limited breeding opportunities and may therefore be more likely to mate with relatives. Third, we cannot rule out the possibility that the greater fitness observed in the young of less related parents is the result of maternal effects. For example, females may preferentially invest more in extra-pair offspring that are deemed to be of greater reproductive value (Magrath et al., 2009;Tschirren et al., 2012). This is a possibility in our study because females choose EPMs based on phenotypic quality in this population. Finally, in including only fledged offspring in our tests we may have inadvertently created a bias, or reduced the power of our tests by reducing our sample size. However, the outcome of this was deemed to be preferable to potentially reporting results based on offspring that had extremely low growth rates that were unrelated to parental relatedness.
In conclusion, our study provides valuable insight into the effect that population social structure can have on the fitness outcomes of extra-pair mating in a large and naturally occurring population. Whether cryptic female choice or sperm selection is responsible for the demonstrated EPP success of genetically dissimilar males is outside the scope of the present study. Irrespective of the exact selective mechanism, our findings demonstrate that females gain compatible genes benefits from extra-pair mating, but that social inbreeding rather than active extra-pair mate choice is responsible. Moreover, in our study, females from highly-related pairs engaging in extra-pair mating for good genes benefits gained genetic compatibility benefits as an additional aside. Thus contrary to theoretical predictions (Colegrave et al., 2002;Mays et al., 2008) we argue that selection for compatible genes need not always weaken selection for good genes when genetic compatibility is an outcome of population social structuring. Such multiple genetic benefits could compensate for any costs of female extra-pair behavior and extra-pair mating could therefore be adaptive for females under such circumstances.

AUTHOR CONTRIBUTIONS
SW and WJ conceived of and designed the study. SW and WJ collected field data. DG provided funding for molecular work. SW analyzed the data with input from DG and BJ. SW, JD, and WJ interpreted the data. SW wrote the manuscript with contributions from JD and WJ.

FUNDING
This study was funded by Landcare Research, New Zealand, and Auckland Council, New Zealand. SW was supported by a Commonwealth Doctoral Scholarship.