Genomic inbreeding measures applied to a population of mice divergently selected for birth weight environmental variance

This study aimed to compare different inbreeding measures estimated from pedigree and molecular data from two divergent mouse lines selected for environmental birth weight during 26 generations. Furthermore, the performance of different approaches and both molecular and pedigree data sources for estimating Ne were tested in this population. A total of 1,699 individuals were genotyped using a high-density genotyping array. Genomic relationship matrices were used to calculate molecular inbreeding: Nejati-Javaremi (F NEJ ), Li and Horvitz (F L&H ), Van Raden method 1 (F VR1 ) and method 2 (F VR2 ), and Yang (F YAN ). Inbreeding based on runs of homozygosity (F ROH ) and pedigree inbreeding (F PED ) were also computed. F ROH , F NEJ , and F L&H were also adjusted for their average values in the first generation of selection and named F ROH0 , F NEJ0 , and F L&H0 . ∆F was calculated from pedigrees as the individual inbreeding rate between the individual and his parents (∆F PEDt ) and individual increases in inbreeding (∆F PEDi ). Moreover, individual ∆F was calculated from the different molecular inbreeding coefficients (∆F NEJ0 , ∆F L&H , ∆F L&H0 , ∆F VR1 , ∆F VR2 , ∆F YAN , and ∆F ROH0 ). The Ne was obtained from different ∆F, such as Ne PEDt , Ne PEDi , Ne NEJ0 , Ne L&H , Ne L&H0 , Ne VR1 , Ne VR2 , Ne YAN , and Ne ROH0 . Comparing with F PED , F ROH , F NEJ and F VR2 overestimated inbreeding while F NEJ0 , F L&H , F L&H0 , F VR1 and F YAN underestimated inbreeding. Correlations between inbreeding coefficients and ∆F were calculated. F ROH had the highest correlation with F PED (0.89); F YAN had correlations >0.95 with all the other molecular inbreeding coefficients. Ne PEDi was more reliable than Ne PEDt and presented similar behaviour to Ne L&H0 and Ne NEJ0 . Stable trends in Ne were not observed until the 10th generation. In the 10th generation Ne PEDi was 42.20, Ne L&H0 was 45.04 and Ne NEJ0 was 45.05 and in the last generation these Ne were 35.65, 35.94 and 35.93, respectively F ROH presented the highest correlation with F PED , which addresses the identity by descent probability (IBD). The evolution of Ne L&H0 and Ne NEJ0 was the most similar to that of Ne PEDi . Data from several generations was necessary to reach a stable trend for Ne, both with pedigree and molecular data. This population was useful to test different approaches to computing inbreeding coefficients and Ne using molecular and pedigree data.


Introduction
Inbreeding appears due to mating between related individuals and is related to negative consequences because of an increase in homozygosity, such as a reduction in fitness, namely, inbreeding depression (Hedrick, 2012;Alemu et al., 2021).Therefore, the study of inbreeding is essential in many areas, e.g., animal and plant breeding (Villanueva et al., 2021), human genetics (McQuillan et al., 2012), and evolutionary (Roff, 1997) and conservation biology (Frankham et al., 2010).
Traditionally, pedigree data has been used to measure inbreeding (Howard et al., 2017;Villanueva et al., 2021); however, this only provides the expected proportions of the genome that are identical by descent (IBD) and does not capture variation due to Mendelian sampling and linkage during gamete formation (Howard et al., 2017).
The implementation of molecular technologies has led to the development of numerous genomic estimators of inbreeding coefficients (Howard et al., 2017;Alemu et al., 2021;Villanueva et al., 2021), which can distinguish between individuals with the same ascendants and differentiate inbreeding at specific regions in chromosomes (Howard et al., 2017;Villanueva et al., 2021).Different approaches have been developed to measure inbreeding coefficients based on genomic data, including methods based on homozygous stretches of DNA sequences (runs of homozygosity-ROH) (Ceballos et al., 2018).However, ROH detection is highly dependent on the parameters set by the user (Peripolli et al., 2017;Rodríguez-Ramilo et al., 2019;Villanueva et al., 2021); therefore, comparisons between studies are not straightforward.In this context, other methodologies appeared that detected homozygosity by descent (HBD) segments under a probabilistic framework based on hidden Markov models.The probability of a single nucleotide polymorphism (SNP) being in an HBD segment was modelled as a function of genotyping error, intermarker distances and allele frequencies (Druet and Gautier, 2017).In addition, other methods are based on analysing each SNP and are derived from genomic relationship matrices (GRMs) (Li and Horvitz, 1953;Nejati-Javaremi et al., 1997;VanRaden, 2008;Yang et al., 2010;VanRaden et al., 2011).GRMs are used to obtain genomic predictions in animal breeding and can be used to estimate inbreeding given (Villanueva et al., 2021).These measures, however, are designed with different constraints and can lead to very different results with several implications (Villanueva et al., 2021).
One of the indicators most used to assess the genetic diversity of a population is the effective population size (Ne) (Wright, 1931).There are some approaches based on different indicators that have been used to measure Ne: the change in allele frequencies across generations (variance Ne), increase in homozygosity (inbreeding Ne), increase in kinship (coancestry Ne) or the rate of loss of less frequent alleles (eigenvalue Ne).Ne can also be estimated using census parameters, pedigree data, individual genotypes or demographic information (Sjödin et al., 2005;Leroy et al., 2013).The relationship between the increase in inbreeding (ΔF) and Ne is defined by the classic formula: Ne 1 2ΔF .Traditionally, ΔF has been calculated as the increase in the inbreeding rate between two successive generations (Wright, 1931).However, other methods have been proposed to estimate the individual increase in inbreeding based on pedigree knowledge, which has demonstrated less susceptibility to mating method changes (Gutiérrez et al., 2008;Gutiérrez et al., 2009;Cervantes et al., 2011).Moreover, the availability of molecular information has led to the estimation of molecular Ne.Some estimators have been proposed based in heterozygosity excess (Pudovkin et al., 1996), linkage disequilibrium (Hill, 1981), temporal changes in allele frequency (Krimbas and Tsakas, 1971;Nei and Tajima, 1981;Pollak, 1983), half-sib and full-sib (Wang, 2009), or in approximate Bayesian computation that take into account, among other parameters, the number of alleles per locus, the linkage disequilibrium or the mean and variance of multilocus homozygosity (Tallmon et al., 2008).Wang et al., 2016 discussed some concerns about these different Ne estimators as the reliability, the interpretation of each method, or the implicit assumption in some of these methods.
A divergent selection experiment for birth weight variability in mice has been successfully carried out, creating two lines, one homogeneous and the other heterogeneous over 26 generation (Formoso-Rafferty et al., 2016a).The experimental mating design avoided sharing grandparents and was optimised as a function of selection criteria in both selected lines (Formoso-Rafferty et al., 2016b).The homogeneous line presented advantages in production, animal welfare, response to selection, and traits related to robustness, such as longevity, survival and feed efficiency (Formoso-Rafferty et al., 2017;Formoso-Rafferty et al., 2018;Formoso-Rafferty et al., 2019;Formoso-Rafferty et al., 2020;Formoso-Rafferty et al., 2022;Formoso-Rafferty et al., 2023).
This study aimed to compare different inbreeding measures estimated from pedigree and molecular data from divergent mouse lines for environmental variability in birth weight.Furthermore, the performance of Ne estimated using different approaches and both molecular and pedigree data sources was tested in this divergent population.The applications of different methodologies in this particular population structure (strong divergent selection) are intended to evaluate the suitability in populations under selection.

Methods
Pedigree data were obtained from 26 discrete generations of a successful divergent selection experiment for environmental variability of birth weight in mice, including an additional five previous discrete generations randomly mated.In the experiment, in each generation, 43 females were mated to one male to give a maximum of two parturitions, avoiding mating animals sharing grandparents, with an approximately 30% proportion selected within the line.In the five preceding generations, 32 males were mated to 64 females, each male with 2 females.More details of the selection process can be found in Formoso-Rafferty et al. (2016a).
A total of 1824 individuals from the 26 selection generations were genotyped using the Affymetrix Mouse Diversity Genotyping Array with 616,137 SNPs.The individuals genotyped were those selected according to the selection criteria in each generation.During quality control (QC) animals with a call rate lower than 97% were removed, leaving 1,669 for analysis.Of these, 1,587 were females and 112 were males.The first generation of selected mice was used as the reference population (a total of 70 individuals, all females).The QC was also applied to the SNPs: 3% of missing genotypes were allowed, SNPs mapped to sex chromosomes were removed, and 545,656 SNPs were retained.This set was used to detect ROH to ensure maximum genome coverage and, therefore, avoid loss of information: no minor allele frequency (MAF) was applied as recommended other authors recommended (Meyermans et al., 2020).
Additional QC was applied to determine the other genomic inbreeding coefficients estimators: SNPs presenting MAFs lower than 0.05 in the reference population were removed.Additionally, remnant SNPs with MAFs less than 0.05 among the whole population were also removed.Finally, 173,546 SNPs were kept, which were used to obtain inbreeding coefficients from different GRMs.The number of animals genotyped per generation in the selected population is presented in Table 1.

Inbreeding coefficients
Inbreeding coefficients from genomic data were obtained from the diagonal elements of different GRMs and ROH.
Inbreeding based on GRMs was named using the nomenclature chosen by Villanueva et al., (2021); therefore, inbreeding coefficients were calculated as follows: 1) F NEJ uses the diagonal elements of the allelic relationship matrix of Nejati-Javaremi et al. (1997).F NEJ is calculated using all individuals (F NEJT ), those from the high variability line (F NEJH ) and those from the low variability line (F NEJL ). 2) F NEJ0 of an individual is calculated as , where F NEJ is the Nejati-Javaremi inbreeding coefficient and F NEJ1st is the average Nejati-Javaremi inbreeding coefficient of the reference population.3) F L&H estimates the deviation of the observed frequency of homozygotes from the expected frequency of homozygotes in the reference population under Hardy-Weinberg proportions (Li and Horvitz, 1953).4) F L&H0 of an individual k is calculated as , where F L&H is the Li and Horvitz inbreeding coefficient of the individual and F L&H1st is the average F L&H of the reference population.Therefore, F L&H0 is expected to perform similarly to that of F NEJ0 .5) F VR1 is computed from the diagonal elements of the GRM obtained according to Van Raden's method 1 (VanRaden, 2008).6) F VR2 is computed from the diagonal elements of the GRM obtained according to Van Raden's method 2 (Leutenegger et al., 2003;VanRaden, 2008).7) F YAN uses the diagonal elements from the GRM of Yang to compute inbreeding coefficients (Yang et al., 2010).8) Sliding window algorithms were used to detect ROH in each population.The parameters set were as follows: 50 SNPs per window; one heterozygote allowed in a window; no limit of the number of heterozygotes per ROH; five missing SNPs allowed in a window; the minimum length of an ROH was 1,000 kb; the minimum number of homozygous SNPs in an ROH was set at 100; the required minimum density was set at one SNP/50 kb; the window threshold was set at 0.5; and the minimum distance between two ROHs was 1,000 kb.Inbreeding based on ROH was calculated as F ROHi (McQuillan et al., 2012).9) F ROH0 is calculated as , where F ROH is the ROH inbreeding coefficient and F ROH1st is the average ROH inbreeding coefficient of the reference population.10) Pedigree inbreeding coefficients (F PED ) are defined as the probability of an individual having two identical alleles by descent and are computed following Meuwissen and Luo (1992).
In summary, we tested several methods; some were intrinsically adjusted for the allele frequencies of the reference population (F L&H , F VR1 , F VR2 , and F YAN ), others were adjusted for mean inbreeding of the reference population (F NEJ0 , F L&H0 , and F ROH0 ), and, finally, some were not adjusted (F PED , F NEJ , and F ROH ).The genomic inbreeding estimators described above were chosen because they have been already tested in other populations (Alemu et al., 2020;Villanueva et al., 2021;Caballero et al., 2022), except for the adjusted inbreeding estimators 2, 4 and 9, which, not been previously described before.The application of these estimators to this population with a high number of generations, a particular and very structure linked to a specific mating design, represented an opportunity to evaluate the performance of these inbreeding coefficient estimators.
Expected homozygosity (F EXP ) was calculated from the frequency of the reference allele (p) per generation as F EXP 1 − 2p(1 − p).This coefficient was calculated considering both lines together (F EXP ) and within the high variability line (F EXPH ) and the low variability line (F EXPL ).
Our own R code was used to calculate coefficients based on GRMs.PLINK v 1.9 (Chang et al., 2015) was used to detect ROH, and ENDOG v 4.8 (Gutierrez and Goyache, 2005) was used to calculate pedigree inbreeding coefficients.

Individual increase in inbreeding and realised effective population size
Effective population size was computed for each generation from individual increases in inbreeding (Gutiérrez et al., 2008;Gutiérrez et al., 2009) based on the different inbreeding coefficients described above.When pedigree inbreeding was used, two approaches were applied to calculate the individual increase in inbreeding.One is based on the classic formula: , where F PEDt is the inbreeding coefficient of the individual of generation t and F PEDt−1 is the average coefficient of inbreeding of their parents (Falconer and Mackay, 1996).Then, the effective population size was computed as Ne PEDt 1 2ΔFPED , where ΔF PEDt is the average ΔF PEDt of n individuals included in each generation (Falconer and Mackay, 1996).
Moreover, the individual increase in inbreeding coefficient using Gutiérrez et al., 2009), where F PEDi is the individual coefficient of inbreeding of i and t are the number of generations.The individual increase in inbreeding has been proposed as a measure of standardized inbreeding rate per generation (Gonzále-Recio et al., 2007).The realised effective population was computed as Ne PEDi 1 2ΔFPEDi , where ΔF PEDi is the average ΔF PEDi of n individuals included in each generation (Cervantes et al., 2008).
The molecular , where F i is the individual F NEJ0 , F L&H0 , F ROH0 , F VR1 , F VR2 or F YAN coefficient of inbreeding of individual i and t is the number of generations that passed since the first generation of selection where the reference population has been defined.
The expected N e per generation was computed as Ne EXP 1 2ΔF EXP , where ΔF EXP is the expected increase in homozygosity per generation calculated as , where F EXP t is the expected homozygosity of the current generation and F EXPt−1 is the average coefficient of expected homozygosity of the previous generation.This expected N e was obtained considering both lines together (Ne EXP ), also within line for the high variability line (Ne EXPH ) and for the low variability line (Ne EXPH ).
In addition, the number of generations that passed since the founder generation to the reference population could be derived from the classic formula F t 1 − (1 − ΔF) t , where F t is the average inbreeding of the reference population and t is the generation of the reference population.Thus, the estimated generations that had passed from the founder population to the reference population are t ln (1−Ft)  ln (1−ΔF) .Different t values were calculated considering F NEJ (t NEJ ) and F ROH (t ROH ); t NEJ and t ROH were calculated using the average inbreeding coefficients of the reference population, and the increase in inbreeding (ΔF 1 2Ne ) was derived from Ne based on the sex ratio (Ne s ), which was calculated as Nes 4MF M+F , where M is the number of breeding males and F is the number of breeding females (Wright, 1931).Ne s was calculated using the permanent number of males (32) and females (64) in the randomly mated population, which was the origin of the selected divergent lines.
Moreover, t was also calculated using the average F PED of the reference population (t PED ) and the average ΔF PEDi of whole selection generations.In this case, t PED represented the number of generations elapsed if the breeding method of the selection experiment would be applied from the founder population of the recorded pedigree to the first generation of the experiment (reference population).

Results
Figure 1 shows the trend of average inbreeding across generations calculated using molecular and pedigree approaches, which was positive in both cases.Both approaches presented similar performances across generations, except F ROH and F NEJ , which showed the greatest amount of inbreeding in the first and last generations.In the first generation, F ROH was 0.54 and F NEJ was 0.61, and in the last generation, F ROH was 0.68 and F NEJ was 0.73.The lowest individual inbreeding coefficient was identified for F L&H in the first (−0.02)and last generation (0.28) (Supplementary Table S1).There were not differences in the evolution of the different inbreeding coefficients in the two selection lines when analysed separately (results are not shown).We therefore decided to analyse the data together.
The evolution of expected homozygosity and F NEJ for the whole population and selected lines for divergent birth weight variability across generations are represented in Figure 2. Expected homozygosity presented a positive trend when data from all individuals was used and when selected lines were analysed separately.Moreover, F EXPH and F EXPL were similar to F NEJH and F NEJL , with 0.75 F EXPH , 0.74 F EXPL , 0.73 F NEJH , and 0.72 F NEJL in the

FIGURE 2
Evolution of expected homozygosity (F EXP ) and mean inbreeding coefficient obtained from the Nejati-Javaremi genomic relationship matrix (F NEJ ) across the selection generations.F EXP-T was calculated using all individuals.F EXP-H was calculated using individuals from the high variability line.F EXP-L was calculated using individuals from the low variability line.F NEJ-T was calculated using all genotyped individuals.F NEJ-H was calculated using individuals from the high variability line.F NEJ-L was calculated using individuals from the low variability line.last generation.F EXP was greater than F NEJ until the seventh generation.However, after these generations, F EXP was markedly lower than F NEJ and lower than expected homozygosity within line because of the structure created by divergent selection.The highest values of F EXP and F NEJ were reached in the last selection generation, at 0.67 and 0.72, respectively.
Correlations between molecular and pedigree inbreeding coefficients using the whole population are shown in Table 2.All correlations were greater than 0.84.The correlations between F PED and molecular inbreeding coefficients were between 0.84 (F PED -F VR1 ) and 0.89 (F PED -F ROH ).The strongest correlations between molecular inbreeding coefficients were detected between F NEJ and F L&H (1.00).The F YAN showed correlations greater than 0.95 with all other molecular inbreeding coefficients, and the correlation with F PED was 0.88.When only a few generations were used to calculate the correlation coefficients, these correlations were lower between F PED and the molecular inbreeding coefficients (Supplementary Table S2).In the initial generations, F PED showed the highest correlation with F ROH (0.25) and the lowest correlation with F VR1 (0.11).However, in the intermediate generations, F PED showed the highest correlations with F NEJ and F L&H (0.39), and the lowest correlation was observed with F VR2 (0.22).Moreover, in the most recent generations, the highest correlations were also observed between F PED -F NEJ and F PED -F L&H (0.22), and the lowest correlation was observed between F PED -F VR2 (0.13).Finally, in general, F YAN showed the highest correlations with other molecular inbreeding coefficients in the initial, intermediate, and final generations.The evolution of individual inbreeding increases across generations calculated by pedigree and molecular inbreeding coefficients are shown in Figure 3.The ΔF PEDt (values between −0.03 and 0.06, Figure 3A) showed an irregular evolution across generations unlike ΔF PEDi (0.00 and 0.02, Figure 3B), which increased during fifteen generations and became stable thereafter.Individual increases in inbreeding calculated from molecular inbreeding coefficients showed a similar trend, with higher variability among individuals until the 10th generation, after which the trend was stabilised.The ΔF L&H0 (values between −0.13 and 0.13, Figure 3E) performed more like the other molecular increases in inbreeding (ΔF NEJ0, ΔF VR1, ΔF VR2, ΔF YAN and ΔF ROH0 ) than ΔF L&H (values between −0.02 and 0.02, Figure 3D) in the first five generations of selection.
Table 3 shows the correlations between increases in inbreeding calculated using all generations.Correlations greater than 0.90 were identified between ΔF L&H0 -ΔF NEJ0 (1.00), ΔF L&H -ΔF YAN (0.92), ΔF ROH0 -ΔF NEJ0 (0.91), and ΔF ROH0 -ΔF L&H0 (0.91).Moreover, ΔF YAN was strongly correlated with ΔF VR1 (0.89) and ΔF VR2 (0.86).The lowest correlations were detected between ΔF PEDt and all other inbreeding increases being between 0.38 for ΔF PEDi and 0.18 for ΔF VR1 and ΔF VR2 .Interestingly, higher correlations were detected between ΔF L&H and ΔF YAN , ΔF VR1 and ΔF VR2 than between ΔF L&H0 and ΔF YAN , ΔF VR1 and ΔF VR2.When only groups of generations (initial, intermediate and most recent generations) were considered, correlations between individual ΔF values were generally lower than when all generations were used (Supplementary Table S3).The strongest correlations were detected between ΔF L&H0 and ΔF NEJ0 (1.00) in all generation groups.In summary, the molecular ΔF with the strongest correlations among the others in the different generation groups was ΔF YAN, followed by ΔF ROH0 .Nevertheless, when only a few generations were used to calculate ΔF, correlations between molecular and pedigree measures were very low.
Table 4 shows Ne calculated via different methodologies with molecular and pedigree data across the selection generations.The Ne trend stabilises after the 10th generation of selection except for Ne PEDt , which presented high variability across generations.In fact, Ne PEDt showed high values in generations 11 (1,482.27)11).The evolution of Ne NEJ0 and Ne L&H0 was almost identical.The evolution of Ne NEJ0 , Ne L&H0 , Ne VR1 , Ne YAN , and Ne ROH0 as of the 10th generation was similar to that of Ne PEDi , while Ne L&H0 showed greater values than Ne PEDi, and Ne VR2 showed lower values than Ne PEDi .The evolution of Ne EXP was more irregular than that of Ne NEJ0 , with extremely negative and positive values, such as 1852.44 in the 16th generation and −464.38 in the 26th generation (Supplementary Figure S1).When selected lines were analysed separately, Ne EXPH presented a more regular evolution than Ne EXPL .However, extremely negative and positive values were also observed in both (Supplementary Figure S1).
The Ne s was 85.33, which was used to infer the number of generations that had passed from the founder population from the hypothetical foundation to the first generation of selection using F NEJ (t NEJ ) and F ROH( t ROH) as described above: t NEJ was 162 generations, and t ROH was 133 generations.
When pedigree data was used to calculate the number of generations elapsed if the breeding method of the selection experiment would be TABLE 3 Correlation coefficients between molecular and pedigree increases in inbreeding coefficients (ΔF) of all genotyped individuals.ΔF PEDi was derived from pedigree referenced against the number of generations of the individual.ΔF PEDt was derived from the pedigree referring to the inbreeding coefficient of the individual's parents.ΔF NEJ was derived from the diagonal of the Nejati-Javaremi matrix.ΔF NEJ0 was derived from inbreeding coefficients (F) obtained using the Nejati-Javaremi matrix (F NEJ ) adjusted for the mean F NEJ of the first selection generation.ΔF L&H was derived from the diagonal of the Li and Horvitz matrix.ΔF L&H0 was derived from F using the Li and Horvitz matrix (F L&H ) adjusted for the mean F L&H of the first selection generation.ΔF VR1 was derived from the diagonal of the Van Raden type 1 matrix.ΔF VR2 was derived from the diagonal of the Van Raden type 2 matrix.ΔF YAN was derived from the diagonal of the Yang matrix.ΔF ROH was computed using runs of homozygosity.ΔF FROH0 was derived from F obtained using runs of homozygosity (F ROH ) adjusted for the mean F ROH of the first selection generation.

Discussion
This population, formed by two divergent lines selected for birth weight variability during twenty-six discrete generations, allowed us to test different inbreeding measures using molecular and pedigree data and their application to compute Ne.
The inbreeding coefficient has been defined in terms of correlations between parents uniting gametes (Wright, 1931) and as a probability that two genes randomly sampled in the parents' gametes are IBD (Malecot, 1948).Because of F PED is completely defined in terms of IBD probability, the performance of the molecular inbreeding coefficients are going to be compared with F PED , although the use of F PED as the golden standard is not straightforward as assumes that the founders of the pedigree are unrelated (Keller et al., 2011) and provide the expected proportion of inbreeding while ignoring the effects of other forces such as Mendelian sampling, linkage, or natural selection against TABLE 4 Evolution across selection generations, total mean, and standard deviation (SD) of the effective population size (Ne) using pedigree and molecular information.Ne PEDi derived from the individual increase in inbreeding (ΔF) refers to the number of individual generations.Ne PEDt derived from ΔF based on the increase in pedigree inbreeding between two successive generations.Ne NEJ0 derived from the inbreeding coefficient (F) of the Nejati-Javaremi genomic matrix (F NEJ ) adjusted for the mean F NEJ of the first generation of selection.Ne L&H.derived from ΔF of the Li and Horvitz genomic matrix.Ne L&H0 derived from F of the Li and Horvitz genomic matrix (F L&H ) adjusted for mean F L&H of the first generation of selection.Ne VR1 derived from ΔF of the Van Raden type 1 genomic matrix.Ne VR2 derived from ΔF of the Van Raden type 2 genomic matrix.Ne YAN derived from ΔF of the Yang genomic matrix.Ne ROH0 derived from the F of runs of homozygosity (F ROH ) adjusted for mean F ROH of the first generation of selection.

Generation of selection
Ne homozygous alleles (Wright, 1931).The F L&H , F VR1 , F VR2 , and F YAN behave more like correlations and have negative values in some individuals (Supplementary Table S1), as reported by other authors (Alemu et al., 2021;Villanueva et al., 2021).In the ROH definition, a long stretch of homozygosity is expected to be inherited from a common ancestor (Ceballos et al., 2018); F ROH values range from 0 to 1 (Alemu et al., 2021) and present the highest correlations with F PED when all individuals from these populations are used.Therefore, among all the different measures of molecular inbreeding analysed in this population, F ROH better fits the IBD definition when F PED is used as the reference coefficient.However, ROH analysis is an empirical, rulebased approach, and detection relies on the definition of several parameters that must be adjusted as a function of several constraints, such as marker density or the number of genotyping errors (Ferenčaković et al., 2013;Meyermans et al., 2020).Hence, ROH detection varied across different studies.Moreover, F ROH is more representative of IBD than F NEJ , defined as the proportion of homozygous SNPs in the individual and, thus, unable to differentiate between IBD and identity by state (IBS) (Toro et al., 2014); however, F ROH theoretically represents the proportion of long IBS regions in the genome.Furthermore, if F ROH and F NEJ were used to infer the number of generations that had passed since the founder population, t NEJ would be expected to present the highest value due to F NEJ being fully IBS, while F ROH is expected to be more representative of IBD.The high values of t PED and t ROH reflected the history of the founder population originating from the genetic contribution of three inbred mouse lines: BALB/c, C57BL and CBA.This mixed starting population was maintained under panmixia for 40 generations before the start of the selection experiment (Formoso-Rafferty et al., 2016b).The lowest correlations between F PED and molecular inbreeding coefficients were detected with F VR1 and F VR2.In addition, other authors (Caballero et al., 2022) reported that the poorest performance with regard to IBD was detected for F VR2 .
The number of generations that had passed before starting the experiment inferred from pedigree (t PED ) was very low.Pedigrees, however, are usually limited to a few past generations and ignore some of the ancestral relationships (Keller et al., 2011).In this case, F PED was calculated using five generations before initiating the experiment, and the mating method was changed at this start; therefore, t PED was lower than the number of generations that had passed since the founder population in the pedigree.
When all data was used, correlations between different inbreeding measures were high.In other studies in Iberian pigs (Saura et al., 2015), cattle (Alemu et al., 2021;Lozada-Soto et al., 2022) or humans (McQuillan et al., 2012), strong correlations were also detected.However, when a low number of generations was used, the correlations between different pedigrees and genomic inbreeding measures were lower.Therefore, the high correlations observed when using all animals across generations were probably because of the large number of genotypes distributed across generations, resulting in a wide range of inbreeding values in all coefficients.This demonstrates that this experimental population was of particular interest to check the relationship between the different parameters assayed.Moreover, other authors reported that the incompleteness of the pedigree should lead to poor correlations (Schiavo et al., 2022).Furthermore, the correlations between the different increases in inbreeding were lower than between the inbreeding coefficients because, firstly, ΔF PEDi and ΔF molecular are standardised by the depth of pedigree information.Therefore, the evolution of ΔF was expected to be stable if the mating design remains stable across generations, generating less variation than in the case of inbreeding coefficients and lower correlations.In addition, ΔF PEDi was also expected to correlate better with the other coefficients than ΔF PEDt , because the estimating molecular ΔFs were designed following ΔF PEDi .We could not fit the ΔF PEDt formula to the molecular information because we did not genotype the trios.
If the frequencies of the base population are known, the performances of F L&H , F VR1 , F VR2 , and F YAN are in terms of IBD (Caballero et al., 2022).However, other authors have reported that F L&H , F VR1 , F VR2 , and F YAN presented values outside of Malecot and Wright's inbreeding definitions and that F NEJ better fitted these definitions, as its values ranged between 0 and 1 (Saura et al., 2015;Villanueva et al., 2021).Moreover, Villanueva et al. (2021) observed that only F L&H could be interpreted in terms of loss or gain of genetic variability, while F VR1 , F VR2 , and F YAN presented some inconsistencies concerning loss or gain of genetic variability.Regarding the application of inbreeding measures, F ROH and F PED should be preferred to measure the increase in whole genome homozygosity, and F ROH should be preferred over F PED when pedigrees are not deep enough or present many errors (Keller et al., 2011;Alemu et al., 2021).
It has been reported that F YAN and F VR2 , giving higher weight to rare alleles (Villanueva et al., 2021), presented the highest correlation with homozygosity at SNPs with moderate to high MAFs (Alemu et al., 2021).However, alleles with low MAFs are more representative of kinship because of the higher probability of being transmitted when individuals belong to the same family.Hence, F PED and F ROH. did not reflect the segregation of these low-frequency alleles that could be an indicator of kinship.Therefore, to measure global inbreeding, F PED and F ROH were more useful (Alemu et al., 2021;Caballero et al., 2022), and F YAN or F VR2 could be better for measuring population kinship.
The F L&H represents a modification of F NEJ adjusted for the expected homozygosity of the reference population.Hence, these inbreeding coefficients have negative values in some individuals (Supplementary Table S3).For F L&H , this is clearly represented in Figure 1, where expected homozygosity was greater than F NEJ during the first ten generations.This was reflected as negative values of the individual increases in inbreeding and high negative and positive Ne L&H values during the first eleven generations.The F L&H is designed using the allele frequencies of a reference population to express homozygosity in terms of IBD, such as F VR1 , F VR2 , and F YAN .However, ΔF L&H presented more variability during the first five generations of selection (Figure 3) than F VR1 , F VR2 , and F YAN .This was probably because F L&H had a higher influence on the change in the mating design during the first selection generations and was more dependent on Hardy-Weinberg disequilibrium.This problem was solved when F L&H was adjusted for the mean inbreeding coefficient of the first generation of selection (Ne L&H0 ).Arias et al. (2023) described that the adjustment of the inbreeding coefficients by the mean of the reference population did not allow a complete correction of the estimates, however, the use of the allele frequencies of the RP to estimate the IBD evolution in the population studied hides some assumptions, as the reference population is in Hardy-Weinberg equilibrium or that the RP have no molecular kindship, as it is assumed to be the real founder population.Therefore, adjusting the molecular inbreeding estimates for the mean inbreeding of the RP brings these estimates closer to the IBD definition.In this context, F ROH and F NEJ presented the advantage of not being influenced by the allele frequencies of the RP in their estimation and of being easily corrected by the mean F ROH and F NEJ of the RP.Furthermore, the first two generations presented negative values for Ne NEJ0 , Ne L&H0 , and Ne ROH0.due to a negative individual increase in inbreeding produced by a negative trend of inbreeding coefficients in the first two generations, which was probably produced by the abrupt change in mating design.Thus, this negative trend could also be because in the five previous generations before the experiment, individuals were randomly mated, and from the first generation of selection, matings were designed to avoid sharing grandparents to avoid inbreeding.In addition, none of the effective population sizes started to present stable values until the 10th generation.Toro et al. (2020) had previously pointed out that when optimal management of genetic diversity is implemented in a population, molecular estimates of Ne could be meaningless because the increase in genetic diversity resulted in negative Ne.In addition, none of the effective population sizes began to present stable values until the 10th generation.Therefore, in this population, ten generations of selection were needed to reflect the change in mating method by the effective population size, as expected, since Ne is a diversity indicator strongly influenced by the change in mating policy, as seen in the results of this study (Table 3).
The Ne NEJ0 and Ne L&H0 presented almost identical values in this population and similar values to Ne PEDi .Ne NEJ0 presented the advantage of not using the frequencies of a reference population such as Ne FROH0 and, therefore, are less dependent on allele frequencies.
Comparing Ne PEDi and Ne PEDt , the latter presented a higher standard deviation, as other authors reported that Ne PEDt presented high fluctuations over time due to the high impact of breeding method changes, errors in pedigree registration, or sampling effects (Gutiérrez et al., 2008;Gutiérrez et al., 2009;Cervantes et al., 2011;Leroy et al., 2013) and could lead to negative values (Groeneveld et al., 2009;Leroy et al., 2009;Nagy et al., 2010).The Ne PEDi values were high during the 10th-12th first generations.After that, the evolution of Ne PEDi tended to be stable and reflected the history of mating.Moreover, no negative values were observed, and the standard deviation was lower.Furthermore, Ne PEDi evolved to be similar to the number of males of females mated in the selection experiment, accounting for the intensity of selection that could be influencing the evolution of homozygosity in this population, when the weight of the random mating previous to the experiment decreased.Another study performed in French Angora rabbits divergently selected for total fleece weight, showed that the low line had high Ne values calculated from pedigree data.However, this line always showed lower levels of inbreeding than the high line over the years of selection, even though the mating policy was the same in both lines (Rafat et al., 2009).This did not occur in our selection lines, both of which had almost the same level of inbreeding since the beginning of the experiment (results not shown).
The minimum viable population size thresholds have been traditionally defined as 50 for the short term and 500 for the long term (Harmon and Braude, 2010).All Ne values of the last generations obtained for this population were lower than the defined thresholds.Low Ne levels relate to a decrease in genetic variability, allele fixation, and a reduction in the selection response (Domínguez-Viveros et al., 2020).Nevertheless, the use of a strict critical level for Ne is not straightforward, as different factors could affect its interpretation, such as the method used, the species, or population structure (Leroy et al., 2013;Mokhtari et al., 2015).Moreover, in other studies, effective population sizes below 50 were reported in some species that did not present viability problems (Leroy et al., 2013).Therefore, this threshold should be revised and adapted to the particularity of each population.

Conclusion
The results obtained in these studies allowed to understand better the performance of different inbreeding coefficients and effective population sizes applied in a population with many discrete generations and under strong selection.Moreover, these results could be used as a reference for the study of inbreeding and genetic diversity in other populations.The F ROH and F PED presented the strongest correlations and were more representative of inbreeding in terms of IBD.However, other measures such as F YAN were useful in terms of rare alleles, which are more likely inherited from common ancestors.Adjusting the inbreeding estimates to the mean inbreeding of the reference population allowed it to fit more closely to IBD.The Ne PEDi presented a more reliable performance than Ne PEDt .In general, Ne obtained by molecular data showed a similar trend when comparing molecular approaches to each other and Ne PEDi , being more similar to Ne L&H0 and Ne NEJ0 .However, data from several generations was necessary to reach a stable trend for Ne, both with pedigree and molecular data.

FIGURE 1
FIGURE 1Evolution of the different inbreeding coefficients computed from pedigree and molecular information across the selection generation.F PED : inbreeding coefficient computed from pedigree information.F NEJ : inbreeding coefficient computed from the diagonal of the Nejati Javaremi Matrix.F L&H : inbreeding coefficient computed from the diagonal of the Li and Horvitz Matrix.F VR1 : inbreeding coefficient computed from the diagonal of the Van Raden type 1 matrix.F VR2 : inbreeding coefficient computed from the diagonal of the Van Raden type 2 matrix.F YAN : inbreeding coefficient computed from the diagonal of the Yang matrix.F ROH : inbreeding coefficient computed using runs of homozygosity.F PED0 , F NEJ0 , F L&H0 , and F ROH0 were adjusted for the mean inbreeding coefficients of the first selection generation.

FIGURE 3
FIGURE 3 Evolution across selection generations of molecular and pedigree increases in inbreeding coefficients (ΔF) of all genotyped individuals.(A): ΔF PEDt derived from pedigree referenced against the inbreeding coefficient of the individual's parents.(B): ΔF PEDi derived from pedigree referred to the number of generations of the individual.(C): ΔF NEJ0 derived from inbreeding coefficients (F) of the Nejati-Javaremi matrix (F NEJ ) adjusted for the mean F NEJ of the first selection generation.(D): ΔF L&H derived from the diagonal of the Li and Horvitz matrix.(E): ΔF L&H0 derived from F of the Li and Horvitz matrix (F L&H ) adjusted for the mean F L&H of the first selection generation.(F): ΔF VR1 derived from the diagonal of the Van Raden type 1 matrix.(G): ΔF VR2 derived from the diagonal of the Van Raden type 2 matrix.(H): ΔF YAN derived from the diagonal of the Yang matrix.(I): ΔF FROH0 derived from F of runs of homozygosity (F ROH ) adjusted for the mean F ROH of the first selection generation.

TABLE 1
Number of animals sampled per line and generation.H-Line: high variability line.L-Line: low variability line.

TABLE 2
Correlation coefficients between molecular and pedigree inbreeding coefficients of all genotyped individuals.F PED : inbreeding coefficient computed from pedigree.F NEJ : inbreeding coefficient computed from the diagonal of the Nejati-Javaremi matrix.F L&H i: inbreeding coefficient computed from the diagonal of Li and Horvitz Matrix.F VR1 : inbreeding coefficient computed from the diagonal of the Van Raden type 1 matrix.F VR2 : inbreeding coefficient computed from the diagonal of the Van Raden type 2 matrix.F YAN : inbreeding coefficient computed from the diagonal of the Yang matrix.F ROH : inbreeding coefficient computed using runs of homozygosity.
and 26(1,138.82)andanegative value in the second generation(−80.03).The Ne PEDi showed neither values as high as Ne PEDt nor negative values.The Ne PEDi showed the highest values in the first generations (94.72 in the first, 143.44 in the second, and 129.46 in the third).The lowest value of Ne PEDi was identified in the 25th generation (34.37).The mean and standard deviation across generations for Ne PEDt were 125.94 and 346.28, respectively, and 53.58 and 28.38, respectively, for Ne PEDi .Regarding molecular measurements, Ne performance differed across methodologies.Extremely negative values were detected with Ne L&H in the first five generations of selection, the most negative being in the fifth generation (−4,081.60).These extremely negative values translated into a negative mean value (−228.68)with a high standard deviation (988.62).The Ne NEJ0 , Ne L&H0 , Ne ROH0 , and Ne YAN also presented negative values in the first and second generations.The Ne VR2 had lower values compared with other molecular approaches, with a mean of 23.26 compared with 35.87 for Ne NEJ0 , 35.88 for Ne L&H0 , 32.66 for Ne VR1 , 40.76 for Ne YAN and 30.48 for Ne ROH0 .The standard deviation of Ne VR1 was the lowest (6.