ORIGINAL RESEARCH article

Front. Genet., 27 June 2025

Sec. Livestock Genomics

Volume 16 - 2025 | https://doi.org/10.3389/fgene.2025.1576125

This article is part of the Research TopicAdvances in Livestock Genetics: Enhancing Breeding Practices and Improving Animal HealthView all 12 articles

Inbreeding patterns and genetic diversity under selection in Teha sheep

Shunzhe WangShunzhe Wang1Long LiangLong Liang1Dilinigeer ZiyayidingDilinigeer Ziyayiding1Wenjing JiaoWenjing Jiao1Hailati KasimuHailati Kasimu1Sangang He
Sangang He2*Mingjun Liu
Mingjun Liu1*
  • 1Key Laboratory of Animal Biotechnology of Xinjiang, Key Laboratory of Genetics Breeding and Reproduction of Grass Feeding Livestock, Ministry of Agriculture and Rural Affairs of the People’s Republic of China, Institute of Biotechnology, Xinjiang Academy of Animal Science, Urumqi, China
  • 2Institute of Animal Husbandry and Veterinary, Zhejiang Academy of Agricultural Sciences, Hangzhou, China

Background: Inbreeding and genetic diversity are critical factors influencing the adaptability, productivity, and sustainability of livestock populations. Teha sheep, a crossbred line between Texel and Kazakh sheep, are an important meat-producing breed in China, yet their genetic structure and inbreeding status remain underexplored. In this study, we aim to evaluate inbreeding coefficients, genetic diversity, and selection signatures in Teha sheep by integrating pedigree and genomic data.

Results: Analysis of pedigree data from 2,652 individuals revealed a low inbreeding coefficient (FPED = 0.001), whereas analysis of genomic data from 1,271 individuals indicated slightly higher inbreeding coefficients, with the FROH averaging 0.044. Genetic diversity metrics, including Ho = 0.347 and PIC = 0.345, confirmed moderate variability within the population. A significant region of runs of homozygosity (ROH) hotspot was identified on chromosome 2 (112.01–119.89 Mb), encompassing genes such as MSTN, TUBGCP5, and NIPA2, which are associated with muscle growth, fat metabolism, and skeletal development. Notably, CYFIP1, SAP130, and UGGT1 were identified as key genes shared across ROH hotspots, QTL regions, and LD blocks, implicating their roles in growth efficiency, carcass quality, and protein regulation under stress. These findings reveal critical genomic regions contributing to the breed’s productivity and adaptability.

Conclusion: In this study, we highlight the low inbreeding levels and moderate genetic diversity of Teha sheep, emphasizing the integration of pedigree and genomic analyses for sustainable breeding programs. The identification of key genes provides a foundation for optimizing productivity and maintaining genetic variability in this important livestock population.

Introduction

Inbreeding refers to the probability that two alleles at a given locus are identical by descent, resulting from the mating of related individuals. In livestock populations, inbreeding commonly arises due to directional selection, small effective population sizes, or closed breeding systems (Falconer, 1996; Keller et al., 2011). Elevated levels of inbreeding increase genome-wide homozygosity, which may lead to the expression of deleterious recessive alleles and a decline in overall fitness, a phenomenon known as inbreeding depression (Doekes et al., 2020). This condition is typically characterized by reduced fertility, impaired immune function, and decreased production performance in farm animals (Doekes et al., 2020; Forneris et al., 2021). Accurate estimation of inbreeding is therefore essential for controlling genetic risks in breeding programs. Traditionally, inbreeding has been quantified using FPED, which estimates the expected proportion of autozygous loci based on known genealogical relationships (Falconer, 1996). However, the accuracy of FPED is highly dependent on pedigree completeness and depth, and it often fails to reflect actual genomic inbreeding, especially in populations lacking systematic pedigree recording (Howard et al., 2017).

Advancements in genomic technologies have revolutionized the assessment of inbreeding and population structure in livestock. High-throughput SNP genotyping platforms have enabled the precise quantification of genomic inbreeding coefficients by leveraging dense marker coverage to detect homozygous segments across the genome. These detected homozygous segments, particularly runs of homozygosity (ROH), represent regions likely to be IBD, allowing research workers to infer autozygosity (Chang et al., 2015). Unlike pedigree-based approaches, which estimate the probability of IBD based on known relationships and can be limited by pedigree completeness and accuracy, genomic methods directly detect patterns of genetic variation (such as ROH) to assess realized homozygosity arising from both recent and ancient common ancestors, offering robust insights into inbreeding dynamics. These approaches also facilitate the detection of genomic regions under selection and elucidate breed-specific evolutionary trajectories (Xiang et al., 2016). In sheep, high-throughput genotyping and ROH analysis have been instrumental in characterizing genetic diversity and identifying signatures of recent and ancestral inbreeding, informing sustainable breeding practices (He et al., 2020). Additionally, ROH-based metrics provide nuanced perspectives on the genomic distribution of inbreeding, distinguishing short ROH (indicative of ancient inbreeding) from long ROH (reflecting recent inbreeding), thus enhancing the precision of genetic management strategies (Van Der Valk et al., 2019).

In recent years, studies on crossbred populations such as Red Angus × Chinese Red Steppe cattle and Yunnan semi-fine wool sheep have begun to explore their genetic structure using genomic data (Guo et al., 2022; Rostamzadeh Mahdabi et al., 2025). These studies demonstrate that ROH analysis is not only effective for evaluating inbreeding levels but also capable of revealing historical recombination patterns and potential selection regions. This provides a foundation for understanding genetic diversity in crossbred populations and offers insights for genomic evaluation approaches in novel hybrid breeds.

Teha sheep, a crossbred population derived from Texel rams and Kazakh ewes, are predominantly raised in the Yili region of Xinjiang, China, and valued as a meat-producing breed with strong local adaptation. Combining the rapid growth rate and high meat productivity of the Texel breed with the superior resilience and adaptability of Kazakh sheep, Teha sheep have become a key resource for lamb production in the regional sheep industry. However, intensive selection and limited population size may elevate inbreeding levels, potentially leading to inbreeding depression that affects the growth rate and reproductive traits (Falconer, 1996).

The primary objective of this study was to evaluate the genetic diversity and inbreeding levels of the Teha sheep population using pedigree and genomic data. The findings derived from this research are intended to provide valuable insights for guiding the sustainable breeding and genetic management strategies of this breed.

Materials and methods

Study population

Teha sheep, a meat-producing crossbred derived from Texel rams and Kazakh ewes, were sourced from the Sheep Nucleus Stud Farm in Zhaosu County, Yili, Xinjiang, China. The current population has been under selection since 2018, primarily for growth performance and meat yield. A total of 2,652 individuals were analyzed, that is, 321 rams and 2,331 ewes, with complete pedigree records spanning five generations (2018–2023). The number of ewes was evenly distributed across the years, whereas most rams were born between 2022 and 2023. This sampling ensured adequate representation across reproductive years, particularly for maternal lines, enabling reliable estimation of relatedness and inbreeding levels.

Sample collection and data

Genomic analysis involved 1,273 Teha sheep, randomly sampled from the 2,652 pedigree population, retaining potential relatedness to support inbreeding and genetic diversity studies. Blood samples were collected from each sheep, with DNA extracted using a commercial kit. Genomic genotyping was performed using the Illumina OvineSNP50 BeadChip. To enhance the analysis of economic traits, the chip was supplemented with custom single-nucleotide polymorphism (SNP) markers relevant to economic traits, yielding 64,734 markers. Genotyping was conducted by Beijing Compass Biotechnology Co., Ltd.

SNP genotyping and quality control

Genomic data were obtained from 1,273 individuals genotyped with 64,734 SNP markers distributed across 26 autosomes. Quality control was conducted using PLINK v1.90 software, with the following criteria applied (Chang et al., 2015): (1) SNPs with a missing call rate >5% were excluded, (2) SNPs with minor allele frequency (MAF) < 0.01 were removed, (3) SNPs deviating from the Hardy–Weinberg equilibrium (HWE) (p < 1 × 10−6) were excluded, and (4) samples with a genotyping success rate < 90% were removed. After quality control, 1,271 samples and 56,297 high-quality SNPs were retained for subsequent analyses.

Genetic diversity evaluation

The genetic diversity of the Teha sheep population was assessed using six parameters derived from SNP data: Ne, Ho, He, PIC, MAF, and ENA.

Ne, chosen for its ability to track historical population dynamics sensitive to inbreeding, reflects the risk of inbreeding and genetic drift, and it was estimated via LD using the formula (Waples and Do, 2010):

Ne=1/4c×1/r21,

where r2 is the LD value between SNP loci and c is the genetic distance between loci measured in centimorgans (cM). Default parameters of PLINK were used for calculations.

Ho and He were selected to assess heterozygosity loss due to inbreeding by comparing actual and potential genetic variation (Nei, 1973):

Ho=1Nk=1NHkn,He=2n2n11Nk=1N1pki2,

where N is the total number of loci, Hk is the number of heterozygous individuals at locus k, n is the population size, and pki is the frequency of allele i at locus k.

PIC, chosen for its utility in evaluating the informativeness of genetic markers in inbreeding, indicates reduced marker diversity due to inbreeding-driven fixation, and it was calculated as follows (Botstein et al., 1980):

PIC=1i1nPi2i=1n1j=i+1n2Pi2Pj2,

where pi and pj are the allele frequencies at each locus and n is the number of alleles.

MAF, chosen to ensure reliable SNP data for inbreeding analysis by filtering low-frequency alleles, supports accurate detection of inbreeding-related diversity loss, and it was computed with a threshold of >0.01 (Chang et al., 2015).

ENA was calculated using PLINK v1.90. This parameter reflects the number of alleles required to achieve the observed level of heterozygosity in a population, assuming equal allele frequencies.

Detection and classification of ROH

ROH in the sample were identified using PLINK v1.90 software. The criteria for ROH identification included the following: (1) a minimum ROH length of 1 Mb; (2) at least 50 SNPs within an ROH, ensuring result reliability and reducing false positives; (3) allowance for one SNP per 100 kb, aligning with the chip’s average SNP spacing; and (4) a maximum of one missing genotype allowed per sliding window, tolerating minor genotyping errors (e.g., copy number variations) and enhancing detection stability (Meyermans et al., 2020; Arias et al., 2022). All ROH segments were categorized into four classes based on length: 1–5 Mb, 5–10 Mb, 10–20 Mb, and >20 Mb (Ferencakovic et al., 2011; Paim et al., 2020; Zayas et al., 2024).

Assessment of genomic inbreeding coefficients

FPED values were estimated using ASReml-R software based on pedigree information. The relationship matrix (A) was constructed from the pedigree data, and its inverse was utilized to calculate the coefficients. The calculation followed the formula FPED = diag(A)- 1, where A represents the relationship matrix (Butler et al., 2017).

SNP-based genomic inbreeding coefficients (FHAT1, FHAT2, FHAT3, and FHOM) were calculated using PLINK software, reflecting different aspects of genomic inbreeding (Chang et al., 2015; Dadousis et al., 2022). Specifically, FHAT1 estimates inbreeding as the excess variance standardization, whereas FHAT2 quantifies excess homozygosity, which is analogous to Fhet. The FHAT3 coefficient measures the correlation between alleles within gametes. FHOM represents the proportion of observed homozygous genotypes relative to the expected number under the Hardy–Weinberg equilibrium.

The FROH was determined to evaluate inbreeding patterns derived from extended homozygous regions in the genome. The FROH was calculated as FROH = ∑ LRon/LAuT, where ∑ LRon represents the total length of ROH identified in an individual’s genome and LAuT refers to the total length of the autosomal genome. Furthermore, chromosome-specific inbreeding coefficients (FROHOAR) were calculated as FROHOAR = LROHOAR/LoAR, where LROHOAR denotes the ROH length within a specific chromosome and LoAR is the total autosomal length for that chromosome.

Identification of ROH hotspots and functional annotation

To identify potential ROH hotspot regions, the percentage of SNP occurrence within ROHs was calculated by dividing the number of animals in which a specific SNP occurred by the total number of animals analyzed. The SNP positions were plotted across the genome, and genomic regions with the top 1% SNP occurrence were considered as potential ROH hotspots (Ferencakovic et al., 2013).

Annotated genes located within the identified ROH hotspots were identified and annotated based on the Ovis aries reference genome (Oar_v3.1) using GFF annotation files obtained from the National Center for Biotechnology Information (NCBI).

LD block and QTL annotation analysis

The LD structure within the ROH hotspot regions was analyzed using LDBlockShow v1.40 (Dong et al., 2021). LD heatmaps were generated based on SNP data from chromosome 2 (112,011,937–119,890,211 bp) to visualize the patterns of linkage disequilibrium and identify significant LD blocks within the hotspot regions.

To investigate potential associations between the ROH hotspot regions and economically important traits, we first defined the genomic coordinates of the LD regions, QTL regions, and ROH hotspot regions. QTL annotations were retrieved from the Sheep QTL Database (SheepQTLdb) (https://www.animalgenome.org/cgi-bin/QTLdb/OA/index). This database provides curated QTL data based on the O. aries genome assembly OAR3.1. Genomic coordinates of the hotspot regions were cross-referenced with the QTL data to identify relevant QTLs.

Subsequently, to identify genes within these defined regions (including LD regions, QTL regions, and ROH hotspot regions), we performed extraction using bcftools v1.12 (https://samtools.github.io/bcftools/bcftools.html) in conjunction with the genome annotation file (GFF format) for the OAR3.1 genome. This integration facilitated the annotation of traits associated with ROH hotspot regions and further understanding of their potential biological background through the identified genes.

Results

Genetic diversity

Six diversity parameters were calculated based on SNP data (Table 1). The effective population size (Ne) was 214.893. Both the observed heterozygosity (Ho) and expected heterozygosity (He) were 0.347. The mean polymorphic information content (PIC) was 0.345. The effective number of alleles was 1.602. The minor allele frequency (MAF) was 0.264. The PIC distribution showed 40% of SNPs with values 0.45–0.60, 35% with 0.30–0.45, 20% with 0.15–0.30, and 10% with <0.15 (Figure 1A). The MAF distribution is presented in Figure 1B.

Table 1
www.frontiersin.org

Table 1. Genetic diversity parameters calculated in a population of Teha sheep.

Figure 1
www.frontiersin.org

Figure 1. Distributions of genetic diversity parameters among SNPs in the Teha sheep. (A) Distribution of polymorphic information content (PIC) values across SNP markers. PIC values were grouped into defined intervals to reflect marker informativeness. (B) Minor allele frequency (MAF) distribution showing the proportion of SNPs in each frequency bin.

Analysis of ROH

A total of 26,560 ROH segments were identified across 1,271 samples from the Teha sheep population. These segments exhibited individual ROH lengths ranging from 1.01 Mb to 103 Mb. The average ROH length was 3.06 Mb. The number of ROH segments was correlated with the total length covered by ROHs (Figure 2A).

Figure 2
www.frontiersin.org

Figure 2. Patterns of runs of homozygosity (ROH): (A) scatterplot showing the correlation between the number of ROH segments and the total ROH length per individual, (B) proportion of ROH segments falling within different length categories, and (C) distribution of ROH in chromosomes.

ROH segments were distributed as follows: 1–5 Mb (35.6%), 5–10 Mb (29.7%), 10–20 Mb (21.1%), and >20 Mb (13.6%) (Table 2). Variation in the number of ROH segments was observed across chromosomes. Chromosome 1 exhibited the highest number of ROH segments (3,549), whereas chromosome 25 exhibited the lowest (451). Other chromosomes displayed intermediate counts, reflecting differences in chromosomal architecture (Figure 2C).

Table 2
www.frontiersin.org

Table 2. Distribution of runs of homozygosity (ROH) of different lengths in Teha sheep.

Recognizing the potential influence of data filtering, we conducted a complementary ROH analysis wherein MAF and HWE filters were not applied. The outcomes of this parallel investigation largely mirrored the patterns and significant findings observed in the filtered dataset. Our decision to employ MAF and HWE filters in the final analysis was grounded in their established role within genomic studies for data quality control, specifically in eliminating low-quality or potentially erroneous SNPs, thereby ensuring the accuracy and broader applicability of our findings.

Pedigree and genomic inbreeding analysis

The basic statistics for inbreeding coefficients based on pedigree and genomic data are presented in Table 3. The pedigree-based inbreeding coefficient (FPED) ranged from 0 to 0.25, with an average of 0.001, indicating a relatively low level of inbreeding within the Teha sheep population. The inbreeding coefficient derived from FROH ranged from 0.0004 to 0.241, with a mean value of 0.044 and a standard deviation of 0.031, demonstrating that most individuals exhibited low FROH values. However, a small subset exhibited higher FROH values, approaching 0.18. The SNP-based inbreeding coefficients (FHAT1, FHAT2, FHAT3, and FHOM) exhibited distinct distributions, with average values close to zero. Among these, FHAT1 displayed the widest range, which was from −0.228 to 0.342 (Table 3).

Table 3
www.frontiersin.org

Table 3. Statistics of inbreeding coefficients based on pedigree and genomic information.

The violin plot showed the distribution of inbreeding coefficients calculated using different methods. FHAT1 had the largest distribution range, whereas FHOM had the smallest range, and FPED exhibited a concentrated distribution (Figure 3A).

Figure 3
www.frontiersin.org

Figure 3. Distribution and patterns of inbreeding coefficients of Teha sheep: (A) violin plot illustrating the distribution of inbreeding coefficients (inbreeding coefficient based on excess variance standardization (FHAT1), inbreeding coefficient based on excess homozygosity (FHAT2), inbreeding coefficient based on allele correlation within gametes (FHAT3), inbreeding coefficient based on the proportion of homozygous genotypes (FHOM), pedigree-based inbreeding coefficient (FPED), and runs of homozygosity-based inbreeding coefficient (FROH)) calculated using different methods. The width of each violin represents the density of the coefficient values and the white dot indicates the mean. (B) Histogram showing the distribution of FROH values across all individuals. (C) Chromosomal distribution of FROH values, where each point represents the mean FROH in a chromosome.

The histogram of FROH values revealed that the majority of individuals fell within the 0–0.05 range, with only a few reaching 0.2, underscoring the overall low level of inbreeding within the Teha sheep population (Figure 3B). Chromosomal-level analysis showed variability in FROH across chromosomes, with chromosome 25 exhibiting the lowest FROH value, averaging only 0.01 (Figure 3C).

Genomic regions with ROH hotspot

Analysis of the Teha sheep genome identified a prominent ROH hotspot on chromosome 2, spanning positions 112,011,937 to 119,890,211 bp. This region covered 7.88 Mb and contained 184 SNPs (Figure 4; Table 4).

Figure 4
www.frontiersin.org

Figure 4. Manhattan plot of the occurrence of single-nucleotide polymorphisms (SNPs) in runs of homozygosity (ROH) islands across different chromosomes. The horizontal line corresponds to the threshold calculated as SNPs with a p-value of ROH incidence higher than 0.999.

Table 4
www.frontiersin.org

Table 4. Candidate regions and genes identified from the ROH hotspot in Teha sheep.

Key candidate genes identified within this hotspot include myostatin (MSTN), a gene associated with double muscling trait of Texel sheep, and collagen type V alpha 2 chain (COL5A2), which contributes to tissue structure and development. Genes related to immune function were also identified, such as interferon regulatory factor 6 (IRF6) and phosphatidylinositol glycan anchor biosynthesis class P (PIGP). Additionally, nuclear pore complex protein 43 (NUP43) and tubulin gamma complex-associated protein 5 (TUBGCP5) were detected, which are involved in cellular processes and stress response.

LD blocks and QTL annotation analysis of the ROH hotspot

To investigate the 7.88 Mb hotspot region on chromosome 2, an LD heatmap was constructed to illustrate linkage disequilibrium patterns within the region. The analysis revealed significant LD block structures, with 24 LD blocks being identified, ranging in size from 9.6 kb to 199.5 kb (Figure 5).

Figure 5
www.frontiersin.org

Figure 5. Linkage disequilibrium (LD) heatmap of the ROH hotspot region on chromosome 2. The x-axis and y-axis represent SNP positions in the region, and the color key shows the R2 value, with red indicating high LD and yellow indicating low LD.

Based on the LD block analysis, QTL annotation analysis was performed to identify regions associated with economically important traits. Two QTLs were identified, both significantly linked to the body condition score (QTL 263631). These QTLs were located at positions 111,201,892–114,207,253 bp and 114,746,188–116,378,689 bp, with lengths of 3.01 Mb and 1.63 Mb, respectively, encompassing 25 and 36 SNPs (Table 5). Further analysis of the candidate genes within the ROH hotspot, QTL, and LD block regions revealed distinct overlaps and unique genes across the three regions. A total of 36 candidate genes were identified. Red nodes represent three key candidate genes shared across all three regions (ROH hotspot, QTL, and LD blocks): Cytoplasmic FMR1 Interacting Protein 1 (CYFIP1), Sin3A Associated Protein 130 (SAP130), and UDP-Glucose Glycoprotein Glucosyltransferase 1 (UGGT1). Orange nodes indicate genes shared between any two regions, including Non-Imprinted in Prader–Willi/Angelman Syndrome 1 (NIPA1), Non-Imprinted in Prader–Willi/Angelman Syndrome 2 (NIPA2), Heparan Sulfate 6-O-Sulfotransferase 1 (HS6ST1), Tubulin Gamma Complex Associated Protein 5 (TUBGCP5), and APC Membrane Recruitment Protein 3 (AMER3). Blue nodes represent genes unique to a specific genomic region, such as Myostatin (MSTN), Ankyrin Repeat Domain 22 (ANKAR), Collagen Type V Alpha 2 Chain (COL5A2), Engulfment Adapter PTB Domain Containing 1 (GULP1), and Bridging Integrator 1 (BIN1, Figure 6).

Table 5
www.frontiersin.org

Table 5. Summary of QTL loci and candidate genes harbored within the ROH hotspot.

Figure 6
www.frontiersin.org

Figure 6. Overlap of candidate genes identified in ROH hotspot, QTL, and LD block. Red nodes represent key candidate genes shared across all three regions. Orange nodes indicate genes shared between any two of the three regions. Blue nodes represent genes unique to each of these regions.

Discussion

Genetic diversity is a critical factor influencing population adaptability and long-term sustainable production, especially for livestock undergoing artificial selection. In this study, multiple genetic diversity parameters characterized the genetic structure of the Teha sheep population. The Ho (0.347), He (0.347), and PIC (0.345) values collectively indicate balanced and robust genetic variability within the Teha sheep population. Notably, the Ho of Teha sheep surpasses that of several indigenous breeds, including the Yongdeng Qishan sheep (Ho = 0.199) and Lanzhou fat-tailed sheep (Ho = 0.341), suggesting superior allelic diversity (Ma KY. et al., 2024). Populations with Ho values ≥0.30 are widely recognized as genetically healthy, capable of maintaining adaptability, and reducing the risks of inbreeding depression and genetic drift (Allendorf et al., 2012; Harmon and Braude, 2010). Maintaining neutral genetic diversity above certain thresholds is essential for long-term population viability in livestock (Harmon and Braude, 2010). In addition, the low FROH (0.044) and relatively large Ne (214.89) further reflect the genetic robustness of the Teha sheep population, which supports maintaining the genetic health and adaptive capacity essential for ongoing genetic improvement (Ma K. et al., 2024). The moderate PIC (0.345) indicates notable genetic variation within the population. A moderate mean MAF (0.253) across the analyzed loci suggests a substantial proportion of medium-frequency alleles. Collectively, these genetic characteristics indicate that the population maintains a favorable level of genetic variability, providing a basis for sustained breeding progress and enhanced production efficiency (Zhu et al., 2024).

The analysis of inbreeding coefficients in the Teha sheep population revealed consistently low levels across both pedigree-based (FPED = 0.001) and genome-based estimates, including FROH (0.044), FHOM (−0.007), FHAT1 (−0.050), FHAT2 (−0.006), and FHAT3 (−0.006). Whereas FPED and FROH are conventional measures of inbreeding, FHOM and FHAT coefficients, derived from genomic data, provide additional insights into the population’s genetic structure. Notably, the negative values observed for FHOM and FHAT1-3 indicate an excess of heterozygosity compared to the expected heterozygosity under the Hardy–Weinberg equilibrium, which is potentially due to recent population admixture or selection pressures favoring genetic diversity (Alemu et al., 2021; Caballero et al., 2022). Studies confirm that negative inbreeding coefficients can reflect higher than expected heterozygosity under such conditions, often resulting from population mixing or selection (Lukić et al., 2020). The low FROH value (0.044) in Teha sheep is lower than that of crossbred Black Slavonian pigs (FROH = 0.098) and comparable to locally adapted breeds such as Simmental cattle (FROH = 0.059) (Visser et al., 2023), highlighting their genetic diversity. These findings underscore effective breeding management practices that maintain low inbreeding levels and high genetic diversity, which is crucial for Teha sheep’s long-term viability and productivity. Thus, these results provide a foundation for developing sustainable breeding strategies that preserve genetic health.

ROH hotspots are genomic regions marked by high homozygosity and are frequently linked to selection pressures and traits associated with adaptability and fitness (Lee et al., 2013). These regions have been widely identified in livestock species such as cattle and sheep, underscoring their pivotal role in adaptive evolution and economically important production traits. Our work identified a primary ROH hotspot in Teha sheep on chromosome 2, spanning 112,011,937–119,890,211 base pair, with a length of 7.88 Mb and encompassing 184 SNPs. This notable clustering of SNPs underscores the potential role of harbored genes associated with growth and adaptability, offering crucial insights into the genetic framework of Teha sheep.

The ROH hotspot on chromosome 2 contains key genes linked to muscle growth, fat deposition, and skeletal development, all of which are essential for production efficiency and adaptability. MSTN stands out as a well-documented regulator of muscle growth, with its mutations associated with the double muscling phenotype in Texel sheep and Belgian Blue cattle, leading to significantly enhanced meat yields (Garcia et al., 2019; Aiello et al., 2018). Similarly, COL5A2 plays a critical role in connective tissue integrity, contributing to carcass strength and quality, whereas C2H2ORF88 is involved in muscle fiber organization and regeneration, supporting skeletal robustness (Harris et al., 2023; Ramljak et al., 2024). ROH analysis, as an effective tool, has been widely applied in sheep genomics studies to identify selection signatures underlying specific traits. For instance, a prominent ROH island on Ovis aries chromosome 6 (OAR6) was identified in French dairy sheep through ROH analysis, and it was associated with milk production traits (Rodríguez-Ramilo et al., 2021). Such studies indicate that different sheep breeds may experience specific selection pressures on distinct chromosomal regions due to differing breeding objectives. The ROH hotspot identified on chromosome 2 in Teha sheep in our study suggests potential selection targeting its meat traits. These findings further support a strong genetic foundation for production traits in the Teha sheep population and provide a basis for developing sustainable breeding strategies that preserve genetic health.

Fat metabolism-related genes, such as TUBGCP5 and PIGP, were also identified within the hotspot. TUBGCP5 regulates the intramuscular fat content, which is critical for meat quality in Hanwoo cattle, whereas PIGP influences the fat distribution in Merino sheep (Garcia et al., 2019; Aiello et al., 2018). Additionally, skeletal development genes, including NIPA1 and NIPA2, were enriched in this region, underscoring their role in bone mineralization and structural strength (Harris et al., 2023). The functional significance of these genes highlights the selective pressures acting on this region and their importance in Teha sheep’s adaptive and productive traits.

In this study, we further integrated QTL annotation analysis and LD block analysis, revealing two significant QTLs (QTL 263631) strongly associated with the ROH hotspot on chromosome 2. Both QTLs were linked to the body condition score, located at positions 111,201,892–114,207,253 bp and 114,746,188–116,378,689 bp, spanning 3.01 Mb and 1.63 Mb, and containing 25 and 36 SNPs, respectively. The LD block analysis demonstrated substantial overlap with these QTL regions, with LD blocks ranging in size from 9.6 kb to 199.5 kb (Figure 5). This significant overlap underscores the evolutionary importance of these loci under strong selective pressure, particularly for traits related to growth and production (Ramljak et al., 2024).

Through comprehensive candidate gene analysis, CYFIP1, SAP130, and UGGT1 were identified as key genes shared across the ROH hotspot, QTLs, and LD blocks, directly implicating their functions in growth, carcass quality, and fat deposition. For instance, CYFIP1 has been shown to regulate growth efficiency and carcass weight, with evidence from studies in Charolais cattle (Colombi et al., 2024). SAP130 plays a crucial role in adipose metabolism and muscle development, exhibiting strong associations with carcass quality in various cattle breeds (Colombi et al., 2024). UGGT1, on the other hand, is essential for protein quality control under cellular stress and has been linked to meat quality traits in Hanwoo cattle and other livestock species (Mastrangelo et al., 2018).

Although we provide important insights into the genetic diversity and selection signals of the Teha sheep population in this study, certain limitations should be noted. The analysis was based on a single population with a limited sample size, which may not fully represent the genetic structure of the broader population. Previous studies have demonstrated that the population structure and sampling strategy can significantly influence estimates of ROH and genetic variability (Peripolli et al., 2017; Ceballos et al., 2018). Additionally, during the analysis, it is noteworthy that ROH information for chromosomes 24, 26, and the end of chromosome 23 could not be presented in Figures 2C, 3C, 4. This absence does not imply a biological lack of ROHs in these regions but rather stems from various technical limitations encountered during the original SNP chip data processing, such as insufficient local SNP density or high genotype missingness, which precluded reliable ROH detection and visualization for these specific areas. Nevertheless, the core findings of this study, particularly the identification of major ROH hotspots, are based on chromosomal regions with complete and highly reliable SNP data. Consequently, these localized data limitations do not substantially affect the validity and generality of our main conclusions. Future studies incorporating multiple populations and higher-resolution data such as whole-genome sequencing would help validate and extend our findings.

Moreover, the ROH hotspots and candidate genes identified here offer promising targets for genomic selection. Incorporating these regions into marker-assisted or genomic selection frameworks could accelerate genetic gains in traits related to growth, carcass quality, and adaptability. Several studies have already shown the practical use of ROH-based selection in livestock breeding programs, supporting the potential of this approach in Teha sheep and other crossbred populations (Peripolli et al., 2017).

Conclusion

In this study, we provide a comprehensive evaluation of genetic diversity, inbreeding patterns, and selection signals in Teha sheep, a breed adapted to the unique environmental conditions of Xinjiang, China. The results reveal low levels of inbreeding through pedigree and genomic data, highlighting the population’s high genetic variability. A significant ROH hotspot was identified on chromosome 2, harboring key genes associated with economically important traits, such as muscle growth, fat metabolism, and skeletal development. These findings highlight the utility of genomic approaches in dissecting functional variation underlying complex traits. The identified ROH regions and associated candidate genes offer actionable targets for genomic selection and conservation efforts in Teha sheep and related crossbred populations. We advocate for the integration of such genomic insights into the design of data-driven breeding strategies aimed at enhancing long-term productivity and environmental resilience in livestock systems.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: datasets belong to Xinjiang Academy of Animal Science. Requests to access these datasets should be directed to bGl1bWluZ2p1bkB4amFhcy5uZXQ=.

Ethics statement

Ethical approval was not required for the study involving animals in accordance with the local legislation and institutional requirements because the study involved the use of Teha sheep, a breed maintained at the Sheep Nucleus Stud Farm in Zhaosu County, Xinjiang, China. The research primarily utilized pedigree and genomic data, with no invasive procedures or direct interaction with the animals that would require ethical approval. The research adhered to local animal management practices, and the study was conducted in accordance with applicable national and institutional guidelines for animal research.

Author contributions

SW: software, validation, visualization, writing – original draft, and writing – review and editing. LL: investigation and writing – original draft. DZ: investigation and writing – original draft. WJ: investigation and writing – original draft. HK: investigation and writing – original draft. SH: conceptualization, supervision, and writing – review and editing. ML: conceptualization, supervision, writing – review and editing, and funding acquisition.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by the Key Program of Breeding of Xinjiang: Genetic improvement and breeding of indigenous sheep breeds of Xinjiang (Grant No. 2022A02001-2) and the Tianshan Talent Training Program (Grant No. 2023TSYCTD0007).

Acknowledgments

The authors would like to thank the staff of the Xinjiang Academy of Animal Science and the Sheep Nucleus Stud Farm in Zhaosu County for their support in sample collection and data provision. Special thanks are extended to the laboratory team for their assistance in SNP genotyping and data analysis.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2025.1576125/full#supplementary-material

Abbreviations

FPED, pedigree-based inbreeding coefficient; FROH, runs of homozygosity-based inbreeding coefficient; Ho, observed heterozygosity; He, expected heterozygosity; PIC, polymorphic information content; ROH, runs of homozygosity; QTL, quantitative trait loci; LD, linkage disequilibrium; MAF, minor allele frequency; Ne, effective population size; ENA, effective number of alleles; FHAT1, inbreeding coefficient based on excess variance standardization; FHAT2, inbreeding coefficient based on excess homozygosity; FHAT3, inbreeding coefficient based on allele correlation within gametes; FROHOAR, chromosome-specific inbreeding coefficients; FHOM, inbreeding coefficient based on the proportion of homozygous genotypes; SNPs, single-nucleotide polymorphisms.

References

Aiello, D., Patel, K., and Lasagna, E. (2018). The myostatin gene: an overview of mechanisms of action and its relevance to livestock animals. Anim. Genet. 49 (6), 505–519. doi:10.1111/age.12696

PubMed Abstract | CrossRef Full Text | Google Scholar

Alemu, S. W., Kadri, N. K., Harland, C., Faux, P., Charlier, C., Caballero, A., et al. (2021). An evaluation of inbreeding measures using a whole-genome sequenced cattle pedigree. Heredity 126 (3), 410–423. doi:10.1038/s41437-020-00383-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Allendorf, F. W., Luikart, G. H., and Aitken, S. N. (2012). Conservation and the genetics of populations[M]. John Wiley and Sons.

Google Scholar

Arias, K. D., Álvarez, I., Gutiérrez, J. P., Fernandez, I., Menéndez, J., Menéndez-Arias, N. A., et al. (2022). Understanding Mendelian errors in SNP arrays data using a Gochu Asturcelta pig pedigree: genomic alterations, family size and calling errors. Sci. Rep. 12 (1), 19686. doi:10.1038/s41598-022-24340-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Botstein, D., White, R. L., Skolnick, M., and Davis, R. W. (1980). Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am. J. Hum. Genet. 32 (3), 314–331.

PubMed Abstract | Google Scholar

Butler, D. G., Cullis, B. R., Gilmour, A. R., Gogel, B. J., and Thompson, R. (2017). ASReml-R reference manual version 4, HP1 1ES. UK: VSN International Ltd, Hemel Hempstead.

Google Scholar

Caballero, A., Fernández, A., Villanueva, B., and Toro, M. A. (2022). A comparison of marker-based estimators of inbreeding and inbreeding depression. Genet. Sel. Evol. 54 (1), 82. doi:10.1186/s12711-022-00772-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Ceballos, F. C., Joshi, P. K., Clark, D. W., Ramsay, M., and Wilson, J. F. (2018). Runs of homozygosity: windows into population history and trait architecture. Nat. Rev. Genet. 19 (4), 220–234. doi:10.1038/nrg.2017.109

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, C. C., Chow, C. C., Tellier, L. C. A. M., Vattikuti, S., Purcell, S. M., and Lee, J. J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4 (1), s13742–015. doi:10.1186/s13742-015-0047-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Colombi, D., Rovelli, G., Luigi-Sierra, M. G., Ceccobelli, S., Guan, D., Perini, F., et al. (2024). Population structure and identification of genomic regions associated with productive traits in five Italian beef cattle breeds. Sci. Rep. 14 (1), 8529. doi:10.1038/s41598-024-59269-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Dadousis, C., Ablondi, M., Cipolat-Gotet, C., van Kaam, J. T., Marusi, M., Cassandro, M., et al. (2022). Genomic inbreeding coefficients using imputed genotypes: assessing different estimators in Holstein-Friesian dairy cows. J. dairy Sci. 105 (7), 5926–5945. doi:10.3168/jds.2021-21125

PubMed Abstract | CrossRef Full Text | Google Scholar

Doekes, H. P., Bijma, P., Veerkamp, R. F., de Jong, G., Wientjes, Y. C. J., and Windig, J. J. (2020). Inbreeding depression across the genome of Dutch Holstein Friesian dairy cattle. Genet. Sel. Evol. 52, 64–18. doi:10.1186/s12711-020-00583-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, Y., Xie, M., Jiang, Y., et al. (2021). LDBlockShow: a fast and convenient tool for visualizing linkage disequilibrium and haplotype structure. Hereditas 158, 3.

PubMed Abstract | Google Scholar

Falconer, D. S. (1996). Introduction to quantitative genetics. 4th edn. New Delhi, India: Pearson Education India.

Google Scholar

Ferencakovic, M., Hamzic, E., Gredler, B., et al. (2011). Runs of homozygosity reveal genome-wide autozygosity in the Austrian Fleckvieh cattle. Agric. Conspec. Sci. 76 (4), 325–329.

Google Scholar

Ferencakovic, M., Hamzic, E., Gredler, B., et al. (2013). Runs of homozygosity reveal genome-wide autozygosity in the Austrian Fleckvieh cattle. BMC Genet. 14, 43.

PubMed Abstract | Google Scholar

Forneris, N. S., Garcia-Baccino, C. A., Cantet, R. J. C., and Vitezica, Z. G. (2021). Estimating inbreeding depression for growth and reproductive traits using pedigree and genomic methods in Argentinean Brangus cattle. J. Animal Sci. 99 (11), skab289. doi:10.1093/jas/skab289

PubMed Abstract | CrossRef Full Text | Google Scholar

Garcia, L. E., et al. (2019). LIM2 and body composition in livestock. J. Livest. Stud. 25 (3), 180–195.

Google Scholar

Guo, Y., Liang, J., Lv, C., Wang, Y., Wu, G., Ding, X., et al. (2022). Sequencing reveals population structure and selection signatures for reproductive traits in Yunnan semi-fine wool sheep (Ovis aries). Front. Genet. 13, 812753. doi:10.3389/fgene.2022.812753

PubMed Abstract | CrossRef Full Text | Google Scholar

Harmon, L. J., and Braude, S. (2010). Conservation of small populations: effective population sizes, inbreeding, and the 50/500 rule. An introduction to methods and models in ecology. Evol. conservation Biol. 47, 546–555.

Google Scholar

Harris, A., et al. (2023). NIPA genes in bone development of sheep. J. Bone Mineral Res. 38 (1), 123–134.

Google Scholar

He, S., Di, J., Han, B., Chen, L., Liu, M., and Li, W. (2020). Genome-wide scan for runs of homozygosity identifies candidate genes related to economically important traits in Chinese Merino. Animals 10 (3), 524. doi:10.3390/ani10030524

PubMed Abstract | CrossRef Full Text | Google Scholar

Howard, J. T., Pryce, J. E., Baes, C., and Maltecca, C. (2017). Invited review: inbreeding in the genomics era: inbreeding, inbreeding depression, and management of genomic variability. J. dairy Sci. 100 (8), 6009–6024. doi:10.3168/jds.2017-12787

PubMed Abstract | CrossRef Full Text | Google Scholar

Keller, M. C., Visscher, P. M., and Goddard, M. E. (2011). Quantification of inbreeding due to distant ancestors and its detection using dense single nucleotide polymorphism data. Genetics 189 (1), 237–249. doi:10.1534/genetics.111.130922

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, K. T., Chung, W. H., Lee, S. Y., Choi, J. W., Kim, J., Lim, D., et al. (2013). Whole-genome resequencing of Hanwoo (Korean cattle) and insight into regions of homozygosity. BMC Genomics 14, 1–14. doi:10.1186/1471-2164-14-519

PubMed Abstract | CrossRef Full Text | Google Scholar

Lukić, B., Ferenčaković, M., Šalamon, D., Čačić, M., Orehovački, V., Iacolina, L., et al. (2020). Conservation genomic analysis of the Croatian indigenous Black Slavonian and Turopolje pig breeds. Front. Genet. 11, 261. doi:10.3389/fgene.2020.00261

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, K., Song, J., Li, D., Li, T., and Ma, Y. (2024b). Genetic diversity and selection signal analysis of hu sheep based on SNP50K BeadChip. Animals 14 (19), 2784. doi:10.3390/ani14192784

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, K. Y., Song, J. J., Li, D. P., Wu, Y., Wang, C. H., Liu, Z. L., et al. (2024a). Genomic structure analysis and construction of DNA fingerprint for four sheep populations. animal 18 (4), 101116. doi:10.1016/j.animal.2024.101116

PubMed Abstract | CrossRef Full Text | Google Scholar

Mastrangelo, S., Ciani, E., Sardina, M. T., Sottile, G., Pilla, F., Portolano, B., et al. (2018). Runs of homozygosity reveal genome-wide autozygosity in Italian sheep breeds. Anim. Genet. 49 (1), 71–81. doi:10.1111/age.12634

PubMed Abstract | CrossRef Full Text | Google Scholar

Meyermans, R., Gorssen, W., Buys, N., and Janssens, S. (2020). How to study runs of homozygosity using PLINK? A guide for analyzing medium density SNP data in livestock and pet species. BMC genomics 21, 94–14. doi:10.1186/s12864-020-6463-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Nei, M. (1973). Analysis of gene diversity in subdivided populations. Proc. Natl. Acad. Sci. 70 (12), 3321–3323. doi:10.1073/pnas.70.12.3321

PubMed Abstract | CrossRef Full Text | Google Scholar

Paim, T. P., Hay, E. H. A., Wilson, C., Thomas, M. G., Kuehn, L. A., Paiva, S. R., et al. (2020). Genomic breed composition of selection signatures in Brangus beef cattle. Front. Genet. 11, 710. doi:10.3389/fgene.2020.00710

PubMed Abstract | CrossRef Full Text | Google Scholar

Peripolli, E., Munari, D. P., Silva, M., Lima, A. L. F., Irgang, R., and Baldi, F. (2017). Runs of homozygosity: current knowledge and applications in livestock. Anim. Genet. 48 (3), 255–271. doi:10.1111/age.12526

PubMed Abstract | CrossRef Full Text | Google Scholar

Ramljak, J., Špehar, M., Ceranac, D., Držaić, V., Pocrnić, I., Barać, D., et al. (2024). Genomic characterization of local Croatian sheep breeds-effective population size, inbreeding and signatures of selection. Animals 14 (13), 1928. doi:10.3390/ani14131928

PubMed Abstract | CrossRef Full Text | Google Scholar

Rodríguez-Ramilo, S. T., Reverter, A., and Legarra, A. (2021). Islands of runs of homozygosity indicate selection signatures in Ovis aries 6 (OAR6) of French dairy sheep. JDS Commun. 2 (3), 132–136. doi:10.3168/jdsc.2020-0011

PubMed Abstract | CrossRef Full Text | Google Scholar

Rostamzadeh Mahdabi, E., Tian, R., Tian, J., Asadollahpour Nanaie, H., Wang, X., Zhao, M., et al. (2025). Uncovering genomic diversity and signatures of selection in red Angus× Chinese red steppe crossbred cattle population. Sci. Rep. 15 (1), 12977. doi:10.1038/s41598-025-98346-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Der Valk, T., Díez-del-Molino, D., Marques-Bonet, T., Guschanski, K., and Dalén, L. (2019). Historical genomes reveal the genomic consequences of recent population decline in eastern gorillas. Curr. Biol. 29 (1), 165–170. e6. doi:10.1016/j.cub.2018.11.055

PubMed Abstract | CrossRef Full Text | Google Scholar

Visser, B., Van Marle-Köster, E., and Taylor, J. F. (2023). Genomic characterization of runs of homozygosity and inbreeding in Jersey and Holstein cattle. Anim. Genet. 54 (1), 31–42.

Google Scholar

Waples, R. S., and Do, C. (2010). Linkage disequilibrium estimates of contemporary N e using highly variable genetic markers: a largely untapped resource for applied conservation and evolution. Evol. Appl. 3 (3), 244–262. doi:10.1111/j.1752-4571.2009.00104.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiang, T., Christensen, O. F., Vitezica, Z. G., and Legarra, A. (2016). Genomic evaluation by including dominance effects and inbreeding depression for purebred and crossbred performance with an application in pigs. Genet. Sel. Evol. 48, 1–14. doi:10.1186/s12711-016-0271-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Zayas, G. A., Rodriguez, E. E., Hernandez, A. S., Rezende, F. M., and Mateescu, R. G. (2024). Exploring genomic inbreeding and selection signatures in a commercial Brangus herd through functional annotation. J. Appl. Genet. 65 (2), 383–394. doi:10.1007/s13353-024-00859-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, M., Wang, L., Ma, Z., Mao, Y., Wang, G., Zheng, R., et al. (2024). Analysis of the genetic diversity and genetic structure of jiangshan black pigs using single nucleotide polymorphism (SNP) chips. Animals: an open access journal from MDPI 14 (18), 2660. doi:10.3390/ani14182660

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: genetic diversity, genomic inbreeding coefficient, ROH, Teha sheep, candidate genes

Citation: Wang S, Liang L, Ziyayiding D, Jiao W, Kasimu H, He S and Liu M (2025) Inbreeding patterns and genetic diversity under selection in Teha sheep. Front. Genet. 16:1576125. doi: 10.3389/fgene.2025.1576125

Received: 13 February 2025; Accepted: 28 May 2025;
Published: 27 June 2025.

Edited by:

Fernanda Marcondes de Rezende, University of Florida, United States

Reviewed by:

Raziye Işık, São Paulo State University, Brazil
Gabriel Antonio Zayas, University of Florida, United States

Copyright © 2025 Wang, Liang, Ziyayiding, Jiao, Kasimu, He and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Mingjun Liu, bGl1bWluZ2p1bkB4amFhcy5uZXQ=; Sangang He, aGVzYW5nYW5nM0AxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.