- 1College of Animal Science, Xinjiang Agricultural University, Urumqi, China
- 2Institute of Animal Science, Xinjiang Academy of Animal Sciences, Urumqi, China
Reproductive traits are among the most important economic characteristics in sheep farming. Structural variations (SVs) are extensively distributed across sheep genomes and can directly or indirectly affect gene expression through a variety of mechanisms, leading to phenotypic variation among individuals or breeds. In this study, we characterized genomic SVs and identified candidate genes related to germplasm traits in seven sheep breeds. Based on the genome sequences of 73 ewes, SVs were detected using Delly, Lumpy, and Manta software tools, and the results were integrated using SURVIVOR software, leading to the identification of 107,166 SVs. The proportions of deletions, duplications, insertions, inversions, and translocations were 48.39, 38.41, 6.96, 6.22, and 0.03%, respectively. Regarding SV distribution, the number of SVs per chromosome decreased with increasing chromosome number. Both shared and breed-specific SVs were identified, with the local Tuva sheep breed showing the highest number of breed-specific SVs. Principal component analysis and phylogenetic tree results revealed a close genetic relationship between Hammari and Kabashi sheep. Selection signal analysis, gene annotation, and enrichment analysis led to the identification of potential functional SVs associated with reproductive traits, including FSHR, ADCY5, and MTNR1A. Experimental validation confirmed the regulatory activity of key SVs and their associations with the expression of target genes. This work characterized SVs in seven sheep breeds, identified genes associated with reproductive traits, and preliminarily validated the regulatory relationships between these SVs and their target genes. These findings provide essential data to support the genetic improvement of local sheep breeds in Xinjiang.
1 Introduction
Sheep (Ovis aries) are economically important animals in global livestock production. Reproductive efficiency is a key indicator of the income level in the livestock industry. Among the key economic traits, litter size is the most significant production characteristic, contributing 70 to 90% of the economic value in the sheep industry (1). The economic benefits of two-lamb litters are more than 1.6 times higher than those of single-lamb ones (2). Seasonal estrus is also a major factor limiting high productivity in sheep (3). Therefore, improving the reproductive capacity of sheep remains a key goal within the sheep industry, and is currently the subject of substantial research. Xinjiang possesses unique conditions for sheep farming and has cultivated many excellent sheep breeds with strong adaptability and disease resistance (4). For example, the local coarse wool breed, Kazakh sheep, is known for its resistance to cold, tolerance to coarse feed, strong adaptability, and good stress resistance. However, Kazakh sheep are seasonal breeders, typically producing one lamb per year, and the two-lamb rate is very low (5). Similarly, the newly developed Yunnan semi-fine wool sheep breed also exhibits seasonal breeding, with most births resulting in single lambs (6). Several foreign breeds, such as the Baikal sheep and Tuva sheep from Russia, and coarse wool breeds such as Kabashi and Hammari sheep from the Sudan, also commonly exhibit single-lamb births and seasonal breeding strategies (7–10). In contrast, the Hu sheep breed is known for its year-round estrus and, consequently, higher lambing rates (11).
Recent advancements in genomics have led to the development of new approaches to uncover the genetic basis of reproductive traits in sheep. Structural variants (SVs), major forms of genetic variation, have been shown to significantly influence important economic traits in livestock. Liang et al. (12) compared genomic SVs between Tibetan sheep, residing at high altitudes, and Hu sheep, typically found at low altitudes, seeking to determine how these variations impact gene expression during the high-altitude adaptation process in Tibetan sheep. They identified numerous SVs in the Tibetan sheep genome, which were widely distributed and had a significant impact on the expression of genes involved in key physiological processes, thus shedding light on the genetic mechanisms underlying high-altitude adaptation. Yang et al. (13) conducted a whole-genome analysis of SVs in sheep and goats and identified numerous SVs that emerged during their evolutionary history. These variations were found to be associated with key production traits and highlighted the convergent evolutionary features of sheep and goats during environmental adaptation. Notably, deletions (DELs) in genes such as BMPR2 and BMPR1B were identified as critical regulators of litter size traits. Combining pan-genome construction, SV detection, and association analysis, Li et al. (14) explored the diversity and complexity of SVs in the sheep genome, and uncovered SVs showing significant associations with tail traits. Qiao et al. (15) employed whole-genome, long-read sequencing technology to systematically detect SVs across multiple sheep breeds. In addition to characterizing the types, sizes, and distribution patterns of these SVs, they also investigated their associations with genetic diversity among breeds. Their findings offer valuable resources and novel insights for genetic research, improvement, and molecular breeding in sheep.
However, research on the genetic basis of reproductive traits in sheep remains limited. In particular, there is a lack of systematic studies focusing on SVs. The aim of this study was to detect SVs in the genome sequences of 73 ewes across seven breeds and characterize these variations using Delly, Manta, and Lumpy software. High-frequency SVs were selected, annotated, and screened for candidate genes associated with reproductive traits. To further explore the function and role of these high-frequency SVs in reproductive traits, experimental validation was carried out. This study not only contributes to the understanding of the genetic mechanisms underlying reproductive traits in Kazakh sheep but also provides a theoretical framework for genetic research and improvements in molecular breeding in sheep.
2 Materials and methods
2.1 Data sources
This study employed whole-genome resequencing data from 73 ewes of seven sheep breeds. Blood samples (5 mL) were collected from six Kazakh sheep obtained from the Yili Kazakh sheep Farm into tubes containing EDTA anticoagulant, preserved on dry ice, and sent to the laboratory for DNA extraction. Whole-genome data for the remaining sheep [20 Yunnan semi-fine wool sheep (BioProject: PRJNA783661), 18 Hu sheep (BioProject: PRJNA1053506), 8 Kabashi sheep and 6 Hammari sheep from Sudan (BioProject: PRJNA849626), 4 Tuva sheep and 8 Baikal sheep from Russia (BioProject: PRJNA656153), and 3 Kazakh sheep (BioProject: PRJNA623062)] were obtained from the NCBI database.1
2.2 Detection of genomic structural variation
2.2.1 DNA extraction and data processing
Genomic DNA was extracted from the samples using the phenol-chloroform method. The concentration and purity of the DNA samples were assessed using a NanoDrop 2000 Microvolume UV Spectrophotometer. Gel electrophoresis was employed to evaluate the integrity of the DNA samples, providing a comprehensive assessment of DNA quality. The DNA samples were stored at −80°C. Library construction and paired-end sequencing (Illumina HiSeq2500 platform; Illumina Inc.) were performed by Tianjin Compass Biotechnology Co., Ltd. (Tianjin, China).
2.2.2 Quality control and alignment of genomic data
The raw data (FASTQ format) from paired-end sequencing were subjected to quality control using fastp software to obtain clean reads, which were then aligned to the sheep reference genome ARS-UI_Ramb_v3.0 using the BWA MEM algorithm (16). The alignment results were sorted and duplicates were removed using the SortSam and MarkDuplicates modules of PICARD software.2 Finally, the sorted BAM files were statistically analyzed using Qualimap software (17).
2.2.3 SV detection and annotation
In this study, three software tools were employed for the detection and genotyping of SVs—Lumpy (18), Delly (19), and Manta (20). The results from Lumpy were further processed using SVTyper for SV genotyping. The results obtained via all three software tools were subsequently filtered and integrated using SURVIVOR (21), generating VCF files. The SVs were then annotated using SnpEff based on the annotation file of the sheep reference genome.
2.3 Population genetic analysis
2.3.1 Principal component analysis
VCF files were converted to PLINK format using VCFtools v.0.1.17 and principal component analysis (PCA) was performed using PLINK (22) v.1.90. The results were visualized with the R package ggplot2.
2.3.2 Phylogenetic tree construction
The phylogenetic tree was constructed using the neighbor-joining method. The Hamming distances between samples were calculated with PLINK software, generating a genetic distance matrix file. A Perl script was written to convert the file into.meg format, and the phylogenetic tree was then constructed using MEGA software. Finally, the graphical visualization of the tree was enhanced using the online tool ITOL.3
2.3.3 Genetic differentiation index (FST)
FST values for each sliding window were calculated using VCFtools, with a window size of 100 kb and a step size of 50 kb. The top 1% of these FST values were considered candidate regions under selection.
2.3.4 Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analysis of genes in selected regions
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were conducted using DAVID4 and the results were visualized using the R package ggplot2.
2.4 SV validation
To validate the activity of RE-mediated SVs, dual-luciferase reporter gene vectors (pGL3-Basic vector) were constructed. Then, three SV sequences (FSHR-DEL, ADCY5-DEL, and MTNR1A-DEL) were separately cloned upstream of the reporter gene. SV activity was determined using the Luciferase Reporter Gene Assay Kit (Yeasen, Shanghai, China). Dual-luciferase activity was detected after 48 h post-transfection in HeLa cells.
3 Results
3.1 Sequencing, mapping, and SV detection
After quality control and alignment, the BAM files of several breeds were statistically analyzed using Qualimap software. A total of 17.43 GB of clean data were obtained. The alignment rate of clean reads from all the samples to the reference genome was 99.70%, with an average sequencing depth of 13.32×. The GC content was above 42.63% in all samples (Supplementary Table 1).
Using Manta, Lumpy, and Delly software, a total of 107,166 SVs were identified from the seven sheep breeds from different regions (Figure 1A). Among the different types of variation, DELs were the most frequent, accounting for 48.39%, while insertions (INSs) were the least common, with only 27 SVs detected, representing 0.03% of the total. The numbers of duplications (DUPs) and inversions (INVs) were relatively similar (Figure 1B). Regarding their distribution, SVs were unevenly distributed across chromosomes, which was largely related to differences in chromosome length. The highest number of SVs was observed on chromosome 1, followed by chromosome 2. Notably, the number of SVs on the X chromosome was significantly higher than that on the Y chromosome (Figure 1C). Subsequently, minor allele frequencies were determined. The results showed that a large number of SVs were widely present in the sheep population, indicating that they are not rare (Figure 1D). Of the detected SVs, 21,335 DELs (35.11%) and 3,865 INSs (76.79%) had lengths ≥1 kb, and 3,800 DELs (14.38%) and 1,342 INSs (57.67%) had lengths ≥4 kb. Most DELs were in the 50 bp to 1 kb range, with their frequency significantly decreasing as their length increased (Figure 1E).

Figure 1. (A) Global geographic distribution of the seven sheep breeds. (B) Types of structural variants (SVs). (C) Chromosomal distribution of the SVs. (D) Minor allele frequency distribution across the genomes of 73 ewes. (E) Length distribution of the deletions and inversions.
3.2 Distribution of SVs in different sheep breeds
Across the seven sheep breeds, Hu sheep had the highest number of SVs, totaling 59,424, accounting for 20.51% of the total. The SV counts for Yunnan fine wool sheep, Kazakh sheep, Baikal sheep, Hammari sheep, Kabashi sheep, and Tuva sheep were 52,642, 52,455, 43,063, 31,909, 26,325, and 23,938, respectively (Figures 2A,B).

Figure 2. (A) The number of structural variants (SVs) in each breed. (B) The proportion of SVs in each breed. (C) A Venn diagram of the SVs in the seven breeds.
Further analysis showed that 4,857 SVs were shared among all seven breeds. The distribution of breed-specific SVs varied significantly, with Tuva sheep exhibiting the highest number of unique SVs (9,455), followed by Hu sheep (9,193), Yunnan semi-fine wool sheep (6,899), Kazakh sheep (6,741), Baikal sheep (4,758), Hammari sheep (1,907), and Kabashi sheep (1,338) (Figure 2C).
3.3 Population structure analysis
PCA of the SVs was used to explore the relationships among the breeds, plotting the first two principal components (PC1 and PC2). The analysis revealed that among the seven breeds, Hammari and Kabashi sheep clustered together, while the other five breeds showed clear separation based on their SV profiles (Figure 3A). To further investigate population relationships, a phylogenetic tree was constructed based on the SVs, with the results showing that the seven populations grouped into six distinct clusters, with Hammari and Kabashi sheep clustering together, consistent with the PCA results (Figure 3B). This was suggestive of a close genetic relationship between these two breeds.
3.4 Selection signal analysis
3.4.1 Differences in SVs between Hu sheep and other breeds and between Hu sheep and Kazak sheep or Yunnan semi-fine wool sheep
Selection signal analysis using FST detected SV differences between Hu sheep and other breeds, as well as between Hu sheep and Kazakh sheep or Yunnan semi-fine wool sheep. The top 1% of SVs with the highest FST values were selected as candidate regions for further analysis, leading to the identification of 653 loci under selection, annotated with 801 candidate genes. The FST threshold for the selected regions was 0.260 (Figure 4A).

Figure 4. (A) A Manhattan plot for Hu sheep and the other breeds as well as between Hu sheep and Kazak sheep or Yunnan semi-fine wool sheep. (B) A Manhattan plot for Kazakh sheep and five other breeds. (C) A Venn diagram. (D) Genotype frequencies of the 72-bp deletion. (E) Genotype frequencies of the 67-bp deletion. (F) Genotype frequencies of the 731-bp deletion.
3.4.2 SV differences between Kazakh sheep and five other breeds
Next, we analyzed the differences in SVs between Kazakh sheep and five other breeds (Yunnan fine-wool sheep, Baikal sheep, Hammari sheep, Kabashi sheep, and Tuva sheep). The top 1% of SVs with the highest FST values were selected as candidate regions for further analysis, revealing that 358 loci were subject to selection, corresponding to 502 candidate genes. The FST threshold for the selected regions was 0.104 (Figure 4B).
3.4.3 Shared and unique SVs detected
We identified a 67-bp deletion (FST = 0.188) in the intron of the MTNR1A gene, which was a shared high-FST SV between the two groups (Figure 4C). Additionally, a 72-bp deletion (FST = 0.385) in the intron of the ADCY5 gene, which is associated with reproductive traits, showed high differentiation between Hu sheep and the other breeds, as well as among Hu sheep, Kazakh sheep, and Yunnan semi-fine wool sheep. Moreover, a 731-bp deletion (FST = 0.153) upstream of the FSHR gene, which is related to reproduction, was highly differentiated between Kazakh sheep and the other five breeds (Supplementary Table 1). Genotype frequencies for these three SVs were also statistically analyzed (Figures 4D–F).
3.5 Annotation and functional enrichment of candidate genes
GO term and KEGG pathway enrichment analyses were performed on the selected regions using the DAVID and KOBAS websites. GO enrichment analysis between Hu sheep and the other breeds, as well as between Hu sheep and Kazakh sheep or Yunnan semi-fine wool sheep, identified 36 GO terms, mainly related to processes such as protein binding, ATP binding, mitochondrion, and regulation of transcription by RNA polymerase II (Figure 5A). Meanwhile, KEGG pathway enrichment analysis revealed 31 pathways, primarily associated with the WNT signaling pathway, Relaxin signaling pathway, GnRH signaling pathway, and ovarian steroidogenesis (Figure 5B). GO enrichment analysis between Kazakh sheep and the other five breeds identified 35 GO terms, mainly linked to processes such as protein binding, metal ion binding, nucleus, and protein phosphorylation (Figure 5C). Finally, KEGG pathway enrichment analysis revealed 17 pathways, mainly related to ovarian steroidogenesis, steroid hormone biosynthesis, Hippo signaling pathway, and circadian entrainment (Figure 5D).

Figure 5. (A,B) GO functional enrichment analysis and KEGG pathway enrichment analysis of selected genes between Hu sheep and the other breeds, as well as between Hu sheep and Kazak sheep or Yunnan semi-fine wool sheep. (C,D) GO functional enrichment analysis and KEGG pathway enrichment analysis of selected genes between Kazakh sheep and the other five breeds.
3.6 Functional validation of high-frequency SVs associated with reproductive traits
We identified three high-frequency SVs associated with reproductive traits, including a 731-bp DEL upstream of the FSHR gene. This variant was found to be most prevalent in the Yunnan fine wool sheep population (AF = 0.70), followed by the Kazakh (AF = 0.68) and Kabashi (AF = 0.68) sheep populations. The frequencies were lowest in the Tuva sheep (AF = 0.25) and Hammari sheep (AF = 0.25) populations. Dual-luciferase reporter assays demonstrated that the 731-bp DEL led to a reduction in enhancer activity, with significantly higher luciferase expression observed in cells carrying the FSHR-DEL variant than in those with the FSHR-WT genotype, suggesting that the 731-bp sequence has enhancer activity and promotes FSHR gene expression (Figure 6A). Similarly, our findings revealed that both the 67-bp DEL in the intron of the MTNR1A gene and the 72-bp DEL in the intron of the ADCY5 gene resulted in a significant reduction in luciferase expression in cells with the MTNR1A-DEL and the ADCY5-DEL compared to that in WT cells (Figures 6B,C). These results indicated that both DNA fragments act as enhancers in HeLa cells.

Figure 6. (A) Dual-luciferase reporter assay of the 731-bp sequence in HeLa cells. (B) Dual-luciferase reporter assay of the 67-bp sequence in HeLa cells. (C) Dual-luciferase reporter assay of the 72-bp sequence in HeLa cells.
4 Discussion
SVs represent an important source of genetic diversity and have been extensively studied in plants and animals over recent years (23). In this study, we used three software tools—Manta, Lumpy, and Delly—to detect SVs and then generated a SV set by considering only those detected by all three tools. This approach aimed to improve detection accuracy and minimize false positives. A total of 107,166 SVs were identified. In terms of variation types and quantities, DELs accounted for the highest proportion at 48.39%, while TRAs represented only 0.03%. We detected genomic SVs in seven sheep breeds, with the highest number found in Hu sheep and the lowest in Tuva sheep. However, when breed-specific variations were considered, Tuva sheep exhibited the highest number of unique variations, followed by Hu sheep. This high number of unique SVs in Tuva sheep, a local breed with a unique genetic background, likely reflects the accumulation of a large number of breed-specific variations due to long-term natural selection and genetic drift. A Venn diagram analysis indicated that 4,857 SVs were shared among the seven breeds, with additional shared SVs detected between specific breeds. Due to the limitations of currently available methods for the detection of INSs, it is necessary to combine long-read sequencing and third-generation sequencing to improve SV detection in the future.
PCA was performed to assess the genetic relationships among the seven sheep breeds. The results showed that, except for Hammari and Kabashi sheep, which clustered together, the other five breeds exhibited significant genetic differentiation. Further analysis based on phylogeny confirmed the PCA results, with the seven populations grouped into six branches. Hammari and Kabashi sheep were again placed in the same branch, suggestive of a high degree of genetic similarity between the two. The consistency between the PCA and phylogenetic tree results suggests that Hammari and Kabashi sheep may share a closer evolutionary origin or experience more frequent gene flow. This genetic relationship may be linked to their geographic distribution, historical breeding, or ecological adaptations. Moreover, the significant genetic differentiation observed among the other five breeds likely reflects breed-specific selective pressures and the long-term environmental effects on these populations.
In this study, we identified several candidate genes related to reproductive traits, including MTNR1A, FSHR, and ADCY5. Previously, Lukic et al. (24) identified the MTNR1A gene as playing a significant role in regulating seasonal reproduction in sheep using a genome-wide analysis. The MTNR1A gene, which encodes the main receptor for melatonin, regulates melatonin signaling, influencing the activity of the hypothalamic-pituitary-gonadal axis in sheep. Consequently, MTNR1A regulates the reproductive cycle and performance of these animals and enables them to better adapt to seasonal environmental changes. Arjoune et al. (25) used genotyping and statistical analysis to explore the genetic polymorphisms of the MTNR1A gene in two Mediterranean sheep breeds and examined how these polymorphisms affect reproductive performance. The authors reported that certain MTNR1A genotypes were significantly associated with favorable reproductive traits, such as higher lambing rates, better fertility, and more stable estrous cycles. Abuzahra et al. (26) summarized existing studies on the relationship between MTNR1A and lambing rates in sheep, reviewing the role of the MTNR1A gene in reproductive performance, particularly lambing rates, and analyzed how its polymorphisms influence reproductive capacity. Their findings suggested that MTNR1A plays a crucial role in regulating sheep reproductive cycles and efficiency. Employing whole-genome sequencing, Guo et al. (27) found that the FSHR gene is closely associated with reproductive traits (e.g., parity) in sheep and plays a significant role in selective breeding. Ma et al. (28), using specific locus amplified fragment sequencing (SLAF-seq), identified novel genes associated with lambing rates in Xinjiang sheep populations, including FSHR, which influence reproductive performance by regulating reproductive processes. He et al. (29) demonstrated through qRT-PCR that the expression level of the FSHR gene was significantly higher in high-reproduction sheep than in low-reproduction ones and identified several polymorphic sites in this gene that were closely related to reproductive performance. Similarly, Tao et al. (30) found a significant correlation between SNPs in the FSHR gene and reproductive traits in different sheep breeds using genotyping, qPCR, and bioinformatics analysis. Pan et al. (31) further characterized the FSHR gene, analyzed its expression patterns across various tissues and conditions, and systematically explored the potential association between FSHR polymorphisms and lambing rates. They concluded that the expression characteristics and genetic variations of the FSHR gene significantly influenced reproductive performance in sheep. Finally, Du et al. (32), using comparative transcriptomic analysis, identified a seasonal expression pattern of the ADCY5 gene in the adrenal tissues of Sunit sheep, confirming it as a key candidate gene influencing seasonal reproduction.
In this study, we identified 107,166 genomic SVs, greatly expanding our understanding of the genetic variation across different sheep breeds. The identified high-frequency SVs associated with reproductive traits offer valuable insights into potential candidate genes for use in Xinjiang native sheep breeding. Overall, this study provides a valuable genetic resource for understanding sheep domestication and reproductive traits and contributes to the dissection of the genetic basis of important phenotypic traits.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary material.
Ethics statement
The animal studies were approved by the Experimental Animal Care and Use Committee of Xinjiang Agricultural University (2024004). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent was obtained from the owners for the participation of their animals in this study.
Author contributions
JL: Methodology, Data curation, Formal analysis, Writing – original draft, Writing – review & editing. SY: Methodology, Project administration, Writing – original draft, Writing – review & editing, Funding acquisition. WZ: Formal analysis, Software, Supervision, Writing – review & editing. WW: Formal analysis, Visualization, Validation, Writing – review & editing. JB: Data curation, Investigation, Resources, Writing – original draft, Writing – review & editing. FL: Project administration, Writing – original draft, Writing – review & editing, Funding acquisition. MZ: Conceptualization, Project administration, Writing – original draft, Writing – review & editing, Funding acquisition.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by the Talent Development Fund of the Autonomous Region’s Tianchi Talent Introduction Plan Project Funding, the Third Xinjiang Scientific Expedition Program (2022xjkk0404), the Youth Science and Technology Innovation Talent of Tianshan Talent Training Program in Xinjiang, the Natural Science Foundation of Xinjiang Uygur Autonomous Region (2024D01D08), and the Autonomous Region’s Major Science and technology Special Projects (2024A02004-4-7 and 2024A02004).
Acknowledgments
The authors are grateful to all the members of the laboratory for their help in sampling and performing the experiments.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fvets.2025.1551293/full#supplementary-material
SUPPLEMENTARY FIGURE S1 | FSHR fluorescence transfection efficiency.
SUPPLEMENTARY FIGURE S2 | ADCY5 fluorescence transfection efficiency.
SUPPLEMENTARY FIGURE S3 | MTNR1A fluorescence transfection efficiency.
Footnotes
References
1. Tao, L, He, XY, Wang, FY, Pan, LX, Wang, XY, Gan, SQ, et al. Identification of genes associated with litter size combining genomic approaches in Luzhong mutton sheep. Anim Genet. (2021) 52:545–9. doi: 10.1111/age.13078
2. Notter, DR. Genetic improvement of reproductive efficiency of sheep and goats. Anim Reprod Sci. (2012) 130:147–51. doi: 10.1016/j.anireprosci.2012.01.008
3. Martinez-Royo, A, Alabart, JL, Sarto, P, Serrano, M, Lahoz, B, Folch, J, et al. Genome-wide association studies for reproductive seasonality traits in rasa Aragonesa sheep breed. Theriogenology. (2017) 99:21–9. doi: 10.1016/j.theriogenology.2017.05.011
4. Li, X, Yang, J, Shen, M, Xie, X-L, Liu, G-J, Xu, Y-X, et al. Whole-genome resequencing of wild and domestic sheep identifies genes associated with morphological and agronomic traits. Nat Commun. (2020) 11:2815. doi: 10.1038/s41467-020-16485-1
5. Zhu, M, Zhang, H, Yang, H, Zhao, Z, Blair, HT, Liang, H, et al. Targeting GNAQ in hypothalamic nerve cells to regulate seasonal estrus in sheep. Theriogenology. (2022) 181:79–88. doi: 10.1016/j.theriogenology.2022.01.005
6. Zhu, M, Yang, Y, Yang, H, Zhao, Z, Zhang, H, Blair, HT, et al. Whole-genome resequencing of the native sheep provides insights into the microevolution and identifies genes associated with reproduction traits. BMC Genomics. (2023) 24:392. doi: 10.1186/s12864-023-09479-y
7. Khamiruev, TN, Bazaron, BZ, and Volkov, IV. Wool and meat production of Khangil-type trans-Baikal sheep. Russ Agric Sci. (2020) 46:167–70. doi: 10.3103/S1068367420020093
8. Deniskova, TE, Abdelmanova, A, Dotsev, AV, Sambu-Khoo, C, Reyer, H, Selionova, MI, et al. PSX-18 high-density genomic description of Russian native sheep breed of the Republic of Tyva. J Anim Sci. (2020) 98:453–4. doi: 10.1093/jas/skaa278.789
9. Salim, B, Alasmari, S, Mohamed, NS, Ahmed, M-KA, Nakao, R, and Hanotte, O. Genetic variation and demographic history of Sudan desert sheep reveal two diversified lineages. BMC Genomics. (2023) 24:118. doi: 10.1186/s12864-023-09231-6
10. Ahbara, AM, Musa, HH, Robert, C, Abebe, A, Al-Jumaili, AS, Kebede, A, et al. Natural adaptation and human selection of northeast African sheep genomes. Genomics. (2022) 114:110448. doi: 10.1016/j.ygeno.2022.110448
11. Li, R, Yang, P, Li, M, Fang, W, Yue, X, Nanaei, HA, et al. A Hu sheep genome with the first ovine Y chromosome reveal introgression history after sheep domestication. Sci China Life Sci. (2021) 64:1116–30. doi: 10.1007/s11427-020-1807-0
12. Liang, X, Duan, Q, Li, B, Wang, Y, Bu, Y, Zhang, Y, et al. Genomic structural variation contributes to evolved changes in gene expression in high-altitude Tibetan sheep. Proc Natl Acad Sci USA. (2024) 121:e2322291121. doi: 10.1073/pnas.2322291121
13. Yang, J, Wang, D-F, Huang, J-H, Zhu, Q-H, Luo, L-Y, Lu, R, et al. Structural variant landscapes reveal convergent signatures of evolution in sheep and goats. Genome Biol. (2024) 25:148. doi: 10.1186/s13059-024-03288-6
14. Li, R, Gong, M, Zhang, X, Wang, F, Liu, Z, Zhang, L, et al. A sheep pangenome reveals the spectrum of structural variations and their effects on tail phenotypes. Genome Res. (2023) 33:463–77. doi: 10.1101/gr.277372.122
15. Qiao, G, Xu, P, Guo, T, He, X, Yue, Y, and Yang, B. Genome-wide detection of structural variation in some sheep breeds using whole-genome long-read sequencing data. J Anim Breed Genet. (2024) 141:403–14. doi: 10.1111/jbg.12846
16. Li, H, and Durbin, R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. (2009) 25:1754–60. doi: 10.1093/bioinformatics/btp324
17. García-Alcalde, F, Okonechnikov, K, Carbonell, J, Cruz, LM, Götz, S, Tarazona, S, et al. Qualimap: evaluating next-generation sequencing alignment data. Bioinformatics. (2012) 28:2678–9. doi: 10.1093/bioinformatics/bts503
18. Layer, RM, Chiang, C, Quinlan, AR, and Hall, IM. LUMPY: a probabilistic framework for structural variant discovery. Genome Biol. (2014) 15:R84. doi: 10.1186/gb-2014-15-6-r84
19. Rausch, T, Zichner, T, Schlattl, A, Stütz, AM, Benes, V, and Korbel, JO. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics. (2012) 28:i333–9. doi: 10.1093/bioinformatics/bts378
20. Chen, X, Schulz-Trieglaff, O, Shaw, R, Barnes, B, Schlesinger, F, Källberg, M, et al. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics. (2016) 32:1220–2. doi: 10.1093/bioinformatics/btv710
21. Jeffares, DC, Jolly, C, Hoti, M, Speed, D, Shaw, L, Rallis, C, et al. Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast. Nat Commun. (2017) 8:14061. doi: 10.1038/ncomms14061
22. Purcell, S, Neale, B, Todd-Brown, K, Thomas, L, Ferreira, MAR, Bender, D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. (2007) 81:559–75. doi: 10.1086/519795
23. Li, X, Liu, Q, Fu, C, Li, M, Li, C, Li, X, et al. Characterizing structural variants based on graph-genotyping provides insights into pig domestication and local adaption. J Genet Genomics. (2024) 51:394–406. doi: 10.1016/j.jgg.2023.11.005
24. Lukic, B, Curik, I, Drzaic, I, Galić, V, Shihabi, M, Vostry, L, et al. Genomic signatures of selection, local adaptation and production type characterisation of East Adriatic sheep breeds. J Anim Sci Biotechnol. (2023) 14:142. doi: 10.1186/s40104-023-00936-y
25. Arjoune, A, Alsaleh, AB, Messaoudi, SA, Chelbi, H, Jelassi, R, Assidi, M, et al. Analysis of MTNR1A genetic polymorphisms and their association with the reproductive performance parameters in two Mediterranean sheep breeds. Animals. (2023) 13:448. doi: 10.3390/ani13030448
26. Abuzahra, M, Wijayanti, D, Effendi, MH, Mustofa, I, and Lamid, M. Association of melatonin receptor 1A with litter size in sheep: a review. F1000Res. (2024) 12:900. doi: 10.12688/f1000research.134890.2
27. Guo, Y, Liang, J, Lv, C, Wang, Y, Wu, G, Ding, X, et al. Sequencing reveals population structure and selection signatures for reproductive traits in Yunnan semi-fine wool sheep (Ovis aries). Front Genet. (2022) 13:812753. doi: 10.3389/fgene.2022.812753
28. Ma, H, Fang, C, Liu, L, Wang, Q, Aniwashi, J, Sulaiman, Y, et al. Identification of novel genes associated with litter size of indigenous sheep population in Xinjiang, China using specific-locus amplified fragment sequencing technology. PeerJ. (2019) 7:e8079. doi: 10.7717/peerj.8079
29. He, X, La, Y, Wang, J, Di, R, Liu, Q, Wang, X, et al. Expression and polymorphism of FSHR gene in sheep with different fecundity. Pakistan J. Zool. (2022) 54:667–75. doi: 10.17582/journal.pjz/20190215010208
30. Tao, M, Li, Z, Liu, M, Ma, H, and Liu, W. Association analysis of polymorphisms in SLK, ARHGEF9, WWC2, GAB3, and FSHR genes with reproductive traits in different sheep breeds. Front Genet. (2024) 15:1371872. doi: 10.3389/fgene.2024.1371872
31. Pan, X, Liu, S, Li, F, Wang, W, Li, C, Ma, Y, et al. Molecular characterization, expression profiles of the ovine FSHR gene and its association with litter size. Mol Biol Rep. (2014) 41:7749–54. doi: 10.1007/s11033-014-3666-8
Keywords: sheep, whole-genome resequencing, structural variation, reproductive traits, selection signals
Citation: Li J, Yiming S, Zheng W, Wu W, Bi J, Li F and Zhu M (2025) Identification of candidate genes for reproductive traits in Xinjiang sheep breeds based on genomic structural variation. Front. Vet. Sci. 12:1551293. doi: 10.3389/fvets.2025.1551293
Edited by:
Sachinandan De, National Dairy Research Institute (ICAR), IndiaReviewed by:
Pengju Zhao, China Agricultural University, ChinaCunfang Zhang, Qinghai University, China
Copyright © 2025 Li, Yiming, Zheng, Wu, Bi, Li and Zhu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mengting Zhu, NzM0NzY5MDg1QHFxLmNvbQ==
†These authors have contributed equally to this work