ORIGINAL RESEARCH article
Sec. Livestock Genomics
Whole Genome Resequencing Reveals Selection Signatures Associated With Important Traits in Ethiopian Indigenous Goat Populations
- 1Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
- 2College of Veterinary Science, Mekelle University, Mekelle, Ethiopia
Ethiopia is considered as the main gateway for the introduction of livestock species, including goat, to the African continent. Ethiopian goats are characterized by their unique adaptive ability, and different physical characteristics in terms of morphology, body size, coat colors, and other important traits. The comparative population genomic analysis provides useful genomic information associated with important traits. Whole-genome resequencing of 44 Ethiopian indigenous goats produced 16 million single-nucleotide polymorphisms (SNPs) as well as 123,577 insertions and deletions. Specifically, 11,137,576, 10,760,581, 10,833,847, 12,229,657 and 10,749,996 putative SNPs were detected in Abergelle, Afar, Begait, Central Highland and Meafure goat populations, respectively. In this study, we used population differentiation (FST) and pooled heterozygosity (HP) Cbased approaches. From the FST analysis, we identified 480 outlier windows. The HP approach detected 108 and 205 outlier windows for Abergelle, and Begait, respectively. About 11 and 5 genes under selective signals were common for both approaches that were associated with important traits. After genome annotation, we found 41 Gene ontology (GO) terms (12 in biological processes, 8 in cellular components and 11 in the molecular function) and 10 Kyoto Encyclopedia of Genes and Genomes pathways. Several of the candidate genes are involved in the reproduction, body weight, fatty acids, and disease related traits. Our investigation contributes to deliver valuable genetic information and paves the way to design conservation strategy, breed management, genetic improvement, and utilization programs. The genomic resources generated in the study will offer an opportunity for further investigations.
Goats (Capra hircus) are one of the first ruminant animals to be domesticated around 10,000 to 9900 years ago (Zeder and Hesse, 2000) at the dawn of the Neolithic period in the Fertile Crescent. They were subsequently dispersed across the continents, adapting themselves to diverse biophysical and production environments. This also holds true for Ethiopia, which is considered as the main gateway for the introduction of livestock species, including goat, to the African continent (Gifford-Gonzalez and Hanotte, 2011). The country is home for 30.2 million goats (CSA, 2017), with seven genetically characterized groups and 14 local phenotypically classified populations (Getinet, 2016) which are characterized by their unique adaptive ability, and different physical characteristics in terms of morphology, body size, coat colors, and other production traits (Hassen et al., 2012), this implies that natural selection has most probably played a key role in the adaptation of these populations under the diverse range of environments. They are also known as the livelihood of resource-poor farmers and pastoralists, because of their immense economic importance trait, substantial phenotypic diversity and evolved in harsh climatic conditions (Hassen et al., 2012). Ethiopian indigenous goats are primarily used as a source of income, milk, meat, manure and many other sociocultural functions (Abegaz et al., 2013).
Among the Ethiopian goat populations, Begait and Afar are well adapted to the hotter and drier low land areas with a high temperature (27 to 37°C) and low rainfall ranging from 250 to 747 mm/years (Abegaz et al., 2013), and they are likely under selection for thermal stress, while the Abergelle, Central highland, and Meafure goats are mainly distributed from mid to mountainous production environments characterized by high rainfall (650 up to 700 mm) and relatively low temperature (5-30°C) (Zerabruk and Vangen, 2005). Moreover, Begait goat can be identified by their tall, predominantly white coat color and hairy thighs with long drooping ears (Berhane and Eik, 2006). They are merits of tolerance to feed and water shortages, resistance to diseases, and have the ability to produce and reproduce under such conditions and therefore, their genomes should harbor footprints for selection program for multi-traits.
Hence, genomic characterization of the Begait goat breed using modern genomic tools is paramount important to ensure breeding of these hardy goat populations. In this study, we reported genes that are positively selected in Begait goat population associated with important traits by scanning the whole genome resequencing data. The availability of whole genome resequencing data has facilitated the mapping of signatures selection associated with domestication and selection (Lai et al., 2016; Guo et al., 2018). Several investigations of whole genome resequencing have previously been examined genetic divergence among populations, identify potential genes/genomic regions associated with production, reproduction and adaption traits in various livestock species including goats (Benjelloun et al., 2015; Wang et al., 2016; Guo et al., 2018), sheep (Yang et al., 2016), cattle (Taye et al., 2017), pig (Li et al., 2014).
In Africa, studies on goat have been reported to detect positive selection signatures mainly focused with environmental adaptation and thermo-tolerance under heat stress conditions (Kim et al., 2016; Onzima et al., 2018). However, there have been no previous studies conducting in important traits in Ethiopian goat populations. Here we reported a positive selection signature of goat populations that is associated with important traits using whole-genome resequencing of five Ethiopian goat populations.
In this study, we applied a population differentiation (Akey et al., 2010) and pooled heterozygosity (Rubin et al., 2010) based approaches to explore the genetic mechanisms underlying variation across the whole genome resequence and are an actual approach to detect selection signature for populations with unknown phenotypes under natural or artificial selection. This study is, therefore, aiming to reveal and identify unique selection signatures in the genome and the genes under selection in Ethiopian indigenous goat populations using whole genome resequencing data.
Materials and Methods
Animals and Sampling
We sampled a total of 44 goat from five Ethiopian indigenous goat populations (Aberegalle (n = 11), Afar (n = 5), Begait (n = 11), Central highland (n = 12) and Meafure (n = 5). The samples were collected from two provinces (Afar and Tigray) as indicated in (Figure 1). All samples were collected from representative unrelated individual animals. To ensuring the representative sampling of the animals, livestock experts, herdsmen and owners of the animals were consulted. Following this, Afar (AF) goat samples were collected from two villages and five flocks. Aberegelle (AB) goat samples were obtained from one research station and two districts while, the Begait (BG) goat samples were obtained from two villages, one research station, and one university farm. Central highland (CH) goat samples were obtained from 4 villages. Meafure (MR) goat samples were collected from 3 local areas. Among these five goat populations, Begait and Abergelle goat populations are highly selected for different purposes (meat and milk production). Begait goats produce a high daily milk yield (0.6 kg)), and have high productive potential since they have better early birth weight and growth rates compared to the other goat populations (Berhane and Eik, 2006). They are the most preferred breeds by livestock keepers because of their excellent twinning ability, body size, pre-weaning kid survival rate, and milk yield (Abraham, 2018), while the Abergelle goats are a dual purpose and widely distributed in areas dominated by plains and river valleys. They are stocky, compact, and well built (height at withers of 71.4 and 65 cm for males and females, respectively), with a body weight of 34 and 28 kg for males and females, respectively, and these are the key identifying physical features of the these goats (Berhane and Eik, 2006). The majority of these goats have commonly a plain coat colour (56%), with 33% patchy and 11% spotted. They have short and smooth hair (Berhane and Eik, 2006). A detailed description of the appearance, characteristics, and sample distributions of the populations is provided in (Supplementary Table S1). Arising from this different phenotypic characteristics and breeding histories, these two groups of animals could therefore provide appropriate genetic material to analyze genomic differences among the animals.
Figure 1 Geographic distribution and sample collection site of the five indigenous goat populations.
For all samples, blood was obtained from veins using FTA® cards following the procedure of Berry Genomics (www.berrygenomics.com). Until DNA extraction, blood samples were stored at room temperature. DNA extractions were performed at the Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China, based on the manufacturer’s protocol. The library construction and whole genome resequencing was performed for 150 bp paired-end reads using the Hiseq-3000 platform (Illumina, San Diego, CA, USA).
Whole Genome Resequence, Quality Checking and Mapping
The fastQC software (Hadfield et al., 2014) was used to perform a raw sequence data quality check. Using BWA (Li and Durbin, 2010), paired-end read data were processed to filter out the adaptors and low quality reads resulted in clean reads. After alignment to the goat reference assembly(ARS1) (Bickhart et al., 2018), variants were called using SAMtools package 0.1.2 (Li et al., 2009) and were used to convert file format from SAM to BAM. On average, 99.2% of the reads were mapped, resulting in a final average sequencing death of 16 (10× to 30×) per individual. Picard ver. 1.86 (http://picard.sourceforge.net) were used to sort BAM file and remove potential MarkDuplicates if multiple read pairs had identical external coordinates. Read pairs with the highest mapping quality were retained. Single nucleotide polymorphisms (SNPs) calling was performed using both GATK (McKenna et al., 2010) and SAMtools (Li et al., 2009) the output was further filtered using Variant Call Format (VCF) tools (Danecek et al., 2011). SNPs that did not meet the following criteria were excluded: (1) 3× < mean sequencing depth (overall included individuals) < 30×; (2) a minor allele frequency > 0.05; (3) maximum missing rate < 0.1; and (4) only bi-alleles autosomal were used for further analysis.
Population Structure and Admixture Analyses
To determine the genetic relatedness among the study goat samples, whole-genome autosomal SNPs were extracted using PLINK v1.9 (Purcell et al., 2007). Principal component analysis (PCA) was carried out to illustrate the relationship among the populations from the genetic relationship matrix of the first two eigenvalues using the smart PCA program in the package EIGENSOFT version 6.0.1 (Patterson et al., 2006), and plotted in R package version 3.4 (Zheng et al., 2012). ADMIXTURE version.1.3.0 were used to assess the population the genome-wide genetic structure among the populations (Alexander et al., 2009). To examine the most ideal population structure, a cross-validation procedure was undertaken with hypothetical ADMIXTURE runs from K = 1 to 10.
Genome-Wide Selective Sweep Analysis
To identify genome-wide selective sweeps, we used two populations (AB and BG as they have enough sample, better phenotypic characteristics and breeding histories) by using the two statistical approaches; population differentiation (FST) and pooled heterozygosity (HP). Individual FST values derived from pairwise comparisons between two breeds were then combined and averaged across all SNPs using a Linux script to obtain the transformed summary statistic (Akey et al., 2010).
First, the weighted FST values were calculated for each SNP based on where s2 represents the sampling variance of allele frequencies between two populations, p represents the overall average allele frequency across populations and r = 2 represent the number of populations (Weir and Clark Cockerham, 1984) and using a sliding-window approach (100-kb windows with 50-kb increments) as described previously (Yang et al., 2016; Li et al., 2017). The FST values were then Z transformed as . To reduce false positive results, only the top 1% of windows with the highest average ZFST value was positively selected as the candidate outliers under strong selective sweeps for each comparison.
Second, we determined the pooled heterozygosity (HP) values to scan selection signals as follows where ΣnMaj and ΣnMin represent the sums of the numbers of the major and minor alleles at each locus, respectively. Individual HP values were Z transformed as follows: where µHP is the overall average heterozygosity and σHP is the standard deviation for all windows within each group. The extremely low HP scores (ZHP < -4 cut-off) were proposed to be selection signals using 100-kb sliding windows with a step size of 50 kb according to the method used previously (Rubin et al., 2010). All of the outlier windows were assigned to corresponding SNPs and genes.
Based on genome annotation, a gene was deemed to show evidence of being under candidate selection if it overlapped with an outlier genomic window based on both ZFST and ZHP values. Finally, we retrieved the caprine gene (ARS1), Gene transfer format (GTF) file from the Ensembl genome browser (http://www.ensembl.org/) databases. Enrichment and functional annotation of the candidate genes were defined using the Enrichr program (Chen et al., 2013) with default settings on the human gene set. We further analysis the significant over-representation of GO biological processes (GO-BP), molecular function (GO-MF), cellular component (GO-CC) and KEGG-pathway. Only pathways or annotations with P < 0.05 were used. The functions of the candidate genes were consulted based on the annotations in the NCBI (http://www.ncbi.nlm.nih.gov in the PubMed, and literatures.
Genome Sequence Mapping and SNPs Calling
After resequencing and mapping of clean reads, we identified 16.56 million SNPs and 123,577 Indels (insertions and deletions) from 44 individuals. Consequently, for each individual, 99.23% of total clean reads were mapped against the latest goat reference genome (assembly ARS1) with coverage of ∼99.88%. Accordingly, for each individual, more than 99.22% of total clean reads were mapped against the latest goat reference genome assembly (ARS1) with coverage of mapped ∼99.87% (Table 1). These mapped reads also generated an average sequencing depth of 16× per breed ranging (10× to 30×) fold, indicating that high-quality sequences were obtained in this study. Among the five goat populations, we identified 13,061,914 unique SNPs based on these stringent thresholds and used for detection of positive selection signature analysis. The largest number 12,229,657 of SNPs was detected in CH goat followed by the AB goat (11,137,576), which likely reflects a larger number of samples size. In contrast, the MR goat displayed the lowest number of SNPs 10, 749, 99 (Table 2), which might be due to low sequencing depth.
Population Genetic Structure Analyses
As a different approach to characterize divergence, the genetic relationship between animals was analyzed by principal component analysis (PCA). We observed closer clustering pattern between Abergelle and Central highland goat, and between Afar and Meafure goats, whereas Begait goat formed a separate cluster (Figure 2). This clustering pattern is consistent with the geographical locations, genetic composition and domestication history of the breeds. We also observed a clear signature of genetic admixture among all the five goat breeds. However, at K = 2, 3 and 4, an independent genetic admixture was observed in Begait goat (Supplementary Figures 1), which is much consistent with the PCA results. At K = 5-10, high genetic admixture was observed among all the populations (Supplementary Figures 2).
Identification of Positive Selective Sweeps
For the detection of selective sweeps, we only used 2 goat populations containing 23 individuals such as Abergelle (n = 12) and Begait (n = 11). The results of PCA and Admixture, revealed that Begait goats were relatively distinct divergence compared the others (Abergelle and Central highland goats are very close relationships) goat populations.
On the basis of this results, we used selection signature to explore differences among groups of populations that are exposed to different selection pressures by calculating the average ‘ZFST’ values between Abergelle versus Begait (Figure 3A) and between Begait versus Central Highland goats (Figure 3B). In all the comparisons, we obtained 480 outlier windows (Supplementary Table S2). The regions with the highest degree of differentiation were found on CHI1, 2, 3, 6, 10, and 12. In this result, UGT2A2 was found in the selection region of Begait goat (FST = 0.222, Z (FST) BG-AB = 6.456).
Figure 3 Genome-wide distributions of selection signals between Begait and Abergelle (A), between Begait and Central Highland (B) goat populations. Manhattan plot of Z-transformed population differentiation (FST) between any two populations for autosomal 100-kb windows 50-kb increments. The Z-value distribution plotted along goat autosomes 1–26 chromosomes.
From this analysis of pooled heterozygosity (HP), the overall average HP values across all the SNPs were 0.341 and 0.340 in the Abergelle, and Begait (Table 2), respectively. Moreover, only the lowest ZHP scores (ZHP ≤ -4) were deemed as selection regions, and were detected to be 108 and 205 outliers windows in Begait and Abergelle goats, respectively (Supplementary Table S3). Figure 4A shown the putative selection sweeps for Begait (Z (HP) BG) goat whereas, Figure 4B presented for Abergelle (Z (HP) AB) goat population.
Figure 4 Selection sweep analyses identified putative signals in the Begait (A) and Abergelle (B) goat populations. The distributions of Z-transformed pooled heterozygosity (HP) of three populations for autosomal 100-kb windows 50-kb increments. The Z-value distribution plotted along goat autosomes 1–26 chromosomes. The dashed horizontal line indicates the cut-off (Z > 4 or Z < -4) used for extracting outliers.
Overlap Between Candidate Selection Sweep Regions and Functional Enrichment Analyses
Putative genomic signals that overlapped between both the two approaches (high ZFST and low ZHP) were further considered as putative selection signatures and found 11 and 5 genes for Begait and Abergelle goats (Table 3), respectively. Comparing their genomic locations, there is limited overlap between the different breeds showed that most of these selection signals were breed-specific, reflecting distinct phenotypic evolutions under different selection objectives or adaptations to the local environments. Within the putative selection regions identified, the lists of genes were used to perform functional analyses using Enrichr program with the default settings on the human. We identified a total of 39 GO terms (12 in biological processes, 8 in cellular component and 11 in the molecular function) and 10 KEGG pathway were annotated (Supplementary Table S4). The biological processes (BP) terms overrepresented were related to negative regulation of protein polymerization, cellular response to oxidative stress, regulation of histone deacetylase activity, rRNA transport, microtubule nucleation by microtubule organizing center and insulin-like growth factor receptor signaling pathway.
Table 3 top overlapped selection regions identified via ZFST and ZHP in Begait and Abergelle goat populations associated with important traits.
The cellular components (CC) include; microtubule minus-end, centrosome, microtubule organizing center, cytoplasmic exosome, platelet alpha granule membrane, calcium channel complex, cytoplasmic microtubule and cytoplasmic stress granule. Whereas, the molecular functions (MF) were related to cuprous ion binding, sodium channel inhibitor activity, poly (G) binding, phosphofructokinase activity, protein serine/threonine phosphatase inhibitor activity, syntaxin-1 binding, metallocarboxypeptidase activity, cadherin binding and TPase activity, coupled to transmembrane movement of substances. The Long-term potentiation, RNA degradation, Glucagon signaling pathway, AMPK signaling pathway, Adrenergic signaling in cardiomyocytes, Axon guidance, and Galactose metabolism pathways were among the enriched in the KEGG pathways. In this study, we mainly focus on and discuss the genes and pathways that putatively contribute to the economic important traits.
In the present study, we performed a whole-genome resequencing of 44 goats from 5 Ethiopian goat populations. Ethiopian indigenous goat breeds are phenotypically categorized in o 14 main populations (Getinet, 2016). However, many of them are named after the community, which keeps the population, or according to geographical localities they inhabit, and the true genetic relationship between the major populations is not yet well known or documented. Among those indigenous goats, the Begait goats are Nubian type of goat which is widely used as source of meat and milk in the North-west part of Tigray, Ethiopia. Nubian goats came from Asia. It is assumed that the first wave of goats entered Ethiopia from the north between 2000 and 3000 B.C (Zeder and Hesse, 2000). In our study, the Begait goat showed high genetic variation compared to Afar, Abergelle, Central highland and Meafure goats, this might be due to artificial/natural selection. Understanding how natural selection shapes genetic variation in populations is of paramount importance in evolutionary biology (Brito et al., 2015).
This study is, therefore, the first whole genome resequencing to the breed under selection characterize genomic polymorphisms among the indigenous goat populations. The availability of high-throughput genotyping, sequencing technologies and analysis of large datasets offer great opportunities to better understand the mechanisms that underlie traits that have been exposed to intensive natural selection forces in both wild and domestic species (Lai et al., 2016; Li et al., 2017). In recent years, the selection of signature studies have become increasingly popular because they offer a complementary strategy for whole genome resequencing—to scan the goat genome for positions that may have been targeted traits of interest and selection of loci for breeding programs—and are useful for the annotation of significant functional genomic regions and thus help to link phenotypes to gene function, which could be of biotechnological relevance. We did find overlap between the selected regions identified with the FST and HP approaches. Comparing their genomic locations showed that most of these selection signals were breed-specific, reflecting distinct phenotypic evolutions under different selection objectives or adaptations to the local environments. Based on the genetic divergence and patterns of admixture among the studied goat populations, here we only used 3 populations containing 34 individuals for the detection of selection signature. Therefore, in this study we focused the analysis on genes that are known to be associated with reproduction, body weight, and fatty acids traits.
Candidate Genes Associated With Reproduction Traits
Reproduction traits are economic importance traits in livestock breeding in general (Laske et al., 2012) and in the goat industry in particular (Lai et al., 2016). It appears to be controlled by multiple genes and factors, (Hanrahan et al., 2004) including ovarian follicular development, embryogenesis, embryo implantation, oocyte maturation, ovulation, fertilization, and uterine receptivity.
Interestingly, in Begait goats, we identified several genes (CAMK2D, KANK4 NIN, RSPH6A and UGT2A2). The CAMK2D gene is involved in protein serine/threonine kinase activity (GO: 0004674). CAMK2D gene play an important role in bovine embryo development process (Adams et al., 2011). The gene NIN contain a highly differentiated region (FST = 0.183, ZFST = 4.940) and a low heterozygosity (HP = 0.178, ZHP = - 4.148) at 59.40 - 59.50 Mb on chromosome 10, which is involved in early embryo development. Ninein (Nin) gene is the centrosome-associated proteins which play significant roles in microtubule stability (Kowanda et al., 2016). The RSPH6A gene is identified in CHI18 (54.40 - 54.50 Mb) and is associated with fertility (Abbasi et al., 2018). Similarly, the region between 85.70 and 85.80 Mb on chromosome 6 showed high differentiation (FST = 0.222, ZFST = 6.456) and a low heterozygosity (HP = 0.082, ZHP = -6.604). According to genome annotation, this sweep region harbored gene UGT2A2, which is involved in capable of acting on estrogens. Estrogens and estrogen-like substances are widely distributed in plants and animals (Gruber et al., 2002). Estrogens are one class of steroid hormones that includes estrone, estradiol (E2), and estriol (Yaşar et al., 2017). Estradiol, the most potent estrogen hormone in the circulation, is involved in a wide variety of vital physio-logical functions that range from the development and maintenance of reproductive organs to the regulation of cardiovascular, musculoskeletal, immune, and central nervous system homeostasis (Yaşar et al., 2017).
In the genome of Abergelle goat, we identified genetic regions with the highest FST values (FST = 0.132, ZFST = 3.941) and lowest HP values (HP = 0.031, ZHP = -7.537) from 33.20 - 33.30 Mb on chromosome 12 contained strongest candidate genes (SLAIN1). SLAIN1 gene play role in the development of the nervous system and morphogenesis of several embryonic structures. SLAIN1 was expressed at the stem cell (Hirst et al., 2006). Moreover, in the same breed at the starting region (13.00 – 14.00 Mb) of chromosome 8 had the differentiation values (FST = 0.150, ZFST = 4.249) and HP scores (HP = 0.126, ZHP = -5.220) encompassing NEK1 gene. NEK1 (NIMA-related kinase 1) gene is a serine/threonine and tyrosine kinase and is essential roles during meiosis. NEK1 gene has been implicated in in ciliogenesis (White and Quarmby, 2008) and DNA damage response (Higelin et al., 2018). The gene NEK1 is highly expressed in mouse germ cells (Chen et al., 2011).
Candidate Genes Associated With Body Weight Traits
Growth rate and body weight are the most economically important traits in livestock that are specialized for meat production. Most of the time, tropical breeds tend to have small body weight/size and growth rate as compared with temperate breeds (Mwacharo et al., 2017). Body weight is one of the most important traits for meat type animals that can be measured at birth or at other life stages. Hence, natural selection may have left genomic footprints in the underlying genes involved the production traits of the Ethiopian goats. In the Begait goat, we found SCN9A gene at 10.63 - 10.64 Mb on CHI 2 with high FST values (FST = 0.208, ZFST = 3.563) and low HP scores (HP = 0.080, ZHP = -6.665). Previous study indicated the SCN9A gene is highly associated with body size and weight in cattle (Wang et al., 2019).
In the Abergelle goat breed, we identified 3 genes (GIGYF2 and PPP1R1A) at FST and low HP, which are associated with various production traits. GIGYF2 gene associated with growth trait at considerable impacts on lamb survivability and growth performance traits in sheep (Ghasemi et al., 2019). PPP1R1A was significantly associated gene with body mass index (Seo et al., 2016) and it is also known that positively correlated with insulin secretion in human (Taneera et al., 2014) and it also affects body mass index (BMI) in humans (Dina et al., 2007).
Candidate Genes Related to Fatty Acids
It is known that several candidate genes in the putative selection regions are involved in regulating fatty acids in mammals. Fatty acids are required by daily normal metabolism, and is an important trait contributing to meat quality (Zhu et al., 2017). Previous studies have been conducted to examine fatty acids for various goat breeds (Banskalieva et al., 2000). Interestingly, in the Begait goat, we identified CAB39L gene at high FST values (FST = 0.207, ZFST = 3.625) and low HP scores (HP = 0.175, ZHP = -4.218) located at 81.45-81.55Mb on CHI 10, and this gene play role in cell adhesion receptors and growth factor receptors. Growth factor–induced cell proliferation, adhesion, and migration in cultured cell models often depend on specific integrin (Eliceiri, 2001). Other candidate genes identified include TMEM117 (transmembrane protein 11) located on CHI 5 (FST = 0.286, ZFST = 8.674) and (HP = 0.164, ZHP = -4.293) at 35.60 - 35.70Mb in Abergelle goats. TMEM117 gene has strongly associated with saturated fatty acids composition in Simmental cattle (Zhu et al., 2017).
Candidate Genes Related to Disease
Moreover, Metabolic syndrome (MetS) is a set of complex disorders characterized by central obesity, high blood pressure, hypercholesterolemia, hypertriglyceridemia, decreased high-density lipoprotein cholesterol, and glucose intolerance (Lan et al., 2015). In Abergelle goats, we identified two genes, PPP4R3B (37.90 - 38.00 Mb) and PNPT1 (37.95 - 38.05 Mb) on CHI11. PPP4R3B is involved in gluconeogenesis, lipid metabolism and facilitates the uncovering of molecular mechanisms and in the identification of novel therapeutic targets for the disease (Kristiansson et al., 2012). PNPT1 gene was reported to be implicated with multiple metabolic RNA processing-related genes in pig (Park et al., 2010). Reduction of PNPT1 levels results in impaired mitochondrial processing and accumulation of large polycistronic transcripts, possibly due to its connection to the import of RNase P RNA into the mitochondria (Pellegrina et al., 2017).
Using a whole genome resequence data, we studied for the first-time selection signals in Ethiopian indigenous goat populations that may have left genomic footprints in the underlying genes involved in important traits. This genomic data generated several potential candidate genes related to body weight, reproduction and fatty acid traits in goats, reflecting phenotypic evolution under different selection signals and adaptation to the local environments. Furthermore, these results provide a basis for further investigation on the genomic characteristics of Ethiopian goat populations for complex traits.
Data Availability Statement
Whole genome resequencing data of 44 animals representing five Ethiopian goat populations after variant calls and genotype calls used in this paper are deposited and available at (http://bigd.big.ac.cn/gvm/getProjectDetail?project=GVM000049; Accession number: GVM000049).
Whole blood sample from the Ethiopian indigenous goat were collected after agreement from the local authorities and owners of the animals. No further specific permissions were required from the Ethics Committee of the Mekelle University, Ethiopia at the time of the sampling.
LJ, YM and HB designed the study. LJ and YM generated the genome data. YL and HB participated in the data analysis. HB, BG and GG participated in DNA extraction, HB and BG sampled individuals. HB drafted the manuscript with the inputs from all authors. All the authors revised and approved the manuscript.
This work was financial supported by the National Natural Science Foundation of China (31601910 and U1603232).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The authors would also like to acknowledge to Mekelle University, Ethiopia for the financial support during sample collection and processing. The authors thank the goat owners in northern Ethiopia (farmers of Tigray and Afar), the president of the Mekelle University (Professor Kindeya Gebrehiwot), lecturers of College of Veterinary Medics (Mekelle University), Abergelle Research Station, and Geyetse livestock farm center for providing goat samples.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2019.01190/full#supplementary-material
Supplementary Figure 1 | Population structure analysis (K=2-4).
Supplementary Figure 2 | Population structure analysis (K=5-10).
Supplementary Table S1 | Ethiopian indigenous goat population information. The characteristics, including physical characteristics and distributions.
Supplementary Table S2 | List of genes in selected overlapping regions by top 1% of windows with the highest average ZFST value for Aberegelle goat population.
Supplementary Table S3 | List of gene in the sellected lowest score (ZHP < -4 cut off ) for Begait goat population.
Supplementary Table S4 | Gene Ontology analysis.
Abbasi, F., Miyata, H., Shimada, K., Morohoshi, A., Nozawa, K., Matsumura, T., et al. (2018). RSPH6A is required for sperm flagellum formation and male fertility in mice. J. Cell Sci. 131 (19),jcs221648. doi: 10.1242/jcs.221648
Abegaz, S., Sölkner, J., Gizaw, S., Dessie, T., Haile, A., Wurzinger, M. (2013). Description of production systems and morphological characteristics of Abergelle and Western lowland goat breeds in Ethiopia: implication for community-based breeding programmes. Anim. Genet. Resour. 53, 69–78. doi: 10.1017/S2078633613000088
Adams, H. A., Southey, B. R., Everts, R. E., Marjani, S. L., Tian, X. C., Lewin, H. A., et al. (2011). Transferase activity function and system development process are critical in cattle embryo development. Funct. Integr. Genomics 11 (1), 139–150. doi: 10.1007/s10142-010-0189-9
Akey, J. M., Ruhe, A. L., Akey, D. T., Wong, A. K., Connelly, C. F., Madeoy, J., et al. (2010). Tracking footprints of artificial selection in the dog genome. Proc. Natl. Acad. Sci. U.S.A. 107 (3), 1160–1165. doi: 10.1073/PNAS.0909918107
Alexander, David H, November, J., Lange, K. (2009). Supplementary material for fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19 (9), 1655–1664. doi: 10.1101/gr.094052.109
Benjelloun, B., Alberto, F. J., Streeter, I., Boyer, F., Coissac, E., Stucki, S., et al. (2015). Characterizing neutral genomic diversity and selection signatures in indigenous populations of Moroccan goats (Capra hircus) using WGS data. Front. Genet. 6: 107. doi: 10.3389/fgene.2015.00107
Berhane, G., Eik, L. O. (2006). Effect of vetch (Vicia sativa) hay supplementation on performance of Begait and Abergelle goats in northern Ethiopia I. Milk yield and composition. Small Ruminant Res. 64, 225–232. doi: 10.1016/j.smallrumres.2005.04.021
Bickhart, D. M., Rosen, B. D., Koren, S., Sayre, B. L., Alex, R., Chan, S., et al. (2018). Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat. Nat. Genet. 49, (4), 643–650. doi: 10.1038/ng.3802.Single-molecule
Brito, L. F., Jafarikia, M., Grossi, D. A., Kijas, J. W., Porto-Neto, L. R., Ventura, R. V., et al. (2015). Characterization of linkage disequilibrium, consistency of gametic phase and admixture in Australian and Canadian goats. BMC Genet. 16, 67. doi: 10.1186/s12863-015-0220-1
Chen, Y., Chen, C., Chiang, H., Pena, M., Polci, R., Wei, R. L., et al. (2011). Mutation of NIMA-related kinase 1 (NEK1) leads to chromosome instability. Mol. Cancer 10, 5. doi: 10.1186/1476-4598-10-5
Chen, E. Y., Tan, C. M., Kou, Y., Duan, Q., Wang, Z., Meirelles, G. V., et al. (2013). Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinf. 14, 128. doi: 10.1186/1471-2105-14-128
CSA (2017). Federal Democratic Republic of Ethiopia: VOLUME II REPORT ON LIVESTOCK AND LIVESTOCK CHARACTERISTICS (PRIVATE PEASANT HOLDINGS). Federal Democratic Republic of Ethiopia. doi: 10.1596/24957
Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., et al. (2011). The variant call format and VCFtools. Bioinformatics 27 (15), 2156–2158. doi: 10.1093/bioinformatics/btr330
Dina, C., Meyre, D., Gallina, S., Durand, E., Körner, A., Jacobson, P., et al. (2007). Variation in FTO contributes to childhood obesity and severe adult obesity. Nat. Genet. 39, 724–726. doi: 10.1038/ng2048
Getinet, M. (2016). Molecular characterization of Ethiopian indigenous goat populations: genetic diversity and structure, demographic dynamics and assessment of the kisspeptin gene polymorphism. Available at: https://www.researchgate.net/publication/329946729_MOLECULAR_CHARACTERIZATION_OF_ETHIOPIAN_INDIGENOUS_GOAT_POPULATIONS_GENETIC_DIVERSITY_AND_STRUCTURE_DEMOGRAPHIC_DYNAMICS_AND_ASSESSMENT_OF_THE_KISSPEPTIN_GENE_POLYMORPHISM.
Guo, J., Tao, H., Li, P., Li, L., Zhong, T., Wang, L., et al. (2018). Whole-genome sequencing reveals selection signatures associated with important traits in six goat breeds. Sci. Rep. 8, 10405. doi: 10.1038/s41598-018-28719-w
Hadfield, J., Eldridge, M. D., Taylor, S. (2014). Multi-genome alignment for quality control and contamination screening of next-generation sequencing data. Front. In Genet. 5, 31. doi: 10.3389/fgene.2014.00031
Hanrahan, J. P., Gregan, S. M., Mulsant, P., Mullen, M., Davis, G. H., Powell, R., et al. (2004). Mutations in the genes for Oocyte-derived growth factors GDF9 and BMP15 are associated with both increased ovulation rate and sterility in Cambridge and Belclare sheep (Ovis aries)1. Biol. Reprod. 70 (4), 900–909. doi: 10.1095/biolreprod.103.023093
Hassen, H., Lababidi, S., Rischkowsky, B., Baum, M., Tibbo, M. (2012). Molecular characterization of Ethiopian indigenous goat populations. Trop. Anim. Health Production 44 (6), 1239–1246. doi: 10.1007/s11250-011-0064-2
Higelin, J., Catanese, A., Semelink-sedlacek, L. L., Oeztuerk, S., Lutz, A., Bausinger, J., et al. (2018). NEK1 loss-of-function mutation induces DNA damage accumulation in ALS patient-derived motoneurons. Stem Cell Res. 30, 150–162. doi: 10.1016/j.scr.2018.06.005
Hirst, C. E., Ng, E. S., Azzola, L., Voss, A. K., Thomas, T., Stanley, E. G., et al. (2006). Transcriptional profiling of mouse and human ES cells identifies SLAIN1, a novel stem cell gene. Dev. Biol. 293, 90–103. doi: 10.1016/j.ydbio.2006.01.023
Kim, E. S., Elbeltagy, A. R., Aboul-Naga, A. M., Rischkowsky, B., Sayre, B., Mwacharo, J. M., et al. (2016). Multiple genomic signatures of selection in goats and sheep indigenous to a hot arid environment. Heredity 116, 255–264. doi: 10.1038/hdy.2015.94
Kowanda, M., Bergalet, J., Wieczorek, M., Brouhard, G., Lécuyer, É., Lasko, P. (2016). Loss of function of the drosophila ninein-related centrosomal protein Bsg25D causes mitotic defects and impairs embryonic development. Biol. Open 5 (8), 1040–1051. doi: 10.1242/bio.019638
Kristiansson, K., Perola, M., Tikkanen, E., Surakka, I., Havulinna, A. S., Tech, D., et al. (2012). Europe PMC funders group genome-wide screen for metabolic syndrome susceptibility loci reveals strong lipid gene contribution but no evidence for common genetic basis for clustering of metabolic syndrome traits. Circ. Cardiovasc. Genet. 5 (2), 242–249. doi: 10.1161/CIRCGENETICS.111.961482.Genome-Wide
Lai, F. N., Zhai, H. L., Cheng, M., Ma, J. Y., Cheng, S. F., Ge, W., et al. (2016). Whole-genome scanning for the litter size trait associated genes and SNPs under selection in dairy goat (Capra hircus). Sci. Rep. 6, 1–12. doi: 10.1038/srep38096
Lan, X., Li, D., Zhong, B., Ren, J., Wang, X., Sun, Q., et al. (2015). Identification of differentially expressed genes related to metabolic syndrome induced with high-fat diet in E3 rats. Exp. Biol. Med. 240, 235–241. doi: 10.1177/1535370214554531
Laske, C. H., Teixeira, B. B. M., Dionello, N. J. L., Cardoso, F. F. (2012). Breeding objectives and economic values for traits of low input familybased beef cattle production system in the State of Rio Grande do Sul. Rev. Bras. Zootecn. 41 (2), 298–305. doi: 10.1590/S1516-35982012000200010
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics 25 (16), 2078–2079. doi: 10.1093/bioinformatics/btp352
Li, M., Tian, S., Yeung, C. K. L., Meng, X., Tang, Q., Niu, L., et al. (2014). Whole-genome sequencing of Berkshire (European native pig) provides insights into its origin and domestication. Sci. Rep. 4, 4678. doi: 10.1038/srep04678
Li, X., Su, R., Zhang, W., Jiang, H., Qiao, X., Li, J. (2017). Identification of selection signals by large-scale whole-genome resequencing of cashmere goats. Sci. Rep. 7, 15142. doi: 10.1038/s41598-017-15516-0
McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., et al. (2010). The genome analysis toolkit: a mapreduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20 (9), 1297–303. doi: 10.1101/gr.107524.110.20
Mwacharo, J. M., Kim, E. S., Elbeltagy, A. R., Aboul-Naga, A. M., Rischkowsky, B. A., Rothschild, M. F. (2017). Genomic footprints of dryland stress adaptation in Egyptian fat-tail sheep and their divergence from East African and western Asia cohorts. Sci. Rep. 7, 1–10. doi: 10.1038/s41598-017-17775-3
Onzima, R. B., Upadhyay, M. R., Doekes, H. P., Brito, L. F., Bosse, M., Kanis, E., et al. (2018). Genome-wide characterization of selection signatures and runs of homozygosity in Ugandan goat breeds. Front. In Genet. 9 (AUG), 1–13. doi: 10.3389/fgene.2018.00318
Park, J.-Y., Park, M.-R., Hwang, K.-C., Chung, J.-S., Bui, H.-T., Kim, T., et al. (2010). Comparative gene expression analysis of somatic cell nuclear transfer-derived cloned pigs with normal and abnormal umbilical cords1. Biol. Reprod. 84 (1), 189–199. doi: 10.1095/biolreprod.110.085779
Pellegrina, D., Severino, P., Barbeiro, H., de Souza, H., Machado, M., Pinheiro-da-Silva, F., et al. (2017). Insights into the function of long noncoding RNAs in sepsis revealed by gene co-expression network analysis. Non-Coding RNA 3, 5. doi: 10.3390/ncrna3010005
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81 (3), 559–575. doi: 10.1371/journal.pgen.0020190
Rubin, C., Zody, M. C., Eriksson, J., Meadows, J. R. S., Sherwood, E., Besnier, F., et al. (2010). Whole-genome resequencing reveals loci under selection during chicken domestication. Nature 464, 587–591. doi: 10.1038/nature08832
Taneera, J., Fadista, J., Ahlqvist, E., Atac, D., Ottosson-Laakso, E., Wollheim, C. B., et al. (2014). Identification of novel genes for glucose metabolism based upon expression pattern in human islets and effect on insulin secretion and glycemia. Hum. Mol. Genet. 24, 1945–1955. doi: 10.1093/hmg/ddu610
Taye, M., Lee, W., Caetano-Anolles, K., Dessie, T., Hanotte, O., Mwai, O. A., et al. (2017). Whole genome detection of signature of positive selection in African cattle reveals selection for thermotolerance. Anim. Sci. J. 88 (12), 1889–1901. doi: 10.1111/asj.12851
Wang, X., Liu, J., Zhou, G., Guo, J., Yan, H., Niu, Y., et al. (2016). Whole-genome sequencing of eight goat populations for the detection of selection signatures underlying production and adaptive traits. Sci. Rep. 6, 38932. doi: 10.1038/srep38932
Wang, Z., Ma, H., Xu, L., Zhu, B., Liu, Y., Bordbar, F., et al. (2019). Genome-wide scan identifies selection signatures in Chinese Wagyu cattle using a high-density. Animals 9 (6), E296. doi: 10.3390/ani9060296.
Yang, J., Li, W.-R., Lv, F.-H., He, S.-G., Tian, S.-L., Peng, W.-F., et al. (2016). Whole-genome sequencing of native sheep provides insights into rapid adaptations to extreme environments. Mol. Biol. Evol. 33 (10), 2576–2592. doi: 10.1093/molbev/msw129.
Zheng, X., Levine, D., Shen, J., Gogarten, S. M., Laurie, C., Weir, B. S. (2012). A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28 (24), 3326–3328. doi: 10.1093/bioinformatics/bts606.
Zhu, B., Niu, H., Zhang, W., Wang, Z., Liang, Y., Guan, L., et al. (2017). Genome wide association study and genomic prediction for fatty acid composition in Chinese Simmental beef cattle using high density SNP array. BMC Genomics 18 (1), 1–15. doi: 10.1186/s12864-017-3847-7
Keywords: candidate genes, Capra hircus, pooled heterozygosity, population differentiation, positive selection signature
Citation: Berihulay H, Li Y, Gebrekidan B, Gebreselassie G, Liu X, Jiang L and Ma Y (2019) Whole Genome Resequencing Reveals Selection Signatures Associated With Important Traits in Ethiopian Indigenous Goat Populations. Front. Genet. 10:1190. doi: 10.3389/fgene.2019.01190
Received: 01 March 2019; Accepted: 28 October 2019;
Published: 28 November 2019.
Edited by:Farai Catherine Muchadeyi, Agricultural Research Council of South Africa (ARC-SA), South Africa
Reviewed by:Roger Vallejo, Cool and Cold Water Aquaculture Research (USDA-ARS), United States
Xiangdong Ding, China Agricultural University (CAU), China
Copyright © 2019 Berihulay, Li, Gebrekidan, Gebreselassie, Liu, Jiang and Ma. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.