- 1Qingdao Agricultural University, Qingdao, China
- 2Qingdao Institute of Animal Husbandry and Veterinary Medicine, Qingdao, China
- 3Inner Mongolia Shengjian Biotechnology Co., Ltd, Hohhot, China
- 4Inner Mongolia Academy of Agricultural and Animal Husbandry Sciences, Hohhot, China
Introduction: Identifying genetic markers associated with economically important traits in dairy goats helps enhance breeding efficiency, thereby increasing industry value. However, the potential genetic structure of key economic traits in dairy goats is still largely unknown.
Methods: This study used three genome-wide association study (GWAS) models (GLM, MLM, FarmCPU) to analyze dairy goat milk production traits (milk yield, fat percentage, protein percentage, lactose percentage, ash percentage, total dry matter, and somatic cell count). The goal was to identify SNPs and positional and functional candidate genes significantly associated with these traits.
Results: The GWAS analysis results identified a total of 242 significant SNPs. Among these, 45 SNPs exhibited genome-wide significance, while 197 SNPs demonstrated suggestive associations, corresponding to 99 positional candidate genes within a 50 kb upstream and downstream range. 15 significant SNP loci were consistently identified across all three models, corresponding to 18 candidate genes.The integrated analysis of three models detected 2, 19, 17, 4, 115, 23, and 62 significant SNPs associated with milk yield, ash percentage, protein percentage, lactose percentage, somatic cell count, fat percentage, and total dry matter percentage, respectively. Correspondingly, 6, 24, 9, 12, 37, 14, and 30 candidate genes were identified for these traits. Additionally, several new candidate genes related to milk production traits were proposed (LCORL, TNFRSF1A, VWF, SPATA6, MAN1C1, MASP1, BRCA2).
Discussion: In summary, the results of this study provide an important reference for further exploration of the genetic mechanisms underlying dairy goat milk production traits and the development of molecular breeding markers.
1 Introduction
China is a major country in goat farming and livestock product production, ranking first in the world for both the number of goats and the volume of goats slaughtered (Zhang et al., 2020). It ranks fourth in the world for goat milk production (Zhang et al., 2020). According to statistics, China has nearly 5 million dairy goats, producing 1.07 million tons of goat milk, which accounts for 5.7% of the world’s goat milk production and 3.4% of the national milk production (Guo et al., 2023). In the ranking of average milk yield per dairy goat, China is 37th, with an average of 183.6 kg per goat, which is above the world average level (Luo et al., 2019). Although China has one of the largest numbers of dairy goats in the world, the average milk yield per goat is still quite low. Therefore, improving the productivity of Chinese dairy goats is crucial for the development of the dairy goat industry in China.
Goat milk, as the main economic product of dairy goats, has always been an important component of human nutrition. Compared to cow milk, goat milk is more similar to human breast milk, offering higher nutritional value, better digestibility, and greater nutritional utilization (Feng et al., 2019). However, the production of goat milk is significantly lower than that of cow milk, and research on the components of goat milk is also much less than that on cow milk. Therefore, identifying important genetic markers and candidate genes associated with milk yield and milk components in dairy goats holds significant research value.
Genome-wide association studies (GWAS) are a method for exploring the genetic architecture of important economic traits and have now become a crucial method for detecting candidate genes for complex quantitative traits in livestock and poultry (Wittenburg et al., 2020). To date, several countries have conducted GWAS on different breeds of dairy goats and achieved certain results. For example, France has conducted GWAS on the milk production traits of Saanen and Alpine dairy goats and identified candidate genes related to these traits (Martin et al., 2017; Martin et al., 2018; Massender et al., 2023; Talouarn et al., 2020), Spain (Luigi-Sierra et al., 2024), New Zealand (Scholtens et al., 2020), Canada (Massender et al., 2023), the United States (Tilahun et al., 2020), the United Kingdom (Mucha et al., 2018), and Russia (Selionova et al., 2024). Several genes have been reported to be associated with milk production traits, significantly affecting milk composition. For example, Eid et al., 2020 detected a significant mutation T713C in the DGAT1 gene in Zaraibi goats, which can significantly regulate total milk solids and milk yield. Martin et al. (2017) found that the R251L and R396W mutations impact fat content reduction. Dettori et al. (2023) discovered that the CSN1S2 gene is not only related to milk protein and fat content but also to milk yield and other milk traits (pH value, SCS, NaCl). Massender et al. (2023), through GWAS, found that casein genes are important for milk production traits in Saanen and Alpine dairy goats.
The three models used in this study are GLM, MLM, and FarmCPU. The GLM model uses PCA as a covariate, which can reduce false positives caused solely by population structure (Price et al., 2006). The MLM model uses PCA and the K matrix to reduce false positives arising from both population structure and familial relationships (Yu et al., 2006). FarmCPU is a multi-locus model that uses a multi-locus linear mixed model (MLMM) and gradually incorporates multiple markers as covariates in the MLM to partially eliminate the confounding between the test marker and kinship (Liu et al., 2016). It has been reported that GLM and MLM are superior to ANOVA in controlling false positives, with MLM being superior to GLM (Yu et al., 2006). The MLM model can suppress the excessive inflation of p-values, thereby reducing false positives, but it also leads to an increased probability of false positives (Zhang et al., 2010). The FarmCPU model has been reported to better control the occurrence of false positives and false negatives compared to other models (Liu et al., 2016).
Therefore, the aim of this study is to use three models of genome-wide association analysis to identify SNPs and important candidate genes that are significantly associated with milk production traits.
2 Methods
2.1 Sample collection and phenotypic measurements
A total of 208 healthy lactating Guanzhong dairy goats (parity 2-4, lactation days 60–150) were randomly selected using a random number table at Shenda Ranch in Hohhot, Inner Mongolia. Jugular venipuncture was performed to collect 5 mL of EDTA-anticoagulated whole blood, which was immediately stored at −80 °C until genomic DNA extraction using the MagCore® Genomic DNA Blood Kit. The phenotypic data collected from dairy goats encompassed 22-day lactation milk production records, milk composition analysis data, and somatic cell count (SCC) in milk. Milk yield was recorded daily using milking machines, milk composition was analyzed by MilkoScan FT + infrared analyzer, and somatic cell count (SCC) was determined via Fossomatic 5,000 flow cytometry. Each milk sample underwent individual analysis to ensure the accuracy of the data pertaining to 7 milk production traits. These traits are as follows: milk yield (MY), fat percentage (FP), protein percentage (PP), lactose percentage (LP), ash percentage (AP), total dry matter (TDM), and somatic cell count (SCC).
2.2 Genotyping and quality control
Genomic DNA was extracted from tissue samples using the MagPure Tissue DNA Kit (Thermo Fisher), with quality assessed by Qubit 4.0 fluorometry and 1% agarose gel electrophoresis (OD260/280 = 1.8–2.0). DNA fragmentation to 300–500 bp was performed using a Covaris M220 ultrasonicator, followed by end repair, A-tailing, and adapter ligation to construct Illumina-compatible libraries. Library quality was verified via Agilent 2,100 Bioanalyzer (fragment size deviation ±10%). Sequencing was conducted on the MGI-Seq 2000 platform using probe-anchor synthesis technology (PE150 mode), with raw data quality control performed by FASTP v0.23.2. Chen et al. (2018), following these detailed steps (Zhang et al., 2020): Low-quality read filtering: Reads were excluded if more than 50% of the bases had a quality score (Q ≤ 20) (Guo et al., 2023). Adapter trimming: Removal of adapter sequences from the reads (Luo et al., 2019). Excessive N base filtering: Reads containing more than 5 N bases were discarded (Feng et al., 2019). Length filtering: Reads shorter than 100 bases were filtered out. Subsequently, alignment and variant detection of the sequencing data were performed using Sentieon software version 202,308.03 (Kendig et al., 2019). Quality control of the filtered data was performed using Plink 1.9 (Chang et al., 2015), adhering to the following stringent criteria: samples with a call rate below 95% were excluded, and SNPs that did not meet quality standards were filtered out. Specifically, all genotypes had a call rate of no less than 90%, the minor allele frequency (MAF) was no less than 5%, and the Hardy-Weinberg equilibrium test for SNP genotypes with a significance level of P < 1 × 10−6 (Tuersuntuoheti et al., 2023). Additionally, the retained SNPs must also meet the conditions of an average sequencing depth ≥5× and calling quality ≥20.
2.3 Population stratification
Genetic distances between samples were calculated, and a phylogenetic tree was constructed using Plink 1.9 for 15,511,550 SNP sites. Principal component analysis (PCA) was also performed using Plink 1.9 to cluster the sample populations based on these sites. The population genetic structure of 208 dairy goats was analyzed using Admixture software version 1.3.0 (Kars et al., 2021).
2.4 Genome-wide association studies
To determine the association between SNP and milk production traits, we conducted a genome-wide association study (GWAS) using the rMVP package in R software version 4.0.4 (Yin et al., 2021). Three models were employed: the General Linear Model (GLM), the Mixed Linear Model (MLM), and FarmCPU. Milk yield and milk composition data are utilized as the phenotype, with sample age incorporated as a covariate in the models. While GLM, MLM, and FarmCPU are widely used in GWAS analysis, they each have distinct differences.
The following three models, GLM, MLM, and FarmCPU, were used respectively for genome-wide association analysis:
Where y is the vector of phenotypes for the analyzed trait; n samples correspond to n traits; W is the fixed effects matrix; α is the corresponding coefficient including the intercept; x represents the SNP genotype; β represents the effect size of the marker; μ is the random effect; ε represents the residuals; S represents the quantitative trait SNP.
To confirm the significant impact of SNPs and identify important regions within the studied goat genome, a Bonferroni correction was performed. The P-value threshold of 3.22 × 10−9 indicates genome-wide significance, while a p-value threshold for suggestive associations of 6.45 × 10−8. To avoid false negatives caused by the stringent Bonferroni correction, we adjusted the P-value threshold for milk yield traits to 5 × 10−9 for genome-wide significance and 1 × 10−7 for suggestive associations. Manhattan plots and QQ plots were generated using R software version 4.0.4, with Manhattan plots illustrating the distribution of significant SNPs across the chromosomes, and QQ plots demonstrating the normal distribution characteristics of the data.
2.5 Functional analysis
Based on the gene annotation information of ARS1 in the Ensembl database (Index of/pub/release-113/gtf/capra_hircus), the GALLO package in R software version 4.2.2 is used to retrieve positional candidate genes within a 50 kb upstream and downstream range of a significant SNP. The online tools Metascape (https://metascape.org/gp/index.html) and KOBAS (https://bioinfo.org/kobas) are utilized to perform GO and KEGG enrichment analysis on the positional candidate genes to determine their biological functions and identify common pathways involved. Finally, the online tool STRING (https://cn.string-db.org) is used to construct a gene interaction network based on the predicted proteins of the positional candidate genes.
3 Results
3.1 Whole genome resequencing
The sequencing output data (clean data) for all samples totaled 6,317.42 Gb, with an average of 30.37 Gb per sample. The average Q30 value for all samples was 94.97%. The sequencing depth of 208 dairy goats was greater than 5×, with an average coverage of 90.60%. Only biallelic SNPs located on chromosomes were retained, resulting in a dataset comprising 15,511,550 SNPs for subsequent analysis.
3.2 Phenotypic statistics
Table 1 presents the descriptive statistics for milk yield and milk parameters of dairy goats used in this study. Analysis of the obtained data indicates that lactose percentage and total solids ratio are the parameters with the least variability, with coefficients of variation (CV) not exceeding 15%. The coefficients of variation for milk yield, milk fat percentage, milk protein percentage, and ash percentage range between 20% and 35%, which is consistent with data from other studies. Somatic cell count exhibits significant variability, with a CV exceeding 120%.
3.3 Population stratification
The phylogenetic tree constructed based on quality-controlled SNP markers is shown in Supplementary Figure S1. It can be observed that there is a certain similarity in the genetic characteristics of the sample populations. Principal component analysis (PCA) of the dairy goat populations revealed that the sample populations are relatively concentrated. According to the first and second principal components, the individual differences within the dairy goat population are small (Supplementary Figure S2). To determine the appropriate number of subgroups (K), we assumed K values ranging from 1 to 9 and calculated the cross-validation error for each breed assigned to the K-th subgroup. The results showed that when K = 2, the CV error stabilized, so K = 2 was chosen as the optimal number of subgroups. The distribution of cross-validation errors at different K values is shown in Supplementary Figure S3, and the population structure is illustrated in Supplementary Figure S4.
3.4 Significant SNP
Figure 1 and Supplementary Figure S6 shows the visualization of statistically significant polymorphic positions on 30 chromosomes for certain milk traits in dairy goats.

Figure 1. Manhattan plots for milk yield traits in dairy goats across three different models. Each plot, from top to bottom, represents the GLM, MLM, and FarmCPU models. On the Y-axis, the red line indicates the significance level of p ≤ 0.00001.
Using genome-wide association analysis, a total of 242 significant SNPs were identified across all traits. Among these, 45 SNPs exhibit genome-wide significance, while 197 SNPs demonstrate suggestive associations. The three models identified significant SNP for milk yield, ash percentage, protein percentage, lactose percentage, somatic cell count, fat percentage, and total dry matter percentage, with 2, 19, 17, 4, 115, 23, and 62 significant SNP respectively (Table 2). These SNP are distributed across 27 chromosomes (1, 2, 3, 4, 5, 6, 7, 8, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, X).

Table 2. Number of significant SNP and annotated genes within 50 kb upstream and downstream for each trait in dairy goats across different models.
In the GLM model for the somatic cell count trait, the highest number of significant SNP was identified, though there may be false positives among these significant SNP. For the milk yield trait, no significant SNP were identified for genome-wide significance, and no significant SNP were identified in the MLM model. In the FarmCPU model for the ash percentage trait, the most significant SNP locus (15:59125909, P = 3.10 × 10−22) was located on chromosome 15. Additionally, the FarmCPU model identified extremely significant SNP for somatic cell count, fat percentage, and total dry matter percentage traits, located on chromosome 13 (13:2275920, P = 4.14 × 10−16), chromosome 3 (3:14938807, P = 5.75 × 10−18), and chromosome 20 (20:66155928, P = 1.03 × 10−18), respectively.
3.5 Positional candidate genes
To further investigate the molecular markers and functional genes related to economic traits in goat milk, we identified genes within a 50 kb range upstream and downstream of the significant SNP as positional candidate genes. Based on annotation information, we found a total of 99 positional candidate genes. Supplementary Table S1 lists the positional candidate genes identified from the significant SNP found for each trait across the three different models. In dairy goats, 6, 24, 9, 12, 37, 14, and 30 positional candidate genes were identified for milk yield, ash percentage, protein percentage, lactose percentage, somatic cell count, fat percentage, and total dry matter percentage traits, respectively. These genes are located on 20 different chromosomes (1, 2, 3, 4, 5, 6, 7, 8, 12, 13, 14, 16, 17, 18, 20, 21, 22, 25, 27, 28). As shown in Supplementary Tables S1 15 significant SNP loci were consistently identified across all three models, corresponding to 18 positional candidate genes. Additionally, 70 significant SNP loci were shared between any two models, which were associated with 35 candidate genes.
Among the identified positional candidate genes, we found that 15 genes were identified across multiple traits. The genes CCDC154, CLCN7, GNPTG, PTX4, TELO2, UNKL, and UQCC4 were identified in traits for ash percentage, protein percentage, fat percentage, and total dry matter percentage. Other genes were identified in different traits. For instance, the genes BLVRB, HIPK4, PRX, SERTAD1, SERTAD3, and SPTBN4 were identified in ash percentage and total dry matter percentage traits; LCORL was identified in protein percentage and total dry matter percentage traits; SELENOF was identified in fat percentage and total dry matter percentage traits. The same genes were identified in the four traits of ash percentage, protein percentage, fat percentage, and total dry matter percentage, while no genes were identified as common between milk yield, somatic cell count, and lactose percentage traits. All genes related to multiple traits were identified in the total dry matter percentage trait.
3.6 Functional analyses
We summarized the key biological functions of the positional candidate genes through GO enrichment (Figure 2), with calcium ion binding (GO:0005509) and intracellular membrane-bounded organelle (GO:0043231) having the highest number of candidate genes, each with 7 genes. The three most significant biological functions based on P-values (P < 5 × 10−4) include regulation of RNA splicing (PTBP3, PRX, RBM38), cardiac conduction (SPTBN4, KCNH2, ZMPSTE24), and spindle pole (NEDD9, FRY, PLEKHG6, RAD21).
In the KEGG enrichment analysis (Figure 3), the highest number of candidate genes, six in total (NOS3, AGXT, MAN1C1, ITPKB, MGAT4C, BLVRB), are involved in Metabolic pathways. Additionally, three candidate genes are associated with Breast cancer (DLL3, BRCA2, WNT2B), the mTOR signaling pathway (WNT2B, TELO2, TNFRSF1A), and Pathways in cancer (DLL3, BRCA2, WNT2B).
Using 99 candidate genes from all traits, a gene interaction network was constructed to visualize potential functional candidate genes (Figure 4). 41 candidate genes were predicted to have interactions, forming a total of twelve connected groups. The largest gene group consists of 11 genes. This gene network shows high connectivity (P = 2.34 × 10−3). Among these, RHOC and SUPT5H co-regulate four genes each, while INTS12 interacts with three genes. These genes warrant prioritized investigation in subsequent functional studies.
4 Discussion
The milk production trait, being an essential characteristic in dairy goats, has been extensively studied in various breeds through high-density genotyping and GWAS to identify significant genetic markers and candidate genes related to economically important traits in goat milk. According to previous studies, genes such as CSN1S1 (Massender et al., 2023), CSN1S2 (Massender et al., 2023), SOCS3 (Song et al., 2024), SLC7A5 (Ni et al., 2024), DGAT1 (Eid et al., 2020), and INSIG1 (Wang et al., 2016) have all been associated with milk production traits in dairy goats. Among these, CSN1S1, CSN1S2, and DGAT1 have been widely reported in studies on milk production traits in dairy goats. However, these genes were not linked to the traits in our results. We hypothesize that this is due to milk production being a complex quantitative trait influenced by both genetic and environmental factors, leading to varying association results across different dairy goat breeds. There may be genetic differences between dairy goat breeds, and the genomic structure, metabolic mechanisms, and development of the mammary glands in different breeds may have varying effects on milk production (Massender et al., 2023). The choice of statistical models and data analysis methods can also have a significant impact on the research results.
In GWAS analysis, the choice of model is crucial for the accuracy of the results. This study employed three models—GLM, MLM, and FarmCPU—to conduct GWAS analysis on milk production traits in dairy goats. Utilizing these three models is beneficial as different models may have varying sensitivities to different types of gene effects, such as additive effects, dominant effects, and epistatic effects. By using multiple models, the probability of detecting single nucleotide polymorphisms (SNPs) associated with the traits can be increased. Each model has different assumptions and algorithms for processing data. Employing various models helps to reduce bias and errors that may arise from the failure of assumptions in a single model, thus making the results more robust. The combined use of these models can fully leverage their respective strengths, thereby enhancing the overall effectiveness of the analysis (Visscher et al., 2017).
Each of the three analytical models presents distinct advantages and limitations. While the GLM detects a greater number of statistically significant SNP loci, it demonstrates an elevated probability of false positives (Price et al., 2006). The MLM, incorporating a kinship matrix, provides superior control over false positive results, though this comes at the cost of increased risk of false negatives (Yu et al., 2006). The FarmCPU model, with its built-in covariate correction capability, enables more comprehensive identification of significant SNP loci and candidate genes associated with complex traits; however, its stringent data requirements may lead to reduced detection of true positive loci (Liu et al., 2016). Integrating GLM, MLM, and FarmCPU models significantly expands the discovery of novel functional SNPs and enhances the detection of true positive loci, thereby identifying more candidate genes for downstream screening. However, this multi-model approach concomitantly increases false positive risks, necessitating rigorous experimental validation in subsequent studies. According to previous studies, the Bonferroni method for determining the threshold is highly restrictive, which may lead to the exclusion of some useful loci and result in false-negative outcomes (Armstrong, 2014). The QQ plots for milk yield trait analysis across all three models demonstrate systematic deviations of observed values from the expected probability distributions, indicating elevated risks of false negatives (Figure 5). Therefore, based on the actual situation, we adjusted the P-value threshold for milk yield traits to 5x10−9 for genome-wide signification and 1x10−7 for suggestive associations.

Figure 5. QQ plots for three different models across milk yield traits in dairy goats. The first column represents the GLM model (blue), the second column represents the MLM model (yellow), and the third column represents the FarmCPU model (purple). The red line represents the distribution of expected values, while the blue, yellow, and purple lines represent the actual distribution of observed values.
The lack of variability in lactose and total solids may be due to the biological characteristics of Guanzhong Dairy Goat, warranting further investigation. The range of variation in milk yield, milk fat percentage, milk protein percentage, and ash percentage suggests that these traits follow a normal distribution in the study samples. The high variability in somatic cell count may be attributed to the presence of mastitis in some dairy goats, leading to a significant increase in the number of somatic cells. In the milk of dairy goats tested in this group, the maximum somatic cell count exceeded 10000k/mL, while the minimum was 32k/mL, indicating significant differences between individuals (Tvarožková et al., 2023). However, it is known that the normal level of somatic cell count in dairy goats is below 1000k/mL, with excessive counts indicating the presence of mastitis (Smistad et al., 2025).
The genetic distances between all samples in the phylogenetic tree ranged from 0.1 to 0.3, indicating high genetic similarity among them (Supplementary Figure S1). This confirms that all samples belong to the same species. Principal component analysis (PCA) further supported this conclusion, as the samples clustered closely together (Supplementary Figure S2). However, slight dispersion of some samples suggests individual genetic variation within the population. Population structure analysis revealed that the optimal number of subgroups (K) was 2, reflecting genetic divergence among individuals (Supplementary Figures S3,S4). This may be attributed to the hybrid origin of Guanzhong dairy goats, which resulted from crossbreeding between Saanen goats and indigenous Chinese goats. The individual genetic differences observed in both PCA and population structure analyses could lead to population stratification, a major challenge in GWAS. Moreover, population stratification may introduce spurious associations, potentially generating numerous false-positive results in GWAS analyses (Haldar and Ghosh, 2012). Appropriate application of statistical models can effectively correct for population stratification and reduce false-positive associations (Sharma et al., 2015). Including principal components (PCs) from PCA as covariates in association analysis can effectively mitigate the impact of population stratification. The QQ plot showed minimal deviation of observed p-values from the expected distribution (λ = 1.06), while the genomic inflation factor (λ) remained within the acceptable threshold (0.94 < λ < 1.05) (Nazar et al., 2022). This approach can significantly reduce false-positive associations caused by population stratification.
For the milk yield trait (Figure 4), the QQ plot of the model indicates that the observed values for the MLM model deviate downward from the expected probability compared to the GLM and FarmCPU models, thereby increasing false negatives in the MLM model. In the QQ plot for the lactose trait, the p-values in the MLM model also show a slight downward deviation, indicating the presence of false negatives. In the QQ plot for somatic cell count using the GLM model, the p-values tilt upward prematurely at the front end, increasing the probability of SNP false positives (Supplementary Figure S5). The QQ plots for the FarmCPU model in traits such as ash content, somatic cell count, fat, and total solids show a line close to 1:1 with a sharp deviation at the tail (Supplementary Figure S5). In the QQ plot for the ash content trait in the FarmCPU model, the p-values exhibit the highest deviation, indicating the presence of SNP highly associated with this trait (Supplementary Figure S5).
We believe that the 18 candidate genes (CCDC154, CLCN7, GNPTG, PTX4, TELO2, UNKL, UQCC4, BLVRB, HIPK4, PRX, SERTAD1, SERTAD3, SPTBN4, LCORL, SELENOF, RHOC, SUPT5H, INTS12) identified in multiple traits are of significant importance for further research. Liao et al. (2012) found that upregulating CCDC154 in HEK293 cells can inhibit cell proliferation by inducing G2/M arrest. Hong et al. (2023) found that CCDC154 can promote the proliferation and metastasis of colorectal cancer cells. CLCN7 was previously identified as a positional candidate gene for acute anterior uveitis (Huang et al., 2020). CLCN7 expression is upregulated by the pain- and inflammation-associated mediator bradykinin (BK), thereby promoting osteoclast differentiation (Srivastava et al., 2014). GNPTG exerts a downregulatory effect on subclinical endometritis in dairy cows, effectively suppressing inflammatory responses (Hoelker et al., 2012). The pentraxin family includes PTX1 and PTX2 (NPTX1 and NPTX2, respectively), PTX3, and PTX4 (Li et al., 2023). These genes serve as a link between immunity and inflammation, and as a bridge between innate and adaptive immunity (Li et al., 2023). However, PTX4 is less studied and requires more research to elucidate its function in immunity (Li et al., 2023). The PRX gene has been identified to be strongly associated with sheep wool follicle development and wool shedding, playing an important role in promoting follicle formation, epidermal differentiation, and follicle stem cell development (Lei et al., 2021). Extracellular PRX family proteins released from necrotic brain cells activate Toll-like receptor 2 (TLR2) and TLR4 in macrophages, upregulating inflammatory cytokines such as IL-23 and subsequently promoting neural cell death (Ito et al., 2013). Using SERTAD1 knockout mice and various molecular methods revealed that SERTAD1 can regulate NLRP3-mediated inflammasome activation (Ha et al., 2024). Studies have reported the inhibitory effect of SERTAD1 on apoptosis/anoikis (Nguyen et al., 2022). In the GWAS analysis of milk production traits in Thai dairy cattle, LCORL was identified as a positional candidate gene for milk total solids percentage (Buaban et al., 2022). LCORL gene-associated quantitative trait loci (QTL) are related to the somatic cell score in Valle del Belice Dairy Sheep (Mohammadi et al., 2022). SELENOF can affect the proliferation and death of normal epithelial and breast cancer cells by regulating p21 and p27 (Ekyalongo et al., 2023). Lipopolysaccharide (LPS) induces inflammatory responses in cells while upregulating SELENOF expression (Lee et al., 2022). According to research results and related gene studies, most genes are associated with cancer, inflammation, and cell proliferation. Mastitis, as a common inflammation, also leads to cell proliferation and may phenotypically affect milk composition traits (Paixão et al., 2017). Therefore, the genes CCDC154, CLCN7, GNPTG, PTX4, PRX, SERTAD1, RHOC, SUPT5H, INTS12 and SELENOF may potentially be associated with somatic cell count (SCC) and mastitis susceptibility. Although the genes TELO2, UNKL, BLVRB, HIPK4, SERTAD3, SPTBN4, and UQCC4 demonstrated significant associations with multiple traits, no published studies have yet established their relationship with milk production traits, warranting further experimental validation. Among them, the LCORL gene is related to milk production traits, and further research may help to explore its potential function in dairy goat milk production traits.
Among the 99 positional candidate genes identified, we found 5 genes associated with milk production traits through literature review: TNFRSF1A, SPATA6, MAN1C1, MASP1, and BRCA2. In cattle, higher mRNA expression levels of TNFRSF1A correlate with lower proliferation rates of mammary cells but higher apoptosis rates (Orellana Rivas et al., 2021). Lopreiato et al. (2020) identified high transcript abundance of TNFRSF1A through MIXED model variance analysis, indicating its important role in immune response and inflammation. SPATA6 was identified as a positional candidate gene for somatic cell score traits in Romanian dairy cows through GWAS (Ilie et al., 2021). MAN1C1 was determined as a target candidate gene for bovine milk oligosaccharides (Poulsen et al., 2019) and lactation persistence (Do et al., 2017) according to GWAS. A polymorphism in MASP1, SNP g.5766A>G, is associated with milk protein percentage (Zhang et al., 2019). BRCA2 increases the risk of breast cancer in women and promotes the proliferation and differentiation of mammary epithelial cells (Vidarsson et al., 2002). Mammary epithelial cells secrete milk and synthesize its major components, and their proliferation and differentiation impact milk production traits.
5 Conclusion
This study identified SNPs and positional and functional candidate genes significantly associated with milk production traits in dairy goats using three GWAS models: GLM, MLM, and FarmCPU. The GWAS analysis results showed a total of 242 significant SNPs corresponding to 99 positional candidate genes within a 50 kb upstream and downstream range. 15 significant SNP loci were consistently identified across all three models, corresponding to 18 candidate genes. The integrated analysis of three models detected 2, 19, 17, 4, 115, 23, and 62 significant SNPs associated with milk yield, ash percentage, protein percentage, lactose percentage, somatic cell count, fat percentage, and total dry matter percentage, respectively. Correspondingly, 6, 24, 9, 12, 37, 14, and 30 candidate genes were identified for these traits. Additionally, we proposed several new candidate genes related to milk production traits (LCORL, TNFRSF1A, SPATA6, MAN1C1, MASP1, BRCA2). In summary, this study provides an important basis for subsequent research.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://ngdc.cncb.ac.cn/gsa, CRA031559.
Ethics statement
The animal studies were approved by Qingdao Agricultural University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent was obtained from the owners for the participation of their animals in this study.
Author contributions
ZH: Writing – original draft, Writing – review and editing, Data curation, Formal analysis, Methodology, Validation, Visualization. YT: Writing – original draft, Data curation, Methodology. JaZ: Writing – original draft, Methodology. XL: Writing – original draft, Methodology. DX: Writing – original draft, Methodology. MC: Writing – review and editing, Methodology. JW: Writing – review and editing, Methodology. QZ: Writing – review and editing, Methodology. JH: Writing – review and editing, Methodology. XG: Writing – review and editing, Methodology. JnZ: Writing – review and editing, Conceptualization, Funding acquisition, Methodology, Project administration, Supervision. HL: Writing – original draft, Writing – review and editing, Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Validation.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by Biological Breeding-National Science and Technology Major Project (2022ZD04014), Shandong Natural Science Foundation (ZR2022MC094, ZR2020MC164), Projects of Qingdao People’s Livelihood Science and Technology (23-2-8-xdny-9-nsh, 21-1-4-ny-8-nsh), Technical System of Sheep and Goat Industry in Shandong Province (SDAIT-10-02) and Postgraduate Education Reform Project of Shandong Province (SDYJSJGC2023061).
Conflict of interest
Author JW was employed by Inner Mongolia Shengjian Biotechnology Co., Ltd.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2025.1650836/full#supplementary-material
SUPPLEMENTARY FIGURE S1 | Phylogenetic tree of 208 dairy goats.
SUPPLEMENTARY FIGURE S2 | Population structure demonstrated by principal component analysis.
SUPPLEMENTARY FIGURE S3 | Cross-validation error values at different K values.
SUPPLEMENTARY FIGURE S4 | Population structure plot of 208 dairy goats. The value of K ranges from 2 to 5, in descending order.
SUPPLEMENTARY FIGURE S5 | QQ plots for three different models across six traits in dairy goats. The first column represents the GLM model (blue), the second column represents the MLM model (yellow), and the third column represents the FarmCPU model (purple). AP = Ash Percentage; PP = Protein Percentage; LP = Lactose Percentage; SCC = Somatic Cell Count; FP = Fat Percentage; TDM = Total Dry Matter Percentage. The red line represents the distribution of expected values, while the blue, yellow, and purple lines represent the actual distribution of observed values.
SUPPLEMENTARY FIGURE S6 | Manhattan plots for six traits in dairy goats across three different models. (a) — AP; (b) — PP; (c) — LP; (d) — SCC; (e) — FP; (f) — TDM. Each plot, from top to bottom, represents the GLM, MLM, and FarmCPU models. On the Y-axis, the red line indicates the significance level of p ≤ 0.00001.
References
Armstrong, R. A. (2014). When to use the Bonferroni correction. Ophthalmic Physiological Opt. 34, 502–508. doi:10.1111/opo.12131
Buaban, S., Lengnudum, K., Boonkum, W., and Phakdeedindan, P. (2022). Genome-wide association study on milk production and somatic cell score for Thai dairy cattle using weighted single-step approach with random regression test-day model. J. Dairy Sci. 105, 468–494. doi:10.3168/jds.2020-19826
Chang, C. C., Chow, C. C., Tellier, L. C., Vattikuti, S., Purcell, S. M., and Lee, J. J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7. doi:10.1186/s13742-015-0047-8
Chen, S., Zhou, Y., Chen, Y., and Gu, J. (2018). fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890. doi:10.1093/bioinformatics/bty560
Dettori, M. L., Pazzola, M., Noce, A., Landi, V., Vacca, G. M., Manca, N., et al. (2023). Exploring the genotype at CSN3 gene, milk composition, coagulation and cheese-yield traits of the sardo-modicana, an autochthonous cattle breed from the sardinia region, Italy. Anim. (Basel) 10, 1995. doi:10.3390/ani10111995
Do, D. N., Bissonnette, N., Lacasse, P., Miglior, F., Sargolzaei, M., Zhao, X., et al. (2017). Genome-wide association analysis and pathways enrichment for lactation persistency in Canadian Holstein cattle. J. Dairy Sci. 100, 1955–1970. doi:10.3168/jds.2016-11910
Eid, J. I., Teleb, D. F., Mohamed, S. A., and El-Ghor, A. A. (2020). DGAT1 polymorphism in Egyptian Zaraibi goat breed and their association with milk yield and composition. J. Basic Appl. Zoology 81, 38–7. doi:10.1186/s41936-020-00176-w
Ekyalongo, R. C., Flowers, B., Sharma, T., Zigrossi, A., Zhang, A., Quintanilla-Arteaga, A., et al. (2023). SELENOF controls proliferation and cell death in breast-derived Immortalized and cancer cells. Cancers (Basel) 15, 3671. doi:10.3390/cancers15143671
Feng, C., Wang, B., Zhao, A., Wei, L., Shao, Y., Wang, Y., et al. (2019). Quality characteristics and antioxidant activities of goat milk yogurt with added jujube pulp. Food Chem. 277, 238–245. doi:10.1016/j.foodchem.2018.10.104
Guo, L., Li, S., Zhao, G., Zhang, W., and Bai, Y. (2023). Analysis on the development of dairy industry of goat in China under the background of modern agriculture. China Dairy Cattle 8, 48–51.
Ha, J., Kim, M., Park, J. S., Lee, Y., Lee, J. Y., Shin, J. C., et al. (2024). SERTAD1 initiates NLRP3-mediated inflammasome activation through restricting NLRP3 polyubiquitination. Cell Rep. 43, 113752. doi:10.1016/j.celrep.2024.113752
Haldar, T., and Ghosh, S. (2012). Effect of population stratification on false positive rates of population-based association analyses of quantitative traits. Ann. Hum. Genet. 76, 237–245. doi:10.1111/j.1469-1809.2012.00708.x
Hoelker, M., Salilew-Wondim, D., Drillich, M., Christine, G. B., Ghanem, N., Goetze, L., et al. (2012). Transcriptional response of the bovine endometrium and embryo to endometrial polymorphonuclear neutrophil infiltration as an indicator of subclinical inflammation of the uterine environment. Reprod. Fertil. Dev. 24, 778–793. doi:10.1071/rd11171
Hong, W. F., Zhu, D. X., Chen, Y. J., Shen, X. Z., Cui, Y. H., Du, S. S., et al. (2023). Coiled-coil domain-containing 154 promotes colorectal cancer proliferation and metastasis via interacting with minichromosome maintenance complex component 2. Cancer Lett. 578, 216460. doi:10.1016/j.canlet.2023.216460
Huang, X. F., Li, Z., De Guzman, E., Robinson, P., Gensler, L., Ward, M. M., et al. (2020). Genomewide association study of acute anterior uveitis identifies new Susceptibility loci. Invest Ophthalmol. Vis. Sci. 61, 3. doi:10.1167/iovs.61.6.3
Ilie, D. E., Mizeranschi, A. E., Mihali, C. V., Neamț, R. I., Goilean, G. V., Georgescu, O. I., et al. (2021). Genome-wide association studies for milk somatic cell score in Romanian dairy cattle. Genes (Basel). 12, 1495. doi:10.3390/genes12101495
Ito, M., Kondo, T., Shichita, T., and Yoshimura, A. (2013). Post-ischemic innate immunity and its application for novel therapeutic strategy targeting brain inflammation. Nihon Rinsho 71, 1291–1301.
Kars, M. E., Başak, A. N., Onat, O. E., Bilguvar, K., Choi, J., Itan, Y., et al. (2021). The genetic structure of the Turkish population reveals high levels of variation and admixture. Proc. Natl. Acad. Sci. U. S. A. 118, e2026076118. doi:10.1073/pnas.2026076118
Kendig, K. I., Baheti, S., Bockol, M. A., Drucker, T. M., Hart, S. N., Heldenbrand, J. R., et al. (2019). Sentieon DNASeq variant calling workflow demonstrates strong computational performance and accuracy. Front. Genet. 10, 736. doi:10.3389/fgene.2019.00736
Lee, T. J., Nettleford, S. K., McGlynn, A., Carlson, B. A., Kirimanjeswara, G. S., and Prabhu, K. S. (2022). The role of selenoproteins in neutrophils during inflammation. Arch. Biochem. Biophys. 732, 109452. doi:10.1016/j.abb.2022.109452
Lei, Z., Sun, W., Guo, T., Li, J., Zhu, S., Lu, Z., et al. (2021). Genome-wide selective signatures reveal candidate genes associated with hair follicle development and wool shedding in sheep. Genes (Basel) 12, 1924. doi:10.3390/genes12121924
Li, Y., Zhang, S., Liu, J., Zhang, Y., Zhang, N., Cheng, Q., et al. (2023). The pentraxin family in autoimmune disease. Clin. Chim. Acta 551, 117592. doi:10.1016/j.cca.2023.117592
Liao, W., Zhao, R., Lu, L., Zhang, R., Zou, J., Xu, T., et al. (2012). Overexpression of a novel osteopetrosis-related gene ccdc154 suppresses cell proliferation by inducing g2/m arrest. Cell Cycle 11, 3270–3279. doi:10.4161/cc.21642
Liu, X., Huang, M., Fan, B., Buckler, E. S., and Zhang, Z. (2016). Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet. 12, e1005767. doi:10.1371/journal.pgen.1005767
Lopreiato, V., Minuti, A., Morittu, V. M., Britti, D., Piccioli-Cappelli, F., Loor, J. J., et al. (2020). Short communication: inflammation, migration, and cell-cell interaction-related gene network expression in leukocytes is enhanced in Simmental compared with Holstein dairy cows after calving. J. Dairy Sci. 103, 1908–1913. doi:10.3168/jds.2019-17298
Luigi-Sierra, M. G., Martínez, A., Macri, M., Delgado, J. V., Castelló, A., Alvarez, J. F., et al. (2024). Single and longitudinal genome-wide association studies for dairy traits available in goats with three recorded lactations. Anim. Genet. 55, 257–264. doi:10.1111/age.13391
Luo, J., Shi, H., Wang, J., Zhang, F., Zheng, H., Ji, Z., et al. (2019). A summary of the development of Chinese dairy goat industry—development trends and characteristics. China Dairy Cattle 9, 1–11.
Martin, P., Palhière, I., Maroteau, C., Bardou, P., Canale-Tabet, K., Sarry, J., et al. (2017). A genome scan for milk production traits in dairy goats reveals two new mutations in Dgat1 reducing milk fat content. Sci. Rep. 7, 1872. doi:10.1038/s41598-017-02052-0
Martin, P., Palhière, I., Maroteau, C., Clément, V., David, I., Klopp, G. T., et al. (2018). Genome-wide association mapping for type and mammary health traits in French dairy goats identifies a pleiotropic region on chromosome 19 in the Saanen breed. J. Dairy Sci. 101, 5214–5226. doi:10.3168/jds.2017-13625
Massender, E., Oliveira, H. R., Brito, L. F., Maignel, L., Jafarikia, M., Baes, C. F., et al. (2023). Genome-wide association study for milk production and conformation traits in Canadian Alpine and Saanen dairy goats. J. Dairy Sci. 106, 1168–1189. doi:10.3168/jds.2022-22223
Mohammadi, H., Farahani, A. H. K., Moradi, M. H., Mastrangelo, S., Di Gerlando, R., Sardina, M. T., et al. (2022). Weighted Single-Step Genome-Wide Association Study Uncovers Known and Novel Candidate Genomic Regions for Milk Production Traits and Somatic Cell Score in Valle del Belice Dairy Sheep. Anim. (Basel) 12, 1155. doi:10.3390/ani12091155
Mucha, S., Mrode, R., Coffey, M., Kizilaslan, M., Desire, S., and Conington, J. (2018). Genome-wide association study of conformation and milk yield in mixed-breed dairy goats. J. Dairy Sci. 101, 2213–2225. doi:10.3168/jds.2017-12919
Nazar, M., Abdalla, I. M., Chen, Z., Ullah, N., Liang, Y., Chu, S., et al. (2022). Genome-wide association study for udder conformation traits in Chinese holstein cattle. Anim. (Basel). 12, 2542. doi:10.3390/ani12192542
Nguyen, H. A., Vu, S. H., Jung, S., Lee, B. S., Nguyen, T. N. Q., Lee, H., et al. (2022). SERTAD1 sensitizes breast cancer cells to doxorubicin and promotes lysosomal protein biosynthesis. Biomedicines 10, 1148. doi:10.3390/biomedicines10051148
Ni, M., Yue, Z., Tian, M., Luo, X., Wang, W., Shi, H., et al. (2024). Leucine-mediated SLC7A5 promotes milk protein and milk fat synthesis through mTOR signaling pathway in goat mammary epithelial cells. J. Agric. Food Chem. 72, 13728–13739. doi:10.1021/acs.jafc.4c02087
Orellana Rivas, R. M., Marins, T. N., Weng, X., Monteiro, A. P. A., Guo, J., Gao, J., et al. (2021). Effects of evaporative cooling and dietary zinc source on heat shock responses and mammary gland development in lactating dairy cows during summer. J. Dairy Sci. 104, 5021–5033. doi:10.3168/jds.2020-19146
Paixão, M. G., Abreu, L. R., Richert, R., and Ruegg, P. L. (2017). Milk composition and health status from mammary gland quarters adjacent to glands affected with naturally occurring clinical mastitis. J. Dairy Sci. 100, 7522–7533. doi:10.3168/jds.2017-12547
Poulsen, N. A., Robinson, R. C., Barile, D., Larsen, L. B., and Buitenhuis, B. (2019). A genome-wide association study reveals specific transferases as candidate loci for bovine milk oligosaccharides synthesis. BMC Genomics 20, 404. doi:10.1186/s12864-019-5786-y
Price, A. L., Patterson, N. J., Plenge, R. M., Weinblatt, M. E., Shadick, N. A., and Reich, D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909. doi:10.1038/ng1847
Scholtens, M., Jiang, A., Smith, A., Littlejohn, M., Lehnert, K., Snell, R., et al. (2020). Genome-wide association studies of lactation yields of milk, fat, protein and somatic cell score in New Zealand dairy goats. J. Anim. Sci. Biotechnol. 11, 55. doi:10.1186/s40104-020-00453-2
Selionova, M., Trukhachev, V., Aibazov, M., Sermyagin, A., Belous, A., Gladkikh, M., et al. (2024). Genome-wide association study of milk composition in karachai goats. Anim. (Basel) 14, 327. doi:10.3390/ani14020327
Sharma, A., Lee, J. S., Dang, C. G., Sudrajad, P., Kim, H. C., Yeon, S. H., et al. (2015). Stories and challenges of genome wide association studies in livestock - a review. Asian-Australas J. Anim. Sci. 28, 1371–1379. doi:10.5713/ajas.14.0715
Smistad, M., Inglingstad, R. A., Vatne, M. K., Franklin, F. V., Hansen, B. G., Skeie, S., et al. (2025). Somatic cell count in dairy goats II: udder health monitoring at goat and herd level. BMC Vet. Res. 21, 157. doi:10.1186/s12917-025-04556-8
Song, N., Ma, C., Guo, Y., Cui, S., Chen, S., Chen, Z., et al. (2024). Identifying differentially expressed genes in goat mammary epithelial cells induced by overexpression of SOCS3 gene using RNA sequencing. Front. Vet. Sci. 11, 1392152. doi:10.3389/fvets.2024.1392152
Srivastava, S., Sharma, K., Kumar, N., and Roy, P. (2014). Bradykinin regulates osteoblast differentiation by Akt/ERK/NFκB signaling axis. J. Cell Physiol. 229, 2088–2105. doi:10.1002/jcp.24668
Talouarn, E., Bardou, P., Palhière, I., Oget, C., Clément, V., Tosser-Klopp, G., et al. (2020). Genome wide association analysis on semen volume and milk yield using different strategies of imputation to whole genome sequence in French dairy goats. BMC Genet. 21, 19. doi:10.1186/s12863-020-0826-9
Tilahun, Y., Gipson, T. A., Alexander, T., McCallum, M. L., and Hoyt, P. R. (2020). Genome-wide association study towards genomic predictive power for high production and quality of milk in American alpine goats. Int. J. Genomics. 2020, 6035694. doi:10.1155/2020/6035694
Tuersuntuoheti, M., Zhang, J., Zhou, W., Zhang, C. L., Liu, C., Chang, Q., et al. (2023). Exploring the growth trait molecular markers in two sheep breeds based on Genome-wide association analysis. PLoS One 18, e0283383. doi:10.1371/journal.pone.0283383
Tvarožková, K., Tančin, V., Uhrinčať, M., Oravcová, M., Hleba, L., Gancárová, B., et al. (2023). Pathogens in milk of goats and their relationship with somatic cell count. J. Dairy Res. 90, 173–177. doi:10.1017/s0022029923000237
Vidarsson, H., Mikaelsdottir, E. K., Rafnar, T., Bertwistle, D., Ashworth, A., Eyfjord, J. E., et al. (2002). BRCA1 and BRCA2 bind Stat5a and suppress its transcriptional activity. FEBS Lett. 532, 247–252. doi:10.1016/s0014-5793(02)03684-0
Visscher, P. M., Wray, N. R., Zhang, Q., Sklar, P., McCarthy, M. I., Brown, M. A., et al. (2017). 10 Years of GWAS discovery: biology, function, and translation. Am. J. Hum. Genet. 101, 5–22. doi:10.1016/j.ajhg.2017.06.005
Wang, H., Luo, J., Zhang, T., Tian, H., Ma, Y., Xu, H., et al. (2016). MicroRNA-26a/b and their host genes synergistically regulate triacylglycerol synthesis by targeting the INSIG1 gene. RNA Biol. 13, 500–510. doi:10.1080/15476286.2016.1164365
Wittenburg, D., Bonk, S., Doschoris, M., and Reyer, H. (2020). Design of experiments for fine-mapping quantitative trait loci in livestock populations. BMC Genet. 21, 66. doi:10.1186/s12863-020-00871-1
Yin, L., Zhang, H., Tang, Z., Xu, J., Yin, D., Zhang, Z., et al. (2021). rMVP: a memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study. Genomics Proteomics Bioinforma. 19, 619–628. doi:10.1016/j.gpb.2020.10.007
Yu, J., Pressoir, G., Briggs, W. H., Vroh, Bi I., Yamasaki, M., Doebley, J. F., et al. (2006). A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat. Genet. 38, 203–208. doi:10.1038/ng1702
Zhang, Z., Ersoz, E., Lai, C. Q., Todhunter, R. J., Tiwari, H. K., Gore, M. A., et al. (2010). Mixed linear model approach adapted for genome-wide association studies. Nat. Genet. 42, 355–360. doi:10.1038/ng.546
Zhang, H., Wei, Y., Zhang, F., Liu, Y., Wang, H., Li, Y., et al. (2019). Polymorphisms of mannose-binding lectin-associated serine protease 1 (MASP1) and its relationship with milk performance traits and complement activity in Chinese Holstein cattle. Res. Vet. Sci. 124, 346–351. doi:10.1016/j.rvsc.2019.04.017
Keywords: genome-wide association study, candidate genes, milk production, dairy goat, significant SNPs
Citation: Huang Z, Tang Y, Zhou J, Xu D, Lin X, Cheng M, Wang J, Zhao Q, He J, Gao X, Zhao J and Li H (2025) Genome-wide association study on dairy goat milk production traits using three models. Front. Genet. 16:1650836. doi: 10.3389/fgene.2025.1650836
Received: 20 June 2025; Accepted: 05 August 2025;
Published: 22 August 2025.
Edited by:
Duy Ngoc Do, Dalhousie University, CanadaReviewed by:
Oshin Togla, National Dairy Research Institute, IndiaZuhal Gunduz, Adnan Menderes University, Türkiye
Copyright © 2025 Huang, Tang, Zhou, Xu, Lin, Cheng, Wang, Zhao, He, Gao, Zhao and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jinshan Zhao, MjAxNTAxMDA1QHFhdS5lZHUuY24=; Hegang Li, MjAxNzAxMDE4QHFhdS5lZHUuY24=
†These authors have contributed equally to this work