High-Throughput Sequencing With the Preselection of Markers Is a Good Alternative to SNP Chips for Genomic Prediction in Broilers

Liu, Tianfei; Luo, Chenglong; Ma, Jie; Wang, Yan; Shu, Dingming; Su, Guosheng; Qu, Hao

doi:10.3389/fgene.2020.00108

ORIGINAL RESEARCH article

Front. Genet., 27 February 2020

Sec. Livestock Genomics

Volume 11 - 2020 | https://doi.org/10.3389/fgene.2020.00108

High-Throughput Sequencing With the Preselection of Markers Is a Good Alternative to SNP Chips for Genomic Prediction in Broilers

Tianfei Liu^1†

Chenglong Luo^1†

Jie Ma²

Yan Wang¹

Dingming Shu¹

Guosheng Su³

Hao Qu^1*

¹State Key Laboratory of Livestock and Poultry Breeding, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou, China
²Guangdong Provincial Key Laboratory of Animal Breeding and Nutrition, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou, China
³Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, Tjele, Denmark

The choice of a genetic marker genotyping platform is important for genomic prediction in livestock and poultry. High-throughput sequencing can produce more genetic markers, but the genotype quality is lower than that obtained with single nucleotide polymorphism (SNP) chips. The aim of this study was to compare the accuracy of genomic prediction between high-throughput sequencing and SNP chips in broilers. In this study, we developed a new SNP marker screening method, the pre-marker-selection (PMS) method, to determine whether an SNP marker can be used for genomic prediction. We also compared a method which preselection marker based results from genome-wide association studies (GWAS). With the two methods, we analysed body weight at the12^th week (BW) and feed conversion ratio (FCR) in a local broiler population. A total of 395 birds were selected from the F2 generation of the population, and 10X specific-locus amplified fragment sequencing (SLAF-seq) and the Illumina Chicken 60K SNP Beadchip were used for genotyping. The genomic best linear unbiased prediction method (GBLUP) was used to predict the genomic breeding values. The accuracy of genomic prediction was validated by the leave-one-out cross-validation method. Without SNP marker screening, the accuracies of the genomic estimated breeding value (GEBV) of BW and FCR were 0.509 and 0.249, respectively, when using SLAF-seq, and the accuracies were 0.516 and 0.232, respectively, when using the SNP chip. With SNP marker screening by the PMS method, the accuracies of GEBV of the two traits were 0.671 and 0.499, respectively, when using SLAF-seq, and 0.605 and 0.422, respectively, when using the SNP chip. Our SNP marker screening method led to an increase of prediction accuracy by 0.089–0.250. With SNP marker screening by the GWAS method, the accuracies of genomic prediction for the two traits were also improved, but the gains of accuracy were less than the gains with PMS method for all traits. The results from this study indicate that our PMS method can improve the accuracy of GEBV, and that more accurate genomic prediction can be obtained from an increased number of genomic markers when using high-throughput sequencing in local broiler populations. Due to its lower genotyping cost, high-throughput sequencing could be a good alternative to SNP chips for genomic prediction in breeding programmes of local broiler populations.

Introduction

Genomic prediction is a new generation breeding technology, and it has been widely implemented in animal and plant breeding (Meuwissen et al., 2001; Su et al., 2016; Wang et al., 2019). Genomic prediction, which uses information from markers throughout the whole genome, can achieve accurate early selection, especially for those traits that are difficult or costly to measure, such as sex-limited traits and slaughter traits. Many studies have shown the advantages of genomic prediction in poultry (Sitzenstock et al., 2013; Wolc et al., 2013; Wolc et al., 2015; Zhang et al., 2017; Liu et al., 2019). However, the high cost of genome marker genotyping limits the application of genomic prediction in poultry. With the development of low-cost and high-throughput sequencing, various marker genotyping platforms have provided alternatives to chip-based genotyping.

The choice of marker genotyping platform is a key factor affecting the accuracy of genomic estimated breeding values (GEBV) (Tan et al., 2017; Wang et al., 2019; Whalen et al., 2019). Single nucleotide polymorphism (SNP) chips are currently the most common choice in livestock and poultry (Wolc et al., 2011; Wang et al., 2013; Zhang et al., 2017). Recently, with the development of high-throughput sequencing technology, reduced-representation genome sequencing (RRGS) has been developed. RRGS uses restriction endonucleases to digest genomic DNA and then sequence the digested fragments, such as restriction site associated DNA (RAD) (Baird et al., 2008), genotyping-by-sequencing (GBS) (Elshire et al., 2011; Wang et al., 2017), and specific-locus amplified fragment sequencing (SLAF-seq) (Sun et al., 2013).

RRGS can produce a large number of genomic markers at a low price and thus be used as an alternative genotyping platform, but the genotype quality of the markers is lower than that obtained with SNP chips (Gorjanc et al., 2015). In animal breeding, the application of RRGS in genomic prediction has attracted great attention and led to many studies (Tan et al., 2017; Wang et al., 2019; Whalen et al., 2019). Gorjanc et al. (Gorjanc et al., 2015) used simulation data to show that the use of GBS for genotyping has great potential for genomic selection in livestock populations. Tan et al. (2017) used GBS for genotyping in a Duroc boar population, and the accuracy of genomic prediction for teat number was 0.435, but this study did not compare their results to those from SNP chips.

There are many local broiler poultry breeds with unique characteristics around the world, such as Chinese yellow feather chickens and French Label Rouge chickens. Compared to SNP chips, a sequencing approach can obtain genetic variations specific to local breeds and, thus, may achieve higher accuracy than SNP chips.

The accuracy of genomic prediction does not increase dramatically as the number of markers increases exponentially (Heidaritabar et al., 2016; Ni et al., 2017). Heidaritabar et al. (Heidaritabar et al., 2016) compared the differences of the accuracy of genomic prediction for the number of eggs between whole genome sequencing data and 60K gene chip data in a commercial white layers line. It was found that the accuracy of genomic prediction from the sequencing data was only increased by ~1%. Most of the important economic traits are quantitative traits that are controlled by multiple genes. We believe that not all SNP sites have an effect on traits, and the number of markers that affect traits is limited. Thus, sequencing data can increase the number of associated markers and introduces a large number of unimportant markers that are not related to the traits of interest, which interferes with the estimation of breeding values. Therefore, selecting effective markers for genomic prediction is expected to improve the prediction accuracy. How to select markers from high-throughput sequencing data for genomic prediction is an important issue.

The aim of this study was to: (1) propose a novel method to screen markers for genomic prediction, and (2) compare the accuracy of genomic prediction between high-throughput sequencing and SNP chips in broilers.

Materials and Methods

Data

The broiler population used in the current study was established by crossing the “High Quality chicken Line A” (HQLA) with the Huiyang Beard chicken (HB)(Sheng et al., 2013). The HQLA line has been under selection for growth traits and high meat quality tailored to Chinese tastes for more than 10 generations. The HB line is a Chinese indigenous breed with the characteristics of slow growth and high meat quality. In this study, 395 individuals (212♂+ 183♀) were selected from 8 half-sib families of 511 F2 birds, which originated from 20 F0 ancestors (6♂+ 14♀), and GBS with 10X specific-locus amplified fragment sequencing (SLAF-seq) (Sun et al., 2013) and the Illumina Chicken 60K SNP Beadchip (Groenen et al., 2011) were used for genotyping. Twenty-eight autosomes and a sex chromosome (chrZ) were extracted for the further analyses. To ensure the integrity of the SNP marker coverage in the sequencing data, only the markers covering more than 70% of the genotype were retained. Then, the marker data were edited by deleting markers with a minor allele frequency (MAF) lower than 0.01. After quality control, 121,132 SLAF-seq markers and 46,690 chip markers were obtained, respectively. Missing genotypes were ignored for preselection of markers and were replaced by the expected genotype score (i.e. 0 after centering) for Gmatrix program, and the average number of missing genotypes per individual were 6,233 and 232 for the SLAF-seq and chip data, respectively.

As shown in Table 1, two of the most important traits in broilers were analysed, namely body weight at the 12^th week (BW) and the feed conversion ratio (FCR). Body weight and feed intake were measured during the period from the beginning of the 7^th to the end of the 12^th week (42 d). FCR was calculated as the ratio of average daily feed intake to average daily gain, as described in Liu et al. (2017). Corrected phenotypic values (y_c), instead of original observations (y), were used as response variables to calculate the difference between phenotypes of the two homozygous genotypes and to predict breeding values using SNP markers. The reason for using y_c as response variables was to reduce noise by removing fixed effects which could be estimated much more accurately using a larger dataset, rather than using only genotyped animals with the two genotyping platforms. The fixed effects were estimated using linear least squares regression including sex (two levels) and batch (six levels), and y_c = y – sex effect – batch effect.

TABLE 1

Table 1 Number of observations (N), mean, standard deviation (SD), minimum value (Min), and maximum value (Max) for body weight (BW) at 12^th week and feed conversion ratio (FCR) during the period from 7^th to 12^th week.

Statistical Models

SNP Marker Screening Method

In this study, we provide a new method, the pre-marker-selection (PMS) method, to screen informative markers for genomic prediction based on the difference between phenotypes of the two homozygous genotypes at the marker with the data of the reference population, and the marks which have no homozygote or only have one homozygote will be deleted. The model is

d = | {\bar{x}}_{A_{1} A_{1}} - {\bar{x}}_{A_{2} A_{2}} |,

where d is absolute value of the difference between phenotypes of the two homozygous genotypes A₁A₁ and A₂A₂; ${\bar{x}}_{A_{1} A_{1}}$ and ${\bar{x}}_{A_{2} A_{2}}$ are the mean of corrected phenotypic values of the genotypes A₁A₁ and A₂A₂, respectively.

To generalize the PMS method, a transformation was applied, d′ = d/max (d). After transformation, d ′ was in the interval [0,1]. To explore the suitable cutoff value with PMS method, four different d ′ value cutoffs (0.001, 0.01, 0.05, 0.1) were set to prune markers of genome data, the peak value of genomic prediction accuracy were on the cutoff value of 0.05, the summary of this preselection is presented in Tables 2 and 3. In the current study, all SNP markers with d′ greater than 0.05 were retained in leave-one-out cross-validation method.

TABLE 2

Table 2 No. of single nucleotide polymorphism (SNP) markers after preselection using the premarker-selection (PMS) method with different cutoff values.

TABLE 3

Table 3 Accuracy of genomic prediction using the markers preselected by premarker-selection (PMS) method with different cutoff values.

In this study, genome-wide association studies (GWAS) using single marker regression was also used as a control method to preselect marker. The markers were selected based on the p values from GWAS results with the data of the reference population. The GWAS model is

y = 1 μ + x q + ε

where y is the vector of corrected phenotypic values of body weight at the 12^th week and FCR, μ is the intercept, q is the effect of the marker in the model, which is treated as a fixed regression of observation on genotype, x is a vector containing genotypes of the marker with 0 for A₁A₁, 1 for A₁A₂ and 2 for A₂A₂, and ε is a vector of random deviates, which is assumed that $ε ~ N (0, I σ_{ε}^{2})$

To explore the suitable cutoff value with GWAS method, three different cutoff p values (0.001, 0.01, 0.1) were set to prune markers of genome data, the peak value of genomic prediction were on the cutoff value of 0.01, the summary of this preselection is presented in Tables 4 and 5. In the current study, all SNP markers with p value less than 0.01 were retained in leave-one-out cross-validation method.

TABLE 4

Table 4 No. of single nucleotide polymorphism (SNP) markers with preselection using genome-wide association studies (GWAS) method for preevaluation the cutoff value.

TABLE 5

Table 5 Accuracy of genomic prediction using the markers preselected by genome-wide association studies (GWAS) method with different cutoff values.

Genomic Prediction Model

In the current study, breeding values were estimated using a genomic best linear unbiased prediction model (GBLUP). The GBLUP model is

y = 1 μ + Z g + e

where the definitions of y and μ are the same as above, g is the vector of genomic breeding values to be estimated, Z is the incidence matrix of g, and e is the vector of random residuals. It is assumed that $g ~ N (0, G σ_{g}^{2})$ and $e ~ N (0, I σ_{e}^{2})$ , where G is the additive genomic relationship matrix based on SNP markers (Vanraden, 2008), G=MM′/Σ2p_i(1−p_i), the coefficients of the i^th column in the M matrix are (0–2p_i) for genotype A₁A₁, (1–2p_i) for A₁A₂, and (2–2p_i) for A₂A₂, where p_i is the allele frequency of A₂ at locus i, and $σ_{g}^{2}$ is the genomic additive genetic variance.

Cross-Validation Method

To eliminate all problems associated with the random partitioning variation with n-fold cross-validation, the accuracy of genomic prediction was verified by leave-one-out cross-validation method (Allen, 1971). All 395 individuals were used for pre-evaluation of the cutoff values of the SNP marker screening methods. For each leave-one-out validation, the used to preselect markers with PMS and GWAS methods were in line with the reference population of leave-one-out validation, i.e., the leave-out individual was excluded from the data for preselecting marker. For example, for BW, the validation repeated 395 times, and each time one bird was masked in preselection the SNP markers and then the bird was also masked in the leave-one-out cross-validation. In this study, the accuracy of prediction was defined as the correlation between the predicted and corrected phenotypic value (y_c), the differences between the correlations were analysed by a bootstrapped paired t-test (Efron, 1979), the sample was repeat 1,000 times. Unbiasedness of the GEBV was measured using the regression of y_c on GEBV. The regression will not differ significantly from one if GEBV is an unbiased estimate of true breeding value (Su et al., 2012).

The G matrix was calculated using Gmatrix package (https://dmu.ghpc.au.dk/DMU). The variance and covariance components were estimated using restricted maximum likelihood (REML) based on the mixed() function of “mixedFunction.R”, and leave-one-out cross-validations were performed with “mixedCV.R”. The R codes “mixedFunction.R” and “mixedCV.R” can be downloaded from the website in (Xu, 2017). The bootstrapped paired t-tests were performed with sample() and t.test() functions in R (https://www.r-project.org/).

Results

Distribution of Genomic Markers in the SLAF-seq and SNP Chip

As shown in Figure 1, in the scenarios using all high-throughput sequencing markers (Sep-ALL) and all chip markers (Chip-ALL) without preselection, the uniformity of the number of genomic markers in MAF intervals was lower for SNPs obtained with SLAF-seq than for those obtained with the SNP chip. The coefficient of variation for the SNP markers in the MAF intervals, which was calculated as the ratio of the standard deviation to the mean, was 0.355 for SLAF-seq and 0.154 for the SNP chip.

FIGURE 1

Figure 1 The distribution of genomic markers in the different intervals of minor allele frequency (MAF) without preselection. Chip-ALL, the scenario using all high-throughput sequencing markers without preselection; Sep-ALL, the scenario using all chip markers without preselection.

All 395 individuals were used for assessing cutoff values in preselection of markers. As shown in Tables 2 and 3, with the PMS method, after unimportant markers were removed, the numbers of SNP markers obtained with the two genotyping platforms were both drastically reduced. The accuracies of genomic prediction with preselection of marker by all the cutoff values were higher than the accuracies using all markers. The peak values of genomic prediction accuracy were in the scenario with cutoff value of 0.1 for BW and 0.05 for FCR, respectively. As shown in Tables 4 and 5, with the GWAS method, after unimportant markers were removed, the number of SNP markers were greater reduced. The peak values of genomic prediction accuracy were in the scenario with cutoff value of 0.01 for the two traits.

Using SLAF-seq markers preselected by PMS method with best cutoff values, the number of SNPs was reduced from 121,132 to 13,423 for BW and 10,976 for FCR, respectively. As shown in Figures 2 and 3, the selected markers were mainly concentrated in the range of MAF from 0.05 to 0.25 for BW and FCR.

FIGURE 2

Figure 2 The distribution of genomic markers preselected by the premarker-selection (PMS) method for body weight in the different intervals of minor allele frequency (MAF). Chip-PMS, the scenario using high-throughput sequencing markers preselected by the PMS method; Sep-PMS, the scenario using chip markers preselected by the PMS method.

FIGURE 3

Figure 3 The distribution of genomic markers preselected by the premarker-selection (PMS) method for feed conversion ratio in the different intervals of minor allele frequency (MAF). Chip-PMS, the scenario using high-throughput sequencing markers preselected by the PMS method; Sep-PMS, the scenario using chip markers preselected by the PMS method.

Using the SNP chip markers preselected by with PMS method with the best cutoff values, the number of markers was reduced from 46,690 to 15,748 and 11,506 for BW and FCR at the peak value of genomic prediction, respectively. As shown in Figures 2 and 3, the selected markers were mainly concentrated in the range of MAF from 0.1 to 0.3 for BW and FCR.

Accuracy of Genomic Prediction Using the Two Genotyping Platforms

Without preselection of markers, the estimates of heritability for BW were 0.703 ± 0.087 using SLAF-seq and 0.634 ± 0.076 using the SNP chip. The estimates of heritability for FCR were 0.265 ± 0.099 and 0.266 ± 0.093, respectively.

As shown in Table 6, without marker screening, the genomic prediction model did not benefit from the increased number of genomic markers obtained by high-throughput sequencing, and the accuracy of genomic prediction using SLAF-seq was not always higher than the accuracy using the SNP chip. For BW, the accuracy of genomic prediction using SLAF-seq was lower than that using the SNP chip, and the difference between them was 0.007 (P < 0.05). For FCR, the accuracy of genomic prediction using sequencing was 0.017 higher than that using the gene chip (P < 0.05).

TABLE 6

Table 6 Accuracy of genomic prediction for body weight and feed conversion ratio using markers preselected with the best cutoff value.

With the PMS method, the accuracy of genomic prediction for the two traits was improved, regardless of whether SLAF-seq or the SNP chip was used. When using SLAF-seq, the accuracy of genomic prediction was increased by 0.162 for BW and 0.250 for FCR, and the gains were significant (P < 0.05). Using the SNP chip, the accuracy of genomic prediction was increased by 0.089 for BW and 0.190 for FCR and the gains were also significant (P < 0.05). In addition, the accuracy of genomic prediction using SLAF-seq was higher than that using the SNP-chip by 0.066 for BW and 0.077 for FCR. As shown in Table 7, when the PMS method was applied, the regression coefficients for the two genotyping platforms were similar, genomic predictions for FCR had a slightly larger bias than those for BW.

TABLE 7

Table 7 Unbiasedness of genomic prediction for body weight and feed conversion ratio using markers preselected with the best cutoff value.

As shown in Table 6, with marker screening by the GWAS method, the accuracy of genomic prediction for the two traits was also improved, but the gains of accuracy were less than the gains with PMS method for all traits. When using SLAF-seq, the accuracy of genomic prediction was increased by 0.098 for BW and 0.068 for FCR, and the gains were significant (P < 0.05). When using the SNP chip, the accuracy of genomic prediction was increased by 0.083 for BW and 0.020 for FCR and the gains were also significant (P < 0.05).

Discussion

In this study, the genomic prediction of growth and feed efficiency traits in a small broiler chicken population was compared between high-throughput sequencing and SNP chip platforms. The results showed that when markers were not screened, the use of high-throughput sequencing data did not result in a higher accuracy than the use of chip data. Our method for marker screening, improved the accuracy of genomic prediction for both genotyping platforms, and high-throughput sequencing achieved higher accuracy for both traits.

With the rapid decline in the price of high-throughput sequencing, its application in genomic selection has received a high level of attention (Meuwissen and Goddard, 2010; Gorjanc et al., 2015; Iheshiulor et al., 2016). Meuwissen and Goddard (2010) used simulation data to study the accuracy of genomic selection based on high-throughput sequencing. The results showed that when using sequencing data, the accuracies of prediction of genetic values were 40% increased relative to the use of dense 30K SNP chips. Iheshiulor et al. (2016) showed that the accuracy of genomic selection using sequencing data can be increased by up to 92% in a simulation study. However, when using real data, researchers can hardly achieve such attractive results (Heidaritabar et al., 2016; Ni et al., 2017; Elbasyoni et al., 2018). In plant breeding, Elbasyoni et al. (2018) studied four traits in a winter wheat population and showed that high-throughput sequencing could achieve only comparable or even better accuracy than an SNP chip. In a commercial brown layer line, Ni et al. (2017) compared genomic predictions for three egg-laying traits using genome-wide sequencing and a 336K SNP chip and reported that little or no benefit was gained when using all sequencing SNPs for genomic prediction.

To improve the accuracy of genomic prediction, some previous studies tried to add causative mutations to chip data (Meuwissen and Goddard, 2010; Brøndum et al., 2015; Teissier et al., 2018). Many studies investigated the effect on the reliability of genomic prediction when a small number of important variants obtained from single marker GWAS or SNP annotation based on whole genome sequence data were added to the regular 54K SNP chip data (Van Binsbergen et al., 2015; Ma et al., 2019). In the current study, the SNP markers obtained by sequencing were twice as abundant as those of the SNP chip, but the prediction accuracy was not increased when all of the markers were used for genomic prediction. One of possible reasons is that the sequencing data increase the number of associated markers but also contribute a large number of unimportant markers that are not related to the traits of interest, which may interfere with the estimation of breeding values. Therefore, we proposed a new method to select effective markers; using our method for screening markers, the sequencing data had higher accuracy of genomic prediction than the SNP chip data for all traits.

The SNP markers farther from the causative mutations may negatively affect the accuracy of genomic prediction. As mentioned above, many previous studies (Zhang et al., 2014; Zhang et al., 2015; Van Den Berg et al., 2016; Ye et al., 2019) have shown that preselected markers from genomic data can improve genomic prediction. However, Ye et al. also performed genomic prediction using markers preselected from imputed whole-genome sequencing (WGS) data based on the p value of GWAS as a control method, and the results showed that using preselected variants resulted in almost no increase for most traits and even increased the bias of the predictions (Ye et al., 2019). The authors argued that one of the possible reasons could be the difficulty of detecting causal variants based on GWAS due to the large number of variants and the high LD between variants. In our study, we provided a method to select SNP markers based on the difference between phenotypes associated with two allelic homozygous genotypes, GWAS method to select markers was also performed as a comparison. Our results showed that when screening markers with the two methods, the accuracies of genomic prediction for the two traits were improved, and the gains of accuracy with PMS method were larger than the gains with GWAS method for all traits. Whether the PMS method can improve the accuracy of genomic prediction in different populations needs further verification.

The cost of genotyping is an important factor limiting its application in poultry breeding. With the continuous development of sequencing technology, the reduced price of sequencing may have advantages in regard to the cost of genotyping. De Donato et al. showed that the cost of sequencing data for the same number of markers is approximately 1/3 that of the SNP chip (De Donato et al., 2013). Among the major livestock and poultry breeds, the chicken genome is more advantageous with regard to the cost of sequencing. The genome of the chicken is only 1043.19 Mb (https://www.ncbi.nlm.nih.gov/genome), which is less than 1/2 the size of the genomes of cattle (2716 Mb) and pigs (2548 Mb), which means that chickens can achieve the same density of genome coverage at a lower cost. In addition, sequencing technology can flexibly select the depth of sequencing for a single sample, which can further reduce the cost of genotyping.

Local breeds usually have characteristic traits that are preferred by local people, allow the birds to adapt well to the local environment, and usually exist in the form of small populations with a small effective population size. Druet et al. showed that the accuracy of genomic prediction depends largely on the coverage of key genes affecting target traits by genotyping platforms (Druet et al., 2014). However, the markers of conventional SNP chips may not cover all of the genetic variation of specific traits in the local breeds, which may limit the efficiency of genomic prediction for these traits. High-throughput sequencing, such as SLAF-seq, can improve the accuracy of genomic prediction by optimizing the selection of suitable restriction enzymes to cover large fragments of specific mutation regions for local breeds and to select the sequencing depth of each individual, which has great potential for genomic prediction in local breeds breeding.

Conclusions

The results from this study indicate that with our PMS marker screening method, the accuracy of genomic prediction obtained using high-throughput sequencing, such as SLAF-seq, is higher than the accuracy obtained using SNP chips in local broiler populations. With accurate prediction and a low cost, the PMS method is a promising method for the use of high-throughput sequencing data for genomic prediction in breeding programmes of local broiler populations.

Data Availability Statement

Genotype data and trait data for the chickens used in this study are not publicly available, but are available from the corresponding author upon reasonable request.

Ethics Statement

This study was approved by the Animal Care Committee of the Institute of Animal Science, Guangdong Academy of Agricultural Sciences (Guangzhou, People’s Republic of China)(No. GAAS-IAS2009-73).

Author Contributions

TL, CL, and HQ conceived and designed the experiments. TL, CL, JM, YW, DS, GS, and HQ discussed and interpreted the results. TL drafted the manuscript. CL, JM, YW, DS, GS, and HQ revised the manuscript. All authors read and approved the final manuscript.

Funding

This work was supported by the Earmarked Fund for Modern Agro-Industry Technology Research System (CARS-41), the Science and Technology Program of Guangdong (2017B020201006, 2017B020206003) and Key-Area Research and Development Program of Guangdong Province (2018B020203001). The funders had no role in study design, data collection and analysis.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We would like to thank Guangdong Public Laboratory of Animal Breeding and Nutrition.

References

Allen, D. M. (1971). Mean square error of prediction as a criterion for selecting variables. Technometrics 13, 469–475. doi: 10.1080/00401706.1971.10488811

CrossRef Full Text | Google Scholar

Baird, N. A., Etter, P. D., Atwood, T. S., Currey, M. C., Shiver, A. L., Lewis, Z. A., et al. (2008). Rapid SNP discovery and genetic mapping using sequenced RAD markers. PloS One 3, e3376. doi: 10.1371/journal.pone.0003376

PubMed Abstract | CrossRef Full Text | Google Scholar

Brøndum, R. F., Su, G., Janss, L., Sahana, G., Guldbrandtsen, B., Boichard, D., et al. (2015). Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction. J. Dairy Sci. 98, 4107–4116. doi: 10.3168/jds.2014-9005

PubMed Abstract | CrossRef Full Text | Google Scholar

De Donato, M., Peters, S. O., Mitchell, S. E., Hussain, T., Imumorin, I. G. (2013). Genotyping-by-sequencing (GBS): a novel, efficient and cost-effective genotyping method for cattle using next-generation sequencing. PloS One 8, e62137–e62137. doi: 10.1371/journal.pone.0062137

PubMed Abstract | CrossRef Full Text | Google Scholar

Druet, T., Macleod, I. M., Hayes, B. J. (2014). Toward genomic prediction from whole-genome sequence data: impact of sequencing design on genotype imputation and accuracy of predictions. Heredity 112, 39–47. doi: 10.1038/hdy.2013.13

PubMed Abstract | CrossRef Full Text | Google Scholar

Efron, B. (1979). Bootstrap methods: another look at the jackknife. Ann. Statist. 7, 1–26. doi: 10.1214/aos/1176344552

CrossRef Full Text | Google Scholar

Elbasyoni, I. S., Lorenz, A. J., Guttieri, M., Frels, K., Baenziger, P. S., Poland, J., et al. (2018). A comparison between genotyping-by-sequencing and array-based scoring of SNPs for genomic prediction accuracy in winter wheat. Plant Sci. 270, 123–130. doi: 10.1016/j.plantsci.2018.02.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Elshire, R. J., Glaubitz, J. C., Sun, Q., Poland, J. A., Kawamoto, K., Buckler, E. S., et al. (2011). A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PloS One 6, e19379. doi: 10.1371/journal.pone.0019379

PubMed Abstract | CrossRef Full Text | Google Scholar

Gorjanc, G., Cleveland, M., Houston, R., Hickey, J. (2015). Potential of genotyping-by-sequencing for genomic selection in livestock populations. Genet. Selection Evol. 47, 12. doi: 10.1186/s12711-015-0102-z

CrossRef Full Text | Google Scholar

Groenen, M., Megens, H.-J., Zare, Y., Warren, W., Hillier, L., Crooijmans, R., et al. (2011). The development and characterization of a 60K SNP chip for chicken. BMC Genomics 12, 274. doi: 10.1186/1471-2164-12-274

PubMed Abstract | CrossRef Full Text | Google Scholar

Heidaritabar, M., Calus, M. P. L., Megens, H. J., Vereijken, A., Groenen, M., Bastiaansen, J. W. M. (2016). Accuracy of genomic prediction using imputed whole-genome sequence data in white layers. J. Anim. Breed. Genet. 133, 167–179. doi: 10.1111/jbg.12199

PubMed Abstract | CrossRef Full Text | Google Scholar

Iheshiulor, O. O. M., Woolliams, J. A., Yu, X., Wellmann, R., Meuwissen, T. H. E. (2016). Within- and across-breed genomic prediction using whole-genome sequence and single nucleotide polymorphism panels. Genet. Selection Evol. 48, 1–15. doi: 10.1186/s12711-016-0193-1

CrossRef Full Text | Google Scholar

Liu, T., Luo, C., Wang, J., Ma, J., Shu, D., Lund, M. S., et al. (2017). Assessment of the genomic prediction accuracy for feed efficiency traits in meat-type chickens. PloS One 12, e0173620. doi: 10.1371/journal.pone.0173620

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, R., Zheng, M., Wang, J., Cui, H., Li, Q., Liu, J., et al. (2019). Effects of genomic selection for intramuscular fat content in breast muscle in Chinese local chickens. Anim. Genet. 50, 87–91. doi: 10.1111/age.12744

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, P., Lund, M. S., Aamand, G. P., Su, G. (2019). Use of a Bayesian model including QTL markers increases prediction reliability when test animals are distant from the reference population. J. Dairy Sci. 102, 7237–7247. doi: 10.3168/jds.2018-15815

PubMed Abstract | CrossRef Full Text | Google Scholar

Meuwissen, T., Goddard, M. (2010). Accurate prediction of genetic values for complex traits by whole-genome resequencing. Genetics 185, 623–631. doi: 10.1534/genetics.110.116590

PubMed Abstract | CrossRef Full Text | Google Scholar

Meuwissen, T. H. E., Hayes, B. J., Goddard, M. E. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829.

PubMed Abstract | Google Scholar

Ni, G., Cavero, D., Fangmann, A., Erbe, M., Simianer, H. (2017). Whole-genome sequence-based genomic prediction in laying chickens with different genomic relationship matrices to account for genetic architecture. Genet. Selection Evol. 49, 8. doi: 10.1186/s12711-016-0277-y

CrossRef Full Text | Google Scholar

Sheng, Z., Pettersson, M., Hu, X., Luo, C., Qu, H., Shu, D., et al. (2013). Genetic dissection of growth traits in a Chinese indigenous x commercial broiler chicken cross. BMC Genomics 14, 151. doi: 10.1186/1471-2164-14-151

PubMed Abstract | CrossRef Full Text | Google Scholar

Sitzenstock, F., Ytournel, F., Sharifi, A., Cavero, D., Taubert, H., Preisinger, R., et al. (2013). Efficiency of genomic selection in an established commercial layer breeding program. Genet. Selection Evol. 45, 29. doi: 10.1186/1297-9686-45-29

CrossRef Full Text | Google Scholar

Su, G., Brondum, R. F., Ma, P., Guldbrandtsen, B., Aamand, G. R., Lund, M. S. (2012). Comparison of genomic predictions using medium-density (similar to 54,000) and high-density (similar to 777,000) single nucleotide polymorphism marker panels in nordic holstein and red dairy cattle populations. J. Dairy Sci. 95, 4657–4665. doi: 10.3168/jds.2012-5379

PubMed Abstract | CrossRef Full Text | Google Scholar

Su, G., Ma, P., Nielsen, U. S., Aamand, G. P., Wiggans, G., Guldbrandtsen, B., et al. (2016). Sharing reference data and including cows in the reference population improve genomic predictions in Danish Jersey. Animal 10, 1067–1075. doi: 10.1017/S1751731115001792

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, X., Liu, D., Zhang, X., Li, W., Liu, H., Hong, W., et al. (2013). SLAF-seq: an efficient method of large-scale de novo snp discovery and genotyping using high-throughput sequencing. PloS One 8, e58700. doi: 10.1371/journal.pone.0058700

PubMed Abstract | CrossRef Full Text | Google Scholar

Tan, C., Wu, Z., Ren, J., Huang, Z., Liu, D., He, X., et al. (2017). Genome-wide association study and accuracy of genomic prediction for teat number in Duroc pigs using genotyping-by-sequencing. Genet. Selection Evol. 49, 35. doi: 10.1186/s12711-017-0311-8

CrossRef Full Text | Google Scholar

Teissier, M., Sanchez, M. P., Boussaha, M., Barbat, A., Hoze, C., Robert-Granie, C., et al. (2018). Use of meta-analyses and joint analyses to select variants in whole genome sequences for genomic evaluation: an application in milk production of French dairy cattle breeds. J. Dairy Sci. 101, 3126–3139. doi: 10.3168/jds.2017-13587

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Binsbergen, R., Calus, M., Bink, M., Van Eeuwijk, F., Schrooten, C., Veerkamp, R. (2015). Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle. Genet. Selection Evol. 47, 71. doi: 10.1186/s12711-015-0149-x

CrossRef Full Text | Google Scholar

Van Den Berg, I., Boichard, D., Guldbrandtsen, B., Lund, M. S. (2016). Using sequence variants in linkage disequilibrium with causative mutations to improve across-breed prediction in dairy cattle: a simulation study. G3: Genes|Genomes|Genetics 6, 2553–2561. doi: 10.1534/g3.116.027730

CrossRef Full Text | Google Scholar

Vanraden, P. M. (2008). Efficient methods to compute genomic predictions. J. Dairy Sci. 91, 4414–4423. doi: 10.3168/jds.2007-0980

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, C., Habier, D., Peiris, B. L., Wolc, A., Kranis, A., Watson, K. A., et al. (2013). Accuracy of genomic prediction using an evenly spaced, low-density single nucleotide polymorphism panel in broiler chickens. Poultry Sci. 92, 1712–1723. doi: 10.3382/ps.2012-02941

CrossRef Full Text | Google Scholar

Wang, Y., Cao, X., Zhao, Y., Fei, J., Hu, X., Li, N. (2017). Optimized double-digest genotyping by sequencing (ddGBS) method with high-density SNP markers and high genotyping accuracy for chickens. PloS One 12, e0179073. doi: 10.1371/journal.pone.0179073

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Lund, M. S., Ma, P., Janss, L., Kadarmideen, H. N., Su, G. (2019). Improving genomic predictions by correction of genotypes from genotyping by sequencing in livestock populations. J. Anim. Sci. Biotechnol. 10, 8. doi: 10.1186/s40104-019-0315-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Whalen, A., Gorjanc, G., Hickey, J. M. (2019). Parentage assignment with genotyping-by-sequencing data. J. Anim. Breed. Genet. 136, 102–112. doi: 10.1111/jbg.12370

PubMed Abstract | CrossRef Full Text | Google Scholar

Wolc, A., Stricker, C., Arango, J., Settar, P., Fulton, J., O'sullivan, N., et al. (2011). Breeding value prediction for production traits in layer chickens using pedigree or genomic relationships in a reduced animal model. Genet. Selection Evol. 43, 5. doi: 10.1186/1297-9686-43-5

CrossRef Full Text | Google Scholar

Wolc, A., Arango, J., Settar, P., Fulton, J. E., O'sullivan, N. P., Preisinger, R., et al. (2013). Analysis of egg production in layer chickens using a random regression model with genomic relationships. Poultry Sci. 92, 1486–1491. doi: 10.3382/ps.2012-02882

CrossRef Full Text | Google Scholar

Wolc, A., Zhao, H., Arango, J., Settar, P., Fulton, J., O'sullivan, N., et al. (2015). Response and inbreeding from a genomic selection experiment in layer chickens. Genet. Selection Evol. 47, 59. doi: 10.1186/s12711-015-0133-5

CrossRef Full Text | Google Scholar

Xu, S. (2017). Predicted residual error sum of squares of mixed models – an application to genomic prediction. G3: Genes|Genomes|Genetics 7, 895–909. doi: 10.1534/g3.116.038059

CrossRef Full Text | Google Scholar

Ye, S., Gao, N., Zheng, R., Chen, Z., Teng, J., Yuan, X., et al. (2019). Strategies for obtaining and pruning imputed whole-genome sequence data for genomic prediction. Front. Genet. 10, 673. doi: 10.3389/fgene.2019.00673

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Z., Ober, U., Erbe, M., Zhang, H., Gao, N., He, J., et al. (2014). Improving the accuracy of whole genome prediction for complex traits using the results of genome wide association studies. PloS One 9, e93017. doi: 10.1371/journal.pone.0093017

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Z., Erbe, M., He, J., Ober, U., Gao, N., Zhang, H., et al. (2015). Accuracy of whole-genome prediction using a genetic architecture-enhanced variance-covariance matrix. G3: Genes|Genomes|Genetics 5, 615–627. doi: 10.1534/g3.114.016261

CrossRef Full Text | Google Scholar

Zhang, Z., Xu, Z.-Q., Luo, Y.-Y., Zhang, H.-B., Gao, N., He, J.-L., et al. (2017). Whole genomic prediction of growth and carcass traits in a Chinese quality chicken population. J. Anim. Sci. 95, 72–80. doi: 10.2527/jas2016.0823

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: genomic prediction, high-throughput sequencing, marker screening method, feed conversion ratio, chickens

Citation: Liu T, Luo C, Ma J, Wang Y, Shu D, Su G and Qu H (2020) High-Throughput Sequencing With the Preselection of Markers Is a Good Alternative to SNP Chips for Genomic Prediction in Broilers. Front. Genet. 11:108. doi: 10.3389/fgene.2020.00108

Received: 12 September 2019; Accepted: 30 January 2020;
Published: 27 February 2020.

Edited by:

Peng Xu, Xiamen University, China

Reviewed by:

Zhe Zhang, South China Agricultural University, China
Ismo Strandén, Natural Resources Institute Finland (Luke), Finland

Copyright © 2020 Liu, Luo, Ma, Wang, Shu, Su and Qu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hao Qu, cWh3MDNAMTYzLmNvbQ==

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.