AUTHOR=Wang Shiyi , Wei Yingjia , Liu Dengying , Zhang Xiangzhe , Wang Qishan , Pan Yuchun , Ma Peipei TITLE=Impact of different genomic relationship matrix construction methods on the accuracy of genomic prediction in different species JOURNAL=Frontiers in Genetics VOLUME=Volume 16 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2025.1576248 DOI=10.3389/fgene.2025.1576248 ISSN=1664-8021 ABSTRACT=ObjectiveGenomic best linear unbiased prediction (GBLUP) is a key method in genomic prediction, relying on the construction of a genomic relationship matrix (G-matrix). Although various methods for G-matrix construction have been proposed, the performance of these methods across different species has not been thoroughly compared.MethodsThis study systematically evaluated the performance of six genomic relationship matrix (G-matrix) construction methods in improving the prediction accuracy of GBLUP models across four species: pigs, bulls, wheat, and mice. The methodological framework included: (1) an initial unscaled matrix; (2) five scaled methods utilizing allele frequency centralization. The scaled methods comprised: (a) three variance-weighted approaches using allele frequencies fixed at 0.5 (G05), observed frequencies (GOF), or average minor allele frequencies (GMF); (b) two centralized methods with weighting by either the trace of the numerator matrix (GN) or reciprocals of each locus’s expected variance (GD).ResultsThe GD matrix demonstrated significant prediction accuracy improvements for pig traits. Conversely, most scaled G-matrices showed minimal effects on mice, wheat, and bull, even with underperforming unscaled baselines in prediction accuracy compared to the original unscaled matrix. The learning curve for bull data showed the choice of G-matrix had minimal impact on prediction accuracy when the reference population size and genetic marker density reached a certain threshold.DiscussionThe study concluded that the optimal G-matrix construction method varies across species, with population structure being a key factor. These findings highlight the importance of species-specific optimization in genomic prediction and suggest that the influence of G-matrix construction diminishes in large-scale, high-density genomic datasets.