AUTHOR=Sun Bo , Guo Rui , Liu Zhi , Shi Xiaolei , Yang Qing , Shi Jiayao , Zhang Mengchen , Yang Chunyan , Zhao Shugang , Zhang Jie , He Jianhan , Zhang Jiaoping , Su Jianhui , Song Qijian , Yan Long TITLE=Genetic variation and marker−trait association affect the genomic selection prediction accuracy of soybean protein and oil content JOURNAL=Frontiers in Plant Science VOLUME=Volume 13 - 2022 YEAR=2022 URL=https://www.frontiersin.org/journals/plant-science/articles/10.3389/fpls.2022.1064623 DOI=10.3389/fpls.2022.1064623 ISSN=1664-462X ABSTRACT=Genomic selection (GS) is a potential breeding approach for soybean improvement. In this study, GS was performed on soybean protein and oil content using the Ridge Regression Best Linear Unbiased Predictor (RR-BLUP) based on 1,007 soybean accessions. The SoySNP50K SNP dataset of the accessions was obtained from the USDA-ARS, Beltsville, MD lab, and the protein and oil content of the accessions were obtained from GRIN. Our results showed that the prediction accuracy of oil content was higher than that of protein content. When the training population size was 100, the prediction accuracies for protein content and oil content were 0.60 and 0.79, respectively. The prediction accuracy increased with the size of the training population. Training populations with similar phenotype or with close genetic relationships to the prediction population exhibited better prediction accuracy. A greatest prediction accuracy for both protein and oil content was observed when approximately 3,000 markers with -log10(P) greater than 1 were included. This information will help improve GS efficiency and facilitate the application of GS.