Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Functional and Applied Plant Genomics

Genome-Wide Association Study and Fine-Mapping Identify a Major Quantitative Trait Locus Controlling Hundred-Seed Weight in Soybean

Provisionally accepted
Chunlei  ZhangChunlei Zhang1Huilong  HongHuilong Hong2Rongqiang  YuanRongqiang Yuan1Shiyao  ZhangShiyao Zhang1Tianjiao  GaoTianjiao Gao1Shuping  YanShuping Yan1Sobhi  F LamlomSobhi F Lamlom1,3Honglei  RenHonglei Ren1*Liu  ZhangxiongLiu Zhangxiong2*Jiajun  WangJiajun Wang1*
  • 1Soybean Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, China
  • 2Chinese Academy of Agricultural Sciences Institute of Crop Sciences, Beijing, China
  • 3Alexandria University Faculty of Agriculture Sababasha, Alexandria, Egypt

The final, formatted version of the article will be published soon.

Background: Hundred-seed weight (HSW) is a critical yield component in soybean that directly influences productivity and seed quality. Despite its agronomic importance, the genetic architecture underlying natural variation in seed weight remains incompletely understood. Methods: We conducted a comprehensive genome-wide association study (GWAS) using 554 globally diverse soybean accessions, comprising 453 Chinese varieties (81.8%) and 101 international accessions (18.2%) from 15 countries. Accessions were evaluated across three consecutive years (2022-2024) and genotyped with 78,050 high-quality single-nucleotide polymorphisms (SNPs). Results: Mixed linear model (MLM) analysis revealed a major QTL on Chr.20 that consistently explained the largest proportion of phenotypic variation across all environments. This QTL demonstrated exceptional temporal stability, maintaining genome-wide significance with peak - log₁₀(P) values of 13.4, 12.1, and 10.2 across the three evaluation years. Fine mapping narrowed the critical interval to 493.69 kb containing 25 annotated genes. The lead SNP within Glyma.20G223200 explained 8-12% of phenotypic variance, while multi-SNP models incorporating five high-priority candidates cumulatively explained 14-18% of variance. Expression analysis of candidate genes revealed differential patterns between large-seeded and small-seeded varieties during seed development, with up to 32-fold expression differences. Conclusions: The environmentally stable Chr. 20 QTL provides immediate opportunities for marker-assisted selection (MAS) in soybean breeding programs. Genomic prediction modeling suggests 35% greater genetic gain compared to phenotypic selection alone, supporting broad applicability for global soybean improvement programs.

Keywords: Soybean, Hundred-seed weight, Genome-Wide Association Study, Quantitative Trait Loci, Fine-mapping, marker-assisted selection, seed development

Received: 30 Sep 2025; Accepted: 14 Nov 2025.

Copyright: © 2025 Zhang, Hong, Yuan, Zhang, Gao, Yan, Lamlom, Ren, Zhangxiong and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Honglei Ren, renhonglei2022@163.com
Liu Zhangxiong, liuzhangxiong@caas.cn
Jiajun Wang, junjiawang@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.