ORIGINAL RESEARCH article
Front. Plant Sci.
Sec. Plant Systematics and Evolution
Volume 16 - 2025 | doi: 10.3389/fpls.2025.1675815
Optimizing core collections for genetic studies: a worldwide flax germplasm case study
Provisionally accepted- 1ARVALIS Institut du Végétal, 110 chemin de la côte vieille, 31450 Baziège, France, Baziège, France
- 2Arvalis Station Experimentale de Boigneville, Boigneville, France
- 3ARVALIS Institut du Végétal, 45 voie Romaine - Ouzouer-le-Marché, 41240 Beauce La Romaine, France, Ouzouer-le-Marché, France
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Core collections provide a strategic approach to reducing population size while retaining genetic diversity and allele frequencies, serving as key resources for genetic research. Although various sampling and selection strategies have been proposed, most of them focused on either diversity or representativeness, rarely both, and none fully integrated these with QTL detection optimization. The first part of our study focuses on a genetic diversity analysis of a flax germplasm (Linum usitatissimum L.) maintained by the Arvalis Institute, a prerequisite for the development of core collections. This germplasm is a worldwide flax collection comprising 1,593 accessions originating from 42 countries, encompassing all major flax-growing regions. It includes both spring-and winter-type lines, as well as oilseed and fiber types. The results revealed a pronounced genetic structure within the germplasm with six clusters, strongly influenced by cultivation purposes (fiber vs. oilseed flax), growth cycle (winter vs. spring), and then geographic origin. Overall genetic diversity was moderate (Hₑ = 0.22), with oilseed flax clusters displaying greater diversity (He from 0.21 to 0.27) than fiber flax (He < 0.17) . In a second step we evaluated distinct strategies for core-collection development, including approaches -originally developed for core collection construction and others-developed for optimizing genomic‐selection calibration panels. We introduced an approach based on QTL detection performance via extensive simulations of QTLs distributed across the genome. We observed a fundamental trade-off between maximizing diversity and ensuring representativeness in core collection design. Diversity-oriented approaches may overemphasize rare or outlier genotypes, compromising representativeness, while representativeness-focused strategies leaded to overlooking rare alleles, thus limiting diversity. In our results we have found that particular combinations of selection criteria achieved a favorable balance between genetic diversity and representativeness, while concurrently maintaining a robust capacity to capture QTL signals across the genome. Finally, the approach using the Shannon index combined with the allelic coverage led to optimal core collection design adapted for GWAS applications in a structured population; and was used to select a core collection of 409 accessions useful for further genetic studies. These results provide knowledge for the development of optimized core collections tailored to GWAS applications.
Keywords: core collection, Optimization criteria, quantitative trait loci (QTLs), genetic diversity, Flax (linum usitatissimum l.)
Received: 29 Jul 2025; Accepted: 29 Sep 2025.
Copyright: © 2025 GOUY, BOGARD, MOHAMADI and Demenou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Boris Brice Demenou, b.demenou@arvalis.fr
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.