ORIGINAL RESEARCH article

Front. Plant Sci., 18 August 2022

Sec. Plant Breeding

Volume 13 - 2022 | https://doi.org/10.3389/fpls.2022.968466

Genome wide association study to detect genetic regions related to isoflavone content in a mutant soybean population derived from radiation breeding

  • 1. Advanced Radiation Technology Institute, Korea Atomic Energy Research Institute, Jeongeup, South Korea

  • 2. Division of Plant Biotechnology, Chonnam National University, Gwangju, South Korea

  • 3. Department of Horticulture, College of Industrial Sciences, Kongju National University, Yesan, South Korea

  • 4. Department of Horticultural Biotechnology, Institute of Life Sciences & Resources, Kyung Hee University, Yongin, South Korea

Article metrics

View details

11

Citations

4,7k

Views

1,8k

Downloads

Abstract

Isoflavones are major secondary metabolites that are exclusively produced by legumes, including soybean. Soy isoflavones play important roles in human health as well as in the plant defense system. The isoflavone content is influenced by minor-effect quantitative trait loci, which interact with polygenetic and environmental factors. It has been difficult to clarify the regulation of isoflavone biosynthesis because of its complex heritability and the influence of external factors. Here, using a genotype-by-sequencing-based genome-wide association mapping study, 189 mutant soybean genotypes (the mutant diversity pool, MDP) were genotyped on the basis of 25,646 high-quality single nucleotide polymorphisms (SNPs) with minor allele frequency of >0.01 except for missing data. All the accessions were phenotyped by determining the contents of 12 isoflavones in the soybean seeds in two consecutive years (2020 and 2021). Then, quantitative trait nucleotides (QTNs) related to isoflavone contents were identified and validated using multi-locus GWAS models. A total of 112 and 46 QTNs related to isoflavone contents were detected by multiple MLM-based models in 2020 and 2021, respectively. Of these, 12 and 5 QTNs were related to more than two types of isoflavones in 2020 and 2021, respectively. Forty-four QTNs were detected within the 441-Kb physical interval surrounding Gm05:38940662. Of them, four QTNs (Gm05:38936166, Gm05:38936167, Gm05:38940662, and Gm05:38940717) were located at Glyma.05g206900 and Glyma.05g207000, which encode glutathione S-transferase THETA 1 (GmGSTT1), as determined from previous quantitative trait loci annotations and the literature. We detected substantial differences in the transcript levels of GmGSTT1 and two other core genes (IFS1 and IFS2) in the isoflavone biosynthetic pathway between the original cultivar and its mutant. The results of this study provide new information about the factors affecting isoflavone contents in soybean seeds and will be useful for breeding soybean lines with high and stable concentrations of isoflavones.

Introduction

Soybean [Glycine max (L.) Merr] is a major source of protein and oil as well as secondary metabolites that are beneficial for human health (Liang et al., 2010). Soy isoflavones primarily consist of daidzein, glycitein, genistein, daidzin, glycitin, genistin, 6-o-malonyldaidzin, 6-o-malonylglycitin, 6-o-malonylgenistin, 6-o-acetyldaidzin, 6-o-acetylglycitin, and 6-o-acetylgenistin in seeds, and coumestrol in leaves and roots (Kudou et al., 1991; Yuk et al., 2011). Soy isoflavones are synthesized and conjugated through a branch of the phenylpropanoid pathway, and are stored in the central vacuole of cells (Dastmalchi and Dhaubhadel, 2014). Phytochemicals are metabolites that are naturally derived from plants. Some of them, such as phenolic compounds and anthocyanins, have antioxidant and antiviral properties (Messina, 1999; Lee et al., 2005). Soy isoflavones have diverse functions. They interact with rhizobia to promote nodulation in the roots, root nodule symbiosis for nitrogen fixation, and are involved in resistance to pathogens and stresses (Graham et al., 1990; Sreevidya et al., 2006; Li et al., 2013). Recent studies have shown that consumption of products and medicines derived from soybean is beneficial for human health (Kang et al., 2002; Collison, 2008; Lobato et al., 2012). Soy isoflavones, as a class of phytoestrogens, have antitumor activity and the ability to ameliorate the symptoms of cardiovascular disease and climacteric syndrome, as well as relieving post-menopausal symptoms (Farina et al., 2006; Ma et al., 2008).

The detailed mechanism of the regulation of isoflavone biosynthesis is still unclear because complex quantitative traits such as isoflavone content, tocopherol content, and saponin content are interrelated, and are affected by genotype, year, and location (Rupasinghe et al., 2003; Gutierrez-Gonzalez et al., 2009; Shaw et al., 2016). Both additive and epistatic effects are affected by genotype × environment interactions, and because many alleles make small contributions to overall phenotypic variations, it is difficult to identify quantitative trait loci (QTLs) related to isoflavone content (Gutierrez-Gonzalez et al., 2011; Fan et al., 2015). To address these problems, high-density genotyping platforms and advanced next-generation sequencing (NGS) technologies have been developed and applied to detect genetic polymorphisms related to traits of interest in many crops (Thomson, 2014; Rasheed et al., 2017). Genotype-by-sequencing (GBS) can be applied in situations where there are multiple samples with high genetic diversity and large genomes. Compared with whole genome genotyping, GBS is more cost-effective, simpler, highly reproducible, and can partially capture important parts of the genome that are otherwise inaccessible. This approach has been used to discover huge numbers of single nucleotide polymorphisms (SNPs). It is anticipated that high-density SNP genotyping will be compatible with genome-wide association study (GWAS), marker-assisted selection, and genomic selection for crop improvement (Elshire et al., 2011; He et al., 2014).

To map QTLs associated with isoflavone content, previous studies conducted QTL mapping using an F2 population and recombinant inbred lines (RILs) as a bi-parental population. Primomo et al. (2005) identified QTLs and epistatic interactions underlying isoflavone contents in soybean seed using RILs with 99 polymorphic simple sequence repeat (SSR) markers and detected 17 QTLs spanning multiple loci. Cai et al. (2018) detected 48 additive QTLs underlying variations in seed isoflavone content using SNP and SSR markers. Using specific-locus amplified fragment sequencing (SLAF-seq) methods, Pei et al. (2018) constructed a high-density genetic map using 3,541 SLAF markers and data from multiple environments and detected 24 stable QTLs for isoflavone components. Although linkage mapping has revealed some core QTLs underlying qualitative and quantitative traits such as agronomic traits (Brummer et al., 1997; Chen et al., 2007), stress tolerance (Concibido et al., 2004; Lee et al., 2004), and some functional components (Warrington et al., 2015; Li et al., 2017), the low crossover rates of the two parents’ alleles in the F2 and RIL populations leads to limited segregation and recombination frequency, resulting in genetic distortion, low allelic diversity, and low mapping resolution (Zhang et al., 2010; Xu et al., 2017).

As an alternative to linkage mapping, GWAS has some advantages as a tool because of its high-density resolution, its suitability for analyses of large sample and population sizes, and the large amount of markers. In addition, it involves powerful statistical methods that analyze the association mapping (AM) between traits of interests and the amount of SNPs markers based on natural populations and the structure of linkage disequilibrium (LD) (Korte and Farlow, 2013; Ibrahim et al., 2020). In previous AM studies on soy isoflavones, Chu et al. (2017) detected a gene encoding an R2R3-type MYB transcription factor (GmMYB29) that affected isoflavone content via GWAS and 207,608 SNPs using the NJAU 355K SoySNP array. This transcription factor binds to the promoters of two major genes in the isoflavone biosynthetic pathway, IFS2 and CHS8, and activates their expression. Meng et al. (2016) detected 44 QTLs on chromosome 16 related to seed isoflavone content using restricted two-stage multi-locus GWAS with 55,340 SNPs derived from restriction-site-association DNA sequencing. Recently, Wu et al. (2020) identified 87 SNPs and 37 QTLs underlying isoflavone content using both GWAS and linkage analysis based on 28,926 SNPs and 4412 bins, respectively, and one major locus (qISO8-1) of GmMPK1 was significantly co-localized on chromosome 8.

The spontaneous mutation rate is as low as one mutation per 100 million to 1 billion nucleotides per generation. Ionizing radiation can increase the mutation rate by up to 1,000-fold to 1 million-fold compared with the natural mutation rate (Johnson et al., 2000; Jiang and Ramachandran, 2010). Therefore, mutation breeding has been used by breeders and researchers to rapidly generate elite cultivars and high-quality, large mutant populations. This method has been used to genetically improve many crops. To date, approximately 3,365 varieties of more than 210 species have been developed using mutation technology (Malek et al., 2014; Chaudhary et al., 2019).1 Using gamma-ray radiation, Kim et al. (2019) developed a high-stearic acid mutant soybean Hfa180, with a stearic acid content more than 8.57-fold that in the original cultivar. Joshi et al. (2016) selected two mutant rice lines, D100-200 and D100-209, with increased salt tolerance; they showed increases of 18 and 34% in yield and related parameters compared with wild-type plants under saline conditions. Lee et al. (2014) generated a lipoxygenase-free soybean mutant H70 with reduced undesirable flavors. This mutant had a deletion and single-point mutations of Lox1, Lox2, and Lox3 genes compared with the original cultivar. The soybean mutant population used in this study was developed as a mutant diversity pool (MDP) by our group. Its agronomic traits, contents of functional components, and expression levels of genes related to certain phytochemicals have been described in other studies (Kim et al., 2020, 2021).

No previous studies have conducted AM using multiple GWAS models and a mutant soybean population. In this study, the first aim was to dissect causative QTLs related to isoflavone content in soybean seeds using an MDP and multiple GWAS methods; MLM and mrMLM (mrMLM, FastmrMLM, FASTmrEMMA, pLAR-mEB, pKWmEB, and ISIS EM-BLASSO) under year-to-year variation. A total of 25,646 high-quality SNPs in 189 mutant soybean genotypes with phenotypic data were used for this GBS-based GWAS analysis to detect important SNPs. The results of this study provide comprehensive insights into the factors that affect isoflavone contents in soybean seeds.

Materials and methods

Mapping population

The association mapping population consisting of 189 soybean lines from the MDP was selected and successfully validated with target region amplification polymorphism (TRAP) markers. Briefly, for constructing a soybean MDP, 1000 M1 seeds of each accession were derived from seven cultivars by 250 Gy of gamma-irradiation (Co60) and planted at Radiation Breeding Research Farm in the Korea Atomic Energy Research Institute (KAERI, 35.5699°N 126.9722°E, Jeongeup, Jeollabuk, Republic of Korea). Subsequently, 1695 M1:2 seeds single-seed descent planted in soybean filed at the KAERI during M1–M5 generation and then we selected 523 accessions based on agronomic traits. To homozygous population, 523 M5:6 seeds were propagated as bulks until M12 generation to remove redundant phenotypes. Finally, we selected 208 soybean mutants showing genetically fixed morphological characteristics. Previous GWAS of this population have revealed details of its genetic diversity, important agronomic traits, and functional genes (Kim et al., 2020, 2021). The seeds of the MDP were obtained from the (KAERI). To evaluate year-to-year variations in isoflavone contents, field trials were conducted in 2020 and 2021 at the Radiation Breeding Research Farm at the KAERI. In each year, the 189-member MDP was planted with a randomized complete block design with three replications. Each genotype was sown in one row per plot, and each row was 300 cm long, with 80 cm spacing between rows. Field management during the growing season, including fertilization, weed management, and insecticide-fungicide application were based on a standard cropping system. After maturity, three individual plants were randomly harvested from each plot and the seeds were collected to quantify isoflavones.

Analysis of isoflavone contents in soybean seeds

Isoflavones were extracted and quantified as described by Chen et al. (2020) and Lim et al. (2021), with slight modifications. Briefly, 10 dried seeds from each genotype were ground into a fine powder using a 2010 GENO/GRINDER (SPEX samplePrep LLC, Metuchen, NJ, United States). Then, 20 mg of the powder was placed in a 2-mL microcentrifuge tube with 1 mL of 80% (v/v) methanol. The mixture was incubated in a shaking incubator for 24 h (room temperature, 150 rpm). After centrifugation at 13,000 rpm for 5 min, the supernatant (1-mL) was passed through a syringe filter (NORM-JECT, Henke-Sass Wolf, Tuttlingen, Germany) equipped with a 0.22 μm PTFE membrane (GVS, Sanford, ME, United States). The filtered extract was placed in a 2-mL vial and subjected to ultra-performance liquid chromatography (UPLC) analysis. The UPLC was equipped with a reversed-phase UPLC column (ACQUITY BEH C18 Column; 2.1 × 50 mm, 1.7 μm particle size, Waters Corp., Milford, MA, United States). The mobile phase consisted of A (water with 0.1% v/v formic acid) and B (acetonitrile with 0.1% v/v formic acid). The elution gradient was as follows: 2 min of equilibration at 13%, followed by a linear increase in mobile phase B to 17% in 3 min, to 22% in 3 min, to 60% in 6 min, then held for 1 min (total run time, 20 min). The flow rate was 0.25 mL min–1, the injection volume was 1 μL, and the oven temperature was 25°C. Twelve isoflavone standards (daidzein (DZE), glycitein (GLE), Genistein (GNE), daidzin (DZI), glycitin (GLI), genistin (GNI), malonyl daidzin (MDZI), malonyl glycitin (MGLI), malonyl genistin (MGNI), acetyl daidzin (ADGI), acetyl glycitin (AGLI), and acetyl genistin (AGNI)) were purchased from Sigma-Aldrich for identification and quantification of each isoflavone in the soybean seeds.

Statistical analysis of isoflavone contents

Descriptive statistics such as maximum, minimum, mean, standard deviation (SD), skewness and kurtosis were calculated for data from each of the two years using Microsoft EXCEL 2016. Two-way analysis of variance (two-way ANOVA) and Pearson’s correlation coefficients among individuals and total isoflavone contents with each year were estimated to determine the effects of genotype, year, and genotype × year using R with the following basic codes; aov (formula, data = NULL, projections = FALSE, qr = TRUE, contrasts = NULL) function and corr_coef (data, verbose = TRUE). These analyses were conducted using the METAN package. The broad-sense heritability (h2) was calculated as follows:

where σ2G is genotypic variance, σ2GE is the variance of the genotype × environment interaction, σ2e is error variance, n is the number of environments, and r is the number of replications within each environment. The frequency distribution of phenotypic data was displayed using ggpubr and cowplot in the R package.

Single nucleotide polymorphisms genotyping and genotype-by-sequencing-based genome-wide association study

The 189 members of the MDP were genotyped using 37,249 SNPs derived by GBS as described by Kim (2021). The raw GBS data have been deposited at the NCBI sequence Read Archive (SRA) database (accession no. PRJNA845013). For the GWAS, a total of 25,646 high-quality SNPs in the 189 MDP with minor allele frequency (MAF) > 0.01 were filtered from raw SNPs using Plink (command settings: geno 0.5, mind 0.2). First, when applying the mixed linear model (MLM) and multi-locus random-SNP-effect mixed linear model (mrMLM) to reduce the false-positive rate in GWAS, a principal component analysis (PCA) was conducted and a kinship matrix (K) was calculated using Tassel 5.2 software to estimate the population structure. The Bonferroni threshold (p-value = 1.95 × 10–6) was calculated by negative log10 (0.05/n, n is the number of SNPs) and its threshold was used to detect important quantitative trait nucleotides (QTNs). A MLM based on single-locus effects was applied by Tassel using phenotype and genotype data with PC as a fixed effect and K as a random effect. In this study, we used six approaches: mrMLM, FastmrMLM, FASTmrEMMA, pLAR-mEB, pKWmEB, and ISIS EM-BLASSO within the multi-locus random-SNP-effect mixed linear model (mrMLM).2 The logarithm of the odds (LOD) score was set to 3 for determining significant QTNs and all options were set to default values. Information about the position of QTNs was retrieved from the Phytozome v9.13 and Soybase4 databases, based on the cultivar Williams 82.a2.v1. The Manhattan and QQ plots were drawn using CMplot in the R package.

RNA extraction and cDNA synthesis

The immature soybean seeds of wild type (Danbaek) and its mutant (DB-088) were collected into conical tubes, immediately frozen in liquid nitrogen, and then finely ground using a mortar and pestle. Total RNA was isolated using TRIZOL reagent (Invitrogen, Carlsbad, CA, United States). To adjust the concentration in the next two steps (RNA DNase treatment and cDNA synthesis), RNA quantity and quality were determined using a NanoDrop ND-1000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, United States). First, 20 μg total RNA was digested to remove contaminating DNA, and then the first-strand cDNA was synthesized using SuperScript III First-Strand synthesis SuperMix (Invitrogen). All of these procedures were conducted according to the manufacturer’s instructions.

Gene transcript analysis quantitative real-time PCR

The transcript levels of GmGSTT1, IFS1, and IFS2 were determined using the Bio-Rad CFX96 Real-time PCR system (Bio-Rad, Hercules, CA, United States) with iTaq Universal SYBR Green Supermix (Bio-Rad). The reference gene was ELF1B (Jian et al., 2008; Le et al., 2012). The thermal cycling conditions were as follows: 95°C for 10 min, followed by 40 cycles of 95°C for 15 s, 50°C for 15 s, and 72°C for 30 s. Each reaction had five replicates. The relative transcript level of each gene was calculated as follows:

The primer sets used in this study are listed in Supplementary Table 1.

Results

Quantification of isoflavone content in soybean seeds across 2 years

To evaluate and validate the range of variation in isoflavone contents among genotypes and across years, the 189 mutant soybean genotypes were grown for 2 consecutive years (2020 and 2021) for GWAS analysis. We quantified the total isoflavones (TI) content and the contents of twelve types of isoflavones in soybean seeds: aglycones (DZE, GLE, and GNE), glucosides (DZI, GLI, and GNI), malonyl-glucosides (MDZI, MGLI, and MGNI), and acetyl-glucosides (ADGI, AGLI, and AGNI).

The average contents of isoflavones in each year were as follows: DZE [2020 (6.14 μg g–1), 2021 (4.67 μg g–1)], GLE [2020 (14.66), 2021 (2.80)], GNE [2020 (7.85), 2021 (6.75)], DZI [2020 (309.14), 2021 (168.45)], GLI [2020 (128.96), 2021 (109.00)], GNI [2020 (421.80), 2021 (252.62)], MDZI [2020 (796.50), 2021 (602.93)], MGLI [2020 (229.07), 2021 (226.86)], MGNI [2020 (1070.55), 2021 (885.91)], ADZI [2020 (117.03), 2021 (88.59)], AGNI [2020 (3.08), 2021 (2.41)], and TI [2020 (3104.80), 2021 (2350.98)].

Overall, isoflavone contents were lower in 2021 than in 2020. The average contents of aglycones, glucosides, malonyl-glucosides, acetyl-glucosides, and total isoflavones decreased from 9.55 μg g–1 in 2020 to 4.74 μg g–1 in 2021 (decreased by 50.39%), from 286.63 μg g–1 in 2020 to 176.69 μg g–1 in 2021 (decreased by 38.36%), from 698.71 μg g–1 in 2020 to 571.90 μg g–1 in 2021 (decreased by 18.15%), from 60.06 μg g–1 to 45.50 μg g–1 (decreased by 24.24%) and from 3104.80 μg g–1 in 2020 to 2350.98 μg g–1 in 2021 (decreased by 24.28%), respectively. The isoflavone contents showed continuous variations with normal distribution in the 2 years (Figure 1), indicating that isoflavone contents are quantitative traits that are regulated by numerous genes with smaller effects of environmental factors.

FIGURE 1

As shown in Table 1, the ANOVA revealed significant effects of genotype, year, and genotype × year on isoflavone contents. The contents of all isoflavones except for MGLI differed significantly between 2020 and 2021 (P < 0.001). This result suggested that the isoflavone content was strongly affected by genetic factors and environmental factors. Based on variance among genotypes, years, and genotype × year, the broad-sense heritability values calculated for isoflavone contents ranged from high (e.g., 86% for MGNI and for ADZI) to low (e.g., 15% for GLE). This indicated that heritability varied depending on the type of isoflavone, and that all of the studied isoflavones except for GLE and AGNI were more affected by genetic factors than by environmental factors.

TABLE 1

TraitYearMinaMaxaSDaMeanaEffects
SkewnessKurtosisH2
gbycg × yd
DZE20201.3323.263.406.14*********1.714.280.51
20210.4832.664.474.673.7318.05
GLE20205.6130.404.0014.66*********0.861.560.15
20210.3331.533.662.804.2823.77
GNE20202.6222.353.157.85*********1.151.920.47
20211.0044.045.616.753.1914.91
DZI202088.15866.08142.15309.14*********1.111.370.81
202133.82525.19102.75168.451.251.28
GLI202020.83352.5068.98128.96*********0.760.190.78
202128.86282.5351.71109.000.870.41
GNI2020119.381097.23165.12421.80*********1.151.680.81
202162.30822.62138.84252.621.331.97
MDZI2020264.502254.16341.12796.50*********1.322.520.82
2021177.221757.39302.93602.931.021.12
MGLI202022.98681.26115.17229.07***ns***0.900.720.77
202152.15535.40101.86226.860.61–0.21
MGNI2020256.152628.58377.591070.55*********1.091.910.85
2021172.842295.43404.65885.910.810.64
ADZI202026.89271.7140.56117.03*********0.961.590.86
202120.68217.1839.1488.590.760.31
AGNI20200.897.431.313.08*********0.800.490.25
2021017.461.932.413.1620.21
TI20201032.956993.061129.293104.80*********0.900.810.86
2021617.355844.521036.562350.980.740.17

Descriptive statistics, ANOVA analysis, and broad-sense heritability of isoflavone contents in MDP.

DZE = daidzein; GLE, glycitein; GNE, Genistein; DZI, daidzin; GLI, glycitin; GNI, genistin; MDZI, malonyl daidzin; MGLI, malonyl glycitin; MGNI, malonyl genistin; ADGI, acetyl daidzin; AGLI, acetyl glycitin; AGNI, acetyl genistin; ns, not significant; H2, broad-sense heritability. *** significant at p < 0.001.

a μg g–1.

b g means the genotype effects.

c y means the year effects.

d g × y means the genotype by year effects.

Correlation analysis between individual isoflavones in 2020 and 2021

To determine the relationships between total isoflavones and individual isoflavones, and between pairs of individual isoflavones in the two years, we conducted Pearson’s correlation analysis using data for 12 individual isoflavones (Figure 2). Strong positive correlations were detected between TI and ADZI [2020 (r = 0.95, P < 0.001) and 2021 (r = 0.96, P < 0.001)], MGNI [2020 (r = 0.94, P < 0.001) and 2021 (r = 0.96, P < 0.001)], MDZI [2020 (r = 0.94, P < 0.001) and 2021 (r = 0.94, P < 0.001)], GNI [2020 (r = 0.94, P < 0.001) and 2021 (r = 0.89, P < 0.001)], and DZI [2020 (r = 0.93, P < 0.001) and 2021 (r = 0.89, P < 0.001)]. This result indicated that the main contributors to TI were ADZI, MGNI, MDZI, GNI, and DZI in both years.

FIGURE 2

We also detected positive correlations between DZI and GNI, between DZI and MDZI (r = 0.85–0.96, P < 0.001) and between GLI and MGLI (r = 0.92–0.95, P < 0.001). GNI and MDZI were positively correlated with MGNI and ADZI (r = 0.84–0.96, P < 0.001). Although GLE and GNE were positively correlated with MGNI and ADZI (r = 0.60–0.91, P < 0.001), GLE and GNE were positively correlated with GNI (r = 0.7–0.88, P < 0.001), and DZI was positively correlated with others (r = 0.63–0.68, P < 0.001) in 2020, these correlations were not significant or weak in 2021. This result indicated that correlations among isoflavones were subject to year effects as well as genetic factors.

Identification of quantitative trait nucleotides for total and individual isoflavone contents

The 25,646 SNPs (MAF < 0.01) were distributed across all the chromosomes, with frequencies ranging from 865 SNPs (3.37%, Chr. 12) to 2182 SNPs (8.51%, Chr. 18) per chromosome, with an average of 1,282 SNPs per chromosome. Thus, the SNPs were evenly distributed across all 20 soybean chromosomes in the MDP, and their number and distribution were sufficient for GWAS analysis. Of the 158 QTNs detected, 112 and 46 significant QTNs (LOD ≥ 3, p-value = 1.95 × 10–6) related to isoflavone contents were detected in 2020 and 2021, respectively. They were distributed across all chromosomes except for Chr. 12 and Chr. 17. The QTNs were detected using MLM and mrMLM methods (Figures 3, 4). Information for all QTNs is shown in Supplementary Tables 2, 3.

FIGURE 3

FIGURE 4

We detected 112 QTNs related to isoflavone contents in 2020. These QTNs were located on 18 of the 20 soybean chromosomes (all except Chr. 12 and 17). The number of QTNs per chromosome ranged from one (Chr. 1, Chr. 6, Chr. 7, and Chr. 16) to 19 (Chr. 5 and Chr. 20). Of the QTNs significantly related to isoflavones detected in 2020 (LOD ≥ 3, -log10(P) ≥ 5.7), nine QTNs were related to DZE and explained 1.71–27.89% of the variation in DZE content (–1.80 to 16.51 μg g–1). They were located on Chr. 2, 5, 10, 14, 18, and 20. The 10 QTNs for GLE were located on chromosomes 2, 5, 8, 9, 10, 11, and 20, and explained 1.67%–17.25% of the variation in GLE content (–3.05 to 8.93 μg g–1). Ten QTNs for GNE were located on Chr. 3, 5, 10, 19, and 20, and explained 2.52–22.62% of the variation in GNE content (–2.25 to 12.91 μg g–1). The nine QTNs for DZI were located on chromosomes 4, 5, 7, 14, 15, 18, and 20 and explained 6.50–18.18% of variation in QTN content (–117.49 to 385.37 μg g–1). The three QTNs for GLI were located on Chr. 14, 18, and 20 and explained 5.86–21.53% of the variation in QTN content (22.45–73.22 μg g–1). The 13 QTNs for GNI were located on chromosomes 2, 5, 10, 11, 15, 18, and 20 and explained 0.16–22.36% of the variation in GNI content (–324.75 to 525.77 μg g–1). The 12 QTNs for MDZI were located on Chr. 4, 5, 8, 9, 13, 18, and 20, and explained 2.07–16.67% of the variation in MDZI content (–316.48 to 891.52 μg g–1). The 8 QTNs for MGLI were located on chromosomes 1, 2, 3, 5, 14, 18, and 20 and explained 2.29–16.74% of the variation in MGLI content (–64.00 to 106.65 μg g–1). The 11 QTNs for MGNI were located on chromosomes 2, 4, 5, 11, 15, and 20, and explained 2.12–21.77% of the variation in MGNI content (–999.45 to 1261.72 μg g–1). The 12 QTNs for ADZI were located on chromosomes 2, 4, 5, 6, 8, 11, 15, and 20, and explained 3.80–19.31% of the variation in ADZI content (–31.76 to 77.35 μg g–1). The seven QTNs for AGNI were located on chromosomes 11, 13, 16, 18 and 20, and explained 4.21–14.11% of the variation in AGNI content (–0.60 to 0.57 μg g–1). The 8 QTNs for TI were located on chromosomes 2, 3, 4, 5, 9, 15, and 20, and explained 2.81–18.31% of the variation in TI content (–3115.80 to 2665.47 μg g–1).

Of the 122 QTNs detected in 2020, 12 QTNs were detected for two or more types of isoflavones and using more than one GWAS analysis method. These 12 QTNs were located on Chr. 2, 4, 5, 8, 9, 10, 11, 14, 15, and 20 (Table 2).

TABLE 2

QTNTraitChrPosition (bp)Effect-log10(P)R2MAFAlleleMethodCandidate geneSymbolDescriptionReferences
Gm02:38740302GLE2387403024.34 ∼6.416.25 ∼10.261.67 ∼17.250.50C/A1, 2, 4, 6Glyma.02g202300Pseudouridine synthase family protein
GNI263.71 ∼307.915.78 ∼12.1613.07 ∼22.360.02 ∼0.504, 6, 7
MGNI465.57, 504.726.07, 7.482.12, 9.90.501, 4
ADZI59.9611.409.750.504
TI1199.89 ∼1680.986.40 ∼9.152.82 ∼15.730.501, 2, 4, 6
Gm04:26325280DZI426325280–58.857.607.480.20A/T1Glyma.04g144300ENO1Enolase 1Arias et al., 2022
MDZI–179.2712.735.740.201
TI–428.116.963.640.201
Gm04:30558742DZI430558742351.75, 385.376.13 ∼7.2310.04 ∼10.720.50C/T2, 4, 6
MDZI750.06 ∼891.526.25, 10.98.02, 8.560.502, 4
MGNI831.46, 843.025.78, 5.937.01, 9.870.504,6
ADZI77.356.144.120.504
TI2331.81 ∼2665.476.54 ∼8.496.04 ∼9.90.502, 4, 6
Gm05:35170270GNI535170270525.775.870.160.03G/A7Glyma.05g160000CESA8Cellulose synthase family proteinNawaz et al., 2017
MGNI1261.726.050.170.037
Gm05:38940662GNE538940662–0.955.795.740.50T/C2Glyma.05g207000GSTT1Glutathione S-transferase THETA 1Knizia et al., 2021
DZI–117.49 ∼50.676.22 ∼8.418.76 ∼14.280.502, 3, 4
GNI–83.91 ∼64.229.82, 10.489.47, 13.470.501, 2
MDZI–316.48 ∼85.116.32 ∼11.743.09 ∼10.230.501, 2, 3, 4, 5, 6
MGNI–318.26 ∼125.766.33 ∼8.704.27 ∼14.980.501, 2, 3, 4, 6
ADZI–10.93, 13.816.47, 9.3211.54, 19.170.505, 6
TI–837.84to ∼318.645.96 ∼12.956.22 ∼13.840.501, 2, 3, 4, 5
Gm05:39039365GLE539039365–3.05, 1.467.21, 10.006.26, 13.640.50C/T3, 4
GNI–152.06to ∼47.365.84 ∼12.938.33 ∼19.880.503, 4, 5, 6
ADZI–31.76 ∼13.625.80 ∼9.936.47 ∼14.380.501, 3, 4
Gm05:41272024GLE541272024–2.5910.609.130.08A/T4
GNI–324.755.7717.040.087
MGNI–831.485.7517.470.087
Gm08:12085018MDZI812085018–115.855.704.850.36T/G2Glyma.08g156300
ADZI–12.79, 16.475.73, 9.944.65, 7.920.35, 0.361, 2
Gm09:38479648GLE938479648–1.77.2510.710.16T/C2Glyma.09g160600RKL1Receptor-like kinase 1
MDZI–151.537.869.280.166
Gm10:45467630DZE10454676306.46, 8.695.01, 6.814.36, 8.210.50A/C4, 6Glyma.10g223800Basic-leucine zipper (bZIP) transcription factor family proteinGuo et al., 2021
GLE8.936.1711.190.505
GNE8.15 ∼9.527.54 ∼8.038.41 ∼9.520.501, 4, 6
GNI397.33, 452.236.68, 7.825.82, 10.910.051, 2
Gm10:49323094DZE104932309412.44 ∼16.516.23 ∼21.6716.85 ∼27.890.01 ∼0.50C/T1, 2, 4, 6, 7Glyma.10g271000Protein of unknown function (DUF1218)
GNE9.88 ∼12.919.56 ∼12.2812.97 ∼22.620.051, 2, 4, 6
Gm11:508733GNI11508733–84.236.593.420.05A/G4Glyma.11g007200
ADZI–28.115.9010.000.051
Gm11:589619GLE11589619–2.6712.7114.880.07T/A2
GNI–115.9410.8417.650.076
MGNI–207.58, 250.587.47, 10.6611.55, 11.610.074, 6
Gm14:38439549DZI1438439549128.52, 147.196.08, 7.3411.55, 13.480.50A/G4,6
GLI73.226.5620.440.502
Gm15:24347624GNI1524347624–137.639.524.640.04C/T1Glyma.15g200200NHX2Sodium hydrogen exchanger 2
TI–745.3810.765.320.044
Gm15:24347662MGNI1524347662–206.83 ∼223.686.59 ∼8.436.76 ∼11.110.06T/C2, 4, 6Glyma.15g200200NHX2Sodium hydrogen exchanger 2
ADZI–21.535.959.230.066
Gm15:47928380GNI1547928380170.095.788.600.13T/C3Glyma.15g251200CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein
MGNI287.69, 438.176.34, 7.570.20, 10.910.06, 0.132, 7
ADZI23.82, 45.196.09, 7.440.19, 10.060.06, 0.133, 7
Gm20:8188732ADZI20818873237.237.549.660.17G/C3
TI425.487.9318.310.175
Gm20:8460978GNE2084609781.5110.1611.180.30T/G1
GNI56.957.424.320.301
MGNI304.246.838.270.303
Gm20:8666294DZE2086662940.977.963.720.20G/A4Glyma.20g046800ATCDPMEK4-(cytidine 5’-phospho)-2-C-methyl-D-erithritol kinaseJahan, 2019
DZI50.37,124.236.56, 6.958.99, 10.750.203, 5
MDZI123.666.4016.670.205
MGNI168.6511.3721.770.205
ADZI17.308.8414.510.206
Gm20:16191481GNI201619148153.12, 68.887.24, 8.184.91, 13.550.20A/G4, 6
MDZI122.93, 136.867.21, 13.507.46, 8.230.204, 6
ADZI14.32 ∼15.487.78 ∼10.486.74 ∼10.270.20 ∼0.211, 2, 4

Information about co-detected QTNs for isoflavone contents in 2020 using MLM-based models.

DZE, daidzein; GLE, glycitein; GNE, Genistein; DZI, daidzin; GLI, glycitin; GNI, genistin; MDZI, malonyl daidzin; MGLI, malonyl glycitin; MGNI, malonyl genistin; ADGI, acetyl daidzin; AGLI, acetyl glycitin; AGNI, acetyl genistin; QTN, Quantitative trait nucleotide; Chr, chromosome; Effect, QTN effect for isoflavone content (μg g–1); R2, phenotype variation; MAF, minor allele frequency. Method 1–7 refers to mrMLM, FastmrMLM, FASTmrEMMA, pLAR-mEB, pKWmEB, and MLM, respectively.

We detected 46 QTNs related to isoflavones content in 2021. They were located on 16 of the 20 soybean chromosomes (all except Chr. 2, 8, 12, and 15). The one QTNs related to DZE were located on Chr. 3 and explained 12.30% of the variation in DZE content (16.30 μg g–1). The three QTNs for GLE were located on Chr. 3, 7 and 18 and explained 6.07–58.24% of the variation in GLE content (9.22–29.05 μg g–1). The two QTNs for GNE were located on Chr. 3 and 10, and explained 17.00–57.37% of the variation in GNE content (16.00–39.51 μg g–1). The six QTNs for DZI were located on Chr. 4, 5, 6, 9, and 20, and explained 2.95–19.67% of the variation in DZI content (–41.34 to 366.70 μg g–1). The three QTNs for GLI were located on Chr. 17, 18, and 19, and explained 5.74–25.26% of the variation in GLI content (–27.80 to 19.54 μg g–1). The five QTNs for GNI were located on Chr. 3, 4, 6, 18, and 20, and explained 4.66–26.02% of the variation in GNI content (–75.44 to 617.89 μg g–1). The four QTNs for MDZI were located on Chr. 4, 7, 13, and 20, and explained 6.04–22.13% of the variation in MDZI content (–116.13 to 1241.39 μg g–1). One QTN for MGLI was located on Chr. 1, and explained 4.01% of the variation in MGLI content (–140.64 μg g–1). The four QTNs for MGNI were located on Chr. 1, 4, 5, and 20, and explained 6.37–21.93% of the variation in MGNI content (–245.86 to 1199.61 μg g–1). The five QTNs for ADZI were located on Chr. 4, 5, 18, and 20, and explained 5.23–21.18% of the variation in ADZI content (–19.42 to 105.52 μg g–1). The seven QTNs for AGNI were located on Chr. 3, 11, 14, 16, and 20, and explained 8.13–34.48% of the variation in AGNI content (–1.60 to 3.31 μg g–1). The five QTNs for TI were located on Chr. 1, 4, 5, and 20 and explained 4.34–21.35% of the variation in TI content (–524.76 to 3212.93 μg g–1).

Of the 46 QTN detected in 2021, 5 QTNs were detected for two or more types of isoflavones and using more than one GWAS analysis method. These 5 QTNs were located on Chr. 3, 4, 5, 18, and 20 (Table 3).

TABLE 3

QTNTraitChrPosition (bp)Effect-log10(P)R2MAFAlleleMethodCandidate geneSymbolDescriptionReferences
Gm03:40639077DZE34063907716.307.1212.300.50C/A1Glyma.03g196800Nucleic acid-binding, OB-fold-like protein
GLE26.84 ∼29.0511.33 ∼34.530.35 ∼58.240.01 ∼0.501, 2, 4, 7
GNE33.69 ∼39.5110.28 ∼20.1531.69 ∼57.370.501, 2, 4, 7
GNI613.86 ∼617.897.09 ∼15.7823.00 ∼26.020.01 ∼0.501, 2, 4, 7
Gm04:30558742DZI430558742281.11 ∼366.706.36 ∼9.9413.85 ∼19.360.50C/T1, 2, 4, 6
GNI325.03 ∼374.796.50 ∼7.507.03 ∼12.120.502, 4, 6
MDZI971.73 ∼1241.399.03 ∼10.2614.44 ∼22.130.501, 2, 4, 6
MGNI975.91 ∼1199.617.15 ∼7.967.38 ∼15.970.501, 2, 4, 6
ADZI89.22 ∼105.526.97 ∼9.145.86 ∼9.460.501, 2, 4, 6
TI2839.91 ∼3212.937.09 ∼10.7211.79 ∼14.440.501, 2, 4, 6, 7
Gm05:41272024MGNI541272024–245.86 to ∼319.446.64 ∼10.239.43 ∼18.830.08A/T1, 2, 4, 6
ADZI–19.42, 23.106.82, 9.505.69, 12.880.081, 6
TI–524.76 to ∼603.996.81 ∼9.366.86 ∼13.420.082, 4, 6
Gm18:56278301GNI1856278301–46.47, ∼38.915.88, 6.844.66, 6.070.33T/C2, 4Glyma.18g281800Homeodomain-like superfamily protein
ADZI–12.969.225.230.331
Gm20:32811819DZI203281181942.23 ∼107.066.66 ∼8.6912.78 ∼14.840.30 ∼0.31C/T1, 2, 6
GNI59.81, 68.258.79, 10.4411.59, 19.260.30 ∼0.311, 6
MDZI93.33 ∼279.115.88 ∼8.006.04 ∼11.870.30 ∼0.311, 2, 3, 4, 6
MGNI153.07 ∼204.059.47 ∼12.316.37 ∼21.930.30 ∼0.311, 2, 4, 6
ADZI14.10 ∼26.375.75 ∼10.695.46 ∼21.180.30 ∼0.311, 2, 3, 6
TI476.50 ∼866.847.86 ∼12.008.42 ∼21.350.30 ∼0.311, 2, 3, 4, 6

Information about co-detected QTNs for isoflavone contents in 2021 using MLM-based models.

DZE, daidzein; GLE, glycitein; GNE, Genistein; DZI, daidzin; GLI, glycitin; GNI, genistin; MDZI, malonyl daidzin; MGLI, malonyl glycitin; MGNI, malonyl genistin; ADGI, acetyl daidzin; AGLI, acetyl glycitin; AGNI, acetyl genistin; QTN, 1Quantitative trait nucleotide; Chr, chromosome; Effect, QTN effect for isoflavone content (μg g–1); R2, phenotype variation; MAF, minor allele frequency. Method 1–7 refers to mrMLM, FastmrMLM, FASTmrEMMA, pLAR-mEB, pKWmEB, and MLM, respectively.

Dissection of stable quantitative trait nucleotides and candidate genes

To explore the genetic basis of isoflavone biosynthesis, we estimated the genome-wide average of LD decay using the genotype dataset of 25,646 SNPs. The LD half decay distance was approximately 441 kb (Figure 5A). Significant QTNs located around Gm05:38940662 (Glyma.05g207000) on chromosome 5 were detected by multiple models (MLM, mrMLM, FASTmrMLM, FASTmrEMMA, pLARmEB, pKWmEB, and ISIS EM-BLASSO) in 2020, and were related to the contents of six individual isoflavones (GNE, DZI, GNI, MDZI, MGNI, and ADZI) and TI. A total of 44 QTNs were detected in a 441-kb interval surrounding Gm05:38940662. Of these, four QTNs (Gm05:38936166, Gm05:38936167, Gm05:38940662 and Gm05:38940717) were significantly associated with variations in isoflavone contents (Figure 5B). Two of those QTNs (Gm05:38936166 and Gm05:38936167) were located at the 3’-untranslated region (UTR) of Glyma.05g206900 (a gene encoding glutathione S-transferase THETA1; GSTT1), and the other two (Gm05:38940662 and Gm05:38940717) were located at the 3’-UTR and caused non-synonymous substitutions in the coding sequence (CDS) of Glyma.05g207000 (a gene on chromosome 5 encoding another GSTT1). The QTNs at Gm05:38936166 and Gm05:38940662 caused transitions (A/G and C/T) and the QTN at Gm05:38936167 resulted in a transversion. The QTN at Gm05:38940717 resulted in a predicted amino acid substitution from threonine to isoleucine (Figure 5C).

FIGURE 5

To compare the isoflavone contents based on these four QTNs, a subset of 137 soybean mutants with variable isoflavone contents was divided into two haplotypes (Hap1 and Hap2) (Figure 5D). The mean contents of the following seven isoflavones were significantly different between Hap 1 (58 genotypes) and Hap 2 (79 genotypes): TI [hap1 (3806.44 μg g–1) and hap2 (2448.48 μg g–1), p < 0.001], GNE [hap1 (8.59 μg g–1) and hap2 (7.03 μg g–1), p < 0.01], DZI [hap1 (390.40 μg g–1) and hap2 (236.20 μg g–1), p < 0.001], GNI [hap1 (519.64 μg g–1) and hap2 (325.42 μg g–1), p < 0.001], MDZI [hap1 (974.80 μg g–1) and hap2 (632.11 μg g–1), p < 0.001], MGNI [hap1 (1281.23 μg g–1) and hap2 (857.82 μg g–1), p < 0.001], ADZI [hap1 (141.21 μg g–1) and hap2 (92.73 μg g–1), p < 0.001].

Expression pattern the candidate gene GmGSTT1 in a mutant

To explore the transcriptional patterns of GmGSTT1 (Glyma.05g206900 and Glyma.05g207000) and those of the core isoflavone biosynthesis genes IFS1 and IFS2, we conducted quantitative real-time PCR (qRT-PCR) analyses using seeds of Danbaek (DB) and its mutant (DB-088) with low and high isoflavone contents, respectively. These two lines were selected from the 189 members of the MDP as shown in Figure 6A. Consistent with the predicted effects of the QTNs, the contents of the isoflavones GNE, DZI, DZI, MDZI, MGNI, ADZI, and TI were significantly higher in seeds of DB088 than in those of DB. The qRT-PCR analyses confirmed differences in gene transcript levels between the two genotypes (Figure 6B). The relative transcript levels of GST1-1 (Glyma.05g206900) and GST1-2 (Glyma.05g207000) were 1.96-fold (p < 0.01) and 1.76-fold (p < 0.05) higher, respectively, in the seeds of DB088 than in the seeds of DB. Interestingly, IFS1 and IFS2 also showed higher transcript levels in the seeds of DB088 than in the seeds of DB, with similar transcriptional patterns to that of GmGSTT1. The transcript level of IFS1 in seeds of DB-088 was 3.71-fold (p < 0.001) that in seeds of DB, and the transcript level of IFS2 in DB-088 was 2.32-fold (p < 0.001) that in DB. These results suggested that GmGSTT1 is involved in isoflavone synthesis in soybean seeds.

FIGURE 6

Discussion

This core collection constructed by mutation breeding has previously been used in GWAS and linkage mapping studies to dissect important QTLs and candidate genes associated with quantitative and qualitative traits for crop improvement. Kim et al. (2019) constructed a bi-parental mapping population derived from the high-stearic acid mutant soybean Hfa180 with a three-base-pair deletion in exon 1 of GmSACPD-C. This mutation leads to a high stearic acid content and was developed using mutated alleles. In the population derived from this mutant, Kompetitive allele-specific PCR markers were used to identify a subset of high-stearic acid soybean lines. Hwang et al. (2020) performed GWAS analysis of a core rice mutant population (2,561 individuals) induced by gamma-ray irradiation. This population was subjected to abiotic stress treatments. Then, whole-genome resequencing analyses revealed a candidate gene for stress tolerance (Os02g0528900, encoding a PDR-like-ABC transporter) with two SNPs. Ma et al. (2022) used QTL mapping with bulk segregant analysis and GBS methods to detect the QTL for the dwarf phenotype of the mutant upland cotton line Ari1327. Thus, the use of GWAS or linkage mapping combined with a mutation breeding population is ideal for genome-wide mapping studies to explore the basis of complex agronomic traits.

To reveal causal genes associated with isoflavone content, we first conducted a GBS-based GWAS using multiple statistical methods, including MLM and mrMLM, to analyze data from a mutant mapping population consisting of 189 soybean mutant genotypes. The multi-locus random-SNP-effect mrMLM, which was developed by Zhang et al. (2020), includes six methods: mrMLM, FASTmrMLM, FASTmrEMMA, pLARmEB, pKWmEB, and ISIS EM-BLASSO. Unlike the conservative MLM model, the mrMLM method has been widely used to detect many important QTNs. The mrMLM estimated potentially significant QTNs based on empirical Bayes. Even if the significance threshold is less stringent, mrMLM have robust statistics, accuracy, short running time, and a low false positive rate (Cui et al., 2018; Guan et al., 2019; Zhang et al., 2020).

The isoflavone contents in the mutant population varied significantly between 2020 and 2021 because of complex heritability as well as external factors. In this study, the largest decrease was detected for aglycones (reduced by 50.39% in 2021 compared with that in 2020), followed by glucosides, total isoflavones, acetyl-glucosides, and malonyl-glucosides (reduced by 38.36, 24.28, 24.24, and 18.15% in 2021 compared with those in 2020, respectively). Thus, all of the isoflavones detected were affected by genotype, year, and the genotype × year interaction (Table 1). In previous studies, one soybean cultivar showed a 3.6-fold variation in isoflavone contents among different sites in the same year, and variations of up to 1.5-fold across different years (Eldridge and Kwolek, 1983); and the contents of individual isoflavones showed variations between 2 years ranging from 16.6% (MDZI) to 400% (GLE), and variations among eight planting locations ranging from 20.9% (MDZI) to 400% (GLE) (Hoeck et al., 2000). Factors that affect the contents of secondary metabolites in soybean seeds include temperature, location, sowing date, the type of fertilizer used, and the developmental stage of the seeds. Previous studies have demonstrated that high temperatures can lead to decreased contents of isoflavones and fatty acids (but not saponins) in soybean seeds. Similarly, isoflavone contents were found to be 5.8-fold lower in soybean varieties cultivated with early sowing dates (April and May) than in those cultivated with later sowing dates (June and July) (Tsukamoto et al., 1995). Another study showed that a shift from high to low temperature increased total isoflavone contents by 2.5-fold, with daidzein showing the largest increase of 3-fold (Lozovaya et al., 2005).

In our study, for those isoflavones showing a normal distribution in their contents in the MDP population, the broad-sense heritability ranged from 15% (GLE) to 86% (TI and ADZI) across the 2 years. Consistent with our results, a previous study detected heritability of TI ranging from 87.2 to 93.8% in individual and multiple environments. Meng et al. (2016) and Chu et al. (2017) detected broad-sense heritability of major isoflavone components ranging from 67.80% (GNE) to 83.80% (GLE) and from 87.2 to 93.8% (TI) across years and locations, respectively. However, Primomo et al. (2005) and Zeng et al. (2009) estimated the heritability for isoflavones to be lower than 60% based on multiple interactions. Overall, our results and those of other studies suggest that isoflavone contents in soybean seeds are more influenced by genetic effects than by year effects, indicating that isoflavone contents in soybean seeds are traits that can be genetically manipulated. We conducted correlation analyses for 12 types of isoflavones across two years (2020 and 2021), and detected strong positive correlations between total isoflavones and glucosides [DZI, GLI, and GNI (r = 0.64–0.94 in 2020 and r = 0.67–0.89 in 2021)]; and between total isoflavones and malonyl glucosides [MDZI, MGLI and MGNI (r = 0.64–0.94 in 2020 and r = 0.67–0.89 in 2021)]. We detected weaker positive correlations between total isoflavones and aglycones [DZE, GLE, and GNE (r = 0.46–0.82 in 2020 and r = 0.12–0.20 in 2021)]; and between total isoflavones and acetyl glucosides [ADZI and AGNI (r = 0.26–0.96 in 2020 and r = 0.47–0.95 in 2021)]. Across both years, DZI (r = 0.93***, r = 0.89***), GNI (r = 0.94***, r = 0.89***), MDZI (r = 0.94***, r = 0.94***), MGNI (r = 0.94***, r = 0.96***) and ADZI (r = 0.95***, r = 0.96***) showed significantly strong positive correlations with TI and with each other, consistent with the results of other studies (Azam et al., 2020; Yoon et al., 2021).

Previously, GWAS and QTL mapping have been used to detect candidate genes for quantitative and qualitative traits using natural populations and bi-parental populations, respectively. Recently, QTLs related to isoflavones have been detected by QTL mapping using a bi-parental population, an F2 population, and RILs. Cai et al. (2018) identified a novel QTL underlying isoflavone content using an RIL mapping population with 3469 bin markers based on restriction site-associated DNA tag sequencing. They detected 15 stable QTLs mapped onto ten chromosomes, among which qIF5-1 explained the largest proportion of variation in isoflavone content across environments (6.37–59.95%). In another study, specific-locus amplified fragment (SLAF) sequencing was used for high-density linkage mapping using 3,541 SLAF markers in multiple environments. That study detected 24 stable QTLs for isoflavone components, which explained 4.2–21.2% of the variations in their contents. They included four novel QTLs (qG8, qMD19, QMG18, and qTIF19) for GNI, MDZI, MGNE, and TI, respectively (Pei et al., 2018). A GWAS identified GmMYB29, a gene encoding a R2R3-type MTB transcription factor, which binds to the promoters of IFS2 and CHS8 to activate their expression. When GmMYB29 was over-expressed or silenced by RNAi in soybean hairy roots, the isoflavone contents were increased by 1.6-fold to 3.3-fold or decreased by 2-fold, respectively (Chu et al., 2017). GmMPK1, encoding a mitogen-activated protein kinase, was detected in GWAS and linkage mapping studies using a bi-parental mapping population of soybean. This gene was found to be significantly associated with isoflavone contents and resistance to Phytophthora root rot (Wu et al., 2020). Another study detected GmMYB176, which encodes a transcription factor that binds to the CHS8 promoter to regulate isoflavone biosynthesis. In further analyses, the isoflavone content decreased when GmMYB176 was suppressed by RNAi, but it was not increased when GmMYB176 was overexpressed (Yi et al., 2010). Overexpression of GmMYB29 and GmMPK1 led to increased isoflavone contents and enhanced tolerance to biotic and abiotic stresses, indicating that isoflavone biosynthesis is closely associated with resistance to external stress factors. Together, the results of those studies show that the expression of isoflavone biosynthetic genes, such as PAL, 4CL, CHS8, CHR, and IFS2, is activated by MYB, zinc, bZIP, and PLATZ transcription factors and their interactions in soybean.

In our study, we first screened for QTNs and candidate genes related to isoflavone contents using a mutant soybean population. and confirmed that isoflavone contents were significantly different (p < 0.001) between two haplotype groups. We identified QTNs in genes encoding transcription factors such as bZIP (Glyma.05g.223800), WRKY (Glyma.04g238300) and bHLH (Glyma.13g291400). We also detected a stable QTN (Gm05:38940662) on chromosome 5 in a gene encoding glutathione S-transferase THETA 1 (GmGSTT1), which was found to be associated with seven types of isoflavones (GNE, DZI, GNI, MDZI, MGNI, AZI, TI) in 2020 using multiple-GWAS models (Table 2). In the LD analysis based on Gm05:38940662, we detected 44 QTNs (Gm05:38513238 to Gm05:39363243) within a 441-kb surrounding Gm05:38940662. These four QTNs were located in the 3’-UTR and within non-synonymous coding regions in two GmGSTT1 genes, and resulted in nucleotide changes (A/G, A/C, and C/T) resulting in an amino acid substitution in the first exon (Thr/Ile). We compared the contents of seven types of isoflavones between two haplotype groups: Hap1 with high isoflavone contents, and Hap2 with low isoflavone contents. Within these groups, the top 20% of Hap1 and bottom 20% of Hap2, the TI, GNE, DZI, MDZI, MGNI and ADZI of the 20% of Hap1 extremely increased up to 3.02-fold, 1.94-fold, 3.32-fold, 3.05-fold, 3.15-fold, 2.77-fold and 2.79-fold (p < 0.001) compared with the bottom 20% of Hap2, respectively. These data suggest that the four QTNs will be useful as informative molecular markers in breeding programs to increase the isoflavone content in soybean seeds.

Ionizing radiation induces mutations, including chromosomal alterations, SNPs, and insertions/deletions. Thus, it has been used extensively for crop improvement (Lee et al., 2014; Kim et al., 2019; Hwang et al., 2020; Ma et al., 2022). To confirm that the gene mutation detected in this study affected the transcript levels of GmGSTT1, we performed qRT-PCR analyses for DB-088, the mutant with the greatest increase in isoflavone content compared with the original cultivar DB (see Figure 6A). The genes IFS1 and IFS2 are known to be important determinants of isoflavone content, and were also up-regulated in DB-088 compared with those in DB. Previous studies have reported that non-synonymous SNPs in exons of IFS1 and IFS2 not only greatly reduced isoflavone contents in soybean seeds, but also affected responses to biotic and abiotic stresses (Cheng et al., 2008, 2010). In the present study, the expression profile of GmGSTT1 was similar to that of two IFS genes, suggesting that GmGSTT1 is also related to variations in isoflavone content. The four QTNs in GmGSTT1 were related to increased contents of seven types of isoflavones (up to 1.65-fold) (Figure 5D).

The NCBI database lists 133 soybean GST genes. Members of this family are well characterized detoxification enzymes that are involved in stress tolerance and nodulation in soybean roots. They are a ubiquitous class of enzymes that participate in the antioxidant system, thereby decreasing the accumulation of reactive oxygen species (Dalton et al., 2009; Dixon et al., 2010). Overexpression of GsGST genes has been shown to increase tolerance to abiotic stresses such as salt, drought, cold and heavy metals, and biotic stresses such as insect attack and virus infection (Takesawa et al., 2002; Ji et al., 2010). Members of the GST family also play roles in the accumulation of secondary metabolites such as anthocyanins and cinnamic acid. Some GSTs have been reported to contribute to the formation of other natural products such as the sulfur-containing phytochemicals in Brassicaceae species (Dixon et al., 2010; Czerniawski and Bednarek, 2018). In maize, the anthocyanin biosynthetic gene Bronze-2 is involved the accumulation of anthocyanins in the vacuole. During this process, anthocyanins conjugate with GST, and the anthocyanin-GSH complex is transported into the vacuole via a glutathione pump (Marrs et al., 1995). Activation of GST catalyzes the binding of GSH to ABC transporters, enabling ATP hydrolysis to direct translocation across the membrane. ATP-binding cassette (ABC) transporters are well known for role in sequestration of flavonoids glycosides, glucuronides and glutathione conjugates into the vacuole (Petrussa et al., 2013). As mentioned above, members of the GST family are involved in many aspects of growth and development during the whole life span of plants. Isoflavones are precursors of phytoalexins, and their accumulation is induced in response to hormones and external stimuli such as insect attack. Like anthocyanins, isoflavones accumulate in the vacuole. Thus, it is possible that GmGSTT1 conjugates with isoflavones for transport and storage, like it does with anthocyanins.

This is the first GWAS analysis associated with isoflavone content using a mutant soybean population with multiple GWAS-models across two years. We have identified many significant QTNs, and detected four stable QTNs in the gene GmGSTT1. Our findings provide evidence that GmGSTT1 affects isoflavone contents in soybean seeds. The results of this study include genetic information that will be useful for crop improvement, specifically for the production of soybean lines with stable and high isoflavone contents.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary material.

Author contributions

S-JK, B-KH, and J-WA conceived and revised the manuscript. JMK designed the experiment and drafted the manuscript. JMK, YJL, SHE, JSS, and NNH phenotyped the isoflavone content. JIL and D-GK genotyped the GBS data. JMK, JIL, and D-GK analyzed the GWAS data. All authors contributed to the article and approved final manuscript.

Funding

This work was supported by the research program of Korea Atomic Energy Research Institute (Project No. 523320-22) and National Research Foundation of Korea (NRF) grant funded by the Korea government (RS-2022-00156231).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.968466/full#supplementary-material

References

  • 1

    AriasC. L.QuachT.HuynhT.NguyenH.MorettiA.ShiY.et al (2022). Expression of AtWRI1 and AtDGAT1 during soybean embryo development influences oil and carbohydrate metabolism.Plant Biotechnol. J.2013271345. 10.1111/pbi.13810

  • 2

    AzamM.ZhangS.AbdelghanyA. M.ShaibuA. S.FengY.LiY.et al (2020). Seed isoflavone profiling of 1168 soybean accessions from major growing ecoregions in China.Food Res. Intl.130:108957. 10.1016/j.foodres.2019.108957

  • 3

    BrummerE.GraefG.OrfJ.WilcoxJ.ShoemakerR. (1997). Mapping QTL for seed protein and oil content in eight soybean populations.Crop Sci.37370378. 10.2135/cropsci1997.0011183X003700020011x

  • 4

    CaiZ.ChengY.MaZ.LiuX.MaQ.XiaQ.et al (2018). Fine-mapping of QTLs for individual and total isoflavone content in soybean (Glycine max L.) using a high-density genetic map.Theo. Appl. Genet.131555568. 10.1007/s00122-017-3018-x

  • 5

    ChaudharyJ.AlishaA.BhattV.ChandanshiveS.KumarN.MirZ.et al (2019). Mutation breeding in tomato: Advances, applicability and challenges.Plants8:128. 10.3390/plants8050128

  • 6

    ChenM.-Y.HungS.-W.TsaiW.-H.LeeM.-C.ChangK.-C. (2020). A method for determination of twelve isoflavones in soybean by ultra-performance liquid chromatography.Adapt. Med.12410.

  • 7

    ChenQ.-S.ZhangZ.-C.LiuC.-Y.XinD.-W.QiuH.-M.ShanD.-P.et al (2007). QTL analysis of major agronomic traits in soybean.Agric. Sci. China6399405. 10.1016/S1671-2927(07)60062-5

  • 8

    ChengH.YangH.ZhangD.GaiJ.YuD. (2010). Polymorphisms of soybean isoflavone synthase and flavanone 3-hydroxylase genes are associated with soybean mosaic virus resistance.Mol. Breed.251324. 10.1007/s11032-009-9305-8

  • 9

    ChengH.YuO.YuD. (2008). Polymorphisms of IFS1 and IFS2 gene are associated with isoflavone concentrations in soybean seeds.Plant Sci.175505512. 10.1016/j.plantsci.2008.05.020

  • 10

    ChuS.WangJ.ZhuY.LiuS.ZhouX.ZhangH.et al (2017). An R2R3-type MYB transcription factor. GmMYB29, regulates isoflavone biosynthesis in soybean.PLoS Genet.13:e1006770. 10.1371/journal.pgen.1006770

  • 11

    CollisonM. W. (2008). Determination of total soy isoflavones in dietary supplements, supplement ingredients, and soy foods by high-performance liquid chromatography with ultraviolet detection: Collaborative study.J. AOAC Int.91489500. 10.1093/jaoac/91.3.489

  • 12

    ConcibidoV. C.DiersB. W.ArelliP. R. (2004). A decade of QTL mapping for cyst nematode resistance in soybean.Crop Sci.4411211131. 10.2135/cropsci2004.1121

  • 13

    CuiY.ZhangF.ZhouY. (2018). The application of multi-locus GWAS for the detection of salt-tolerance loci in rice.Front. Plant Sci.9:1464. 10.3389/fpls.2018.01464

  • 14

    CzerniawskiP.BednarekP. (2018). Glutathione S-transferases in the biosynthesis of sulfur-containing secondary metabolites in Brassicaceae plants.Front. Plant Sci.9:1639. 10.3389/fpls.2018.01639

  • 15

    DaltonD. A.BonifaceC.TurnerZ.LindahlA.KimH. J.JelinekL.et al (2009). Physiological roles of glutathione S-transferases in soybean root nodules.Plant Physiol.150521530. 10.1104/pp.109.136630

  • 16

    DastmalchiM.DhaubhadelS. (2014). “Soybean seed isoflavonoids: Biosynthesis and regulation,” in Phytochemicals–Biosynthesis, function and application, (Berlin: Springer), 121. 10.1007/978-3-319-04045-5_1

  • 17

    DixonD. P.SkipseyM.EdwardsR. (2010). Roles for glutathione transferases in plant secondary metabolism.Phytochemistry71338350. 10.1016/j.phytochem.2009.12.012

  • 18

    EldridgeA. C.KwolekW. F. (1983). Soybean isoflavones: Effect of environment and variety on composition.J. Agric. Food Chem.31394396. 10.1021/jf00116a052

  • 19

    ElshireR. J.GlaubitzJ. C.SunQ.PolandJ. A.KawamotoK.BucklerE. S.et al (2011). A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species.PLoS One6:e19379. 10.1371/journal.pone.0019379

  • 20

    FanS.LiB.YuF.HanF.YanS.WangL.et al (2015). Analysis of additive and epistatic quantitative trait loci underlying fatty acid concentrations in soybean seeds across multiple environments.Euphytica206689700. 10.1007/s10681-015-1491-3

  • 21

    FarinaH. G.PomiesM.AlonsoD. F.GomezD. E. (2006). Antitumor and antiangiogenic activity of soy isoflavone genistein in mouse models of melanoma and breast cancer.Oncol. Rep.16885891. 10.3892/or.16.4.885

  • 22

    GrahamT.KimJ.GrahamM. (1990). Role of constitutive isoflavone conjugates in the accumulation of glyceollin in soybean infected with Phytophthora megasperma.Mol. Plant Microbe. Interact.3157166. 10.1094/MPMI-3-157

  • 23

    GuanM.HuangX.XiaoZ.JiaL.WangS.ZhuM.et al (2019). Association mapping analysis of fatty acid content in different ecotypic rapeseed using mrMLM.Front. Plant Sci.9:1872. 10.3389/fpls.2018.01872

  • 24

    GuoB.DaiY.ChenL.PanZ.SongL. (2021). Genome-wide analysis of the soybean root transcriptome reveals the impact of nitrate on alternative splicing.G3 (Bethesda)11:jkab162. 10.1093/g3journal/jkab162

  • 25

    Gutierrez-GonzalezJ. J.VuongT. D.ZhongR.YuO.LeeJ.-D.ShannonG.et al (2011). Major locus and other novel additive and epistatic loci involved in modulation of isoflavone concentration in soybean seeds.Theo. Appl. Genet.12313751385. 10.1007/s00122-011-1673-x

  • 26

    Gutierrez-GonzalezJ. J.WuX.ZhangJ.LeeJ.-D.EllersieckM.ShannonJ. G.et al (2009). Genetic control of soybean seed isoflavone content: Importance of statistical model and epistasis in complex traits.Theo. Appl. Genet.11910691083. 10.1007/s00122-009-1109-z

  • 27

    HeJ.ZhaoX.LarocheA.LuZ.-X.LiuH.LiZ. (2014). Genotyping-by-sequencing (GBS), an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding.Front. Plant Sci.5:484. 10.3389/fpls.2014.00484

  • 28

    HoeckJ. A.FehrW. R.MurphyP. A.WelkeG. A. (2000). Influence of genotype and environment on isoflavone contents of soybean.Crop Sci.404851. 10.2135/cropsci2000.40148x

  • 29

    HwangS.-G.LeeS. C.LeeJ.LeeJ. W.KimJ.-H.ChoiS. Y.et al (2020). Resequencing of a core rice mutant population induced by gamma-ray irradiation and its application in a genome-wide association study.J. Plant Biol.63463472. 10.1007/s12374-020-09266-2

  • 30

    IbrahimA. K.ZhangL.NiyitangaS.AfzalM. Z.XuY.ZhangL.et al (2020). Principles and approaches of association mapping in plant breeding.Trop. Plant Biol.13212224. 10.1007/s12042-020-09261-4

  • 31

    JahanM. (2019). Genetic regulation of the elicitation of glyceollin biosynthesis in soybean. [dissertation/doctral thesis].West Virginia: West Virginia University.

  • 32

    JiW.ZhuY.LiY.YangL.ZhaoX.CaiH.et al (2010). Over-expression of a glutathione S-transferase gene, GsGST, from wild soybean (Glycine soja) enhances drought and salt tolerance in transgenic tobacco.Biotechnol. Lett.3211731179. 10.1007/s10529-010-0269-x

  • 33

    JianB.LiuB.BiY.HouW.WuC.HanT. (2008). Validation of internal control for gene expression study in soybean by quantitative real-time PCR.BMC Mol. Biol.9:59. 10.1186/1471-2199-9-59

  • 34

    JiangS.-Y.RamachandranS. (2010). Natural and artificial mutants as valuable resources for functional genomics and molecular breeding.Int. J. Biol. Sci.6:228. 10.7150/ijbs.6.228

  • 35

    JohnsonR. E.WashingtonM. T.PrakashS.PrakashL. (2000). Fidelity of human DNA polymerase η.J. Biol. Chem.27574477450. 10.1074/jbc.275.11.7447

  • 36

    JoshiR.PrashatR.SharmaP. C.Singla-PareekS. L.PareekA. (2016). Physiological characterization of gamma-ray induced mutant population of rice to facilitate biomass and yield improvement under salinity stress.Indian J. Plant Physiol.21545555. 10.1007/s40502-016-0264-x

  • 37

    KangH.AnsbacherR.HammoudM. (2002). Use of alternative and complementary medicine in menopause.Int. J. Gynecol. Obstetr79195207. 10.1016/S0020-7292(02)00297-7

  • 38

    KimD.-G. (2021). Studies on the construction of Mutant Diversity Pool (MDP) Lines, and their genomic and metabolic characterization in soybean. [dissertation/doctral thesis].Suncheon: Sunchon National University.

  • 39

    KimD.-G.LyuJ. I.LeeM.-K.KimJ. M.HungN. N.HongM. J.et al (2020). Construction of soybean mutant diversity pool (MDP) lines and an analysis of their genetic relationships and associations using TRAP markers.Agronomy10:253. 10.3390/agronomy10020253

  • 40

    KimD.-G.LyuJ.-I.LimY.-J.KimJ.-M.HungN.-N.EomS.-H.et al (2021). Differential gene expression associated with altered isoflavone and fatty acid contents in soybean mutant diversity pool.Plants10:1037. 10.3390/plants10061037

  • 41

    KimS. H.RyuJ.KimW. J.KangR.SeoE.KimG.et al (2019). Identification of a new GmSACPD-C allele in high stearic acid mutant Hfa180 derived from gamma-ray irradiation.Mol. Breed.39112. 10.1007/s11032-019-0928-0

  • 42

    KniziaD.YuanJ.BellalouiN.VuongT.UsovskyM.SongQ.et al (2021). The soybean high density ‘Forrest’by ‘Williams 82’SNP-based genetic linkage map identifies QTL and candidate genes for seed isoflavone content.Plants10:2029. 10.3390/plants10102029

  • 43

    KorteA.FarlowA. (2013). The advantages and limitations of trait analysis with GWAS: A review.Plant Methods919. 10.1186/1746-4811-9-29

  • 44

    KudouS.FleuryY.WeltiD.MagnolatoD.UchidaT.KitamuraK.et al (1991). Malonyl isoflavone glycosides in soybean seeds (Glycine max Merrill).Agric. Biol. Chem.5522272233. 10.1271/bbb1961.55.2227

  • 45

    LeD. T.AldrichD. L.ValliyodanB.WatanabeY.HaC. V.NishiyamaR.et al (2012). Evaluation of candidate reference genes for normalization of quantitative RT-PCR in soybean tissues under various abiotic stress conditions.PLoS One7:e46487. 10.1371/journal.pone.0046487

  • 46

    LeeC. H.YangL.XuJ. Z.YeungS. Y. V.HuangY.ChenZ.-Y. (2005). Relative antioxidant activity of soybean isoflavones and their glycosides.Food Chem.90735741. 10.1016/j.foodchem.2004.04.034

  • 47

    LeeG.BoermaH.VillagarciaM.ZhouX.CarterT.LiZ.et al (2004). A major QTL conditioning salt tolerance in S-100 soybean and descendent cultivars.Theo. Appl. Genet.10916101619. 10.1007/s00122-004-1783-9

  • 48

    LeeK. J.HwangJ. E.VelusamyV.HaB.-K.KimJ.-B.KimS. H.et al (2014). Selection and molecular characterization of a lipoxygenase-free soybean mutant line induced by gamma irradiation.Theo. Appl. Genet.12724052413. 10.1007/s00122-014-2385-9

  • 49

    LiB.FanS.YuF.ChenY.ZhangS.HanF.et al (2017). High-resolution mapping of QTL for fatty acid composition in soybean using specific-locus amplified fragment sequencing.Theo. Appl. Genet.13014671479. 10.1007/s00122-017-2902-8

  • 50

    LiQ.-G.ZhangL.LiC.DunwellJ. M.ZhangY.-M. (2013). Comparative genomics suggests that an ancestral polyploidy event leads to enhanced root nodule symbiosis in the Papilionoideae.Mol. Biol. Evol.3026022611. 10.1093/molbev/mst152

  • 51

    LiangH.-Z.YuY.-L.WangS.-F.YunL.WangT.-F.WeiY.-L.et al (2010). QTL mapping of isoflavone, oil and protein contents in soybean (Glycine max L. Merr.).Agric. Sci. China911081116. 10.1016/S1671-2927(09)60197-8

  • 52

    LimY. J.LyuJ. I.KwonS.-J.EomS. H. (2021). Effects of UV-A radiation on organ-specific accumulation and gene expression of isoflavones and flavonols in soybean sprout.Food Chem.339:128080. 10.1016/j.foodchem.2020.128080

  • 53

    LobatoL. P.Iakmiu Camargo PereiraA. E.LazarettiM. M.BarbosaD. S.CarreiraC. M.MandarinoJ. M. G.et al (2012). Snack bars with high soy protein and isoflavone content for use in diets to control dyslipidaemia.Int. J. Food Sci. Nutr.634958. 10.3109/09637486.2011.596148

  • 54

    LozovayaV. V.LyginA. V.UlanovA. V.NelsonR. L.DaydéJ.WidholmJ. M. (2005). Effect of temperature and soil moisture status during seed development on soybean seed isoflavone concentration and composition.Crop Sci.4519341940. 10.2135/cropsci2004.0567

  • 55

    MaC.RehmanA.LiH. G.ZhaoZ. B.SunG.DuX. M. (2022). Mapping of dwarfing QTL of Ari1327, a semi-dwarf mutant of upland cotton.BMC Plant Biol.22:5. 10.1186/s12870-021-03359-x

  • 56

    MaD.-F.QinL.-Q.WangP.-Y.KatohR. (2008). Soy isoflavone intake increases bone mineral density in the spine of menopausal women: Meta-analysis of randomized controlled trials.Clin. Nutr.275764. 10.1016/j.clnu.2007.10.012

  • 57

    MalekM.RafiiM. Y.AfrozS. S.NathU. K.MondalM. (2014). Morphological characterization and assessment of genetic variability, character association, and divergence in soybean mutants.Sci. World J.2014:968796. 10.1155/2014/968796

  • 58

    MarrsK. A.AlfenitoM. R.LloydA. M.WalbotV. (1995). A glutathione S-transferase involved in vacuolar transfer encoded by the maize gene Bronze-2.Nature375397400. 10.1038/375397a0

  • 59

    MengS.HeJ.ZhaoT.XingG.LiY.YangS.et al (2016). Detecting the QTL-allele system of seed isoflavone content in Chinese soybean landrace population for optimal cross design and gene system exploration.Theo. Appl. Genet.12915571576. 10.1007/s00122-016-2724-0

  • 60

    MessinaM. J. (1999). Legumes and soybeans: Overview of their nutritional profiles and health effects.Am. J. Clin. Nutr.70439s450s. 10.1093/ajcn/70.3.439s

  • 61

    NawazM. A.RehmanH. M.BalochF. S.IjazB.AliM. A.KhanI. A.et al (2017). Genome and transcriptome-wide analyses of cellulose synthase gene superfamily in soybean.J. Plant Physiol.215163175. 10.1016/j.jplph.2017.04.009

  • 62

    PeiR.ZhangJ.TianL.ZhangS.HanF.YanS.et al (2018). Identification of novel QTL associated with soybean isoflavone content.Crop J.6244252. 10.1016/j.cj.2017.10.004

  • 63

    PetrussaE.BraidotE.ZancaniM.PeressonC.BertoliniA.PatuiS.et al (2013). Plant flavonoids—biosynthesis, transport and involvement in stress responses.Int. J. Mol. Sci.141495014973. 10.3390/ijms140714950

  • 64

    PrimomoV. S.PoysaV.AblettG. R.JacksonC.-J.GijzenM.RajcanI. (2005). Mapping QTL for individual and total isoflavone content in soybean seeds.Crop Sci.4524542464. 10.2135/cropsci2004.0672

  • 65

    RaoX.HuangX.ZhouZ.LinX. (2013). An improvement of the 2 (–delta delta CT) method for quantitative real-time polymerase chain reaction data analysis.Biostat. Bioinform. Biomath.3:71.

  • 66

    RasheedA.HaoY.XiaX.KhanA.XuY.VarshneyR. K.et al (2017). Crop breeding chips and genotyping platforms: Progress, challenges, and perspectives.Mol. Plant1010471064. 10.1016/j.molp.2017.06.008

  • 67

    RupasingheH. V.JacksonC.-J. C.PoysaV.Di BerardoC.BewleyJ. D.JenkinsonJ. (2003). Soyasapogenol A and B distribution in soybean (Glycine max L. Merr.) in relation to seed physiology, genetic variability, and growing location.J. Agric. Food Chem.5158885894. 10.1021/jf0343736

  • 68

    ShawE. J.KakudaY.RajcanI. (2016). Effect of genotype, environment, and genotype× environment interaction on tocopherol accumulation in soybean seed.Crop Sci.564050. 10.2135/cropsci2015.02.0069

  • 69

    SreevidyaV.Srinivasa RaoC.SulliaS.LadhaJ. K.ReddyP. M. (2006). Metabolic engineering of rice with soybean isoflavone synthase for promoting nodulation gene expression in rhizobia.J. Exp. Bot.5719571969. 10.1093/jxb/erj143

  • 70

    TakesawaT.ItoM.KanzakiH.KameyaN.NakamuraI. (2002). Over-expression of ζ glutathione S-transferase in transgenic rice enhances germination and growth at low temperature.Mol. Breed.993101. 10.1023/A:1026718308155

  • 71

    ThomsonM. J. (2014). High-throughput SNP genotyping to accelerate crop improvement.Plant Breed. Biotechnol.2195212. 10.9787/PBB.2014.2.3.195

  • 72

    TsukamotoC.ShimadaS.IgitaK.KudouS.KokubunM.OkuboK.et al (1995). Factors affecting isoflavone content in soybean seeds: Changes in isoflavones, saponins, and composition of fatty acids at different temperatures during seed development.J. Agric. Food Chem.4311841192. 10.1021/jf00053a012

  • 73

    WarringtonC. V.Abdel-HaleemH.HytenD.CreganP.OrfJ.KillamA.et al (2015). QTL for seed protein and amino acids in the Benning× Danbaekkong soybean population.Theo. Appl. Genet.128839850. 10.1007/s00122-015-2474-4

  • 74

    WuD.LiD.ZhaoX.ZhanY.TengW.QiuL.et al (2020). Identification of a candidate gene associated with isoflavone content in soybean seeds using genome-wide association and linkage mapping.Plant J.104950963. 10.1111/tpj.14972

  • 75

    XuY.LiP.YangZ.XuC. (2017). Genetic mapping of quantitative trait loci in crops.Crop J.5175184. 10.1016/j.cj.2016.06.003

  • 76

    YiJ.DerynckM. R.LiX.TelmerP.MarsolaisF.DhaubhadelS. (2010). A single-repeat MYB transcription factor, GmMYB176, regulates CHS8 gene expression and affects isoflavonoid biosynthesis in soybean.Plant J.6210191034. 10.1111/j.1365-313X.2010.04214.x

  • 77

    YoonH.YiJ.DestaK.tShinM.-J.LeeY.LeeS.et al (2021). Yearly variation of isoflavone composition and yield-related traits of 35 Korean soybean germplasm.Korean J. Breed. Sci.53411423. 10.9787/KJBS.2021.53.4.411

  • 78

    YukH. J.LeeJ. H.Curtis-LongM. J.LeeJ. W.KimY. S.RyuH. W.et al (2011). The most abundant polyphenol of soy leaves, coumestrol, displays potent α-glucosidase inhibitory activity.Food Chem.12610571063. 10.1016/j.foodchem.2010.11.125

  • 79

    ZengG.LiD.HanY.TengW.WangJ.QiuL.et al (2009). Identification of QTL underlying isoflavone contents in soybean seeds among multiple environments.Theo. Appl. Genet.11814551463. 10.1007/s00122-009-0994-5

  • 80

    ZhangL.WangS.LiH.DengQ.ZhengA.LiS.et al (2010). Effects of missing marker and segregation distortion on QTL mapping in F2 populations.Theo. Appl. Genet.12110711082. 10.1007/s00122-010-1372-z

  • 81

    ZhangY.-W.TambaC. L.sWenY.-J.LiP.RenW.-L.NiY.-L.et al (2020). mrMLM v4. 0.2: An R platform for multi-locus genome-wide association studies.Genom. Proteom. Bioinform.18481487. 10.1016/j.gpb.2020.06.006

Summary

Keywords

Isoflavone, SNPs, QTLs, GBS, GWAS, MDP

Citation

Kim JM, Lyu JI, Kim D-G, Hung NN, Seo JS, Ahn J-W, Lim YJ, Eom SH, Ha B-K and Kwon S-J (2022) Genome wide association study to detect genetic regions related to isoflavone content in a mutant soybean population derived from radiation breeding. Front. Plant Sci. 13:968466. doi: 10.3389/fpls.2022.968466

Received

14 June 2022

Accepted

29 July 2022

Published

18 August 2022

Volume

13 - 2022

Edited by

Antonio Costa De Oliveira, Federal University of Pelotas, Brazil

Reviewed by

Yuan-Ming Zhang, Huazhong Agricultural University, China; Ripa Akter Sharmin, Nanjing Agricultural University, China; Sungwoo Lee, Chungnam National University, South Korea

Updates

Copyright

*Correspondence: Soon-Jae Kwon, Bo-Keun Ha,

This article was submitted to Plant Breeding, a section of the journal Frontiers in Plant Science

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics