ORIGINAL RESEARCH article

Front. Plant Sci., 17 April 2025

Sec. Functional and Applied Plant Genomics

Volume 16 - 2025 | https://doi.org/10.3389/fpls.2025.1572319

Identification of novel candidate genes for regulating oil composition in soybean seeds under environmental stresses

  • 1School of Plant and Environmental Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
  • 2Translational Plant Science Center, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
  • 3Fralin Life Science Institute, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States

Introduction: A key objective of soybean breeding programs is to enhance nutritional quality for human and animal consumption, with improved fatty acid (FA) composition for health benefits, and expand soybean use for industrial applications.

Methods: We conducted a metabolite genome-wide association study (mGWAS) to identify genomic regions associated with changes in FA composition and FA ratios in soybean seeds influenced by environmental factors. This mGWAS utilized 218 soybean plant introductions (PIs) grown in two field locations in Virginia over two years.

Results: The mGWAS revealed that 20 SNPs were significantly associated with 21 FA ratios, while additional suggestive SNPs were found for 36 FA ratios, highlighting potential quantitative trait loci linked to FA composition.

Discussion: Many of these SNPs are located near or within the genes related to phytohormone-mediated biotic and abiotic stress responses, suggesting the involvement of environmental factors in modulating FA composition in soybean seeds. Our findings provide novel insights into the genetic and environmental factors influencing FA composition in oilseeds. This research also lays the foundation for developing stable markers to develop soybean cultivars with tailored FA profiles for different practical applications under variable growth conditions.

1 Introduction

Since its domestication in East Asia 6,000-9,000 years ago, genetic diversity in soybeans has decreased by approximately 50%, with an 81% loss of rare alleles (Sedivy et al., 2017). A similar reduction in nucleotide diversity was observed for genes involved with fatty acid (FA) metabolism, which can be attributed to desirable trait selection and retention, genetic bottlenecks, and/or genetic drift (Derbyshire et al., 2023). Now, soybean oil typically contains approximately 10% palmitic acid (PA), 20% oleic acid (OA), 55% linoleic acid (LA), and 10% alpha-linolenic acid (ALA) representing the major FAs (Song et al., 2023). With its low saturated FAs (SFAs) content and elevated levels of monounsaturated (MUFAs) and polyunsaturated FAs (PUFAs), soybean oil is often considered as health promoting in humans and animals (Martins et al., 2023; Natnan et al., 2023; Tutunchi et al., 2020). Due to the health-promoting properties of OA and because less saturation in FAs is desirable for oil stability, breeding programs have focused on creating soybean cultivars with seed oil enriched in OA (Baer et al., 2021; Huth et al., 2015; Sakurai et al., 2003), contributing to the loss of genetic diversity in soybean cultivars.

However, high-OA soybean oil may not be suitable for other applications as it typically has lower levels of LA and ALA. For example, these two FAs are precursors for high-value ω-3 FAs [eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA)] that are essential for human and animal diets and health (Banaszak et al., 2024; Li et al., 2021; Swanson et al., 2012). The major EPA and DHA dietary sources are algae and fish oil. While fish acquire these FAs predominantly from their diets, some algae possess several FA desaturases and elongases capable of EPA and DHA biosynthesis from LA and ALA (Mühlroth et al., 2013). Plants, on the other hand, do not produce detectable levels of these very-long chain PUFAs, but they can be engineered to biosynthesize them (Petrie et al., 2012; Ruiz-Lopez et al., 2015; Usher et al., 2017; Walsh et al., 2016). Identifying soybean PIs with high LA and ALA content or specific FA ratios provides the basis for future soybean breeding or engineering for diverse specific practical applications.

Although soybean cultivars have been selected based on specific FA content, FA composition of soybean oil can vary in response to environment. Soybean seed FA composition is influenced by heat and drought stress as well as insect feeding (Bellaloui et al., 2015; Kanobe et al., 2015; Quoc Thien et al., 2016; Sehgal et al., 2018). Interestingly, these types of abiotic and biotic stresses have synergistic effects on FA composition in membrane lipids and seed oil as they reduce PUFAs like LA and ALA while increasing PA and OA not only in soybean (Schmidt et al., 2015), but also in other plant species (Mousavi et al., 2022; Wang et al., 2022a). These changes in FA composition are essential for maintaining membrane fluidity and function under drought and heat (Henschel et al., 2024; Higashi and Saito, 2019; Sharma et al., 2023). In the case of insect feeding, they are a result of an insect defense mechanism to block jasmonic acid (JA) signaling by limiting the availability of ALA for JA synthesis (Kanobe et al., 2015; Thompson and Goggin, 2006). Considering that JA is derived from ALA (Sharma et al., 2023), there is also a direct metabolic relationship between JA and FA composition in addition to regulatory interactions. JA and salicylic acid (SA) are important components of molecular defense mechanisms against herbivory (Mostafa et al., 2022). Application of JA and SA to soybean leaves resulted in changes in FA composition (Ghassemi-Golezani and Farhangi-Abriz, 2018), suggesting their involvement in an overall regulation of FA composition during herbivory and plant diseases.

In addition to JA, other phytohormones are also involved in modulating FA composition in plants (He and Ding, 2020; Shi et al., 2023). As a response to the low abscisic acid (ABA) levels in soybean seeds engineered to produce β-carotene (a precursor for ABA synthesis), changes in the expression of several genes encoding ABA-responsive transcription factors and FA desaturases resulted in an elevated OA/LA ratio while the levels of other FAs remained unchanged (Schmidt et al., 2015). External ABA application to Siberian apricot seeds resulted in an up-regulation of ABA-responsive genes and an increase in the OA/PA ratio (Huo et al., 2020). Increased levels of ABA and auxin in a protein kinase mutant resulted in increased OA levels, while the levels of PUFAs were decreased in B. napus seeds. The decreased sensitivity of this protein kinase mutant to ABA and glucose is suggestive of interactions between ABA and sugar signaling (Wang et al., 2022a). Auxin is also involved in modulating FA composition by regulating expression of FA desaturase genes in soybean as part of signaling and responses to heat and drought stress (He and Ding, 2020; Ozga et al., 2016; Ram Kumar et al., 2021). Foliar applications of auxin resulted in an increase in PA and OA levels in safflower plants under drought stress, a common response to lower membrane fluidity as in heat stress (Mousavi et al., 2022).

Here, we explored the genetic landscape of diverse soybean PIs for different FA compositions in the context of environmental stresses. Specifically, we conducted mGWAS to identify genomic regions linked to variations in FA compositions in 218 soybean PIs grown under field environmental conditions. Our focus was on the FA compositions and ratios relevant to both human and animal nutrition and compositional changes in membrane lipids and oil during abiotic and biotic stresses.

2 Materials and methods

2.1 Plant materials and environmental data

A total of 218 soybean PIs were selected from a diverse germplasm collection (Singer et al., 2022) and grown in Blacksburg and Warsaw in Virginia in 2020 and 2021, using a randomized complete block design. Each location had two similar plots, with each PI grown in both plots, allowing for two replicate measurements for each location and year. Collectively, eight seed samples were taken from each PI line. At each location, plants were grown in two 3-m, two-row plots with a 76-cm row spacing. Commercial cultivars Ellis and AG4403 were included at both locations as a reference for seed FA composition. Average daily temperature and precipitation data for the soybean growing seasons in Blacksburg and Warsaw in 2020 and 2021 were obtained from the National Oceanic and Atmospheric Administration’s (NOAA) National Centers for Environmental Information database (Menne et al., 2012). Least squares means were calculated, and Tukey’s HSD test was used to compare means at α = 0.05. Analysis of variance (ANOVA) was performed on monthly temperatures using a least squares model in JMP (JMP Pro, version 17.2.0, SAS Institute Inc.) with year, location, and year · location interaction as fixed effects.

2.2 FA and statistical analyses

A total of 1,744 seed samples were ground into a homogenous powder using a water-cooled 1095 KnifetechTM sample mill (Foss Analytical, MN, USA). Oil was extracted and FA levels were analyzed as described (Collakova et al., 2013). Briefly, 5 mg of dry powder for each seed sample was extracted using the Bligh and Dyer biphasic extraction method (Bligh and Dyer, 1959). Heptadecanoic acid was used as an internal standard to correct for % recovery for quantification purposes. FA methyl esters were prepared by acidic transesterification and analyzed on an Agilent 7890A gas chromatograph equipped with a flame ionization detector (GC-FID) as described (Collakova et al., 2013). Replicate seed samples from selected PIs and Ellis were also analyzed with GC coupled to an Agilent 5975C series single quadrupole mass spectrometer to confirm the identity of FAs using the NIST14 spectral library (National Institute of Standards and Technology, NIST).

Composition and ratios of different FAs were calculated using FA content data for each seed sample (n = 8 per PI line, n = 2 per PI line per each location and year combination). FA composition was expressed in two ways for each analyzed seed sample: (i) as a percentage of seed dry weight, reflecting seed FA composition with a sum of % of total FA as a proxy for the % seed oil and (ii) as a percentage of total FAs, approximating FA composition in oil. Then, Tukey’s HSD test was used to compare means of FA composition and FA ratios across all 218 PIs at α = 0.05. ANOVA and Principal Component Analysis (PCA) were performed in JMP (JMP Pro, version 17.2.0, SAS Institute Inc.) using FA composition in a least squares model with genotype, year, location, and year · location interaction as fixed effects and FA levels for each sample, respectively.

2.3 Genotypic data

Gene model and gene annotation information was obtained for the PIs used in this study from the SoyBase database (Brown et al., 2020). The genotype matrix was constructed using 42,291 SNPs from the SoySNP50K Illumina Infinium BeadChip (Song et al., 2013). Genotype data preprocessing was performed using base R (version 4.3.2) and RStudio (version 2023.9.1.494). SNP alleles were converted from alphabetic to numerical representations using a custom SNPtoNum() function, which encodes genotypes as {0, 1, 2} based on homozygous and heterozygous states. Monomorphic SNPs were identified using the sapply() function to check for columns with only a single unique value. Allele frequency (p) was calculated using colMeans() function, and minor allele frequency was determined using the pmin() function. After filtering out monomorphic SNPs and SNPs with a minor allele frequency of 0.05, 36,037 SNPs were retained for further analysis.

After filtering, the genomic relationship matrix was constructed from the resulting genotype matrix containing encoded SNP information. The genotype matrix (W) was centered using the scale() function from base R. The genomic relationship matrix (G) was derived following VanRaden’s method (VanRaden, 2008) after computing the cross-product of the centered genotype matrix (WWt) using the formula G = WWt/(2·sum(p(1-p))), where p is the frequency of the reference allele. Finally, the genomic relationship matrix was normalized to ensure proper scaling of the genetic variance component by dividing all elements by the mean of the diagonal elements. This procedure produces a genomic relationship matrix where the average diagonal element equals 1, scaling the genetic variance component for downstream analyses.

Best Linear Unbiased Estimates (BLUEs) for phenotypic traits were calculated using a weighted two-stage approach, which is common in multi-environment trials (Endelman, 2023; Verbyla, 2023; Möhring and Piepho, 2009). Briefly, trait values were modeled separately for each environment (year-location combination) using the model: trait ~ genotype + replicate. For each environment-specific model, reliability weights were calculated based on the residual variance and the number of replicates per genotype. BLUEs from individual environments were then combined using a weighted average approach, with weights proportional to their reliability. This approach accounts for potential heterogeneity in variance across environments while properly capturing genotype-by-environment interactions.

Narrow-sense heritability (h2) was estimated by using a bootstrap approach with 1,000 resamples as described (Piepho and Mohring, 2007; Guidalevich et al., 2025). Mixed models were fitted with the genomic relationship matrix using restricted maximum likelihood (REML) within the mixed.solve() function from the GWAS package in R (RStudio, version 2023.9.1.494), calculating heritability as the ratio of additive genetic variance (VA) to total phenotypic variance (VP) where VP = VA + Ve and Ve represents the residual variance not attributable to additive genetic effects. The formula used was h2=VA/(VP). Standard error (SE) was calculated as the standard deviation of bootstrap-generated h2 values.

2.4 Metabolite genome-wide association study

mGWAS was performed using 44 different combinations of FA compositions and ratios. Compositions and ratios of different FAs were expressed as % of seed or % of oil using FA content data for each PI line. SNPs were then analyzed for the association with FA compositions or ratios using the GWAS() function from the rrBLUP package in R (RStudio, version 2023.9.1.494). Year, location, and block number were included as fixed effects to account for the variance attributed to environmental effects. A kinship matrix was included to account for relatedness between individuals and to reduce false positive associations due to population structure. Quantile-quantile (QQ) plots were generated for each trait by plotting the observed -log10(p) values against the expected -log10(p) values from a uniform distribution. The genomic inflation factor (λ) was calculated as the median of the χ2 distribution of observed p-values divided by the expected median χ2 value (Stich and Melchinger, 2009; Kaler et al., 2019).

To visualize the SNP p-values for each ratio, Manhattan plots were generated using the manhattan() function from the qqman() package (version 0.1.9) in R. The effective number of markers for each chromosome was calculated using the meff() function from the poolr package (version 1.1-1) (Li and Ji, 2005). Effective markers by chromosome were calculated to account for chromosome-specific linkage disequilibrium patterns between markers, then summed to obtain the total number of effective markers. This approach provides an accurate estimate of the number of independent tests, avoiding overly conservative multiple testing correction that would occur if treating all markers as independent. The sum of effective markers was used to determine a significance threshold for selecting significant SNPs with 95% confidence as described (Singer et al., 2022). Briefly, the significance threshold was determined using the formula: threshold = 1 - (1 - α)^(1/Meff), where Meff represents the total number of effective markers (5,067) and α is the desired significance level (0.05 in this case).The resulting significance threshold was found to be approximately 1.01 · 10-5. A suggestive threshold was also set using 75% confidence (α = 0.25) and was determined to be approximately 5.68 · 10-5.

SNPs with p-values above the suggestive threshold were further analyzed to extract effect sizes (β) and SEs. For each SNP, a mixed linear model that mirrored the model set by the GWAS() function was fitted. β coefficients were estimated using mixed model solving using the mixed.solve() function, with the coefficient representing the change in the trait value associated with one copy of the alternative allele. SEs were calculated to assess the precision of these effect estimates. To reduce potential redundancy and identify independent significant loci, a post-GWAS clumping procedure was performed using a custom R function. SNPs were clustered within a 250-kb window and grouped based on linkage disequilibrium (r2 threshold of 0.5) (Misra et al., 2017). For each chromosome, lead SNPs and their correlated neighboring SNPs were identified, keeping the most significant SNP as the representative for each linkage group. This approach helps to minimize the reporting of highly correlated SNPs and provides a more concise representation of the genetic associations with FA compositions. Finally, candidate genes were identified from the gene models for Williams 82 (Glyma.Wm82.a2.v1 genome) available in the SoyBase database (Brown et al., 2020) within 10-kb flanking regions of the significant or suggestive SNPs. 10-kb regions are commonly used for identifying genes in soybean GWAS (Ayalew et al., 2022; Ravelombola et al., 2021; Steketee et al., 2020).

3 Results

3.1 Environmental differences between locations and years

Heat and drought are known to affect FA composition in oilseeds (Bellaloui et al., 2015; Hernández et al., 2019; Narayanan et al., 2020; Sehgal et al., 2018). To evaluate potential differences in environmental conditions between the two locations and years that could affect seed FA composition, average temperatures and rainfall from the NOAA National Centers for Environmental Information database (Menne et al., 2012) were compared for the two locations and years during the growing season. Average season-long temperatures followed similar trends in both locations. However, Warsaw is a warmer location than Blacksburg, and the average monthly temperatures were higher there than in Blacksburg regardless of the year (Figure 1A). These temperature differences were statistically significant (Tukey’s HSD test, α < 0.05) for most months (Supplementary Table S1A). Average daily temperatures were the highest in Warsaw in July 2020 when soybeans flower and initiate seed development (Supplementary Table S1B). Year 2020 was the rainiest for both locations (Supplementary Table S1A). As expected, rainfall did not follow any specific trend during both growing seasons (Figure 1B), and there were variations in cumulative precipitation by month across years and locations with the highest cumulative monthly precipitation seen in Warsaw in August 2020 (Supplementary Table S1A) due to several rainy days (Supplementary Table S1B). There was a drought period in Warsaw from May to July in 2020 and in Blacksburg from May to June in 2021 (Supplementary Table S1B). Overall, the Warsaw 2020 growing season was the most different compared to other seasons during the seed development period. Collectively, these environmental variations happening during the reproductive stages of soybean seed development created environments that could impact FA composition in soybean seeds.

Figure 1
www.frontiersin.org

Figure 1. Environmental data for the soybean growing seasons in Virginia. (A) Average monthly temperatures and (B) cumulative rainfall per month are shown in the context of key stages in soybean growth and development. In both 2020 and 2021, soybean seeds were planted in Blacksburg (B) and Warsaw (W) in May and mature pods harvested in October. Approximate time ranges for different soybean developmental stages are indicated by double arrows. Environmental data for this period was collected from NOAA’s National Center for Environmental Information (Menne et al., 2012) for both locations and years. Images of different parts of a soybean plant were licensed from BioRender.

3.2 Effects of genotype, location and year on variance in soybean FA composition

To obtain statistically significant results, GWAS requires sufficient differences in phenotypic data for correlating SNPs with specific phenotypes, which makes evaluating the variance in FA composition data essential. Considering the low genetic diversity in soybeans, we first investigated if different environmental conditions affected FA composition in diverse genetic backgrounds of soybean PIs in a global way. Averaging composition of the individual FAs from 218 PIs revealed that seed FA composition and ratios were similar regardless of the location, year, and the method used for expressing FA composition, though there was a high degree of variance (Figure 2; Supplementary Table S2). Tukey’s HSD test was also conducted to determine if there were statistically significant differences (α = 0.05) in the means of seed FA compositions (Table 1) and ratios (Supplementary Table S3) for different locations and years. Overall, seeds of soybean PIs grown in Warsaw in 2020 showed the most statistically significant differences for most FA compositions and ratios, particularly favoring the high PA/LA ratio, relative to the other location and year combinations regardless of how FA composition was expressed. As for the individual FAs, OA composition for both OA as % seed weight and % oil varied the most and was significantly different for all four combinations of locations and years (Table 1). There was an expected degree of variance for this type of data when levels or % of specific metabolites are averaged in genetically diverse lines without considering potential differences due to genotypes.

Figure 2
www.frontiersin.org

Figure 2. Average FA compositions in 218 soybean PIs grown in two locations and two years in Virginia. Soybean PIs were grown in a randomized block design in Blacksburg and Warsaw in 2020 and 2021 and FA analysis was performed as described in Materials and methods. FA composition was expressed as FA % of seed weight and FA % in oil of individual PIs and shown as an average ± SD of all tested PIs grown at each location and year with two replications. (A) Average seed FA composition and (B) Average oil FA composition. Based on ANOVA, no statistically significant differences (p < 0.05) were identified among the specific FAs in the PIs from both locations and years.

Table 1
www.frontiersin.org

Table 1. Statistical analysis of means of FA composition in soybean PIs grown in two different locations and years in Virginia.

To include genotypes and potential location and year interactions, we also explored the fixed effects of these different factors on FA composition in soybean seeds by using both types of FA composition data. Specifically, we used ANOVA to investigate if genotype, location, year, and year · location interaction had significant effects on FA composition in the seeds of soybean PIs by comparing least squares means. ANOVA revealed significant effects for all fixed factors for most FA compositions and ratios (Supplementary Table S4). Only some factors had no statistically significant effects on specific FA traits and included the effects of: (i) location on PA and OA composition (% seed weight) and on the OA/PA ratio, (ii) year on ALA (% seed weight) and the OA/ALA and (OA+ALA)/LA ratios, and (iii) the interaction of year · location on the OA/(PA+ALA) and (OA+ALA)/(LA+PA) ratios.

To further evaluate the variance in these datasets (Supplementary Table S2) in the context of specific genotypes and to identify the PIs contributing the most to this variance, PCA was performed on both types of FA composition datasets. This multivariate statistical method is used to evaluate if samples are similar or different from each other based on specific traits. Here, we used PCA to assess whether there is sufficient variance among the PIs based on FA composition due to different environmental conditions for mGWAS. PCA confirmed the lack of major differences in FA composition regardless of location or year in many PIs (Figure 3; Supplementary Figure S1). Based on the PCA results, many PIs from all four locations and year combinations clustered together, suggesting that these PIs were not different from each other in terms of FA composition. However, some distinction between two years can be observed based on the clustering of PIs from different locations and the same year. This is evident from the overlapping blue and black dots representing some PIs from Blacksburg and Warsaw in 2020 that are separated from overlapping green and red dots representing some PIs from the two locations in 2021 (Figure 3; Supplementary Figure S1). In addition, many outliers, particularly for the seed-based FA composition, were identified (Supplementary Figure S1). PCA on the oil-based FA composition showed only several extreme outliers and specifically for PIs grown in Warsaw in 2020, with only four PIs with both replicate measurements as outliers. This was evident from the separation of PI424611B, PI543793, PI548160, and PI548547 from the rest of the PIs based on PCs 1 and 2, but not PC3 and several other outlier PIs with another replicate clustering with the rest of the PIs (Figure 3). It appears that these PIs responded consistently to different environmental conditions specifically in Warsaw in 2020 compared to the remaining plants. Overall, there may be enough PIs with environmentally induced differences in FA seed composition for mGWAS.

Figure 3
www.frontiersin.org

Figure 3. PCA on correlations of oil FA composition of PIs from two different locations in Virginia and years. Soybean PIs were grown, and their seed FA content analyzed as for Figure 1. Most PIs cluster together regardless of location or year, suggesting their oil FA compositions are similar. PC 1 and 2 account for nearly 90% of variance and explain the differences in oil FA composition in some PIs, particularly those grown in Warsaw in 2020. Although most PIs grown in Warsaw in 2020 cluster with the rest of PIs, many are outliers at the borderline of being separated from the main cluster. The four PIs with both replicates as outliers are shown in different colors.

3.3 Metabolite genome-wide association study

Our goal was to identify SNPs associated with genes encoding proteins likely regulating FA composition in soybean seeds through stress-related signaling pathways. We expected regulatory proteins to be responsive to the environment and have weaker effects on FA composition in soybean seeds than metabolic enzymes (FA elongases and desaturases), which would be reflected by weaker associations of SNPs with the corresponding genes. Therefore, to increase the chances of identifying SNPs, suggestive in addition to statistically significant p-value cutoffs were used. All combinations of complex FA ratios regardless of any relevance to biological processes and/or practical applications were also included to help with SNP identification. Because of the rather large environmental influence on these FA phenotypes, we also estimated the relative contribution of genetic and environmental variance to the observed FA phenotypes by determining the narrow-sense heritability for each FA composition and ratio. The heritability ranged from 0.015 to 0.7 for FA ratios; low heritability was observed for several FA ratios involving PA and high heritability for ALA-related FA ratios (Supplementary Table S5). FA ratios relevant to abiotic and biotic stresses such as PA/(OA + LA + ALA) and (PA + OA)/(LA + ALA) had low heritability.

mGWAS was performed using 44 different combinations of FA compositions and ratios at the significant (p < 10-5 at 95% confidence) and suggestive (p < 5.68 · 10-5 at 75% confidence) p-value cutoffs to identify 20 and 90 SNPs for 21 and 36 FA composition and ratio phenotypes, respectively (Supplementary Table S6). Many of the identified SNPs were associated with more than one FA composition and/or ratio that included these FAs in a way that a particular FA was present in every related FA ratio within that group. This FA was viewed as specific to the corresponding SNP and FA ratio group. In addition, many of the complex FA ratios represent similar ratios and can be considered redundant. Collectively, these 93 SNPs were located within 10-kb regions containing 124 unique genes and some of them were linked (17 different linkage groups of at least two SNPs), while 34 SNPs were independent (Supplementary Table S7). Because SNPs adjacent to the SNP associated with the specific phenotype are inherited together and form a linkage group, focusing on linked SNPs is expected to increase chances of finding the genes responsible for the observed metabolic phenotypes. Therefore, our focus was mostly on significant and suggestive linked SNPs for composition of individual FAs. As expected, some SNPs were associated with an increase, while the others with a decrease in the corresponding FA traits, as evident from their respective β coefficients (Supplementary Table S8).

To ensure that our models appropriately control false positives and negatives, QQ plots were generated and λ calculated for each FA trait. The resulting QQ plots for oil-based PA and OA composition, and for the (PA + OA)/(LA + ALA) ratio suggest the existence of significant SNPs associated with these FA traits (Figure 4). In all cases, λ was close to 1 and most QQ plots for the remaining FA compositions and ratios had the expected straight line until reaching the suggestive and significant p-value thresholds where the observed probability was higher than the expected one (Supplementary Figure S2). QQ plots for seed-based FA composition and some FA ratios, particularly those containing ALA in the nominator were indicative of the lack of significant or suggestive SNPs. Consistent with the QQ plots, no significant SNPs were identified for any FAs when FA composition was expressed as a percentage of FA content per seed weight and for ALA when ALA composition was expressed as a percentage of ALA in oil. However, several significant, mostly unlinked SNPs were identified for PA, OA, and LA with oil-based FA composition and specific FA ratios, and the largest linkage groups were found on chromosomes 1, 2, and 19 (Figures 5, 6, Supplementary Figures S3, S4, Tables S6, S7).

Figure 4
www.frontiersin.org

Figure 4. QQ plots for the selected FA composition and ratio traits. (A) Oil-based PA composition, (B) Oil-based OA composition, and (C) The (PA + OA)/(LA + ALA) ratio. The QQ plots for these selected FA traits were generated by plotting the observed and expected -log10(p) values from a uniform distribution and the corresponding λ factors were calculated as described in Materials and Methods. False positives and negatives were properly controlled as evident from the QQ curve shapes and λ factors (0.989, 0.985, and 0.969 for the PA, OA, and (PA + OA)/(LA + ALA) ratio, respectively).

Figure 5
www.frontiersin.org

Figure 5. Manhattan plots showing significant and suggestive SNPs for FA composition as a percentage of oil. SNP associations are shown as the negative logarithm of the p-values obtained from the GWAS model. Alternating dark red and light red points show the localization of SNPs on different chromosomes. Orange and black lines show the significance thresholds for significant (p-value < 10-5) and suggestive (p-value < 5.68 · 10-5) SNPs, respectively. For PA (A), significant and suggestive SNPs were identified on chromosomes 2, 3, 9, and 18, while for OA (B), they were located on chromosomes 1, 8, 16, and 19. (C) LA-associated SNPs were found on chromosomes 8, 15, 16, and 20. SNPs ss715602319 and ss715624375 on chromosomes 8 and 16 were the same as for OA.

Figure 6
www.frontiersin.org

Figure 6. Manhattan plots showing significant and suggestive SNPs for selected chromosomes and multiple simple and complex FA ratios. SNP associations and significance levels (also represented by the orange and blue lines for significant and suggestive SNPs, respectively) are the same as in Figure 4. SNPs associated with individual FA ratios are color coded. Only the last third of chromosome 2 (A) is shown as no significant or suggestive SNPs were found on this chromosome before 35 Mbs. Pink, purple, and mustard colors for the clusters of significant and suggestive SNPs between 40 and 41 Mbs correspond to the SNPs for several PA-related FA ratios, including (PA + OA)/(LA + ALA). (B) For chromosome 1, pink and blue color for the significant and suggestive linked SNPs between 50 and 55 Mbs correspond to PA- and OA-related FA ratios.

Two major linkage groups were identified for PA (2.3 and 3.4, where the first number represents the chromosome and the second one the linkage group; see Supplementary Table S7). In linkage group 2.3, nine linked SNPs (ss715582608 - 44) were associated with eight genes encoding regulatory and metabolic proteins and were suggestive for the elevated PA/LA and (PA + ALA)/LA ratios with PA as the FA driving these genetic associations. One of the metabolism-related genes (Glyma.02g218700) near SNP ss715582631 encodes a glycerol-3-phosphate dehydrogenase known to influence FA composition towards FA unsaturation in soybean seeds (Zhao et al., 2021). Near this group of linked SNPs, two individual SNPs (ss715582649 and ss715582700) were also significant for high PA as a % of oil and many other FA ratios involving PA and OA and associated with a methyl transferase or a transporter gene and two uncharacterized genes. Five metabolic and regulatory genes (Glyma.03g229300 - Glyma.03g229600) were associated with a group of SNPs (ss715586385 – 9) that were related to elevated PA/OA ratios and were part of linkage group 3.4 (Supplementary Table S7B).

For OA as a % of oil, four major linkage groups (1.4, 1.5, 16.1, and 19.2) were identified (Supplementary Table S7). In the adjacent linkage groups 1.4 and 1.5, three significant and seven suggestive SNPs (ss715580258 – 60 and ss715580261 - 315), respectively, were also associated with several complex FA ratios like the OA/(LA + ALA) ratio in group 1.4. The high OA/(LA + ALA) ratio is of interest as it reflects the MUFA/PUFA ratio relevant to human and animal nutrition. The β coefficients were always positive for the OA/LA-related ratios and always negative for LA/OA-ratios (Supplementary Table S8), conferring to the increased simple and complex MUFA/PUFA ratios. These OA-associated SNPs were near or within 15 adjacent metabolic and regulatory genes (Glyma.01g190700 - Glyma.01g194200). However, information on specific biological processes these genes are involved in is not available, except for the strigolactone biosynthetic gene Glyma.01g191200 (Liu et al., 2020). Similarly, linkage group 16.1 is also associated with various OA/LA-related ratios, including OA/(LA + ALA). In this case, the OA/LA-related ratios are low, as indicated by the negative β coefficients for these ratios (Supplementary Table S8). Group 16.1 contains four linked SNPs (ss715624374 - 8) associated with four adjacent genes (Glyma.16g151000 - Glyma.16g151300). Glyma.16g151000 encodes a protein that binds phospholipids, while Glyma.16g151200 syntaxin involved in vesicle trafficking. TatD-related DNAse and aquaporin AQP67 are encoded by Glyma.16g151100 and Glyma.16g151300 and these types of proteins are known to be involved in stress responses (Dasmandal et al., 2020; Feng et al., 2019). However, AQP67 (designated as the stress-induced protein GmSIP1;2) was shown as unresponsive to drought stress (Feng et al., 2019).

The OA- and LA/OA-driven linkage group 19.2 contains eight SNPs (ss715635406 - 22). Group 19.2 is also associated with several low OA/LA-related ratios, including OA/(LA + ALA). There are nine consecutive genes (Glyma.19g192600 - Glyma.19g194300) associated with the SNPs within this linkage group. This group of genes was also identified by GWAS to be involved in phosphorus efficiency in maturing soybean plants (Wang et al., 2022b). In Arabidopsis, homologs of these genes are either developmental or related to phosphate metabolism (Cerise et al., 2023; Jung et al., 2009; Muller et al., 2007; Skinner and Gasser, 2009), though Glyma.19g193800 encodes an AN1-type zinc finger protein that could be drought stress-related (Ding et al., 2013). For low LA as a % of oil, only a small linkage group (20.1) of two suggestive SNPs (ss715638312 and ss715638314) associated with four genes of unknown or unrelated biological functions was identified. Glyma.20g187000 is a developmental gene involved in plastidic RNA editing and chloroplast function (Zhu et al., 2020).

The (PA + OA)/(LA + ALA) ratio is related to compositional changes in FAs when the levels of PA and OA increase at the expense of ALA and LA to decrease membrane fluidity as part of adaptation to drought and heat stress (Henschel et al., 2024; Higashi and Saito, 2019; Sharma et al., 2023). This complex FA ratio was not well represented in our data, but it was increased for all relevant SNPs (Supplementary Table S8). This FA ratio is associated with only two linkage groups (15.1 and 19.1) and two individual SNPs (ss715582171 and ss715627896). The first SNP is not near any gene, while the second one is in vicinity of three regulatory genes, one of which (Glyma.17g053700) encodes a heat-shock transcription factor. For 15.1, three suggestive SNPs (ss715620394, ss715620407, and ss715620417), are related to the (PA + OA)/(LA + ALA) ratio. There was only one such SNP (ss715635104) linked to two uncharacterized genes in group 19.1. SNP ss715620394 is associated with DNA-J domain chaperone protein encoded by Glyma.15G146900 and ss715635104 with two uncharacterized proteins.

4 Discussion

4.1 Environmental influence on FA composition in soybean seeds

Soybean domestication has led to a significant loss of genetic diversity during the past 9,000 years, leading to relatively stable FA composition in the seeds of different soybean cultivars and breeding lines (Derbyshire et al., 2023; Sedivy et al., 2017; Song et al., 2023). As expected, seed FA composition was consistent across most tested soybean PIs regardless of the location and year, with LA predominating and followed by OA, PA, and ALA, which is consistent with previously reported FA composition values (Abdelghany et al., 2020; Di et al., 2022; Messina, 2016; Song et al., 2023). Although seed FA composition was stable in the majority of soybean PIs, environmental conditions were sufficiently different, especially in Warsaw in 2020 and between the two years, and coincided with soybean reproductive stages to induce changes in FA composition in many PIs. The observed higher PA and lower PUFA levels could be related to higher temperatures and lower rainfall during key stages of soybean seed development in Warsaw 2020. However, the two locations and years likely vary in other environmental aspects (occurrence and interactions of other abiotic and biotic stressors) that were not accounted for in our study and could have had an impact on FA levels and composition. The important point is that the environmental conditions varied enough during key stages of soybean seed development to induce expected changes in seed FA composition so that statistically significant genotype-to-phenotype relationships could be identified by mGWAS.

The unsaturation level of FAs can be influenced by various environmental stresses, including drought and heat. Double bonds in cis configuration create kinks in the FA chains disrupting hydrophobic interactions between FAs within biological membranes, thus increasing membrane fluidity, which is not desirable at high temperatures (Rustgi et al., 2021). A common stress response during drought and heat stress includes lowering the unsaturation level by increasing the proportion of SFAs and sometimes also monounsaturated FAs such as OA at the expense of PUFAs such as ALA and LA (Schmidt et al., 2015; Mousavi et al., 2022; Wang et al., 2022a). Seeds from PIs grown in Warsaw 2020 showed the lowest levels of PUFAs and the highest levels of PA. Similarly, higher levels of PA and lower levels of OA were detected in the seeds of PIs from Blacksburg in 2020 than seeds from Blacksburg 2021, which could be attributed to the higher temperatures and lower rainfall in Blacksburg in July 2020. However, soybean PIs grown in Blacksburg in 2020 had the highest seed ALA levels, suggesting that other environmental factors that were not accounted for may have been interacting and affecting FA composition in an unpredictable way.

The molecular mechanisms driving membrane remodeling in terms of FA composition are complex and involve various metabolic and signaling pathways, including, but not limited to reduced expression of genes encoding FA elongases and desaturases for de novo FA synthesis (Hernández et al., 2019; Nahar et al., 2016; Narayanan et al., 2020) and lipid and FA exchange between membrane lipids and oil (Kanai et al., 2019; Bates, 2016; Bhandari and Bates, 2021). During seed filling, these processes occur naturally in soybean regardless of stress that a plant might be experiencing as diacylglycerol and FAs from membrane lipids are used for oil assembly to keep up with high demands of carbon for oil synthesis (Kanai et al., 2019; Bates, 2016; Bhandari and Bates, 2021). During this time, FA composition in oil will depend on the type of stress and the required response to overcome it. Other protective mechanisms, including up-regulation of stress-related genes and production of protective metabolites involving different types of regulatory mechanisms, are also in place (Zhang et al., 2015; Gelaw and Sanan-Mishra, 2021; Haak et al., 2017; Punzo et al., 2020; Razi and Muneer, 2021; Zandalinas and Mittler, 2022).

We expected the genotype-phenotype associations for regulatory and signaling genes involved in modulating FA composition to be weaker than for metabolic genes directly involved in FA elongation and desaturation. In addition, domesticated soybeans have relatively low genetic diversity, and many traits can be highly influenced by the environment. As expected, the heritability of FA ratios relevant to heat and drought stress such as PA/(OA + LA + ALA) and (PA + OA)/(LA + ALA) was low, suggesting that these traits are influenced more by environment than genetics. Using all possible FA ratios proved useful, as we were able to identify several significant and suggestive SNPs for different FA compositions and simple and complex FA ratios. In many cases, a SNP was associated with more than one FA ratio, suggesting that these ratios were at least partially redundant. In the case of PA, OA, and to some extent LA, a clear prevalence of these FAs was observed in groups of these partially redundant FA ratios associated with specific linked SNPs, suggesting that these individual FAs were driving the statistical differences in these ratios for mGWAS.

4.2 Metabolic genes influencing FA composition

Identification of metabolic genes that are known or implicated to affect soybean seed FA composition directly or indirectly can serve as a type of validation for our approach. Since we identified such genes, our approach is suitable for the identification of other genes potentially involved in modulating FA composition in a response to different environmental conditions. As part of the central carbon metabolism, plastidic glycerol-3-phosphate dehydrogenase (GmGPDHp1) and glucose-6-phosphate 1-dehydrogenase 1 (GmG6PDH1) produce carbon substrates and reductant (NADPH) used for FA synthesis and desaturation during seed development when light may be limiting (Allen et al., 2009; Hu et al., 2023; Zhao et al., 2021). These genes were identified as candidate genes based on several SNPs that were strongly associated with PA, LA or OA and the related FA ratios that would lower FA desaturation consistent with drought and heat stress. Overexpression of the GmGPDHp1 gene resulted in an overall increase in the levels of unsaturated FAs, particularly OA, thus altering an overall FA composition towards unsaturation in soybean seeds (Zhao et al., 2021). NAD kinases could also be part of this mechanism, as they control the NAD(H) to NADP(H) ratio, therefore contributing to the maintenance of redox homeostasis in plants under various abiotic and biotic stresses (Li et al., 2018). Activities of the plastidic dehydrogenases could rely on the availability of NADP produced from NAD+ by these NAD kinases. For the LA-driven FA ratios, the SNP ss715612955 is associated with Glyma.12g234000 encoding a putative NAD kinase potentially involved in modulating FA composition through this mechanism.

Interestingly, the GmG6PDH1 gene was associated with the same groups of linked SNPs as Glyma.03g229600 encoding an uncharacterized protein homologous to the Arabidopsis At1g09390 gene encoding a zinc finger FYVE domain-containing GDSL lipase/acylhydrolase (Lai et al., 2017). These enzymes either hydrolyze lipids to diacylglycerol and free FAs or attach FAs to lipids and have been associated with high-oil phenotypes in soybean seeds and maize kernels (Lai et al., 2017; Akoh et al., 2004; Liao et al., 2024; Luo et al., 2023). GDSL lipases/acylhydolases have potential to modulate the desaturation level in oil depending on the desaturation degree of membrane lipid-derived diacylglycerols and which FAs are available for oil synthesis. However, they can also have non-lipid substrates and specific metabolic functions of At1g09390 and Glyma.03g229600 gene products remain to be determined. The Glyma.16g151000 gene encoding a phosphatidylethanolamine-binding protein is associated with two SNPs predicted to lower OA composition and OA-related FA ratios. Different alleles of this gene were shown to be associated with regulating flowering time (Chen et al., 2020; Liu et al., 2022; Ogiso-Tanaka et al., 2019). Differences in flowering time could shift the timing when plants are exposed to different stresses during membrane remodeling, FA synthesis, and oil assembly as part of seed filling, which could lead to differences in FA composition. Dehydrogenases and membrane-modifying enzymes represent molecular components of two different mechanisms contributing to modulating FA composition.

4.3 Effects of abiotic and biotic stress signaling on FA composition

In addition to regulating many diverse aspects of plant growth, development, and abiotic and/or biotic stress responses (Duhan and Pasrija, 2025; Gupta et al., 2024), the phytohormones ABA, auxin, ethylene, JA, and SA are involved in regulating FA and lipid metabolism (Bajguz and Piotrowska-Niczyporuk, 2023; Waadt et al., 2022; Yi et al., 2024). Interactions of the corresponding signaling pathways in plants under combinations of abiotic and biotic stresses are complex and can be synergistic or contradictory (Gupta et al., 2024; Singh and Roychoudhury, 2023; Waadt et al., 2022). Except for ethylene, which has opposite effects on FA composition (Li et al., 2023), these phytohormones were shown to alter PA, OA, and/or LA composition towards elevated PA and OA levels in leaves during drought and heat stress or in developing seeds of various species (Ghassemi-Golezani and Farhangi-Abriz, 2018; Huo et al., 2020; Wang et al., 2022a), including soybean (Ram Kumar et al., 2021; Schmidt et al., 2015). These FA composition changes are associated with changes in expression of genes encoding ABA-responsive genes and FA desaturases (Ghassemi-Golezani and Farhangi-Abriz, 2018; Ram Kumar et al., 2021; Schmidt et al., 2015; He and Ding, 2020). Similarly, aphid feeding is also accompanied by significant increases in the ratios of PA and OA to LA and ALA ratio, lowering the levels of ALA that can be used for JA synthesis as a mechanism to block JA signaling in soybean leaves (Kanobe et al., 2015).

Therefore, there appears to be a common trend of increasing the levels of FAs that have no and/or low degree of unsaturation such as PA and OA at the expense of the PUFAs LA and ALA during abiotic and biotic stresses, such as drought, heat, and herbivory. We were able to identify several SNPs associated with the increased (PA + OA)/(LA + ALA), and the related PA/(LA + ALA) and/or OA/(LA + ALA) ratios that represent these types of changes in FA composition, with primarily PA and/or OA as the driving factors for significance of these FA ratios. These SNPs were within or near the genes encoding proteins of diverse molecular functions (protein-molecule interactions, transporters, transcription factors, and protein- and membrane-modifying enzymes). Most of these proteins are uncharacterized in respect to specific biological processes, while others are known to function in processes unrelated to regulating FA composition. Some of these proteins are known as components of phytohormone-mediated signaling pathways that could be involved in regulating FA composition under various environmental stresses, or their genes have been shown to be stress responsive. All these genes represent potential candidates for experimental validation and some of them will be discussed in the context of their association with relevant phytohormone-mediated signaling pathways and/or stress responsiveness.

4.3.1 Abiotic stress-related candidate genes

For the OA-related FA ratios, two adjacent linkage groups (1.4 and 1.5) with three and seven SNPs (ss715580258 – 60 and ss715580261 – 315) on chromosome 1 had three SNPs significantly associated with several OA/LA-related ratios and seven SNPs above the suggestive threshold for the PA/OA ratio. Three genes (Glyma.01g191000 - Glyma.01g191300) were linked with most of these SNPs and encode a tetratricopeptide repeat (TPR) domain-containing protein, KAI2-related esterase, and protein phosphatase 2C. TPR repeat-containing proteins are involved in protein-protein interactions, while protein phosphatases regulate activities of phosphorylated proteins by dephosphorylation. These types of proteins act as regulatory components of phytohormone-mediated abiotic and biotic stress responses in plants (Jung et al., 2020; Jurkiewicz and Batoko, 2018; Schapire et al., 2006; Sharma and Pandey, 2016; Zhou et al., 2021). The biological function of the KAI2-related esterase is known; this enzyme is involved in the biosynthesis of strigolactones (Liu et al., 2020), phytohormones found in root exudates promoting mycorrhizal associations and germination of parasitic plants as well as many aspects of plant development and responses to drought and other environmental stresses (Jardim-Messeder et al., 2025). Within linkage group 1.5 on chromosome 1, ss715580267 was also associated with a gene encoding a DnaJ domain-containing protein. These proteins function as molecular chaperones by maintaining protein stability under heat and other environmental stresses (Dong et al., 2025; Nagaraju et al., 2020; Zhao et al., 2022). We have identified several genes for these DnaJ domain-containing heat shock proteins that were associated with SNPs on other chromosomes. For example, Glyma.15g146900 is associated with ss715620393 and ss715620394 and the increased PA/ALA and (PA+ OA)/(LA + ALA) ratios. This gene is a homologue of the Arabidopsis At3g06778 gene encoding a heat-shock transcription factor involved in drought and salt stress by ABA-mediated signaling (Hwang et al., 2014). While the involvement of these DnaJ domain-containing heat-shock proteins in heat and drought stress signaling is well documented, whether they also modulate FA composition remains to be explored.

Two LA- and ALA-driven linked SNP (ss715631993 and ss715631995, linkage group 18.4) are in the coding region of the gene Glyma.18g255000 and in the vicinity of Glyma.18g255100. The first gene encodes an enzyme that synthesizes inositol pyrophosphates involved in auxin, JA, and SA signaling (Wu et al., 2024), while the second gene encodes an AN1-type zinc finger protein that has been identified as a stress-associated protein (GmSAP23) (Zhang et al., 2019). An AN1-type zinc finger protein encoded by Glyma.19g193800 also could be drought stress-related as the homologous At2g36320 gene was identified as a drought stress memory gene in Arabidopsis (Ding et al., 2013). Glyma.19g193800 has ss715635416 in its coding sequence and is the only stress-related candidate gene belonging to the large linkage group 19.2. The second SNP (ss715631995) was identified as part of a QTL for the soybean maturity timing (Zimmer et al., 2021). Early maturing soybean accessions show increased levels of PA and OA and lower levels of LA and ALA that could be connected to warm environments in early fall (Abdelghany et al., 2020).

4.3.2 Biotic stress-related candidate genes

We identified several pathogen- and insect feeding-related candidate genes associated with SNPs predicted to modulate PA, OA, and/or ALA composition. This suggests that differences in environmental conditions in Blacksburg and Warsaw during the 2020 and 2021 growing season could have provided differential conditions conducive to plant diseases and herbivory. The SNP ss715582649 that we identified as significant for increased PA compositions and PA-related ratios was previously reported as associated with soybean resistance to Fusarium subglutinans (Rafi et al., 2024). This SNP is located within an intron of Glyma.02g219700, a homologue of the Arabidopsis gene for methyl esterase 17 responsible for converting an inactive methylated form of auxin to the active form (Yang et al., 2008; Zhang and Peer, 2017). The involvement of this gene in regulating auxin homeostasis and FA composition remains to be experimentally validated in soybean.

The Glyma.03g229500 gene encoding a NIM1-INTERACTING 1-like isoform X2 protein is located between two other candidate lipid metabolism-related genes (Glyma.03g229400 and Glyma.03g229600 encoding GmG6PDH1 and a lipase/acylhydrolase, respectively, that were already discussed) for modulating FA composition. NIM1-INTERACTING proteins are known to be involved in JA and SA signaling, and plant defense against aphid feeding (Thompson and Goggin, 2006), but their involvement in altering FA composition has not been demonstrated. All three genes represent potential candidates for experimental validation of their involvement in regulating FA composition. Two genes (Glyma.13g194500 and Glyma.13g194600) encoding leucine-rich repeat containing proteins were both linked to the same two SNPs (ss715615032 and 34) for the high FA/ALA-related ratios. The resulting proteins may be involved in aphid resistance by lowering the ALA availability for JA synthesis during aphid feeding (Yang et al., 2022). Glyma.13g194500 is a homolog of the CYR-1 gene involved in resistance to yellow mosaic disease and Phytophthora sojae (Rahman et al., 2023; Schneider et al., 2016), while both genes may confer resistance to a root-knot nematode (Alekcevetch et al., 2021).

5 Conclusions

Differential environmental conditions in Blacksburg and Warsaw in 2020 and 2021 sufficiently affected FA composition in soybean seeds for mGWAS. We were able to identify several candidate SNPs and genes associated with various FA ratios in soybean oil in the context of abiotic and biotic stresses, offering new insights into potential regulatory mechanisms governing these traits. The discovery of SNPs associated with specific FA ratios and linked to genes related to various stress responses also highlights the complex interplay between the environment, phytohormone-mediated signaling, and FA composition phenotypes. The observed enrichment towards low levels of unsaturation in soybean oil as a response to a combination of several environmental stresses is desirable for developing health-promoting high-OA cultivars. However, for other practical applications and for seed germination in colder environments, high PUFA content in seed oil is preferrable. Understanding how these genes influence FA composition could have significant implications for breeding efforts aimed at improving oil quality in response to industry needs and climate change.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author. mGWAS data was also submitted to the SoyBase database.

Author contributions

PB: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing. PF: Formal analysis, Investigation, Methodology, Writing – review & editing. BZ: Funding acquisition, Methodology, Project administration, Resources, Supervision, Writing – review & editing. EC: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. The funding for this project came from the Virginia Agricultural Council, project #803. EC’s salary is funded through USDA HATCH project # VA-160157 supporting this study.

Acknowledgments

We would like to acknowledge the USDA-ARS for providing soybean germplasm and publicly accessible genotypic information through SoyBase.org. Our thanks also go to Drs. David Haak, Zhibo Wang, and Gota Morota for sharing their R codes and GWAS tools as well as for useful stimulating discussions about GWAS.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1572319/full#supplementary-material

References

Abdelghany, A. M., Zhang, S., Azam, M., Shaibu, A. S., Feng, Y., Qi, J., et al. (2020). Natural Variation in Fatty Acid Composition of Diverse World Soybean Germplasms Grown in China. Agronomy 10, 24. doi: 10.3390/agronomy10010024

Crossref Full Text | Google Scholar

Akoh, C. C., Lee, G. C., Liaw, Y. C., Huang, T. H., Shaw, J. F. (2004). GDSL family of serine esterases/lipases. Prog. Lipid Res. 43, 534–552. doi: 10.1016/j.plipres.2004.09.002

PubMed Abstract | Crossref Full Text | Google Scholar

Alekcevetch, J. C., De Lima Passianotto, A. L., Ferreira, E. G. C., Dos Santos, A. B., Da Silva, D. C. G., Dias, W. P., et al. (2021). Genome-wide association study for resistance to the Meloidogyne javanica causing root-knot nematode in soybean. Theor. Appl. Genet. 134, 777–792. doi: 10.1007/s00122-020-03723-9

PubMed Abstract | Crossref Full Text | Google Scholar

Allen, D. K., Ohlrogge, J. B., Shachar-Hill, Y. (2009). The role of light in soybean seed filling metabolism. Plant J. 58, 220–234. doi: 10.1111/j.1365-313X.2008.03771.x

PubMed Abstract | Crossref Full Text | Google Scholar

Ayalew, H., Schapaugh, W., Vuong, T., Nguyen, H. T. (2022). Genome-wide association analysis identified consistent QTL for seed yield in a soybean diversity panel tested across multiple environments. Plant Genome 15, e20268. doi: 10.1002/tpg2.20268

PubMed Abstract | Crossref Full Text | Google Scholar

Baer, D. J., Henderson, T., Gebauer, S. K. (2021). Consumption of High-Oleic Soybean Oil Improves Lipid and Lipoprotein Profile in Humans Compared to a Palm Oil Blend: A Randomized Controlled Trial. Lipids 56, 313–325. doi: 10.1002/lipd.12298

PubMed Abstract | Crossref Full Text | Google Scholar

Bajguz, A., Piotrowska-Niczyporuk, A. (2023). Biosynthetic Pathways of Hormones in Plants. Metabolites 13, 884. doi: 10.3390/metabo13080884

PubMed Abstract | Crossref Full Text | Google Scholar

Banaszak, M., Dobrzyńska, M., Kawka, A., Górna, I., Woźniak, D., Przysławski, J., et al. (2024). Role of Omega-3 fatty acids eicosapentaenoic (EPA) and docosahexaenoic (DHA) as modulatory and anti-inflammatory agents in noncommunicable diet-related diseases &x2013; Reports from the last 10 years. Clin. Nutr. ESPEN 63, 240–258. doi: 10.1016/j.clnesp.2024.06.053

PubMed Abstract | Crossref Full Text | Google Scholar

Bates, P. D. (2016). Understanding the control of acyl flux through the lipid metabolic network of plant oil biosynthesis. Biochim. Biophys. Acta 1861, 1214–1225. doi: 10.1016/j.bbalip.2016.03.021

PubMed Abstract | Crossref Full Text | Google Scholar

Bellaloui, N., Reddy, K. N., Mengistu, A. (2015). “Chapter 45 - Drought and Heat Stress Effects on Soybean Fatty Acid Composition and Oil Stability,” in Processing and Impact on Active Components in Food. Ed. Preedy, V. (Academic Press, San Diego).

Google Scholar

Bhandari, S., Bates, P. D. (2021). Triacylglycerol remodeling in Physaria fendleri indicates oil accumulation is dynamic and not a metabolic endpoint. Plant Physiol. 187, 799–815. doi: 10.1093/plphys/kiab294

PubMed Abstract | Crossref Full Text | Google Scholar

Bligh, E. G., Dyer, W. J. (1959). A rapid method of total lipid extraction and purification. Can. J. Biochem. Physiol. 37, 911–917. doi: 10.1139/y59-099

PubMed Abstract | Crossref Full Text | Google Scholar

Brown, A. V., Conners, S. I., Huang, W., Wilkey, A. P., Grant, D., Weeks, N. T., et al. (2020). A new decade and new data at SoyBase, the USDA-ARS soybean genetics and genomics database. Nucleic Acids Res. 49, D1496–D1501. doi: 10.1201/9781003065302

PubMed Abstract | Crossref Full Text | Google Scholar

Cerise, M., Da Silveira Falavigna, V., Rodriguez-Maroto, G., Signol, A., Severing, E., Gao, H., et al. (2023). Two modes of gene regulation by TFL1 mediate its dual function in flowering time and shoot determinacy of Arabidopsis. Development 150, dev202089. doi: 10.1242/dev.202089

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, L., Cai, Y., Qu, M., Wang, L., Sun, H., Jiang, B., et al. (2020). Soybean adaption to high-latitude regions is associated with natural variations of GmFT2b, an ortholog of FLOWERING LOCUS T. Plant Cell Environ. 43, 934–944. doi: 10.1111/pce.13695

PubMed Abstract | Crossref Full Text | Google Scholar

Collakova, E., Aghamirzaie, D., Fang, Y., Klumas, C., Tabataba, F., Kakumanu, A., et al. (2013). Metabolic and Transcriptional Reprogramming in Developing Soybean (Glycine max) Embryos. Metabolites 3, 347–372. doi: 10.3390/metabo3020347

PubMed Abstract | Crossref Full Text | Google Scholar

Dasmandal, T., Rao, A., Sahu, S. (2020). Identification and characterization of circular RNAs regulating genes responsible for drought stress tolerance in chickpea and soybean. Indian J. Of Genet. And Plant Breed. 80, 1–8. doi: 10.31742/IJGPB.80.1.1

Crossref Full Text | Google Scholar

Derbyshire, M. C., Marsh, J., Tirnaz, S., Nguyen, H. T., Batley, J., Bayer, P. E., et al. (2023). Diversity of fatty acid biosynthesis genes across the soybean pangenome. Plant Genome 16, e20334. doi: 10.1002/tpg2.20334

PubMed Abstract | Crossref Full Text | Google Scholar

Di, Q., Piersanti, A., Zhang, Q., Miceli, C., Li, H., Liu, X. (2022). Genome-Wide Association Study Identifies Candidate Genes Related to the Linoleic Acid Content in Soybean Seed. Int. J. Mol. Sci. 23, 454. doi: 10.3390/ijms23010454

PubMed Abstract | Crossref Full Text | Google Scholar

Ding, Y., Liu, N., Virlouvet, L., Riethoven, J. J., Fromm, M., Avramova, Z. (2013). Four distinct types of dehydration stress memory genes in Arabidopsis thaliana. BMC Plant Biol. 13, 229. doi: 10.1186/1471-2229-13-229

PubMed Abstract | Crossref Full Text | Google Scholar

Dong, A., Wang, N., Zenda, T., Zhai, X., Zhong, Y., Yang, Q., et al. (2025). ZmDnaJ-ZmNCED6 module positively regulates drought tolerance via modulating stomatal closure in maize. Plant Physiol. Biochem. 218, 109286. doi: 10.1016/j.plaphy.2024.109286

PubMed Abstract | Crossref Full Text | Google Scholar

Duhan, L., Pasrija, R. (2025). Unveiling exogenous potential of phytohormones as sustainable arsenals against plant pathogens: molecular signaling and crosstalk insights. Mol. Biol. Rep. 52, 98. doi: 10.1007/s11033-024-10206-3

PubMed Abstract | Crossref Full Text | Google Scholar

Endelman, J. B. (2023). Fully efficient, two-stage analysis of multi-environment trials with directional dominance and multi-trait genomic selection. Theor. Appl. Genet. 136, 65. doi: 10.1007/s00122-023-04298-x

PubMed Abstract | Crossref Full Text | Google Scholar

Feng, Z.-J., Liu, N., Zhang, G.-W., Niu, F.-G., Xu, S.-C., Gong, Y.-M. (2019). Investigation of the AQP Family in Soybean and the Promoter Activity of TIP2;6 in Heat Stress and Hormone Responses. Int. J. Mol. Sci. 20, 262. doi: 10.3390/ijms20020262

PubMed Abstract | Crossref Full Text | Google Scholar

Gelaw, T. A., Sanan-Mishra, N. (2021). Non-Coding RNAs in Response to Drought Stress. Int. J. Mol. Sci. 22, 12519. doi: 10.3390/ijms222212519

PubMed Abstract | Crossref Full Text | Google Scholar

Ghassemi-Golezani, K., Farhangi-Abriz, S. (2018). Changes in Oil Accumulation and Fatty Acid Composition of Soybean Seeds under Salt Stress in Response to Salicylic Acid and Jasmonic Acid. Russian J. Plant Physiol. 65, 229–236. doi: 10.1134/S1021443718020115

Crossref Full Text | Google Scholar

Guidalevich, V., Berro, I., Aparicio, A. G., Azpilicueta, M. M., López, A. S., Arias-Rios, J. A., et al. (2025). Genetic variation and heritability of agronomic traits in a native perennial forage species from drylands: breeding potential of Festuca pallescens. Sci. Rep. 15, 6896. doi: 10.1038/s41598-025-90875-7

PubMed Abstract | Crossref Full Text | Google Scholar

Gupta, S., Kaur, R., Upadhyay, A., Chauhan, A., Tripathi, V. (2024). Unveiling the secrets of abiotic stress tolerance in plants through molecular and hormonal insights. 3 Biotech. 14, 252. doi: 10.1007/s13205-024-04083-7

PubMed Abstract | Crossref Full Text | Google Scholar

Haak, D. C., Fukao, T., Grene, R., Hua, Z., Ivanov, R., Perrella, G., et al. (2017). Multilevel Regulation of Abiotic Stress Responses in Plants. Front. Plant Sci. 8, 1564. doi: 10.3389/fpls.2017.01564

PubMed Abstract | Crossref Full Text | Google Scholar

He, M., Ding, N.-Z. (2020). Plant Unsaturated Fatty Acids: Multiple Roles in Stress Response. Front. Plant Sci. 11. doi: 10.3389/fpls.2020.562785

PubMed Abstract | Crossref Full Text | Google Scholar

Henschel, J. M., Andrade, A. N. D., Dos Santos, J. B. L., Da Silva, R. R., Da Mata, D. A., Souza, T., et al. (2024). Lipidomics in Plants Under Abiotic Stress Conditions: An Overview. Agronomy 14, 1670. doi: 10.3390/agronomy14081670

Crossref Full Text | Google Scholar

Hernández, M. L., Sicardo, M. D., Alfonso, M., Martínez-Rivas, J. M. (2019). Transcriptional Regulation of Stearoyl-Acyl Carrier Protein Desaturase Genes in Response to Abiotic Stresses Leads to Changes in the Unsaturated Fatty Acids Composition of Olive Mesocarp. Front. Plant Sci. 10. doi: 10.3389/fpls.2019.00251

PubMed Abstract | Crossref Full Text | Google Scholar

Higashi, Y., Saito, K. (2019). Lipidomic studies of membrane glycerolipids in plant leaves under heat stress. Prog. Lipid Res. 75, 100990. doi: 10.1016/j.plipres.2019.100990

PubMed Abstract | Crossref Full Text | Google Scholar

Hu, Y., Liu, Y., Wei, J.-J., Zhang, W.-K., Chen, S.-Y., Zhang, J.-S. (2023). Regulation of seed traits in soybean. aBIOTECH 4, 372–385. doi: 10.1007/s42994-023-00122-8

PubMed Abstract | Crossref Full Text | Google Scholar

Huo, K., Shui, L., Mai, Y., Zhou, N., Liu, Y., Zhang, C., et al. (2020). Effects of exogenous abscisic acid on oil content, fatty acid composition, biodiesel properties and lipid components in developing Siberian apricot (Prunus sibirica) seeds. Plant Physiol. Biochem. 154, 260–267. doi: 10.1016/j.plaphy.2020.06.020

PubMed Abstract | Crossref Full Text | Google Scholar

Huth, P. J., Fulgoni, V. L., 3rd, Larson, B. T. (2015). A systematic review of high-oleic vegetable oil substitutions for other fats and oils on cardiovascular disease risk factors: implications for novel high-oleic soybean oils. Adv. Nutr. 6, 674–693. doi: 10.3945/an.115.008979

PubMed Abstract | Crossref Full Text | Google Scholar

Hwang, S. M., Kim, D. W., Woo, M. S., Jeong, H. S., Son, Y. S., Akhter, S., et al. (2014). Functional characterization of rabidopsis HsfA6a as a heat-shock transcription factor under high salinity and dehydration conditions. Plant Cell Environ. 37, 1202–1222. doi: 10.1111/pce.2014.37.issue-5

PubMed Abstract | Crossref Full Text | Google Scholar

Jardim-Messeder, D., De Souza-Vieira, Y., Sachetto-Martins, G. (2025). Dressed Up to the Nines: The Interplay of Phytohormones Signaling and Redox Metabolism During Plant Response to Drought. Plants 14, 208. doi: 10.3390/plants14020208

PubMed Abstract | Crossref Full Text | Google Scholar

Jung, B., Florchinger, M., Kunz, H. H., Traub, M., Wartenberg, R., Jeblick, W., et al. (2009). Uridine-ribohydrolase is a key regulator in the uridine degradation pathway of Arabidopsis. Plant Cell 21, 876–891. doi: 10.1105/tpc.108.062612

PubMed Abstract | Crossref Full Text | Google Scholar

Jung, C., Nguyen, N. H., Cheong, J.-J. (2020). Transcriptional Regulation of Protein Phosphatase 2C Genes to Modulate Abscisic Acid Signaling. Int. J. Mol. Sci. 21, 9517. doi: 10.3390/ijms21249517

PubMed Abstract | Crossref Full Text | Google Scholar

Jurkiewicz, P., Batoko, H. (2018). Protein degradation mechanisms modulate abscisic acid signaling and responses during abiotic stress. Plant Sci. 267, 48–54. doi: 10.1016/j.plantsci.2017.10.017

PubMed Abstract | Crossref Full Text | Google Scholar

Kaler, A. S., Gillman, J. D., Beissinger, T., Purcell, L. C. (2019). Comparing Different Statistical Models and Multiple Testing Corrections for Association Mapping in Soybean and Maize. Front. Plant Sci. 10, 1794. doi: 10.3389/fpls.2019.01794

PubMed Abstract | Crossref Full Text | Google Scholar

Kanai, M., Yamada, T., Hayashi, M., Mano, S., Nishimura, M. (2019). Soybean (Glycine max L.) triacylglycerol lipase GmSDP1 regulates the quality and quantity of seed oil. Sci. Rep. 9, 8924. doi: 10.1038/s41598-019-45331-8

PubMed Abstract | Crossref Full Text | Google Scholar

Kanobe, C., Mccarville, M. T., O’neal, M. E., Tylka, G. L., Macintosh, G. C. (2015). Soybean Aphid Infestation Induces Changes in Fatty Acid Metabolism in Soybean. PloS One 10, e0145660. doi: 10.1371/journal.pone.0145660

PubMed Abstract | Crossref Full Text | Google Scholar

Lai, C. P., Huang, L. M., Chen, L. O., Chan, M. T., Shaw, J. F. (2017). Genome-wide analysis of GDSL-type esterases/lipases in Arabidopsis. Plant Mol. Biol. 95, 181–197. doi: 10.1007/s11103-017-0648-y

PubMed Abstract | Crossref Full Text | Google Scholar

Li, J., Ji, L. (2005). Adjusting multiple testing in multilocus analyses using the eigenvalues of a correlation matrix. Heredity 95, 221–227. doi: 10.1038/sj.hdy.6800717

PubMed Abstract | Crossref Full Text | Google Scholar

Li, H., Ma, X., Wang, W., Zhang, J., Liu, Y., Yuan, D. (2023). Enhancing the accumulation of linoleic acid and α-linolenic acid through the pre-harvest ethylene treatment in Camellia oleifera. Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1080946

PubMed Abstract | Crossref Full Text | Google Scholar

Li, J., Pora, B. L. R., Dong, K., Hasjim, J. (2021). Health benefits of docosahexaenoic acid and its bioavailability: A review. Food Sci. Nutr. 9, 5229–5243. doi: 10.1002/fsn3.2299

PubMed Abstract | Crossref Full Text | Google Scholar

Li, B.-B., Wang, X., Tai, L., Ma, T.-T., Shalmani, A., Liu, W.-T., et al. (2018). NAD Kinases: Metabolic Targets Controlling Redox Co-enzymes and Reducing Power Partitioning in Plant Stress and Development. Front. Plant Sci. 9. doi: 10.3389/fpls.2018.00379

PubMed Abstract | Crossref Full Text | Google Scholar

Liao, W., Guo, R., Li, J., Liu, N., Jiang, L., Whelan, J., et al. (2025). CRISPR/Cas9-mediated mutagenesis of SEED FATTY ACID REDUCER genes significantly increased seed oil content in soybean. Plant Cell Physiol. 66, 273–284. doi: 10.1093/pcp/pcae148

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, F., Rice, J. H., Lopes, V., Grewal, P., Lebeis, S. L., Hewezi, T., et al. (2020). Overexpression of Strigolactone-Associated Genes Exerts Fine-Tuning Selection on Soybean Rhizosphere Bacterial and Fungal Microbiome. Phytobiomes J. 4, 239–251. doi: 10.1094/PBIOMES-01-20-0003-R

Crossref Full Text | Google Scholar

Liu, J., Xie, H., Lin, T., Tie, C., Luo, H., Yang, B., et al. (2022). Putative variants, genetic diversity and population structure among Soybean cultivars bred at different ages in Huang-Huai-Hai region. Sci. Rep. 12, 2372. doi: 10.1038/s41598-022-06447-6

PubMed Abstract | Crossref Full Text | Google Scholar

Luo, M., Lu, B., Shi, Y., Zhao, Y., Liu, J., Zhang, C., et al. (2023). Genetic basis of the oil biosynthesis in ultra-high-oil maize grains with an oil content exceeding 20. Front. Plant Sci. 14, 1168216. doi: 10.3389/fpls.2023.1168216

PubMed Abstract | Crossref Full Text | Google Scholar

Martins, N., Magalhães, R., Vieria, L., Couto, A., Serra, C. R., Maia, M. R. G., et al. (2023). Dietary oleic acid supplementation improves feed efficiency and modulates fatty acid profile and cell signaling pathway in European sea bass (Dicentrarchus labrax) juveniles fed high-lipid diets. Aquaculture 576, 739870. doi: 10.1016/j.aquaculture.2023.739870

Crossref Full Text | Google Scholar

Menne, M. J., Durre, I., Vose, R. S., Gleason, B. E., Houston, T. G. (2012). An Overview of the Global Historical Climatology Network-Daily Database. J. Atmos Ocean Technol. 29, 897–910. doi: 10.1175/JTECH-D-11-00103.1

Crossref Full Text | Google Scholar

Messina, M. (2016). Soy and Health Update: Evaluation of the Clinical and Epidemiologic Literature. Nutrients 8, 754. doi: 10.3390/nu8120754

PubMed Abstract | Crossref Full Text | Google Scholar

Misra, G., Badoni, S., Anacleto, R., Graner, A., Alexandrov, N., Sreenivasulu, N. (2017). Whole genome sequencing-based association study to unravel genetic architecture of cooked grain width and length traits in rice. Sci. Rep. 7, 12478. doi: 10.1038/s41598-017-12778-6

PubMed Abstract | Crossref Full Text | Google Scholar

Möhring, J., Piepho, H.-P. (2009). Comparison of Weighting in Two-Stage Analysis of Plant Breeding Trial. Crop Sci. 49, 1977–1988. doi: 10.2135/cropsci2009.02.0083

Crossref Full Text | Google Scholar

Mostafa, S., Wang, Y., Zeng, W., Jin, B. (2022). Plant Responses to Herbivory, Wounding, and Infection. Int. J. Mol. Sci. 23, 7031. doi: 10.3390/ijms23137031

PubMed Abstract | Crossref Full Text | Google Scholar

Mousavi, B. S., Sayfzadeh, S., Jabbari, H., Valadabadi, A. S., Masouleh, H. E. (2022). Effect of auxin foliar application on seed yield and fatty acids composition of two safflower genotypes under late-season drought. Plant Soil Environ. 68, 82–88. doi: 10.17221/329/2021-PSE

Crossref Full Text | Google Scholar

Mühlroth, A., Li, K., Røkke, G., Winge, P., Olsen, Y., Hohmann-Marriott, M. F., et al. (2013). Pathways of lipid metabolism in marine algae, co-expression network, bottlenecks and candidate genes for enhanced production of EPA and DHA in species of Chromista. Mar. Drugs 11, 4662–4697. doi: 10.3390/md11114662

PubMed Abstract | Crossref Full Text | Google Scholar

Muller, R., Morant, M., Jarmer, H., Nilsson, L., Nielsen, T. H. (2007). Genome-wide analysis of the Arabidopsis leaf transcriptome reveals interaction of phosphate and sugar metabolism. Plant Physiol. 143, 156–171. doi: 10.1104/pp.106.090167

PubMed Abstract | Crossref Full Text | Google Scholar

Nagaraju, M., Kumar, A., Rajasheker, G., Manohar Rao, D., Kavi Kishor, P. B. (2020). DnaJs, the critical drivers of Hsp70s: genome-wide screening, characterization and expression of DnaJ family genes in Sorghum bicolor. Mol. Biol. Rep. 47, 7379–7390. doi: 10.1007/s11033-020-05793-w

PubMed Abstract | Crossref Full Text | Google Scholar

Nahar, K., Hasanuzzaman, M., Fujita, M. (2016). “Heat stress responses and thermotolerance in soybean,” in Abiotic and Biotic Stresses in Soybean Production. Ed. Miransari, M. (Academic Press, San Diego).

Google Scholar

Narayanan, S., Zoong-Lwe, Z. S., Gandhi, N., Welti, R., Fallen, B., Smith, J. R., et al. (2020). Comparative Lipidomic Analysis Reveals Heat Stress Responses of Two Soybean Genotypes Differing in Temperature Sensitivity. Plants 9, 457. doi: 10.3390/plants9040457

PubMed Abstract | Crossref Full Text | Google Scholar

Natnan, M. E., Low, C.-F., Chong, C.-M., Bunawan, H., Baharum, S. N. (2023). Oleic acid as potential immunostimulant in metabolism pathways of hybrid grouper fingerlings (Epinephelus fuscoguttatus × Epinephelus lanceolatus) infected with Vibrio vulnificus. Sci. Rep. 13, 12830. doi: 10.1038/s41598-023-40096-7

PubMed Abstract | Crossref Full Text | Google Scholar

Ogiso-Tanaka, E., Shimizu, T., Hajika, M., Kaga, A., IshimotO, M. (2019). Highly multiplexed AmpliSeq technology identifies novel variation of flowering time-related genes in soybean (Glycine max). DNA Res. 26, 243–260. doi: 10.1093/dnares/dsz005

PubMed Abstract | Crossref Full Text | Google Scholar

Ozga, J. A., Kaur, H., Savada, R. P., Reinecke, D. M. (2016). Hormonal regulation of reproductive growth under normal and heat-stress conditions in legume and other model crop species. J. Exp. Bot. 68, 1885–1894. doi: 10.1093/jxb/erw464

PubMed Abstract | Crossref Full Text | Google Scholar

Petrie, J. R., Shrestha, P., Zhou, X.-R., Mansour, M. P., Liu, Q., Belide, S., et al. (2012). Metabolic Engineering Plant Seeds with Fish Oil-Like Levels of DHA. PloS One 7, e49165. doi: 10.1371/journal.pone.0049165

PubMed Abstract | Crossref Full Text | Google Scholar

Piepho, H. P., Mohring, J. (2007). Computing heritability and selection response from unbalanced plant breeding trials. Genetics 177, 1881–1888. doi: 10.1534/genetics.107.074229

PubMed Abstract | Crossref Full Text | Google Scholar

Punzo, P., Grillo, S., Batelli, G. (2020). Alternative splicing in plant abiotic stress responses. Biochem. Soc. Trans. 48, 2117–2126. doi: 10.1042/BST20200281

PubMed Abstract | Crossref Full Text | Google Scholar

Quoc Thien, N., Anna, K., Peter, A., Emery, R. J. N., Suresh, N. (2016). Soybean Seed Development: Fatty Acid and Phytohormone Metabolism and Their Interactions. Curr. Genomics 17, 241–260. doi: 10.2174/1389202917666160202220238

PubMed Abstract | Crossref Full Text | Google Scholar

Rafi, N., Dominguez, M., Okello, P., Mathew, F. M. (2024). No Common Candidate Genes for Resistance to Fusarium graminearum, F. proliferatum, F. sporotrichioides and F. subglutinans in Soybean Accessions from Maturity Groups 0 and I: Findings from Genome-wide Association Mapping. Plant Dis. 108, 2722–2730. doi: 10.1094/PDIS-02-24-0477-RE

PubMed Abstract | Crossref Full Text | Google Scholar

Rahman, S. U., Raza, G., Naqvi, R. Z., Mccoy, E., Hammad, M., Lafayette, P., et al. (2023). A source of resistance against yellow mosaic disease in soybeans correlates with a novel mutation in a resistance gene. Front. Plant Sci. 14, 1230559. doi: 10.3389/fpls.2023.1230559

PubMed Abstract | Crossref Full Text | Google Scholar

Kumar, M. R., Abhishek, K., Baburajendra Prasad, V., Kathiresan, S. (2021). IAA combine with kinetin elevates the α-linolenic acid in callus tissues of soybean by stimulating the expression of FAD3 gene. Plant Gene 28, 100336. doi: 10.1016/j.plgene.2021.100336

Crossref Full Text | Google Scholar

Ravelombola, W., Qin, J., Shi, A., Song, Q., Yuan, J., Wang, F., et al. (2021). Genome-wide association study and genomic selection for yield and related traits in soybean. PloS One 16, e0255761. doi: 10.1371/journal.pone.0255761

PubMed Abstract | Crossref Full Text | Google Scholar

Razi, K., Muneer, S. (2021). Drought stress-induced physiological mechanisms, signaling pathways and molecular response of chloroplasts in common vegetable crops. Crit. Rev. Biotechnol. 41, 669–691. doi: 10.1080/07388551.2021.1874280

PubMed Abstract | Crossref Full Text | Google Scholar

Ruiz-Lopez, N., Haslam, R. P., Usher, S., Napier, J. A., Sayanova, O. (2015). An alternative pathway for the effective production of the omega-3 long-chain polyunsaturates EPA and ETA in transgenic oilseeds. Plant Biotechnol. J. 13, 1264–1275. doi: 10.1111/pbi.12328

PubMed Abstract | Crossref Full Text | Google Scholar

Rustgi, S., Kakati, J. P., Jones, Z. T., Zoong Lwe, Z. S., Narayanan, S. (2021). Heat tolerance as a function of membrane lipid remodeling in the major US oilseed crops (soybean and peanut). J. Plant Biochem. Biotechnol. 30, 652–667. doi: 10.1007/s13562-021-00729-2

Crossref Full Text | Google Scholar

Sakurai, H., Yoshihashi, T., Nguyen, H. T. T., Pokorný, J. (2003). A new generation of frying oils. Czech J. Food Sci. 21, 145–151. doi: 10.17221/3491-CJFS

Crossref Full Text | Google Scholar

Schapire, A. L., Valpuesta, V., Botella, M. A. (2006). TPR Proteins in Plant Hormone Signaling. Plant Signal Behav. 1, 229–230. doi: 10.4161/psb.1.5.3491

PubMed Abstract | Crossref Full Text | Google Scholar

Schmidt, M. A., Parrott, W. A., Hildebrand, D. F., Berg, R. H., Cooksey, A., Pendarvis, K., et al. (2015). Transgenic soya bean seeds accumulating β-carotene exhibit the collateral enhancements of oleate and protein content traits. Plant Biotechnol. J. 13, 590–600. doi: 10.1111/pbi.2015.13.issue-4

PubMed Abstract | Crossref Full Text | Google Scholar

Schneider, R., Rolling, W., Song, Q., Cregan, P., Dorrance, A. E., Mchale, L. K. (2016). Genome-wide association mapping of partial resistance to Phytophthora sojae in soybean plant introductions from the Republic of Korea. BMC Genomics 17, 607. doi: 10.1186/s12864-016-2918-5

PubMed Abstract | Crossref Full Text | Google Scholar

Sedivy, E. J., Wu, F., Hanzawa, Y. (2017). Soybean domestication: the origin, genetic architecture and molecular bases. New Phytol. 214, 539–553. doi: 10.1111/nph.2017.214.issue-2

PubMed Abstract | Crossref Full Text | Google Scholar

Sehgal, A., Sita, K., Siddique, K. H. M., Kumar, R., Bhogireddy, S., Varshney, R. K., et al. (2018). Drought or/and Heat-Stress Effects on Seed Filling in Food Crops: Impacts on Functional Biochemistry, Seed Yields, and Nutritional Quality. Front. Plant Sci. 9. doi: 10.3389/fpls.2018.01705

PubMed Abstract | Crossref Full Text | Google Scholar

Sharma, P., Lakra, N., Goyal, A., Ahlawat, Y. K., Zaid, A., Siddique, K. H. M. (2023). Drought and heat stress mediated activation of lipid signaling in plants: a critical review. Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1216835

PubMed Abstract | Crossref Full Text | Google Scholar

Sharma, M., Pandey, G. K. (2016). Expansion and Function of Repeat Domain Proteins During Stress and Development in Plants. Front. Plant Sci. 6. doi: 10.3389/fpls.2015.01218

PubMed Abstract | Crossref Full Text | Google Scholar

Shi, Y., Yang, S., Zhao, Z., An, L. (2023). Phytohormones regulate the non-redundant response of ω-3 fatty acid desaturases to low temperatures in Chorispora bungeana. Sci. Rep. 13, 2799. doi: 10.1038/s41598-023-29910-4

PubMed Abstract | Crossref Full Text | Google Scholar

Singer, W. M., Shea, Z., Yu, D., Huang, H., Mian, M. A. R., Shang, C., et al. (2022). Genome-Wide Association Study and Genomic Selection for Proteinogenic Methionine in Soybean Seeds. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.859109

PubMed Abstract | Crossref Full Text | Google Scholar

Singh, A., Roychoudhury, A. (2023). Abscisic acid in plants under abiotic stress: crosstalk with major phytohormones. Plant Cell Rep. 42, 961–974. doi: 10.1007/s00299-023-03013-w

PubMed Abstract | Crossref Full Text | Google Scholar

Skinner, D. J., Gasser, C. S. (2009). Expression-based discovery of candidate ovule development regulators through transcriptional profiling of ovule mutants. BMC Plant Biol. 9, 29. doi: 10.1186/1471-2229-9-29

PubMed Abstract | Crossref Full Text | Google Scholar

Song, Q., Hyten, D. L., Jia, G., Quigley, C. V., Fickus, E. W., Nelson, R. L., et al. (2013). Development and Evaluation of SoySNP50K, a High-Density Genotyping Array for Soybean. PloS One 8, e54985. doi: 10.1371/journal.pone.0054985

PubMed Abstract | Crossref Full Text | Google Scholar

Song, H., Taylor, D. C., Zhang, M. (2023). Bioengineering of Soybean Oil and Its Impact on Agronomic Traits. Int. J. Mol. Sci. 24, 2256. doi: 10.3390/ijms24032256

PubMed Abstract | Crossref Full Text | Google Scholar

Steketee, C. J., Schapaugh, W. T., Carter, T. E., Jr., Li, Z. (2020). Genome-Wide Association Analyses Reveal Genomic Regions Controlling Canopy Wilting in Soybean. G3 (Bethesda) 10, 1413–1425. doi: 10.1534/g3.119.401016

PubMed Abstract | Crossref Full Text | Google Scholar

Stich, B., Melchinger, A. E. (2009). Comparison of mixed-model approaches for association mapping in rapeseed, potato, sugar beet, maize, and Arabidopsis. BMC Genomics 10, 94. doi: 10.1186/1471-2164-10-94

PubMed Abstract | Crossref Full Text | Google Scholar

Swanson, D., Block, R., Mousa, S. A. (2012). Omega-3 Fatty Acids EPA and DHA: Health Benefits Throughout Life. Adv. Nutr. 3, 1–7. doi: 10.3945/an.111.000893

PubMed Abstract | Crossref Full Text | Google Scholar

Thompson, G. A., Goggin, F. L. (2006). Transcriptomics and functional genomics of plant defence induction by phloem-feeding insects. J. Exp. Bot. 57, 755–766. doi: 10.1093/jxb/erj135

PubMed Abstract | Crossref Full Text | Google Scholar

Tutunchi, H., Ostadrahimi, A., Saghafi-Asl, M. (2020). The Effects of Diets Enriched in Monounsaturated Oleic Acid on the Management and Prevention of Obesity: a Systematic Review of Human Intervention Studies. Adv. Nutr. 11, 864–877. doi: 10.1093/advances/nmaa013

PubMed Abstract | Crossref Full Text | Google Scholar

Usher, S., Han, L., Haslam, R. P., Michaelson, L. V., Sturtevant, D., Aziz, M., et al. (2017). Tailoring seed oil composition in the real world: optimising omega-3 long chain polyunsaturated fatty acid accumulation in transgenic Camelina sativa. Sci. Rep. 7, 6570. doi: 10.1038/s41598-017-06838-0

PubMed Abstract | Crossref Full Text | Google Scholar

VanRaden, P. M. (2008). Efficient methods to compute genomic predictions. J. Dairy Sci. 91, 4414–4423. doi: 10.3168/jds.2007-0980

PubMed Abstract | Crossref Full Text | Google Scholar

Verbyla, A. (2023). On two-stage analysis of multi-environment trials. Euphytica 219, 121. doi: 10.1007/s10681-023-03248-4

Crossref Full Text | Google Scholar

Waadt, R., Seller, C. A., Hsu, P.-K., Takahashi, Y., Munemasa, S., Schroeder, J. I. (2022). Plant hormone regulation of abiotic stress responses. Nat. Rev. Mol. Cell Biol. 23, 680–694. doi: 10.1038/s41580-022-00479-6

PubMed Abstract | Crossref Full Text | Google Scholar

Walsh, T. A., Bevan, S. A., Gachotte, D. J., Larsen, C. M., Moskal, W. A., Merlo, P. A., et al. (2016). Canola engineered with a microalgal polyketide synthase-like system produces oil enriched in docosahexaenoic acid. Nat. Biotechnol. 34, 881–887. doi: 10.1038/nbt.3585

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, Q., Ning, L., Yu, W., Zhao, W., Huang, F., Yu, D., et al. (2022b). Detection of Candidate Loci and Genes Related to Phosphorus Efficiency at Maturity through a Genome-Wide Association Study in Soybean. Agronomy 12, 2031. doi: 10.3390/agronomy12092031

Crossref Full Text | Google Scholar

Wang, N., Tao, B., Mai, J., Guo, Y., Li, R., Chen, R., et al. (2022a). Kinase CIPK9 integrates glucose and abscisic acid signaling to regulate seed oil metabolism in rapeseed. Plant Physiol. 191, 1836–1856. doi: 10.1093/plphys/kiac569

PubMed Abstract | Crossref Full Text | Google Scholar

Wu, T., Wang, C., Han, B., Liu, Z., Yang, X., Wang, W., et al. (2024). Emerging roles of inositol pyrophosphates in signaling plant phosphorus status and phytohormone signaling. Plant Soil 505, 127–145. doi: 10.1007/s11104-023-05976-x

Crossref Full Text | Google Scholar

Yang, J., Liu, G., Tang, J., Wang, X., Diao, Y., Su, Y., et al. (2022). Fine Mapping and Characterization of an Aphid-Resistance Gene in the Soybean Landrace Fangzheng Moshidou. Front. Plant Sci. 13, 899212. doi: 10.3389/fpls.2022.899212

PubMed Abstract | Crossref Full Text | Google Scholar

Yang, Y., Xu, R., Ma, C.-J., Vlot, A. C., Klessig, D. F., Pichersky, E. (2008). Inactive Methyl Indole-3-Acetic Acid Ester Can Be Hydrolyzed and Activated by Several Esterases Belonging to the AtMES Esterase Family of Arabidopsis. Plant Physiol. 147, 1034–1045. doi: 10.1104/pp.108.118224

PubMed Abstract | Crossref Full Text | Google Scholar

Yi, R., Li, Y., Shan, X. (2024). OPDA/dn-OPDA actions: biosynthesis, metabolism, and signaling. Plant Cell Rep. 43, 206. doi: 10.1007/s00299-024-03286-9

PubMed Abstract | Crossref Full Text | Google Scholar

Zandalinas, S. I., Mittler, R. (2022). Plant responses to multifactorial stress combination. New Phytol. 234, 1161–1167. doi: 10.1111/nph.v234.4

Crossref Full Text | Google Scholar

Zhang, J., Peer, W. A. (2017). Auxin homeostasis: the DAO of catabolism. J. Exp. Bot. 68, 3145–3154. doi: 10.1093/jxb/erx221

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, L., Zhao, H.-K., Dong, Q.-L., Zhang, Y.-Y., Wang, Y.-M., Li, H.-Y., et al. (2015). Genome-wide analysis and expression profiling under heat and drought treatments of HSP70 gene family in soybean (Glycine max L.). Front. Plant Sci. 6. doi: 10.3389/fpls.2015.00773

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, X.-Z., Zheng, W.-J., Cao, X.-Y., Cui, X.-Y., Zhao, S.-P., Yu, T.-F., et al. (2019). Genomic Analysis of Stress Associated Proteins in Soybean and the Role of GmSAP16 in Abiotic Stress Responses in Arabidopsis and Soybean. Front. Plant Sci. 10. doi: 10.3389/fpls.2019.01453

PubMed Abstract | Crossref Full Text | Google Scholar

Zhao, Y., Cao, P., Cui, Y., Liu, D., Li, J., Zhao, Y., et al. (2021). Enhanced production of seed oil with improved fatty acid composition by overexpressing NAD+-dependent glycerol-3-phosphate dehydrogenase in soybean. J. Integr. Plant Biol. 63, 1036–1053. doi: 10.1111/jipb.13094

PubMed Abstract | Crossref Full Text | Google Scholar

Zhao, L., Jia, T., Jiao, Q., Hu, X. (2022). Research Progress in J-Proteins in the Chloroplast. Genes 13, 1469. doi: 10.3390/genes13081469

PubMed Abstract | Crossref Full Text | Google Scholar

Zhou, X. N., Zheng, Y., Cai, Z., Wang, X., Liu, Y., Yu, A., et al. (2021). Identification and Functional Analysis of Tomato TPR Gene Family. Int. J. Mol. Sci. 22, 758. doi: 10.3390/ijms22020758

PubMed Abstract | Crossref Full Text | Google Scholar

Zhu, X., Pan, Y., Liu, Z., Liu, Y., Zhong, D., Duan, Z., et al. (2020). Mutation of YL Results in a Yellow Leaf with Chloroplast RNA Editing Defect in Soybean. Int. J. Mol. Sci. 21, 4275. doi: 10.3390/ijms21124275

PubMed Abstract | Crossref Full Text | Google Scholar

Zimmer, G., Miller, M. J., Steketee, C. J., Jackson, S. A., De Tunes, L. V. M., Li, Z. (2021). Genetic control and allele variation among soybean maturity groups 000 through IX. Plant Genome 14, e20146. doi: 10.1002/tpg2.20146

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: abiotic and biotic stresses, fatty acids, mGWAS, oil composition, phytohormone signaling, SNPs, soybean seeds

Citation: Bewick P, Forstner P, Zhang B and Collakova E (2025) Identification of novel candidate genes for regulating oil composition in soybean seeds under environmental stresses. Front. Plant Sci. 16:1572319. doi: 10.3389/fpls.2025.1572319

Received: 06 February 2025; Accepted: 26 March 2025;
Published: 17 April 2025.

Edited by:

Yi-Hong Wang, University of Louisiana at Lafayette, United States

Reviewed by:

Zhengge Zhu, Hebei Normal University, China
Bo-Keun Ha, Chonnam National University, Republic of Korea
Nadezhda Potapova, Institute for Information Transmission Problems (RAS), Russia

Copyright © 2025 Bewick, Forstner, Zhang and Collakova. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Patrick Bewick, cHdiQHZ0LmVkdQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.