- 1Plant Biotechnology Center, College of Agronomy, Jilin Agriculture University, Changchun, Jilin, China
- 2Gezira Research Station, Agricultural Research Corporation (ARC), Wad Madani, Sudan
Salinized soil can significantly hinder soybean growth, leading to a reduction in overall yield. To address this issue, identifying key genes related to salt tolerance in soybeans is essential for improving their resistance to salinity and ensuring sustainable development of soybean production. While current research predominantly focuses on salt tolerance during the seedling stage, there is still a lack of comprehensive studies on the genes involved in salt tolerance during the germination stage. This study established the optimal screening criteria by phenotyping the salt-tolerant variety R063 and the salt-sensitive variety W82 during the germination stage under salt stress. RNA-seq analysis was performed on 24 samples from both varieties at 36 and 48 hours under two different salt concentrations (0 and 150 mM/L NaCl). Differential expression analysis revealed that the salt-tolerant variety R063 exhibited the fewest differentially expressed genes (DEGs) compared to its control after 48 hours of salt stress. A total of 305 DEGs were commonly identified between the salt-tolerant variety R063 and the salt-sensitive variety W82 under salt stress at both time points. Additionally, 187 DEGs were commonly identified between R063 under salt stress and its corresponding control group across the two time points. Gene ontology (GO) enrichment analysis revealed that the differentially expressed genes were significantly enriched in ADP binding, monooxygenase activity, oxidoreductase activity, defense response, and protein phosphorylation signaling pathways. The weighted gene co-expression network analysis (WGCNA) method was employed to identify modules strongly correlated with salt tolerance during soybean germination. Candidate genes associated with soybean sprouting salt tolerance were identified by evaluating the connectivity and expression profiles of genes within these modules. These findings provide a theoretical foundation for further elucidating the molecular mechanisms underlying salt tolerance during soybean germination and present new genetic resources for studying this trait.
Introduction
According to incomplete statistics from UNESCO and the Food and Agriculture Organization of the United Nations (FAO, 2024), saline soils cover 1,381 million hectares, accounting for 10.7% of the world’s total land area. In China, the area of saline-alkaline land is approximately 3.6 × 107 hectares, accounting for 4.88% of the country’s total available land resources (Li et al., 2014). Soda saline-alkaline land in the western part of the Songnen Plain in northeastern China represents one of the world’s three largest concentrations of soda saline-alkaline soils (Wei et al., 2020). Soil salinization not only restricts the expected growth of plants but also significantly impacts the development of China’s national economy. Screening and breeding salt-tolerant crop varieties is one of the most effective strategies to mitigate the inhibitory effects of saline-alkaline soils on plant growth (Zou et al., 2022).
To cope with these stresses, plants have developed complex adaptive mechanisms through long-term evolutionary processes. While extensive research has been conducted on salt stress tolerance in soybeans, most studies have focused on tolerance during later stages of growth, with fewer investigations addressing salt tolerance during the germination stage. Ma et al. (2021) found that overexpression of the gene GmNFYA13 improved salt tolerance in transgenic soybean plants, while Li et al. (2017) reported that overexpression of the gene GmFDL19 enhanced salt tolerance in soybean seedlings. Hu et al. (2022) conducted transcriptome sequencing of both salt-tolerant and salt-sensitive soybean varieties during the seedling stage at three time points, identifying 6644 differentially expressed genes. Enrichment analysis revealed that these genes were involved in phytohormone signaling, redox processes, phenylpropanoid biosynthesis, mitogen-activated protein kinase (MAPK) signaling, and ribosome metabolism, all of which may play key roles in salt stress response. Additionally, through interlocking localization and genome-wide association studies, Zhang et al. (2019) identified the major QTL qST-8, which was significantly associated with salt tolerance during soybean seed germination.
Previous studies have shown that the salt tolerance mechanisms during germination and the corresponding genes differ from those identified at later growth stages. Additionally, the number of studies and reference identifications related to salt tolerance at the germination stage is limited, and the evaluation criteria for this stage do not fully align with those for other growth periods. Therefore, further investigation into the salt tolerance mechanisms during germination is required. For example, GmSALT3, a salt tolerance gene identified during the seedling stage, is unrelated to salt tolerance during the emergence stage (Liu et al., 2016). The focus on mining salt tolerance genes only during the seedling stage does not lead to a comprehensive improvement in soybean’s overall salt tolerance. If seeds lack salt tolerance during germination, the seedling emergence rate will be reduced, which can seriously affect soybean cultivation in saline soils.
The use of transcriptome sequencing technology is increasingly prevalent in the identification of plant salt tolerance genes. This technology enables more precise qualitative and quantitative analysis of the types and abundance of genes expressed in specific cells or tissues under defined conditions. It has been widely applied in crop research for gene mining and the exploration of mechanisms such as disease resistance and stress tolerance, providing novel insights and methodologies for functional genomics studies across different organisms (Imadi et al., 2015; Afzal et al., 2020). While transcriptome sequencing has become a well-established tool for gene discovery, fewer studies have integrated this technique with others to identify pivotal genes. WGCNA (Weighted Gene Co-expression Network Analysis) is a systems biology approach that utilizes transcriptomic sequencing data to construct gene co-expression networks. It primarily involves grouping genes with similar expression patterns into modules, building co-expression networks, and identifying key hub genes. WGCNA has been successfully applied to identify genes in rice (Wang et al., 2022), wheat (Luo et al., 2019), and maize (Ma et al., 2021). Yao et al. (2023)employed transcriptomics and WGCNA analyses to identify hub genes regulating seed size and oil content during the developmental stages of both wild and cultivated soybeans. Similarly, Tang et al. (2023) utilized transcriptomic and WGCNA analyses to uncover the involvement of hub genes related to drought stress in the tillering seedlings of sugarcane.
As one of the major food crops in China, soybean consumption has been steadily increasing in recent years (He et al., 2019). Seed germination represents a critical developmental stage in the life cycle of soybeans (Zhang et al., 2021), and abiotic stresses encountered during seed germination can slow down soybean seed germination, reduce germination rate, or lead to seed rot and death, resulting in large-scale seedling shortages and crop failure and causing significant economic losses to soybean production (Liu et al., 2020). Screening salt-tolerant soybean varieties and mining salt-tolerant genes are of great significance to the development and utilization of soybean germplasm resources. In this study, we used transcriptome and WGCNA technologies to study soybean seeds at the germination stage after salt stress. K-means clustering analysis and gene ontology (GO) enrichment analysis were performed for differential genes co-existing between salt-tolerant varieties and their controls at 2-time points and for differentially expressed genes (DEGs) co-existing between salt-tolerant and salt-sensitive varieties at 2-time points, respectively, and co-expression analyses were performed to identify the pivotal genes involved in salt stress tolerance in soybean during the germination period (Figure 1). This study provides a new perspective on understanding.

Figure 1. Technology roadmap. The workflow investigates salt stress responses in two rice varieties (salt-tolerant R063 and salt-sensitive Williams 82) through a multi-step approach. Key stages include:(1) Physiological and Agronomic Assessments: Measuring indices and traits to identify critical timepoints for transcriptomic analysis. (2) Transcriptome Profiling: RNA-seq identifies differentially expressed genes (DEGs), followed by GO enrichment and WGCNA to detect co-expressed gene modules. (3) Validation and Network Analysis: RT-qPCR confirms RNA-seq results, while co-expression networks highlight hub genes central to salt tolerance. (4) Integration of Data: Combining physiological, transcriptomic, and network analyses to pinpoint key regulatory genes, providing molecular insights for crop improvement under salinity stress.
This study has the following contributions:
● By analyzing the transcriptome data of salt-tolerant and salt-sensitive varieties of germinating soybeans, key genes related to salt tolerance were identified.
● By combining high-throughput transcriptome data and WGCNA analysis, it was possible to systematically identify hub genes that play key roles in the process of salt stress tolerance during the germination period and which play central roles in regulating the response to salt stress.
● WGCNA can help to reveal the co-expression patterns among genes, and further provide a theoretical basis for understanding the salt tolerance mechanism and improving the salt resistance of soybean.
Materials and methods
The “Plant Materials and Stress Treatment” subsection provides the foundation for the experiment, detailing the types of soybeans used, the treatment conditions, and the phenotyping approach. The “Water Absorption Rate”, “SOD Activity”, “POD Activity” and “MDA content” subsections focus on physiological measurements that assess the plants’ responses to salt stress, providing crucial data on the impact of salt stress at the biochemical level. RNA sequencing and WGCNA provide molecular insights into gene expression changes and network analysis, which are further validated through RT-qPCR. This multi-level approach—spanning phenotypic, physiological, biochemical, and molecular analyses—ensures a robust and comprehensive study of salt stress tolerance in soybeans.
Plant materials and stress treatment
Salt-tolerant soybean variety ‘R063’ as well as salt-sensitive soybean variety ‘W82’ were used in this study. Fifty soybean seeds were selected and placed flat in petri dishes with three replications in each group. The soybean seeds were sterilized with alcohol for 30 seconds and then washed with distilled water three times. The dark culture was incubated for two days and then shifted to the light culture environment. Seeds were subjected to salt stress treatments at varying concentrations (0, 100, 150, and 200 mM NaCl) to assess germination responses. Seed mass was measured at 12-hour intervals for a total of 72 hours. To maintain experimental conditions, water was replenished and filter paper was replaced every 24 hours, as described by Kan et al. (2015). The optimal salt concentration (150 mM NaCl) was determined through phenotypic screening of germination traits.
Imbibition rate
Select 10 seeds of uniform size for each sample, with three replicates per group. Initially, After absorbing water, soybean seeds were placed in an oven at 105°C, inactivated for 30 min, and then dried at 80°C to constant weight as . After soaking, gently blot the seeds with filter paper or cloth to remove excess surface moisture, then weigh the soaked seeds and record this total mass as (Smith et al., 1961). The formula to calculate the imbibition rate is given by:
. P1 is the dry weight of the seed, is the total weight of the seed after imbibition.
Superoxide dismutase activity
The activity of Superoxide Dismutase (SOD) was measured using the nitro blue tetrazolium (NBT) method (Fridovich, 1986). The reaction mixture consisted of 1.5 mL of 0.05 mol/L phosphate buffer, 0.3 mL of 130 mM methionine (Met) solution, 0.3 mL of 750 μM NBT solution, 0.3 mL of 100 μM EDTA-Na2, 0.3 mL of 20 μM riboflavin, and 0.05 mL of enzyme extract, making up a total reaction volume of 3 mL. The tubes containing the reaction mixture were incubated under a light intensity of 4000 lux at 25°C for 30 minutes. One unit of SOD activity is defined as the amount of enzyme required to inhibit the reduction rate of NBT by 50% under assay conditions. SOD activity can be calculated using the following formula:
. is the absorbance of the light control (the tube exposed to light). is the absorbance of the sample. is the total volume of the diluted sample (mL). is the sample used for measurement (mL). is the fresh weight of the sample (g). The total SOD activity is expressed in enzyme units per gram of fresh weight. Protein concentration is given as milligrams of protein per gram of fresh weight (mg/g).
Peroxidase activity assay using Guaiacol method
Determination of POD activity (Guaiacol method) (Gao et al., 2021).
1. Reagent Preparation: 100 mmol/L Phosphate Buffer Solution (PBS, pH 6.0): Prepare by mixing 61.5 mL of 1 M sodium phosphate dibasic (Na2HPO4) (Solution A) with 438.5 mL of 1 M sodium phosphate monobasic (NaH2PO4) (Solution B), then dilute to 1,000 mL with deionized water. The final pH is adjusted to 6.0 using a calibrated pH meter.
2. Reaction Mixture Preparation: Combine 50 mL of PBS (100 mmol/L, pH 6.0) with 28 μL of guaiacol (2-methoxyphenol) in a beaker. Heat gently with continuous stirring to dissolve the guaiacol. After cooling, add 19 μL of 30% (v/v) hydrogen peroxide (H2O2) and mix thoroughly. Store the solution at 4°C for subsequent use.
3. Sample Extraction: Accurately weigh 1.0 g of germinating soybean seeds and homogenize them in a pre-cooled mortar using an appropriate volume of PBS (pH 6.0). Centrifuge the homogenate at 4,000 × g for 15 min at 4°C. Transfer the supernatant to a 100 mL volumetric flask and dilute to the mark with PBS. Store the enzyme extract at 4°C until analysis.
Spectrophotometric Measurement: For each sample, prepare three test tubes: Blank Tube: Add 3.0 mL of reaction mixture and 1.0 mL of PBS. Test Tubes: Add 3.0 mL of reaction mixture and 1.0 mL of enzyme extract. Immediately initiate the reaction by adding the enzyme solution and record the absorbance at 470 nm at 30 s intervals for 5–10 measurements. Perform measurements sequentially to avoid interference from simultaneous reactions.
Enzyme Activity Calculation: POD activity is expressed as the change in optical density (ΔOD470) per minute, normalized to a standard unit. One unit (U) of POD activity is defined as the amount of enzyme required to catalyze the oxidation of guaiacol, resulting in a ΔOD470 of 0.01 per minute under the specified assay conditions. The activity is calculated using the formula:
. In the equation, ΔA470 represents the change in absorbance during the reaction period; W denotes the fresh weight (g) of the sample; t indicates the reaction time (min); Vt refers to the total volume of the enzyme extraction solution; and Vs corresponds to the volume of enzyme solution used during measurement.
Malondialdehyde content assay
MDA content was measured using a commercial MDA assay kit (Sangon Biotech, Shanghai, China).
(1) Reagent Addition and Reaction Protocol
The reaction was performed in two tubes: a test tube (containing the sample) and a blank tube (without the sample). Reagent volumes were as follows: MDA detection working solution: 600 μL (test and blank tubes); Sample supernatant: 200 μL (test tube only); Distilled water: 200 μL (blank tube only); Reagent Three: 200 μL (both test and blank tubes).
The mixture was incubated in a 100 °C water bath for 60 minutes (with lids tightened to prevent evaporation) and then cooled rapidly in an ice bath. After cooling, the samples were centrifuged at 10,000 × g for 10 minutes at room temperature, and the supernatant was transferred to 1 mL quartz cuvettes for absorbance measurement.
(2) Spectrophotometric Analysis and Calculation
Absorbance values were measured using a visible spectrophotometer at 532 nm and 600 nm. The corrected absorbance (ΔA) was calculated as follows:
A532a, A600a: Absorbance values of the test tube at 532 nm and 600 nm, respectively; A532b, A600b: Absorbance values of the blank tube at 532 nm and 600 nm, respectively.
The MDA content (nmol/g mass) was quantified using the formula:
The variables in the formula are defined as follows: V1: Total reaction volume, fixed at 1.0 × 10⁻³ L; ϵ: The MDA molar absorption coefficient, 1.55 × 105L/mol/cm; V2: Sample volume added, 0.2 ML. d: Cuvette light path diameter, 1 cm; V3: Volume of extract added, 1 ml; Cpr: Sample protein concentration, mg/L. W: Sample weight, g.
RNA sequencing
Transcriptome analysis was performed on 36-h and 48-h samples of salt-tolerant versus salt-sensitive seeds. Each treatment consisted of three replicates, yielding a total of 24 samples for RNA sequencing. Trizol reagent (Invitrogen, USA) was used to isolate total RNA from the samples. DNase I was used to remove contaminating genomic DNA from the RNA.RNA concentration and purity were detected using the UV spectrophotometric NanoDrop (NanoDropND-1000 UV-Vis spectrophotometer) (Smith et al., 1961). Qualified RNA was processed for RNA sequencing to construct RNA-seq libraries, which was performed by Biomarker Technologies Co., Ltd. (Beijing, China) on the Illumina sequencing platform. The raw image data generated by sequencing was converted from base detection to sequence data, called raw data/raw reads. The raw data from sequencing was quality assessed by FastQC, quality clipped by Trimmomatic to obtain relatively accurate valid data, and the valid data from the samples were aligned to the reference genome using HISAT2. The Glycine max cv. Williams 82 reference genome (Wm82.a2.v1) and corresponding gene annotation files were obtained from SoyBase [https://www.soybase.org/dlpages/index.php], a curated soybean genomic resource. Genes with |log2 fold-change| ≥ 1 and FDR< 0.05 were considered differentially expressed. GO enrichment analysis of differentially expressed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID) (Sherman et al., 2022). Selection pathway enriched to differential genes greater than 6 and less than 1500, and all false discovery rate (FDR) adjusted p-values (q-values) with q< 0.05 were considered significant (Jhan et al., 2023).
WGCNA
WGCNA was performed on the transcriptome data using R software version 4.2.2 and the WGCNA package (Langfelder and Horvath, 2008; Wang et al., 2023). The soft threshold was determined based on the scale-free network distribution principle. The threshold parameter β for creating the adjacency matrix was selected when the fit curve first approached 0.8 (Zhao et al., 2024). Set the soft threshold to β = 15 to realize a scale-free network. The adjacency matrix was transformed into a Topological Overlap Matrix (TOM) using the TOM similarity function, thereby constructing the gene connectivity network (Dileo et al., 2011). Finally, gene modules were generated based on high gene significance (GS, the association between gene expression and traits) and module membership (MM, the correlation between gene expression and module eigengenes) (Azam et al., 2023). These gene modules were clustered using the dynamic tree-cut method, and the co-expression networks were visualized using Cytoscape version 3.10.0 (Shannon et al., 2003). Chromosomal localization of hub genes using TBtools in R (Chen and Xia, 2022).
Quantitative real-time PCR verification
RNA was extracted from salt-tolerant and salt-sensitive varieties treated with zero mM and 150 mM NaCl at 36 hours and 48 hours post-treatment using Trizol and then reverse-transcribed into cDNA. Specific quantitative primers were designed using Primer Premier 5 software (Supplementary Table S1) for RT-qPCR reactions, with Glyma.12G051100 as the reference gene. Each sample included three biological replicates. The relative expression levels of the genes were calculated using the 2-ΔΔCt method (Amorim et al., 2018), and graphs were generated using GraphPad Prism 8 software.
Statistical analysis
All data were analyzed using one-way analysis of variance (ANOVA) to assess significant differences among treatments and time points. Duncan’s multiple range test was applied for post hoc comparisons at a significance level of p< 0.05. All statistical analyses and visualizations were performed using GraphPad Prism 8. Data are presented as means ± standard error (n = 3).
Results
Phenotypic response of soybean seeds to salt stress
Taking R063 and W82 as materials, the seeds were treated with 150 mmol/L NaCl for three days, respectively, during the seed germination period; with the increase of salt stress time, the germination rate of the two varieties appeared to be significantly changed, and it was observed that the seeds showed significant changes in germination rate between 36 h and 48 h, and the germination rate of W82 was affected by the salt stress more than the germination rate of R063 was affected seriously (Figures 2A, B). Analysis of water absorption rates revealed that over time, the rate for R063 gradually increased, whereas that for W82 showed a gradual decline. This indicates that salt stress significantly restricted the water absorption rate of W82 but had a lesser effect on R063 (Figures 2C, D).

Figure 2. The critical stages of soybean seed response to salt stress at 36h and 48h. (A) Seed phenotypes of W82 (left) and R063 (right) varieties under control conditions and salt stress at 36 hours. (B) Seed phenotypes of W82 (left) and R063 (right) varieties under control conditions and salt stress at 48 hours. (C) Dynamic changes in water absorption rates over time for the control groups of W82 and R063 soybean varieties. (D) Dynamic changes in water absorption rates over time for W82 and R063 soybean varieties under salt stress. Scale bar, 1 cm. Data are presented as means ± standard error (n = 3). Different lowercase letters indicate statistically significant differences among time points within the same treatment group at p< 0.05, as determined by one-way ANOVA followed by Duncan’s multiple range test.
Analysis of SOD activity
SOD is a key antioxidant enzyme for plants to cope with oxidative stress, which reduces the accumulation of reactive oxygen species (ROS) and mitigates the oxidative damage of salt stress on the cell membrane system by catalyzing the conversion of superoxide anion radicals (O2⁻) to H2O2 and O2. In this study, salt-tolerant variety R063 and salt-sensitive variety W82 were subjected to salt stress treatment (150 mmol/L NaCl) by hydroponics, and the samples were collected at six time points—12, 24, 36, 48, 60, and 72 h, respectively—and the SOD activity was determined using the nitrogen blue tetrazolium (NBT) photoreduction method. The results showed that under normal conditions, the SOD activity of R063 was significantly higher than that of W82 and reached a peak at 36 h and then stabilized, while the SOD activity of W82 reached a peak at 48 h and then gradually decreased. Under salt stress, the SOD activity of R063 was significantly higher than that of W82, and its trend was consistent with that of the control, while the activity of W82 fluctuated greatly after salt stress. This result indicated that R063 effectively alleviated salt stress-induced ROS accumulation by maintaining a higher and stable SOD activity (Figures 3A, B).

Figure 3. Changes in SOD activity during germination in soybean with different tolerances. (A) Changes in SOD activity of W82 and R063 at six time points under normal conditions. (B) Changes in SOD activity at six time points under salt stress conditions in W82 and R063. Data are presented as means ± standard error (n = 3). Different lowercase letters indicate statistically significant differences among time points within the same treatment group at p< 0.05, as determined by one-way ANOVA followed by Duncan’s multiple range test.
Analysis of POD activity
POD is an important enzyme for H2O2 scavenging in plants, and changes in its activity can reflect the intensity of plant response to oxidative stress. The results of this study showed that the POD activities of both R063 and W82 peaked at 24 h under normal conditions. Under salt stress, the POD activities of R063 and W82 had the lowest activity at 24 h and the peak activity at 48 h. The POD activities of R063 and W82 were also found to be the lowest at 24 h and the peak activity at 48 h (Figures 4A, B).

Figure 4. Changes in POD activity during germination in soybean with different tolerances. (A) Changes in POD activity of W82 and R063 at six time points under normal conditions. (B) Changes in POD activity at six time points under salt stress conditions in W82 and R063. Data are presented as means ± standard error (n = 3). Different lowercase letters indicate statistically significant differences among time points within the same treatment group at p< 0.05, as determined by one-way ANOVA followed by Duncan’s multiple range test.
Analysis of MDA content
MDA is a hallmark product of membrane lipid peroxidation, and its content can reflect the degree of cell membrane damage. The results of this study showed that, under normal conditions, the MDA content of R063 showed a tendency to increase and then decrease from 12 to 36 h, and then increased again and then decreased from 48 to 72 h. The MDA content of W82 decreased continuously from 12 to 36 h, but increased significantly after 36 h. The MDA content of G68 was significantly lower than that of Williams82 under salt stress, and its fluctuation was lower than that of Williams82. Under salt stress, the MDA content of R063 was significantly lower than that of W82, and the fluctuation of MDA content was small, indicating that the degree of membrane lipid peroxidation was low (Figures 5A, B).

Figure 5. Changes in MDA content during germination of different tolerant soybeans. (A) Changes in MDA content of W82 and R063 at six time points under normal conditions. (B) Changes in MDA content at six time points under salt stress conditions in W82 and R063. Data are presented as means ± standard error (n = 3). Different lowercase letters indicate statistically significant differences among time points within the same treatment group at p< 0.05, as determined by one-way ANOVA followed by Duncan’s multiple range test.
Comprehensive analysis of physiological indicators and selection of transcriptome sequencing time points
Based on a detailed analysis of antioxidant enzyme activity under salt stress, we selected 36 hours and 48 hours as the key time points for transcriptome sequencing. Although POD activity peaked at 24 hours under control conditions, its trend differed markedly under salt stress. Specifically, POD activity was lowest at 24 hours and increased substantially afterward, reaching a peak at 48 hours in both the salt-tolerant (R063) and salt-sensitive (W82) varieties. This delayed response suggests that the activation of peroxidase-dependent antioxidant defense mechanisms occurs later during salt stress.
In contrast, SOD activity showed the largest divergence between R063 and W82 at 36 hours, reflecting early-stage antioxidant activation. At 48 hours, the increase in MDA content in W82 indicated substantial oxidative damage, while POD activity reached its maximum, this suggests that a stress-induced enhancement of peroxidative enzyme activity serving to restore redox homeostasis. Therefore, 36 hours represents the initiation of the stress response, and 48 hours reflects the transition to stress adaptation or damage control. These two stages provide meaningful insight into the physiological and molecular mechanisms of salt tolerance.
In summary, 36 h and 48 h were selected for subsequent transcriptome sequencing, which provided key data support for revealing the molecular mechanism of salt tolerance and the mining of resistance genes in soybean during the germination period.
Transcriptome data analysis and quality assessment
Transcriptome sequencing was performed on 24 cDNA libraries constructed from R063 and W82 varieties and treated with 0 and 150 mmol/L NaCl at two time points (36 and 48 h). After filtering the 36-hour sequencing data, a total of 480,730,516 clean reads were obtained, with a Q30 ratio exceeding 95.81%, a GC content greater than 44.42%, and an overall alignment rate greater than 98% (Supplementary Table S2). Similarly, after filtering the 48-hour sequencing data, a total of 457,842,976 clean reads were obtained, with a Q30 ratio exceeding 95.82%, a GC content greater than 44.67%, and an overall alignment rate exceeding 98%. These results confirm that the transcriptome data were of high quality and suitable for downstream analysis (Supplementary Table S3). Group B corresponds to the 36-hour salt-treated Williams82 group, Group C represents the 36-hour control R063 group, and Group D is the 36-hour salt-treated R063 group (Supplementary Table S4). Group F corresponds to the 48-hour salt-treated Williams82 group, Group G represents the 48-hour control R063 group, and Group H is the 48-hour salt-treated R063 group (Supplementary Table S5).
Differential expression gene analysis
To investigate the regulatory mechanisms of salt-tolerant genes during soybean seed germination, pairwise comparisons were conducted among four cDNA libraries: D-VS-C, D-VS-B, H-VS-G, and H-VS-F. The groups were defined as follows: B group: W82 treated with salt at 36 hours; C group: R063 control at 36 hours; D group: R063 treated with salt at 36 hours; F group: W82 treated with salt at 48 hours; G group: R063 control at 48 hours; H group: R063 treated with salt at 48 hours. The comparison results revealed the following: D-VS-C identified 6,591 DEGs, including 3,157 upregulated and 3,434 downregulated genes (Figure 6B). D-VS-B identified 656 DEGs, with 415 upregulated and 241 downregulated genes (Figure 6A). H-VS-G showed 438 DEGs, comprising 118 upregulated and 320 downregulated genes (Figure 6D). H-VS-F exhibited 826 DEGs, including 509 upregulated and 317 downregulated genes (Figure 6C).

Figure 6. RNA-seq analysis of W82 and R063 under salt stress at 36h and 48h stages. (A) Volcano plot of DEGs for D-VS-B. (B) Volcano plot of DEGs for D-VS-C. (C) Volcano plot of DEGs for H-VS-F. (D) Volcano plot of DEGs for H-VS-G. (E) Venn diagram showing the overlap of DEGs among the four comparison groups. (F) Venn diagram of DEGs shared between 36 h and 48 h salt stress treatments in R063, relative to controls. (G) Venn diagram of DEGs shared between salt-tolerant and salt-sensitive genotypes under salt stress at 36 h and 48 h.
DC refers to the DEGs identified between the salt-tolerant soybean variety R063 under salt stress and its corresponding control group at 36 hours. HG refers to the DEGs identified between R063 under salt stress and its control group at 48 hours. DC-VS-HG represents the common DEGs identified between the salt-stressed R063 and its control group at both 36 and 48 hours. DB refers to the DEGs identified between the salt-tolerant variety R063 and the salt-sensitive variety W82 under salt stress at 36 hours. HF refers to the DEGs identified between R063 and W82 under salt stress at 48 hours. DB-VS-HF denotes the DEGs commonly identified between R063 and W82 under salt stress at both 36 and 48 hours. DC-VS-HG identified 187 DEGs (Figure 6G), with 47 upregulated and 139 downregulated genes across both time points. Notably, Glyma.17G041200 was upregulated at 36 hours but downregulated at 48 hours. DB-VS-HF identified 305 DEGs (Figure 6F), with 164 upregulated and 141 downregulated genes. A summary of the four sets of comparisons revealed no co-occurring differential genes (Figure 6E).
Enrichment analysis of differentially expressed genes
The identified DEGs were functionally categorized into three main classes: Molecular Function (MF), Biological Process (BP), and Cellular Component (CC). Gene Ontology (GO) enrichment analysis of the D-VS-B comparison revealed the top 10 significantly enriched pathways, as follows (Figure 7A): MF includes six pathways: ADP binding (GO:0043531), signaling receptor activity (GO:0038023), abscisic acid binding (GO:0010427), monooxygenase activity (GO:0004497), oxidoreductase activity (GO:0016705), protein phosphatase inhibitor activity(GO:0004864); BP includes four pathways: defense response to other organisms (GO:0098542), defense response (GO:0006952), abscisic acid-activated signaling pathway (GO:0009738), response to biotic stimulus (GO:0009607). Among these, the most significantly enriched pathway was ADP binding (GO:0043531). In a study by Wang et al. (2020) investigating salt-responsive miRNAs and phasiRNAs in soybean roots, GO enrichment analysis of candidate genes also revealed significant enrichment in ADP binding, indicating that this pathway plays a critical role not only in soybean root responses to salt stress but also in soybean seed germination under salt stress.

Figure 7. Functional enrichment and clustering trends of DEGs under salt stress in soybean. This figure illustrates the GO enrichment analysis and expression trend clustering of DEGs identified between tolerant and sensitive soybean varieties under salt stress. (A) GO enrichment circle plot for the D-VS-B comparison (salt-tolerant R063 vs. salt-sensitive W82 at 36 hours under salt stress). (B) GO enrichment circle plot for the H-VS-F comparison (R063 vs. W82 at 48 hours under salt stress). (C) K-means clustering analysis of co-expressed DEGs shared across comparisons. Heatmaps on the left depict gene expression trends over time; corresponding enriched GO terms on the right reveal temporal dynamics of stress-response processes.
In the H-VS-F comparison, the top 10 significantly enriched pathways (Figure 7B) were categorized as follows: MF includes four pathways: ADP binding (GO:0043531), oxidoreductase activity (GO:0016705), monooxygenase activity (GO:0004497), transmembrane receptor protein tyrosine kinase activity (GO:0004714); BP includes four pathways: DNA unwinding involved in DNA replication (GO:0006268), defense response to other organisms (GO:0098542), double-strand break repair via break-induced replication (GO:0000727), and defense response (GO:0006952); CC includes two pathways: THO complex (GO:0000347), MCM complex (GO:0042555). The most significantly enriched pathway was ADP binding (GO:0043531).
It was observed that at both time points, the D-VS-B and H-VS-F comparisons were enriched in the same pathways, including ADP binding (GO:0043531), monooxygenase activity (GO:0004497), oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (GO:0016705), defense response (GO:0006952), defense response to other organism (GO:0098542). Notably, ADP binding (GO:0043531) was the most significantly enriched pathway in both comparisons. To enhance the understanding of the dynamic expression trends of differential genes, using K-means clustering. The differential genes in each category were subsequently analyzed for GO enrichment. Differential genes shared between the salt-tolerant and salt-sensitive varieties at both time points were grouped into three categories (Figure 7C). The first category consisted of 110 DEGs significantly enriched in ADP binding (GO:0043531). The second category, comprising 107 DEGs, was significantly enriched in the assembly of the large subunit precursor of preribosomes. The third category included 88 DEGs, was significantly enriched in the oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (GO:0016705).
In the D-VS-C comparison, the top 10 significantly enriched pathways (Figure 8A) were categorized as follows: MF includes three pathways: lactoperoxidase activity (GO:0140825), structural constituent of chromatin (GO:0030527), peroxidase activity (GO:0004601); BP includes seven pathways: protein complex oligomerization (GO:0051259), response to heat (GO:0009408), response to hydrogen peroxide (GO:0042542), response to salt stress (GO:0009651), protein folding (GO:0006457), hydrogen peroxide catabolic process (GO:0042744); CC includes one pathway: nucleosome (GO:0000786). The most significantly enriched pathway was protein complex oligomerization (GO:0051259), while the pathway with the highest number of DEGs was protein folding (GO:0006457). In a study by Gangadhar et al. (2014), which utilized a large-scale yeast functional screening system to explore genes responsive to high-temperature stress in potatoes, it was found that DEGs were significantly enriched in the protein folding pathway. This suggests that the protein folding pathway plays a crucial role not only in plant responses to high-temperature stress but also in salt stress tolerance.

Figure 8. GO functional enrichment and temporal clustering of DEGs in the salt-tolerant soybean genotype R063 under salt stress. This figure displays the functional enrichment results and expression trend dynamics of DEGs in R063 under salt treatment at two time points. (A) D-VS-C differentially expressed gene GO enrichment circle plot. (B) H-VS-G differentially expressed gene GO enrichment circle plot. (C) Clustering trend analysis of differentially expressed genes using the k-means algorithm. The trend plots and the corresponding GO annotations are shown on the left and right sides of the heatmaps, respectively.
In the H-VS-G comparison, the top 10 significantly enriched pathways (Figure 8B) were categorized as follows: MF includes seven pathways: fatty acid binding (GO:0005504); BP includes seven pathways: systemic acquired resistance (GO:0009627), DNA replication initiation (GO:0006270), response to chitin (GO:0010200), deoxyribonucleotide biosynthetic process (GO:0009263), DNA unwinding involved in DNA replication (GO:0006268), DNA replication (GO:0006260), and response to stress (GO:0006950); CC includes one pathways: extracellular region (GO:0005576). The most significantly enriched pathway was systemic acquired resistance (GO:0009627), while the pathway with the highest number of DEGs was the extracellular region (GO:0005576). In a study by Li et al. (2021), which combined transcriptomics and metabolomics analyses on the salt-tolerant plant Nitraria sibirica Pall., GO enrichment analysis of the identified DEGs revealed significant enrichment in the extracellular region pathway. This suggests that this pathway plays a crucial role in salt stress tolerance.
To enhance the understanding of the dynamic expression trends of differential genes, we categorized the differential genes that co-occurred between salt-tolerant varieties and their controls at both time points, using k-means clustering. The differential genes shared between the salt-tolerant varieties and their controls at both time points were divided into four categories (Figure 8C). The first category included 62 DEGs, which were significantly enriched in DNA unwinding involved in DNA replication (GO:0006268). The second category comprised 32 DEGs, which were significantly enriched in the deoxyribonucleotide biosynthetic process (GO:0009263). The third category, consisting of 62 DEGs, was nucleosome (GO:0000786). Finally, the fourth category contained 69 DEGs with no significant enrichment in any signaling pathway.
RT-qPCR analysis of gene expression
To validate the accuracy of the sequencing results, four candidate genes were randomly selected from the DEGs genes (Glyma.05G000050 and Glyma.05G219900) and two downregulated genes (Glyma.14G213600 and Glyma.19G135600). Similarly, four candidate genes were randomly selected from the DEGs identified in the comparison between Group H and Group F, including two upregulated genes (Glyma.14G223700 and Glyma.19G072702) and two downregulated genes (Glyma.05G175500 and Glyma.17G172400). A heatmap of the FPKM values for these eight candidate genes was generated (Figure 9). RT-qPCR analysis was performed on all eight candidate genes. Glyma.12G051100 was used as the internal reference gene. The relative mRNA expression levels of the target genes were quantified using the Stratagene Mx3005P Real-Time PCR System (Agilent Technologies) and calculated using the 2-ΔΔCt method. The RT-qPCR results were highly consistent with the transcriptome data, indicating a high level of reliability in the RNA-Seq results. Notably, Glyma.17G172400 exhibited a relatively high expression level in the W82 salt-treated group, whereas Glyma.05G219900 showed a relatively low expression level in the same group. These two genes may serve as promising candidates for future transgenic experiments and functional validation studies.

Figure 9. Candidate gene expression validation(A–H). FPKM heatmap of genes(I). The abscissa in the figure is the sample name, and the ordinate is the relative expression. (** means P ≤ 0.01 in T test, *** means P ≤ 0.005in T-test, **** means P ≤ 0.0001 in T-test).
WGCNA analysis
WGCNA is a widely employed systems biology methodology for constructing gene co-expression networks, utilizing high-throughput messenger RNA (mRNA) expression data. The underlying assumption of the algorithm is that the gene network follows a scale-free distribution. The process begins by defining a gene co-expression correlation matrix, followed by the computation of an adjacency function to establish the gene network. Next, the algorithm calculates the topological dissimilarity between nodes and constructs a hierarchical clustering tree based on these dissimilarity measures. The branches of the resulting tree correspond to distinct gene modules, with genes within the same module exhibiting high co-expression, while genes from different modules demonstrate low co-expression (Zhu et al., 2021; Zheng et al., 2023; Zhou et al., 2024).
In this experiment, we performed WGCNA on RNA sequencing (RNA-seq) data derived from 24 samples representing two soybean varieties at two-time points under various concentrations of salt treatments during the germination phase. A total of 52,872 differential genes were analyzed. Initially, gene expression data were screened, and the log2-transformed expression values (after adding 1 to each value) were used to filter out genes. To construct a robust co-expression network, we prioritized the top 5,000 genes with the highest median absolute deviation (MAD) across all samples. This selection aimed to focus the WGCNA analysis on genes exhibiting the most pronounced expression variability, a well-established filtering strategy that enhances module detection accuracy and minimizes background noise. WGCNA was then applied to construct a hierarchical clustering tree based on gene expression correlations, dividing the genes into distinct modules. Each module is represented by a different color in the clustering tree. Genes within the same module exhibit similar expression patterns, suggesting they may be functionally related and can be grouped accordingly. In the upper half of the tree, the vertical distance between two nodes (genes) represents their dissimilarity, while the horizontal distance is not relevant.
DEGs were organized into 11 co-expression modules, each assigned a unique color for visualization (Figure 10A). The correlation between these modules and DEGs was evaluated based on expression pattern similarity. As depicted in Figure 10B, the correlation coefficients and their corresponding P-values between modules and phenotypes were quantified, with larger absolute values indicating stronger relationships (red for positive correlations, blue for negative correlations). Specifically, labels A–D correspond to the 36-hour time point for the W82 control group, W82 salt-treated group, R063 control group, and R063 salt-treated group, respectively, while E–H represent the 48-hour time point for these groups.

Figure 10. WGCNA. (A) Gene Cluster Dendrogram and Module Detection: Displays the hierarchical clustering tree of genes and the detected modules. (B) Module-Trait Correlation Heatmap: Illustrates the correlations between modules and sample traits. Each box represents a module-trait relationship, with red indicating a positive correlation and blue indicating a negative correlation. (C) Co-expression Network of Top 20 Hub Genes in the Red Module: Visualizes the co-expression relationships among the top 20 hub genes within the red module. (D) GO Enrichment Bubble Plot for DEGs in the Red Module: Depicts the GO enrichment analysis results for differentially expressed genes within the red module. (E) Chromosomal Localization of the Top 10 Hub Genes: Maps the chromosomal positions of the top 10 hub genes. The x-axis represents different treatments in these figures, and the y-axis represents different modules. Candidate hub genes are highlighted in red. Each node represents a gene, and each edge represents the co-expression correlation between genes.
Ten hub genes were identified as central nodes in the co-expression network using Cytoscape: Glyma.01G238800, Glyma.11G051800, Glyma.06G319520, Glyma.16G212600, Glyma.10G253100, Glyma.20G013400, Glyma.18G148700, Glyma.15G232200, Glyma.07G243500, and Glyma.08G277000 (Figure 10C). Notably, genes in the red module exhibited distinct expression dynamics: they were upregulated in groups D and H (R063 salt-treated at both time points) but downregulated in groups B and F (W82 salt-treated at both time points).
GO enrichment analysis of red module DEGs (Figure 10D) revealed significant overrepresentation in biological processes related to symporter activity (GO:0015293), monosaccharide transmembrane transporter activity (GO:0015145), and monosaccharide transmembrane transport (GO:0015749). Chromosomal localization of the top 10 hub genes is illustrated in Figure 10E. Further comparative analysis between the red module and the DB-VS-HF dataset identified 24 overlapping DEGs, among which Glyma.16G212600, Glyma.08G277000, and Glyma.15G232200 were also classified as hub genes. Quantitative fluorescence assays confirmed the expression patterns of these three core genes, aligning with transcriptomic data (Supplementary Figure 1).
In this study, Gene Ontology enrichment analysis indicated that Glyma.08G277000 is involved in “transketolase or transaldolase activity” (GO:0016744), the “1-deoxy-D-xylulose-5-phosphate biosynthetic process” (GO:0052865), and “1-deoxy-D-xylulose-5-phosphate synthase activity” (GO:0008661). These functions are key components of the methylerythritol phosphate (MEP) pathway (Cordoba et al., 2009), which is responsible for the plastid-localized biosynthesis of isoprenoid precursors such as carotenoids and abscisic acid (ABA). The observed salt-induced expression of Glyma.08G277000 in the tolerant genotype suggests that it may contribute to salt stress adaptation by enhancing the metabolic flux through the MEP pathway. Collectively, these findings imply that Glyma.08G277000 may play a pivotal role in coordinating metabolic and hormonal responses to salt stress during soybean seed germination.
Discussion
Salt stress significantly hampers plant development, with seed germination being particularly vulnerable due to the sensitivity of early-stage cellular processes (Shi et al., 2022). In soybeans, successful germination under salinity conditions is a prerequisite for stand establishment and yield stability in saline soils. To survive salt-induced osmotic and oxidative stress, soybeans have evolved multilayered adaptive mechanisms involving ion homeostasis, osmotic adjustment, antioxidative defense, and stress-responsive gene regulation (Leung et al., 2023). Previous studies have identified several key genes that play fundamental roles in the salt tolerance mechanisms of soybean varieties, including Glyma.02G228100, Glyma.03G226000, Glyma.03G031000, Glyma.03G031400, Glyma.04G180300, Glyma.04G180400, Glyma.05G204600, Glyma.08G189600, Glyma.13G042200, and Glyma.17G17320 (Zeng et al., 2019). In this study, our data revealed that Glyma.03G031400 and Glyma.13G042200 were present in both the DB-VS-HF comparison, while Glyma.04G180400 was only found in the D-VS-B comparison.
Notably, we observed a sharp reduction in the number of DEGs in the salt-tolerant R063 between 36 hours and 48 hours post salt treatment, from 6,591 to only 438. This rapid contraction in transcriptomic activity may suggest that R063 possesses a fast-acting regulatory mechanism, enabling swift stress perception and subsequent stabilization of gene expression. Based on these dynamics, we hypothesize that the 36-hour time point may represent an “activation phase” in which the plant initiates extensive transcriptional reprogramming in response to salt stress, while the 48-hour point may correspond to a “dynamic equilibrium phase” reflecting a return toward homeostasis. We emphasize that these terms represent our interpretation of the observed gene expression patterns, rather than definitive physiological states. This proposed framework aligns with models of rapid adaptation and transcriptional stabilization that have been previously associated with abiotic stress tolerance.
Interestingly, when comparing the tolerant and sensitive genotypes under salt stress (D-VS-B and H-VS-F), the DEG numbers were surprisingly low (656 and 826, respectively), which implies that R063’s advantage may not arise from an amplified inducible response, but rather from pre-configured gene expression or post-transcriptional regulation. This observation supports the notion of a ‘primed’ or ‘readiness’ state in salt-tolerant seeds, a concept previously reported in priming studies (Johnson and Puthur, 2021).
The physiological responses of the salt-tolerant genotype R063 under stress conditions were consistent with its transcriptomic profile. Compared to the sensitive variety, R063 showed significantly elevated levels of SOD and POD activities, while malondialdehyde MDA content was markedly lower. These results suggest that R063 is better equipped to neutralize reactive ROS and minimize lipid peroxidation in cell membranes—key mechanisms for surviving salinity stress. Interestingly, the physiological data align with the transcriptomic findings. For instance, GO terms such as “oxidoreductase activity” and “defense response” were enriched among DEGs, directly linking to antioxidant functions.
Through statistical analysis of the GO enrichment results for multiple groups of DEGs, we identified ten key pathways significantly associated with soybean tolerance to salt stress. These pathways include ADP binding (GO:0043531), monooxygenase activity (GO:0004497), oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (GO:0016705), defense response (GO:0006952), defense response to other organism (GO:0098542), oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (GO:0016705), DNA unwinding involved in DNA replication (GO:0006268), deoxyribonucleotide biosynthetic process (GO:0009263), nucleosome (GO:0000786). These pathways play critical roles in soybean’s resistance to salt stress, and our findings are consistent with previous reports. For instance, Feng et al. (2017) conducted a GO enrichment analysis on DEGs from wild emmer wheat under salt stress, identifying significant enrichment in protein phosphorylation, ATP binding, and ADP binding pathways. Similarly, Li et al. (2019) analyzed DEGs in transgenic poplars under salt stress and found a prominent expression of monooxygenase and oxidoreductase activities. The enrichment of GO terms such as “ADP binding” among DEGs under salt stress suggests a mechanistic connection between energy metabolism and stress adaptation. ADP-binding proteins, including H+-ATPases and kinases, are central to maintaining energy-demanding processes critical for salt tolerance (Guo et al., 2022). This analysis bridges broad GO enrichments to functional insights, advancing our understanding of salinity resilience at the molecular level.
WGCNA is a powerful tool for identifying gene sets that exhibit highly coordinated variation and for mining candidate genes through the interconnectivity and associations between gene sets and phenotypes. WGCNA has been widely applied across various species to identify candidate genes regulating target traits (Lu et al., 2019; Chen et al., 2020; Ma et al., 2021). In order to identify hub genes associated with salt tolerance in soybeans, we performed WGCNA using FPKM values of DEGs. The results revealed 11 distinct modules (Figure 10B). In the red module, we identified 10 core genes, among which Glyma.18G148700 and Glyma.08G277000 were found to be upregulated under acidity stress and downregulated under drought stress (Jahan et al., 2019). These genes were also expressed following salt stress treatment. A comparison between the differentially expressed genes obtained from the DB-VS-HF dataset and those identified in the red module revealed 24 overlapping genes, including Glyma.16G212600, Glyma.08G277000, and Glyma.15G232200, all of which appeared among the top 10 core genes. At present, this study has not conducted further validation of these core genes. Future work will involve more in-depth functional analysis of these core genes, as well as a comprehensive investigation into the signaling pathways that were significantly enriched with differential genes. Based on GO annotations and prior literature, we propose the following hypotheses: Glyma.08G277000 encodes a transketolase involved in the methylerythritol phosphate (MEP) pathway, which regulates plastidial isoprenoid synthesis, including abscisic acid (ABA)—a critical hormone for salt response. Its upregulation may enhance ABA production during germination.
This study integrated RNA-seq technology with WGCNA to systematically investigate the transcriptomic response mechanisms of soybean under salt stress, providing molecular insights into the genetic basis of salt tolerance in soybean. The results revealed that salt stress enhances salt tolerance in soybean by regulating multiple key genes and signaling pathways. GO enrichment analysis indicated that DEGs were significantly enriched in the ADP-binding pathway. Furthermore, WGCNA analysis identified three hub genes closely associated with salt stress responses: Glyma.16G212600, Glyma.08G277000, and Glyma.15G232200, which serve as molecular targets for future studies on improving soybean salt tolerance. This study offers testable molecular targets for improving salt resilience in early-stage soybean development.
Conclusion
This study comprehensively examined salt stress effects on soybean seed germination using physiological assays and transcriptomic profiling. The findings revealed that SOD plays a crucial regulatory role in salt stress tolerance during soybean germination. In addition, two potential salt-tolerant genes, Glyma.03G031400 and Glyma.13G04220, were identified and verified in this study, building upon previous research data. Through a combined analysis of transcriptome sequencing and WGCNA, we identified key genes associated with salt tolerance during soybean germination. GO enrichment analysis of differentially expressed genes revealed that the ADP binding (GO:0043531) signaling pathway is integral to salt stress tolerance during soybean germination. Further WGCNA analysis identified hub genes, all of which play significant roles in regulating salt stress tolerance during soybean germination. Among these, Glyma.16G212600 was found to be the most critical gene. This study provides valuable theoretical insights and practical experience for the identification of genes related to salt stress tolerance during soybean germination, offering a robust genetic resource for future research in this area.
Data availability statement
The data presented in the study are deposited in the NCBI SRA repository, accession number PRJNA 1252398.
Author contributions
LP: Writing – original draft, Methodology, Software. YC: Investigation, Software, Writing – original draft. ZR: Conceptualization, Methodology, Supervision, Writing – original draft. DK: Methodology, Software, Writing – original draft. SW: Investigation, Methodology, Writing – original draft. YL: Methodology, Software, Writing – original draft. SI: Investigation, Software, Writing – original draft. SF: Conceptualization, Formal Analysis, Supervision, Writing – review & editing. YS: Formal Analysis, Supervision, Writing – review & editing. ZZ: Conceptualization, Funding acquisition, Project administration, Writing – review & editing. JW: Conceptualization, Formal Analysis, Supervision, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by the Key Agricultural Technology Research and Development Program of Jilin Province, China (Grant No. 20240303011NC). Additional support was provided by the Science and Technology Research Program of the Jilin Provincial Department of Education (Grant No. JJKH20240437KJ). The project also received funding from the Changchun Science and Technology Bureau through the initiative titled “Integration and Demonstration of Screening and Supporting Technologies for Soda Salt-Tolerant Varieties of Northern Spring Soybeans” (Project Code: 24GXYSZZ31), as part of the Pilot Agricultural Project Selection Program for Universities and Colleges.
Conflict of interest
Authors DK and SI were employed by the company Agricultural Research Corporation ARC.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1569565/full#supplementary-material
References
Afzal, M., Alghamdi, S. S., Migdadi, H. H., Khan, M. A., Mirza, S. B., and El-Harty, E. (2020). Legume genomics and transcriptomics: From classic breeding to modern technologies. Saudi J. Biol. Sci. 27, 543–555. doi: 10.1016/j.sjbs.2019.11.018
Amorim, L. L. B., Ferreira-Neto, J. R. C., Bezerra-Neto, J. P., Pandolfi, V., De Araújo, F. T., Da Silva Matos, M. K., et al. (2018). Cowpea and abiotic stresses: identification of reference genes for transcriptional profiling by qPCR. Plant Methods 14, 1–17. doi: 10.1186/s13007-018-0354-z
Azam, M., Zhang, S., Li, J., Ahsan, M., Agyenim-Boateng, K. G., Qi, J., et al. (2023). Identification of hub genes regulating isoflavone accumulation in soybean seeds via GWAS and WGCNA approaches. Front. Plant Sci. 14, 1120498. doi: 10.3389/fpls.2023.1120498
Chen, C. and Xia, R. (2022). “Interactive data analyses using TBtools,” in Integrative bioinformatics: history and future. (Springer), 343–375. doi: 10.1007/978-981-16-6795-4_17
Chen, X., Mei, Q., Liang, W., Sun, J., Wang, X., Zhou, J., et al. (2020). Gene mapping, genome-wide transcriptome analysis, and WGCNA reveals the molecular mechanism for triggering programmed cell death in rice mutant pir1. Plants 9, 1607. doi: 10.3390/plants9111607
Cordoba, E., Salmi, M., and León, P. (2009). Unravelling the regulatory mechanisms that modulate the MEP pathway in higher plants. J. Exp. Bot. 60, 2933–2943. doi: 10.1093/jxb/erp190
Dileo, M. V., Strahan, G. D., Den Bakker, M., and Hoekenga, O. A. (2011). Weighted correlation network analysis (WGCNA) applied to the tomato fruit metabolome. PLoS One 6, e26683. doi: 10.1371/journal.pone.0026683
FAO (2024). FAO launches first major global assessment of salt-affected soils in 50 years. Available online at: https://www.fao.org/newsroom/detail/fao-launches-first-major-global-assessment-of-salt-affected-soils-in-50-years/en (Accessed June 10, 2025).
Feng, K., Nie, X., Cui, L., Deng, P., Wang, M., and Song, W. (2017). Genome-wide identification and characterization of salinity stress-responsive miRNAs in wild emmer wheat (Triticum turgidum ssp. dicoccoides). Genes 8, 156. doi: 10.3390/genes8060156
Fridovich, I. (1986). Superoxide dismutases. Adv. enzymology related areas Mol. Biol. 58, 61–97. doi: 10.1002/9780470123041.ch2
Gangadhar, B. H., Yu, J. W., Sajeesh, K., and Park, S. W. (2014). A systematic exploration of high-temperature stress-responsive genes in potato using large-scale yeast functional screening. Mol. Genet. Genomics 289, 185–201. doi: 10.1007/s00438-013-0795-z
Gao, T., Zhang, Y., Shi, J., Mohamed, S. R., Xu, J., and Liu, X. (2021). The antioxidant guaiacol exerts fungicidal activity against fungal growth and deoxynivalenol production in Fusarium graminearum. Front. Microbiol. 12, 762844. doi: 10.3389/fmicb.2021.762844
Guo, M., Wang, X.-S., Guo, H.-D., Bai, S.-Y., Khan, A., Wang, X.-M., et al. (2022). Tomato salt tolerance mechanisms and their potential applications for fighting salinity: A review. Front. Plant Sci. 13, 2022. doi: 10.3389/fpls.2022.949541
He, G., Zhao, Y., Wang, L., Jiang, S., and Zhu, Y. (2019). China’s food security challenge: Effects of food habit changes on requirements for arable land and water. J. Cleaner Production 229, 739–750. doi: 10.1016/j.jclepro.2019.05.053
Hu, J., Zhuang, Y., Li, X., Li, X., Sun, C., Ding, Z., et al. (2022). Time-series transcriptome comparison reveals the gene regulation network under salt stress in soybean (Glycine max) roots. BMC Plant Biol. 22, 157. doi: 10.1186/s12870-022-03541-9
Imadi, S. R., Kazi, A. G., Ahanger, M. A., Gucel, S., and Ahmad, P. (2015). Plant transcriptomics and responses to environmental stress: an overview. J. Genet. 94, 525–537. doi: 10.1007/s12041-015-0545-6
Jahan, M. A., Harris, B., Lowery, M., Coburn, K., Infante, A. M., Percifield, R. J., et al. (2019). The NAC family transcription factor GmNAC42–1 regulates biosynthesis of the anticancer and neuroprotective glyceollins in soybean. BMC Genomics 20, 1–21. doi: 10.1186/s12864-019-5524-5
Jhan, L.-H., Yang, C.-Y., Huang, C.-M., Lai, M.-C., Huang, Y.-H., Baiya, S., et al. (2023). Integrative pathway and network analysis provide insights on flooding-tolerance genes in soybean. Sci. Rep. 13, 1980. doi: 10.1038/s41598-023-28593-1
Johnson, R. and Puthur, J. T. (2021). Seed priming as a cost effective technique for developing plants with cross tolerance to salinity stress. Plant Physiol. Biochem. 162, 247–257. doi: 10.1016/j.plaphy.2021.02.034
Kan, G., Zhang, W., Yang, W., Ma, D., Zhang, D., Hao, D., et al. (2015). Association mapping of soybean seed germination under salt stress. Mol. Genet. Genomics 290, 2147–2162. doi: 10.1007/s00438-015-1066-y
Langfelder, P. and Horvath, S. (2008). WGCNA: an R package for weighted correlation network analysis. BMC Bioinf. 9, 1–13. doi: 10.1186/1471-2105-9-559
Leung, H.-S., Chan, L.-Y., Law, C.-H., Li, M.-W., and Lam, H.-M. (2023). Twenty years of mining salt tolerance genes in soybean. Mol. Breed. 43, 45. doi: 10.1007/s11032-023-01383-3
Li, Y., Chen, Q., Nan, H., Li, X., Lu, S., Zhao, X., et al. (2017). Overexpression of GmFDL19 enhances tolerance to drought and salt stresses in soybean. PLoS One 12, e0179554. doi: 10.1371/journal.pone.0179554
Li, J., Pu, L., Han, M., Zhu, M., Zhang, R., and Xiang, Y. (2014). Soil salinization research in China: Advances and prospects. J. Geographical Sci. 24, 943–960. doi: 10.1007/s11442-014-1130-2
Li, J., Sun, P., Xia, Y., Zheng, G., Sun, J., and Jia, H. (2019). A stress-associated protein, PtSAP13, from Populus trichocarpa provides tolerance to salt stress. Int. J. Mol. Sci. 20, 5782. doi: 10.3390/ijms20225782
Li, H., Tang, X., Yang, X., and Zhang, H. (2021). Comprehensive transcriptome and metabolome profiling reveal metabolic mechanisms of Nitraria sibirica Pall. to salt stress. Sci. Rep. 11, 12878. doi: 10.1038/s41598-021-92317-6
Liu, Z., Li, H., Gou, Z., Zhang, Y., Wang, X., Ren, H., et al. (2020). Genome-wide association study of soybean seed germination under drought stress. Mol. Genet. Genomics 295, 661–673. doi: 10.1007/s00438-020-01646-0
Liu, Y., Yu, L., Qu, Y., Chen, J., Liu, X., Hong, H., et al. (2016). GmSALT3, which confers improved soybean salt tolerance in the field, increases leaf Cl-exclusion prior to Na+ exclusion but does not improve early vigor under salinity. Front. Plant Sci. 7, 1485. doi: 10.3389/fpls.2016.01485
Lu, C., Pu, Y., Liu, Y., Li, Y., Qu, J., Huang, H., et al. (2019). Comparative transcriptomics and weighted gene co-expression correlation network analysis (WGCNA) reveal potential regulation mechanism of carotenoid accumulation in Chrysanthemum× morifolium. Plant Physiol. Biochem. 142, 415–428. doi: 10.1016/j.plaphy.2019.07.023
Luo, Y., Pang, D., Jin, M., Chen, J., Kong, X., Li, W., et al. (2019). Identification of plant hormones and candidate hub genes regulating flag leaf senescence in wheat response to water deficit stress at the grain-filling stage. Plant Direct 3, e00152. doi: 10.1002/pld3.152
Ma, L., Zhang, M., Chen, J., Qing, C., He, S., Zou, C., et al. (2021). GWAS and WGCNA uncover hub genes controlling salt tolerance in maize (Zea mays L.) seedlings. Theor. Appl. Genet. 134, 3305–3318. doi: 10.1007/s00122-021-03897-w
Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. doi: 10.1101/gr.1239303
Sherman, B. T., Hao, M., Qiu, J., Jiao, X., Baseler, M. W., Lane, H. C., et al. (2022). DAVID: a web server for functional enrichment analysis and functional annotation of gene lists, (2021 update). Nucleic Acids Res. 50, W216–W221. doi: 10.1093/nar/gkac194
Shi, M.-Q., Liao, X.-L., Qian, Y., Zhang, W., Li, Y.-K., Bhat, J. A., et al. (2022). Linkage and association mapping of wild soybean (Glycine soja) seeds germinating under salt stress. J. Integr. Agric. 21, 2833–2847. doi: 10.1016/j.jia.2022.07.031
Smith, A. K., Nash, A. M., and Wilson, L. I. (1961). Water absorption of soybeans. J. Am. Oil Chem. Soc. 38, 120–123. doi: 10.1007/BF02641230
Tang, Y., Li, J., Song, Q., Cheng, Q., Tan, Q., Zhou, Q., et al. (2023). Transcriptome and WGCNA reveal hub genes in sugarcane tiller seedlings in response to drought stress. Sci. Rep. 13, 12823. doi: 10.1038/s41598-023-40006-x
Wang, X., Liu, H., Zhang, D., Zou, D., Wang, J., Zheng, H., et al. (2022). Photosynthetic carbon fixation and sucrose metabolism supplemented by weighted gene co-expression network analysis in response to water stress in rice with overlapping growth stages. Front. Plant Sci. 13, 864605. doi: 10.3389/fpls.2022.864605
Wang, R., Wang, Y., Yao, W., Ge, W., Jiang, T., and Zhou, B. (2023). Transcriptome sequencing and WGCNA reveal key genes in response to leaf blight in poplar. Int. J. Mol. Sci. 24, 10047. doi: 10.3390/ijms241210047
Wang, Q., Yang, Y., Lu, G., Sun, X., Feng, Y., Yan, S., et al. (2020). Genome-wide identification of microRNAs and phased siRNAs in soybean roots under long-term salt stress. Genes Genomics 42, 1239–1249. doi: 10.1007/s13258-020-00990-0
Wei, Y., Li, L., Hu, W., Ju, H., Zhang, M., Qin, Q., et al. (2020). Suppression of rice blast by bacterial strains isolated from cultivated soda saline-sodic soils. Int. J. Environ. Res. Public Health 17, 5248. doi: 10.3390/ijerph17145248
Yao, Y., Xiong, E., Qu, X., Li, J., Liu, H., Quan, L., et al. (2023). WGCNA and transcriptome profiling reveal hub genes for key development stage seed size/oil content between wild and cultivated soybean. BMC Genomics 24, 494. doi: 10.1186/s12864-023-09617-6
Zeng, A., Chen, P., Korth, K. L., Ping, J., Thomas, J., Wu, C., et al. (2019). RNA sequencing analysis of salt tolerance in soybean (Glycine max). Genomics 111, 629–635. doi: 10.1016/j.ygeno.2018.03.020
Zhang, R., Hussain, S., Wang, Y., Liu, Y., Li, Q., Chen, Y., et al. (2021). Comprehensive evaluation of salt tolerance in rice (Oryza sativa L.) germplasm at the germination stage. Agronomy 11, 1569. doi: 10.3390/agronomy11081569
Zhang, W., Liao, X., Cui, Y., Ma, W., Zhang, X., Du, H., et al. (2019). A cation diffusion facilitator, GmCDF1, negatively regulates salt tolerance in soybean. PLoS Genet. 15, e1007798. doi: 10.1371/journal.pgen.1007798
Zhao, X., Zhang, Y., Wang, J., Zhao, X., Li, Y., Teng, W., et al. (2024). GWAS and WGCNA analysis uncover candidate genes associated with oil content in soybean. Plants 13, 1351. doi: 10.3390/plants13101351
Zheng, R., Deng, M., Lv, D., Tong, B., Liu, Y., and Luo, H. (2023). Combined BSA-seq and RNA-seq reveal genes associated with the visual stay-green of maize (Zea mays L.). Int. J. Mol. Sci. 24, 17617. doi: 10.3390/ijms242417617
Zhou, S., Wu, T., Li, X., Wang, S., and Hu, B. (2024). Identification of candidate genes controlling cold tolerance at the early seedling stage from Dongxiang wild rice by QTL mapping, BSA-Seq and RNA-Seq. BMC Plant Biol. 24, 649. doi: 10.1186/s12870-024-05369-x
Zhu, Q., Gao, S., and Zhang, W. (2021). Identification of key transcription factors related to bacterial spot resistance in pepper through regulatory network analyses. Genes 12, 1351. doi: 10.3390/genes12091351
Keywords: soybean, germination, salt stress tolerance, transcriptome profiling, WGCNA
Citation: Pan L, Chen Y, Ren Z, Khojely DM, Wang S, Li Y, Ibrahim SE, Fan S, Song Y, Zhang Z and Wei J (2025) Using WGCNA and transcriptome profiling to identify hub genes for salt stress tolerance in germinating soybean seeds. Front. Plant Sci. 16:1569565. doi: 10.3389/fpls.2025.1569565
Received: 01 February 2025; Accepted: 04 July 2025;
Published: 08 August 2025.
Edited by:
Baris Uzilday, Ege University, TürkiyeReviewed by:
Jiyi G. E. Zhang, Harpe Bioherbicide Solutions Inc., United StatesShi-Hong Zhang, Jilin University, China
Mian Faisal Nazir, Lushan Botanical Garden (CAS), China
Jian Ning Liu, Shandong Agricultural University, China
Copyright © 2025 Pan, Chen, Ren, Khojely, Wang, Li, Ibrahim, Fan, Song, Zhang and Wei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhuo Zhang, emhhbmd6aHVvQGpsYXUuZWR1LmNu; Jian Wei, d2VpamlhbkBqbGF1LmVkdS5jbg==