Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Plant Sci., 08 January 2026

Sec. Plant Physiology

Volume 16 - 2025 | https://doi.org/10.3389/fpls.2025.1708851

Integrated transcriptome and metabolome analyses reveal key genes regulating jujuboside biosynthesis in Ziziphus jujuba var. spinosa

Dali GengDali GengXiaozhou YangXiaozhou YangXinhong WangXinhong WangXiaojun ChiXiaojun ChiXiaohan TangXiaohan TangXiaojun MaXiaojun MaXuexiang LiXuexiang LiMenglin PuMenglin PuJing Shu*Jing Shu*
  • Department of Forestry Engineering, Shandong Agriculture and Engineering University, Zibo, China

The seeds of Ziziphus jujuba var. spinosa are a traditional Chinese medicine for treating insomnia, with jujuboside A and jujuboside B as their core bioactive components. However, the biosynthetic pathways and regulatory mechanisms of these triterpenoid saponins remain poorly understood. In this study, we investigated the accumulation patterns of jujube saponins across 10 varieties at 45, 65, and 85 days after flowering (DAF). Integrated transcriptomic and metabolomic analyses revealed 2,040 differentially expressed genes (DEGs) and 1,593 differentially accumulated metabolites (DAMs). Among these, 35 core DEGs were consistently significant across comparisons, enriched in saponin metabolism, monooxygenase activity, and glycosylation processes. A Uridine Diphosphate (UDP)-glucosyltransferase and two CYP450 family members exhibited expression patterns positively correlated with saponin accumulation, showing upregulation during 45 and 65 DAF and suppression by 85 DAF. qRT-PCR validation confirmed their higher expression in high-saponin varieties. Additionally, downregulated stress–response genes in high-saponin varieties suggested a potential link between abiotic stress tolerance and enhanced saponin production. These findings identify key glycosyltransferase and cytochrome P450 genes potentially governing jujuboside biosynthesis, providing critical insights for metabolic engineering and quality improvement of Z. jujuba.

Introduction

The number of individuals with insomnia disorders in China is rapidly increasing. This growth in the population suffering from sleep disorders has led to a soaring market demand for sedative drugs (Zhao et al., 2020). Seeds of Ziziphus jujuba var. spinosa (Z. jujuba), a traditional Chinese medicinal herb used for treating insomnia, have consequently experienced rapidly increasing demand and price (Hua et al., 2021). However, despite the strong demand for Z. jujuba, its production has long faced challenges. At the germplasm level, the production of Z. jujuba has historically relied on the collection of wild resources, lacking systematic genetic improvement and standardized cultivation techniques. Additionally, long-term neglect of this traditional Chinese medicine means that research on Z. jujuba breeding and the mechanisms underlying its medicinal efficacy remains insufficient. The biosynthetic pathways of its core active components, jujuboside A and jujuboside B, remain unclear, and key enzyme genes and regulatory networks within the related secondary metabolic pathways have yet to be elucidated. This quality uncertainty severely hinders the development of standardized formulations and the stability of pharmaceutical and health product efficacy, thereby impeding the promotion and popularization of Z. jujuba-based sedative drugs and health products and presenting a significant challenge for the healthy development of the Z. jujuba industry.

The key to overcoming these challenges lies in deciphering the molecular basis of Z. jujuba’s medicinal efficacy formation. Unraveling the biosynthetic mechanisms of Z. jujuba’s active components requires focusing on the discovery of its key rate-limiting enzyme systems. Although the aglycone structure of jujuboside A/B shares homology with dammarane-type triterpenes, the formation mechanism of its characteristic ketal ring structure is specific (Li et al., 2023). Current research indicates that this molecule undergoes oxygenation at the C-16 and C-30 positions of dammarenediol-II, followed by aldol condensation and ketal cyclization to form the C16–C30 and C16–C23 rings. This series of oxidative modification reactions is highly dependent on hydroxylase catalysis. Notably, in Panax ginseng, the CYP450 superfamily (particularly members of the CYP716A subfamily) has been confirmed to precisely regulate the hydroxylation sites and reaction sequence of dammarane-type triterpenes. Its functional diversity provides an important reference for deciphering the jujuboside biosynthetic network (Han et al., 2011, 2012, 2013) and has been successfully applied in the artificial cellular synthesis of ginsenosides, achieving the production of intermediates (Zhu et al., 2014; Zhao et al., 2016; Song et al., 2020), aglycones (Li D. et al., 2019), or saponins (Wei et al., 2015; Wang et al., 2015; Zhuang et al., 2017) in microbial cells such as Escherichia coli, Bacillus subtilis, Pichia pastoris, Yarrowia lipolytica, and Saccharomyces cerevisiae. However, despite also being dammarane-type tetracyclic triterpenoids, the functional divergence of CYP450 homologs corresponding to those in ginseng and their catalytic specificity for the characteristic ketal structure in jujuboside biosynthesis remain unknown. The absence of these key enzymes directly hinders the complete mapping of the molecular regulatory network governing the jujuboside biosynthetic pathway.

Biosynthesis of dammarane-type triterpenoids is related to Uridine Diphosphate (UDP)-glucosyltransferase and CYP450 (Shin and Oh, 2016). CYP450 enzymes perform complex modifications such as hydroxylation and oxidation on the carbon ring framework of triterpenoids and are key enzymes in the dammarane-type triterpenoid biosynthesis pathway. In P. ginseng, 11 CYP450 genes related to ginsenoside biosynthesis were discovered and cloned by Zeng et al. (2018). Among these genes, CYP716A47 can increase expression in response to methyl jasmonate induction. Furthermore, when transgenic ginseng plants with an overexpressed SS gene are produced, the yield of saponins in the roots increases. By introducing CYP716A47 into brewing yeast, the expressed recombinant protein can catalyze the C-12 hydroxylation of dammarane-2,4-diol and convert it into protospondiols. When DS and CYP716A47 are simultaneously introduced into brewing yeast, protospondiol production was detected in the recombinant strain (Han et al., 2011).

In Panax notoginseng, 15 CYP450 genes were discovered by Luo et al. (2011). Among these genes, Pn00158 is a homolog of CYP716A47 in P. ginseng (Li D. et al., 2019). UDP-glucosyltransferase catalyzes the glycosylation reaction, the final step in jujube saponin biosynthesis. This process involves transferring sugar groups to the aglycone of the saponin to form glycosidic bonds. In P. ginseng, UDP-glucuronosyltransferase Pg1 (UGTPg1) catalyzes the biosynthesis of ginsenoside C-K (Yan et al., 2014); UGTPg45 and UDP-glucuronosyltransferase Pg29 (UGTPg29) catalyze the biosynthesis of ginsenoside Rh2 and Rh3 (Wang et al., 2015); and UGTPg1 and UGTPg100 catalyze the biosynthesis of ginsenoside Rh1 and F1 (Wei et al., 2015).

In this paper, we investigated the accumulation patterns of jujube saponins across 10 Z. jujuba varieties at 45, 65, and 85 days after flowering (DAF). We performed integrated transcriptomic and metabolomic analyses on varieties exhibiting contrasting saponin profiles. Differential expression analysis identified 2,040 differentially expressed genes (DEGs) and 1,593 differentially accumulated metabolites (DAMs), with 35 core DEGs consistently significant across comparisons. Gene Ontology (GO) enrichment revealed that these DEGs were associated with saponin metabolism, monooxygenase activity, and glycosylation processes. Co-expression analysis highlighted three candidate genes whose temporal expression patterns correlated with saponin accumulation. These genes were upregulated during the active saponin biosynthesis stages (45–65 DAF) but suppressed by 85 DAF. Their involvement in jujuboside biosynthesis was further supported by quantitative reverse transcription polymerase chain reaction (qRT-PCR) validation and by observed correlations with precursor metabolite levels. Finally, the potential linkage between abiotic stress tolerance and enhanced saponin production was indicated by downregulated stress–response genes in high-saponin varieties.

Methods

Plant materials

Z. jujuba var. spinosa were planted in Guangrao County, Dongying City, Shandong Province, China (118.45°E, 37.08°N) with 1.2‰ soil salinity. We selected 5-year-old plants for seed collection. Labels indicating the date were hung on buds when they were about to bloom, and fruits were then harvested and frozen in liquid nitrogen at 35, 45, 55, 65, 75, and 85 DAF. Epicarp, sarcocarp, and endocarp were removed with a nutcracker in liquid nitrogen to maintain frozen conditions. Seeds were collected and stored at − 80°C. Ten varieties of the hybrid from the crossbreeding of “Madu” with the “Pufeng” variety were selected; for each variety, five individuals were chosen, and 10 seeds were harvested from each individual.

Jujube saponin content measurement

Seeds were crushed in a mixer mill (MM 400, Retsch) containing small steel balls in liquid nitrogen for 1.5 min at 30 Hz. The powder was divided into equal parts. One part was stored at − 80°C, and the other part was freeze-dried in a vacuum freeze dryer (LGJ-18D, The Fourth Ring, Beijing, China). The lyophilized powder was stored at − 80°C. Three seeds from different individuals were used as biological replicates.

Jujuboside A, jujuboside B, and jujubogenin were purchased from Desite Biotech (Chengdu, Sichuan Province, China). Lyophilized powder was used for the High Performance Liquid Chromatography with Evaporative Light Scattering Detector (HPLC-ELSD) analysis. A 100-mg sample was extracted with 1.5 mL of 50% methanol in a sonicator bath for 1 h and centrifuged at 12,000 × g for 5 min to remove debris. The supernatant was filtered through a 0.2-μm filter before injection. High Performance Liquid Chromatography with Evaporative Light Scattering Detector (HPLC-ELSD) was performed with the Vanauish Core Duo (Thermo Fisher, Waltham, MA, USA). Separation was achieved with a 150 × 2.1 mm, 3 µm C18–120 column (Shimadzu, Tokyo, Japan) using the following gradient: 0.2% acetic acid in water (32%) vs. methanol (68%) run at 1 mL min−1 and a column temperature of 30°C. The ELSD detection shows an atomization temperature of 45°C and a gas flow rate of 1.6 L min−1. Each group had three biological repeats.

Transcriptome and metabolome analyses

Seeds of 45 and 65 lyophilized powder were used for metabolome and transcriptome analyses. The metabolome analysis was performed by Metware Biotechnology (Wuhan, Hubei Province, China) using methods described by (Chen et al., 2018). Metabolite profiling was conducted using an ExionLC™ AD UPLC system coupled to a QTRAP® 6500+ mass spectrometer (SCIEX, Danaher Corporation, USA). Chromatographic separation was performed on an Agilent SB-C18 column (1.8 µm, 2.1 × 100 mm) with a mobile phase comprising 0.1% formic acid in water (solvent A) and 0.1% formic acid in acetonitrile (solvent B). The gradient program initiated at 95% A/5% B, linearly transitioned to 5% A/95% B over 9 min, held at this composition for 1 min, then re-equilibrated to 95% A/5% B within 1.1 min, followed by a 2.9-min stabilization period. Separation was performed at a flow rate of 0.35 mL min−1, column temperature of 40°C, and injection volume of 2 µL. Electrospray ionization parameters included a source temperature of 500°C, a spray voltage of ± 5,500 V (positive/negative mode), and gas pressures of 50 psi (GS1), 60 psi (GS2), and 25 psi (CUR). Mass spectrometry utilized high CAD energy with medium-pressure nitrogen collision gas. Data acquisition occurred in time-scheduled MRM mode, where declustering potentials and collision energies were optimized for each metabolite transition and monitored according to the corresponding elution windows. Transcriptome sequencing analysis was performed by Metware Biotechnology (Wuhan, China) following the protocol described by Chen et al. (2018). Total RNA from plant tissues was extracted using the CTAB-PBIOZOL method combined with ethanol precipitation, while RNA from animal tissues was processed using Trizol reagent. Following extraction, RNA pellets were dissolved in 50 μL of DEPC-treated nuclease-free water. RNA concentration was precisely quantified using a Qubit fluorometer, and RNA integrity was assessed using a Qsep400 biofragment analyzer to ensure samples met library construction requirements.

For mRNA library preparation, polyadenylated mRNAs were first enriched using Oligo(dT) magnetic beads based on the polyA tail structure characteristic of eukaryotic mRNAs. The purified mRNAs were then fragmented into appropriate lengths under controlled temperature conditions using a fragmentation buffer. First-strand complementary DNA (cDNA) synthesis was performed via reverse transcription with random hexamer primers, followed by second-strand cDNA synthesis, during which dUTP was incorporated instead of dTTP during end repair and dA-tailing. This strategic substitution enables strand-specific sequencing, as subsequent PCR amplification selectively degrades uracil-containing strands. Sequencing adapters were ligated to the double-stranded cDNA fragments, followed by purification and size selection using SPRI beads to obtain libraries with 250–350 bp inserts. The adapter-ligated products underwent PCR amplification and final purification, with elution performed in nuclease-free water.

After library construction, concentration was quantified using Qubit, and fragment size distribution was validated using the Qsep400 analyzer. Qualified libraries were pooled according to effective concentration and target sequencing output, then subjected to paired-end 150 bp (PE150) sequencing on an Illumina platform. The sequencing-by-synthesis principle involves bridge amplification of DNA fragments on the flow cell to generate clusters. During each cycle, fluorescently labeled dNTPs and DNA polymerase are introduced, with nucleotide incorporation releasing characteristic optical signals. The sequencer captures these signals and converts them into base calls through dedicated software, generating complete sequence reads for the target fragments. Raw data were subjected to adapter trimming and low-quality read filtration prior to downstream gene expression analysis.

Samples at 45 and 65 DAF were used for metabolome and transcriptome analyses. Each group had three biological repeats, with nine seeds from three different individuals per replicate.

qRT-PCR analysis, gene cloning, and phylogenetic analysis

The nonlyophilized powder of all samples was used for the qRT-PCR analysis. A 500-mg portion of this powder was used to extract RNA with the Polysaccharides and Polyphenolics-rich RNAprep Pure Plant Kit (Cat. No. DP441, Tiangen Biotech, Beijing, China). A total of 2 µg of RNA was used to synthesize the first-strand cDNA with the PrimeScript™ First Strand cDNA Synthesis Kit (Takara, Shiga, Japan). The cDNA reaction mixture was diluted five times, and 5 µL was used in the 20-µL PCR reaction. PCR reactions included a preincubation step at 95°C for 2 min, followed by 45 cycles of denaturation at 95°C for 15 s, annealing at 54°C for 30 s, and extension at 72°C for 30 s. All reactions were performed in the QuantStudio™ 5 Food Safety Real-Time PCR System using TB Green Fast qPCR Mix (Takara) with ROX reference dye. Each experiment had nine replicates (three technical replicates for each biological replicate). The relative expression levels were calculated using the 2−ΔΔCt method.

The cDNA reaction mixture was used for cloning. Gene cloning was performed with TaKaRa Ex Taq® (Takara, Japan). The PCR products were subcloned into pDONR207 using the Gateway BP Clonase II enzyme mix (Thermo Fisher).

Sequenced genes were translated into protein sequences, aligned, and phylogenetic trees were built using Molecular Evolutionary Genetics Analysis version 7.0 (Kumar et al., 2016) with maximum likelihood methods and 1,000 bootstrap replicates.

All primers used are shown in Supplementary Table S1.

Statistics

All data are presented as mean ± SD. Paired or unpaired two-tailed Student’s t-tests were used to compare group differences. p-values < 0.05 were considered significant. Three biological repeats were used for all analyses.

Results

Jujube saponins show different accumulation patterns in the seeds of different jujube varieties

The accumulation pattern of jujube saponins in seeds of different jujube varieties may vary. To verify this, we selected 10 different jujube varieties and detected the contents of jujube saponin A, jujube saponin B, and jujubogenin in seeds at 45, 65, and 85 DAF, respectively. The results showed that in seven of the 10 varieties, the content of jujubogenin decreased, while the content of jujube saponin A and B increased over time (Figure 1). However, varieties No. 1, No. 4, and No. 8 showed a different accumulation pattern. In these three varieties, accumulation of jujube saponins increased abnormally at 65 DAF compared with 45 DAF (Figure 1), and the content of jujube saponin A and B was relatively higher than in the other varieties during the red ripening stage These results indicate that the regulation of accumulation of jujube saponin A, jujube saponin B, and jujubogenin in the seeds of No. 1, No. 4, and No. 8 was different.

Figure 1
Bar charts labeled A, B, and C show jujubogenin, jujube saponin A, and jujube saponin B content in micrograms per gram dry weight, respectively, across ten samples (#1 to #10) at 45, 65, and 85 days after flowering (DAF). Each chart uses blue, orange, and gray bars to represent data at 45th, 65th, and 85th DAF. Error bars indicate variability, and asterisks denote significant differences.

Figure 1. (A) Content of jujubogenin, (B) jujube saponin (A, C) jujube saponin B in 10 different varieties at the 45th, 65th, and 85th DAF. *p < 0.05—significant level (n = 3).

Transcriptome and metabolome analyses reveal a significant difference

To determine differentially expressed genes related to the accumulation of jujube saponins, we performed combined transcription–metabolome analysis in the seed of No. 1 and No. 3 jujube varieties at 45 and 65 DAF, which have significantly different contents of jujubogenin at 65 DAF. At the whole metabolome level, the difference caused by DAF is the main source of variation, while the difference caused by varieties is less pronounced, since PC1(40.6%) separated the 45 and 65 DAF groups, and PC2 (19.62%) separated the No. 1 and No. 3 groups (Figure 2A). Similar patterns were observed in PCA results at the whole transcriptome level (Figure 2B) and in the correlation matrix of metabolome and transcriptome analysis (Figures 2C, D).

Figure 2
Figure A displays a heatmap with a gradient from red to green, representing correlation values between different samples. Figure B shows a similar heatmap with a gradient indicating correlation strengths. Figures C and D are scatter plots of principal component analysis (PCA) results with color-coded points representing groups: No.1_45th_DAF, No.1_65th_DAF, No.3_45th_DAF, and No.3_65th_DAF. The axes are labeled PC1 and PC2, with variance percentages noted.

Figure 2. (A) Pearson correlation analysis of metabolome results. (B) Pearson correlation analysis of transcriptome results. (C) Principal component analysis of metabolome results. (D) Principal component analysis of transcriptome results.

However, the content of jujube saponins in No. 1 and No. 3 varieties was significantly different at 45 and 65 DAF. Thus, we proceeded to determine which genes and metabolites were differentially expressed by varieties and DAF. Results showed 2,040 DEGs and 1,593 DAMs (Supplementary Figures S1, S2). Of all these DEGs and DAMs, 668 DAMs and 1,097 DEGs were discovered in No. 1–45 DAF vs. No. 1–65 DAF groups; 498 DAMs and 529 DEGs were discovered in No. 1–45 DAF vs. No. 3–45 DAF groups; 979 DAMs and 1,295 DEGs were discovered in No. 3–45 DAF vs. No. 3–65 DAF groups; and 283 DAMs and 433 DEGs were discovered in No. 1–65 DAF vs. No. 3–65 DAF groups (Figures 3A, 4A). The datasets of metabolome and transcriptome results are shown in Supplementary Tables S2S6, respectively.

Figure 3
A set of analyses including (A) a Venn diagram with overlapping colored areas showing different datasets at various developmental stages, labeled #1, #3, #4, and #5 with corresponding numbers. (B, C, D, E) Bubble plots showing gene ontology terms on the y-axis and rich factors on the x-axis, with bubbles representing different gene counts and color gradients indicating Q-values. Each plot shows various biological processes or responses with bubble sizes indicating counts from 8 to 80 across different panels.

Figure 3. (A) Venn diagram of DEGs from all groups. GO analysis results. (B) No. 1 45th DAF vs. No. 1 65th DAF groups; (C) No. 3 45th DAF vs. No. 3 65th DAF groups. (D) No. 1 45th DAF vs. No. 3 45th DAF groups. (E) No. 1 65th DAF vs. No. 3 65th DAF groups.

Figure 4
A multi-panel image displays various data visualizations. Panel A features a Venn diagram with four overlapping sets, labeled with identifiers and metrics. Panels B, C, D, and E show scatter plots with bubble charts illustrating different biosynthesis and metabolic pathways. The x-axis represents the Rich Factor, while the y-axis lists pathways. Bubble size corresponds to the count of genes, and color indicates the p-value. Each plot includes a color scale for p-values and a legend for the count.

Figure 4. (A) Venn diagram of DAMs from all groups. KEGG analysis results of (B) No. 1 45th DAF vs. No. 1 65th DAF groups, (C) No. 3 45th DAF vs. No. 3 65th DAF groups, (D) No. 1 45th DAF vs. No. 3 45th DAF groups, and (E) No. 1 65th DAF vs. No. 3 65th DAF groups.

GO and KEGG analyses revealed significant differences between varieties in seeds

To discover the function of significant DEGs, GO and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed on all DEGs. GO analysis results showed differences between varieties. DEGs in the No. 1–45 DAF vs. No. 1–65 DAF groups and No. 3–45 DAF vs. No. 3–65 DAF groups were annotated as related to cell wall biogenesis, photosystem, lipid droplet, and cysteine-type peptidase activity (Figures 3C, D). DEGs in the No. 1–45 DAF vs. No. 3–45 DAF groups and No. 1–65 DAF vs. No. 3–65 groups were annotated differently. In No. 1–45 DAF vs. No. 3–45 DAF groups, DEGs were annotated as associated with protein folding, photosystem, and response to abiotic stress (Figure 3E). In the No. 1–65 DAF vs. No. 3–65 DAF groups, DEGs were annotated as associated with protein folding, photosystem, response to abiotic stress, and enzyme inhibitor activity (Figure 3E).

KEGG analysis revealed significant differences between varieties. DEGs in the No. 1–45 DAF vs. No. 1–65 DAF comparison were annotated as associated with flavonoid biosynthesis, triterpenoid biosynthesis, and nucleotide metabolism (Figure 4B). DEGs in the No. 3–45 DAF vs. No. 3–65 DAF comparison were annotated as associated with flavonoid biosynthesis, vitamin biosynthesis, amino acid biosynthesis, taurine metabolism, and porphyrin metabolites (Figure 4C). By contrast, DEGs in the No. 1–45 DAF vs. No. 3–45 DAF and No. 1–65 DAF vs. No. 3–65 DAF exhibited similar trends: KEGG annotated these DEGs as associated with flavonoid biosynthesis, vitamin biosynthesis, amino acid biosynthesis, and triterpenoid biosynthesis (Figures 4D, E). These results indicate that the differential accumulation of jujube saponins was the result of differential expression of genes related to triterpenoid biosynthesis.

To confirm the DEG results, we randomly selected 10 DEGs with high FPKM and significantly higher relative expression levels for qRT-PCR analysis (Figure 5). These results are consistent with our transcriptome data.

Figure 5
Bar graphs show relative expression levels of different genes (LOC107404388, LOC107432186, etc.) at two stages, 45th and 65th days after flowering (DAF), across two samples (No.1 and No.3). Each graph indicates changes in expression with varying significance levels denoted by asterisks.

Figure 5. q-RT PCR analysis to confirm transcriptome results. *p < 0.05; **p < 0.01—significant levels (n = 3).

Potential genes related to jujube saponin biosynthesis discovered by GO analysis

Among all DAMs, 20 DAMs were significant in all groups, including jujube saponin A, jujube saponin B, and jujubogenin, which were located in the center of the DAMs Venn diagram (Figure 4A). Thus, 35 DEGs significant in all groups might contribute to the different accumulation of jujube saponin A, jujube saponin B, and jujubogenin between varieties and time points (Figure 3A). To functionally analyze these 35 DEGs, GO analysis was performed. The results showed that the DEGs were enriched in saponin metabolic process, monooxygenase activity, heme binding, iron ion binding, and glycoside metabolic process (Figure 6A). The heatmap showed that nine of these genes were upregulated in variety No. 3 compared with variety No. 1 and downregulated over time; two of these genes were upregulated in variety No. 3 compared with variety No. 1 and upregulated over time; and 24 genes were downregulated in variety No. 3 compared with variety No. 1 and upregulated over time (Figure 6B). These 35 genes were highly likely related to the biosynthesis of jujube saponin.

Figure 6
Panel A shows a bubble plot analyzing various biological processes, with the x-axis representing the Rich Factor and the y-axis listing processes. Bubble size indicates count, while color intensity indicates Q-value. Panel B displays a heatmap with rows representing different gene loci, and columns representing sample groups. Colors range from green to red, indicating expression levels from low to high, with a legend for group differentiation.

Figure 6. (A) GO analysis results and (B) heatmap of filtered genes.

Three genes involved in the saponin metabolic process were induced at 45 DAF and 65 DAF, but suppressed at 85 DAF

As dammaran-type triterpenes, jujube saponins share a synthetic pathway with ginsenosides. The main synthetic strategy that has not yet been elucidated is the conversion of dammarenediol-II to jujube saponins, which mainly involves the oxygenation and glycosylation of dammarenediol-II. These two-step reactions are usually catalyzed by enzymes of the CYP450 and UGT gene families, so we hypothesize that the CYP450 gene family catalyzes the oxidation of dammarenediol-II to jujubogenin.

To discover genes that catalyze the biosynthesis of jujubogenin, the expression level of three genes involved in the saponin metabolic process, LOC107404388, LOC125422327, and LOC107406771, were analyzed by qRT-PCR (Figure 5). Among these three genes, LOC107404388 was annotated as a UDP-glucosyltransferase 29-like gene, and LOC125422327 and LOC107406771 were annotated as CYP450 family genes. The expression levels of these three genes showed similar trends compared with 45 DAF in both varieties: increased at 65 DAF but suppressed at 85 DAF. Additionally, expression of these three genes was higher in variety No. 1 than in No. 3. These results indicate that these three genes might positively contribute to the biosynthesis of jujube saponins.

Based on the annotation, the CDS of LOC107404388, LOC125422327, and LOC107406771 were cloned, and a maximum likelihood tree of these three genes was constructed to confirm their annotation (Figure 7). Results showed that LOC107404388 is a UGT superfamily gene, while LOC125422327 and LOC107406771 are CYP716 family genes.

Figure 7
Three phylogenetic trees labeled A, B, and C, showing relationships among various plant species and proteins. Tree A includes various UGT superfamily genes from plants like Nicotiana benthamiana and Panax ginseng. Tree B shows relationships involving CYP716A1 and other genes from plants such as Arabidopsis thaliana. Tree C involves CYP716A17-like genes and others from species like Rosa chinensis and Panax notoginseng. Each tree displays bootstrap or distance values at nodes.

Figure 7. Phylogenetic trees of (A) LOC107404388, (B) LOC125422327, and (C) LOC107406771. Maximum likelihood (ML) was used to construct this tree with 1,000 replicate bootstrap support.

Furthermore, the trend of expression levels of these three genes is consistent with the trend of jujube seed saponin content in variety No. 5 (Figure 8). From 35 to 85 DAF, expression of these three genes was significantly increased in 45, 55, 65, and 75 DAF compared with 35 DAF, then significantly decreased in 85 DAF. In contrast, LOC107404388 was only decreased at 85 DAF compared with 35 DAF. The relative contents of jujube saponin A, jujube saponin B, and jujubogenin showed similar trends: the content of jujubogenin was significantly increased at 45 DAF compared with 35 DAF, then decreased at 55, 65, 75, and 85 DAF compared with 45 DAF. The contents of jujube saponin A and jujube saponin B were significantly increased at 45, 55, 65, and 75 DAF compared with 35 DAF, then slightly increased at 85 DAF compared with 75 DAF. These results indicate that these three genes participate in the biosynthesis of jujube saponins.

Figure 8
Bar charts display relative expression levels and content for different genes and compounds at various days after flowering (DAF). The left chart shows gene expression levels of LOC107404388, LOC125422327, and LOC107406771 at five DAF stages. The right chart indicates relative content of Jujube saponin A, Jujube saponin B, and Jujubogenin. Different colors represent distinct DAF stages: blue for 35th, orange for 45th, gray for 55th, yellow for 65th, light blue for 75th, and green for 85th. Error bars and significance markers (*) are included.

Figure 8. (A) Relative expression of LOC107404388, LOC125422327, and LOC107406771 and (B) relative content of jujube saponin (A) jujube saponin (B) and jujubogenin in variety No. 5. *p < 0.05; **p < 0.01—significant levels (n = 5).

Discussion

Stress response genes appear correlated with the biosynthesis of jujube saponins

Our data suggest that differences in the content of jujube saponins between varieties may be related to different stress tolerance. In No. 1–45 DAF vs. No. 3–45 DAF groups and No. 1–65 DAF vs. No. 3–65 DAF groups, genes related to response to abiotic stress were majorly downregulated in variety No. 3 compared with variety No. 1 (Figure 3), whereas the content of jujube saponins was higher in variety No. 1. These results indicate that abiotic stress tolerance appears to be correlated with the biosynthesis of jujube saponins.

Similar results were also reported in many studies on other Chinese medicinal plants. In Panax ginseng, stress-related genes ERDL6 were upregulated in older wild-simulated ginseng, which is rich in ginsenoside varieties and amounts (Kim et al., 2023). In Scutellaria baicalensis, drought and salt stress can induce the content of baicalin and baicalein (Su et al., 2018). Long-term stress induced expression of SbWRKY34, leading to a flavonoid index ingredient level in Scutellaria baicalensis that initially decreased, then rose as the drought duration extended, followed by a notable postrehydration increase in baicalin, wogonoside, and baicalein content and a decrease in levels of wogonin and oroxylin A (Zhang et al., 2025). In Glycyrrhiza glabra, controlled drought stress upregulates the expression of key genes like squalene synthase, β-amyrin synthase, and cycloartenol synthase involved in the biosynthesis of triterpenoid saponins and directly enhances the production of glycyrrhizin (Nasrollahi et al., 2014). In Eucommia ulmoides, EuRBG10 was induced by salt and drought stress and was proposed to play a hub role in regulating the biosynthesis of alkaloids (Zuo et al., 2022).

Potential UDP-glucosyltransferase and CYP450s related to jujube saponin biosynthesis

As triterpenoids, intermediate metabolites in the biosynthesis of jujube saponins include mevalonate, farnesyl pyrophosphate (FPP), squalene, and dammarenediol-II. Based on our results, three genes related to the biosynthesis of jujube saponin were discovered by transcriptome–metabolome combined analysis. LOC125422327 and LOC107406771 were annotated as CYP716A15-like genes, and LOC107404388 was annotated as a UDP-glucosyltransferase 29-like gene. CYP716 subfamily members are multifunctional oxidases in triterpenoid biosynthesis. In Medicago truncatula, expression of CYP716 subfamily genes was highly correlated with that of β-amyrin synthase, which catalyzes the biosynthesis of oleanolic acid (Fukushima et al., 2013). In grape, CYP716A15 and CYP716A17 are involved in triterpenoid biosynthesis (Fukushima et al., 2011). CYP716A enzymes, including CYP716A752V2 isolated from olive, sugar beet, coffee, and P. notoginseng, were characterized as multifunctional C-28 oxidases and are involved in triterpenoid biosynthesis (Suzuki et al., 2018). CYP716A47 responds to JA and induces biosynthesis of ginsenoside in P. ginseng (Han et al., 2011) and P. notoginseng (Li et al., 2017). UDP-glucosyltransferase 29 (UGT29) is involved in the biosynthesis of triterpenoid saponin. In P. ginseng, UGTPg29 and its homolog Pq3-O-UGT2 can transfer glucose onto the C-2′ hydroxyl group of the first glucose residue at C-3 of ginsenoside Rh2 to produce ginsenoside Rg3 (Jung et al., 2014; Wang et al., 2015; Lu et al., 2017; Yang et al., 2020). These findings suggest that LOC125422327, LOC107406771, and LOC107404388 might be involved in the biosynthesis of jujube saponins.

In summary, we performed integrated transcriptomic and metabolomic analyses of seeds from 10 Z. jujuba varieties at 45, 65, and 85 DAF. Our analysis revealed that three candidate genes, LOC107404388, LOC125422327, and LOC107406771, exhibited expression patterns positively correlated with jujube saponin accumulation. These genes were significantly upregulated from 45 to 65 DAF but suppressed by 85 DAF. qRT-PCR validation confirmed their higher expression in high-saponin varieties vs. low-saponin varieties. Metabolite profiling further demonstrated coordinated accumulation of mevalonate, squalene, dammarenediol-II, and saponins in high-expression genotypes. Notably, varieties exhibiting enhanced saponin production showed downregulation of abiotic stress–response genes. Taken together, these results identify key glycosyltransferase and cytochrome P450 genes potentially governing jujuboside biosynthesis in Z. jujuba.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://ngdc.cncb.ac.cn/search/specific?db=bioproject&q=PRJCA046538, PRJCA046538.

Author contributions

DG: Project administration, Visualization, Funding acquisition, Resources, Formal Analysis, Conceptualization, Data curation, Methodology, Writing – review & editing, Writing – original draft. XY: Writing – review & editing, Project administration, Funding acquisition, Supervision. XW: Supervision, Formal Analysis, Data curation, Writing – review & editing, Conceptualization, Software, Project administration. XC: Project administration, Funding acquisition, Writing – review & editing, Formal Analysis. XT: Writing – review & editing, Data curation, Methodology. XM: Formal Analysis, Resources, Writing – review & editing. XL: Writing – review & editing, Software, Validation. MP: Data curation, Writing – review & editing. JS: Funding acquisition, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the Natural Science Foundation of Shandong Province, China (ZR2023QH170), the Shandong Province Modern Agricultural Industrial Technology System Chinese Herbal Medicine System (SDAIT-20-13), and the Jinan Agricultural Science and Technology Research Project (GG202402).

Acknowledgments

We thank Prof. Guan for providing the pDONR222 vector.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1708851/full#supplementary-material

Supplementary Figure 1 | Heatmap of DEGs from all groups.

Supplementary Figure 2 | Heatmap of DAMs from all groups.

Supplementary Table 1 | Primers used in this paper.

Supplementary Table 2 | Data sets of metabolome results.

Supplementary Table 3 | Data sets of Different expression genes in No. 1–45 DAF vs No. 1–65 DAF group.

Supplementary Table 4 | Data sets of Different expression genes in No. 1–65 DAF vs No. 3–65 DAF group.

Supplementary Table 5 | Data sets of Different expression genes in No. 1–45 DAF vs No. 3–45 DAF group

Supplementary Table 6 | Data sets of Different expression genes in No. 3–45 DAF vs No. 3–65 DAF group.

References

Chen, S., Zhou, Y., Chen, Y., and Gu, J. (2018). Fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 34, i884–90. doi: 10.1093/bioinformatics/bty560

PubMed Abstract | Crossref Full Text | Google Scholar

Fukushima, E. O., Seki, H., Ohyama, K., Ono, E., Umemoto, N., Mizutani, M., et al. (2011). CYP716A subfamily members are multifunctional oxidases in triterpenoid biosynthesis. Plant Cell Physiol. 52, 2050–2061. doi: 10.1093/pcp/pcr146

PubMed Abstract | Crossref Full Text | Google Scholar

Fukushima, E. O., Seki, H., Sawai, S., Suzuki, M., Ohyama, K., Saito, K., et al. (2013). Combinatorial biosynthesis of legume natural and rare triterpenoids in engineered yeast. Plant Cell Physiol. 54, 740–749. doi: 10.1093/pcp/pct015

PubMed Abstract | Crossref Full Text | Google Scholar

Han, J. Y., Hwang, H. S., Choi, S. W., Kim, H. J., and Choi, Y. E. (2012). Cytochrome P450 CYP716A53v2 catalyzes the formation of protopanaxatriol from protopanaxadiol during ginsenoside biosynthesis in Panax ginseng. Plant Cell Physiol. 53, 1535–1545. doi: 10.1093/pcp/pcs106

PubMed Abstract | Crossref Full Text | Google Scholar

Han, J. Y., Kim, M. J., Ban, Y. W., Hwang, H. S., and Choi, Y. E. (2013). The involvement of β-amyrin 28-oxidase (CYP716A52v2) in oleanane-type ginsenoside biosynthesis in Panax ginseng. Plant Cell Physiol. 54, 2034–2046. doi: 10.1093/pcp/pct141

PubMed Abstract | Crossref Full Text | Google Scholar

Han, J. Y., Kim, H. J., Kwon, Y. S., and Choi, Y. E. (2011). The Cyt P450 enzyme CYP716A47 catalyzes the formation of protopanaxadiol from dammarenediol-II during ginsenoside biosynthesis in Panax ginseng. Plant Cell Physiol. 52, 2062–2073. doi: 10.1093/pcp/pcr150

PubMed Abstract | Crossref Full Text | Google Scholar

Hua, Y., Guo, S., Xie, H., Zhu, Y., Yan, H., Tao, W. W., et al. (2021). Z.jujuba Mill. var. spinosa (Bunge) Hu ex H. F. Chou Seed Ameliorates Insomnia in Rats by Regulating Metabolome and Intestinal Flora Composition. Front. Pharmacol. 12. doi: 10.3389/fphar.2021.653767

PubMed Abstract | Crossref Full Text | Google Scholar

Jung, S. C., Kim, W., Park, S. C., Jeong, J., Park, M. K., and Lim, S. (2014). Two ginseng UDP-glycosyltransferases synthesize ginsenoside Rg3 and Rd. Plant Cell Physiol. 55, 2177–2188. doi: 10.1093/pcp/pcu147

PubMed Abstract | Crossref Full Text | Google Scholar

Kim, J., Yun, Y., Huh, J., Um, Y., and Shim, D. (2023). Comparative transcriptome analysis on wild-simulated ginseng of different age revealed possible mechanism of ginsenoside accumulation. Plant Physiol. Biochem. 201, 107870. doi: 10.1016/j.plaphy.2023.107870

PubMed Abstract | Crossref Full Text | Google Scholar

Kumar, S., Stecher, G., and Tamura, K. (2016). MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Molecular Biology and Evolution. 33, 1870–1874.

PubMed Abstract | Google Scholar

Li, Y., Wang, J., Li, L., Song, W., Li, M., Hua, X., et al. (2023). Natural products of pentacyclic triterpenoids: from discovery to heterologous biosynthesis. Nat. Prod Rep. 40, 1303–1353. doi: 10.1039/d2np00063f

PubMed Abstract | Crossref Full Text | Google Scholar

Li, J., Wang, J., Wu, X., Liu, D., Li, J., Li, J., et al. (2017). Jasmonic acid and methyl dihydrojasmonate enhance saponin biosynthesis as well as expression of functional genes in adventitious roots of Panax notoginseng F.H. Chen. Biotechnol. Appl. Biochem. 64, 225–238. doi: 10.1002/bab.1477

PubMed Abstract | Crossref Full Text | Google Scholar

Li, D., Wu, Y., Zhang, C., Sun, J., Zhou, Z., and Lu, W. (2019). Production of triterpene ginsenoside compound K in the non-conventional yeast yarrowia lipolytica. J. Agric. Food Chem. 67, 2581–2588. doi: 10.1021/acs.jafc.9b00009

PubMed Abstract | Crossref Full Text | Google Scholar

Lu, C., Zhao, S. J., and Wang, X. S. (2017). Functional regulation of a UDP-glucosyltransferase gene (Pq3-O-UGT1) by RNA interference and overexpression in Panax quinquefolius. Plant Cell Tissue Organ Cult. 129, 445–456. doi: 10.1007/s11240-017-1190-y

Crossref Full Text | Google Scholar

Luo, H., Sun, C., Sun, Y., Wu, Q., Li, Y., Song, J., et al. (2011). Analysis of the transcriptome of Panax notoginseng root uncovers putative triterpene saponin-biosynthetic genes and genetic markers. BMC Genomics 12 Suppl 5, S5. doi: 10.1186/1471-2164-12-S5-S5

PubMed Abstract | Crossref Full Text | Google Scholar

Nasrollahi, V., Mirzaie-Asl, A., Piri, K., Nazeri, S., and Mehrabi, R. (2014). The effect of drought stress on the expression of key genes involved in the biosynthesis of triterpenoid saponins in liquorice (Glycyrrhiza glabra). Phytochemistry 103, 32–37. doi: 10.1016/j.phytochem.2014.03.004

PubMed Abstract | Crossref Full Text | Google Scholar

Shin, K. C. and Oh, D. K. (2016). Classification of glycosidases that hydrolyze the specific positions and types of sugar moieties in ginsenosides. Crit. Rev. Biotechnol. 36, 1036–1049. doi: 10.3109/07388551.2015.1083942

PubMed Abstract | Crossref Full Text | Google Scholar

Song, Y., Guan, Z., van Merkerk, R., Pramastya, H., Abdallah, I. I., Setroikromo, R., et al. (2020). Production of squalene in bacillus subtilis by squalene synthase screening and metabolic engineering. J. Agric. Food Chem. 68, 4447–4455. doi: 10.1021/acs.jafc.0c00375

PubMed Abstract | Crossref Full Text | Google Scholar

Su, H., Song, S., Yan, X., Fang, L., Zeng, B., and Zhu, Y. (2018). Endogenous salicylic acid shows different correlation with baicalin and baicalein in the medicinal plant Scutellaria baicalensis Georgi subjected to stress and exogenous salicylic acid. PloS One 13, e0192114. doi: 10.1371/journal.pone.0192114

PubMed Abstract | Crossref Full Text | Google Scholar

Suzuki, H., Fukushima, E. O., Umemoto, N., Ohyama, K., Seki, H., and Muranaka, T. (2018). Comparative analysis of CYP716A subfamily enzymes for the heterologous production of C-28 oxidized triterpenoids in transgenic yeast. Plant Biotechnol. (Tokyo Japan) 35, 131–139. doi: 10.5511/plantbiotechnology.18.0416a

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, P., Wei, Y., Fan, Y., Liu, Q., Wei, W., and Yang, C. (2015). Production of bioactive ginsenosides Rh2 and Rg3 by metabolically engineered yeasts. Metab. Eng. 29, 97–105. doi: 10.1016/j.ymben.2015.03.003

PubMed Abstract | Crossref Full Text | Google Scholar

Wei, W., Wang, P., Wei, Y., Liu, Q., Yang, C., Zhao, G., et al. (2015). Characterization of panax ginseng UDP-glycosyltransferases catalyzing protopanaxatriol and biosyntheses of bioactive ginsenosides F1 and rh1 in metabolically engineered yeasts. Mol. Plant 8, 1412–1424. doi: 10.1016/j.molp.2015.05.010

PubMed Abstract | Crossref Full Text | Google Scholar

Yan, X., Fan, Y., Wei, W., Wang, P., Liu, Q., Wei, Y., et al. (2014). Production of bioactive ginsenoside compound K in metabolically engineered yeast. Cell Res. 24, 770–773. doi: 10.1038/cr.2014.28

PubMed Abstract | Crossref Full Text | Google Scholar

Yang, C., Li, C., Wei, W., Wei, Y., Liu, Q., and Zhao, G. (2020). The unprecedented diversity of UGT94-family UDP-glycosyltransferases in Panax plants and their contribution to ginsenoside biosynthesis. Sci. Rep. 10, 15394. doi: 10.1038/s41598-020-72278-y

PubMed Abstract | Crossref Full Text | Google Scholar

Zeng, X., Luo, T., Li, J., Li, G., Zhou, D., Liu, T., et al. (2018). Transcriptomics-based identification and characterization of 11 CYP450 genes of Panax ginseng responsive to MeJA. Acta Biochim. Biophys. Sin. 50, 1094–1103. doi: 10.1093/abbs/gmy120

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, T., Zhang, C., Wang, W., Hu, S., Tian, Q., Li, Y., et al. (2025). Effects of drought stress on the secondary metabolism of Scutellaria baicalensis Georgi and the function of SbWRKY34 in drought resistance. Plant Physiol. biochemistry: PPB 219, 109362. doi: 10.1016/j.plaphy.2024.109362

PubMed Abstract | Crossref Full Text | Google Scholar

Zhao, C., Gao, X., Liu, X., Wang, Y., Yang, S., Wang, F., et al. (2016). Enhancing biosynthesis of a ginsenoside precursor by self-assembly of two key enzymes in pichia pastoris. J. Agric. Food Chem. 64, 3380–3385. doi: 10.1021/acs.jafc.6b00650

PubMed Abstract | Crossref Full Text | Google Scholar

Zhao, M., Tuo, H., Wang, S., and Zhao, L. (2020). The effects of dietary nutrition on sleep and sleep disorders. Mediators Inflammation 2020, 3142874. doi: 10.1155/2020/3142874

PubMed Abstract | Crossref Full Text | Google Scholar

Zhu, F., Zhong, X., Hu, M., Lu, L., Deng, Z., and Liu, T. (2014). In vitro reconstitution of mevalonate pathway and targeted engineering of farnesene overproduction in Escherichia coli. Biotechnol. Bioeng. 111, 1396–1405. doi: 10.1002/bit.25198

PubMed Abstract | Crossref Full Text | Google Scholar

Zhuang, Y., Yang, G. Y., Chen, X., Liu, Q., Zhang, X., Deng, Z., et al. (2017). Biosynthesis of plant-derived ginsenoside Rh2 in yeast via repurposing a key promiscuous microbial enzyme. Metab. Eng. 42, 25–32. doi: 10.1016/j.ymben.2017.04.009

PubMed Abstract | Crossref Full Text | Google Scholar

Zuo, Y., Li, B., Guan, S., Jia, J., Xu, X., Zhang, Z., et al. (2022). EuRBG10 involved in indole alkaloids biosynthesis in Eucommia ulmoides induced by drought and salt stresses. J. Plant Physiol. 278, 153813. doi: 10.1016/j.jplph.2022.153813

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Ziziphus jujuba var. spinosa, jujuboside biosynthesis, triterpenoid biosynthesis, transcriptome and metabolic analyses, CYP450, UDP- glucuronosyltransferase

Citation: Geng D, Yang X, Wang X, Chi X, Tang X, Ma X, Li X, Pu M and Shu J (2026) Integrated transcriptome and metabolome analyses reveal key genes regulating jujuboside biosynthesis in Ziziphus jujuba var. spinosa. Front. Plant Sci. 16:1708851. doi: 10.3389/fpls.2025.1708851

Received: 19 September 2025; Accepted: 08 December 2025; Revised: 20 November 2025;
Published: 08 January 2026.

Edited by:

Jian You Wang, Academia Sinica, Taiwan

Reviewed by:

Rufeng Wang, Shanghai University of Traditional Chinese Medicine, China
Shulin Zhang, Anyang Institute of Technology, China

Copyright © 2026 Geng, Yang, Wang, Chi, Tang, Ma, Li, Pu and Shu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jing Shu, c2h1amluZzc5QDE2My5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.