The chemical profiling of Salvia plebeia during different growth periods and the biosynthesis of its main flavonoids ingredients

Salvia plebeia (Lamiaceae) is a valuable medicinal plant widely distributed across Asia and Oceania. However, the composition and accumulation patterns of its active ingredients in different organs during the growth and their biosynthetic mechanism remain unknown. Therefore, we conducted metabolite profiling, transcriptomic analysis, and biological functional verification to explore the distribution, accumulation, and biosynthesis mechanisms of flavonoids in S. plebeia. We identified 70 metabolites including 46 flavonoids, 16 phenolic acids, seven terpenoids, and one organic acid, of which 21 were previously unreported in S. plebeia. Combining metabolomic-transcriptomic analysis and biological functional verification, we identified the key genes involved in biosynthesis of its main active ingredients, hispidulin and homoplantaginin, including SpPAL, SpC4H, Sp4CL2, Sp4CL5, SpCHS1, SpCHI, SpFNS, SpF6H1, SpF6OMT1, SpF6OMT2, SpUGT1, SpUGT2, and SpUGT3. Using the identified genes, we reconstructed the hispidulin and homoplantaginin biosynthesis pathways in Escherichia coli, and obtained a yield of 5.33 and 3.86 mg/L for hispidulin and homoplantaginin, respectively. Our findings provide valuable insights into the changes in chemical components in different organs of S. plebeia during different growth and harvest stages and establishes a foundation for identifying and synthesizing its active components.


Introduction
Salvia plebeia R. Br. is a valuable medicinal plant belonging to the genus Salvia in Labiatae, which widely distributed in China, Japan, Korea, Australia, and India.It has long history used for traditional herb medicine in the Compendium of Materia Medica above 490 years ago in the Ming Dynasty of China.Traditional Chinese Medicines (TCMs) believes that S. plebeia can clear heat and detoxify the toxins.Consequently, it has been included in the China Pharmacopoeia cure for bronchitis, hemorrhoids, and nephritis (Liang et al., 2020).Modern pharmaceutical findings have further reported that S. plebeia extract possesses anticancer, immunomodulatory, anti-inflammatory, antioxidant, and antiviral activities (Choi et al., 2020;Shin et al., 2020;Liang et al., 2021).
TCMs pharmacologists acknowledged that the different pharmacological effects of plant medicines are attributed to their potent secondary metabolites.In addition, the chemical components of TCMs are influenced by various factors, such as genetic background, organ and tissue specificity, growth stages, cultivation methods, harvest times, and processing/storage condition (Zhu et al., 2014).While the information is limited about the compositions and dynamic accumulation patterns of its active ingredients in different organs during the growth and development of S. plebeia.According to China Pharmacopoeia, the dried aerial part of S. plebeia should be harvested in summer (Commission, C.P., 1997).So that, most of the medicinal materials of the herb in Chinese herbal materials markets are originated from the dried aerial part of the plant possessing the flowers or fruits (Wang et al., 2010).However, literature also reported that in some place of Eastern China, the fresh seedlings of the plant without elongating stem, those are mainly basal leaves collected in spring, were used as folk medicine for treating senile chronic bronchitis and mastitis (Jiang Su Food and Drug Administration, 2016) and had good cure effects.Why do different medicinal parts and harvesting time of the same plant produce different therapeutic effects, and what are the molecular mechanisms involved in, which aroused our research interest.
The phytochemical investigations have showed that S. plebeia contains sesquiterpenoids, flavonoids, phenylpropanoids, phenolic acids and diterpenoids, etc (Liang et al., 2020).Previous reports indicated that flavonoids, especially hispidulin and homoplantaginin, are the main bioactive compounds of S. plebeia.Recent studies have demonstrated that hispidulin and homoplantaginin have multiple pharmacological effects, such as anticancer, anti-inflammatory, neuroprotective, anti-epileptic, alleviated vascular endothelial cell apoptosis (Lin et al., 2015;He et al., 2016;An et al., 2019;Liu et al., 2020;Fan et al., 2023).However, the difficulties in purification and the challenges in chemical synthesize limit their further development and wider application in clinical studies.Biosynthesis is expected to solve this problem with its advantages of low cost, high yield and environmental friendliness.Whereas, the information on flavonoid biosynthesis in S. plebeia, including relevant biosynthetic enzymes, genes and their regulation mechanisms is also unclear.Based on previous reports, the biosynthetic pathway of homoplantaginin can be predicted (Li et al., 2019;Nabavi et al., 2020;Pei et al., 2022).The homoplantaginin biosynthesis pathway is initiated by PAL, PAL converts phenylalanine to cinnamic acid.Subsequently, cinnamic acid reacts with cinnamon-4-hydroxylase (C4H) and 4-coumaryl-CoA ligase (4CL) to produce 4-coumaryl-CoA, the precursor compound of naringin chalcone.Chalcone synthase (CHS) is the starting enzyme for the synthesis of flavonoids, and its product naringin chalcone is transformed into naringenin by chalcone isomerase (CHI).Then apigenin is produced by flavone synthase II (FNS).Apigenin is converted to hispidulin under the flavone 6-hydroxylase (F6H) and flavone 6-Omethyltransferase (F6OMT).Eventually, UDP-glycosyltransferase (UGT) converts hispidulin into homoplantaginin.Although the key enzymes for homoplantaginin biosynthesis can be predicted from other species plant biosynthetic pathway, the complete biosynthetic pathway of homoplantaginin has not been reported in plants.
Here, we systematically analyzed the chemical profiles of different organs in S. plebeia at different developmental stages using UPLC-Q-TOF-MS/MS and HPLC-DAD methods.In addition, a transcriptome dataset of S. plebeia was constructed for the first time to analyze the expression levels of genes related to flavonoid biosynthesis in different organs at different developmental stages.Finally, all candidate genes were functionally verified, and a synthetic biology platform was constructed to obtain main flavonoids of S. plebeia.This result provides insights into the chemical profiling of S. plebeia and would help in the further exploitation.

UPLC-Q-TOF-MS/MS and HPLC-DAD conditions
All samples were crushed, weighed 0.5 g into 10 mL of 70% methanol (0.02 mg/mL baicalin and 0.01 mg/mL irigenin as internal standards for UPLC-Q-TOF-MS/MS assay).The mixed solution was sonicated for 30 min.After ultrasonic extraction, the extraction solution centrifuged at 10,000 rpm for 30 min.Then, the supernatant was filtered by a 0.22µm membrane filter.The solutions were injection into the column for UPLC-Q-TOF-MS/ MS and HPLC-DAD analysis.

Illumina sequencing and transcriptomic analysis
Total RNA was isolated from S. plebeia using the RNA simple total RNA kit (TIANGEN, Beijing, China).A cDNA library was then constructed and sequenced on an Illumina NovaSeq 6000 platform (Illumina, San Diego, CA, USA).All raw reads were uploaded to NCBI at SRA database (PRJNA952802) and include 21 accession items (SRR24128449−SRR24128469). FPKM (the number of fragments per kilobase transcript per million fragment map) was used to calculate the gene expression level.Genes with | log2FC|≥1 and Benjamini−Hochberg-adjusted P-values<0.05 were determined as differentially expressed genes (DEGs).Using the following databases for gene functional annotation: NCBI nonredundant protein sequences (Nr), Kyoto Encyclopedia of Genes and Genomes (KEGG), NCBI non-redundant nucleotide sequences (Nt), Gene Ontology (GO), a manually annotated and reviewed protein sequence database (Swiss-Prot), a manually annotated and reviewed protein sequence database (Pfam), and Clusters of Orthologous Groups of Proteins (KOG/COG).
Based on the results of gene functional annotation, the candidate key genes PAL, C4H, 4CL, CHS, CHI, F6OMT, and UGT were selected in the homoplantaginin biosynthesis pathway.The sequences of FNS and F6H were obtained by a local BLAST algorithm-based search and relevant literature (Chen et al., 2014;Zhao et al., 2018).Phylogenetic tree construction with MEGA11 (https://megasoftware.net/).The Neighbor-Joining method was used to construct this tree with bootstrap (n=1000).

Quantitative real-time PCR analysis
Total RNA from leaves, roots, stems, and flowers of S. plebeia was extracted as the protocol described above.Reverse transcription was done by using HiScript II Q RT SuperMix for qPCR (Code: R223-1, Vazyme, Nanjing, China).The ChemQ SYBR qPCR Master Mix (Code: 341-02, Vazyme, Nanjing, China) was used for qRT-PCR analysis and gene-specific primer pairs (Table S2).The relative expression levels of the target genes were evaluated using the 2 -△△Ct approach with Spb-actin as the reference gene (Livak and Schmittgen, 2001).Each sample was analyzed in three biological replicates.

Reconstitution of hispidulin and homoplantaginin biosynthesis pathway in bacterial and product analysis
The genes of homoplantaginin pathway with known function (SpFNS, SpF6H1, SpF6OMT2, and SpUGT1) were inserted in different combinations into the vectors pET32a and pACYCDuet-1 downstream of T7 promoters.The 3′ end of SpFNS and SpF6H1 were fused to the 5′ end of AtCPR via the linker sequence ACTAGTGGTTCTACCTCTTCTGGTTCTGGT.The 5′ end of SpUGT1 connects the MBP sequence of pSJ8-MBP vector.All the recombinant E. coli strains used in this study are shown in Table S3.The recombinant plasmids were transformed into E. coli after sequencing verified.The recombinant strains were cultured in LB medium with antibiotics at 37°C and 200 rpm until the OD 600 reached 0.6-0.8.Then, 0.5 mM IPTG and 100 mM naringenin were added and cultured at 20°C and 135 rpm.After 24 h, 10 mL fermentation liquor was taken and mixed with a double volume of ethyl acetate to extract the reactions.The ethyl acetate layer was dried under reduced pressure, and dissolved with 100 mL methanol for HPLC analysis.

Chemical profiling of different tissues of S. plebeia at developmental stages
The chemical compositions of different organs of S. plebeia during growth and development were analyzed using UPLC-Q-TOF-MS/MS method.The total ion chromatograms (TICs) of different organs and developmental stages of S. plebeia are shown in Figures 2, S1.Combined with information such as the parent ion, molecular formula, retention time, secondary fragmentation, and relevant literature, a total of 70 compounds including 46 flavonoids, 16 phenolic acids, seven terpenoids, and one organic acid, were accurately or tentatively identified (Table 1; Figure 3).We identified 46 flavonoids, including 24 flavones, 17 flavonones, 4 flavanones and one flavanonol, of which 21 compounds were discovered in S. plebeia for the first time (Table 1).Especially, the peaks F2 and F5 were tentatively identified as new flavanone diglycosides from S. plebeia.The characteristic product ions and neutral losses are shown in the Figure S2.From the TICs and Venn graph (Figures 2,  S3), we can see that there was small difference in the number of the identified compounds in the samples of different developmental stages, which 43 compounds were common.However, the number of the identified compounds in different organs varies greatly.For instance, 68 compounds were identified in the leaf, while only 21 compounds were identified in the root.Principal component analysis (PCA) also revealed that the stems and roots, flowers, and leaves clustered into three groups (Figure 4A).The distribution of compounds in different organs of S. plebeia is quite different.There was little difference in the compounds present in the roots and stems at different stages.Therefore, we selected leaves from different periods and different organs from the flowering stage for differential metabolite analysis.The groups showed clear separation in the OPLS-DA score plots of S. plebeia with satisfactory goodness of fit and statistical significance (Figure S4).Twenty-four of the 70 annotated metabolites were found to be differential metabolites (DEMs) (Table S4).Based on the relative contents of the DEMs in the different parts at different developmental stages, a heatmap was created (Figure 4B) and revealed differences.The compounds were mainly distributed in leaves and flowers and less existed in the stems and roots.Specifically, the flavonoids were the main DEMs and mainly distributed in the leaves, while terpenoids mostly existed in the flowers.Besides, the flavonoid metabolites in different developmental stages showed differences.The main DEMs were flavonoid aglycones in L1, and glycosides in L2, L3, and L4.
To further accurately explore the contents of the main flavonoids in different parts at different plant developmental stages, 10 flavonoids (6-hydroxyluteolin 7-O-b-glucoside, nepitrin, luteolin 7-O-b-glucoside, apigenin 7-O-b-glucoside, homoplantaginin, luteolin, nepetin, 6-methoxynaringenin, apigenin, and hispidulin) were quantitatively analyzed using HPLC-DAD.Results indicated that flavonoid content varied among different organs and at different growth periods of the plant (Figures 4C, D).Regarding the distribution of flavonoids among different organs, the flavonoids were mainly distributed in leaves and flowers and to a less extent in the stems and roots.In different growth and development periods of the plant, flavonoid content peaked at the stem elongation stage.Specifically, flavonoid content in the leaves followed a parabolic trend and peaked when the stem elongated (L2, 18.21 ± 0.33 mg/g).Similarly, the flavonoid content in the stems was also the highest in S2 (8.97 ± 0.25 mg/g).In addition, the flavonoid content in flowers was up to 16.73 ± 0.16 mg/g (F3).A fluctuating trend of increase, decrease, and increase trend was observed in the roots; however, the highest yield was only 0.94 ± 0.08 mg/g at the ripening stage (R4).
Notably, hispidulin and homoplantaginin exhibited the highest levels in all the organs at each developmental stages of the plant.Our results showed that they accumulated differently in different growth stages (Figure S5).The content of hispidulin reached the highest level of 6.55 ± 1.44 mg/g in the leaves at the basal leaf stage, and dropped to about 1 mg/g from vegetative growth to reproductive stage, but remained abundant in flowers (6.07 ± 0.43 mg/g).Homoplantaginin content in leaves gradually increased from 3.63 ± 0.59 to 6.38 ± 0.68 mg/g from the basal leaf to the ripening stage.A similar trend was observed for accumulation of 6-hydroxyluteolin 7-O-b-glucoside, luteolin 7-O-b-glucoside, and nepitrin.As the glycosides are maybe more stable than aglycones in the plant (Bowles et al., 2005), aglycones might transform into glycosides during the accumulation process.The homoplantaginin content in the stem was high in the vegetative growth period, but decreased during the reproductive stage.In addition, similar to hispidulin, homoplantaginin was abundant in flowers (5.73 ± 0.59 mg/g).

Transcriptomic analysis of different organs and developmental stages of S. plebeia
To further study the mechanisms of flavonoid biosynthesis in different organs and developmental stages of S. plebeia, 21 samples were used for transcriptomic analysis.The analysis produced approximately 161.44 Gb of clean data and 58,905 unigenes (140,509 transcripts) with a mean length of 1161.286nt and an N50 of 1754 nt, following de novo assembly.The Q30 and GC percentages ranged from 94.46% to 95.15% and from 47.3% to 49.96%, respectively (Tables S5, S6).The KEGG, NR, GO, Swissprot, KOG, COG, TrEMBL, Pfam, and eggNOG databases were used to annotate the predicted protein sequences.Finally, 44,185 unigenes were functionally annotated (Table S7).Results from qPCR validation were mostly consistent with those obtained using RNA-seq, indicating the accuracy of the sequencing data (Figure S6).Accordingly, the RNA-seq data were used for further analysis.We then screened the differentially expressed genes (DEGs) in different organs and developmental stages of S. plebeia.There were 7896 DEGs in F3, 6366 DEGs in R3, 4471 DEGs in S3, 5112 DEGs in L1, 3798 DEGs in L2, 4211 DEGs in L4 compared with those in L3 (Figure 5A; Table S8).The number of DEGs in different tissues was higher than that in the developmental stages, which were consistent with the trend of metabolite accumulation.The statistical significance of the differences in gene expression in S. plebeia in different tissues and developmental stages was also represented using the volcano plots (Figure S7).The number of DEGs varied in different groups, and there were more special DEGs in the flowers and roots (Figure 5B).DEGs were further assigned to the KEGG pathway.Interesting, the metabolites involved in "flavonoid biosynthesis" and "phenylpropanoid biosynthesis" were significantly enriched in different tissues and developmental stages (Figure S8).1915 301, 283, 267, 241, 215, 201 2,11,12-Trihydroxy-7,20-epoxy-8,11,13-abietatriene F, L (2,3,4)  Chemical structures of the compounds identified in S. plebeia.

Identification and functional validation of candidate biosynthetic enzymes involved in hispidulin and homoplantaginin biosynthesis
The biosynthetic pathways of flavonoids have been extensively studied in Arabidopsis thaliana, Scutellaria baicalensis, Carthamus tinctorius (Saito et al., 2013;Xu et al., 2020;Wang et al., 2021).Hence, the biosynthetic pathway from L-phenylalanine to homoplantaginin was predicted, comprising nine consecutive steps (Figure 6A).To fully screen for the possible unigenes involved in the biosynthesis of hispidulin and homoplantaginin, we mined the candidate genes by combining differential transcript analysis with transcriptome annotation.A total of 49 candidate genes, including two PAL, 17 4CL, two C4H, three CHS, two CHI, five F6OMT, and 18 UGT, were obtained.However, FNS and F6H were not annotated in the transcriptome analysis.Therefore, we searched for these two genes using the local BLAST and relevant literature.FNS has been reported to be divided into FNS I and FNS II.FNS I is mainly found in Umbelliferae, whereas FNS II is widely distributed in plants containing flavonoids.Therefore, we chose FNS I (GenBank: AY817680.1)from Petroselinum crispum (Umbelliferae) and FNS II (GenBank: KP337723.1)from Salvia miltiorrhiza (Labiatae) as templates to conduct a similarity search against the S. plebeia RNA-sequencing (RNA-Seq) datasets.However, only one gene (c127704.graph_c0)with high similarity to FNS II was found.The phylogenetic analysis demonstrates that SpFNS clustered with members of the FNS II clade and was closest in affinity to the CYP93B25 from Salvia officinalis, and FNS II from Ziziphora clinopodioides (Figure S9).F6H is mainly from the CYP71D family of Leguminosae and the CYP82D family of Labiatae.Considering the phylogenetic relationships among the species, the F6H of S. plebeia is more likely to have originated from the CYP82D family.Hence, c120693.graph_c0,c125994.graph_c1,and c127863.graph_c4were annotated as CYP82D family in our datasets, which named SpF6H1, SpF6H2, and SpF6H3.These three genes cluster in family CYP82D with CYP82D1.1 from Scutellaria baicalensis, which has been shown to have broad catalytic activity for 6-hydroxyflavonoids such as apigenin (Figure S10).
Finally, 53 unigenes were selected as candidate genes (Table S10).The sequences of the 53 genes were used as the queries in a BLAST algorithm-based search of NCBI databases to predict their possible functions and the open reading frame (ORF) sequences.Finally, 28 candidate genes were selected to validate their catalytic functions (Table S1).
Specifically, SpPAL, SpC4H, Sp4CL1-6, SpCHS1-3, SpCHI, SpFNS, SpF6H1-2, SpF6OMT1-3, and SpUGT1-10 were expressed in E. coli, respectively.The recombinant proteins were extracted and purified for biochemical analysis in vitro.HPLC-DAD and LC-MS analyses showed that SpPAL can catalyze the formation of cinnamic acid from phenylalanine (Figure 6B).When AtCPR proteins and NADPH were added, SpC4H was able to catalyze the production of 4-coumaric acid from cinnamic acid (Figure 6C).As we do not have the standard of 4-coumaryl-CoA, we identified the reaction products of Sp4CL2 and Sp4CL5 by comparing the different peak areas of 4-coumaryl-CoA at 330 and 260 nm according to a previous report (Figure 6D) (Li et al, 2010).Similarly, in order to verify the enzymatic activity of SpCHS1-3, we used a binary plasmid with pCDFDuet-RtTAL-Pt4CL and pET28a-SpCHS1-3 co-expressed in E. coli.The co-expression of RtTAL and Pt4CL can directly catalyze tyrosine into 4-coumaryl CoA, providing a sufficient substrate for the catalytic reaction of SpCHS.Upon feeding with tyrosine, naringenin chalcones were found in the culture extracts (Figure 6E).Although naringenin chalcone can be converted to naringenin spontaneously, it can be converted almost completely instantaneously in the presence of CHI.As shown in Figure 6F, some naringenin was also generated in the control group, but naringenin chalcone was completely consumed in the sample group to generate naringenin, indicating the catalytic activity of SpCHI.FNS and F6H, which belong to the CYP450 family and require the NADPH-CPR to provide electrons, are similar to C4H.AtCPR was co-expressed with SpFNS and SpF6H1-2, respectively.SpFNS produced apigenin from naringenin (Figure 6G).SpF6H1 produced scutellarein from apigenin, a precursor of hispidulin (Figure 6H).SpF6OMT1 and SpF6OMT2 converted scutellarein into hispidulin (Figure 6I).Thus, SpPAL, SpC4H, Sp4CL2, Sp4CL5, SpCHS1, SpCHI, SpFNS, SpF6H1, SpF6OMT1, and SpF6OMT2 formed a complete biosynthetic pathway for hispidulin.In the final step of homoplantaginin biosynthesis, SpUGT1, SpUGT2, and SpUGT3 produced homoplantaginin from hispidulin in vitro using UDPglucose as a sugar donor (Figure 6J).In sum, all the enzymes involved in the hispidulin and homoplantaginin biosynthetic pathways were identified.

Heterologous production of hispidulin and homoplantaginin in E. coli
To further investigate the key enzymes involved in the hispidulin and homoplantaginin pathway in the S. plebeia, we reconstructed the pathway leading from naringenin to homoplantaginin in E. coli (Figures 7A).Simultaneously introduced SpF6H1 and SpUGT1 into E. coli to harbor AtCPR, the recombinant E. coli HP1 was able to produce scutellarein and homoplantaginin (Figures 7B, C).When SpFNS and SpF6OMT2 were simultaneously introduced into E. coli, which harbored the AtCPR, the resulting recombinant E. coli HP2 was able to produce apigenin and hispidulin (Figures 7D, E).The pET32a vector carrying SpF6H1 and SpUGT1 and the pACYCDuet-1 vector carrying SpFNS and SpF6OMT2 were transferred into E. coli to construct HP3. HP3 was able to produce hispidulin and homoplantaginin under naringenin induction.After 48 h of cultivation, the accumulation of hispidulin and homoplantaginin reached to 5.33 and 3.86 mg/L, respectively (Figure 7F).Although the yield was not desirable, it could be increased through metabolic engineering and fermentation optimization to make the full use of the E. coli fermentation system in the future.

Discussion
Flavonoids were found to be the main components of S. plebeia, which are mainly present in leaves and flowers owing to their photosynthetic function and UV protection (Mathesius, 2018).Flavonoid levels vary during plant growth.During the basal leaf period in early March (stage 1), flavonoids levels in the leaves reached a peak during winter.As the plant matures, growth accelerates.The flavonoid content reached its maximum value in early April (stage 2).In early May, S. pebeia enters the reproductive stage.During the flowering stage (stage 3), the flavonoid content in the stems and leaves decreased to some extent, whereas the flavonoid content in flowers was as high as 16.73 ± 0.16 mg/g, which was higher than that in the leaves.This might be due to the fact that flavonoids are mostly synthesized in breeding organs for reproductive growth at the bloom stage, while synthesis of flavonoids in vegetative organs may be suppressed.The transcriptomic analysis showed that most of the genes involved in flavonoid synthesis were highly expressed in flowers (Figure 5D).At the later stage of reproduction (stage 4, early June), the flavonoid content in the stems and leaves decreased to its lowest.These findings suggest that late April (before the bloom stage) is the optimum harvest period for S. plebeia.Contrastingly, the contents of phenolic diterpenes, such as rosmanol and carnosol, were higher in flowers and leaves at the flowering stage (Figure 4B).Rosmanol and carnosol also possess anti-inflammatory and anti-tumor properties (Loussouarn et al., 2017;Jiang et al., 2021).Therefore, harvesting S. plebeia at the flowering stage is supported by pharmacopeial guidelines.Overall, determining the optimal harvest time for S. plebeia should be based on the desired chemical constituents and their intended use.If flavonoids are the main evaluation index, late April (before the bloom stage) is the optimum harvest time.If both flavonoids and terpenoids are in demand, May (the flower stage) is the optimum harvest time for S. plebeia.
We also characterized the complete biosynthetic pathway of hispidulin and homoplantaginin, which are main components of S. plebeia.The biosynthetic pathway from L-phenylalanine to scutellarein has been reported (Pei et al., 2022).PaF6OMT from the liverwort species Plagiochasma appendiculatum can convert scutellarein into hispidulin (Zhang et al., 2016).Although the key enzymes involved in hispidulin biosynthesis have been reported in other plant biosynthetic pathways, the complete pathway of hispidulin biosynthesis in S. plebeia has not been reported.Homoplantaginin is a 7-O-glycosylated hispidulin product.Glycosylation is commonly post-modified of flavonoids, which is beneficial to improve the stability and water solubility of flavonoids.Thus, it can help the regulation of bioactivity and the storage and detoxification of xenobiotics in plants (Bowles et al., 2005).In addition, glycosylation can be better absorbed by the human body compared to aglycones and has the potential to improve pharmacokinetic and pharmacodynamic profiles, making it better prospects for clinical use (Goel et al., 2021).While flavonoid 7-Oglycosyltransferase has been identified in many plants (Han et al., 2021;Pei et al., 2022), the UGT responsible for catalyzing hispidulin to homoplantaginin has not been reported.In this study, SpUGT1, SpUGT2, and SpUGT3 were confirmed to be the flavonoid-7-Oglycosyltransferase.SpUGT1, SpUGT2 and SpUGT3 were submitted to the UGT Nomenclature Committee and designated as UGT88A47, UGT88T1 and UGT71AP4, respectively.The UGT88 family was previously reported as a unique UGT family in Lamiales (Noguchi et al., 2009).Based on this, downstream genes were reconstructed in E. coli to enable biosynthesis of hispidulin and homoplantaginin using naringin as a precursor.However, biosynthesis was not initiated from tyrosine or glucose, and further work by synthetic biologists is necessary.Future measures to increase precursor supply, enhance cofactor levels, establish specific metabolic channels, and use co-culture technology will be taken.

Conclusion
This study presents a comprehensive and systematic analysis of the chemical profile of various tissues of S. plebeia at different developmental stages, which is a novel contribution to the scientific literature.Furthermore, for the first time, we generated a transcriptome dataset for S. plebeia across different developmental stages and tissues.Through the integration of metabolomictranscriptomic analysis and in vitro functional verification, we identified the enzymes involved in the biosynthetic pathways of hispidulin and homoplantaginin.In addition, we successfully reconstructed hispidulin and homoplantaginin in a heterologous host.These findings enhance our understanding of the distribution, accumulation, and biosynthesis mechanisms of flavonoids and provide a series of candidate genes for synthetic biology applications aimed at the production of natural products.
FIGURE 2 The total ion chromatograms (TIC) of S. plebeia at the flower stage in negative ion mode.(A) TIC of methanol extracts from leaves.(B) TIC of methanol extracts from flowers.(C) TIC of methanol extracts from stems.(D) TIC of methanol extracts from roots.
FIGURE 5 Differentially expressed genes (DEGs) of S. plebeia with different tissues and developmental stages.(A) The changes of upregulated DEGs and downregulated DEGs in different groups, with L3 as control groups.(B) Venn diagram for all DEGs in different groups, with L3 as control groups.(C) The heatmap was drawn based on relative expression of annotated flavonoid biosynthesis genes in KEGG pathway in different developmental stages.(D) The heatmap was drawn based on relative expression of annotated flavonoid biosynthesis genes in KEGG pathway in different developmental tissues.
FIGURE 7Reconstitution of the biosynthesis pathway of homoplantaginin in E. coli.(A) Schematic of the recombinant E. coli strain HP3.(B) HPLC analysis of apigenin (AG) and scutellarein (SA) standards and the fermented products of the recombinant E. coli strain HP1 with apigenin as a precursor.(C) HPLC analysis of hispidulin (HD) and homoplantaginin (HG) standards and the fermented products of the recombinant E. coli strain HP1 with hispidulin as a precursor.(D) HPLC analysis of naringenin (NG) and apigenin (AG) standards and the fermented products of the recombinant E. coli strain HP2 with naringenin as a precursor.(E) HPLC analysis of scutellarein (SA) and hispidulin (HD) standards and the fermented products of the recombinant E. coli strain HP2 with scutellarein as a precursor.(F) HPLC analysis of the activity of the recombinant E. coli strain HP3 by feeding naringenin (NG).

TABLE 1
Characterization of chemical constituents of S. plebeia by UPLC-Q-TOF-MS/MS.