Randomized trial and multi-omics, machine learning–based mechanistic exploration of daixie decoction granules in type 2 diabetes

Zheng, Zhong; Zhang, Kepei; Ma, Xiaogang; Qian, Yuezhou; Wang, Miao

doi:10.3389/fphar.2025.1723584

ORIGINAL RESEARCH article

Front. Pharmacol., 05 January 2026

Sec. Integrative and Regenerative Pharmacology

Volume 16 - 2025 | https://doi.org/10.3389/fphar.2025.1723584

This article is part of the Research TopicTargeting Dysregulation in Metabolic Diseases: Potential Therapeutic InterventionView all 3 articles

Randomized trial and multi-omics, machine learning–based mechanistic exploration of daixie decoction granules in type 2 diabetes

Zhong Zheng^†

Kepei Zhang^†

Xiaogang Ma

Yuezhou Qian*

Miao Wang*

Traditional Chinese Medicine Department, Longhua Hospital, Shanghai University of Traditional Chinese Medicine, Shanghai, China

Introduction: Traditional Chinese Medicine (TCM) offers multi-target strategies for Type 2 Diabetes Mellitus (T2DM), but its mechanisms are unclear. This study combined a randomized controlled trial (RCT) with a multi-omics approach to evaluate the efficacy of Daixie Decoction granules (DDG) as an add-on therapy to metformin and to generate mechanistic hypotheses using a multi-omics framework.

Methods: We conducted a randomized, double-blind, placebo-controlled trial involving 136 randomized and 128 completed with DDG plus metformin or placebo plus metformin for 6 months. Mechanistic prediction was based on network pharmacology, integration of T2DM-related genes from public databases (GeneCards, DisGeNET, OMIM), and transcriptomic differentially expressed genes (DEGs) from GEO. Seven machine learning algorithms were applied to prioritize core targets from the overlapping candidate list. A nested serum proteomics sub-study within the randomized trial, with tissue-specific expression profiling (GTEx), was then used to explore the consistency of these computational predictions at the protein and tissue levels. Statistical analysis was performed using appropriate parametric and nonparametric tests, including ANCOVA where applicable.

Results: DDG reduced HbA1c compared with placebo (−0.32%, P=0.032). Fasting plasma glucose showed a borderline reduction (P=0.050). Network pharmacology identified 617 potential targets intersecting with 2,652 DEGs, yielding 29 candidates. Using machine-learning combined with protein–protein interaction topology and literature support, we further prioritized eight core targets (P2RX7, IL1B, PTPN1, AKT2, CD38, NFE2L2, NOS3, and MERTK). Enrichment analyses of these candidates, together with serum proteomic profiling, implicated PI3K–Akt signaling, inflammatory and oxidative stress responses, and focal adhesion–related pathways.

Conclusion: Clinically, DDG used as add-on therapy to metformin produced a modest but statistically significant improvement in glycemic control in patients with inadequately controlled T2DM. Our findings are consistent with the hypothesis that DDG may act through a multi-target network spanning inflammatory (P2RX7, IL1B), insulin/metabolic (PTPN1, AKT2, CD38), oxidative–endothelial (NFE2L2, NOS3) and vascular-resolution (MERTK) axes, generating testable mechanistic hypotheses for future experimental studies.

1 Introduction

Type 2 diabetes mellitus (T2DM) is a chronic, progressive metabolic disorder characterized by hyperglycemia resulting from insulin resistance and β-cell dysfunction (NCD Risk Factor Collaboration NCD-RisC, 2024). Its global prevalence continues to rise (Sun et al., 2022). While metformin remains the first-line therapy (ElSayed et al., 2023), many patients fail to achieve satisfactory glycemic control (Davies et al., 2022; Heyward et al., 2021). Hence, effective and safe adjunctive options are needed.

Traditional Chinese Medicine (TCM) is increasingly explored for metabolic disease (Chen et al., 2025; Liu et al., 2023). Compound herbal formulas, in particular, may modulate multiple pathological processes, aligning with T2DM´s multifactorial nature (Ni et al., 2024). Recent studies have shown promising effects of certain formulas in regulating glucose metabolism, inflammation, and oxidative stress (Meng et al., 2023; Zhang Q. et al., 2024). However, current research often focuses on isolated compounds or single mechanisms, neglecting the synergistic actions of whole formulas (Zhang B. et al., 2024). Yet integrative, translational multi-omics studies remain limited (Zhai et al., 2025).

The Daixie Decoction granules (DDG) is a classical TCM prescription for metabolic disorders. We first conducted a randomized controlled trial (RCT) to evaluate its efficacy on glycemic outcomes in patients with Type 2 Diabetes Mellitus (T2DM). An integrated multi-omics strategy-combining network pharmacology, transcriptomic analysis, machine learning, proteomics, and molecular docking-was then applied to elucidate its underlying mechanisms. This study is among the first to link randomized clinical evidence with multi-layer omics validation in a TCM formula for T2DM, offering novel mechanistic insights and translational value.

2 Methods

2.1 The workflow of the study

This study comprises three sequential phases: (1) clinical validation through a double-blind, randomized, placebo-controlled trial; (2) mechanistic prediction via network pharmacology,followed by machine-learning- based target prioritization; (3) a nested serum proteomics sub-study within the RCT, combined with tissue-specific expression profiling from GTEx, to explore molecular correlates of the intervention. The overall pipeline from clinical endpoints to computationally inferred mechanistic hypotheses is shown in Figure 1.

Figure 1

Multi-phase integrative workflow diagram for T2DM target identification. Phase 1 involves a randomized controlled trial with 136 participants focusing on DDG efficacy, showing significant HbA1c reduction. Phase 2 includes computational target prediction with 29 overlapping candidate targets identified through network pharmacology, disease databases, and GEO transcriptomics. Processes involve machine learning ranking, PPI network analysis, and literature validation, resulting in eight prioritized core targets: P2RX7, IL1B, PTPN1, AKT2, CD38, NFE2L2, NOS3, MERTK. Phase 3 emphasizes multi-omics exploration, nested proteomic sub-study, cross-omics concordance, and GTEx tissue expression profiling, ensuring biological plausibility.

Figure 1. Workflow overview.

2.2 Randomized controlled trial (RCT)

2.2.1 Study design and oversight

This was a prospective, two-arm, parallel-group randomized controlled trial (ChiCTR2000036290), conducted at Longhua Hospital, Shanghai University of Traditional Chinese Medicine, between September 2020 and March 2022, and was approved by the Ethics Committee of Longhua Hospital (2020LHSB026).

2.2.2 Participants

From September 2020 to March 2022, 238 patients were screened, 136 participants were enrolled and randomized, and 128 were included in the final analysis.

Key inclusion criteria were as follows: (1) Age between 18 and 65 years; (2) Met the diagnostic criteria for type 2 diabetes; (3) New diagnosis of T2DM within the past 12 months with no prior use of any anti-diabetic medication; (4) A fasting plasma glucose (FPG) level between 7.0 mmol/L and 13.9 mmol/L after a 2-week run-in period of diet and exercise control.

Key Exclusion Criteria: (1) Glycated hemoglobin (HbA1c) level >9%; (2) Patients requiring insulin therapy; (3) Patients with severe hepatic insufficiency (transaminase levels more than twice the upper limit of normal), renal failure (serum creatinine >115 μmol/L), or heart failure (NYHA class III-IV); (4) Use of weight-loss supplements, antidepressants, or hormonal medications within the past 3 months or during the study period; (5) Pregnancy or lactation.

2.2.3 Randomization and blinding

A centralized, stratified block randomization sequence was generated using SAS by an independent statistician. Allocation concealment was maintained using sequentially numbered opaque envelopes.

Double-blinding was ensured for all stakeholders. Placebo granules (identical in appearance and packaging) contained only excipients (sucrose, lactose, dextrin) and were supplied by the same manufacturer as DDG.

2.2.4 Interventions

All participants received standardized guidance on diet and exercise. Participants were randomly assigned in a 1:1 ratio to either the intervention group or the control group for a 6-month treatment period.

Intervention Group (DDG + Metformin):

DDG granules: One 10 g packet, twice daily, dissolved in boiling water. Each packet of DDG consisted of the following herbal components: Atractylodes lancea 18 g, Atractylodes macrocephala 18 g, Salvia miltiorrhiza 15 g, Polygonum cuspidatum 12 g, Folium ilicis cornutae 15 g, Scutellaria 12 g, and Linderae radix 9 g. Metformin: Treatment was initiated at 0.5 g once daily after dinner for the first week. The dose was then titrated over 4 weeks based on patient tolerance to a maximum of 2 g daily (1 g twice daily) and maintained until the end of the treatment period.

Control Group (Placebo + Metformin):

Placebo granules: One packet, twice daily. The usage, dosage, and appearance were identical to the DDG granules.

Metformin: The dosage and titration schedule were identical to that of the intervention group.

The use of any other hypoglycemic, lipid-lowering, or weight-loss medications was prohibited during the trial.

2.2.5 Outcomes and safety assessment

The primary endpoint of this study was the change in HbA1c from baseline to 6 months. Secondary endpoints included changes in FPG, fasting insulin, C-peptide levels, HOMA-IR and other related parameters. Safety was assessed by monitoring liver and renal function tests.

2.2.6 Statistical analysis

Continuous variables were compared between groups using Student’s t-test; categorical variables were compared using the chi-square test. Normality of baseline variables was evaluated using the Shapiro–Wilk test and visual inspection of Q–Q plots. Given the sample size (N > 60), t-tests were considered robust even when minor deviations from normality were present.

Between-group differences in post-treatment outcomes were assessed using analysis of covariance (ANCOVA), with the corresponding baseline value included as a covariate. To address the modest baseline imbalance in BMI, additional post hoc sensitivity analyses were conducted using ANCOVA models further adjusted for baseline BMI. For skewed variables, log-transformed ANCOVA models were also performed as supplementary sensitivity analyses.

Missing data were handled using multiple imputation with predictive mean matching (PMM, m = 10), and estimates were pooled using Rubin’s rules. All analyses were two-sided with a significance threshold of P < 0.05.

2.3 Network pharmacology and bioinformatics analysis

2.3.1 Identification of active compounds and potential targets

Active compounds in the DDG formula were retrieved from the Traditional Chinese Medicine Systems Pharmacology Database (TCMSP, http://tcmspw.com) (Ru et al., 2014) using predefined thresholds of oral bioavailability (OB) ≥ 30% and drug-likeness (DL) ≥ 0.18 (Xu et al., 2012). Compound–target associations were obtained from TCMSP or, if unavailable, predicted using SwissTargetPrediction (http://www.swisstargetprediction.ch/) (Gfeller et al., 2014). All targets were standardized via UniProt (https://www.uniprot.org) (UniProt Consortium, 2021), with species restricted to Homo sapiens.

2.3.2 Identification of T2DM-associated disease targets

T2DM-related genes were collected from GeneCards (Stelzer et al., 2016), DisGeNET (Piñero et al., 2020), and OMIM (Amberger et al., 2019). Using the search term “Type 2 Diabetes Mellitus”. The union of all retrieved gene entries was compiled into a non-redundant disease target list.

2.3.3 Differentially expressed genes (DEGs) analysis

Transcriptomic datasets from the Gene Expression Omnibus (GEO) database (Barrett et al., 2013) were integrated, that share the comparable biological contrast of T2DM versus healthy status. When merging multiple sets of transcriptome data, filtering low-abundance genes is a key preprocessing step. Specifically, genes with counts ≥10 in at least three samples are retained.

The merged count data was analyzed using the DESeq2 package (Love et al., 2014) in R (Team R. R, 2014). Batch effects originating from the distinct GEO series were controlled by incorporating a Batch factor into the design formula:{design} = {Batch} + {Group}. DEGs were identified using the DESeq2 with adjusted p-value <0.05 and |log2FC| ≥ 0.5 as cutoffs (Schurch et al., 2016), which is commonly used to balance sensitivity and biological relevance, avoiding overly stringent filters that may eliminate moderate but functionally meaningful changes.

2.3.4 Identification of core therapeutic targets

We intersected: (1) DDG-predicted targets, (2) T2DM disease genes, and (3) GEO DEGs. To identify core therapeutic targets, we employed a three-step evidence strategy.

2.3.4.1 Machine learning prioritization

The aim of the machine-learning analysis was to prioritize candidate genes according to their contribution to discriminating T2DM patients from non-diabetic controls. LASSO (Regression Shrinkage and Selection Via the Lasso, 1996), SVM-RFE (Guyon et al., 2002), random forest (RF) (Breiman, 2001), generalized linear model (GLM) (Nelder and Wedderburn, 1972), K-nearest neighbors (KNN) (Cover and Hart, 1967), neural network (NNET) (Rumelhart et al., 1986), and decision tree (DT) (Breiman et al, 2017)—were applied for target prioritization in R. The dependent variable was the binary T2DM status (T2DM vs. non-diabetic control), and the predictors were the expression levels of the overlapping candidate genes in the merged GEO expression matrix.

Samples were randomly split into a training set (70%) and an internal test set (30%), stratified by outcome. All models were trained using repeated 5-fold cross-validation (5 repeats) with hyperparameter tuning via grid search, using the area under the receiver operating characteristic curve (AUC) as the primary optimization metric. Model performance was summarized in both the cross-validation resamples and the test set using AUC, accuracy, sensitivity, specificity, and F1 score. For each algorithm, feature importance was extracted, and genes were ranked within each model. A consensus ranking was then obtained based on selection frequency and average importance ranking across the seven algorithms, and genes that consistently appeared among the top contributors were considered machine-learning- prioritized targets. Details of algorithm settings and tuning grids are provided in Supplementary Material 3.

2.3.4.2 PPI analysis

Protein-protein interaction (PPI) analysis was conducted using the STRING database (version 12.0) with a confidence score >0.7. The resulting network was visualized in Cytoscape 3.10.0. Targets with high network centrality values were regarded as topological hubs.

2.3.4.3 Literature-based and functional evidence

For each target, literature mining was performed using PubMed and GeneCards to evaluate biological relevance to T2DM, insulin resistance, oxidative stress, and inflammation.

Integrating all three evidence layers—machine learning importance, PPI hubness, and literature validation—candidates were assessed, and those meeting the multi-layered criteria were designated as core therapeutic targets for mechanistic interpretation and exploration.

2.3.5 Functional enrichment analysis

GO and KEGG pathway enrichment analyses were conducted via DAVID (version 6.8) (Sherman et al., 2022), and results (p < 0.05) were visualized via Bioinformatics. com.cn.

2.4 Proteomic validation

The proteomic validation was conducted as a substudy within the RCT. Briefly, a subset of 25 participants from the DDG + metformin arm was randomly selected, and fasting serum samples were collected at baseline and at the 6-month visit. An additional group of age- and sex-matched healthy volunteers without diabetes (n = 25) was recruited as non-diabetic controls.

A data-independent acquisition (DIA)-based quantitative proteomic analysis was applied to evaluate the protein-level effects of the intervention (Ludwig et al., 2018). Sample preparation was conducted using the iST sample preparation kit (PreOmics, Germany) (PreOmics GmbH, 2023) according to the manufacturer’s protocol. Briefly, protein samples were lysed, denatured, reduced, alkylated, digested with trypsin, and desalted prior to LC-MS/MS analysis.

Peptides were analyzed using a timsTOF HT mass spectrometer (Meier et al., 2020) coupled with a nanoElute 2 LC system (Bruker Daltonik, Germany). A 20-min gradient was used for chromatographic separation on an AUR3-15075C18 analytical column (15 cm × 75 μm i. d., 1.7 μm particle size). DIA data were acquired in diaPASEF mode (Meier et al., 2018) with 24 * 25 Th isolation windows, covering an m/z range of 400–1000. Collision energy was ramped linearly during ion mobility scanning.

Raw data were processed using Spectronaut 18 (Biognosys, Switzerland) (Bruderer et al., 2015) with the human UniProt reference database. Peptide and protein FDR thresholds were set at 1% (Elias and Gygi, 2007). Quantification was prformed using the MaxLFQ algorithm (Cox et al., 2014). Differentially expressed proteins (DEPs) were defined as those with |fold change| > 1.5 and adjusted p-value <0.05 (Benjamini–Hochberg correction) (Benjamini and Hochberg, 1995). GO and KEGG enrichment analyses were conducted for DEPs.

To align with transcriptomic validation, downstream analyses emphasized the comparison between post-treatment T2DM and healthy controls.

2.5 Tissue-specific expression of core genes

The expression patterns of the identified core targets across human tissues were examined using data from the GTEx (https://www.gtexportal.org) (Consortium, 2020).

3 Results

3.1 Clinical efficacy of DDG in the randomized controlled trial (RCT)

A total of 136 participants were randomized (1:1), and 128 completed the study and were included in the final analysis (Figure 2). Data completeness was high, with missingness generally low (highest 12.1% in the treatment group). Missing values were handled using multiple imputation via predictive mean matching (m = 10). Sensitivity analyses,including complete-case analyses and log-transformed models for skewed variables, showed minimal impact on the key outcomes (Supplementary Figures S1–S3).

Figure 2

CONSORT flow diagram of a randomized, double-blind, placebo-controlled trial. Of 238 assessed individuals, 102 were excluded. A total of 136 participants were randomized 1:1 to receive DDG granules plus metformin (n=68) or placebo granules plus metformin (n=68) for 6 months. Losses to follow-up and discontinuations occurred during follow-up. The final analysis included 63 participants in the DDG group and 65 in the placebo group. The diagram summarizes screening, randomization, follow-up, and analysis populations.

Figure 2. Participant flow through the study according to CONSORT.

The assumption of normality of the baseline was formally tested using the Shapiro-Wilk test, and distributions were also visually inspected with Q-Q plots and histograms (Supplementary Table S1; Supplementary Figures S4-S5). Although some variables were not normally distributed according to the Shapiro-Wilk test, the t-test was considered robust due to the large sample size (N > 60) based on the Central Limit Theorem. Categorical variables were compared using the Chi-squared test. Baseline characteristics were assessed in the 128 participants contributing primary-outcome data. Most demographic and metabolic variable were broadly similar between groups (Table 1), indicating successful randomization. However, notable baseline imbalances were observed in BMI (28.38 vs. 27.29 kg/m², P = 0.047).

Table 1

Table 1. Baseline characteristics of participants.

3.1.1 Primary endpoint

After 6 months, the DDG group showed a significantly greater reduction in adjusted mean HbA1c compared to the control group (between-group difference: −0.32%, P = 0.032; Table 2).

Table 2

Table 2. ANCOVA of end-of-study outcomes in the treatment and control groups.

As an exploratory analysis, we further stratified participants by baseline HbA1c of 7.5%, a threshold commonly used to indicate suboptimal glycemic control in clinical practice. In the subgroup, the benefit was primarily driven by participants with a baseline HbA1c ≥ 7.5% (P = 0.045), whereas no significant effect was observed in the <7.5% stratum (P = 0.441).

3.1.2 Secondary endpoints

Fasting plasma glucose (FPG) showed a borderline reduction favoring the treatment group (P = 0.050; Table 2).

There were no significant overall between-group differences in HOMA-IR, C-peptide, or fasting insulin. In the same exploratory stratification by baseline HbA1c ≥ 7.5% subgroup, fasting insulin levels were significantly higher in the DDG group (P = 0.027), suggesting a potential improvement in β-cell function in patients with poor baseline control. Other metabolic markers, including lipids, BMI, and CRP, did not differ significantly between groups.

Log-transformed sensitivity analyses yielded results consistent with the primary analysis (Supplementary Tables S2, S3), supporting the robustness of the findings.

3.1.3 Safety

No hypoglycemic episodes were observed, and no intervention-related SAEs occurred. Occasional, mild, self-limiting diarrhea was reported in both groups; a contribution from metformin and/or dietary factors cannot be excluded. Routine safety laboratories (hepatic/renal) remained overall stable without clinically meaningful abnormalities.

3.1.4 Baseline-adjusted sensitivity analysis

Given the modest baseline imbalance in BMI between groups, we performed post hoc sensitivity analyses using ANCOVA models further adjusted for baseline BMI. In these BMI-adjusted models, the between-group differences in HbA1c and FPG remained similar in magnitude to the primary analyses (BMI-adjusted LS means 6.58% vs. 6.87% for HbA1c and 5.98 vs. 6.43 mmol/L for FPG in the DDG and control groups, respectively), although the P values became borderline (P = 0.053 and P = 0.052, Supplementary Table S4). No between-group differences were observed for HOMA-IR, lipid parameters, CRP, liver enzymes or creatinine after additional BMI adjustment.

3.2 Prediction of core targets

3.2.1 Identification of potential therapeutic targets

We first retrieved a total of 144 active compounds and 617 corresponding drug targets from the TCMSP database, which includes the seven herbal components of DDG. Concurrently, we collected 11,574 T2DM-related genes from three public disease databases: GeneCards, DisGeNET, and OMIM. Differential gene expression analysis was then performed by integrating the GEO datasets GSE137317, GSE153315, and GSE153792, yielding 12,353 genes. After Batch effects originating, we apply the thresholds of adjusted p-value (padj) < 0.05 and |log2FoldChange| ≥ 0.5, we identified 2,652 differentially expressed genes (DEGs). By intersecting the drug targets, disease-related genes, and DEGs, we obtained 29 overlapping candidate targets, which were considered potential therapeutic targets through which DDG may exert its effects on T2DM (Figure 3A). The full lists of drug targets, disease-related genes, DEGs, and overlapping candidate targets are provided in Supplementary Material 2 (Excel), with each dataset presented in a separate sheet.

Figure 3

Diagram showing three visualizations: A) A Venn diagram with overlapping circles representing three data sets: DDG, Database, and another set, with numbers indicating elements in each section. B) A network diagram with IL1B at the center, connected to nodes like HIF1A and ICAM1. C) A complex network graph with interconnected groups labeled HQ, DS, CZ, RE, GGY, WY, and BZ, linked through various connections.

Figure 3. Analysis of DDG targets in T2DM. (A) Venn diagram of 29 core targets from compound, disease, and DEG overlap. (B) PPI network of candidate targets. (C) Herb–compound–target network.

3.2.2 Construction of protein–protein interaction and herb–compound–target networks

The 29 intersected candidate genes were imported into the STRING database to construct a PPI network, which was subsequently visualized using Cytoscape (Figure 3B). The resulting network revealed significant interactions among the targets, with IL1B, HIF1A, PTPRC, NOS3, AKT2, CD38, NFE2L2 and ICAM1 identified as central hub genes due to their high degree of connectivity. We further constructed a herb–compound–target network based on the seven herbs, 144 active compounds, and the 29 candidate targets (Figure 3C). Among these, scutellaria baicalensis (Huang Qin) and radix salviae (Dan Shen) contained the greatest number of active compounds and regulated the largest number of targets. Notably, dan-shexinkum d (PubChem CID: 124307626) was identified as the most central compound, targeting 21 genes, followed by quercetin (PubChem CID: 5280343) with 17 targets.

3.2.3 Machine-learning–based prioritization and integrative definition of core targets

To further prioritize the 29 overlapping candidate genes, we applied seven supervised machine-learning algorithms (LASSO, SVM-RFE, RF, GLM, KNN, NNET, and DT) using T2DM status (T2DM vs. control) as the dependent variable and the expression levels of the 29 genes as predictors. As detailed in Section 2.3.4, all models were trained with repeated 5-fold cross-validation (5 repeats), and their performance in both the cross-validation resamples and the internal test set was summarized by AUC(Figure 4A), accuracy, sensitivity, specificity, and F1 score (Supplementary Table S1). The ROC and precision–recall curves (Figures 4A,B) showed that all seven models captured discriminative signal from the 29-gene panel, supporting their use as feature-ranking tools. Residual diagnostics (Figure 4C) did not reveal major systematic bias, with residuals approximately centered around zero and comparable dispersion across models.

Figure 4

Panel A shows ROC curves for various models, with AUC values ranging from 0.555 to 0.8. Panel B presents precision-recall curves for the same models. Panel C displays residuals density plots, indicating model performance. Panel D includes variable importance bar charts for k-nearest neighbors, neural net, support vector machine, generalized linear model, decision tree, random forest, and lasso, showing the top contributing variables for each model.

Figure 4. Evaluation of machine learning models for core gene selection. (A) ROC curves of seven algorithms. (B) Precision–recall curves. (C) Residual histograms. (D) Top 10 feature genes ranked by model importance.

We next focused on the feature-importance patterns generated by these models. Across the seven algorithms, several genes—including P2RX7, PTPN1, CD38, KDM3A, CAT, NOS3, MERTK, and AKT2—repeatedly appeared among the top-ranked features, suggesting robust and model-independent contributions to the T2DM–control separation (Figure 4D). When these ML-derived importance rankings were compared with the PPI network structure, we observed considerable overlap: for example, NOS3, AKT2, CD38, and NFE2L2 were not only highly ranked by multiple ML models but also occupied central positions in the PPI network (Figure 3B).

To define a focused set of core therapeutic targets, we integrated three layers of evidence: (1) high and consistent feature importance across multiple ML algorithms, (2) high connectivity (hub-like behavior) in the PPI network, and (3) strong prior experimental or clinical evidence linking each gene to T2DM, metabolic dysregulation, and inflammation/oxidative stress.

Through this integrative filtering, we prioritized a set of key targets, including P2RX7 and IL1B (inflammasome- and NLRP3-related inflammatory axis), PTPN1, AKT2, and CD38 (insulin resistance and metabolic signaling axis), NFE2L2 and NOS3 (oxidative-stress and endothelial-protective axis), and MERTK (inflammation-resolution and plaque-stabilizing axis). These genes were carried forward as core therapeutic targets of DDG in subsequent mechanistic analyses. The main literature supporting the involvement of each target in T2DM is summarized in Supplementary Table S2.

3.3 Multi-omics exploration of putative mechanisms

To explore whether the computationally prioritized targets and pathways are consistent with independent data sources, we conducted a series of exploratory evaluation analyses. These included (1) quantitative proteomic profiling to confirm protein-level alterations, (2) tissue-specific expression profiling to assess biological plausibility, and (3) molecular docking simulations to evaluate the binding potential between DDG-derived compounds and representative targets.

3.3.1 Proteomic profiling in the nested RCT sub-cohort

A total of 920 protein groups were identified across the serum samples. Between the post-treatment T2DM group and healthy controls, 180 proteins were differentially expressed based on the criteria of |fold change| > 1.5 and adjusted p-value <0.05, including 146 upregulated and 34 downregulated proteins. PCA revealed clear clustering between the two groups, indicating a distinct protein expression profile (Figure 5A). The volcano plot further illustrated the distribution of DEPs in terms of log2 fold change and statistical significance (Figure 5B). The complete differential expression matrix for proteomics is provided in Supplementary Material 4 (Excel).

Figure 5

Panel A shows a PCA plot with two groups:

Figure 5. Proteomic and transcriptomic analyses. (A) PCA plot showing separation between post-treatment T2DM samples and healthy controls; the percentages in the axis labels denote the variance explained by each component (effect size). (B) Volcano plot of differentially expressed proteins; the x-axis displays the effect size (log2 fold change), and the y-axis shows statistical significance (–log10 adjusted p value). (C,E) GO enrichment analyses for proteomic DEPs (C) and transcriptomic candidate genes (E); bar length represents the enrichment effect size (enrichment score), with terms grouped by Biological Process (BP), Cellular Component (CC), and Molecular Function (MF). (D,F) KEGG enrichment analyses for proteomic DEPs (D) and transcriptomic candidates (F); bar length indicates the effect size, defined as the number of genes enriched in each pathway (Gene Count).

GO enrichment analysis (Figure 5C) revealed that the DEPs were enriched in biological processes such as oxidative stress response, insulin signaling (e.g., “positive regulation of protein kinase B signaling”), glycolysis/gluconeogenesis, and protein deubiquitination. In terms of cellular components, they were mainly localized to cytoplasmic vesicles, secretory granules, extracellular matrix, and platelet-related structures. Molecular functions were associated with protein binding, oxidoreductase activity, and cytoskeletal structural components.

KEGG pathway enrichment (Figure 5D) indicated that DEPs were primarily involved in metabolic pathways such as carbon metabolism, amino acid biosynthesis, glycolysis/gluconeogenesis. Significant enrichment was also found in pathways related to platelet activation, focal adhesion, pancreatic secretion, and the complement and coagulation cascades. Morever, the “fluid shear stress and atherosclerosis” pathway was highlighted.

3.3.2 Cross-omics pathway concordance between transcriptomic candidates and proteomic DEPs

To assess cross-omics pathway consistency, we compared enrichment profiles of 29 GEO-derived transcriptomic targets and 180 serum DEPs.

GO and KEGG (Figures 5E,F) analyses of the 29 transcriptomic candidate genes highlighted several key biological programs, including inflammatory and immune signaling (“cytokine–cytokine receptor interaction”, “NF-κB signaling”), insulin-related metabolic regulation (“PI3K–Akt signaling pathway”), oxidative stress responses, and cytoskeletal remodeling and adhesion (“focal adhesion”, “regulation of actin cytoskeleton”).

Strikingly, these pathway themes were independently recapitulated in the proteomic DEPs. Proteomic enrichment revealed oxidative stress–related processes, PI3K–Akt signaling, glycolysis/gluconeogenesis, complement and coagulation cascades, platelet activation, and the “fluid shear stress and atherosclerosis” pathway—overlapping with the transcriptomic signatures of vascular regulation and cytoskeletal organization.

Together, this cross-omics concordance indicates that similar inflammatory, metabolic, oxidative stress–related, and vascular/cytoskeletal pathways are enriched at both the transcriptomic and proteomic levels. These observations suggest that the biological processes highlighted in the candidate-target analysis are reproducible across omics layers.

3.3.3 Tissue expression profiles of core target genes

To explore the tissue context and biological plausibility of the eight prioritized targets, we retrieved transcripts-per-million (TPM) expression data for the core genes from the GTEx database.

For P2RX7, IL1B, PTPN1, AKT2, CD38, NFE2L2, NOS3 and MERTK, GTEx profiles showed higher expression in metabolically and vasculature-relevant tissues, including adipose tissue, liver, skeletal muscle, arterial segments and whole blood (Figures 6A-G). These patterns indicate that the multi-omics–derived targets are preferentially expressed in organs and cell types implicated in insulin resistance, vascular dysfunction and chronic inflammation, supporting their biological relevance to T2DM.

Figure 6

Violin plots labeled A to H show bulk tissue gene expression for different genes, including PMCH, LPL, PTGDS, AVP, CD209, WFDC2, NOS3, and AZGP1 across several organs like brain, testis, and liver. TPM values are on the y-axis, indicating gene expression levels, with varying widths reflecting data distribution.

Figure 6. Tissue-specific expression profiles of the core target genes.(A) P2RX7; (B) IL1B; (C) PTPN1; (D) AKT2; (E) CD38; (F) NFE2L2; (G) NOS3; (H) MERTK.

4 Discussion

4.1 Clinical efficacy of DDG

Our trial demonstrates that DDG provides a modest but statistically significant reduction in HbA1c, particularly for patients with poorer glycemic control who are most in need of adjunctive therapies. The observed increase in fasting insulin within this subgroup, despite no overall change in HOMA-IR, could be consistent with enhanced insulin secretion, but in the absence of stimulated insulin or C-peptide measurements and given the unchanged HOMA-IR, this should not be interpreted as direct evidence of β-cell functional improvement. Nonetheless, as subgroup analyses were exploratory and the overall insulin comparison was null, this finding should be interpreted cautiously and viewed as hypothesis-generating.

Other metabolic markers, including C-peptide, lipids, and CRP, showed no significant between-group differences over 6 months. Given that CRP represents only a crude measure of systemic inflammation, these findings do not exclude more subtle anti-inflammatory or antioxidant effects, which are warrant further mechanistic investigation.

4.2 Exploratory, multi-omics interpretation of the core DDG targets

4.2.1 Hypothesis-generating mechanistic axes suggested by the core targets

4.2.1.1 P2RX7–IL1B: inflammasome-related inflammatory axis

P2RX7 and IL1B were jointly prioritized by the integrative filtering and cluster in a classical NLRP3–inflammasome pathway (Meier et al., 2025; Kong et al., 2022). Prior experimental and clinical work has already linked P2X7 signaling and IL-1β production to low-grade inflammation, β-cell stress and vascular injury in T2DM (Wang et al., 2020; Sun et al., 2012), so their emergence here is consistent with a contributory inflammatory background on which DDG was tested, and further targeted experiments are needed to determine whether the formula additionally exerts a specific anti-inflammasome action.

4.2.1.2 PTPN1–AKT2–CD38: insulin signaling and glucose-metabolic axis

PTPN1, AKT2 and CD38 are all involved in insulin receptor signaling, downstream Akt activation, and β-cell/NAD⁺-dependent metabolic regulation (Florez et al., 2005; Escande et al., 2013; Cho et al., 2001). These genes have been implicated in insulin resistance and glycemic control in previous studies, and their joint prioritization in insulin-responsive tissues provides a plausible, though still indirect, link between the modest HbA1c improvement observed in our trial and potential effects of DDG on insulin-related pathways.

4.2.1.3 NFE2L2–NOS3: oxidative-stress and endothelial axis

NFE2L2 (Nrf2) coordinates antioxidant responses (Wu et al., 2020; David et al., 2017), whereas NOS3 (eNOS) is a key determinant of endothelial nitric oxide bioavailability and vascular homeostasis (Janaszak-Jasiecka et al., 2023). Both have been associated with oxidative stress, endothelial dysfunction and diabetic vascular complications, so their appearance among the core targets fits with a working hypothesis that DDG may influence redox and endothelial biology, in line with—but not conclusively demonstrated by—the oxidative/vascular pathways enriched in our omics analyses.

4.2.1.4 MERTK: inflammation-resolution and plaque-stabilizing axis

MERTK has been linked to more stable atherosclerotic plaque phenotypes (Cai et al., 2017). Its selection as a core target, together with the enrichment of extracellular matrix, complement and coagulation-related pathways, suggests a possible connection between DDG and vascular remodeling or thrombo-inflammatory processes, which remains to be tested in studies with appropriate vascular endpoints.

Taken together, these four axes provide a conceptual organization of the core targets that is compatible with existing literature on T2DM pathophysiology, but they should be regarded as hypothesis-generating rather than mechanistically definitive.

4.2.2 Consistency between transcriptomic targets and serum proteomics

The pathway enrichment results from serum proteomics showed a similar pattern to the biological themes suggested by the transcriptomic candidate targets. Although the core genes were not always directly detected at the protein level, the proteomic DEPs were enriched for immune/inflammatory processes, metabolism-related pathways and vascular/ECM remodeling, which correspond to the inflammatory (P2RX7–IL1B), insulin/metabolic (PTPN1–AKT2–CD38), oxidative/endothelial (NFE2L2–NOS3) and pro-resolving (MERTK) axes described above. We therefore regard this agreement at the pathway level as supportive that the prioritized targets reflect biologically relevant programs in T2DM, while acknowledging that the present data cannot link individual genes to specific protein changes or separate direct from downstream effects of DDG.

4.2.3 Tissue expression context from GTEx and remaining uncertainties

GTEx analysis places the core targets in a tissue context that is compatible with their putative roles in T2DM: several genes show appreciable expression in liver, adipose tissue, skeletal muscle and pancreas, as well as in vascular and immune-rich tissues, in line with involvement in insulin resistance, low-grade inflammation and vascular dysfunction. However, GTEx is based on non-diabetic donors and bulk tissues, and thus cannot capture disease-specific regulation, cellular heterogeneity or any pharmacological modulation by DDG. Accordingly, the GTEx findings mainly provide biological plausibility and anatomical context for the highlighted axes, and future work using patient-derived, single-cell or spatial datasets will be needed to refine these hypotheses.

4.3 Study limitations and future directions

Several limitations should be acknowledged. Clinically, the modest sample size and trial duration limit conclusions regarding long-term durability and diabetic complications. Although sensitivity analysis adjusting for baseline BMI showed consistent effect sizes for glycemic improvement, the observed imbalance in baseline BMI between groups may have contributed to the strength of the statistical significance in the primary analysis. Future studies should ensure balanced randomization or consider stratification by BMI to confirm these findings. Furthermore, the absence of specific biomarkers for oxidative or inflammatory pathways in the RCT prevents direct confirmation of the proposed axes. Mechanistically, our findings rely on integrative bioinformatics; while these rigorous computational approaches prioritize targets, they remain hypothesis-generating and lack direct experimental validation.

Future research should focus on: (1) mechanistic validation through targeted gene knockout/overexpression studies; (2) identification and isolation of specific active compounds responsible for observed effects; (3) larger-scale clinical trials with extended follow-up; and (4) investigation of optimal dosing regimens and patient selection criteria.

5 Conclusion

In this randomized trial, adding DDG to metformin yielded a modest improvement in HbA1c, particularly in patients with higher baseline levels, without inducing broader short-term metabolic remodeling. While the clinical magnitude was limited, these findings provided a crucial phenotypic anchor for our multi-omics analysis, which mapped the observed effects to a prioritized network of eight core targets (e.g., P2RX7, AKT2, NFE2L2) spanning inflammatory, metabolic, and vascular axes. Collectively, this study offers preliminary support for DDG as an adjunctive option and, more importantly, validates a strategy for decoding complex herbal interventions by bridging empirical clinical signals with data-driven mechanistic hypotheses.

Data availability statement

The proteomics data has been deposited in the iProX repository, accession number PXD069763 (Project ID IPX0013864002). The processed datasets supporting the findings of this study are available in the Supplementary Materials. Further requests can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by Ethics Committee of Longhua Hospital (2020LHSB026). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

ZZ: Conceptualization, Writing – review and editing, Formal Analysis, Software, Writing – original draft. KZ: Investigation, Writing – review and editing, Validation, Visualization. XM: Writing – review and editing, Software, Methodology. YQ: Writing – review and editing, Validation, Data curation, Supervision. MW: Methodology, Supervision, Writing – review and editing, Funding acquisition, Project administration.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This study is supported by the Longhua Hospital Affiliated to Shanghai University of Traditional Chinese Medicine and is funded by the Clinical Research Plan of Shanghai Hospital Development Center (No. SHDC2020CR3028A). The funding source had no role in the design of this study and will not have any role during its execution, analyses, interpretation of the data, or decision to submit results.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was used in the creation of this manuscript. Generative AI was used to refine the language and improve the clarity of the manuscript. Specifically, Chat-gpt was employed to assist with grammar, spelling, and stylistic suggestions. The authors have reviewed and take full responsibility for all content, ensuring that the AI’s suggestions were appropriately integrated and that the final manuscript accurately represents our research and findings.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2025.1723584/full#supplementary-material

References

Amberger, J. S., Bocchini, C. A., Scott, A. F., and Hamosh, A. (2019). OMIM.org: leveraging knowledge across phenotype-gene relationships. Nucleic Acids Res. 47 (D1), D1038–D1043. doi:10.1093/nar/gky1151

PubMed Abstract | CrossRef Full Text | Google Scholar

Barrett, T., Wilhite, S. E., Ledoux, P., Evangelista, C., Kim, I. F., Tomashevsky, M., et al. (2013). NCBI GEO: archive for functional genomics data sets--update. Nucleic Acids Res. 41 (Database issue), D991–D995. doi:10.1093/nar/gks1193

PubMed Abstract | CrossRef Full Text | Google Scholar

Benjamini, Y., and Hochberg, Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. 57, 289–300. doi:10.1111/j.2517-6161.1995.tb02031.x

CrossRef Full Text | Google Scholar

Breiman, L. (2001). Random forests. Mach. Learn. 45 (1), 5–32. doi:10.1023/a:1010933404324

CrossRef Full Text | Google Scholar

Breiman, L., Friedman, J., Olshen, R. A., and Stone, C. J. (2017). Classification and regression trees. New York: Chapman and Hall/CRC, 368.

Google Scholar

Bruderer, R., Bernhardt, O. M., Gandhi, T., Miladinović, S. M., Cheng, L. Y., Messner, S., et al. (2015). Extending the limits of quantitative proteome profiling with data-independent acquisition and application to acetaminophen-treated three-dimensional liver microtissues. Mol. Cell Proteomics 14 (5), 1400–1410. doi:10.1074/mcp.M114.044305

PubMed Abstract | CrossRef Full Text | Google Scholar

Cai, B., Thorp, E. B., Doran, A. C., Sansbury, B. E., Daemen, MJAP, Dorweiler, B., et al. (2017). MerTK receptor cleavage promotes plaque necrosis and defective resolution in atherosclerosis. J. Clin. Invest. 127 (2), 564–568. doi:10.1172/JCI90520

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, C., Gao, H., Wei, Y., and Wang, Y. (2025). Traditional Chinese medicine in the prevention of diabetes mellitus and cardiovascular complications: mechanisms and therapeutic approaches. Front. Pharmacol. 16, 1511701. doi:10.3389/fphar.2025.1511701

PubMed Abstract | CrossRef Full Text | Google Scholar

Cho, H., Mu, J., Kim, J. K., Thorvaldsen, J. L., Chu, Q., Crenshaw, E. B., et al. (2001). Insulin resistance and a diabetes mellitus-like syndrome in mice lacking the protein kinase Akt2 (PKBβ). Science. 292 (5522), 1728–1731. doi:10.1126/science.292.5522.1728

PubMed Abstract | CrossRef Full Text | Google Scholar

Consortium, G. T.Ex (2020). The GTEx consortium atlas of genetic regulatory effects across human tissues. Science. 369 (6509), 1318–1330. doi:10.1126/science.aaz1776

PubMed Abstract | CrossRef Full Text | Google Scholar

Cover, T., and Hart, P. (1967). Nearest neighbor pattern classification, in IEEE Journals and Magazine. IEEE Xplore. Available online at: https://ieeexplore.ieee.org/document/1053964.

Google Scholar

Cox, J., Hein, M. Y., Luber, C. A., Paron, I., Nagaraj, N., and Mann, M. (2014). Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol. Cell Proteomics 13 (9), 2513–2526. doi:10.1074/mcp.M113.031591

PubMed Abstract | CrossRef Full Text | Google Scholar

David, J. A., Rifkin, W. J., Rabbani, P. S., and Ceradini, D. J. (2017). The Nrf2/Keap1/ARE pathway and oxidative stress as a therapeutic target in type II diabetes mellitus. J. Diabetes Res. 2017 (1), 4826724. doi:10.1155/2017/4826724

PubMed Abstract | CrossRef Full Text | Google Scholar

Davies, M. J., Aroda, V. R., Collins, B. S., Gabbay, R. A., Green, J., Maruthur, N. M., et al. (2022). Management of hyperglycemia in type 2 diabetes, 2022. A consensus report by the American diabetes association (ADA) and the European association for the study of diabetes (EASD). Diabetes Care 45 (11), 2753–2786. doi:10.2337/dci22-0034

PubMed Abstract | CrossRef Full Text | Google Scholar

Elias, J. E., and Gygi, S. P. (2007). Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat. Methods 4 (3), 207–214. doi:10.1038/nmeth1019

PubMed Abstract | CrossRef Full Text | Google Scholar

ElSayed, N. A., Aleppo, G., Aroda, V. R., Bannuru, R. R., Brown, F. M., Bruemmer, D., et al. (2023). 2. Classification and diagnosis of diabetes: standards of care in Diabetes-2023. Diabetes Care 46 (Suppl. 1), S19–S40. doi:10.2337/dc23-S002

PubMed Abstract | CrossRef Full Text | Google Scholar

Escande, C., Nin, V., Price, N. L., Capellini, V., Gomes, A. P., Barbosa, M. T., et al. (2013). Flavonoid apigenin is an inhibitor of the NAD+ase CD38: implications for cellular NAD+ metabolism, protein acetylation, and treatment of metabolic syndrome. Diabetes 62 (4), 1084–1093. doi:10.2337/db12-1139

PubMed Abstract | CrossRef Full Text | Google Scholar

Florez, J. C., Agapakis, C. M., Burtt, N. P., Sun, M., Almgren, P., Råstam, L., et al. (2005). Association testing of the protein tyrosine phosphatase 1B gene (PTPN1) with type 2 diabetes in 7,883 people. Diabetes 54 (6), 1884–1891. doi:10.2337/diabetes.54.6.1884

PubMed Abstract | CrossRef Full Text | Google Scholar

Gfeller, D., Grosdidier, A., Wirth, M., Daina, A., Michielin, O., and Zoete, V. (2014). SwissTargetPrediction: a web server for target prediction of bioactive small molecules. Nucleic Acids Res. 42 (Web Server issue), W32–W38. doi:10.1093/nar/gku293

PubMed Abstract | CrossRef Full Text | Google Scholar

Guyon, I., Weston, J., Barnhill, S., and Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Mach. Learn. 46 (1), 389–422. doi:10.1023/a:1012487302797

CrossRef Full Text | Google Scholar

Sun, H., Saeedi, P., Karuranga, S., Pinkepank, M., Ogurtsova, K., Duncan, B. B., Stein, C., et al. (2022). IDF diabetes atlas: global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res. Clinical Practice 183, 109119. Available online at: https://pubmed.ncbi.nlm.nih.gov/34879977/.

PubMed Abstract | CrossRef Full Text | Google Scholar

Heyward, J., Christopher, J., Sarkar, S., Shin, J. I., Kalyani, R. R., and Alexander, G. C. (2021). Ambulatory noninsulin treatment of type 2 diabetes mellitus in the United States, 2015 to 2019. Diabetes Obes. Metab. 23 (8), 1843–1850. doi:10.1111/dom.14408

PubMed Abstract | CrossRef Full Text | Google Scholar

Ru, J., Li, P., Wang, J., Zhou, W., Li, B., Huang, C, Li, P., et al. (2014). TCMSP: a database of systems pharmacology for drug discovery from herbal medicines. J. Cheminformatics 6, 13. Available online at: https://pubmed.ncbi.nlm.nih.gov/24735618/.

PubMed Abstract | CrossRef Full Text | Google Scholar

Janaszak-Jasiecka, A., Płoska, A., Wierońska, J. M., Dobrucki, L. W., and Kalinowski, L. (2023). Endothelial dysfunction due to eNOS uncoupling: molecular mechanisms as potential therapeutic targets. Cell Mol. Biol. Lett. 28 (1), 21. doi:10.1186/s11658-023-00423-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Kong, H., Zhao, H., Chen, T., Song, Y., and Cui, Y. (2022). Targeted P2X7/NLRP3 signaling pathway against inflammation, apoptosis, and pyroptosis of retinal endothelial cells in diabetic retinopathy. Cell Death Dis. 13 (4), 336. doi:10.1038/s41419-022-04786-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J., Yao, C., Wang, Y., Zhao, J., and Luo, H. (2023). Non-drug interventions of traditional Chinese medicine in preventing type 2 diabetes: a review. Chin. Med. 18 (1), 151. doi:10.1186/s13020-023-00854-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Love, M. I., Huber, W., and Anders, S. (2014). Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15 (12), 550. doi:10.1186/s13059-014-0550-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Ludwig, C., Gillet, L., Rosenberger, G., Amon, S., Collins, B. C., and Aebersold, R. (2018). Data-independent acquisition-based SWATH-MS for quantitative proteomics: a tutorial. Mol. Syst. Biol. 14 (8), e8126. doi:10.15252/msb.20178126

PubMed Abstract | CrossRef Full Text | Google Scholar

Meier, F., Brunner, A. D., Koch, S., Koch, H., Lubeck, M., Krause, M., et al. (2018). Online parallel accumulation-serial fragmentation (PASEF) with a novel trapped ion mobility mass spectrometer. Mol. Cell Proteomics 17 (12), 2534–2545. doi:10.1074/mcp.TIR118.000900

PubMed Abstract | CrossRef Full Text | Google Scholar

Meier, F., Brunner, A. D., Frank, M., Ha, A., Bludau, I., Voytik, E., et al. (2020). diaPASEF: parallel accumulation-serial fragmentation combined with data-independent acquisition. Nat. Methods 17 (12), 1229–1236. doi:10.1038/s41592-020-00998-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Meier, D. T., De Paula Souza, J., and Donath, M. Y. (2025). Targeting the NLRP3 inflammasome–IL-1β pathway in type 2 diabetes and obesity. Diabetologia 68 (1), 3–16. doi:10.1007/s00125-024-06306-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Meng, X., Liu, X., Tan, J., Sheng, Q., Zhang, D., Li, B., et al. (2023). From Xiaoke to diabetes mellitus: a review of the research progress in traditional Chinese medicine for diabetes mellitus treatment. Chin. Med. 18 (1), 75. doi:10.1186/s13020-023-00783-z

PubMed Abstract | CrossRef Full Text | Google Scholar

NCD Risk Factor Collaboration (NCD-RisC) (2024). Worldwide trends in diabetes prevalence and treatment from 1990 to 2022: a pooled analysis of 1108 population-representative studies with 141 million participants. Lancet. 404 (10467), 2077–2093. doi:10.1016/S0140-6736(24)02317-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Nelder, J. A., and Wedderburn, R. W. M. (1972). Generalized linear models. J. R. Stat. Soc. Ser. A 135, 370. doi:10.2307/2344614

CrossRef Full Text | Google Scholar

Ni, Y., Wu, X., Yao, W., Zhang, Y., Chen, J., and Ding, X. (2024). Evidence of traditional Chinese medicine for treating type 2 diabetes mellitus: from molecular mechanisms to clinical efficacy. Pharm. Biol. 62 (1), 592–606. doi:10.1080/13880209.2024.2374794

PubMed Abstract | CrossRef Full Text | Google Scholar

Piñero, J., Ramírez-Anguita, J. M., Saüch-Pitarch, J., Ronzano, F., Centeno, E., Sanz, F., et al. (2020). The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res. 48 (D1), D845–D855. doi:10.1093/nar/gkz1021

PubMed Abstract | CrossRef Full Text | Google Scholar

PreOmics GmbH (2023). iST sample preparation kit user manual. Germany: Martinsried.

Google Scholar

Regression Shrinkage and Selection Via the Lasso. (1996). Journal of the royal statistical society series B: statistical methodology. Oxf. Acad. doi:10.1111/J.2517-6161.1996.TB02080.X

CrossRef Full Text | Google Scholar

Rumelhart, D. E., Hinton, G. E., and Williams, R. J. (1986). Learning representations by back-propagating errors. Nature 323 (6088), 533–536. doi:10.1038/323533a0

CrossRef Full Text | Google Scholar

Schurch, N. J., Schofield, P., Gierliński, M., Cole, C., Sherstnev, A., Singh, V., et al. (2016). How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use? RNA 22 (6), 839–851. doi:10.1261/rna.053959.115

PubMed Abstract | CrossRef Full Text | Google Scholar

Sherman, B. T., Hao, M., Qiu, J., Jiao, X., Baseler, M. W., Lane, H. C., et al. (2022). DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res. 50 (W1), W216–W221. doi:10.1093/nar/gkac194

PubMed Abstract | CrossRef Full Text | Google Scholar

Stelzer, G., Rosen, N., Plaschkes, I., Zimmerman, S., Twik, M., Fishilevich, S., et al. (2016). The GeneCards suite: from gene data mining to disease genome sequence analyses. Curr. Protoc. Bioinforma. 54 (1.30.1-1.30.33), 1. doi:10.1002/cpbi.5

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, S., Xia, S., Ji, Y., Kersten, S., and Qi, L. (2012). The ATP-P2X7 signaling axis is dispensable for obesity-associated inflammasome activation in adipose tissue. Diabetes 61 (6), 1471–1478. doi:10.2337/db11-1389

PubMed Abstract | CrossRef Full Text | Google Scholar

Team R. R (2014). A language and environment for statistical computing. MSOR Connections. Available online at: https://www.R-project.org/.

Google Scholar

UniProt Consortium (2021). UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 49 (D1), D480–D489. doi:10.1093/nar/gkaa1100

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, D., Wang, H., Gao, H., Zhang, H., Zhang, H., Wang, Q., et al. (2020). P2X7 receptor mediates NLRP3 inflammasome activation in depression and diabetes. Cell Biosci. 10 (1), 28. doi:10.1186/s13578-020-00388-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, J., Sun, X., Jiang, Z., Jiang, J., Xu, L., Tian, A., et al. (2020). Protective role of NRF2 in macrovascular complications of diabetes. J. Cell. Mol. Med. 24 (16), 8903–8917. doi:10.1111/jcmm.15583

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, X., Zhang, W., Huang, C., Li, Y., Yu, H., Wang, Y., et al. (2012). A novel chemometric method for the prediction of human oral bioavailability. Int. J. Mol. Sci. 13 (6), 6964–6982. doi:10.3390/ijms13066964

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhai, Y., Liu, L., Zhang, F., Chen, X., Wang, H., Zhou, J., et al. (2025). Network pharmacology: a crucial approach in traditional Chinese medicine research. Chin. Med. 20 (1), 8. doi:10.1186/s13020-024-01056-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Q., Hu, S., Jin, Z., Wang, S., Zhang, B., and Zhao, L. (2024a). Mechanism of traditional Chinese medicine in elderly diabetes mellitus and a systematic review of its clinical application. Front. Pharmacol. 15, 1339148. doi:10.3389/fphar.2024.1339148

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, B., Zhou, L., Chen, K., Fang, X., Li, Q., Gao, Z., et al. (2024b). Investigation on phenomics of traditional Chinese medicine from the diabetes. Phenomics 4 (3), 257–268. doi:10.1007/s43657-023-00146-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Glossary

AKT2 AKT Serine/Threonine Kinase 2

ALT Alanine aminotransferase

ANCOVA Analysis of covariance

AST Aspartate aminotransferase

ATP Adenosine triphosphate

AUC Area under the curve

BMI Body mass index

CD38 Cluster of Differentiation 38

CRP C-reactive protein

DDG Daixie Decoction granules

DEGs Differentially expressed genes

DEPs Differentially expressed proteins

DIA Data-independent acquisition

DisGeNET Database of gene-disease associations

DL Drug-likeness

DT Decision tree

FDR False discovery rate

FPG Fasting plasma glucose

GEO Gene Expression Omnibus

GLM Generalized linear model

GO Gene Ontology

GTEx Genotype-Tissue Expression

HbA1c Glycated hemoglobin

HDL-C High-density lipoprotein cholesterol

HOMA-IR Homeostatic Model Assessment for Insulin Resistance

IL1B Interleukin 1 Beta

KEGG Kyoto Encyclopedia of Genes and Genomes

KNN K-nearest neighbors

LASSO Least absolute shrinkage and selection operator

LC-MS/MS Liquid chromatography-tandem mass spectrometry

LDL-C Low-density lipoprotein cholesterol

MERTK MER Tyrosine Kinase

NLRP3 NLR family pyrin domain containing 3

NNET Neural network

NFE2L2 Nuclear factor erythroid 2-related factor 2

NOS3 Nitric Oxide Synthase 3 (eNOS)

NYHA New York Heart Association

OB Oral bioavailability

OMIM Online Mendelian Inheritance in Man

P2RX7 P2X purinoceptor 7

PCA Principal component analysis

PMM Predictive mean matching

PTPN1 Protein Tyrosine Phosphatase Non-Receptor Type 1

PPI Protein-protein interaction

RCT Randomized controlled trial

RF Random forest

ROC Receiver operating characteristic

SVM-RFE Support vector machine-recursive feature elimination

T2DM Type 2 diabetes mellitus

TC Total cholesterol

TCM Traditional Chinese Medicine

TCMSP Traditional Chinese Medicine Systems Pharmacology Database

TG Triglycerides

TPM Transcript per million

Keywords: machine learning, multi-omics integration, network pharmacology, proteomics, traditional chinese medicine, type 2 diabetes mellitus

Citation: Zheng Z, Zhang K, Ma X, Qian Y and Wang M (2026) Randomized trial and multi-omics, machine learning–based mechanistic exploration of daixie decoction granules in type 2 diabetes. Front. Pharmacol. 16:1723584. doi: 10.3389/fphar.2025.1723584

Received: 12 October 2025; Accepted: 10 December 2025;
Published: 05 January 2026.

Edited by:

Vítor Samuel Fernandes, Complutense University of Madrid, Spain

Reviewed by:

Yong Yang, The Affiliated Hospital of Changchun University of Traditional Chinese Medicine, China
Saitharani Arumugam, Chettinad Hospital and Research Institute, India

Copyright © 2026 Zheng, Zhang, Ma, Qian and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yuezhou Qian, bG9uZ2h1YXF5ekAxNjMuY29t; Miao Wang, d2FuZ21pYW9fMTI2QDEyNi5jb20=

†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.