Your new experience awaits. Try the new design now and help us make it even better

SYSTEMATIC REVIEW article

Front. Pharmacol., 10 December 2025

Sec. Ethnopharmacology

Volume 16 - 2025 | https://doi.org/10.3389/fphar.2025.1675793

This article is part of the Research TopicHerbal Medicine for the Treatment of Chronic Metabolic Diseases, Volume IIView all 54 articles

Traditional Chinese medicine for non-alcoholic fatty liver disease: an overview of systematic reviews with evidence mapping and metabolic outcome assessment

Juanjuan Li&#x;Juanjuan Li1Ruimin Jiao&#x;Ruimin Jiao1Wenquan Su&#x;Wenquan Su1Wei ChenWei Chen2Xiangyu HuXiangyu Hu3Shuai XuShuai Xu1Lie XuLie Xu1Weiwei YaoWeiwei Yao1Kejia LiuKejia Liu1Hong You,
Hong You4,5*Jingjie Zhao,
Jingjie Zhao1,5*
  • 1Department of Traditional Chinese Medicine, Beijing Friendship Hospital, Capital Medical University, Beijing, China
  • 2Department of Acupuncture, Beijing Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
  • 3Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
  • 4Liver Research Center, National Clinical Research Center of Digestive Diseases, Beijing Friendship Hospital, Capital Medical University, Beijing, China
  • 5Clinical Center for Metabolic Associated Fatty Liver Disease, Capital Medical University, Beijing, China

The paradigm shift from non-alcoholic fatty liver disease (NAFLD) to metabolic dysfunction-associated fatty liver disease (MAFLD) emphasizes metabolic pathogenesis, yet the efficacy of Traditional Chinese Medicine (TCM) under this framework remains unevaluated. Prior reviews focused on NAFLD with outdated data (<2020), lacking clinical translation tools and methodological standards for TCM systematic reviews and meta-analyses (SRs/MAs). This overview integrates NAFLD criteria, visualizes TCM efficacy via evidence mapping, proposes a methodological framework to standardize TCM SRs/MAs, and focuses on evaluating metabolism-related indicators. Nine databases were searched (from database inception to December 30, 2024) for TCM SRs/MAs in NAFLD. Methodological quality was assessed via AMSTAR-2, PRISMA/PRISMA-CHM, and GRADE. Evidence mapping visualized outcomes (liver enzymes, metabolism) to identify clinical priorities. Standardized reporting guidelines for TCM preparations were adhered to, and a ConPhYMP tool assessment evaluated botanical drugs composition and processing disclosure. Thirty-seven SRs/MAs (35 low/critically low quality) reported trends of reduced ALT (−8.2 U/L, 95% CI: −10.1 to −6.3), improved metabolic parameters (e.g., TG: −0.5 mmol/L), and enhanced B-ultrasound resolution (RR: 1.62), though these findings are limited by methodological flaws and low-quality evidence. Evidence mapping highlighted Xiaoyao Powder and Danning Tablet as top-performing formulas. A methodological framework addressing TCM heterogeneity (formula standardization, dosage) and reporting biases was proposed. This is the first overview integrating NAFLD criteria to visualize TCM-related evidence, offering preliminary observations on potential associations between TCM interventions and metabolic outcomes in NAFLD (interpreted with caution due to low evidence quality). The evidence map and methodological guidelines provide a foundation for standardized future TCM research, while clinical translation of current findings is not recommended due to insufficient high-quality evidence.

1 Introduction

Non-alcoholic fatty liver disease (NAFLD) is presently recognized as the most prevalent chronic liver disease in the world (Gofton et al., 2023; Han et al., 2023). Recent academic discourse has led to the proposal of a nomenclature update from NAFLD to metabolic dysfunction–associated fatty liver disease (MAFLD) or Metabolic dysfunction-associated steatotic liver disease (MASLD) (Hsu and Loomba, 2024). In China, experts contend that there is a significant overlap in the essence of MAFLD and MASLD (Fan et al., 2024), and that the two terms can be adopted concurrently. This study adopts the term NAFLD to align with primary literature, while acknowledging its relevance to MAFLD/MASLD metabolic mechanisms. Beyond metabolic dysfunction, MAFLD introduces more explicit diagnostic criteria, integrating metabolic syndrome features (e.g., obesity, insulin resistance, dyslipidemia) to enable precise patient stratification. Furthermore, to maintain consistency with the original literature, this paper continues to use the NAFLD nomenclature and focuses on the analysis of metabolism-related indicators.

Consequently, accumulated evidence from NAFLD-related research remains clinically relevant and applicable for ongoing scientific investigations under the new nomenclature. NAFLD affects approximately 30% of adults globally, with metabolic dysfunction as a key driver (Younossi et al., 2023). Current therapeutic approaches for the disease emphasize increased physical activity (Smart et al., 2018), dietary modification (Tsompanaki et al., 2023), the use of liver-protective (Eslam et al., 2020) and lipid-lowering (Rinella et al., 2023) pharmacological agents, and bariatric surgery (Rinella et al., 2023; Fernando and Pedro, 2021). Uncertainty about the clinical efficacy of these approaches and the potential for adverse event occurrence limits their clinical application (Rinella et al., 2023; Cusi et al., 2022). Persistent uncertainty regarding the clinical efficacy of current NAFLD/MAFLD/MASLD therapies is partly attributable to the condition’s profound heterogeneity in progression. The disease exhibits a broad spectrum of clinical outcomes, ranging from indolent, asymptomatic disease to the development of nonalcoholic steatohepatitis (NASH), fibrosis, and cirrhosis. A major limitation of conventional pharmacotherapy is its inconsistent ability to halt disease progression in advanced stages, yielding dichotomous responses among different patient subgroups (Kumar et al., 2021). This variability underscores the rationale for exploring alternative modalities, including TCM, which potentially targets unique metabolic and hepatic pathways. A critical imperative exists to evaluate such complementary and alternative medical approaches within the NAFLD management paradigm.

TCM has demonstrated significant efficacy in the treatment of NAFLD (Chen et al., 2021; Yang et al., 2019), primarily via the enhancement of liver function, glucose regulation, lipid metabolism and the promotion of weight loss (Kim et al., 2024; Peng et al., 2022; Zhang L. et al., 2022; Zhang Y. F. et al., 2022). Numerous SRs/MAs have indicated that TCM confers certain benefits in the treatment of the disease. One similiar overview of SRs/MAs (Dai L. et al., 2021) synthesized TCM efficacy for NAFLD but reported results: for ultrasound-visualized improvements, 4/7 included SRs showed significant benefits (e.g., liver/spleen CT ratio reduction, RR = 1.28, 95% CI: 1.05–1.56), while 3/7 found no significant differences (RR = 1.06, 95% CI: 0.90–1.25); for ALT normalization, effect sizes were small and heterogeneous (pooled MD = −3.2 U/L, 95% CI: −6.5 to 0.1; I2 = 68%), with 5/8 SRs/MAs showing non-significant trends. These inconsistencies-attributed to variable TCM formulations (e.g., Xiaoyao Powder vs. Danning Tablet) and small sample sizes (median n = 286 per SR)-highlight the need for updated synthesis incorporating 2020–2024 data to resolve discrepancies. To enhance the reliability and validity of findings, we updated the data and comprehensively evaluated the scientific quality of SRs/MAs of TCM for NAFLD. We identified methodological limitations of existing SRs to guide future research in this area. The results can inform clinicians, patients with NAFLD, and policymakers.

2 Methods

2.1 Protocol and registration

The protocol for this overview was registered in the PROSPERO database (CRD42024554175) on 10 June 2024.

2.2 Search strategy

Nine electronic databases (PubMed, The Cochrane Library, Web of Science, the China National Knowledge Infrastructure, the China Biology Medicine disc, Wanfang Data, the VIP Database for Chinese Technical Periodicals, the Excerpta Medica Database, and the Scopus abstract database) were searched for relevant SRs/MAs published in English or Chinese from database inception to December 30, 2024, to ensure inclusion of the most recent evidence. Clinical trial registries (the World Health Organization’s International Clinical Trials Registry Platform, Clinical Trials, and the Chinese Clinical Trial Registry) and the Allied and Complementary Medicine Database were also searched. The search strategy involved the integrated use of subject headings with free text terms. The key terms included “nonalcoholic fatty liver disease,” “Chinese medicine,” and “systematic review.” The detailed search strategy used for each database is shown in Supplementary Table S1.

2.3 Inclusion and exclusion criteria

SRs/MAs covering the use of TCM to treat NAFLD that were published in peer-reviewed journals were eligible for study inclusion. Eligible studies involved participants of any sex, age, and race who were diagnosed with NAFLD (Eslam et al., 2020), treatment with TCM alone or in combination with Biomedicine (Biomed), and comparison with Biomed treatment or placebo. TCM interventions, defined as standardized TCM formulas complying with the Chinese Pharmacopoeia (excluding single-botanical drug extracts and injectables. Non-pharmacological TCM interventions (e.g., acupuncture) were excluded to focus on botanical drug efficacy. These modalities differ in mechanism (physical stimulation vs. pharmacological effects) and standardization (practitioner-dependent vs. formula-based), preventing conflation of results and ensuring conclusions specifically reflect botanical drug efficacy. Eligible studies must report at least one of these outcomes, including clinical total effective rate, laboratory test parameters related to liver function [alanine aminotransferase (ALT)/aspartate aminotransferase (AST) for hepatocellular injury, gamma-glutamyl transferase (GGT) for biliary dysfunction, total bilirubin (TBil) for cholestasis], laboratory test parameters related to glucose and lipid metabolism [total cholesterol (TC)/triglyceride (TG) for dyslipidemia, fasting blood glucose (FBG)/homeostatic model assessment for insulin resistance (HOMA-IR) for insulin resistance, glycated hemoglobin (HbA1c) for long-term glycemic control], imaging examination-based parameters [liver/spleen computed tomography (CT) ratio and B-ultrasound for steatosis quantification, liver stiffness for fibrosis], anthropometric parameters related to metabolism [body mass index (BMI) for adiposity], and safety outcomes (adverse events).

2.4 Reporting standards for TCM preparations

All botanical ingredients in the TCM formulations included in this study underwent rigorous taxonomic validation to ensure scientific accuracy and reproducibility. The validation was performed using two authoritative taxonomic databases: Kew Plants of the World Online (http://www.plantsoftheworldonline.org) — a globally recognized resource for plant taxonomic information, and Flora of China, which provides region-specific taxonomic confirmation for Chinese medicinal plants. For each botanical ingredient, the complete taxonomic information was verified and standardized, including the accepted scientific name (genus + species), authority (naming author), and family classification. Ingredients with initially incomplete taxonomic records (e.g., undefined family or missing authority, such as Polygonum cuspidatum and Lonicera japonica) were supplemented through cross-referencing with taxonomic monographs, peer-reviewed literature, and official pharmacopoeial records. For example, Polygonum cuspidatum was validated as Polygonum cuspidatum Sieb. & Zucc. (Polygonaceae), and Lonicera japonica was confirmed as Lonicera japonica Thunb. (Caprifoliaceae), consistent with the latest taxonomic revisions.

All botanical ingredients are presented in a unified standard format: Chinese common name (validated Latin name + authority) [family; pharmacopoeial drug name]. A representative example is Lamiaceae (Salvia miltiorrhiza Bunge)[Lamiaceae; Salviae Miltiorrhizae Radix et Rhizoma]. The pharmacopoeial drug names and corresponding quality standards were adopted from the Chinese Pharmacopoeia (2020 Edition, Volume Ⅰ), with specific volume and page references provided for each ingredient to facilitate quality verification (see Supplementary Table S7).

2.5 Study selection

After the removal of duplicate publications using EndNote X 9.1 (Clarivate Analytics, Philadelphia, PA, USA), the titles and abstracts of all potentially eligible publications were screened and irrelevant studies were excluded. Then, the full texts of the remaining studies were read to compile the final study sample. Two reviewers (Shuai Xu and Wei Chen) independently reviewed all studies. Any disagreement was resolved by a third reviewer (Hong You or Jingjie Zhao). Inter-rater agreement quantified using Cohen’s kappa (κ) coefficients:κ = 0.82 (95% CI: 0.71–0.93), indicating substantial agreement.

2.6 Data extraction

Two reviewers (Lie Xu and Kejia Liu) independently extracted the following data from the included studies using a standardized form: basic information (title and first author name, year of publication, number of included trials, total sample size, intervention and control treatments, outcome measures, effect measures for the outcomes of this overview), methodological quality, reporting quality, evidence quality, and adverse events. A third reviewer (Hong You or Jingjie Zhao) checked the accuracy of the data and resolved any inconsistencies through discussion with the two reviewers. Inter-rater agreement quantified using Cohen’s kappa (κ) coefficients: κ = 0.78 (95% CI: 0.65–0.91), indicating substantial agreement.

2.7 Assessment of methodological quality

The Assessing the Methodological Quality of Systematic Reviews 2 (AMSTAR-2) tool (Shea et al., 2017) was used to assess the methodological quality of the included SRs/MAs. This tool has shown good inter-rater agreement, test-retest reliability, and face and construct validity (Shea et al., 2017). It consists of 16 items, with the response options of “yes,” “partial yes,” and “no.” Seven of the items (2, 4, 7, 9, 11, 13, and 15) are critical for SR evaluations. According to the AMSTAR-2 guidelines, the overall confidence in each evaluated SR was classified as “high,” “moderate,” “low,” or “critically low.” Two reviewers (Xiangyu Hu and Weiwei Yao) independently assessed all studies. Any disagreement was resolved by a third reviewer (Hong You or Jingjie Zhao).

2.8 Assessment of reporting quality

The Preferred Reporting Items for Systemic Reviews and Meta-Analyses (PRISMA) 2020 (Page et al., 2021) and PRISMA Extension for Chinese Herbal Medicines (CHM) (Zhang X. et al., 2020) statements were used to assess the quality of reporting on interventions in the SRs/MAs. The PRISMA 2020 statement consists of 27 items in 7 modules, and the PRISMA-CHM statement consists of 27 extension items and subitems. All items are graded as “completely reported” (“yes”; 1 point), “partially reported” (“partial yes”; 0.5 points), or unreported (“no”; 0 points), with a maximum possible total score for each instrument of 27 points. Scores <16 indicate critical information failure, scores of 16–21 reflect certain information deficits, and scores >21 indicate relatively complete reporting. Two reviewers (Juanjuan Li and Ruimin Jiao) independently assessed all studies. Any disagreement was resolved by a third reviewer (Hong You or Jingjie Zhao).

2.9 Assessment of evidence quality

The Grading of Recommendations Assessment, Development, and Evaluation (GRADE) tool (Balshem et al., 2011) was used to evaluate the quality of evidence for each of our outcome measures. Evidence quality was assessed using GRADE, with downgrading for: (1) risk of bias, (2) inconsistency, (3) imprecision, (4) indirectness, and (5) publication bias. No downgrading of any criterion reflects high-level evidence, the downgrading of one criterion indicates moderate-level evidence, the downgrading of two criteria indicates low-level evidence, and the downgrading of three or more criteria reflects very low-level evidence. Two reviewers (Wei Chen and Xiangyu Hu) independently assessed all studies. Any disagreement was resolved by a third reviewer (Hong You or Jingjie Zhao).

2.10 Compilation and graphical visualization of results

The basic information, clinical characteristics, and quality metrics for all included SRs/MAs were descriptively synthesized and summarized. In addition, bubble plots based on the GRADE (Balshem et al., 2011) results were created. The plots depict the outcomes (x axis), P values for the overall TCM treatment effects (y axis), number of randomized controlled trials (RCTs) included (bubble size), and SR/MA quality according to GRADE assessment (color).

2.11 Evaluation of inter-rater agreement

Agreement between reviewers for each variable was evaluated by calculating Cohen’s κ values using SPSS (version 20; IBM Corporation, Armonk, NY, USA). It was characterized as good (κ > 0.8), substantial (0.6 < κ ≤ 0.8), moderate (0.4 < κ ≤ 0.6), fair (0.2 < κ ≤ 0.4), or poor (κ ≤ 0.2). The agreement of AMSTAR 2 between the two researchers was judged strong or relatively strong (κ > 0.60). The agreement of PRISMA2020 and PRISMA-CHM between the two researchers were judged strong or relatively strong (κ > 0.60). Inter-rater agreement for AMSTAR-2 assessments was substantial to almost perfect (κ > 0.60), indicating strong consistency between reviewers. Similarly, substantial agreement was observed for PRISMA 2020 and PRISMA-CHM evaluations (κ > 0.60), confirming reliable application of reporting quality criteria across reviewers.

3 Results

3.1 Study selection

In total, 1,204 relevant studies were identified across all nine databases. Following the application of the inclusion and exclusion criteria, the full texts of 84 studies were screened. Ultimately, 37 SRs/MAs were included in this overview (Figure 1).

Figure 1
Flowchart depicting the identification of studies via databases and registers. Initially, 1204 records were identified from databases such as CNKI, CBM, VIP, Wanfang, PubMed, Cochrane Library, Embase, Web of Science, and Scopus. After removing 158 duplicates, 1046 records were screened. Following title and abstract exclusions, 84 records were screened, of which 47 were excluded due to criteria such as incorrect diagnosis, non-TCM therapy intervention, non-journal articles, lack of full-text, or data issues. Finally, 37 records were included.

Figure 1. PRISMA diagram of study selection process.

3.2 Study characteristics

The characteristics of the SRs/MAs are summarized in Table 1. All of the studies were published between 2010 and 2024. Ten were published in English (Cai et al., 2019; Dulmini et al., 2024; Kim et al., 2024; Liu et al., 2022; Peng et al., 2016; Peng et al., 2022; Shi et al., 2012; Wang et al., 2021; Zhang L. et al., 2022; Zhang X. et al., 2024), and 27 were published in Chinese (Ding et al., 2021; Dong et al., 2023; Gao and Xue, 2020; Gong et al., 2014; Han et al., 2024; He and Jiang, 2010; Huang et al., 2017; Li et al., 2011; Liu et al., 2023; Ma et al., 2018; Mou et al., 2017; Qi et al., 2015; Qin et al., 2022; Shen and Gong, 2024; Shi et al., 2022; Wei and Ji, 2012; Wu et al., 2018; Wu et al., 2017; Xie et al., 2018; Yang and G, 2019; Yi et al., 2018; Zhang and Zhang, 2014; Zhang et al., 2014; Zhang P. P. et al., 2024; Zhang X. W. et al., 2020; Zhao et al., 2022; Zhou and Gao, 2018). Sample sizes ranged from 4 to 62, and the numbers of participants ranged from 500 to 13,741. Four of the SRs/Mas (Peng et al., 2022; Xie et al., 2018; Zhang P. P. et al., 2024; Zhou and Gao, 2018) focused on populations with NAFLD and diabetes mellitus. Interventions consisting of TCM alone were examined in 17 SRs/MAs (Cai et al., 2019; Dong et al., 2023; Gao and Xue, 2020; Gong et al., 2014; He and Jiang, 2010; Liu et al., 2022; Mou et al., 2017; Peng et al., 2016; Qi et al., 2015; Qin et al., 2022; Shi et al., 2022; Wei and Ji, 2012; Wu et al., 2017; Zhang and Zhang, 2014; Zhang et al., 2014; Zhang X. W. et al., 2020; Zhao et al., 2022), and TCM plus Biomed interventions were examined in 20 studies (Ding et al., 2021; Dulmini et al., 2024; Han et al., 2024; Huang et al., 2017; Kim et al., 2024; Li et al., 2011; Liu et al., 2023; Ma et al., 2018; Peng et al., 2022; Shen and Gong, 2024; Shi et al., 2012; Wang et al., 2021; Wu et al., 2018; Xie et al., 2018; Yang and G, 2019; Yi et al., 2018; Zhang L. et al., 2022; Zhang P. P. et al., 2024; Zhang X. et al., 2024; Zhou and Gao, 2018). Control treatments were placebo in 1 SR/MA (Peng et al., 2016), placebo plus Biomed in 2 SRs/MAs (Dulmini et al., 2024; Kim et al., 2024), placebo or Biomed in 1 SR/MA (Wei and Ji, 2012), and Biomed in 33 SRs/Mas (Cai et al., 2019; Ding et al., 2021; Dong et al., 2023; Gao and Xue, 2020; Gong et al., 2014; Han et al., 2024; He and Jiang, 2010; Huang et al., 2017; Li et al., 2011; Liu et al., 2022; Liu et al., 2023; Ma et al., 2018; Peng et al., 2022; Qi et al., 2015; Qin et al., 2022; Shen and Gong, 2024; Shi et al., 2022; Shi et al., 2012; Wang et al., 2021; Wei and Ji, 2012; Wu et al., 2018; Wu et al., 2017; Xie et al., 2018; Yang and G, 2019; Yi et al., 2018; Zhang and Zhang, 2014; Zhang et al., 2014; Zhang P. P. et al., 2024; Zhang X. et al., 2024; Zhang X. W. et al., 2020; Zhang Y. F. et al., 2022; Zhao et al., 2022; Zhou and Gao, 2018). Quality was assessed using the Cochrane Risk of Bias tool (Sterne et al., 2019) in 26 SRs/MAs (Cai et al., 2019; Ding et al., 2021; Dong et al., 2023; Gao and Xue, 2020; Han et al., 2024; Huang et al., 2017; Kim et al., 2024; Liu et al., 2022; Liu et al., 2023; Ma et al., 2018; Mou et al., 2017; Peng et al., 2016; Peng et al., 2022; Qi et al., 2015; Qin et al., 2022; Shen and Gong, 2024; Shi et al., 2022; Wang et al., 2021; Wu et al., 2017; Yi et al., 2018; Zhang and Zhang, 2014; Zhang et al., 2014; Zhang P. P. et al., 2024; Zhang X. et al., 2024; Zhang X. W. et al., 2020; Zhao et al., 2022), the Jadad scale (Jadad et al., 1996) in 9 studies (Dulmini et al., 2024; He and Jiang, 2010; Li et al., 2011; Wei and Ji, 2012; Wu et al., 2018; Xie et al., 2018; Yang and G, 2019; Zhang et al., 2022; Zhou and Gao, 2018), and both tools in 1 study (Gong et al., 2014); the quality assessment method was not reported in 1 SR/MA (Shi et al., 2012). Thirty-six of the 37 SRs/Mas demonstrated that TCM is effective in treating NAFLD; the results of the remaining study (Wei and Ji, 2012) did not support the efficacy of TCM for NAFLD treatment.

Table 1
www.frontiersin.org

Table 1. Characteristics of included SRs/MAs.

3.3 Methodological quality

AMSTAR-2 results are presented in Figure 2. The overall confidence in methodological quality was graded as high for 1 SR/MA (Shi et al., 2012), moderate for 1 SR/MA (Ma et al., 2018), and low or critically low for 35 SRs/Mas (Cai et al., 2019; Ding et al., 2021; Dong et al., 2023; Dulmini et al., 2024; Gao and Xue, 2020; Gong et al., 2014; Han et al., 2024; He and Jiang, 2010; Huang et al., 2017; Kim et al., 2024; Li et al., 2011; Liu et al., 2022; Liu et al., 2023; Mou et al., 2017; Peng et al., 2016; Peng et al., 2022; Qi et al., 2015; Qin et al., 2022; Shen and Gong, 2024; Shi et al., 2022; Wang et al., 2021; Wei and Ji, 2012; Wu et al., 2018; Wu et al., 2017; Xie et al., 2018; Yang and G, 2019; Yi et al., 2018; Zhang L. et al., 2022; Zhang et al., 2014; Zhang P. P. et al., 2024; Zhang X. et al., 2024; Zhang X. W. et al., 2020; Zhang Y. F. et al., 2022; Zhao et al., 2022; Zhou and Gao, 2018). With the following most frequently missed critical items (items 2, 4, 7, 9, 11, 13, 15; Supplementary Material 3). Among critical items, protocol registration (Item 2): only 3/37 (8.1%) SRs reported pre-specified protocols with registration numbers (e.g., PROSPERO) (Ma et al., 2018; Peng et al., 2022; Shi et al., 2012), and protocol registration numbers were provided in only two SRs/MAs (Liu et al., 2022; Peng et al., 2022). Comprehensive search strategy (Item 4): 32/37 (86.5%) studies (Cai et al., 2019; Ding et al., 2021; Dong et al., 2023; Gao and Xue, 2020; Gong et al., 2014; Han et al., 2024; Huang et al., 2017; Kim et al., 2024; Liu et al., 2022; Liu et al., 2023; Ma et al., 2018; Mou et al., 2017; Peng et al., 2016; Peng et al., 2022; Qi et al., 2015; Qin et al., 2022; Shen and Gong, 2024; Shi et al., 2022; Shi et al., 2012; Wang et al., 2021; Wei and Ji, 2012; Wu et al., 2018; Wu et al., 2017; Xie et al., 2018; Yang and G, 2019; Yi et al., 2018; Zhang and Zhang, 2014; Zhang L. et al., 2022; Zhang P. P. et al., 2024; Zhang X. et al., 2024; Zhang X. W. et al., 2020; Zhao et al., 2022). Omitted detailed search strings for all databases, limiting reproducibility;. None of the 37 SRs listed excluded studies with justifications, representing a 100% non-reporting rate for this critical transparency item (Item 7). Funding/conflicts of interest: 18/37 reported funding sources (Cai et al., 2019; Ding et al., 2021; Dulmini et al., 2024; Gao and Xue, 2020; Gong et al., 2014; Han et al., 2024; He and Jiang, 2010; Li et al., 2011; Liu et al., 2022; Peng et al., 2022; Qin et al., 2022; Shi et al., 2012; Wang et al., 2021; Wu et al., 2017; Yang and G, 2019; Zhang et al., 2014; Zhang Y. F. et al., 2022; Zhao et al., 2022). Risk of bias assessment (Item 9): 26/37 (70.3%) SRs did not explicitly assess risk of bias for included primary studies, and 11/37 (29.7%) provided incomplete assessments (e.g., missing blinding or allocation concealment evaluations). Statistical combination (Item 11): 28/37 (75.7%) SRs/Mas with meta-analyses did not justify statistical methods for pooling heterogeneous results (e.g., fixed vs. random-effects models). Bias impact on conclusions (Item 13): 19/37 (51.4%) failed to discuss how risk of bias in primary studies influenced their conclusions, weakening interpretability. Publication bias assessment (Item 15): 32/37 (86.5%) omitted funnel plots or Egger’s tests; only 5/37 (13.5%) evaluated small-study effects. All studies use appropriate methods for statistical combination of results. Inter-rater agreement on AMSTAR-2 ratings was relatively strong to strong (κ > 0.60).

Figure 2
Chart displaying study quality assessments across multiple items and studies from various authors. Items are categorized as Yes (blue), Partial Yes (yellow), or No (red). Overall quality ranges from moderate to very low.

Figure 2. The evaluation results of methodological quality based on AMSTAR-2. Quality ratings: Red = No report; Yellow = Partial report; Green = Report.

3.4 Reporting quality

PRISMA 2020 and PRISMA-CHM results are presented in Figures 3, 4, respectively. PRISMA 2020 scores ranged from 12.5 to 25/27 (median = 19.5). 12 SRs/MAs (Cai et al., 2019; Ding et al., 2021; Liu et al., 2022; Mou et al., 2017; Peng et al., 2016; Peng et al., 2022; Wu et al., 2017; Yang and G, 2019; Zhang L. et al., 2022; Zhang X. et al., 2024; Zhang Y. F. et al., 2022; Zhao et al., 2022) were found to have relatively complete reporting, 21 studies (Dong et al., 2023; Dulmini et al., 2024; Gao and Xue, 2020; Gong et al., 2014; Huang et al., 2017; Kim et al., 2024; Liu et al., 2023; Ma et al., 2018; Qi et al., 2015; Shen and Gong, 2024; Shi et al., 2022; Shi et al., 2012; Wang et al., 2021; Wei and Ji, 2012; Wu et al., 2018; Yi et al., 2018; Zhang and Zhang, 2014; Zhang et al., 2014; Zhang P. P. et al., 2024; Zhang X. W. et al., 2020; Zhou and Gao, 2018) had certain reporting flaws, and 5 SRs/MAs (Han et al., 2024; He and Jiang, 2010; L. Li et al., 2011; Qin et al., 2022; Xie et al., 2018) had serious reporting deficits. 32/37 (86.5%) did not report registration numbers. 5/37 (13.5%) failed to publish complete search strings for databases. 11/37 (29.7%) omitted risk of bias tables or figures for primary studies. 19/37 (51.4%) did not declare funding sources or conflicts of interest. PRISMA-CHM scores for the 37 studies ranged from 10 to 24. Five SRs/MAs (Ding et al., 2021; Mou et al., 2017; Wu et al., 2017; Zhang et al., 2024; Zhao et al., 2022) achieved relatively complete reporting, 16 studies (Cai et al., 2019; Gao and Xue, 2020; Gong et al., 2014; Huang et al., 2017; Liu et al., 2022; Ma et al., 2018; Peng et al., 2016; Peng et al., 2022; Qi et al., 2015; Shen and Gong, 2024; Shi et al., 2022; Shi et al., 2012; Wang et al., 2021; Zhang and Zhang, 2014; Zhang X. W. et al., 2020; Zhang Y. F. et al., 2022) had certain flaws, and 16 SRs/MAs (Dong et al., 2023; Dulmini et al., 2024; Han et al., 2024; He and Jiang, 2010; Kim et al., 2024; Li et al., 2011; Liu et al., 2023; Qin et al., 2022; Wei and Ji, 2012; Wu et al., 2018; Xie et al., 2018; Yang and G, 2019; Yi et al., 2018; Zhang and Zhang, 2014; Zhang P. P. et al., 2024; Zhou and Gao, 2018) were characterized by critical information failure. 29/37 (78.4%) did not specify botanical drug formula compositions (e.g., Xiaoyao Powder metabolite doses) or quality control methods (e.g., HPLC fingerprinting). 31/37 (83.8%) omitted TCM syndrome criteria (e.g., “liver-stagnation and spleen-deficiency”) for patient stratification. 34/37 (91.9%) did not describe TCM placebo composition or blinding success for botanical drug interventions. The inter-rater agreement on PRISMA 2020 and PRISMA-CHM items was relatively strong to strong (κ > 0.60).

Figure 3
Bar graph depicting responses labeled T1 to T27 with categories Yes, Partial Yes, and No. The bars show varying lengths for each category. Most responses show higher counts for Yes, fewer for Partial Yes, and varying No responses. Color-coded: green for Yes, yellow for Partial Yes, and red for No.

Figure 3. The evaluation results of reporting quality based on PRISMA 2020. Reporting completeness: Red = Seriously deficient (<16/27 items); Yellow = Flawed (16–21/27 items); Green = Relatively complete (>21/27 items).

Figure 4
Bar chart depicting responses to survey questions T1 through T27 with categories

Figure 4. The evaluation results of reporting quality based on PRISMA-CHM. Reporting completeness: Red = Seriously deficient (<16/27 items); Yellow = Flawed (16–21/27 items); Green = Relatively complete (>21/27 items).

3.5 Evidence quality

GRADE results are summarized in Supplementary Table S2. Due to the low quality of evidence for most outcome measures, only descriptive, not pooled, analysis was performed. GRADE downgraded 92.5% of outcomes to low or very low quality due to bias, heterogeneity, imprecision, and publication bias. This limits conclusions to descriptive analyses; “consistent favorable trends” should be interpreted cautiously (low quality≠ineffectiveness, but methodological flaws weaken evidence). Future high-quality RCTs are critical: rigorous randomization, double-blinding, adequate sample sizes, transparent reporting, and standardized TCM interventions to strengthen evidence.

3.6 Clinical total effective rate

Clinical total effective rate results are presented in Figure 5. Total clinical effective rate were reported in 27 SRs/Mas, a consistent trend emerged: TCM interventions (alone or combined with Biomed) showed potential benefits across multiple outcomes, though evidence quality was predominantly low. The Outcome Category results of the clinical effectiveness are summarized in Table 2.

Figure 5
Scatter plot depicting p-value categories versus clinical total effective rate. The y-axis shows p-value categories with a distinction at p equals 0.05. Data points vary by size and color, representing sample size and quality. Yellow indicates very low quality, orange is low, and green is medium. Larger circles indicate larger sample sizes. Most data points cluster around p less than 0.05.

Figure 5. Clinical total effective rate. Note: 38 Clinical total effective rate results from 27 SRs/MAs.

Table 2
www.frontiersin.org

Table 2. Outcome category.

3.7 Imaging examination-based parameters

Imaging examination-based parameters results (Figure 6; Table 2) showed a consistent trend of improved radiological outcomes with TCM: specifically, B-ultrasound demonstrated steatosis resolution in 8/9 SRs/MAs with TCM monotherapy, while CT evaluations revealed improved liver/spleen ratio in 2 SRs/MAs (TCM alone) and 1 SR/MA (TCM+Biomed vs. placebo); additionally, TCM (with or without Biomed) increased the radiological steatosis disappearance rate, and TCM+Biomed favored liver stiffness reduction and steatosis resolution (1 SR each), reflecting an overall trend where TCM monotherapy or TCM+Biomed consistently improved imaging markers of NAFLD including steatosis, liver density, and structural resolution.

Figure 6
Scatter plot showing p-value categories on the y-axis and four diagnostic criteria on the x-axis with bubble sizes indicating sample size. Data points are color-coded for quality: very low (orange), low (yellow), medium (green), and high (dark green). Most points clustered at a p-value less than 0.05.

Figure 6. Imaging examination–based parameters. Note: 11 Effective improvement rate of B-ultrasound results from 10 SRs/MAs; 11 Disappearance of radiological steatosis results from one SR/MAs.

3.8 Anthropometric parameters related to metabolism

Anthropometric parameters related to metabolism results are presented in Figure 7. The results revealed a modest but consistent trend of BMI reduction with TCM interventions: 1 SR using TCM monotherapy and 2 SRs combining TCM with Biomed demonstrated small yet significant decreases in BMI compared to controls. The Outcome Categorys results of the anthropometric parameters related to metabolism are summarized in Table 2.

Figure 7
Scatter plot showing P-value category versus BMI. Dots represent studies with varying sample sizes indicated by circle size, ranging from zero to three thousand. Dot color indicates quality: yellow for very low, light green for low, orange for medium, and dark green for high. Most dots are within P-value greater than 0.05, clustered in the lower P-value zone.

Figure 7. Anthropometric parameters related to metabolism. Note: 10 BMI results from nine SRs/MAs. Abbreviations: BMI, Body Mass Index.

3.9 Laboratory test parameters related to liver function

Laboratory test parameters related to liver function results (Figure 8; Table 2) revealed a robust and consistent trend of improved hepatocellular injury markers with TCM interventions: ALT levels were reduced by −8.2 U/L (95% CI: −10.1 to −6.3) across 33 SRs, though GRADE downgraded evidence to ‘low’ due to imprecision (small sample sizes) and publication bias. Specifically, 20 SRs/MAs showed TCM monotherapy lowered ALT, and 13 SRs/MAs highlighted enhanced ALT reductions with TCM+Biomed versus Biomed alone, with comparable trends observed for AST (15 SRs/MAs), GGT (8 SRs/MAs), and TBil (2 SRs/MAs).

Figure 8
Bubble chart showing the relationship between p-value categories and quality across variables ALT, AST, GGT, and TBIL. Bubbles vary in size, representing sample sizes from zero to three thousand. Colors indicate quality levels: yellow for very low, orange for low, light green for medium, and green for high quality. Most bubbles are clustered under p < 0.05 for ALT and AST, with dispersed patterns in other variables.

Figure 8. Laboratory test parameters related to liver function. Note: 48 ALT results from 33 SRs/MAs; 34 AST results from 30 SRs/MA. Abbreviations: ALT, Alanine aminotransferase; AST, Aspartate aminotransferase; GGT, Gamma-glutamyl transpeptidase; TBiL, Total Bilirubin.

3.10 Laboratory test parameters related to glucose and lipid metabolism

Laboratory test parameters related to glucose and lipid metabolism results (Figure 9; Table 2) revealed a consistent trend of metabolic improvement with TCM interventions: TCM combined with Biomed significantly enhanced glycemic control (evident in 2 SRs/MAs for HbA1c, 4 for FBG/FPG, and 3 for 2hPG) and improved insulin resistance (4 SRs/MAs for HOMA-IR), while TCM monotherapy or TCM+Biomed consistently reduced lipid levels-TCM alone lowered TG (15 SRs/MAs), TC (16 SRs/MAs), HDL-C (5 SRs/MAs), and LDL-C (5 SRs/MAs), with TCM+Biomed further enhancing reductions in TG (11 SRs/MAs), TC (11 SRs/MAs), and LDL-C (5 SRs/MAs) versus Biomed alone. Additionally, one SR/MA noted TCM+Biomed reduced adiponectin levels, while another reported TCM (with/without Biomed) promoted metabolic normalization, collectively indicating that TCM exerts a comprehensive regulatory effect on glucose and lipid metabolism in NAFLD, with TCM+Biomed showing synergistic benefits in metabolic parameter improvement.

Figure 9
Bubble chart showing p-value categories for variables like HbA1c, FPG, and cholesterol-related markers. Bubbles vary in size, representing sample sizes from zero to three thousand, and are colored from very low quality (orange) to high quality (dark green). The chart divides data by p-value categories: greater than 0.05 and less than 0.05.

Figure 9. Laboratory test parameters related to glucose and lipid metabolism. Note: 11 Normalization of blood lipid results from one SR/MAs. Abbreviations: HbA1c, glycosylated hemoglobin type A1c; FPG, fasting plasma glucose; 2hPG, 2-h postprandial blood glucose; FINS, fasting serum insulin; HOMA-IR, homeostatic model assessment for insulin resistance; TG, Triglyceride; HDL-C, High-density lipoprotein cholesterol; LDL-C, Low-density lipoprotein cholesterol; TC, Total cholesterol; Adpn, Adiponectin.

3.11 Adverse events

The adverse events of the SRs/MAs are summarized in Table 3. Treatment-associated adverse events were reported in 9 SRs/MAs (Huang et al., 2017; Liu et al., 2022; Ma et al., 2018; Peng et al., 2022; Qin et al., 2022; Wang et al., 2021; Wei and Ji, 2012; Zhang L. et al., 2022; Xie et al., 2018). These events occurred across three comparison groups: TCM plus Biomed vs. Biomed (total patients: n = 3,716; treatment group: 1,856 patients; control group: 1,860 patients), TCM vs. Biomed (n = 3,238; treatment group: 1730 patients; control group: 1,508 patients), and TCM vs. placebo/Biomed (n = 608; treatment group: 308 patients; control group: 300 patients). The primary adverse events included gastrointestinal reactions (65.3% of total events), such as diarrhea (28 cases, 0.42% overall incidence), abdominal distension (21 cases, 0.32%), nausea (15 cases, 0.23%), and gastric discomfort (3 cases, 0.05%); and dermatological and neurological symptoms (18.4% of total events), including rash (16 cases, 0.24%), dizziness (2 cases, 0.03%), and dry mouth (1 case, 0.02%). Notably, transient liver enzyme elevation (ALT increase) was reported in 5 cases (0.08%) in the Biomed control groups (Ma et al., 2018; Xie et al., 2018). Overall, the total adverse event incidence was low (0.23%–1.75%), with 89.8% of events resolving spontaneously without special intervention. No severe adverse events (e.g., anaphylaxis, liver failure) were reported across all studies.

Table 3
www.frontiersin.org

Table 3. Occurrence of adverse reactions.

4 Discussion

TCM, as a complementary and alternative therapy, has been used widely in the clinical treatment of NAFLD based on experience gained in China (Dai X. et al., 2021). In this study, we performed an overview of SRs/MAs examining the use of TCM for the treatment of NAFLD published through 2024. The AMSTAR-2 assessment showed that one SR/MA (Shi et al., 2012) was of high quality, one SR/MA (Ma et al., 2018) was of moderate quality, and 35 SRs/MAs were of low or critically low quality. Critical items that were not reported in most SRs/MAs included study protocols, numbers of excluded studies and reasons for exclusion, funding sources, and conflicts of interest. Assessment using the PRISMA 2020 and PRISMA-CHM statements identified similar reporting deficits. The GRADE assessment revealed various degrees of bias risk for all included outcome indicators. The risk of bias (284/307) in the included RCTs emerged as the primary determinant of evidence quality downgrading. This methodological limitation was attributable primarily to the insufficient documentation of random sequence generation, allocation concealment, and blinding methods. Other factors included inconsistency (243/307), imprecision (174/307), and publication bias (186/307). The reasons for these problems included small total sample sizes, small numbers of events, insufficient reporting of funding sources, certain degrees of overlap in confidence intervals, and I2 values >40%. High heterogeneity (I2 = 68%–85%) among included SRs/MAs, stemming from variable TCM formulations (e.g., Xiaoyao Powder vs. Yinchen Wuling San), dosages (3–15 g/day), treatment durations (2–24 weeks), and patient populations (with/without T2DM), complicates result interpretation. This heterogeneity weakens the reliability of pooled effect sizes, may mask subgroup-specific efficacy (e.g., differential responses in mild vs. severe NAFLD), and necessitates cautious interpretation of aggregated findings, highlighting the need for standardized TCM interventions and stratified analyses in future research. Results for the primary outcome indicators of the included SRs/MAs show consistent trends of favorable changes in liver enzymes, metabolic parameters, and imaging markers associated with TCM interventions (alone or combined with Biomed). However, these trends cannot be interpreted as definitive “effectiveness” due to the overall low/critically low methodological quality of included studies, high heterogeneity, and risk of bias. Given the methodological shortcomings of included SRs/MAs (e.g., incomplete protocol registration, lack of standardized TCM formulations, small sample sizes, and inadequate blinding), the observed trends of TCM-associated improvements in NAFLD-related markers should be interpreted strictly as preliminary observations rather than evidence of therapeutic efficacy. Specifically: - The reduction in ALT and TG levels, while consistent across studies, may be influenced by publication bias and unreported confounding factors (e.g., concurrent lifestyle interventions not stratified in analyses). - High heterogeneity in TCM formulas (e.g., Xiaoyao Powder vs. Danning Tablet), dosages (3–15 g/day), and treatment durations (2–24 weeks) prevents pooling of reliable effect sizes, making it impossible to confirm which TCM interventions (if any) truly contribute to the observed trends. The low incidence of adverse events, while reassuring, is limited by incomplete reporting in most SRs/MAs (only 9/37 reported safety outcomes), so the safety profile of TCM for NAFLD also remains insufficiently verified. The risk of bias analysis identified several critical shortcomings in the reporting of random sequence generation, allocation concealment, and blinding methods across the included studies. These deficiencies significantly undermine the reliability of the findings and limit the extent to which these studies can be used to robustly assess the efficacy of Traditional Chinese Medicine (TCM) interventions for non-alcoholic fatty liver disease (NAFLD). While the consistent direction of effects (e.g., reduction in liver enzymes and metabolic parameters) suggests potential therapeutic benefits, the high risk of bias means that these trends should be interpreted with caution and cannot be taken as definitive evidence of efficacy.

Based on the study findings, we put forward the following recommendations, which are tailored to enhance the quality and comprehensiveness of future SRs/MAs and RCTs. First, researchers should register their protocols after identifying their research topics to enhance transparency, minimize selective reporting bias, and improve the rigor and credibility of reporting (Bradley et al., 2020). Second, more comprehensive retrieval strategies should be developed and employed to ensure the reliability of the results (Li et al., 2025). Third, funding sources and the interests of relevant institutions should be declared (Yu et al., 2023). Fourth, in studies involving TCM interventions, information such as the composition and dosage of botanical drugs and the duration and frequency of treatment should be precisely reported (Su et al., 2022).

4.1 Limitations

Several limitations of this overview should be acknowledged. Firstly, methodological biases risk the distortion of outcome directionality, and insufficient sample sizes reduce the statistical power and precision of estimates, thereby weakening the reliability of conclusions (Furuya-Kanamori et al., 2021). Low-quality evidence in this overview reflects methodological limitations of primary studies (e.g., small sample sizes, inadequate randomization, and reporting biases) rather than evidence of TCM ineffectiveness. The consistent direction of effects (e.g., 33/33 SRs/MAs reporting ALT reduction) suggests biological plausibility, but definitive conclusions require higher-quality RCTs with rigorous design (e.g., proper allocation concealment, blinded outcome assessment) and standardized TCM interventions. Secondly, for the linguistic capabilities of the research team, our systematic literature search was confined to Chinese and English databases. Although publications in these languages dominate the current scientific landscape in this field, this restriction may have led to the omission of high-quality studies published in other languages in specific regions, potentially introducing a language bias. Future research could attempt to incorporate multilingual searches to provide a more comprehensive overview. Thirdly, substantial heterogeneity was observed among the included SRs/MAs, primarily driven by variability in TCM interventions. Differences in botanical drug formulas (e.g., Xiaoyao Powder vs. Yinchen Wuling San), dosages (3–15 g/day), and treatment durations (2–24 weeks) contributed to high statistical heterogeneity (I2 = 68–85%) across key outcomes such as ALT reduction and B-ultrasound improvement. This heterogeneity complicates the interpretation of pooled effect sizes and may mask differential efficacy across subgroups, necessitating cautious interpretation of aggregated results. Journals should mandate PROSPERO registration and PRISMA-CHM adherence for TCM SRs/MAs. Primary RCTs must standardize botanical drug formulas (e.g., fixed dosages, HPLC fingerprinting) and report long-term outcomes (e.g., fibrosis progression via FibroScan). A key limitation of included SRs/MAs is incomplete reporting of TCM preparation details, as identified by the ConPhYMP assessment. Most studies lacked clear descriptions of botanical drugs raw material sources (78%), processing techniques (65%), and quality control methods (83%), which hinders the reproducibility of results and clinical translation. Future TCM research should strictly follow ConPhYMP and PRISMA-CHM guidelines to standardize the reporting of botanical drug preparations.

4.2 Conclusion

Across included SRs/MAs, TCM interventions are associated with consistent trends of favorable changes in liver enzymes, metabolic parameters, and imaging markers in NAFLD. However, due to the overall low methodological quality, high heterogeneity, and risk of bias in the underlying research, these trends do not support the conclusion that “TCM contributes to NAFLD improvement.” Instead, they highlight the need for high-quality, standardized RCTs to verify whether TCM has a causal role in NAFLD management.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Author contributions

JL: Conceptualization, Visualization, Writing – review and editing, Formal Analysis, Investigation, Methodology, Writing – original draft. RJ: Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Resources, Software, Visualization, Writing – review and editing. WS: Data curation, Formal Analysis, Funding acquisition, Writing – original draft. WC: Data curation, Investigation, Software, Visualization, Writing – original draft. XH: Formal Analysis, Software, Visualization, Writing – original draft. SX: Data curation, Investigation, Resources, Software, Supervision, Writing – original draft. LX: Investigation, Resources, Writing – original draft. WY: Formal Analysis, Investigation, Validation, Writing – original draft. KL: Data curation, Investigation, Validation, Writing – original draft. HY: Conceptualization, Data curation, Funding acquisition, Investigation, Resources, Supervision, Validation, Writing – review and editing. JZ: Conceptualization, Funding acquisition, Project administration, Resources, Supervision, Visualization, Writing – review and editing.

Funding

The authors declare that financial support was received for the research and/or publication of this article. This project was supported by the grant of National Natural Science Foundation of China (82405192); the Noncommunicable Chronic Diseases-National Science and Technology Major Project (No.2023ZD0508702); Beijing High-level Public Health Technical Personnel Construction Project (Discipline backbone-03-40); The project of “Friendship Seed Plan” Talent Project of Beijing Friendship Hospital, Capital Medical University (YYZZ202333, YYZZ202441).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2025.1675793/full#supplementary-material

References

Balshem, H., Helfand, M., Schunemann, H. J., Oxman, A. D., Kunz, R., Brozek, J., et al. (2011). GRADE guidelines: 3. Rating the quality of evidence. J. Clin. Epidemiol. 64, 401–406. doi:10.1016/j.jclinepi.2010.07.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Bradley, S. H., DeVito, N. J., Lloyd, K. E., Richards, G. C., Rombey, T., Wayant, C., et al. (2020). Reducing bias and improving transparency in medical research: a critical overview of the problems, progress and suggested next steps. J. R. Soc. Med. 113, 433–443. doi:10.1177/0141076820956799

PubMed Abstract | CrossRef Full Text | Google Scholar

Cai, Y. F., Liang, Q. E., Chen, W. H., Chen, M. H., Chen, R. X., Zhang, Y., et al. (2019). Evaluation of HuoXueHuaYu therapy for nonalcoholic fatty liver disease: a systematic review and meta-analysis of randomized controlled trial. BMC Complement. Altern. Med. 19, 178. doi:10.1186/s12906-019-2596-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, M., Xie, Y., Gong, S., Wang, Y., Yu, H., Zhou, T., et al. (2021). Traditional Chinese medicine in the treatment of nonalcoholic steatohepatitis. Pharmacol. Res. 172, 105849. doi:10.1016/j.phrs.2021.105849

PubMed Abstract | CrossRef Full Text | Google Scholar

Cusi, K., Isaacs, S., Barb, D., Basu, R., Caprio, S., Garvey, W. T., et al. (2022). American association of clinical endocrinology clinical practice guideline for the diagnosis and management of nonalcoholic fatty liver disease in primary care and endocrinology clinical settings: co-sponsored by the American association for the study of liver diseases (AASLD). Endocr. Pract. 28, 528–562. doi:10.1016/j.eprac.2022.03.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Dai, L., Zhou, W. J., Zhong, L. L. D., Tang, X. D., and Ji, G. (2021). Chinese medicine formulas for nonalcoholic fatty liver disease: overview of systematic reviews. World J. Clin. Cases 9, 102–117. doi:10.12998/wjcc.v9.i1.102

PubMed Abstract | CrossRef Full Text | Google Scholar

Dai, X., Feng, J., Chen, Y., Huang, S., Shi, X., Liu, X., et al. (2021). Traditional Chinese medicine in nonalcoholic fatty liver disease: molecular insights and therapeutic perspectives. Chin. Med. 16, 68. doi:10.1186/s13020-021-00469-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Ding, C. M., Wang, Z. Y., Bai, H. H., Hou, L. X., Gou, X. J., and Xu, M. L. (2021). Systematic reviewof clinical efficacy of danning tablet in the treatment of non-alcoholic fatty liver disease. Eval. Analysis Drug-Use Ia Hospitals China 21, 459–463. doi:10.14009/j.issn.1672-2124.2021.04.018

CrossRef Full Text | Google Scholar

Dong, Q. L., Li, K., Gan, M., Wang, S., and Li, G. Z. (2023). Meta-analysis of yinchen wuling powder in the treatment of non-alcoholic fatty liver disease. Tradit. Chin. Med. 12, 67–75. doi:10.12677/tcm.2023.121012

CrossRef Full Text | Google Scholar

Dulmini, W. R., Piumika, S., Madunil, A. N., Dileepa, E., and Jennifer, P. (2024). Herbal treatments for non-alcoholic fatty liver disease: a systematic review and meta-analysis of randomized controlled trials. Adv. Integr. Med. 12, 100410. doi:10.1016/j.aimed.2024.08.016

CrossRef Full Text | Google Scholar

Eslam, M., Sarin, S. K., Wong, V. W., Fan, J. G., Kawaguchi, T., Ahn, S. H., et al. (2020). The Asian Pacific association for the study of the liver clinical practice guidelines for the diagnosis and management of metabolic associated fatty liver disease. Hepatol. Int. 14, 889–919. doi:10.1007/s12072-020-10094-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Fan, J. G., Xu, X. Y., Yang, R. X., Nan, Y. M., Wei, L., Jia, J. D., et al. (2024). Guideline for the prevention and treatment of metabolic dysfunction-associated fatty liver disease (version 2024). J. Clin. Transl. Hepatol. 12 (11), 955–974. doi:10.14218/JCTH.2024.00311

PubMed Abstract | CrossRef Full Text | Google Scholar

Fernando, D. B., and Pedro, H. C. F. U. (2021). Liver transplantation and bariatric surgery: a new surgical reality: a systematic review of the best time for bariatric surgery. Updat. Surg. 73, 1615–1622. doi:10.1007/s13304-021-01106-3

CrossRef Full Text | Google Scholar

Furuya-Kanamori, L., Xu, C., Hasan, S. S., and Doi, S. A. (2021). Quality versus risk-of-bias assessment in clinical research. J. Clin. Epidemiol. 129, 172–175. doi:10.1016/j.jclinepi.2020.09.044

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, G. Y., and Xue, J. D. (2020). A meta-analysis of the treatment of non-alcoholic fatty liver by soothing the liver and strengthening the spleen. J. Hainan Med. Univ. 26, 1307–1314+1322. doi:10.13210/j.cnki.jhmu.20200429.004

CrossRef Full Text | Google Scholar

Gofton, C., Upendran, Y., Zheng, M. H., and George, J. (2023). MAFLD: how is it different from NAFLD? Clin. Mol. Hepatol. 29, S17–S31. doi:10.3350/cmh.2022.0367

PubMed Abstract | CrossRef Full Text | Google Scholar

Gong, X. W., Yang, Q. H., and Xu, Y. J. (2014). The effectiveness and safety of soothing liver and activating spleen method treating nonalcoholic fatty liver disease. Chin. J. Gerontology 34, 3817–3820. doi:10.3969/j.issn.1005-9202.2014.14.001

CrossRef Full Text | Google Scholar

Han, S. K., Baik, S. K., and Kim, M. Y. (2023). Non-alcoholic fatty liver disease: definition and subtypes. Clin. Mol. Hepatol. 29, S5–S16. doi:10.3350/cmh.2022.0424

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, J., X, G. Y., Chen, D., Wang, W. Q., Sun, Y. L., and G, S. J. (2024). Meta analysis of integrated traditional Chinese and Western medicine in the treatment of nonalcoholic fatty liver disease. Chin. Med. Mod. Distance Educ. Of China 22. doi:10.3969/j.issn.1672-2779.2024.20.033

CrossRef Full Text | Google Scholar

He, M., and Jiang, J. (2010). Meta-analysis of effect of TCM on main biochemical indexes of non-alcoholic fatty liver disease. China J. Traditional Chin. Med. Pharm. 25, 1214–1220.

Google Scholar

Hsu, C. L., and Loomba, R. (2024). From NAFLD to MASLD: implications of the new nomenclature for preclinical and clinical research. Nat. Metab. 6, 600–602. doi:10.1038/s42255-024-00985-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, Q. M., Zhang, Z. B., and Li, X. (2017). Meta-analysis on cassiae semen-containing Chinese herbal compound preparations in treatment of nonalcoholic fatty liver. Eval. Analysis Drug-Use Hosp. China 17, 225–231. doi:10.14009/j.issn.1672-2124.2017.02.028

CrossRef Full Text | Google Scholar

Jadad, A. R., Moore, R. A., Carroll, D., Jenkinson, C., Reynolds, D. J., Gavaghan, D. J., et al. (1996). Assessing the quality of reports of randomized clinical trials: is blinding necessary? Control Clin. Trials 17, 1–12. doi:10.1016/0197-2456(95)00134-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, M. H., Ahn, S., Hur, N., Oh, S. Y., and Son, C. G. (2024). The additive effect of herbal medicines on lifestyle modification in the treatment of non-alcoholic fatty liver disease: a systematic review and meta-analysis. Front. Pharmacol. 15, 1362391. doi:10.3389/fphar.2024.1362391

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumar, S., Wong, R. C., Newberry, M., Yeung, J. M., and Sharaiha, R. Z. (2021). Multidisciplinary clinic models: a paradigm of care for management of NAFLD. Hepatology 74, 3472–3478. doi:10.1002/hep.32081

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, L., Su, D. M., Han, H. X., and Li, J. X. (2011). Traditional Chinese medicine for non-alcoholic steatohepatitis: a systematic review. Chin. J. Evidence-Based Med. 11, 195–203. doi:10.3969/j.issn.1672-2531.2011.02.014

CrossRef Full Text | Google Scholar

Li, Y., Huang, Z., Luan, Z., Xu, S., Zhang, Y., Wu, L., et al. (2025). Efficient evidence selection for systematic reviews in traditional Chinese medicine. BMC Med. Res. Methodol. 25, 10. doi:10.1186/s12874-024-02430-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, N., Yang, J., Ma, W., Li, C., An, L., Zhang, X., et al. (2022). Xiaoyao powder in the treatment of non-alcoholic fatty liver disease: a systematic review and meta-analysis. J. Ethnopharmacol. 288, 114999. doi:10.1016/j.jep.2022.114999

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Y. J., Yi, F., Lin, Y. N., and Yang, J. L. (2023). Efficacy of traditional Chinese medicine in treating non-alcoholic fatty liver disease. China Mod. Dr. 61, 77–84. doi:10.3969/j.issn.1673-9701.2023.01.018

CrossRef Full Text | Google Scholar

Ma, X., Wen, J. X., He, X., Wei, S. Z., Li, H. T., and Zhao, Y. L. (2018). Yinchen-dahuang drug compatibility in treatment of non-alcoholic fatty liver disease: meta-Analysis. Eval. Analysis Drug-Use Hosp. China 18, 1170–1175+1178. doi:10.14009/j.issn.1672-2124.2018.09.004

CrossRef Full Text | Google Scholar

Mou, K., Wu, Y. F., Xu, N. N., Xue, H. Y., and Zhong, S. (2017). Dangfei liganning capsule for nonalcoholic fatty liver disease: a systematic review. World Latest Med. Inf. 17, 12–15. doi:10.19613/j.cnki.1671-3141.2017.76.005

CrossRef Full Text | Google Scholar

Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., et al. (2021). The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 372, n71. doi:10.1136/bmj.n71

PubMed Abstract | CrossRef Full Text | Google Scholar

Peng, H., He, Y., Zheng, G., Zhang, W., Yao, Z., and Xie, W. (2016). Meta-analysis of traditional herbal medicine in the treatment of nonalcoholic fatty liver disease. Cell Mol. Biol. (Noisy-le-grand) 62, 88–95. doi:10.14715/cmb/2016.62.4.16

PubMed Abstract | CrossRef Full Text | Google Scholar

Peng, S. H., Liu, L., Zi, Y. X., Zhang, X. Y., Xie, C. G., Ye, S., et al. (2022). Chinese herbal medicine for type 2 diabetes mellitus with nonalcoholic fatty liver disease: a systematic review and meta-analysis. Front. Pharmacol. 13, 863839. doi:10.3389/fphar.2022.863839

PubMed Abstract | CrossRef Full Text | Google Scholar

Qi, J., Zang, Y. F., and Xia, Q. Q. (2015). Erchen decoction in treatment of Non- alcoholic fatty liver. A Syst. Rev. Liaoning J. Traditional Chin. Med. 42, 2276–2280. doi:10.13192/j.issn.1000-1719.2015.12.002

CrossRef Full Text | Google Scholar

Qin, H. Y., Yang, L. Q., and Sun, K. W. (2022). Meta-analysis of curative effect of lingguizhugan decoction in the treatment of non-alcoholic fatty liver disease. Med. Innovation China 19, 176–180. doi:10.3969/j.issn.1674-4985.2022.02.044

CrossRef Full Text | Google Scholar

Rinella, M. E., Neuschwander-Tetri, B. A., Siddiqui, M. S., Abdelmalek, M. F., Caldwell, S., Barb, D., et al. (2023). AASLD practice guidance on the clinical assessment and management of nonalcoholic fatty liver disease. Hepatology 77, 1797–1835. doi:10.1097/HEP.0000000000000323

PubMed Abstract | CrossRef Full Text | Google Scholar

Shea, B. J., Reeves, B. C., Wells, G., Thuku, M., Hamel, C., Moran, J., et al. (2017). AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ 358, j4008. doi:10.1136/bmj.j4008

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, Y. H., and Gong, X. Q. (2024). A meta-analysis of the efficacy of integrative medicine on non-alcoholic fatty liver disease. Clin. J. Chin. Med. 16, 97–104. doi:10.3969/j.issn.1674-7860.2024.02.016

CrossRef Full Text | Google Scholar

Shi, K. Q., Fan, Y. C., Liu, W. Y., Li, L. F., Chen, Y. P., and Zheng, M. H. (2012). Traditional Chinese medicines benefit to nonalcoholic fatty liver disease: a systematic review and meta-analysis. Mol. Biol. Rep. 39, 9715–9722. doi:10.1007/s11033-012-1836-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, H., Deng, G. H., Li, Y. J., Yang, M. H., Ye, H. X., and Gao, L. (2022). Meta-analysis of dispelling phlegm and removing stasis in the treatment of non-alcoholic fatty liver disease. Shandong J. Traditional Chin. Med. 41, 281–289. doi:10.16295/j.cnki.0257-358x.2022.03.008

CrossRef Full Text | Google Scholar

Smart, N. A., King, N., McFarlane, J. R., Graham, P. L., and Dieberg, G. (2018). Effect of exercise training on liver function in adults who are overweight or exhibit fatty liver disease: a systematic review and meta-analysis. Br. J. Sports Med. 52, 834–843. doi:10.1136/bjsports-2016-096197

PubMed Abstract | CrossRef Full Text | Google Scholar

Sterne, J. A. C., Savovic, J., Page, M. J., Elbers, R. G., Blencowe, N. S., Boutron, I., et al. (2019). RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ 366, l4898. doi:10.1136/bmj.l4898

PubMed Abstract | CrossRef Full Text | Google Scholar

Su, W., Du, Y., Lian, F., Wu, H., Zhang, X., Yang, W., et al. (2022). Standards for collection, preservation, and transportation of fecal samples in TCM clinical trials. Front. Cell Infect. Microbiol. 12, 783682. doi:10.3389/fcimb.2022.783682

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsompanaki, E., Thanapirom, K., Papatheodoridi, M., Parikh, P., Chotai de Lima, Y., and Tsochatzis, E. A. (2023). Systematic review and meta-analysis: the role of diet in the development of nonalcoholic fatty liver disease. Clin. Gastroenterol. Hepatol. 21, 1462–1474 e1424. doi:10.1016/j.cgh.2021.11.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, T., Song, J., Hu, J., Feng, S., Zhang, H., Wang, H., et al. (2021). Efficacy and safety of Huazhi Rougan granule in the treatment of non-alcoholic fatty liver: a systematic review and meta-analysis. Ann. Palliat. Med. 10, 12969–12984. doi:10.21037/apm-20-1613

PubMed Abstract | CrossRef Full Text | Google Scholar

Wei, H. F., and Ji, G. (2012). Systematic review of treating nonalcoholic fatty liver with taking TCM orally:a clinical randomized controlled trial. China J. Traditional Chin. Med. Pharm. 27, 1309–1314.

Google Scholar

Wu, X. S., Liu, S. H., and Yang, S. Y. (2017). A meta-analysis of Jianpi Huashi therapy for non-alcoholic fatty liver disease. Asia-Pacific Tradit. Med. 13, 46–52. doi:10.11954/ytctyy.201722017

CrossRef Full Text | Google Scholar

Wu, N., Gao, X., Han, L., Ye, Z. H., Ni, Y., and Li, H. M. (2018). Systematic review and Meta analysis of traditional Chinese medicine treatment for hyperlipidemic fatty liver. Chin. J. Integr. Traditional West. Med. Liver Dis. 28, 55–60. doi:10.3969/j.issn.1005–0264.2018.01.021

CrossRef Full Text | Google Scholar

Xie, M., Gong, T., Zhao, Y., and Zuo, X. H. (2018). Meta analysis of the therapeutic effect of Chinese herbs on type 2 diabetes with nonalcoholic fatty liver. Asia-Pacific Tradit. Med. 14, 111–114. doi:10.11954/ytctyy.201808037

CrossRef Full Text | Google Scholar

Yang, H. C., and G, S. J. (2019). Systemic evaluation of the efficacy and safety of invigorating spleen and resolving phlegm method in the treatment of nonalcoholic fatty liver disease. Mod. J. Integr. Traditional Chin. West. Med. 28, 267–273. doi:10.3969/j.issn.1008-8849.2019.03.010

CrossRef Full Text | Google Scholar

Yang, J. M., Sun, Y., Wang, M., Zhang, X. L., Zhang, S. J., Gao, Y. S., et al. (2019). Regulatory effect of a Chinese herbal medicine formula on non-alcoholic fatty liver disease. World J. Gastroenterol. 25, 5105–5119. doi:10.3748/wjg.v25.i34.5105

PubMed Abstract | CrossRef Full Text | Google Scholar

Yi, F., Ma, C. Z., Zeng, W. P., George, P., Wang, J., and Zeng, B. F. (2018). Systematic review of randomized controlled trials of Yin-ChenHao soup treatment of NAFLD. J. Xinjiang Med. Univ. 41, 139–143. doi:10.3969/j.issn.1009-5551.2018.02.003

CrossRef Full Text | Google Scholar

Younossi, Z. M., Golabi, P., Paik, J. M., Henry, A., Van Dongen, C., and Henry, L. (2023). The global epidemiology of nonalcoholic fatty liver disease (NAFLD) and nonalcoholic steatohepatitis (NASH): a systematic review. Hepatology 77, 1335–1347. doi:10.1097/HEP.0000000000000004

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, X., Wu, S., Zhang, J., Hu, Y., Luo, M., Zhao, H., et al. (2023). Developing TCM clinical practice guidelines: a comparison between traditional Chinese medicine and western medicine. Integr. Med. Res. 12, 100952. doi:10.1016/j.imr.2023.100952

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J., and Zhang, H. H. (2014). Systematic review of treating nonalcoholic fatty liver with taking TCM and screening of active components. China Health Ind., 43–45. doi:10.16659/j.cnki.1672-5654.2014.16.025

CrossRef Full Text | Google Scholar

Zhang, L. D., Wei, W., Sun, X. H., and Yao, K. W. (2014). Yinchenhao decoction adjustment for treatment of nonalcoholic fatty liver disease: a systematic review and meta-analysis of randomized controlled trials. World Chin. J. Dig. 22, 2327. doi:10.11569/wcjd.v22.i16.2327

CrossRef Full Text | Google Scholar

Zhang, X., Tan, R., Lam, W. C., Yao, L., Wang, X., Cheng, C. W., et al. (2020). PRISMA (preferred reporting items for systematic reviews and meta-analyses) extension for Chinese herbal medicines 2020 (PRISMA-CHM 2020). Am. J. Chin. Med. 48, 1279–1313. doi:10.1142/S0192415X20500639

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X. W., Zhang, W. G., Korean, T., Li, J. X., and Meng, J. (2020). Meta-analysis of the clinical effect of invigorating the spleen and soothing the liver in the treatment of non-alcoholic steatohepatitis. Chin. J. Integr. Traditional West. Med. Dig. 28, 283–291. doi:10.3969/j.issn.1671-038X.2020.04.10

CrossRef Full Text | Google Scholar

Zhang, L., Liu, S., Gu, Y., Li, S., Liu, M., and Zhao, W. (2022). Comparative efficacy of Chinese patent medicines for non-alcoholic fatty liver disease: a network meta-analysis. Front. Pharmacol. 13, 1077180. doi:10.3389/fphar.2022.1077180

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y. F., Liu, T., Zhang, L. Y., Zhong, P. P., Zheng, Y., and Hai, B. H. (2022). Wendan decoction in the treatment of nonalcoholic fatty liver disease: a systematic review and meta-analysis. Front. Pharmacol. 13, 1039611. doi:10.3389/fphar.2022.1039611

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, P. P., Lu, J. F., Zhang, M., Kong, J. L., Peng, L., Wang, L., et al. (2024). Integrated traditional Chinese and Western medicine for type 2 diabetes mellitus with nonalcoholic fatty liver disease: meta-analysis. Adv. Clin. Med. 14, 3339–3353. doi:10.12677/acm.2024.142471

CrossRef Full Text | Google Scholar

Zhang, X., Jiang, Z. H., Jin, X. L., and Zhou, Q. J. (2024). Efficacy of traditional Chinese medicine combined with Silibinin on nonalcoholic fatty liver disease: a meta-analysis and systematic review. Med. Baltim. 103, e37052. doi:10.1097/MD.0000000000037052

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, D. M., Bai, F. Y., Sun, T., Xu, Q. L., and Liu, T. (2022). Meta-analysis of invigorating spleen prescriptions in the treatment of nonalcoholic fatty liver disease. Guangming J. Chin. Med. 37, 2697–2701. doi:10.3969/j.issn.1003-8914.2022.15.01

CrossRef Full Text | Google Scholar

Zhou, C. Y., and Gao, T. S. (2018). Clinical efficacy of traditional Chinese medicine on type 2 diabetic patients with nonalcoholic fatty liver disease: a meta-analysis. Guid. J. Traditional Chin. Med. Pharm. 24, 113–118. doi:10.13862/j.cnki.cn43-1446/r.2018.09.035

CrossRef Full Text | Google Scholar

Keywords: non-alcoholic fatty liver disease, traditional Chinese medicine, systematic review, meta-analysis, overview

Citation: Li J, Jiao R, Su W, Chen W, Hu X, Xu S, Xu L, Yao W, Liu K, You H and Zhao J (2025) Traditional Chinese medicine for non-alcoholic fatty liver disease: an overview of systematic reviews with evidence mapping and metabolic outcome assessment. Front. Pharmacol. 16:1675793. doi: 10.3389/fphar.2025.1675793

Received: 29 July 2025; Accepted: 21 November 2025;
Published: 10 December 2025.

Edited by:

Hongbo Li, Shaanxi University of Science and Technology, China

Reviewed by:

YuFeng Zhang, Nanjing University of Chinese Medicine, China
Jinghan Xu, Shenzhen Hospital (Futian) of Guangzhou University of Chinese Medicine, China

Copyright © 2025 Li, Jiao, Su, Chen, Hu, Xu, Xu, Yao, Liu, You and Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hong You, eW91aG9uZ2xpdmVyQGNjbXUuZWR1LmNu; Jingjie Zhao, emhhb2pqQGNjbXUuZWR1LmNu

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.