Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Bioinform., 26 August 2025

Sec. Drug Discovery in Bioinformatics

Volume 5 - 2025 | https://doi.org/10.3389/fbinf.2025.1613985

Identifying novel therapeutic targets for non-alcoholic fatty liver disease using bioinformatics approaches: from drug repositioning to traditional Chinese medicine

  • 1Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Third Hospital of Shanxi Medical University, Tongji Shanxi Hospital, Taiyuan, China
  • 2Heilongjiang University of Chinese Medicine, Harbin, Heilongjiang, China
  • 3Department of Pharmacy, Changzhi People’s Hospital, Changzhi, China

Background: Non-alcoholic fatty liver disease (NAFLD) is a prevalent condition with limited effective treatments, necessitating novel therapeutic strategies. Bioinformatics offers a promising approach to identify new targets by analyzing gene expression and drug interactions.

Objective: This study aims to identify novel therapeutic targets for NAFLD through bioinformatics, focusing on drug repositioning and traditional Chinese medicine (TCM) components.

Methods: Three NAFLD-related gene expression datasets (GSE260666, GSE126848, GSE135251) were analyzed to identify differentially expressed genes. Protein-protein interaction networks were constructed using STRING and visualized with Cytoscape. Pathway enrichment analysis was performed, and drug-gene interactions were explored using the DGIdb database. TCM components were screened via the HERB database, with molecular docking conducted to assess binding affinities.

Results: Key hub genes (CXCL2, CDKN1A, TNFRSF12A, HGFAC) were identified, with significant enrichment in cell proliferation and PI3K-Akt signaling pathways. Cyclosporine emerged as a potential repurposed drug, while TCM components (curcumin, resveratrol, berberine) showed strong binding affinities to NAFLD targets.

Conclusion: Cyclosporine and TCM compounds are promising candidates for NAFLD treatment, warranting further experimental validation to confirm their therapeutic potential.

1 Introduction

Nonalcoholic fatty liver disease (NAFLD) is a multifactorial and complex disorder recognized as a significant etiological factor for liver cancer (McVey et al., 2022), although its pathogenesis remains incompletely understood (Wang et al., 2017). NAFLD is prevalent globally, with an estimated prevalence of approximately 30% in the general population and a significantly higher prevalence in males (40% vs. 26%) (Zhang et al., 2021; Riazi et al., 2022; Younossi et al., 2023). Most patients with NAFLD remain asymptomatic; however, individuals progressing to metabolic-associated steatohepatitis may experience symptoms such as fatigue, malaise, and vague discomfort in the right upper quadrant (Younossi et al., 2011; Le et al., 2023). The long-term prognosis of NAFLD is primarily influenced by the presence of atherosclerotic cardiovascular disease and extrahepatic malignancies. In cases of advanced fibrosis, liver-related events, including hepatic decompensation, hepatocellular carcinoma, liver transplantation, and liver-related mortality, increase significantly (Sun et al., 2023; Toader et al., 2025). Current therapeutic strategies for NAFLD primarily encompass lifestyle modifications and pharmacological interventions, including weight reduction, antihypertensive agents, and hypoglycemic medications (Tsamos et al., 2023). However, due to limited efficacy and generally suboptimal clinical outcomes, the development of novel therapeutic agents remains a critical area of research.

Drug repositioning entails identifying new indications for approved drugs (Wang et al., 2025). This approach is distinguished by its low risk and high efficiency, effectively addressing the challenges of high costs and low success rates in new drug development while expediting the market availability of therapeutic agents, particularly for rare diseases (Hurle et al., 2013). Drug repositioning has evolved from traditional random screening for new indications to a more precise, computer-assisted research phase. Nevertheless, computational drug-repositioning efforts are hindered by algorithmic bias arising from incomplete or noisy omics data, heterogeneous disease definitions, and the frequent absence of prospective experimental validation, all of which can limit the translational success of in silico predictions (Loscalzo, 2024).This process involves integrating data from small-molecule ligand and protein receptor databases, along with experimental validation, to evaluate the feasibility of repurposing existing drugs for new indications.

The Drug-Gene Interaction Database (DGIdb) is a specialized and efficient resource that integrates drug-gene interaction relationships (Huang et al., 2024). DGIdb can be accessed through a programmatic interface or its web-based user interface for interactive operations. The HERB database (Fang et al., 2021) analyzes the gene expression profiles of traditional Chinese medicines (TCMs) and their components, correlating these profiles with the world’s largest pharmacogenomics database, the Connectivity Map, to establish systematic links between TCM components and modern drugs. In this study, bioinformatics methods were employed to investigate differentially expressed genes associated with NAFLD, thereby identifying novel therapeutic agents for NAFLD and providing new directions for subsequent drug development.

2 Materials and methods

2.1 Acquisition of gene expression microarray data and screening for differentially expressed genes

Relevant microarray datasets associated with NAFLD were retrieved from the GEO microarray database (Clough et al., 2024) (GEO, https://www.ncbi.nlm.nih.gov/geo/) using the keywords “non-alcoholic fatty liver disease” and “NAFLD”. After comparison and screening, three datasets (GSE260666, GSE126848, and GSE135251) were ultimately selected for the study. Only human-liver RNA-seq studies providing raw counts, ≥10 samples and full NAFLD histology without drug-treatment confounders met the eligibility criteria. To minimise cross-study batch effects, differential expression was calculated inside each dataset with GEO2R; subsequent analyses used the intersection of significant genes across the three datasets, a strategy that bypasses direct data merging and reduces technical bias. Expression data from NAFLD liver tissues and normal liver tissues were extracted from the samples for comparative analysis. Differential expression was determined using a combined criterion of |log2FC| ≥ 1 and an adjusted P value <0.05; genes meeting these thresholds were classified as significantly up- or downregulated accordingly. Finally, the intersection of differentially expressed genes from the three datasets was identified, and the shared genes were designated as the key differentially expressed genes (DEGs) for this study.

2.2 Construction and visualization of protein-protein interaction networks

The differentially expressed genes were entered into the STRING database (Franceschini et al., 2013) (https://string-db.org/) to construct a protein-protein interaction (PPI) network, with an interaction threshold set at 0.400, which corresponds to STRING’s medium-confidence score and is widely used to achieve a balance between capturing biologically meaningful interactions and minimizing false positives in large-scale analyses, to identify proteins with significant interactions. Subsequently, the PPI network was visualized using Cytoscape software (version 3.9.1) (Shannon et al., 2003), and highly interconnected protein interaction modules were identified using the MCODE plugin to create corresponding visual network diagrams. The specific parameters were set as follows: degree = 2, node score cutoff = 0.2, k-core = 2, and maximum depth = 100.

2.3 GO enrichment analysis and KEGG pathway analysis of differentially expressed genes

The Omicshare online analysis tool (http://www.omicshare.com/tools/index.php/) was utilized to perform Gene Ontology (GO) secondary classification enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis on differentially expressed genes. Enrichment significance was assessed with the hypergeometric test, and the resulting P values were corrected for multiple comparisons using the Benjamini–Hochberg false-discovery rate method; pathways with an adjusted P < 0.05 were considered significant. The species selected for this analysis was Homo sapiens, and the results included biological processes, molecular functions, and cellular components from the GO enrichment analysis, as well as related biological pathway enrichment outcomes.

2.4 Analysis of drug-gene interactions

The identified differentially expressed genes associated with NAFLD were entered into the DGIdb database (https://dgidb.org) (Cannon et al., 2024) to identify potential drugs that may target these core genes. Drugs predicted to exhibit known interaction types with the core genes of NAFLD are considered potential candidates for treating this condition. Subsequently, the interactions between the potential drugs and their corresponding target genes were visualized using Cytoscape software.

2.5 Screening and prediction of potential traditional Chinese medicine components for the treatment of NAFLD

The differentially expressed genes were entered into the HERB database (Fang et al., 2021) (http://herb.ac.cn/) to filter out traditional Chinese medicine components that have a statistically significant mapping relationship (P < 0.05) with these genes. A statistical analysis was conducted on these traditional Chinese medicine components to examine the distribution of their meridian affinities and efficacy categories.

2.6 Molecular docking

The three-dimensional (3D) structure of the target compound was downloaded from the PubChem database (https://pubchem.ncbi.nlm.nih.gov/). Target protein structures were then selected from the Protein Data Bank based on the following criteria: a resolution below 2.5 Å and the presence of co-crystallized small-molecule ligands. Subsequently, the selected compounds and target proteins were submitted to the receptor-ligand docking platform DockThor (https://dockthor.lncc.br/v2/) for molecular docking analysis. The top-ranked docking results were visualized using PyMOL 2.4.0 software. The binding stability between the molecules was assessed by analyzing the docking affinity scores.

3 Results

3.1 Selection and screening of gene expression data

The gene expression data selected for this study includes: ① A study published in 2004 by Xiangya Second Hospital of Central South University, which analyzed the differential gene expression in liver tissues of patients with NAFLD compared to a control group. The dataset is identified as GSE260666 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE260666), comprising a total of 16 samples, including 10 NAFLD samples and 6 normal liver tissue samples; ② A transcriptomic analysis across the spectrum of non-alcoholic fatty liver disease published in 2020 by Newcastle University, with the dataset identified as GSE135251 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE135251), comprising a total of 216 samples, including 206 samples at various stages of fibrosis and 10 normal liver tissue samples; ③ A study published in 2019 by Gubler in Denmark that compared the hepatic transcriptomic characteristics of patients with varying degrees of non-alcoholic fatty liver disease to those of healthy individuals with normal weight. The dataset is identified as GSE126848 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE126848), comprising a total of 57 samples, including 14 normal healthy liver tissue samples, 12 normal healthy obese liver tissue samples, 15 samples of simple steatosis, and 16 samples of non-alcoholic steatohepatitis with or without fibrosis.

3.2 Acquisition and selection method of differentially expressed genes

Using P < 0.05 and |log2FC| ≥ 1 as the criteria for selection, a total of 167 genes were identified in the dataset GSE260666, including 100 upregulated genes and 67 downregulated genes. In the dataset GSE135251, a total of 2,166 genes were identified, including 871 upregulated genes and 1,295 downregulated genes. In the dataset GSE126848, a total of 1,241 genes were identified, including 526 upregulated genes and 715 downregulated genes. By constructing a Venn diagram to analyze the intersection of upregulated and downregulated genes from the three datasets, a total of 34 differentially expressed genes were identified, including 16 upregulated and 18 downregulated genes (Figure 1).

Figure 1
Two Venn diagrams display upregulated and downregulated genes across three datasets: GSE260666 (blue), GSE135251 (orange), and GSE126848 (green). The left diagram, for upregulated genes, shows 85 unique to GSE260666, 828 to GSE135251, and 485 to GSE126848, with overlapping genes in various sections. The right diagram, for downregulated genes, shows 604 unique to GSE260666, 1183 to GSE135251, and 45 to GSE126848, with overlaps highlighted.

Figure 1. Venn diagram of differentially expressed genes.

3.3 Modular analysis of the protein-protein interaction network

A total of 34 differentially expressed genes were entered into the STRING database to construct the PPI network (Figure 2A), resulting in 22 interactions. The network was then visualized and analyzed for modularity using Cytoscape software (version 3.9.1) along with the MCODE plugin, identifying the most significant module, which is highlighted in red (Figure 2B). Further analysis revealed that CXCL2, CDKN1A, TNFRSF12A, and HGFAC are the key regulatory hub genes within this module of differentially expressed genes.

Figure 2
Two diagrams labeled A and B show interaction networks. Diagram A displays a complex network of nodes representing proteins with various colors and labels, interconnected by lines indicating interactions. Diagram B presents a simpler network with oval nodes labeled with protein names, connected by lines; nodes are color-coded in shades of red, yellow, and orange to signify different groups or clusters.

Figure 2. Protein-protein interaction network (A). Module analysis and screening of core differentially expressed genes (B).

3.4 Enrichment analysis

This study employed a significance threshold of P < 0.05 for selection, resulting in a total of 231 entries from the GO enrichment analysis. Among these, 161 entries pertain to biological processes, including the negative regulation of DNA biosynthesis, RNA splicing, and modulation of fibroblast proliferation. Forty-four entries relate to molecular functions, encompassing interleukin-33 binding, interleukin-1 receptor activity, and cyclin-dependent protein kinase activating kinase activity. Additionally, 26 entries correspond to cellular components, which include integrin α8-β1 complexes, growth factor complexes, and the extracellular environment. The top 20 results for biological processes, molecular functions, and cellular components are illustrated in bubble charts (Figures 3A–C).

Figure 3
Four-panel scientific visualization of Gene Ontology (GO) enrichment analysis. Panel A displays a dot plot of the top 20 enriched GO terms related to biological processes; dots vary in size representing gene numbers and color indicating q-values. Panel B shows a similar dot plot for molecular function enrichment. Panel C focuses on cellular components with enriched GO terms, with dot variations for the number of genes and q-values. Panel D is a circular chart depicting multiple pathways categorized by metabolism, organismal systems, human diseases, environmental information processing, and cellular processes, with varying gene numbers and selection frequencies.

Figure 3. Bubble diagram of GO biological process enrichment analysis (A). Bubble diagram of functional enrichment analysis of GO molecules (B). Bubble chart of GO Cellular Component enrichment analysis (C). The KEGG pathway enrichment circle plot places a scale for gene counts on its periphery; the first ring presents the enriched categories and pathway IDs, with red indicating human diseases, blue denoting biological systems, and green representing environmental information processing; in the second ring, bar length corresponds to the number of background genes and colour depth maps the P-value, darker shades signifying smaller values; the third ring displays the total number of foreground genes; the fourth ring charts the enrichment metric Rich Factor, where each minor gridline equals 0.1 and larger values denote stronger enrichment (D).

The KEGG enrichment analysis identified a total of 81 pathways, of which 17 exhibited a significance level of P < 0.05, as detailed in Table 1. The results indicate that these pathways are enriched in several tumor-related processes, including cytokine interactions, the PI3K-Akt signaling pathway, and butyrate metabolism, with the melanoma pathway showing the strongest correlation. The results of the KEGG enrichment analysis are presented in a circular diagram (Figure 3D).

Table 1
www.frontiersin.org

Table 1. KEGG pathway enrichment analysis results.

3.5 Analysis of drug-gene interactions

To explore potential therapeutic strategies, this study utilized the DGIdb database to predict drugs that target the aforementioned core genes (Figure 4). The selection criteria included drugs approved by the U.S. Food and Drug Administration, resulting in a list of chemical compounds (Table 2). A total of 22 drugs or compounds targeting the core genes CDKN1A and CXCL2 were identified. However, no drugs were predicted to potentially regulate the HGFAC gene. The U.S. Food and Drug Administration has not yet approved any drugs or compounds targeting TNFRSF12A. To validate its causal role in NAFLD pathogenesis, an initial step could involve pooled CRISPR-Cas9 knockout screens in human hepatocyte-like cells subjected to lipotoxic stress. Subsequent validation would include arrayed siRNA knock-down assays in hepatic organoids derived from NAFLD patients. Key convergent phenotypic readouts such as neutral lipid accumulation, NF-κB signaling activity, and pyroptotic cytokine release could then be assessed to confirm mechanistic involvement and identify potential assay formats suitable for future small-molecule discovery. Additionally, cyclosporine shows potential activity against CDKN1A and CXCL2, making it a particularly promising novel candidate compound for NAFLD treatment.

Figure 4
Network diagram showing connections between two central proteins, CDKN1A and CXCL2, and various drugs including Acetaminophen, Melphalan, and Cyclopsorine. Lines indicate relationships between the proteins and the drugs surrounding them.

Figure 4. Predicted potential drug interactions with core genes.

Table 2
www.frontiersin.org

Table 2. Predicted drug–gene interactions and their DGIdb interaction scores.

3.6 Results of herbal component screening

After inputting the core genes CXCL2 and CDKN1A, the search results from the HERB database identified five components: curcumin, resveratrol, ursolic acid, berberine, and tetrandrine. Screening through the TCMSP database (https://www.tcmsp-e.com) revealed that these components can be derived from various TCMs, including Alpinia officinarum (Gaoliangjiang), Curcuma longa (Jianghuang), Polygonum cuspidatum (Huzhang), Smilax glabra (Tufuling), Coptis chinensis (Huanglian), Stephania tetrandra (Fangji), and Cornus officinalis (Shanzhuyu).

3.7 Molecular docking

This study employs molecular docking techniques to investigate the binding modes and characteristics of active components from traditional Chinese medicine with the core genes CXCL2 and CDKN1A. The three-dimensional structures of the proteins corresponding to the core genes were obtained from the Protein Data Bank database. As indicated in Table 3, the binding energies of all active components from traditional Chinese medicine with the target proteins are below −5 kcal/mol, suggesting a strong binding affinity between these components and the core targets (Wong et al., 2022). The visualization results of the molecular docking (Figures 5A–E) further reveal that these small molecules from traditional Chinese medicine can interact with key amino acid residues of the target proteins through hydrogen bonds, resulting in stable complex conformations. The findings of this study not only validate the potential mechanisms of action of these active components from traditional Chinese medicine at the molecular level concerning targets related to NAFLD but also provide a theoretical basis for their application in NAFLD treatment. Guided by the principles of traditional Chinese medicine, the aforementioned active components can be incorporated into modified prescriptions to support the precise treatment of NAFLD, thereby enhancing clinical efficacy.

Table 3
www.frontiersin.org

Table 3. Binding energy of Ingredients to genes.

Figure 5
Five-panel illustration depicting protein-ligand interactions: A. Shows a protein structure with a ligand binding pocket highlighted, involving lysine-181 and glutamine-131.B. Features a close-up of a ligand interacting with leucine-118.C. Illustrates a ligand interacting with phenylalanine-98 and tryptophan-47.D. Displays a ligand binding to glutamine-312.E. Highlights interactions with asparagine-96, arginine-156, and histidine-152. Each panel includes a zoomed-in view with distance measurements.

Figure 5. Molecular docking analysis of representative active compounds with core NAFLD-related targets. Curcumin was docked with CXCL2, forming hydrogen bonds with lysine at position 181 (2.7 Å), glutamine at position 131 (2.5 Å), and glutamic acid at position 131 (2.4 Å) (A). Resveratrol interacted with CDKN1A, establishing a hydrogen bond with leucine at position 118 (2.7 Å) (B). Ursolic acid bound to CDKN1A, forming hydrogen bonds with tryptophan at position 47 (2.5 Å) and phenylalanine at position 98 (3.1 Å) (C). Berberine was docked with CDKN1A, generating a hydrogen bond with glutamic acid at position 112 (2.4 Å) (D). Tetrandrine interacted with CDKN1A, forming hydrogen bonds with asparagine at position 36 (2.4 Å), histidine at position 152 (2.9 Å), and aspartic acid at position 196 (2.8 Å) (E). These interactions demonstrate the strong binding affinities and structural stability between natural compounds and NAFLD-related targets.

4 Discussion

4.1 Impact of core differentially expressed genes on the liver

This study utilized gene expression data from the GEO database to identify differentially expressed genes associated with NAFLD. The analysis revealed a total of 16 upregulated and 18 downregulated genes, with CXCL2, CDKN1A, TNFRSF12A, and HGFAC exhibiting the highest composite scores.

CXCL2, also known as C-X-C motif chemokine ligand 2, is a chemokine that plays a crucial role in hepatic inflammatory responses. Studies indicate that the expression level of CXCL2 is significantly elevated in the liver tissues of NAFLD patients, primarily synthesized by intrahepatic macrophages (Han et al., 2022). Ke-Qi Han et al. (Han et al., 2015) found that inhibiting the expression of CXCL1 can slow the growth of liver tumors in nude mice and suppress the expression of CXCL2, CXCL3, and interleukin-1β. Cyclin-dependent kinase inhibitor 1A (CDKN1A), a member of the Cip/Kip family, is positively correlated with the inhibition of genes related to cell cycle progression and the induction of senescence-associated genes (d'Adda di Fagagna, 2008; Kreis et al., 2019). Lamas-Paz et al. (2025) discovered through data analysis from various patient cohorts and mouse models that the expression level of CDKN1A is significantly associated with non-alcoholic steatohepatitis (NASH), liver fibrosis, and more severe liver diseases such as cirrhosis and hepatocellular carcinoma, with its overexpression exacerbating lipid metabolic disorders and inflammatory responses. Mice with a knockout of CDKN1A exhibit enhanced tolerance to metabolic liver injury, characterized by attenuated liver damage, reduced cell death, inhibited fibrotic progression, and improved lipid metabolism.

The member 12A of the tumor necrosis factor receptor family (TNFRSF12A) plays a critical role in cholestatic liver disease. Experiments have shown that hepatocyte apoptosis is closely associated with the expression of TNFRSF12A, and inhibiting TNFRSF12A expression can alleviate liver damage (Liao et al., 2023). Hepatocyte growth factor activator (HGFAC) is secreted by the liver, circulating in plasma and activated in injured tissues and tumors (Fukushima et al., 2018). The expression level of HGFAC is negatively correlated with the methylation of its promoter region, suggesting that HGFAC expression may be regulated by DNA methylation. Furthermore, the reduction in HGFAC expression is significantly associated with a shortened survival period in liver cancer patients (Yin et al., 2019).

4.2 Research progress on candidate compounds for NAFLD treatment

This study suggests that Cyclosporine may represent a highly promising novel candidate compound for the treatment of NAFLD. Cyclosporine, as a calcineurin inhibitor, primarily functions to suppress inflammatory responses and alleviate symptoms related to irritation (Zhong et al., 2020). Clinically, this drug is widely utilized to prevent graft-versus-host disease (GVHD) post-transplantation and to treat various autoimmune disorders (Xie et al., 2024). In recent years, studies have indicated that the potential therapeutic applications of Cyclosporine in liver diseases have garnered extensive attention. Studies have reported (Halestrap and Davidson, 1990) that Cyclosporine can inhibit significant Ca2+-induced mitochondrial swelling in the liver and heart, a mechanism likely associated with its regulatory effects on intracellular calcium homeostasis. Calcium ions play a pivotal role in cellular signal transduction and metabolic regulation, and their abnormal elevation is closely associated with the pathogenesis of various liver diseases. Kobayashi et al. (2022) reported that the immunosuppressive agent Cyclosporine A (CsA) confers notable neuroprotection in models of ischemic and traumatic brain injury. This observation supports CsA’s established role in immune modulation and implies its potential hepatoprotective capacity, possibly through the regulation of apoptotic and autophagic pathways. Additionally, there is evidence to suggest that CsA could inhibit the progression of NAFLD by reducing oxidative stress within hepatocytes. Despite its documented hepatotoxicity, CsA emerged as the most promising repositioning candidate because it simultaneously targets the two hub genes CDKN1A and CXCL2 and exhibited one of the lowest docking free energies in our screen, indicating a high likelihood of direct, multi-pathway activity against NAFLD. The drug-vacant status of HGFAC, an extracellular serine protease that zymogen-activates hepatocyte-growth factor, marks this node as an attractive de novo target (Sargsyan et al., 2023). Its solvent-exposed activation loop and trypsin-fold catalytic cleft provide a tractable pocket for either covalent inhibitors or antibody blockade, opening a route for first-in-class therapy. Moreover, decades of pharmacokinetic data and recent advances in liver-targeted nano-formulations provide realistic avenues to administer CsA at micro-doses or with tissue-specific delivery, thereby minimizing systemic exposure and liver injury while preserving therapeutic efficacy. Nonetheless, the known hepatotoxic potential of CsA necessitates careful dose management in future studies. Recent metabolomic studies indicate a clear dose-dependent hepatotoxic effect of CsA: oral micro-doses (<5 mg kg-1 day-1) administered for 4 weeks resulted in mild, reversible biochemical changes in rats, whereas higher doses (≥10 mg kg−1 day−1) significantly elevated circulating bile acids and induced hepatic lipid accumulation (Yen et al., 2024). Notably, hepatic lipid accumulation exacerbates NAFLD pathology, thus conflicting with CsA’s therapeutic objectives (Losurdo et al., 2018). Consequently, future NAFLD studies employing CsA should restrict dosing to sub-immunosuppressive micro-doses, accompanied by rigorous therapeutic drug monitoring and routine assessments of bile acids, ALT, and AST, thereby maximizing metabolic benefits while minimizing hepatotoxic risk.

4.3 Simultaneous treatment of liver and spleen, clearing heat and resolving dampness

The clinical manifestations of NAFLD primarily include nausea, abdominal distension, loss of appetite, and fatigue. These symptoms can be categorized in traditional Chinese medicine as “intercostal pain,” “phlegm syndrome,” and “accumulation.” The pathophysiological mechanisms of this disease are primarily characterized by liver dysfunction and impaired spleen transportation. This condition often exhibits a mixed pattern of deficiency and excess, accompanied by qi and blood deficiency, with the primary lesions located in the liver, while also involving the spleen and kidneys, internal damp-heat accumulation, and obstruction of the liver meridians. Therefore, the therapeutic principles should focus on promoting liver function, strengthening the spleen, and clearing heat while transforming dampness. At the molecular level, the chemokine CXCL2 drives neutrophil infiltration and local inflammatory “heat,” mirroring the TCM concept of damp-heat accumulation in the liver; CDKN1A-induced cell-cycle arrest leads to hepatocellular stagnation, corresponding to liver dysfunction; TNFRSF12A mediates bile-acid-triggered pyroptosis and cholestatic injury, echoing “dampness obstructing bile flow;” whereas HGFAC activates hepatocyte growth factor to facilitate hepatic regeneration, functionally akin to reinforcing the spleen to restore transformation and transportation.Collectively, these gene–symptom correspondences forge a mechanistic bridge between modern molecular pathology and traditional concepts, highlighting why formulas that clear damp-heat, soothe the liver, and fortify the spleen are rational strategies for correcting NAFLD-related gene dysregulation. The two core genes and five components identified in this study align with the therapeutic principles for this disease. Notably, source tracing analysis and molecular docking visualization indicate that Polygonum cuspidatum exhibits the highest correlation with this disease. Polygonum cuspidatum topped the herb–gene network and its signature stilbenes (resveratrol, polydatin) showed the strongest docking to NAFLD hub genes CDKN1A and CXCL2, while its classical function of “clearing damp-heat and invigorating blood” mirrors the disease’s inflammatory-metabolic block. Resveratrol suppresses SREBP-1c mRNA and upregulates PPAR-α, thereby diminishing lipid accumulation and hepatocellular pathology in fatty liver disease (Mo et al., 2018). Furthermore, Resveratrol effectively protects against high-fat-induced liver injury by inhibiting endoplasmic reticulum stress and promoting autophagy (Chen et al., 2020), thus offering new avenues for the treatment of non-alcoholic fatty liver disease. The component–gene associations obtained from the HERB database in this study are merely statistical inferences based on similarities between transcriptomic signatures; such inference cannot establish a definitive regulatory relationship between the five active TCM compounds and the hub genes. Future confirmation will require ligand-binding assays, CRISPR-based perturbations, and validation at the transcriptomic or proteomic level.

5 Conclusion

In summary, this study employed bioinformatics to investigate the differentially expressed genes and associated pathways in NAFLD. The findings suggest that Cyclosporine may serve as a novel compound for the treatment of NAFLD. Furthermore, TCMs such as Alpinia officinarum, Curcuma longa, Polygonum cuspidatum, Smilax glabra, Coptis chinensis, Stephania tetrandra, and Cornus officinalis may represent potential therapeutic agents. Nevertheless, experimental validation is still required, and we anticipate that ongoing advances in multi-omics technologies will accelerate translational progress in this field.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author contributions

JZ: Writing – original draft. TM: Writing – original draft. WG: Writing – original draft. XL: Writing – review and editing. JX: Writing – review and editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the Shanxi Provincial Department of Science and Technology Basic Research Program Youth Projects (Grant Nos. 202203021212109 and 202303021222376), the Basic Research Program General Project (Grant No. 202403021221261), and the Shanxi Provincial Administration of Traditional Chinese Medicine Research Project (Grant No. 2022ZYYC061).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Cannon, M., Stevenson, J., Stahl, K., Basu, R., Coffman, A., Kiwala, S., et al. (2024). DGIdb 5.0: rebuilding the drug-gene interaction database for precision medicine and drug discovery platforms. Nucleic Acids Res. 52 (D1), D1227–d1235. doi:10.1093/nar/gkad1040

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Y., Zhang, H., Chen, Y., Zhang, Y., Shen, M., Jia, P., et al. (2020). Resveratrol alleviates endoplasmic reticulum stress-associated hepatic steatosis and injury in mice challenged with tunicamycin. Mol. Nutr. Food Res. 64 (14), e2000105. doi:10.1002/mnfr.202000105

PubMed Abstract | CrossRef Full Text | Google Scholar

Clough, E., Barrett, T., Wilhite, S. E., Ledoux, P., Evangelista, C., Kim, I. F., et al. (2024). NCBI GEO: archive for gene expression and epigenomics data sets: 23-Year update. Nucleic Acids Res. 52 (D1), D138–d144. doi:10.1093/nar/gkad965

PubMed Abstract | CrossRef Full Text | Google Scholar

d'Adda di Fagagna, F. (2008). Living on a break: cellular senescence as a DNA-Damage response. Nat. Rev. Cancer 8 (7), 512–522. doi:10.1038/nrc2440

PubMed Abstract | CrossRef Full Text | Google Scholar

Fang, S., Dong, L., Liu, L., Guo, J., Zhao, L., Zhang, J., et al. (2021). HERB: a high-throughput experiment- and reference-guided database of traditional Chinese medicine. Nucleic Acids Res. 49 (D1), D1197–d1206. doi:10.1093/nar/gkaa1063

PubMed Abstract | CrossRef Full Text | Google Scholar

Franceschini, A., Szklarczyk, D., Frankild, S., Kuhn, M., Simonovic, M., Roth, A., et al. (2013). STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res. 41 (Database issue), D808–D815. doi:10.1093/nar/gks1094

PubMed Abstract | CrossRef Full Text | Google Scholar

Fukushima, T., Uchiyama, S., Tanaka, H., and Kataoka, H. (2018). Hepatocyte growth factor activator: a proteinase linking tissue injury with repair. Int. J. Mol. Sci. 19 (11), 3435. doi:10.3390/ijms19113435

PubMed Abstract | CrossRef Full Text | Google Scholar

Halestrap, A. P., and Davidson, A. M. (1990). Inhibition of Ca2(+)-induced large-amplitude swelling of liver and heart mitochondria by cyclosporin is probably caused by the inhibitor binding to mitochondrial-matrix peptidyl-prolyl cis-trans isomerase and preventing it interacting with the adenine nucleotide translocase. Biochem. J. 268 (1), 153–160. doi:10.1042/bj2680153

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, K. Q., He, X. Q., Ma, M. Y., Guo, X. D., Zhang, X. M., Chen, J., et al. (2015). Inflammatory microenvironment and expression of chemokines in hepatocellular carcinoma. World J. Gastroenterol. 21 (16), 4864–4874. doi:10.3748/wjg.v21.i16.4864

PubMed Abstract | CrossRef Full Text | Google Scholar

Han, Y. H., Choi, H., Kim, H. J., and Lee, M. O. (2022). Chemotactic cytokines secreted from Kupffer cells contribute to the sex-dependent susceptibility to non-alcoholic fatty liver diseases in mice. Life Sci. 306, 120846. doi:10.1016/j.lfs.2022.120846

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, Y., Dong, D., Zhang, W., Wang, R., Lin, Y. C., Zuo, H., et al. (2024). DrugRepoBank: a comprehensive database and discovery platform for accelerating drug repositioning. Database (Oxford) 2024, baae051. doi:10.1093/database/baae051

PubMed Abstract | CrossRef Full Text | Google Scholar

Hurle, M. R., Yang, L., Xie, Q., Rajpal, D. K., Sanseau, P., and Agarwal, P. (2013). Computational drug repositioning: from data to therapeutics. Clin. Pharmacol. Ther. 93 (4), 335–341. doi:10.1038/clpt.2013.1

PubMed Abstract | CrossRef Full Text | Google Scholar

Kobayashi, T., Uchino, H., Elmér, E., Ogihara, Y., Fujita, H., Sekine, S., et al. (2022). Disease outcome and brain metabolomics of Cyclophilin-D knockout mice in sepsis. Int. J. Mol. Sci. 23 (2), 961. doi:10.3390/ijms23020961

PubMed Abstract | CrossRef Full Text | Google Scholar

Kreis, N. N., Louwen, F., and Yuan, J. (2019). The multifaceted p21 (Cip1/Waf1/CDKN1A) in cell differentiation, migration and cancer therapy. Cancers (Basel) 11 (9), 1220. doi:10.3390/cancers11091220

PubMed Abstract | CrossRef Full Text | Google Scholar

Lamas-Paz, A., Hionides-Gutiérrez, A., Guo, F., Jorquera, G., Morán-Blanco, L., Benedé-Ubieto, R., et al. (2025). Loss of Cdkn1a protects against MASLD alone or with alcohol intake by preserving lipid homeostasis. JHEP Rep. 7 (1), 101230. doi:10.1016/j.jhepr.2024.101230

PubMed Abstract | CrossRef Full Text | Google Scholar

Le, M. H., Le, D. M., Baez, T. C., Wu, Y., Ito, T., Lee, E. Y., et al. (2023). Global incidence of non-alcoholic fatty liver disease: a systematic review and meta-analysis of 63 studies and 1,201,807 persons. J. Hepatol. 79 (2), 287–295. doi:10.1016/j.jhep.2023.03.040

PubMed Abstract | CrossRef Full Text | Google Scholar

Liao, M., Liao, J., Qu, J., Shi, P., Cheng, Y., Pan, Q., et al. (2023). Hepatic TNFRSF12A promotes bile acid-induced hepatocyte pyroptosis through NFκB/Caspase-1/GSDMD signaling in cholestasis. Cell Death Discov. 9 (1), 26. doi:10.1038/s41420-023-01326-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Loscalzo, J. (2024). Multi-Omics and single-cell omics: new tools in drug target discovery. Arterioscler. Thromb. Vasc. Biol. 44 (4), 759–762. doi:10.1161/atvbaha.124.320686

PubMed Abstract | CrossRef Full Text | Google Scholar

Losurdo, G., Castellaneta, A., Rendina, M., Carparelli, S., Leandro, G., and Di Leo, A. (2018). Systematic review with meta-analysis: de novo non-alcoholic fatty liver disease in liver-transplanted patients. Aliment. Pharmacol. Ther. 47 (6), 704–714. doi:10.1111/apt.14521

PubMed Abstract | CrossRef Full Text | Google Scholar

McVey, J. C., Green, B. L., Ruf, B., McCallen, J. D., Wabitsch, S., Subramanyam, V., et al. (2022). NAFLD indirectly impairs antigen-specific CD8(+) T cell immunity against liver cancer in mice. iScience 25 (2), 103847. doi:10.1016/j.isci.2022.103847

PubMed Abstract | CrossRef Full Text | Google Scholar

Mo, J. F., Wu, J. Y., Zheng, L., Yu, Y. W., Zhang, T. X., Guo, L., et al. (2018). Therapeutic efficacy of polydatin for nonalcoholic fatty liver disease via regulating inflammatory response in obese mice. RSC Adv. 8 (54), 31194–31200. doi:10.1039/c8ra05915b

PubMed Abstract | CrossRef Full Text | Google Scholar

Riazi, K., Azhari, H., Charette, J. H., Underwood, F. E., King, J. A., Afshar, E. E., et al. (2022). The prevalence and incidence of NAFLD worldwide: a systematic review and meta-analysis. Lancet Gastroenterol. Hepatol. 7 (9), 851–861. doi:10.1016/s2468-1253(22)00165-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Sargsyan, A., Doridot, L., Hannou, S. A., Tong, W., Srinivasan, H., Ivison, R., et al. (2023). HGFAC is a ChREBP-regulated hepatokine that enhances glucose and lipid homeostasis. JCI Insight 8 (1), e153740. doi:10.1172/jci.insight.153740

PubMed Abstract | CrossRef Full Text | Google Scholar

Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13 (11), 2498–2504. doi:10.1101/gr.1239303

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, D. Q., Targher, G., Byrne, C. D., Wheeler, D. C., Wong, V. W., Fan, J. G., et al. (2023). An international Delphi consensus statement on metabolic dysfunction-associated fatty liver disease and risk of chronic kidney disease. Hepatobiliary Surg. Nutr. 12 (3), 386–403. doi:10.21037/hbsn-22-421

PubMed Abstract | CrossRef Full Text | Google Scholar

Toader, M., Gheorghe, L., Chirica, C., Ghicu, I. A., Chirica, S. I., Mazga, A. I., et al. (2025). Cardiovascular profile and cardiovascular imaging after bariatric surgery: a narrative review. Med. Kaunas. 61 (1), 73. doi:10.3390/medicina61010073

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsamos, G., Vasdeki, D., Koufakis, T., Michou, V., Makedou, K., and Tzimagiorgis, G. (2023). Therapeutic potentials of reducing liver fat in non-alcoholic fatty liver disease: close Association with type 2 diabetes. Metabolites 13 (4), 517. doi:10.3390/metabo13040517

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y., Du, J., Niu, X., Fu, N., Wang, R., Zhang, Y., et al. (2017). MiR-130a-3p attenuates activation and induces apoptosis of hepatic stellate cells in nonalcoholic fibrosing steatohepatitis by directly targeting TGFBR1 and TGFBR2. Cell Death Dis. 8 (5), e2792. doi:10.1038/cddis.2017.10

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Meng, T., Si, N., Li, H., Yan, Y., and Li, X. (2025). Identifying novel therapeutic opportunities for dilated cardiomyopathy: a bioinformatics approach to drug repositioning and herbal medicine prediction. Curr. Pharm. Biotechnol. 26. doi:10.2174/0113892010335576241202061139

PubMed Abstract | CrossRef Full Text | Google Scholar

Wong, F., Krishnan, A., Zheng, E. J., Stärk, H., Manson, A. L., Earl, A. M., et al. (2022). Benchmarking AlphaFold-enabled molecular docking predictions for antibiotic discovery. Mol. Syst. Biol. 18 (9), e11081. doi:10.15252/msb.202211081

PubMed Abstract | CrossRef Full Text | Google Scholar

Xie, L., Xu, J., Xu, H., Zhang, B., Lin, W., and Yang, T. (2024). Multiple autoimmune disorders refractory to glucocorticoids after allogeneic hematopoietic stem cell transplantation: a case report and review of the literature. Front. Immunol. 15, 1366101. doi:10.3389/fimmu.2024.1366101

PubMed Abstract | CrossRef Full Text | Google Scholar

Yen, N. T. H., Tien, N. T. N., Anh, N. T. V., Le, Q. V., Eunsu, C., Kim, H. S., et al. (2024). Cyclosporine A-induced systemic metabolic perturbations in rats: a comprehensive metabolome analysis. Toxicol. Lett. 395, 50–59. doi:10.1016/j.toxlet.2024.03.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Yin, L., Mu, Y., Lin, Y., and Xia, Q. (2019). HGFAC expression decreased in liver cancer and its low expression correlated with DNA hypermethylation and poor prognosis. J. Cell Biochem. 120 (6), 9692–9699. doi:10.1002/jcb.28247

PubMed Abstract | CrossRef Full Text | Google Scholar

Younossi, Z. M., Stepanova, M., Afendy, M., Fang, Y., Younossi, Y., Mir, H., et al. (2011). Changes in the prevalence of the most common causes of chronic liver diseases in the United States from 1988 to 2008. Clin. Gastroenterol. Hepatol. 9 (6), 524–530.e1. e521; quiz e560. doi:10.1016/j.cgh.2011.03.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Younossi, Z. M., Golabi, P., Paik, J. M., Henry, A., Van Dongen, C., and Henry, L. (2023). The global epidemiology of nonalcoholic fatty liver disease (NAFLD) and nonalcoholic steatohepatitis (NASH): a systematic review. Hepatology 77 (4), 1335–1347. doi:10.1097/hep.0000000000000004

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X., Heredia, N. I., Balakrishnan, M., and Thrift, A. P. (2021). Prevalence and factors associated with NAFLD detected by vibration controlled transient elastography among US adults: results from NHANES 2017-2018. PLoS One 16 (6), e0252164. doi:10.1371/journal.pone.0252164

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhong, E. F., Chang, A., Stucky, A., Chen, X., Mundluru, T., Khalifeh, M., et al. (2020). Genomic analysis of oral Lichen Planus and related oral microbiome pathogens. Pathogens 9 (11), 952. doi:10.3390/pathogens9110952

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: non-alcoholic fatty liver disease, bioinformatics, drug repositioning, traditional Chinese medicine, cyclosporine

Citation: Zhang J, Meng T, Gao W, Li X and Xu J (2025) Identifying novel therapeutic targets for non-alcoholic fatty liver disease using bioinformatics approaches: from drug repositioning to traditional Chinese medicine. Front. Bioinform. 5:1613985. doi: 10.3389/fbinf.2025.1613985

Received: 18 April 2025; Accepted: 07 August 2025;
Published: 26 August 2025.

Edited by:

Dharmendra Kumar Yadav, Gachon University, Republic of Korea

Reviewed by:

Anchala Kumari, Jawaharlal Nehru University, India
Bhavika Mam, Independent Researcher, Palo Alto, MA, United States

Copyright © 2025 Zhang, Meng, Gao, Li and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Juan Xu, eHVqdWFueHBAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.