Multi-omics integration analysis based on plasma circulating proteins reveals potential therapeutic targets for ulcerative colitis

Zhou, Jihai; Zhao, Wenwen; Lin, Yiping; Yang, Bo; Sun, Dongjie; Liu, Zhu

doi:10.3389/fmolb.2025.1686282

ORIGINAL RESEARCH article

Front. Mol. Biosci., 20 November 2025

Sec. Molecular Diagnostics and Therapeutics

Volume 12 - 2025 | https://doi.org/10.3389/fmolb.2025.1686282

Multi-omics integration analysis based on plasma circulating proteins reveals potential therapeutic targets for ulcerative colitis

Jihai Zhou¹^†

Wenwen Zhao¹^†

Yiping Lin²^†

Bo Yang³

Dongjie Sun⁴*^‡

Zhu Liu¹*^‡

¹Department of Gastroenterology, Shandong Provincial Maternal and Child Health Care Hospital Affiliated to Qingdao University, Jinan, China
²Department of Health Medicine, 900th Hospital of PLA Joint Logistic Support Force, Fuzong Clinical Medical College of Fujian Medical University, Fuzhou, China
³Department of Gastroenterology and Hepatology, Guizhou Aerospace Hospital, Zunyi, China
⁴Department of Digestive Diseases, 900th Hospital of PLA Joint Logistic Support Force, Fuzong Clinical Medical College of Fujian Medical University, Fuzhou, China

Background: Ulcerative colitis (UC) is a complex inflammatory bowel disease with unclear etiology and challenging molecular mechanisms. This study aims to identify potential diagnostic and therapeutic biomarkers for UC through multi-omics integrative analysis, providing new insights into its precise diagnosis and treatment.

Methods: Data samples from the Gene Expression Omnibus database and protein quantitative trait loci data from genome-wide association studies were integrated to identify overlapping genes. Three machine learning (ML) algorithms were employed to screen core hub genes from these overlapping genes, followed by the construction and external validation of a diagnostic model. Single-cell sequencing data were used to explore the expression profiles of core hub genes across different cell types. Additionally, immune infiltration, functional enrichment, and regulatory networks were analyzed. Finally, the expression trends of the core hub genes were validated in a dextran sulfate sodium (DSS)-induced UC mouse model using RT-qPCR.

Results: Mendelian randomization (MR) analysis identified 168 plasma proteins causally associated with UC. Differential expression analysis revealed 1,011 DEGs, and the intersection of DEGs and MR results yielded 12 overlapping genes. Four core hub genes, including EIF5A2, IDO1, CDH5, and MYL5, were identified using three ML algorithms. The nomogram model constructed with these four genes demonstrated strong predictive performance, which was further confirmed in an external validation dataset. GSEA analysis revealed that these genes are involved in various biological processes, including immune response, signal transduction, metabolism, and cellular stress. CIBERSORT immune infiltration analysis showed significant differences in immune cell infiltration between UC and normal tissues. Furthermore, a comprehensive mRNA-miRNA-lncRNA regulatory network was constructed, identifying key molecular interactions potentially driving UC pathogenesis. Single-cell RNA sequencing analysis revealed that CDH5 is primarily expressed in endothelial cells, EIF5A2 is enriched in stem cells/T cells, IDO1 is expressed in monocytes, and MYL5 is found in epithelial and endothelial cells. Finally, RT-qPCR validation in the DSS-induced UC mouse model confirmed that the expression changes of core hub genes were consistent with bioinformatics predictions.

Conclusion: This study systematically identified core diagnostic genes and their regulatory networks for UC through multi-omics integration.

1 Introduction

Ulcerative Colitis (UC) is a chronic inflammatory bowel disease characterized by recurrent inflammation of the colonic and rectal mucosa, leading to symptoms such as abdominal pain, diarrhea, and rectal bleeding (Ungaro et al., 2017). Globally, the incidence and prevalence of UC have been on the rise in many newly industrialized countries, with reported prevalence rates in Western countries estimated to be as high as 505 cases per 100,000 population (Ng et al., 2017). The pathogenesis of UC involves a complex interplay of genetic susceptibility, environmental factors, immune dysregulation, and gut microbiota dysbiosis, all of which collectively contribute to epithelial barrier dysfunction and chronic inflammation (Santana et al., 2022). Despite advances in understanding these mechanisms, the identification of precise therapeutic targets remains challenging due to the disease’s inherent heterogeneity and the limitations of current therapies (e.g., immunosuppressants and biologics), which often fail to achieve long-term remission in a substantial proportion of patients (Cholapranee et al., 2017; Dai et al., 2023).

Despite rapid advancements in multi-omics technologies, existing studies primarily focus on single molecules or single pathways, failing to systematically integrate interactions at the multi-omics level (Zhu et al., 2025). The integration of plasma quantitative trait loci (pQTL) analysis with genome-wide association studies (GWAS) has become a powerful tool for elucidating causal relationships between genetic variants and protein expression levels, with the potential to identify novel biomarkers and drug targets (Suhre et al., 2017). Recent genomic and proteomic studies have highlighted the role of circulating proteins in UC pathogenesis, providing insights into systemic inflammatory processes (Di Narzo et al., 2017; Chen et al., 2025). While GWAS has identified genetic susceptibility loci associated with UC pathogenesis, there remains a lack of systematic evidence regarding how these genetic predispositions translate into altered protein functions that directly lead to inflammation and tissue damage. Mendelian randomization (MR) approaches have further strengthened these findings by mitigating confounding factors and reverse causality, particularly when establishing associations between specific proteins and diseases (Verbanck et al., 2018). However, a truly comprehensive multi-omics investigation that systematically integrates genetic, transcriptomic, and single-cell RNA sequencing data remains crucial for elucidating the complex mechanisms underlying UC onset and progression and for pinpointing therapeutic targets.

Significant advancements in microarray technology and bioinformatics have profoundly propelled the development of the biomedical field. Leveraging their robust classification capabilities, the integration of machine learning (ML) with microarray analysis has emerged as a powerful tool for identifying novel diagnostic biomarkers (Bhandari et al., 2022). By learning intricate data patterns, ML techniques such as Random Forest (RF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) have been successfully employed in UC research to select and validate potential serum biomarkers, demonstrating enhanced efficiency and accuracy (Ge et al., 2025; Zheng et al., 2025). However, existing studies often focus on single-omics data, lacking the comprehensive integration of diverse omics datasets, thereby restricting the scope and overall accuracy of the identified biomarkers (Zheng et al., 2025). Consequently, developing multi-omics integration approaches to enhance biomarker diagnostic and prognostic capabilities represents a critical direction in current biomedical research.

To address this gap, our study systematically integrates publicly available datasets, including Gene Expression Omnibus (GEO) microarray data, pQTL data from large cohorts, GWAS summary statistics for UC, and single-cell RNA sequencing data. Furthermore, a multifaceted methodological approach is employed, encompassing MR, various ML algorithms, nomogram model construction, immune infiltration analysis, and experimental validation. The overarching aim is to identify key genes with causal roles in the pathogenesis of UC, offering potential strategies for its precise treatment.

2 Materials and methods

2.1 Data sources

The GEO database is a widely used functional genomics repository for storing and retrieving high-throughput gene expression data, chips, and microarray information. Microarray data of three datasets were acquired from the GEO database (http://www.ncbi.nlm.nih.gov/geo/), including GSE87466, GSE92415, and GSE75214. GSE87466 (87 UC and 21 normal samples) and GSE92415 (162 UC and 21 normal samples) were used as the training sets. These two datasets were combined and batch-corrected using the “sva” package in R (Leek et al., 2012). GSE75214 (97 UC and 11 normal samples) was used as a validation set. Additionally, we obtained the GSE214695 dataset from the GEO database for single-cell sequencing data analysis, including six UC samples.

The detailed description of the datasets used in the MR analysis is provided in Supplementary Table S1. Specifically, a large-scale pQTL study involving 35,559 Icelandic individuals provided genetic association data for 4,907 circulating proteins (Ferkingstad et al., 2021). Proteomic analysis was conducted using a modified, multiplexed aptamer-based binding assay (SOMAscan version 4). Protein level variations were normalized for age- and sex-specific effects, followed by rank-inverse normal transformation to standardize the residuals. These normalized values were then used as phenotypes in GWAS conducted using BOLT-LMM linear mixed models. Additionally, summary-level GWAS data for UC were obtained from the IEU Open GWAS Project (https://gwas.mrcieu.ac.uk/datasets/) (Wu et al., 2024), with UC diagnosis based on the International Classification of Diseases (ICD) code (ICD-10: K51). However, the IEU Open GWAS database only provides summary-level statistical data and does not offer detailed information on individual samples. Since all the genetic association summary data used in this analysis are publicly available, no additional ethical approval is required for this study.

2.2 Exposure data

The genomic positions of pQTLs are generally located near the corresponding genes of specific proteins. Specifically, pQTLs that are close to homologous genes are termed “cis-pQTLs,” based on the assumption that these pQTL influence protein levels through the corresponding gene (Liu et al., 2025). To minimize bias from horizontal pleiotropy, we used only cis-pQTLs as instrumental variables (Peng et al., 2025). Based on the study by Ferkingstad et al. (Ferkingstad et al., 2021), we obtained genetic summary statistics related to plasma proteins (https://www.decode.com/summarydata/). These pQTL data were required to meet the following criteria: 1. show genome-wide significant associations (P < 5 × 10⁻⁸); 2. display independent associations (linkage disequilibrium r² < 0.001); 3. be cis-pQTLs; and .4 have an F-test value greater than 10. Based on these criteria, we identified 241,653 SNPs associated with 4,288 proteins.

2.3 Outcome data

We obtained the UC cohort from the IEU Open GWAS Project, which originated from the Neale lab study and is based on UK Biobank data (OpenGWAS ID: ukb-a-553) (Bycroft et al., 2018; Papier et al., 2024). The sample size consists of 1,579 UC cases and 335,620 controls, with 10,894,596 single nucleotide polymorphisms (SNPs), and all participants are of European descent. The UK Biobank, established in 2006 in the United Kingdom, is internationally renowned for its extensive and openly accessible repository of biological and clinical information, making it a valuable resource for large-scale genetic and epidemiological research (Sudlow et al., 2015).

2.4 MR analysis

In this study, we conducted a proteome-wide MR analysis using the “TwoSampleMR” R package. The harmonize data function was employed to align effect alleles and effect sizes across datasets. To perform MR analysis, five different methods were utilized: MR Egger (Burgess and Thompson, 2017), weighted median (Bowden et al., 2016), inverse variance weighted (IVW) (Bowden et al., 2017), simple mode (Ai and Yang, 2023), and weighted mode (Hartwig et al., 2017). Among these, the IVW approach was primarily used to evaluate the causal relationship between exposure factors and outcomes. A p-value <0.05 from the IVW method was considered indicative of a significant causal association. The odds ratio (OR) was calculated to interpret the results, where OR > 1 suggested the exposure as a risk factor, while OR < 1 indicated a protective factor. To ensure the robustness of the findings, sensitivity analyses were performed, including tests for heterogeneity, horizontal pleiotropy, and leave-one-out analysis. The heterogeneity test assessed the consistency among different instrumental variables, with a p-value >0.05 indicating no substantial heterogeneity. The horizontal pleiotropy test examined whether instrumental variables influenced outcomes through pathways other than the exposure factor; a p-value >0.05 suggested the absence of horizontal pleiotropy, thus minimizing the impact of confounding factors. Finally, the leave-one-out analysis involved sequentially excluding individual instrumental variables to evaluate their influence on the overall results. Stability of the findings under this analysis confirmed the reliability and robustness of the conclusions.

2.5 Functional enrichment analysis

To elucidate the biological functions and signaling pathways associated with these differential genes, we utilized the R package “clusterProfiler” (Yu et al., 2012) to perform Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses. GO analysis, drawing from the GO database, categorizes gene functions into three principal domains: biological processes (BP), cellular components (CC), and molecular functions (MF). KEGG pathway analysis was employed to annotate the pathways of identified or differentially expressed proteins, thereby exploring the key metabolic and signal transduction pathways in which these proteins or genes are involved.

2.6 Analysis of differentially expressed genes (DEGs)

To identify DEGs associated with UC, the “limma” R package (Ritchie et al., 2015) was employed, using |log2 fold change (FC)| > 0.9 and adj p-value <0.05 as the thresholds (Supplementary Table S2). Visualization of DEGs was performed using the “ggplot2” (Maag, 2018) and “pheatmap” R packages, which generated volcano plots and heatmaps, respectively. The “VennDiagram” package in R was used to plot the intersection of transcriptome differential genes and MR differential genes, resulting in shared significant shared DEGs (S-DEGs).

2.7 Identification of hub S-DEGs via ML algorithms

Three ML algorithms, including Least Absolute Shrinkage and Selection Operator (LASSO), SVM-RFE, and RF, were employed to screen hub S-DEGs. LASSO analysis was implemented with the R package “glmnet” (Duan et al., 2016) with 10 cross-validations to screen the optimal tuning parameter (λ). The SVM-RFE algorithm was utilized to select the point with the smallest cross-validation error to determine the variable through packages “e1071” and “caret” (Lin et al., 2017). The package “randomForest” was employed to develop a random forest model, and the top 5 S-DEGs with MeanDecreaseGini scores were chosen (Ishwaran and Kogalur, 2010). Finally, the genes obtained by intersecting the three ML algorithms with each other were regarded as the signature genes.

2.8 Construct a diagnostic nomogram and ROC analysis

The nomogram model was constructed using the “rms” package of R. Individual “Points” indicates the score of each candidate gene, and “Total Points” indicates the summation of all the scores of genes above (Balachandran et al., 2015). Furthermore, a calibration curve was applied to assess the predictive power of the nomogram model (Yang et al., 2020). ROC curves were visualized to present the area under the curve (AUC) for evaluating their diagnostic value. In general, an AUC of 0.5 indicates no discrimination; values between 0.7 and 0.8 are considered acceptable, 0.8–0.9 excellent, and above 0.9 outstanding.

2.9 Diagnostic value of hub candidate genes

The value of four hub candidate genes was evaluated via ROC curves. The external dataset GSE75214 was used to validate the ability of the prediction model to distinguish between UC and normal samples.

2.10 Gene set enrichment analysis (GSEA) of the key genes

The single-gene GSEA was performed using the R “GSEA” package to further investigate the potential regulatory pathways of key genes in UC. A P-value of < 0.05 was set as the threshold for significant enrichment.

2.11 Immune cell infiltration and correlation analysis

The CIBERSORT algorithm was used to assess the relative abundance of 22 lymphocyte subtypes in each normal and UC sample (Newman et al., 2019). Subsequently, the relationship between hub candidate genes in UC and immune cells was evaluated by Spearman correlation analysis.

2.12 GeneMANIA network and mRNA-miRNA-lncRNA network construction

For the functional association analysis, we examined key genes by creating protein-protein interaction (PPI) networks using the GeneMANIA platform (http://genemania.org, accessed on 15 June 2025). This online tool helps identify gene clusters that are functionally related by integrating various types of interactions, including protein–protein and protein–DNA interactions, co-expression patterns, correlations within metabolic pathways, and characteristics of spatial colocalization (Warde-Farley et al., 2010). Based on the elucidation of the functional interaction networks of key genes, we further explored their regulatory relationships at the post-transcriptional level. Initially, online tools for predicting RNA interactions, including miRanda, miRDB, and TargetScan, were employed to explore mRNA-miRNA interactions. Only interactions consistently identified by all tools were considered valid. Next, the spongescan database was used to identify miRNA-long non-coding RNA (lncRNA) interactions and validate if they were consistently recognized across databases. Based on these data, an integrated mRNA-miRNA-lncRNA regulatory network was constructed and visualized using Cytoscape 3.10.3.

2.13 Single-cell RNA-seq data analysis and subcellular localization

Quality control was performed before analyzing single-cell data. We preserved high-quality cells that had fewer than 20% mitochondrial genes and expressed more than 200 genes. Given the potential existence of diploid cells, cells with genes < 200 or > 5,800 were filtered out. Thereafter, the integration workflow was conducted by the Seurat pipeline (Butler et al., 2018). The remaining cells were further scaled and normalized using a linear regression model with the “Log-normalization” method, and the top 2,500 variable genes were detected by the “FindVariablFeatures” function. Subsequently, the dimensionality of the single-cell data was diminished through principal component analysis. To address batch effects across samples, the Harmony package (v1.2.0) was utilized, selecting the top 15 principal components for dimensionality reduction and clustering. Following this, UMAP was employed to project the results onto a two-dimensional plot, facilitating cell type identification. The cell types to which different clusters belonged were identified using the R package “singleR”. After, single-cell data analysis was performed to study the distribution of key genes across different cell types in UC. Finally, the subcellular localization of the biomarker proteins was also predicted based on the GENECARDS database (https://www.genecards.org) to provide a basis for understanding the changes of biomarkers in the disease process.

2.14 Animal experiments

The UC animal model was induced in 6–8-week-old male C57BL/6 mice under specific pathogen-free conditions following established protocols (Eichele and Kharbanda, 2017). After a 7-day acclimatization period with ad libitum access to food and water, 12 mice were randomly allocated to control or dextran sulfate sodium (DSS) groups (n = 6 each). The animal protocols were approved by the Institutional Animal Care and Use Committee of Yi Shengyuan Gene Technology (Tianjin) Co., Ltd. (protocol number YSY-DWLL-2024713). The DSS group received 3% DSS in drinking water for 7 days, while controls drank plain water. During the study period, mice were weighed daily and assessed for fecal occult blood and stool consistency. The Disease Activity Index (DAI) was calculated as the sum of these three scores to quantify the severity of intestinal inflammation, according to the predefined scoring criteria (Supplementary Table S3; Kim et al., 2012). On day 8, euthanasia was performed on the mice via cervical dislocation, and colon tissues were collected for RT-PCR analysis.

2.15 Tissue RNA Extraction and RT-qPCR

Total RNA was extracted and purified from colon tissues using TRIzol reagent (ThermoFisher Scientific, No. 15596018CN). The concentration and purity of RNA were assessed using a NanoDrop spectrophotometer (ThermoFisher Scientific). Complementary DNA was synthesized via reverse transcription using 5x Evo M-MLV RT Reaction Mix and gDNA Clean Reaction Mix (Accurate Biology, No. AG11728). The RNA input amount for each sample was 1 µg. Primers were sourced from PrimerBank (https://pga.mgh.harvard.edu/primerbank/). Quantitative real-time PCR (RT-qPCR) was performed on a Bio-Rad T100 thermal cycler, with SYBR Green Master Mix (Aibotec) used for amplification. β-actin was used as the housekeeping gene for normalization (Supplementary Table S4). Data analysis was conducted using the 2,^(-ΔΔCt) method, and data visualization was performed using GraphPad Prism software.

2.16 Statistical analyses

Unless otherwise specified, all statistical analyses were performed using R software (version 4.2.3). A two-tailed P-value <0.05 was considered statistically significant. The study’s workflow is illustrated in Figure 1.

Figure 1

Flowchart detailing a multi-step research process for genetic analysis. It includes data sources like GSE87466 and GSE92415, SNP extraction, and GWAS data. Steps involve data standardization, differential expression analysis, MR analysis, and identification of genes and proteins. Key steps include MR analysis, IVW, and GO and KEGG enrichment analysis. The process results in identifying common genes and hub genes, utilizing machine learning, and various validation steps. Final analyses include nomogram, external validation, and immune cell infiltration, with results shown in expression across cell types.

Figure 1. The study flowchart.

3 Results

3.1 MR analysis

We performed MR analysis to evaluate the causal association between plasma circulating proteins and UC. This analysis identified 168 plasma proteins with significant causal associations with UC, of which 67 were recognized as protective factors and 101 as risk factors (Supplementary Table S5). Figure 2A, presented as a forest plot, illustrates the causal effect estimates and their 95% confidence intervals for 20 of these associated plasma proteins, intuitively demonstrating their potential direction and strength of influence on UC risk. The upregulated and downregulated status of these 20 plasma proteins has been provided in Supplementary Table S6.

Figure 2

A panel of five plots. A: A table listing exposures, methods, p-values, and odds ratios for various factors with confidence intervals shown graphically. B: A scatter plot illustrating SNP effects on IDO1 with regression lines for different MR tests. C: A forest plot showing MR effect sizes for

Figure 2. Statistical analysis of exposure effects conducted through Mendelian randomization approaches. (A) MR analysis results for different exposure variables, including gene loci, analysis methods, P-values, and odds ratios (randomly displayed 20 genes). (B) Shows the effect estimates of SNPs on outcome variables under various MR test methods. (C) Presents MR results of the associations between multiple SNP loci and ulcerative colitis. (D) Compares the effect estimates of different MR methods. (E) The result of the MR leave-one-out sensitivity analysis for the association between genes and outcome variables.

To further elucidate the details and robustness of the MR analysis, we selected IDO1 as an example to comprehensively present its MR study results. The scatter plot in Figure 2B visually revealed a positive causal association between elevated IDO1 levels and increased UC risk, with consistent trends observed across fitted lines from various MR methods. Figure 2C, in the form of a forest plot, displays the estimated effect of each SNP of IDO1 on the UC outcome, indicating that the majority of SNPs showed consistent effect directions with the overall result, thereby supporting the validity of the instrumental variables. Furthermore, the funnel plot in Figure 2D exhibited a largely symmetrical shape, suggesting that the MR analysis in this study was not significantly affected by horizontal pleiotropy or publication bias. The leave-one-out sensitivity analysis (Figure 2E) further confirmed that the causal effect estimate for IDO1 on UC was highly robust and not decisively influenced by any single SNP. Detailed results of heterogeneity tests and horizontal pleiotropy tests for plasma proteins are presented in Supplementary Tables S7, S8, respectively. These comprehensive analyses robustly confirmed that all 168 plasma proteins identified by the MR approach have a significant causal relationship with UC.

3.2 Functional enrichment analysis

To investigate the biological functions and signaling pathways of the 168 differential genes, we conducted GO and KEGG enrichment analyses. We identified 803 significantly enriched GO terms, comprising 723 BP, 26 CC, and 54 MF categories. Furthermore, 26 KEGG signaling pathways were also significantly enriched. KEGG pathway enrichment analysis revealed several significantly enriched pathways, primarily associated with immune and inflammatory responses, cell signaling, and disease development (Figure 3A). These pathways included cytokine-cytokine receptor interaction, fluid shear stress and atherosclerosis, Th17 cell differentiation, and chemical carcinogenesis-receptor activation. Notably, several of these enriched pathways, such as cytokine-cytokine receptor interaction and Th17 cell differentiation, are closely associated with the immune dysregulation and inflammatory cascades characteristic of UC. For BP, significantly enriched terms included regulation of cell-cell adhesion and regulation of T cell proliferation, indicating the involvement of these genes in intercellular communication and the regulation of immune cells. In terms of CC, we observed significant enrichment in terms such as the external side of the plasma membrane and the collagen-containing extracellular matrix, suggesting the contribution of these genes to extracellular structural integrity and cell surface interactions. MF analysis revealed enrichment in terms like cytokine receptor binding and immune receptor activity, emphasizing their roles in mediating immune signaling and receptor-ligand interactions (Figure 3B). These findings suggest that the identified genes play crucial roles in modulating immune responses and inflammatory processes.

Figure 3

Panel A shows a KEGG pathway enrichment analysis bar chart with pathways ranked by -log10(p-value), from cytokine-cytokine receptor interaction to EGFR tyrosine kinase inhibitor resistance. Panel B displays three bar charts of the top 10 enriched Gene Ontology (GO) terms by p-value, categorized into biological process (BP), cellular component (CC), and molecular function (MF), using a color scale from yellow to dark red indicating increasing -log10(p-value).

Figure 3. Functional enrichment analysis. (A) Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis results. (B) Gene Ontology (GO) enrichment analysis results.

3.3 Screening DEGs and selection of the hub genes

Differential expression analysis identified 636 upregulated and 375 downregulated genes (Figure 4A). To pinpoint genes common to both the transcriptome dataset and MR analysis, we intersected their respective DEGs sets, which yielded 12 S-DEGs (Figure 4B; Supplementary Table S9). ML algorithms are powerful tools for biomarker discovery and enable the identification of key genes associated with UC. To identify candidate hub S-DEGs potentially involved in UC progression, we employed three ML algorithms: LASSO regression, SVM-RFE, and RF. LASSO regression analysis identified seven candidate genes: CDH5, EIF5A2, IDO1, MMP7, MYL5, PCK1, and TNFAIP8 (Figure 5A). Eight candidate signature genes (EIF5A2, PCK1, CDH5, FJX1, IDO1, GPX7, EGFL6, and MYL5) were obtained using the SVM-RFE algorithm (Figure 5B). The RF algorithm screened the top five genes by relative importance: EIF5A2, STC1, CDH5, MYL5, and IDO1 (Figures 5C,D). Finally, a Venn diagram was used to identify the common genes among the results of these three algorithms. This intersection revealed four core signature genes: EIF5A2, CDH5, IDO1, and MYL5 (Figure 5E).

Figure 4

Volcano plot and Venn diagram comparison. The volcano plot on the left displays gene expression levels with downregulated genes in blue, upregulated in red, and non-significant in gray, highlighting the top 25 labeled genes. The Venn diagram and accompanying charts on the right show the overlap of 12 elements between 999 differentially expressed genes (DEGs) and 156 metabolic rate genes (MR), with an additional bar graph indicating list sizes and shared elements.

Figure 4. Screening of differentially expressed genes. (A) Volcano plot depicting gene expression changes. (B) Venn diagram illustrating the overlap between differentially expressed genes (DEG) and genes related to Mendelian randomization (MR).

Figure 5

A series of visualizations depict statistical analyses and data modeling results. Panel A shows line and scatter plots illustrating coefficient paths and binomial deviance across log lambda values. Panel B features plots of generalization error and cross-validation accuracy against the number of features. Panel C presents a line chart tracking Random Forest error rates over trees. Panel D displays gene importance with a bubble chart, highlighting genes like EIF5A2. Panel E is a Venn diagram comparing Lasso, SVM-RFE, and RF methods, indicating common and unique gene features such as CDH5 and MYL5.

Figure 5. Hub gene selection. (A) Lasso regression plot and 10-fold cross-validation plot. (B) The accuracy curve and error rate curve for 5-fold cross-validation based on the SVM-RFE algorithm. (C) RF error rate versus the number of classification trees. (D) The top 5 relatively significant genes based on the RF algorithm. (E) Three algorithmic Venn diagram screening hub genes.

3.4 Construction and evaluation of a diagnostic nomogram

To assess the predictive capability of the identified hub genes, we constructed a diagnostic nomogram (Figure 6A). The nomogram’s predictive efficacy was then evaluated using calibration curves, which demonstrated close agreement between predicted and actual probabilities, indicating good model calibration (Figure 6B). Furthermore, ROC curve analysis revealed a high AUC of 0.985 for the nomogram, signifying its excellent diagnostic performance (Figure 6C). For external validation, individual ROC curves for these four hub genes were generated using the GSE75214 dataset (Figure 6D). All genes exhibited strong discriminative abilities, with AUC values ranging from 0.750 to 0.981. Subsequently, boxplot analyses of the GSE75214 dataset compared the expression levels of these genes between the control and UC groups (Figures 6E–H). Consistent with findings from training sets, CDH5, EIF5A2, and IDO1 were significantly upregulated, while MYL5 was significantly downregulated in the UC group compared to the control group, all demonstrating strong statistical significance (p < 0.005 for all, as indicated in figures). These comprehensive results collectively underscore the potential of EIF5A2, CDH5, IDO1, and MYL5 as promising diagnostic biomarkers for UC.

Figure 6

(A) A prediction nomogram with gene scores and total score. (B) Calibration plot comparing predicted and observed values. (C) ROC curve with AUC of 0.985. (D) Combined ROC curves for EIF5A2, CDH5, IDO1, and MYL5 genes with respective AUC values. (E-H) Box plots showing expression levels of CDH5, EIF5A2, IDO1, and MYL5 in control versus UC groups, with p-values indicating statistical significance.

Figure 6. Construction and validation of a diagnostic nomogram model. (A) A diagnostic nomogram was developed based on the four signature genes. (B) Calibration curves to assess the predictive power of the nomogram. (C) ROC curve of the nomogram model. (D) ROC curve of the four hub genes in the external dataset (GSE75214). (E–H) The expression levels of the four hub genes in the GSE75214 dataset.

3.5 GSEA analysis

Subsequently, GSEA of the hub genes was performed, and the top three upregulated and downregulated pathways were visualized using the “clusterProfiler” package. KEGG pathway analysis revealed distinct enrichment patterns: CDH5 was significantly associated with pathways including cytokine-cytokine receptor interaction, hematopoietic cell lineage, leishmania infection, RNA polymerase, steroid biosynthesis, and vibrio cholerae infection (Figure 7A). EIF5A2 demonstrated enrichment in cytokine-cytokine receptor interaction, fructose and mannose metabolism, hematopoietic cell lineage, leishmania infection, non-homologous end joining, and O-glycan biosynthesis (Figure 7B). IDO1 was prominently linked to pathways such as cytokine-cytokine receptor interaction, hematopoietic cell lineage, leishmania infection, pentose phosphate pathway, RNA polymerase, and vasopressin-regulated water reabsorption (Figure 7C). MYL5 showed enrichment in pathways related to amino sugar and nucleotide sugar metabolism, butanoate metabolism, cysteine and methionine metabolism, endometrial cancer, metabolism of xenobiotics by cytochrome P450, and retinol metabolism (Figure 7D). Collectively, these findings suggest that all four biomarkers are involved in diverse biological pathways related to immune response, signal transduction, metabolism, and cellular stress, thereby highlighting their potential roles in the pathogenesis of UC.

Figure 7

Four GSEA enrichment plots labeled A, B, C, and D, each displaying pathways and running enrichment scores across ranked datasets. Plot A shows CDH5 with multiple KEGG pathways. Plot B features EIF5A2 with distinct pathways. Plot C displays IDO1, and plot D shows MYL5 with KEGG pathway interactions. Each graph includes X and Y axes with ranks and scores, colored pathways, and ranked list metrics.

Figure 7. GSEA analysis of four hub genes characteristic of UC. (A) GSEA of CDH5 in UC. (B) GSEA of EIF5A2 in UC. (C) GSEA of IDO1 in UC. (D) GSEA of MYL5 in UC.

3.6 Immune infiltration analysis

The CIBERSORT algorithm calculated the infiltration proportion of 22 immune cells for each sample. The result showed that the proportions of naive B cells, naive CD4 T cells, CD4 memory T cells (activated), follicular helper T cells, gamma delta T cells, M1 macrophages, activated dendritic cells, activated mast cells, and neutrophils were upregulated in the UC samples compared with the control samples. The proportions of CD8 T cells, resting CD4 memory T cells, regulatory T cells (Tregs), activated NK cells, monocytes, M0 macrophages, M2 macrophages, resting dendritic cells, and resting mast cells were downregulated in the UC group compared with the control group (Figure 8A). This finding suggests a significant difference in the immune microenvironment between the control and UC groups. We further analyzed the relationship between four hub genes and immune cell infiltration (Figure 8B). CDH5 was positively correlated with neutrophils and M0 macrophages, while negatively correlated with resting mast cells and M2 macrophages. EIF5A2 was positively correlated with neutrophils and CD4 memory T cells (activated), while negatively correlated with resting mast cells and M2 macrophages. IDO1 was positively correlated with neutrophils and M1 macrophages and CD4 memory T cells (activated), while negatively correlated with regulatory T cells (Tregs) and M2 macrophages. MYL5 was positively correlated with M2 macrophages and regulatory T cells (Tregs), while negatively correlated with neutrophils and CD4 memory T cells (activated). These findings suggest that these hub genes may play significant roles in modulating immune cell infiltration and immune responses.

Figure 8

Panel A shows a box plot comparing immune cell fractions in control (n=41) and UC (n=249) groups across various cell types, with significant differences marked. Panel B is a correlation matrix of gene-immune infiltrating cells, with colored lines indicating correlation strength and significance. Positive correlations are in pink, negative in blue, and non-significant in grey.

Figure 8. Analysis of immune cell infiltration. (A) Differentially expressed immune cells between control and UC samples. (B) Correlation analysis between immune cells and the 4 hub genes.

3.7 Functional prediction of key genes and construction of mRNA-miRNA-lncRNA regulatory network

To further understand the role of key genes in disease pathogenesis, these genes were submitted to the GeneMANIA database for functional and pathway prediction (Figure 9A). The results indicated that key genes were enriched in epithelial cell migration, translation regulator activity, tryptophan metabolic process, cellular biogenic amine metabolic process, regulation of muscle system process, aromatic amino acid family metabolic process, and extrinsic component of plasma membrane. Figures 9B,C illustrate mRNA-miRNA-lncRNA regulatory networks centered on CDH5 and EIF5A2, respectively. In Figure 9B, CDH5 is shown to be regulated by multiple microRNAs, including hsa-miR-125b-5p, hsa-miR-101-3p, hsa-miR-1226-3p, hsa-miR-1202, and hsa-miR-125a-5p. These microRNAs also target and regulate a shared group of lncRNAs, such as HTR5A-AS1, LINC01070, and HP09025. Similarly, Figure 9C reveals that EIF5A2 is a target of several microRNAs, including hsa-miR-1225-3p, hsa-miR-1227-3p, and hsa-miR-1226-3p, which also regulate a common set of lncRNAs, such as LINC01002, LINC01043, and LINC01106.

Figure 9

Panel A shows a network of protein interactions highlighted using different colors for physical interactions, co-expression, and other functions. Panels B and C depict miRNA-mRNA interaction networks related to CDH5 and EIF5A, with nodes representing genes in green and miRNAs in orange, connected by black lines.

Figure 9. Functional prediction of key genes and construction of mRNA-miRNA-lncRNA regulatory network. (A) Co-expression network of hub genes predicted by GeneMANIA. (B,C) Construction of an mRNA-miRNA-lncRNA regulatory network using data from the miRanda, miRDB, and TargetScan databases.

3.8 Quality control of single-cell RNA-seq data and cell annotation

To elucidate the expression landscape of potential driver genes within the UC-associated single-cell transcriptomic data, we first conducted rigorous quality control on the single-cell dataset utilizing the Seurat package. The violin plots (Supplementary Figures S1A,B) illustrated the distributions of nFeatureRNA, nCountRNA, and percent mitochondrial before and after filtering. The selection process for the top 2,500 highly variable genes was visually represented through a scatter plot (Supplementary Figure S1C). Further analysis via scatter plots (Supplementary Figure S1D) characterized inter-variable relationships, revealing a significant negative correlation between percent mitochondrial and nCountRNA (r = −0.53). Conversely, a strong positive correlation was identified between nFeatureRNA and nCountRNA (r = 0.48). Additionally, a bubble plot (Supplementary Figure S1E) displayed the distribution of the top 25 genes contributing most significantly to the first three principal components, while a heatmap (Supplementary Figure S1F) visualized the expression patterns of these top-ranked genes across cells. The clustering analysis of the dataset identified 18 distinct clusters, which were subsequently annotated using the “SingleR” package. This process resolved the clusters into seven major cell types: B cells, endothelial cells, epithelial cells, monocytes, natural killer cells, T cells, and tissue stem cells (Figures 10A,B). The cell proportion figure illustrates the proportional distribution of seven cell types in UC samples, predominantly featuring B cells and T cells (Figure 10C). The UMAP plot illustrates elevated CDH5 expression in endothelial cell clusters, with subcellular localization analysis indicating primary distribution in the nucleus and plasma membrane (Figure 10D). EIF5A2 expression is prominently enriched in tissue stem cell and T cell clusters on the UMAP, with subcellular localization analysis indicating primary distribution in the cytosol and nucleus (Figure 10E). IDO1 expression shows enrichment in monocyte clusters via UMAP visualization, with subcellular localization analysis indicating primary distribution in the cytosol, nucleus, and mitochondria (Figure 10F). The UMAP plot reveals upregulated MYL5 expression in B cell, endothelial cell, and epithelial cell clusters, with subcellular localization analysis indicating primary distribution in the cytosol and cytoskeleton (Figure 10G).

Figure 10

Composite image showing multiple panels: (A) UMAP plot displaying cell clusters. (B) Another UMAP plot labeled by cell types including B-cell, Myeloid, and T-cell. (C) Bar graph of cell type proportions across UCs. (D-G) Panels featuring UMAP plots, bar graphs of gene expression or subcellular localization (CDH5, EIF5A2, IDO1, MYL5), and corresponding cellular diagrams illustrating subcellular compartments such as the nucleus, endoplasmic reticulum, and mitochondria. Each gene is associated with specific subcellular localization visualized with bar and diagram representations.

Figure 10. Cell annotation, single-cell expression analysis, and subcellular localization of hub genes in UC samples. (A,B) Cell clustering and automated annotation of clustered cells into seven cell types. (C) Proportional distribution of seven cell types in UC samples. (D) Expression distribution of CDH5 protein in UC and CDH5 subcellular localization. (E) Expression distribution of EIF5A2 protein in UC and EIF5A2 subcellular localization. (F) Expression distribution of IDO1 protein in UC and IDO1 subcellular localization. (G) Expression distribution of MYL5 protein in UC and MYL5 subcellular localization.

3.9 Validation of hub gene expression in the experimental colitis

To experimentally validate our bioinformatics findings, a DSS-induced UC mouse model was established. Compared to controls, DSS-treated mice exhibited significant body weight loss (Figure 11A) and progressively elevated DAI scores (Figure 11B). Clinical manifestations, including depression, dark fur, and persistent diarrhea, emerged from day 4, with perianal bleeding and soft stools evident by day 7. Additionally, the colon length was significantly reduced in the DSS group, confirming the UC model’s validity (Figures 11C,D) (Yan et al., 2009; Cao et al., 2025). In addition, we observed a significant increase in the expression of COX-2 and NLRP3, both of which are known markers of strong inflammatory response. However, there was no significant change in the expression of PCNA, a known marker of cell proliferation (Figures 11E–G). Furthermore, qRT-PCR validation confirmed the significant upregulation of EI5A2, IDO1, and CDH5 mRNA in the model group, consistent with our bioinformatics analysis (Figures 11H–J).

Figure 11

Graphical summary of an experiment comparing control and DSS-treated groups. Panel A shows weight change over time, with control group increasing and DSS group decreasing significantly. Panel B depicts DAI scores rising significantly in the DSS group. Panel C displays a photo of colons, visibly shorter in the DSS group. Panel D's bar graph shows significantly reduced colon length in the DSS group. Panels E to J present relative mRNA expressions (COX-2, NLRP3, PCNA, IDO1, EIF5A2, CDH5), with significant increases in the DSS group for most genes, except PCNA, which is not significant.

Figure 11. The expression of feature genes in the mouse model of UC. (A,B) Body weight change and DAI score in mice. (C) Photographs of Control colon tissue and DSS-induced colon tissue. (D) Changes of colon length between control and DSS-induced colon tissue. (E) Relative mRNA expression of COX-2 between control and DSS-induced colon tissue. (F) Relative mRNA expression of NLRP3 between control and DSS-induced colon tissue. (G) Relative mRNA expression of PCNA (Proliferating Cell Nuclear Antigen) between control and DSS-induced colon tissue. (H) Relative mRNA expression of IDO1 between control and DSS-induced colon tissue. (I) Relative mRNA expression of EIF5A2 between control and DSS-induced colon tissue. (J) Relative mRNA expression of CDH5 between control and DSS-induced colon tissue. ****P < 0.0001, ***P < 0.001, **P < 0.01, *P < 0.05.

4 Discussion

UC poses significant clinical challenges due to an incomplete understanding of its genetic basis and the lack of precise therapeutic targets, often resulting in suboptimal treatment outcomes (Shimomori et al., 2025). Moreover, its heterogeneous manifestations frequently lead to diagnostic errors and delays, highlighting the urgent need for reliable early biomarkers (Sakurai and Saruta, 2023). Consequently, integrating multi-omics data to uncover causal genetic factors is essential for advancing UC research. Although prior GWAS studies have identified susceptibility loci, they typically fail to bridge genetic variants with functional protein changes and disease mechanisms at the transcriptomic and cellular levels. In this study, we implemented a comprehensive multi-omics strategy to identify and validate potential genetic drivers of UC. MR analysis of 4,288 plasma proteins from GWAS summary statistics pinpointed 168 genes with causal associations to UC, primarily enriched in immune and inflammatory pathways. Subsequent transcriptomic profiling and ML algorithms further refined this to four hub genes. Their pivotal roles were substantiated through predictive model construction, external dataset validation, and in vivo confirmation in a DSS-induced UC mouse model. These findings offer promising avenues for developing novel biomarkers and targeted therapies in UC.

This study identified four core genes that provide new insights into the pathological mechanisms of UC and reveal their potential roles in disease progression. EIF5A2 (eukaryotic translation elongation factor 5A2) is an important member of the eukaryotic translation elongation factor family, primarily involved in the regulation of protein translation elongation and polypeptide chain synthesis (Xiong et al., 2025). In recent years, EIF5A2 has been shown to play a critical role in various cancers, including breast cancer, liver cancer, pancreatic cancer, and gastric cancer (Bai et al., 2018; Guan et al., 2019; Wu et al., 2020; Jiang et al., 2022). However, its role in UC has not been adequately studied. For the first time, this study identified a significant upregulation of EIF5A2 in UC patients and further demonstrated its causal relationship with UC through MR analysis. This finding suggests that EIF5A2 may play an important role in the pathogenic mechanisms of UC. Additionally, a study has identified the small-molecule inhibitor SRI-011381 as a promising candidate drug for targeting EIF5A2 (Yang et al., 2023). This compound has demonstrated significant efficacy in preclinical models, highlighting the importance of developing specific inhibitors that effectively target EIF5A2 in cancer therapy. However, its role in UC remains insufficiently documented. The primary function of EIF5A2 lies in regulating protein translation elongation and polypeptide chain synthesis. Its unique translational regulatory function depends on hypusination, a highly conserved and specific post-translational modification (Suzuki et al., 2025). EIF5A2 is involved in cell proliferation, apoptosis, and inflammatory processes, as well as the regulation of transcription and RNA metabolism (Turpaev, 2018). Therefore, the upregulation of EIF5A2 may be closely associated with the activation of inflammatory signaling pathways. In this study, GSEA demonstrated that the high expression of EIF5A2 is significantly enriched in key pathways such as cytokine-cytokine receptor interaction and glucose metabolism, indicating that EIF5A2 may promote inflammation progression by regulating the activation and migration of immune cells. Particularly in the chronic inflammatory environment of UC, EIF5A2 likely exacerbates local intestinal immune imbalance by modulating T-cell function. This perspective aligns with findings in the tumor microenvironment, where EIF5A2 has been shown to promote cell migration and enhance survival signals (Wang et al., 2018). Moreover, single-cell RNA sequencing data revealed that EIF5A2 is highly expressed primarily in intestinal epithelial stem cells and T-cell populations, further suggesting its dual role in maintaining epithelial regeneration and immune homeostasis in UC. Notably, EIF5A2 has also been predicted to be a target of multiple microRNAs (e.g., hsa-miR-1225-3p and hsa-miR-1227-3p), which simultaneously regulate a group of lncRNAs, such as LINC01002 and LINC01106. This regulatory network suggests that EIF5A2 expression may be under multilayered epigenetic control, allowing precise modulation of its function in the pathological processes of UC. These findings provide a foundation for further exploration of the molecular mechanisms of EIF5A2 in UC and offer potential avenues for developing microRNA- or lncRNA-based targeted therapies. Based on these findings, our future research plans to utilize the validated small-molecule inhibitor SRI-011381 to ameliorate UC symptoms through EIF5A2 inhibition, thereby providing a novel therapeutic target for UC treatment.

In addition to EIF5A2, this study identified three other core genes, which are involved in distinct but interconnected biological processes, including immune regulation, epithelial integrity, and cellular metabolism. IDO1 plays a critical role in tryptophan metabolism and immune regulation. Its significant upregulation in UC suggests that it may regulate immune tolerance through the tryptophan-kynurenine pathway (Upadhyay et al., 2025). Furthermore, a previous study found that IDO1 promotes UC progression by inducing M1 macrophage polarization via the GRP78-XBP1 pathway, which is related to endoplasmic reticulum stress (Gao et al., 2025). In this study, we also observed that IDO1 was positively correlated with M1 macrophages and negatively correlated with M2 macrophages, suggesting that IDO1 may contribute to UC by activating M1 macrophages. Another study demonstrated that oxymatrine alleviates UC by inhibiting IDO1, thereby reducing inflammation and ferroptosis (Gao et al., 2024). Therefore, inhibiting IDO1 activity may help restore immune balance and alleviate chronic inflammation, making it a promising therapeutic target for UC intervention. Although the application of IDO1 inhibitors in UC is still in its early stages, current evidence suggests that targeting IDO1 may help restore immune balance, alleviate UC symptoms, and reduce inflammatory responses (Proietti et al., 2023; Wang et al., 2023). IDO1 inhibitors could emerge as a promising therapeutic strategy for UC, particularly when combined with other immunomodulatory therapies, potentially yielding more significant clinical benefits.

CDH5, also known as vascular endothelial cadherin, is a transmembrane protein predominantly expressed in endothelial cells and plays a critical role in maintaining vascular barrier function (Cavallaro et al., 2006). Endothelial dysfunction is a hallmark of UC progression, leading to immune cell infiltration and impaired tissue repair (Roifman et al., 2009). In this study, we also found for the first time that CDH5 is significantly upregulated in UC patients. Immune infiltration analysis indicated that CDH5 is primarily expressed in regions of neutrophil infiltration. Single-cell RNA sequencing further confirmed that CDH5 is highly expressed in endothelial cell clusters, highlighting its role in vascular remodeling during inflammation. These findings suggest that CDH5 may contribute to the inflammatory microenvironment of UC by mediating immune cell migration and endothelial activation. Additionally, the role of CDH5 in leukocyte adhesion and endothelial permeability indicates that it may be involved in the disruption of the mucosal barrier (Müller et al., 2022). Targeting CDH5 to enhance endothelial integrity and reduce immune cell infiltration could represent a novel therapeutic strategy to alleviate inflammation and promote mucosal healing.

MYL5, which encodes one of the light chains of myosin, is an essential protein for maintaining cytoskeletal dynamics and cellular contractility (Ramirez et al., 2021). Myosin is critical for maintaining epithelial structure and function, particularly in the intestinal barrier, which is frequently impaired in UC patients (Lechuga et al., 2022). A previous study reported that MYL5 expression is reduced in multiple cancers, including breast cancer, colorectal cancer, esophageal cancer, gastric cancer, head and neck cancer, and leukemia, compared to corresponding normal tissues (Lv, 2023). Similarly, our study found that MYL5 is significantly downregulated in UC patients, suggesting its potential involvement in barrier dysfunction and impaired epithelial repair. GSEA analysis showed that MYL5 is enriched in pathways related to amino sugar and nucleotide sugar metabolism, cysteine and methionine metabolism, and retinol metabolism, indicating its role in stress response and metabolic regulation. These pathways are essential for maintaining epithelial homeostasis, and reduced MYL5 expression may lead to cytoskeletal instability, increased intestinal permeability, and exacerbated inflammation. Single-cell RNA sequencing revealed that MYL5 is primarily expressed in intestinal epithelial and endothelial cells, further confirming its importance in maintaining the structural integrity of the intestinal barrier. Restoring MYL5 expression or compensating for its functional loss may provide a potential strategy to reduce intestinal permeability and prevent UC-associated inflammatory exacerbation.

The notable strength of this study lies in the comprehensive application of multi-omics approaches, which integrated MR analysis, gene expression profiling, ML algorithms, and in vivo validation to systematically identify and validate causal genetic factors for UC. This multi-layered research strategy not only effectively reduced the false-positive results commonly observed in single-omics studies but also significantly enhanced the reliability and reproducibility of the findings. Furthermore, the use of single-cell RNA sequencing allowed us to localize gene expression to specific cell types, providing critical insights into the cellular context of the core genes. Despite the multiple strengths of this study, certain limitations must be acknowledged. First, the MR analysis in this study relied on genetic instruments derived from GWAS data, which are predominantly based on European populations. This reliance may limit the generalizability of the findings to other ethnic groups and may not fully capture the genetic and environmental heterogeneity across populations. Future studies should focus on validating the roles of these core genes in more diverse populations to improve the universality of the findings. Additionally, the transcriptomic data used in this study were obtained from publicly available databases, lacking large-scale clinical sample validation. The inherent sample heterogeneity and batch effects may impact the robustness of the results. Future studies should incorporate multi-center samples and perform batch effect correction to further enhance data reliability. Second, although we validated the functions of three upregulated genes in a DSS-induced UC mouse model, the MYL5 gene does not exist in mice, making it impossible to validate in this model. Furthermore, this model primarily mimics acute intestinal inflammation and may not fully recapitulate the chronic and relapsing nature of human UC. Additionally, inflammation induced by DSS progresses more rapidly and severely than inflammation observed in human UC. Moreover, there are notable differences between mice and humans in terms of immune system architecture and intestinal microenvironment. Therefore, future studies should incorporate patient-derived organoid models or chronic UC animal models to further validate the roles of these genes in UC progression. Additionally, since this study heavily relies on computational predictions and bioinformatics analysis, there may be certain interpretative limitations. Therefore, future studies should incorporate more comprehensive experimental data, such as gene knockout, overexpression experiments, and protein-level investigations, to further validate the functions and mechanisms of these genes. Furthermore, the lack of clinical specimens to validate the differential expression of the core genes limits the cross-species generalizability of the results. Finally, the exact molecular mechanisms through which these core genes interact within specific signaling pathways and molecular networks in UC pathogenesis remain unclear. Future research should focus on functional studies, including gene knockout, overexpression experiments, and protein-level investigations, to further elucidate the roles of these genes and support their potential as therapeutic targets.

5 Conclusion

In summary, this study identified four core genes—EIF5A2, IDO1, CDH5, and MYL5—providing new insights into the genetic and molecular mechanisms underlying UC. These genes play critical roles in immune regulation, barrier integrity, and cellular metabolism. Through a multi-omics approach, we not only established the causal associations of these genes with UC but also highlighted their potential as biomarkers and therapeutic targets. Future research should focus on elucidating their specific mechanisms, validating their clinical utility, and exploring their therapeutic potential to advance the precision medicine management of UC.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Ethics statement

The animal study was approved by the Institutional Animal Care and Use Committee of Yi Shengyuan Gene Technology (Tianjin) Co., Ltd., (protocol number YSY-DWLL-2024713). The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

JZ: Conceptualization, Data curation, Writing – original draft, Writing – review and editing, Formal Analysis, Funding acquisition, Investigation. WZ: Investigation, Methodology, Resources, Writing – original draft, Writing – review and editing, Project administration. YL: Data curation, Methodology, Writing – original draft, Formal Analysis, Investigation, Project administration, Writing – review and editing. BY: Supervision, Writing – original draft, Data curation, Methodology, Software. DS: Writing – review and editing, Resources, Supervision, Validation, Visualization. ZL: Funding acquisition, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review and editing.

Funding

The authors declare that financial support was received for the research and/or publication of this article. This study was supported by the Youth Science and Technology Innovation Project of Shandong Province for Maternal and Child Health (SFYZXJJ-2024034) and Shandong Provincial Maternal and Child Health Hospital Institutional-Level Project (2021SFF012).

Acknowledgments

Acknowledgements

We are grateful to GEO databases for providing the datasets essential for this research.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmolb.2025.1686282/full#supplementary-material

References

Ai, Q., and Yang, B. (2023). Are inflammatory bowel diseases associated with an increased risk of COVID-19 susceptibility and severity? A two-sample Mendelian randomization study. Front. Genet. 14, 1095050. doi:10.3389/fgene.2023.1095050

PubMed Abstract | CrossRef Full Text | Google Scholar

Bai, H.-Y., Liao, Y.-J., Cai, M.-Y., Ma, N.-F., Zhang, Q., Chen, J.-W., et al. (2018). Eukaryotic initiation factor 5A2 contributes to the maintenance of CD133(+) hepatocellular carcinoma cells via the c-Myc/microRNA-29b axis. Stem Cells 36, 180–191. doi:10.1002/stem.2734

PubMed Abstract | CrossRef Full Text | Google Scholar

Balachandran, V. P., Gonen, M., Smith, J. J., and DeMatteo, R. P. (2015). Nomograms in oncology: more than meets the eye. Lancet Oncol. 16, e173–e180. doi:10.1016/S1470-2045(14)71116-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Bhandari, N., Walambe, R., Kotecha, K., and Khare, S. P. (2022). A comprehensive survey on computational learning methods for analysis of gene expression data. Front. Mol. Biosci. 9, 907150. doi:10.3389/fmolb.2022.907150

PubMed Abstract | CrossRef Full Text | Google Scholar

Bowden, J., Davey Smith, G., Haycock, P. C., and Burgess, S. (2016). Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 40, 304–314. doi:10.1002/gepi.21965

PubMed Abstract | CrossRef Full Text | Google Scholar

Bowden, J., Del Greco, M. F., Minelli, C., Davey Smith, G., Sheehan, N., and Thompson, J. (2017). A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat. Med. 36, 1783–1802. doi:10.1002/sim.7221

PubMed Abstract | CrossRef Full Text | Google Scholar

Burgess, S., and Thompson, S. G. (2017). Interpreting findings from Mendelian randomization using the MR-Egger method. Eur. J. Epidemiol. 32, 377–389. doi:10.1007/s10654-017-0255-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Butler, A., Hoffman, P., Smibert, P., Papalexi, E., and Satija, R. (2018). Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 36, 411–420. doi:10.1038/nbt.4096

PubMed Abstract | CrossRef Full Text | Google Scholar

Bycroft, C., Freeman, C., Petkova, D., Band, G., Elliott, L. T., Sharp, K., et al. (2018). The UK biobank resource with deep phenotyping and genomic data. Nature 562, 203–209. doi:10.1038/s41586-018-0579-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Cao, T., Wang, Y., Zhang, J., Song, W., Dai, W., Ji, G., et al. (2025). Leonurine alleviates DSS-induced colitis in mice by regulating pancreatic secretion pathway and gut microbiota. J. Immunol. Res. 2025, 6626309. doi:10.1155/jimr/6626309

PubMed Abstract | CrossRef Full Text | Google Scholar

Cavallaro, U., Liebner, S., and Dejana, E. (2006). Endothelial cadherins and tumor angiogenesis. Exp. Cell Res. 312, 659–667. doi:10.1016/j.yexcr.2005.09.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, X., Wu, H., Wang, L., Guo, Y., and Miao, X. (2025). Decoding ulcerative colitis: Plasma protein-mediated antibody immune responses influence disease risk. Anal. Biochem. 706, 115946. doi:10.1016/j.ab.2025.115946

PubMed Abstract | CrossRef Full Text | Google Scholar

Cholapranee, A., Hazlewood, G. S., Kaplan, G. G., Peyrin-Biroulet, L., and Ananthakrishnan, A. N. (2017). Systematic review with meta-analysis: comparative efficacy of biologics for induction and maintenance of mucosal healing in Crohn’s disease and ulcerative colitis controlled trials. Aliment. Pharmacol. Ther. 45, 1291–1302. doi:10.1111/apt.14030

PubMed Abstract | CrossRef Full Text | Google Scholar

Dai, C., Huang, Y.-H., and Jiang, M. (2023). Combination therapy in inflammatory bowel disease: current evidence and perspectives. Int. Immunopharmacol. 114, 109545. doi:10.1016/j.intimp.2022.109545

PubMed Abstract | CrossRef Full Text | Google Scholar

Di Narzo, A. F., Telesco, S. E., Brodmerkel, C., Argmann, C., Peters, L. A., Li, K., et al. (2017). High-throughput characterization of blood serum proteomics of IBD patients with respect to aging and genetic factors. PLoS Genet. 13, e1006565. doi:10.1371/journal.pgen.1006565

PubMed Abstract | CrossRef Full Text | Google Scholar

Duan, J., Soussen, C., Brie, D., Idier, J., Wan, M., and Wang, Y.-P. (2016). Generalized LASSO with under-determined regularization matrices. Signal Process. 127, 239–246. doi:10.1016/j.sigpro.2016.03.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Eichele, D. D., and Kharbanda, K. K. (2017). Dextran sodium sulfate colitis murine model: an indispensable tool for advancing our understanding of inflammatory bowel diseases pathogenesis. World J. Gastroenterol. 23, 6016–6029. doi:10.3748/wjg.v23.i33.6016

PubMed Abstract | CrossRef Full Text | Google Scholar

Ferkingstad, E., Sulem, P., Atlason, B. A., Sveinbjornsson, G., Magnusson, M. I., Styrmisdottir, E. L., et al. (2021). Large-scale integration of the plasma proteome with genetics and disease. Nat. Genet. 53, 1712–1721. doi:10.1038/s41588-021-00978-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, B. B., Wang, L., Li, L. Z., Fei, Z. Q., Wang, Y. Y., Zou, X. M., et al. (2024). Beneficial effects of oxymatrine from Sophora flavescens on alleviating Ulcerative colitis by improving inflammation and ferroptosis. J. Ethnopharmacol. 332, 118385. doi:10.1016/j.jep.2024.118385

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, Z., Shao, S., Xu, Z., Nie, J., Li, C., and Du, C. (2025). IDO1 induced macrophage M1 polarization via ER stress-associated GRP78-XBP1 pathway to promote ulcerative colitis progression. Front. Med. (Lausanne) 12, 1524952. doi:10.3389/fmed.2025.1524952

PubMed Abstract | CrossRef Full Text | Google Scholar

Ge, C., Lu, Y., Shen, Z., Lu, Y., Liu, X., Zhang, M., et al. (2025). Machine learning and metabolomics identify biomarkers associated with the disease extent of ulcerative colitis. J. Crohns Colitis 19, jjaf020. doi:10.1093/ecco-jcc/jjaf020

PubMed Abstract | CrossRef Full Text | Google Scholar

Guan, X., Gu, S., Yuan, M., Zheng, X., and Wu, J. (2019). MicroRNA-33a-5p overexpression sensitizes triple-negative breast cancer to doxorubicin by inhibiting eIF5A2 and epithelial-mesenchymal transition. Oncol. Lett. 18, 5986–5994. doi:10.3892/ol.2019.10984

PubMed Abstract | CrossRef Full Text | Google Scholar

Hartwig, F. P., Davey Smith, G., and Bowden, J. (2017). Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption. Int. J. Epidemiol. 46, 1985–1998. doi:10.1093/ije/dyx102

PubMed Abstract | CrossRef Full Text | Google Scholar

Ishwaran, H., and Kogalur, U. B. (2010). Consistency of random survival forests. Stat. Probab. Lett. 80, 1056–1064. doi:10.1016/j.spl.2010.02.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, L., Zhang, Y., Su, P., Ma, Z., Ye, X., Kang, W., et al. (2022). Long non-coding RNA HNF1A-AS1 induces 5-FU resistance of gastric cancer through miR-30b-5p/EIF5A2 pathway. Transl. Oncol. 18, 101351. doi:10.1016/j.tranon.2022.101351

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, J. J., Shajib, M. S., Manocha, M. M., and Khan, W. I. (2012). Investigating intestinal inflammation in DSS-Induced model of IBD. J. Vis. Exp., 3678. doi:10.3791/3678

PubMed Abstract | CrossRef Full Text | Google Scholar

Lechuga, S., Cartagena-Rivera, A. X., Khan, A., Crawford, B. I., Narayanan, V., Conway, D. E., et al. (2022). A myosin chaperone, UNC-45A, is a novel regulator of intestinal epithelial barrier integrity and repair. FASEB J. 36, e22290. doi:10.1096/fj.202200154R

PubMed Abstract | CrossRef Full Text | Google Scholar

Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E., and Storey, J. D. (2012). The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28, 882–883. doi:10.1093/bioinformatics/bts034

PubMed Abstract | CrossRef Full Text | Google Scholar

Lin, X., Li, C., Zhang, Y., Su, B., Fan, M., and Wei, H. (2017). Selecting feature subsets based on SVM-RFE and the overlapping ratio with applications in bioinformatics. Molecules 23, 52. doi:10.3390/molecules23010052

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Y., Li, C., Yao, L., Tian, M., Tan, Y., Shi, L., et al. (2025). A multi-omics exploration revealing SLIT2 as a prime therapeutic target for peripheral facial paralysis: integrating single-cell transcriptomics and plasma proteome data. Cell Mol. Neurobiol. 45, 87. doi:10.1007/s10571-025-01607-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Lv, M. (2023). MYL5 as a novel prognostic marker is associated with immune infiltrating in breast cancer: a preliminary study. Breast J. 2023, 9508632. doi:10.1155/2023/9508632

PubMed Abstract | CrossRef Full Text | Google Scholar

Maag, J. L. V. (2018). Gganatogram: an R package for modular visualisation of anatograms and tissues based on ggplot2. F1000Res 7, 1576. doi:10.12688/f1000research.16409.2

PubMed Abstract | CrossRef Full Text | Google Scholar

Müller, M. B., Hübner, M., Li, L., Tomasi, S., Ließke, V., Effinger, D., et al. (2022). Cell-crossing functional network driven by microRNA-125a regulates endothelial permeability and monocyte trafficking in acute inflammation. Front. Immunol. 13, 826047. doi:10.3389/fimmu.2022.826047

PubMed Abstract | CrossRef Full Text | Google Scholar

Newman, A. M., Steen, C. B., Liu, C. L., Gentles, A. J., Chaudhuri, A. A., Scherer, F., et al. (2019). Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat. Biotechnol. 37, 773–782. doi:10.1038/s41587-019-0114-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Ng, S. C., Shi, H. Y., Hamidi, N., Underwood, F. E., Tang, W., Benchimol, E. I., et al. (2017). Worldwide incidence and prevalence of inflammatory bowel disease in the 21st century: a systematic review of population-based studies. Lancet 390, 2769–2778. doi:10.1016/S0140-6736(17)32448-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Papier, K., Atkins, J. R., Tong, T. Y. N., Gaitskell, K., Desai, T., Ogamba, C. F., et al. (2024). Identifying proteomic risk factors for cancer using prospective and exome analyses of 1463 circulating proteins and risk of 19 cancers in the UK biobank. Nat. Commun. 15, 4010. doi:10.1038/s41467-024-48017-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Peng, L., Peng, H., Wu, N., Wang, X., Han, L., and Ding, X. (2025). Potential drug targets for intracranial aneurysms identified through Mendelian randomization analysis. Neurosurg. Rev. 48, 683. doi:10.1007/s10143-025-03870-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Proietti, E., Pauwels, R. W. M., de Vries, A. C., Orecchini, E., Volpi, C., Orabona, C., et al. (2023). Modulation of indoleamine 2,3-Dioxygenase 1 during inflammatory bowel disease activity in humans and mice. Int. J. Tryptophan Res. 16, 11786469231153109. doi:10.1177/11786469231153109

PubMed Abstract | CrossRef Full Text | Google Scholar

Ramirez, I., Gholkar, A. A., Velasquez, E. F., Guo, X., Tofig, B., Damoiseaux, R., et al. (2021). The myosin regulatory light chain Myl5 localizes to mitotic spindle poles and is required for proper cell division. Cytoskelet. Hob. 78, 23–35. doi:10.1002/cm.21654

PubMed Abstract | CrossRef Full Text | Google Scholar

Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., et al. (2015). Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47. doi:10.1093/nar/gkv007

PubMed Abstract | CrossRef Full Text | Google Scholar

Roifman, I., Sun, Y. C., Fedwick, J. P., Panaccione, R., Buret, A. G., Liu, H., et al. (2009). Evidence of endothelial dysfunction in patients with inflammatory bowel disease. Clin. Gastroenterol. Hepatol. 7, 175–182. doi:10.1016/j.cgh.2008.10.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Sakurai, T., and Saruta, M. (2023). Positioning and usefulness of biomarkers in inflammatory bowel disease. Digestion 104, 30–41. doi:10.1159/000527846

PubMed Abstract | CrossRef Full Text | Google Scholar

Santana, P. T., Rosas, S. L. B., Ribeiro, B. E., Marinho, Y., and de Souza, H. S. P. (2022). Dysbiosis in inflammatory bowel disease: pathogenic role and potential therapeutic targets. Int. J. Mol. Sci. 23, 3464. doi:10.3390/ijms23073464

PubMed Abstract | CrossRef Full Text | Google Scholar

Shimomori, Y., Yokoyama, Y., Kurumi, H., Akita, K., Kazama, T., Hayashi, Y., et al. (2025). Unraveling the complexity of ulcerative colitis: insights into cytokine dysregulation and targeted therapies. EXCLI J. 24, 638–658. doi:10.17179/excli2025-8374

PubMed Abstract | CrossRef Full Text | Google Scholar

Sudlow, C., Gallacher, J., Allen, N., Beral, V., Burton, P., Danesh, J., et al. (2015). UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779. doi:10.1371/journal.pmed.1001779

PubMed Abstract | CrossRef Full Text | Google Scholar

Suhre, K., Arnold, M., Bhagwat, A. M., Cotton, R. J., Engelke, R., Raffler, J., et al. (2017). Connecting genetic risk to disease end points through the human blood plasma proteome. Nat. Commun. 8, 14357. doi:10.1038/ncomms14357

PubMed Abstract | CrossRef Full Text | Google Scholar

Suzuki, M., Suzuki, T., Nakano, Y., Matsumoto, K., Manaka, H., Komeno, M., et al. (2025). Polyamines stimulate the protein synthesis of the translation initiation factor eIF5A2, participating in mRNA decoding, distinct from eIF5A1. J. Biol. Chem. 301, 110453. doi:10.1016/j.jbc.2025.110453

PubMed Abstract | CrossRef Full Text | Google Scholar

Turpaev, K. T. (2018). Translation factor eIF5A, modification with hypusine and role in regulation of gene expression. eIF5A as a target for pharmacological interventions. Biochem. (Mosc) 83, 863–873. doi:10.1134/S0006297918080011

PubMed Abstract | CrossRef Full Text | Google Scholar

Ungaro, R., Mehandru, S., Allen, P. B., Peyrin-Biroulet, L., and Colombel, J.-F. (2017). Ulcerative colitis. Lancet 389, 1756–1770. doi:10.1016/S0140-6736(16)32126-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Upadhyay, K. G., Desai, D. C., Ashavaid, T. F., and Dherai, A. J. (2025). Evaluating the role of kynurenine/tryptophan ratio as an indicator of disease activity in Indian patients with inflammatory bowel disease. A case-control study. Scand. J. Gastroenterol. 60, 454–462. doi:10.1080/00365521.2025.2491784

PubMed Abstract | CrossRef Full Text | Google Scholar

Verbanck, M., Chen, C.-Y., Neale, B., and Do, R. (2018). Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat. Genet. 50, 693–698. doi:10.1038/s41588-018-0099-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Jin, Y., Zhang, H., Huang, X., Zhang, Y., and Zhu, J. (2018). MicroRNA-599 inhibits metastasis and epithelial-mesenchymal transition via targeting EIF5A2 in gastric cancer. Biomed. Pharmacother. 97, 473–480. doi:10.1016/j.biopha.2017.10.069

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, T., Song, Y., Ai, Z., Liu, Y., Li, H., Xu, W., et al. (2023). Pulsatilla chinensis saponins ameliorated murine depression by inhibiting intestinal inflammation mediated IDO1 overexpression and rebalancing tryptophan metabolism. Phytomedicine 116, 154852. doi:10.1016/j.phymed.2023.154852

PubMed Abstract | CrossRef Full Text | Google Scholar

Warde-Farley, D., Donaldson, S. L., Comes, O., Zuberi, K., Badrawi, R., Chao, P., et al. (2010). The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 38, W214–W220. doi:10.1093/nar/gkq537

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, G.-Q., Xu, Y.-M., and Lau, A. T. Y. (2020). Recent insights into eukaryotic translation initiation factors 5A1 and 5A2 and their roles in human health and disease. Cancer Cell Int. 20, 142. doi:10.1186/s12935-020-01226-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, H., Wang, L., and Qiu, C. (2024). Causal relationship, shared genes between rheumatoid arthritis and pulp and periapical disease: evidence from GWAS and transcriptome data. Front. Immunol. 15, 1440753. doi:10.3389/fimmu.2024.1440753

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiong, X., Du, Y., Liu, P., Li, X., Lai, X., Miao, H., et al. (2025). Unveiling EIF5A2: a multifaceted player in cellular regulation, tumorigenesis and drug resistance. Eur. J. Pharmacol. 997, 177596. doi:10.1016/j.ejphar.2025.177596

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, Y., Kolachala, V., Dalmasso, G., Nguyen, H., Laroui, H., Sitaraman, S. V., et al. (2009). Temporal and spatial analysis of clinical and molecular parameters in dextran sodium sulfate induced colitis. PLoS One 4, e6073. doi:10.1371/journal.pone.0006073

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, B., Sun, K., Hui, Y., Zhu, L., Wang, S., Ma, S., et al. (2020). Large population-based study using the SEER database: is endoscopic resection appropriate for early gastric cancer patients in the United States? Scand. J. Gastroenterol. 55, 834–842. doi:10.1080/00365521.2020.1786158

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, J., Jiang, X., Chen, Y., and Teng, L. (2023). EIF5A2 promotes doxorubicin resistance in bladder cancer cells through the TGF-β signaling pathway. Discov. Med. 35, 1167–1176. doi:10.24976/Discov.Med.202335179.113

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, G., Wang, L.-G., Han, Y., and He, Q.-Y. (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287. doi:10.1089/omi.2011.0118

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, Q., Wang, L., Zhang, Y., Peng, J., Hou, J., Wang, H., et al. (2025). Immune microenvironment characterization and machine learning-guided identification of diagnostic biomarkers for ulcerative colitis. J. Inflamm. Res. 18, 8977–8992. doi:10.2147/JIR.S526325

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, N., Ge, X., Zhang, L., Chen, X., Xiang, W., and Mei, Q. (2025). HMGB1 exacerbates intestinal barrier damage by inducing ferroptosis through the TLR4/NF-κB/GPX4 pathway in ulcerative colitis. Mediat. Inflamm. 2025, 2395557. doi:10.1155/mi/2395557

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: ulcerative colitis, multi-omics integration, diagnostic biomarkers, immune infiltration analysis, regulatory network

Citation: Zhou J, Zhao W, Lin Y, Yang B, Sun D and Liu Z (2025) Multi-omics integration analysis based on plasma circulating proteins reveals potential therapeutic targets for ulcerative colitis. Front. Mol. Biosci. 12:1686282. doi: 10.3389/fmolb.2025.1686282

Received: 15 August 2025; Accepted: 03 November 2025;
Published: 20 November 2025.

Edited by:

Padhmanand Sudhakar, Kumaraguru College of Technology, India

Reviewed by:

Sanjit Roy, Glycomontra Inc., United States
Tzu-Hurng Cheng, China Medical University, Taiwan

Copyright © 2025 Zhou, Zhao, Lin, Yang, Sun and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dongjie Sun, MjgyMjIwMjc4QHFxLmNvbQ==; Zhu Liu, MTg3Njk3ODk1MDBAMTYzLmNvbQ==

†These authors have contributed equally to this work and share first authorship

‡These authors have contributed equally to this work and share last authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.