Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol., 04 February 2026

Sec. Autoimmune and Autoinflammatory Disorders : Autoimmune Disorders

Volume 17 - 2026 | https://doi.org/10.3389/fimmu.2026.1761476

This article is part of the Research TopicDynamics of Stromal-Immune Interactions: Advances in -Omics Technologies and Implications for Health and DiseaseView all 5 articles

Integrated single-cell and transcriptomic profiling identifies machine-learning–based pyroptosis biomarkers in IBD

Yao Lu&#x;Yao Lu1†Yahui Lin&#x;Yahui Lin2†Yuansen LiYuansen Li2Huan SaiHuan Sai1Cheng Chen*Cheng Chen2*Jinjiao Li*Jinjiao Li3*
  • 1Department of Gastroenterology, The People’s Hospital of Wenshan Prefecture, Wenshan, Yunnan, China
  • 2Department of Gastrointestinal and Hernia Surgery, The First Affiliated Hospital of Kunming Medical University, Kunming, Yunnan, China
  • 3Department of Breast, Thyroid and Burn Surgery, The People’s Hospital of Wenshan Prefecture, Wenshan, Yunnan, China

Background: Pyroptosis, an inflammatory form of programmed cell death, contributes to intestinal inflammation in inflammatory bowel disease (IBD), but the key cell types and regulatory genes remain unclear.

Methods: We analyzed single-cell RNA-seq data from intestinal mucosa to assess pyroptosis-related gene expression signatures using AUCell, AddModuleScore, and GSVA. High- and low-pyroptosis cells were compared to identify core genes. Findings were validated in bulk peripheral blood transcriptomic data. Machine learning (LASSO, GBM, SVM, Boruta, Random Forest) identified optimal diagnostic genes, and functional enrichment explored biological roles.

Results: Pyroptosis was most active in macrophages, epithelial cells, and neutrophils. Six cell subsets consistently exhibited high pyroptosis. Nineteen core pyroptosis-related genes were identified, 16 of which were upregulated in both mucosal and blood samples. Functional analysis linked these genes to apoptosis and reactive oxygen species pathways. Machine learning highlighted six key diagnostic genes—BASP1, LITAF, NAMPT, PHACTR1, PLAUR, PPIF—with BASP1 showing the strongest performance (AUC = 0.935).

Conclusions: Pyroptosis is highly active in specific immune and epithelial cells in IBD. Six identified genes show potential as non-invasive diagnostic biomarkers, offering insights into disease mechanisms and therapeutic targets.

Introduction

Inflammatory bowel disease (IBD) has emerged as a chronic disorder with a steadily increasing global incidence, placing a substantial and ongoing burden on healthcare systems and patient quality of life (1). IBD, comprising ulcerative colitis (UC) and Crohn’s disease (CD), is characterized by chronic, relapsing intestinal inflammation and epithelial damage. Its pathogenesis is complex (2), involving genetic susceptibility, environmental triggers, gut microbiota dysbiosis, and immune dysregulation. Pyroptosis, a pro-inflammatory form of programmed cell death mediated by Gasdermin family proteins, plays a key role in regulating mucosal innate immune responses and intestinal pathogen defense. By activating multiple inflammatory pathways through damage-associated signaling, pyroptosis may contribute to chronic intestinal inflammation (3). Nevertheless, the specific cellular mechanisms and genetic factors that govern pyroptosis in IBD remain largely unexplored (4). Investigating pyroptosis within intestinal immune and epithelial cells could provide novel insights into IBD pathogenesis and offer potential avenues for improved diagnosis and therapeutic intervention (46).

Recent advances in single-cell RNA sequencing (scRNA-seq) have enabled high-resolution dissection of the intestinal mucosa, allowing the identification of heterogeneous cell populations and their transcriptional states (7). This technology provides opportunities to uncover cell-type-specific alterations and molecular programs underlying IBD pathogenesis. Integrating scRNA-seq with bulk transcriptomic data further facilitates the validation of key molecular features across local (mucosal) and systemic (peripheral blood) compartments, offering robust biomarkers with potential clinical relevance (8). In addition, machine learning–based feature selection provides a powerful strategy to identify optimal diagnostic genes from complex multi-omics datasets (9).

In this study, we systematically profiled pyroptosis-related activity across intestinal mucosal cell populations in IBD using single-cell RNA sequencing, and explored potential pyroptosis-associated regulatory genes. These candidates were further examined using bulk transcriptomic datasets. In addition, multiple machine learning approaches were applied to refine and prioritize a minimal set of putative diagnostic feature genes. Collectively, these analyses may provide additional perspectives on the molecular regulation of IBD and offer potential clues for biomarker development in disease diagnosis and therapeutic decision-making.

Methods

Data acquisition and processing

The single-cell RNA sequencing (scRNA-seq) data used in this study were obtained from the GSE214695 dataset, which includes samples from 6 patients with UC, 6 patients with CD, and 6 healthy controls (HC) (10). Based on previous studies (3, 4), we selected 10 pyroptosis-related genes associated with the canonical inflammasome pathway (NLRP1, NLRP3, NLRC4, AIM2, Pyrin, PYCARD, CASP1, IL1B, IL18, and GSDMD). After quality control, cells with fewer than 200 or more than 2,500 detected genes or with a mitochondrial gene proportion exceeding 5% were excluded, resulting in high-quality cells for downstream analysis. The top 2,000 highly variable genes were identified using the “vst” method to provide a stable basis for subsequent dimensionality reduction and clustering (11). To mitigate potential technical variation across samples, batch effects were corrected using the Harmony algorithm implemented in Seurat, using principal components derived from PCA with default parameters (theta = 2, lambda = 1) (12). Cells were clustered using Seurat’s graph-based clustering approach with a resolution parameter set to 0.5. Based on the batch-corrected data, unsupervised clustering was performed using the Seurat package with a resolution of 0.5, and UMAP was applied for dimensionality reduction and visualization, revealing diverse intestinal mucosal cell subpopulations (13). Cell annotation was conducted by integrating the original nanostring_reference information with known cell marker genes, ultimately identifying 54 distinct cell subpopulations based on the nanostring_reference for downstream analysis, encompassing various cell types (10). For bulk RNA-seq data, external validation was performed using peripheral blood bulk transcriptome data (GSE3365) (14).

Gene-set scoring and correlation analysis in scRNA-seq

pyroptosis-related gene expression signatures was quantified using AUCell (15), AddModuleScore (16), and GSVA (17) based on a curated pyroptosis gene set. AUCell calculated the AUC of ranked gene expression per cell; AddModuleScore generated normalized module scores; GSVA estimated pathway-level enrichment for each cell or cluster. Cells were divided into high- and low-pyroptosis groups according to score distribution. Differential expression was analyzed using Seurat’s FindMarkers (Wilcoxon test). Gene–pyroptosis associations were assessed using Spearman correlation between gene expression and pyroptosis scores across cells or clusters. Subsequently, Gene Ontology (GO) enrichment analysis was performed using the clusterProfiler R package to explore the potential biological mechanisms associated with these differentially expressed genes.

Machine learning-based identification of optimal diagnostic genes

To evaluate the diagnostic potential of candidate genes identified from single-cell analysis, we applied an integrative machine learning–based feature selection framework to an independent peripheral blood transcriptomic dataset. Gene expression normalization and feature selection were conducted on the complete peripheral blood dataset in the context of gene prioritization.

To identify the most robust diagnostic markers, we integrated five machine learning algorithms, including LASSO (18), GBM (19), SVM-RFE (20), Boruta (21), and random forest (22). To ensure reproducibility, all machine learning analyses were performed using a fixed random seed (set.seed(123)). LASSO regression was implemented using the glmnet package, in which an L1 penalty was applied to shrink coefficients and eliminate redundant variables, with 10-fold external validation used to determine the optimal regularization parameter (λ). Gradient boosting machine (GBM) models were constructed using default parameters (n.trees = 100, interaction.depth = 1, shrinkage = 0.1), and random forest models were built with default settings (including ntree = 500), to rank feature importance based on ensemble learning and thereby improve the stability of feature selection. SVM-RFE was used to iteratively remove the least informative genes and retain the most discriminative features, and was implemented using a linear kernel with default settings. In parallel, the Boruta algorithm was employed to identify all relevant variables by comparing their importance with that of randomly permuted shadow features, and was applied using default parameters (maxRuns = 100, pValue = 0.01), thereby ensuring comprehensive feature extraction. Finally, the intersecting genes identified by all five algorithms were defined as the optimal diagnostic gene set for further analysis.

Rather than constructing a standalone predictive classifier, these machine learning methods were applied primarily for robust feature selection and ranking. Genes consistently identified by all five algorithms were defined as the final diagnostic gene set. ROC analyses were performed using the same peripheral blood transcriptomic dataset employed for feature selection, without a separate held-out testing set, with the primary objective of gene prioritization.

Statistical analysis

All statistical analyses were conducted in R (version 4.4.0). Group differences were assessed using Student’s t-test or Wilcoxon rank-sum test as appropriate, and multiple-group comparisons used one-way ANOVA or Kruskal–Wallis tests. Correlations between pyroptosis scores and gene expression were evaluated using Spearman’s correlation. Differentially expressed genes were identified with Benjamini–Hochberg FDR–adjusted P values < 0.05. A two-sided P < 0.05 was considered statistically significant.

Result

Single-cell RNA-seq analysis of IBD

Prior to downstream analyses, we performed stringent quality control and selected highly variable genes, resulting in a high-quality dataset (Figures 1A, B). Batch effects across samples were then corrected, leading to a more stable distribution (Figure 1C). Following the standard Seurat workflow, we conducted clustering and identified 18 distinct cell subpopulations (Figure 1D). Given that the GSE214695 dataset provides high-quality cell-type annotations that are highly consistent with our clustering results, all subsequent analyses were based on the original annotations (Figure 1E). Comparing cell compositions across different pathological states revealed substantial differences among healthy controls, ulcerative colitis, and Crohn’s disease (Figure 1F). Notably, several immune cell subsets were markedly increased in inflammatory conditions, suggesting that these cells may play key roles in the pathogenesis of IBD.

Figure 1
Panel A shows violin plots displaying the distribution of percent mitochondrial reads, nFeature_RNA, and nCount_RNA across different sample groups. Panel B is a scatter plot illustrating marker gene expression. Panels C and D depict UMAP plots of cell clustering by sample group and identified clusters, respectively. Panel E is a UMAP plot showing cell type annotations by nanospring markers. Panel F displays a stacked bar chart representing the proportion of various cell types for each group, with a legend indicating cell type colors.

Figure 1. Cellular subpopulation landscape and characteristic analysis. (A) Quality control of single-cell RNA-sequencing data. (B) Identification of highly variable genes. (C) Batch effect correction. (D) UMAP dimensionality reduction and clustering based on the Seurat algorithm, illustrating the distribution and classification of intestinal mucosal cells. (E) Cell annotation revealed 54 distinct cellular subpopulations, including immune and epithelial cell types. (F) Proportional distribution of each cellular subpopulation across healthy controls, ulcerative colitis, and Crohn’s disease groups.

Pyroptosis in single-cell RNA-seq data

We evaluated the expression of pyroptosis-related gene sets at the single-cell level using the AUCell and AddModuleScore algorithms. pyroptosis-related gene expression signatures exhibited substantial heterogeneity across different cell types; specifically, these genes were more highly enriched in macrophages, intestinal epithelial cells, and neutrophils (Figure 2A). Both the Crohn’s disease and ulcerative colitis groups showed higher overall pyroptosis scores than healthy controls (HC) across multiple immune and epithelial cell populations (Figures 3B, C). Using AUCell, AddModuleScore, and GSVA, we assessed pyroptosis-related gene expression signatures across the three groups within each cell type (Supplementary Material 1). Across all three scoring methods, six cell populations consistently exhibited higher pyroptosis-related gene expression in both the UC and CD groups: colonocytes, inflammatory monocytes, N1, N2, N3, and M1. By stratifying cells into high- and low-pyroptosis groups according to average expression levels, we further demonstrated that these genes were predominantly expressed within the neutrophil clusters (Figure 2E).

Figure 2
Composite image showcasing various data visualizations related to Pyroptosis scores. Panel A displays violin plots of AddModuleScore and AUCell Scores across different cell types. Panels B and C present scatter plots comparing these scores by group, with significant differences noted (p-value < 2.2e-16). Panel D features a bar chart illustrating intersection sizes. Panel E shows a UMAP plot contrasting high and low pyroptosis scores. Panels F and G depict volcano plots highlighting gene expressions with significant changes. Panel H displays a correlation plot of genes and pyroptosis scores. Panel I provides a Venn diagram of overlapping gene sets.

Figure 2. Pyroptosis scoring and differential analysis in single-cell data. (A) Pyroptosis-related gene set scores calculated across different cell types using the AUCell and AddModuleScore algorithms. (B, C) Differences in pyroptosis-related gene expression signatures scores between healthy controls (HC) and inflammatory bowel disease (IBD) groups (B: AddModuleScore; C: AUCell). (D) Cell types showing consistently elevated pyroptosis scores across AUCell, GSEA, and AddModuleScore. (E) Cells stratified into high- and low-pyroptosis groups based on pyroptosis scores. (F, G) Percentage changes and log-fold changes of differentially expressed genes between high- and low-pyroptosis groups (F: AddModuleScore; G: AUCell). (H) Correlation analysis between pyroptosis scores and the overall expression of pyroptosis-related genes. (I) Venn diagram identifying key genes closely associated with pyroptosis.

Figure 3
Panel A shows expression levels of various genes comparing control and IBD groups with significant differences marked. Panel B illustrates pathways involved in apoptotic signaling, and chemical carcinogenesis with relative significance. Panel C is a heatmap of gene expression levels across samples, clustering control and IBD groups. Panel D depicts a network visualization of gene interactions with combined scores indicating interaction strength.

Figure 3. Cross-analysis of key genes based on bulk RNA-seq data. (A) Differential expression of the 16 overlapping genes identified from scRNA-seq and bulk transcriptomic datasets. (B) GO and KEGG enrichment analysis results. (C) Heatmap showing the expression patterns of these 16 genes in control and IBD samples. (D) Gene interaction network.

To identify genes associated with pyroptosis-related gene expression signatures, we categorized cells into high-pyroptosis and low-pyroptosis groups based on the AUCell and AddModuleScore pyroptosis scores, respectively. Differential expression analysis between the two groups was performed using Seurat’s FindMarkers function (Wilcoxon rank-sum test). Using the AUCell-derived scores, we identified 118 upregulated genes, whereas the AddModuleScore-based scoring yielded 14 upregulated and 140 downregulated genes (FDR < 0.05; FindMarkers criteria: |log2FC| > 1, min.pct > 0.1). To further determine genes most closely associated with pyroptosis, we conducted correlation analysis and identified 96 genes significantly correlated with pyroptosis-related gene expression signatures (r > 0.3, FDR < 0.05) (Figure 2H). To avoid over-filtering and potentially missing important candidates, all DEGs identified by either scoring method (FDR < 0.05, |log2FC| > 0) were retained for constructing the intersections shown in Figure 2I. The intersection between the correlated genes and the differentially expressed genes resulted in 19 pyroptosis-associated genes (Figure 2I).

Cross-analysis of overlapping genes based on bulk transcriptomic data

Given that IBD is characterized by both localized intestinal mucosal inflammation and systemic immune activation, we performed external validation by evaluating candidate genes identified from intestinal mucosal scRNA-seq analysis in an independent peripheral blood bulk transcriptomic dataset (GSE3365). Although the two datasets originate from different biological sources, the consistency of key gene expression patterns at both local (mucosa) and systemic (peripheral blood) levels would further support their stable and broad biological relevance in IBD. To verify the reliability of the 19 selected genes, we further evaluated their expression profiles in the bulk transcriptome dataset. Cross-comparison between scRNA-seq–identified genes and bulk data revealed 16 overlapping genes. All 16 genes were significantly upregulated in the IBD group (Figure 3A). GO and KEGG enrichment analyses indicated that these genes were primarily involved in the regulation of intrinsic and overall apoptotic signaling pathways and were closely associated with reactive oxygen species–mediated chemical carcinogenesis (Figure 3B). A heatmap further supported these findings, showing a consistent high-expression pattern of the 16 genes in the IBD samples (Figure 3C). Among these upregulated genes, SOD2, PPIF, HIF1A, NAMPT, ICAM1, and PLEK exhibited higher DC scores and formed a tightly connected interaction network with the other genes (Figure 3D).

Identification of optimal genes using machine learning

We applied five machine learning algorithms to screen for signature genes, including the LASSO algorithm (Figure 4A), GBM algorithm (Figure 4B), SVM-RFE algorithm (Figure 4C), Boruta algorithm (Figure 4D), and random forest model (Figure 4E). Intersection analysis of the marker genes identified by all five algorithms yielded six characteristic genes: BASP1, LITAF, NAMPT, PHACTR1, PLAUR, and PPIF (Figure 4F). We further evaluated the diagnostic performance of these six genes in peripheral blood. ROC curve analysis (Figure 4G) showed that all six genes exhibited good diagnostic value, with BASP1 displaying the strongest diagnostic ability (AUC = 0.935), followed by PLAUR (AUC = 0.840), NAMPT (AUC = 0.825), PPIF (AUC = 0.742), LITAF (AUC = 0.733), and PHACTR1 (AUC = 0.666).

Figure 4
Composite image featuring:  A) LASSO regression analysis with coefficient path, cross-validation plot, and feature importance bar chart. B) Bar chart showing GBM feature importance for various genes. C) SVM-RFE performance curve and feature importance bar chart. D) Boxplot depicting Boruta feature importance across attributes. E) Random forest plot displaying mean decrease Gini for gene groups. F) Upset plot with bars illustrating interactions and set sizes among gene groups. G) ROC curves for candidate genes with AUC values annotated.

Figure 4. Identification and diagnostic evaluation of optimal feature genes by machine learning. (A–E) Feature gene selection using five machine learning algorithms: LASSO (A), GBM (B), SVM (C), Boruta (D), and Random Forest (E). Each algorithm identifies candidate genes with predictive relevance. (F) Intersection analysis of the genes selected by all five algorithms. (G) Receiver operating characteristic (ROC) curve analysis evaluating the diagnostic performance of the six genes in peripheral blood.

Discussion

Pyroptosis, as a highly inflammatory form of programmed cell death, plays a pivotal role in the onset and progression of IBD. Numerous studies have shown that pyroptosis not only disrupts intestinal epithelial barrier integrity but also regulates the activation states of multiple immune cell populations, thereby amplifying intestinal inflammatory cascades (4, 23). Thus, pyroptosis represents a critical entry point for understanding the immunopathogenesis of IBD and offers a promising therapeutic target for precision intervention (24).

By integrating AUCell, AddModuleScore, and GSVA scoring methods, we identified six immune and epithelial subpopulations (colonocytes, inflammatory monocytes, N1, N2, N3, and M1) that consistently exhibited elevated pyroptosis-related gene expression signatures. Differential expression and correlation analyses further revealed a set of 19 core pyroptosis-associated genes, which were strongly supported by bulk transcriptomic data from peripheral blood. Sixteen genes overlapped between mucosal and systemic datasets, underscoring their potential as stable biomarkers and highlighting the systemic consequences of local intestinal inflammation. Functional enrichment analyses indicated that these genes are primarily involved in apoptotic regulation and reactive oxygen species–mediated processes, suggesting mechanistic links among pyroptosis, oxidative stress, and epithelial injury.

Notably, Machine learning identified six diagnostic genes (BASP1, LITAF, NAMPT, PHACTR1, PLAUR, and PPIF) with strong predictive performance in peripheral blood. BASP1 showed the highest diagnostic accuracy and may represent a key gene associated with pyroptosis-related inflammation. By integrating intestinal mucosal single-cell data with peripheral blood bulk transcriptomes, our multi-cohort analysis consistently supported BASP1 as an IBD-associated molecule linked to pyroptosis-related gene expression signatures, providing the first evidence for this potential connection. Previous studies have linked BASP1 to IBD through genetic susceptibility (25) and immune infiltration (26) analyses, including the Immunochip meta-analysis and WGCNA-based UC study, yet its role in pyroptosis remains unexplored.

The remaining five genes differ in their biological relevance to pyroptosis, representing distinct modules involving metabolism, mitochondrial stress, and inflammatory transcriptional regulation. PLAUR, a classical mediator of inflammation and tissue remodeling, is upregulated in IBD mucosa and in cytokine-induced epithelial barrier disruption (27), serving as an important marker of epithelial hypoxia responses and mucosal repair in IBD (28). Although PLAUR may interact with IL-1β production and inflammasome activation (29), its influence on pyroptosis appears to be indirect—primarily enhancing inflammatory signaling rather than functioning as a canonical GSDM-dependent pyroptosis gene. NAMPT, a dual mediator of inflammation and metabolism, is markedly elevated in IBD, reflecting both inflammatory burden and tissue hypoxia (30), and correlates strongly with responses to anti-TNF therapy (31); moreover, neutralizing extracellular NAMPT ameliorates disease severity in experimental colitis (32). Its prominent performance in our analysis suggests that metabolic–inflammatory coupling may drive pyroptosis-related activity, particularly in neutrophils. In contrast, PPIF, a key regulator of the mitochondrial permeability transition pore (33), is more closely associated with mitochondrial stress and necrotic cell death (34) rather than GSDMD-mediated pyroptotic execution. LITAF, representing the inflammatory transcriptional regulatory axis, is a critical regulator of TNF-α with established pro-inflammatory roles in IBD (35). Although direct evidence linking LITAF to pyroptosis execution is lacking, its amplification of TNF-α pathways may sensitize upstream inflammasome signaling. Finally, PHACTR1 has relatively weak evidence in the context of IBD, and no studies have yet linked it to either IBD pathogenesis or pyroptosis.

Our single-cell analysis revealed an increase in pyroptosis-related gene signatures within intestinal epithelial cells in IBD. This elevation, however, is most likely reflective of inflammation-induced transcriptional activation and epithelial restitution demands rather than bona fide execution of canonical GSDMD-dependent pyroptosis, as supported by recent mechanistic studies (5). Notably, although neutrophil pyroptosis has been implicated in other inflammatory disorders (36, 37), evidence in the context of IBD has been lacking. Our data provide the first indication that neutrophils exhibit markedly elevated pyroptosis-associated signatures in IBD, suggesting a previously underestimated contribution of neutrophil-driven inflammatory cell death pathways to mucosal inflammation. In line with prior literature, macrophages remain the major immune cell population with established pyroptotic activation in IBD (24). Together, these findings delineate distinct cell-type-specific patterns of pyroptosis-related responses and highlight neutrophils as a potentially overlooked contributor to intestinal inflammation.

Several methodological and interpretative limitations should be acknowledged. First, all analyses were performed using publicly available scRNA-seq and bulk transcriptomic datasets, which may introduce unavoidable batch effects and sample heterogeneity (disease activity, duration, treatment history, intestinal location) despite rigorous quality control. The merging of UC and CD patients into a single “IBD group” for analysis risks obscuring important biological differences between subtypes. Second, comparisons between UC and CD were based on a relatively small number of biological samples (n = 6 per group) and should therefore be interpreted as exploratory. Although batch effects were mitigated using Harmony, subtype-specific differences may still be influenced by sampling variability or unmeasured confounding factors. Importantly, the primary conclusions of this study do not rely on direct UC–CD contrasts. Third, pyroptosis activation was inferred from transcriptional signatures rather than direct functional assays, and therefore may reflect inflammatory gene programs rather than bona fide gasdermin-mediated cell death. Fourth, although multiple machine learning algorithms were integrated to enhance robustness, the diagnostic assessment of candidate genes may still be influenced by dataset-specific bias. The reported AUC values were derived from internal evaluation within a single peripheral blood cohort and should therefore be interpreted as indicative of diagnostic potential rather than definitive performance, warranting further validation in independent cohorts. Finally, the lack of in vitro and in vivo experiments limits mechanistic interpretation, and future studies are needed to experimentally confirm the regulatory roles of the identified genes in IBD-related pyroptosis.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.

Author contributions

YL: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. YHL: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. YSL: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. HS: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. CC: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. JL: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. Youth Project of Yunnan Basic Research Program (202301AU07169) Scientific Research Fund of Education Department of Yunnan Province, 2023Y0811. Yunnan Wenshan Zhuang and Miao Autonomous Prefecture People’s Hospital Key Project (No. ZDXM2024001); Wenshan Prefecture 2025 Scientific Research Development Special Fund Project (No. WS-KYFZXM25002).

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2026.1761476/full#supplementary-material

References

1. Kaplan GG. The global burden of IBD: from 2015 to 2025. Nat Rev Gastroenterol Hepatol. (2015) 12:720–7. doi: 10.1038/nrgastro.2015.150

PubMed Abstract | Crossref Full Text | Google Scholar

2. Ciorba MA, Konnikova L, Hirota SA, Lucchetta EM, Turner JR, Slavin A, et al. Challenges in IBD Research 2024: Preclinical Human IBD Mechanisms. Inflammation Bowel Dis. (2024) 30:S5–S18. doi: 10.1093/ibd/izae081

PubMed Abstract | Crossref Full Text | Google Scholar

3. Rao Z, Zhu Y, Yang P, Chen Z, Xia Y, Qiao C, et al. Pyroptosis in inflammatory diseases and cancer. Theranostics. (2022) 12:4310–29. doi: 10.7150/thno.71086

PubMed Abstract | Crossref Full Text | Google Scholar

4. Zhang S, Liang Y, Yao J, Li D, and Wang L. Role of Pyroptosis in Inflammatory Bowel Disease (IBD): From Gasdermins to DAMPs. Front Pharmacol. (2022) 13:833588. doi: 10.3389/fphar.2022.833588

PubMed Abstract | Crossref Full Text | Google Scholar

5. Rana N, Privitera G, Kondolf HC, Bulek K, Lechuga S, De Salvo C, et al. GSDMB is increased in IBD and regulates epithelial restitution/repair independent of pyroptosis. Cell. (2022) 185:283–298.e17. doi: 10.1016/j.cell.2021.12.024

PubMed Abstract | Crossref Full Text | Google Scholar

6. Wang J, Tan H, Ye Z, Weng S, Shi Y, Xu J, et al. Gut microbial Nordihydroguaiaretic acid suppresses macrophage pyroptosis to regulate epithelial homeostasis and inflammation. Gut Microbes. (2025) 17:2518338. doi: 10.1080/19490976.2025.2518338

PubMed Abstract | Crossref Full Text | Google Scholar

7. Jovic D, Liang X, Zeng H, Lin L, Xu F, and Luo Y. Single-cell RNA sequencing technologies and applications: A brief overview. Clin Transl Med. (2022) 12:e694. doi: 10.1002/ctm2.694

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zhang M, Xie Z, Zhang S, and Zhou G. Integrated Single-Cell Transcriptome Analysis Reveals Novel Insights into the Role of Opioid Signaling in the Pathophysiology of Inflammatory Bowel Disease. Biomedicines. (2025) 13:1398. doi: 10.3390/biomedicines13061398

PubMed Abstract | Crossref Full Text | Google Scholar

9. Chen G, Qi H, Jiang L, Sun S, Zhang J, Yu J, et al. Integrating single-cell RNA-Seq and machine learning to dissect tryptophan metabolism in ulcerative colitis. J Transl Med. (2024) 22:1121. doi: 10.1186/s12967-024-05934-w

PubMed Abstract | Crossref Full Text | Google Scholar

10. Garrido-Trigo A, Corraliza AM, Veny M, Dotti I, Melón-Ardanaz E, Rill A, et al. Macrophage and neutrophil heterogeneity at single-cell spatial resolution in human inflammatory bowel disease. Nat Commun. (2023) 14:4506. doi: 10.1038/s41467-023-40156-6

PubMed Abstract | Crossref Full Text | Google Scholar

11. Hrovatin K, Moinfar AA, Zappia L, Parikh S, Lapuerta AT, Lengerich B, et al. Integrating single-cell RNA-seq datasets with substantial batch effects. BMC Genomics. (2025) 26:974. doi: 10.1186/s12864-025-12126-3

PubMed Abstract | Crossref Full Text | Google Scholar

12. Korsunsky I, Millard N, Fan J, Slowikowski K, Zhang F, Wei K, et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat Methods. (2019) 16:1289–96. doi: 10.1038/s41592-019-0619-0

PubMed Abstract | Crossref Full Text | Google Scholar

13. Yang L, Chen H, Yang Y, Deng Y, Chen Q, Luo B, et al. Single-cell and microarray chip analysis revealed the underlying pathogenesis of ulcerative colitis and validated model genes in diagnosis and drug response. Hum Cell. (2023) 36:132–45. doi: 10.1007/s13577-022-00801-6

PubMed Abstract | Crossref Full Text | Google Scholar

14. Burczynski ME, Peterson RL, Twine NC, Zuberek KA, Brodeur BJ, Casciotti L, et al. Molecular classification of Crohn’s disease and ulcerative colitis patients using transcriptional profiles in peripheral blood mononuclear cells. J Mol Diagn. (2006) 8:51–61. doi: 10.2353/jmoldx.2006.050079

PubMed Abstract | Crossref Full Text | Google Scholar

15. Liu Y, Li H, Zeng T, Wang Y, Zhang H, Wan Y, et al. Integrated bulk and single-cell transcriptomes reveal pyroptotic signature in prognosis and therapeutic options of hepatocellular carcinoma by combining deep learning. Brief Bioinform. (2023) 25:bbad487. doi: 10.1093/bib/bbad487

PubMed Abstract | Crossref Full Text | Google Scholar

16. Mei Y, Li M, Wen J, Kong X, and Li J. Single-cell characteristics and malignancy regulation of alpha-fetoprotein-producing gastric cancer. Cancer Med. (2023) 12:12018–33. doi: 10.1002/cam4.5883

PubMed Abstract | Crossref Full Text | Google Scholar

17. Jin Y, Wang Z, He D, Zhu Y, Chen X, and Cao K. Identification of novel subtypes based on ssGSEA in immune-related prognostic signature for tongue squamous cell carcinoma. Cancer Med. (2021) 10:8693–707. doi: 10.1002/cam4.4341

PubMed Abstract | Crossref Full Text | Google Scholar

18. Lin M, Xu F, Sun J, Song J, Shen Y, Lu S, et al. Integrative multi-omics analysis unravels the host response landscape and reveals a serum protein panel for early prognosis prediction for ARDS. Crit Care. (2024) 28:213. doi: 10.1186/s13054-024-05000-3

PubMed Abstract | Crossref Full Text | Google Scholar

19. Zhang K, Ye B, Wu L, Ni S, Li Y, Wang Q, et al. Machine learning−based prediction of survival prognosis in esophageal squamous cell carcinoma. Sci Rep. (2023) 13:13532. doi: 10.1038/s41598-023-40780-8

PubMed Abstract | Crossref Full Text | Google Scholar

20. van der Ploeg T and Steyerberg EW. Feature selection and validated predictive performance in the domain of Legionella pneumophila: a comparative study. BMC Res Notes. (2016) 9:147. doi: 10.1186/s13104-016-1945-2

PubMed Abstract | Crossref Full Text | Google Scholar

21. Yan F, Chen X, Quan X, Wang L, Wei X, and Zhu J. Association between the stress hyperglycemia ratio and 28-day all-cause mortality in critically ill patients with sepsis: a retrospective cohort study and predictive model establishment based on machine learning. Cardiovasc Diabetol. (2024) 23:163. doi: 10.1186/s12933-024-02265-4

PubMed Abstract | Crossref Full Text | Google Scholar

22. Jin Y, Lan A, Dai Y, Jiang L, and Liu S. Development and testing of a random forest-based machine learning model for predicting events among breast cancer patients with a poor response to neoadjuvant chemotherapy. Eur J Med Res. (2023) 28:394. doi: 10.1186/s40001-023-01361-7

PubMed Abstract | Crossref Full Text | Google Scholar

23. Zhou P, Zhang S, Wang M, and Zhou J. The Induction Mechanism of Ferroptosis, Necroptosis, and Pyroptosis in Inflammatory Bowel Disease, Colorectal Cancer, and Intestinal Injury. Biomolecules. (2023) 13:820. doi: 10.3390/biom13050820

PubMed Abstract | Crossref Full Text | Google Scholar

24. Chen K, Shang S, Yu S, Cui L, Li S, and He N. Identification and exploration of pharmacological pyroptosis-related biomarkers of ulcerative colitis. Front Immunol. (2022) 13:998470. doi: 10.3389/fimmu.2022.998470

PubMed Abstract | Crossref Full Text | Google Scholar

25. Hong M, Ye BD, Yang S-K, Jung S, Lee H-S, Kim BM, et al. Immunochip Meta-Analysis of Inflammatory Bowel Disease Identifies Three Novel Loci and Four Novel Associations in Previously Reported Loci. J Crohns Colitis. (2018) 12:730–41. doi: 10.1093/ecco-jcc/jjy002

PubMed Abstract | Crossref Full Text | Google Scholar

26. Qi Y, Wang Y, Zhang S, Pan X, Li W, Cheng M, et al. Identification of Key Biomarkers and Immune Microenvironment Features in Ulcerative Colitis: An Integrated Analysis Using WGCNA and Multiple Machine Learning Algorithms. J Inflammation Res. (2025) 18:13879–96. doi: 10.2147/JIR.S545790

PubMed Abstract | Crossref Full Text | Google Scholar

27. Cheng Y, Hall TR, Xu X, Yung I, Souza D, Zheng J, et al. Targeting uPA-uPAR interaction to improve intestinal epithelial barrier integrity in inflammatory bowel disease. EBioMedicine. (2022) 75:103758. doi: 10.1016/j.ebiom.2021.103758

PubMed Abstract | Crossref Full Text | Google Scholar

28. Knyazev E, Maltseva D, Raygorodskaya M, and Shkurnikov M. HIF-Dependent NFATC1 Activation Upregulates ITGA5 and PLAUR in Intestinal Epithelium in Inflammatory Bowel Disease. Front Genet. (2021) 12:791640. doi: 10.3389/fgene.2021.791640

PubMed Abstract | Crossref Full Text | Google Scholar

29. Kanno Y. The uPA/uPAR System Orchestrates the Inflammatory Response, Vascular Homeostasis, and Immune System in Fibrosis Progression. Int J Mol Sci. (2023) 24:1796. doi: 10.3390/ijms24021796

PubMed Abstract | Crossref Full Text | Google Scholar

30. Neubauer K, Bednarz-Misa I, Walecka-Zacharska E, Wierzbicki J, Agrawal A, Gamian A, et al. Oversecretion and Overexpression of Nicotinamide Phosphoribosyltransferase/Pre-B Colony-Enhancing Factor/Visfatin in Inflammatory Bowel Disease Reflects the Disease Activity, Severity of Inflammatory Response and Hypoxia. Int J Mol Sci. (2019) 20:166. doi: 10.3390/ijms20010166

PubMed Abstract | Crossref Full Text | Google Scholar

31. Colombo G, Caviglia GP, Ravera A, Tribocco E, Frara S, Rosso C, et al. NAMPT and NAPRT serum levels predict response to anti-TNF therapy in inflammatory bowel disease. Front Med (Lausanne). (2023) 10:1116862. doi: 10.3389/fmed.2023.1116862

PubMed Abstract | Crossref Full Text | Google Scholar

32. Colombo G, Clemente N, Zito A, Bracci C, Colombo FS, Sangaletti S, et al. Neutralization of extracellular NAMPT (nicotinamide phosphoribosyltransferase) ameliorates experimental murine colitis. J Mol Med (Berl). (2020) 98:595–612. doi: 10.1007/s00109-020-01892-0

PubMed Abstract | Crossref Full Text | Google Scholar

33. Baines CP, Kaiser RA, Purcell NH, Blair NS, Osinska H, Hambleton MA, et al. Loss of cyclophilin D reveals a critical role for mitochondrial permeability transition in cell death. Nature. (2005) 434:658–62. doi: 10.1038/nature03434

PubMed Abstract | Crossref Full Text | Google Scholar

34. Elrod JW, Wong R, Mishra S, Vagnozzi RJ, Sakthievel B, Goonasekera SA, et al. Cyclophilin D controls mitochondrial pore-dependent Ca(2+) exchange, metabolic flexibility, and propensity for heart failure in mice. J Clin Invest. (2010) 120:3680–7. doi: 10.1172/JCI43171

PubMed Abstract | Crossref Full Text | Google Scholar

35. Stucchi A, Reed K, O’Brien M, Cerda S, Andrews C, Gower A, et al. A new transcription factor that regulates TNF-alpha gene expression, LITAF, is increased in intestinal tissues from patients with CD and UC. Inflammation Bowel Dis. (2006) 12:581–7. doi: 10.1097/01.MIB.0000225338.14356.d5

PubMed Abstract | Crossref Full Text | Google Scholar

36. Liu L and Sun B. Neutrophil pyroptosis: new perspectives on sepsis. Cell Mol Life Sci. (2019) 76:2031–42. doi: 10.1007/s00018-019-03060-1

PubMed Abstract | Crossref Full Text | Google Scholar

37. Dejas L, Santoni K, Meunier E, and Lamkanfi M. Regulated cell death in neutrophils: From apoptosis to NETosis and pyroptosis. Semin Immunol. (2023) 70:101849. doi: 10.1016/j.smim.2023.101849w

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: inflammatory bowel disease, machine learning, pyroptosis, single-cell RNA sequencing, transcriptomic integration

Citation: Lu Y, Lin Y, Li Y, Sai H, Chen C and Li J (2026) Integrated single-cell and transcriptomic profiling identifies machine-learning–based pyroptosis biomarkers in IBD. Front. Immunol. 17:1761476. doi: 10.3389/fimmu.2026.1761476

Received: 05 December 2025; Accepted: 19 January 2026; Revised: 29 December 2025;
Published: 04 February 2026.

Edited by:

Barbara Molon, University of Padua, Italy

Reviewed by:

Chen-Yang Su, McGill University and Génome Québec Innovation Centre, Canada
Yitian Yang, Henan Provincial People’s Hospital, China

Copyright © 2026 Lu, Lin, Li, Sai, Chen and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Cheng Chen, Y2NSYWNoZWwxMjExQDE2My5jb20=; Jinjiao Li, NTI4OTczMjU1QHFxLmNvbQ==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.