ORIGINAL RESEARCH article

Front. Immunol., 26 February 2026

Sec. Cancer Immunity and Immunotherapy

Volume 17 - 2026 | https://doi.org/10.3389/fimmu.2026.1775776

Shared immune dysregulation in systemic lupus erythematosus and colorectal cancer: a multi-omics guided discovery of DNASE1L3-centric efferocytosis deficiency

  • 1. Department of Radiation Oncology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China

  • 2. Department of Gynecology, The First People’s Hospital of Yunnan Province, The Affiliated Hospital of Kunming University of Science and Technology, Kunming, Yunnan, China

  • 3. Department of General Surgery, The Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen, Guangdong, China

  • 4. Center for Translational Medicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China

  • 5. Department of Biochemistry, SUSTech Homeostatic Medicine Institute, School of Medicine, Southern University of Science and Technology, Shenzhen, Guangdong, China

  • 6. Department of Nephrology, Center of Kidney and Urology, The Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen, Guangdong, China

  • 7. Department of Endocrinology and Metabolism, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China

Article metrics

View details

140

Views

52

Downloads

Abstract

Background:

Although immune dysregulation is implicated in both autoimmune diseases and cancer, comparative pathogenesis and immune response mechanisms between systemic lupus erythematosus (SLE) and colorectal cancer (CRC) remain elusive. This study identifies common molecular biomarkers and pathogenic pathways shared between SLE and CRC via multi-omics analysis.

Methods:

Integrated datasets including SLE (GSE61635, GSE50772, GSE142016) and CRC cohorts (GSE39582, GSE17536, E-MTAB-8107, COAD, READ), were analyzed. Differential expression analysis identified shared genes, and machine learning screened four hub prognostic genes. A risk model based on these genes was constructed to evaluate SLE screening and CRC prognosis. Multi-omics approaches explored mutational profiles, immune infiltration, drug sensitivity, and signaling pathways. Reverse transcription-quantitative polymerase chain reaction (RT-qPCR), staining were performed to confirmed the expression levels of the hub genes. The impact of DNASE1L3 knockdown in the human monocyte cell line THP-1 on phagocytosis of apoptotic intestinal epithelial cell line NCM460 was evaluated using RT-qPCR and flow cytometry.

Results:

We identified 58 shared differentially expressed genes (DEGs) between SLE and CRC, enriched in apoptosis, oxidative phosphorylation, and immune infiltration. Machine learning highlighted four hub genes (DNASE1L3, PTPN14, SELENBP1, ECRG4), forming a robust predictive model for SLE occurrence and CRC prognosis. Shared immune infiltration patterns and small-molecule drug candidates were observed. Single-cell and spatial transcriptomic analyses revealed DNASE1L3 predominantly in myeloid cells. Cellular experiments revealed that reduced DNASE1L3 levels significantly compromised macrophage efferocytosis of apoptotic cells, accompanied by a decrease in M2 macrophage proportion, an increase in M1 macrophage proportion, diminished LOX enzyme activity, and elevated levels of interleukin-1 β and tumor necrosis factor-α.

Conclusion:

SLE and CRC exhibit overlapping DEGs, immune profiles, and signaling pathways. The model based on the shared genes—DNASE1L3, PTPN14, SELENBP1, and ECRG4—offers novel insights for precise intervention in both diseases.

Introduction

Systemic lupus erythematosus (SLE) is a chronic, systemic, autoimmune, inflammatory, connective tissue disease (1). The global incidence of SLE varies from about 1.5 to 11 per 100,000 person-years, with a global prevalence of 13 to 7,713.5 per 100,000 people (2). The pathogenesis mechanism of SLE is complex and is closely related to ultraviolet light exposure, high estrogen, genetics, and drugs (3). The clinical manifestations of SLE are highly heterogeneous and can involve almost every organ, including the skin, joints, lungs, kidneys, nervous system and gastrointestinal tract, etc. (4). In addition to renal failure, cardiovascular disease and infections, tumors are also an important cause of death in SLE. In recent years, growing evidence has revealed a strong association between SLE and cancer. Several large cohort studies have observed a significantly increased risk of malignancy in patients with SLE (5, 6).

In 2022, there were 20 million new tumor cases and over 9.7 million deaths worldwide. Colorectal cancer (CRC) ranked third in cancer incidence and second in cancer-related mortality, with 1,926,118 new cases (9.6% of all cases) and 903,859 deaths (9.3% of all cancer deaths) (7). Genetics, poor dietary and lifestyle, and inflammatory bowel disease were high risk factors for the development of CRC. Several studies revealed SLE was an independent risk factor for CRC (8). Relevant evidence suggests that anti-tumor therapy (e.g. capecitabine and fluorouracil) may induce SLE (9, 10).

Several potential mechanisms may explain the association between SLE and CRC. First, imbalance of lactic acid metabolism, persistent and severe chronic inflammation are common risk factors for both SLE and CRC (11, 12). SLE is typically characterized by immune dysfunction, with the release of pro-inflammatory cytokines (e.g., IL-6, TNF-α, IL-17, etc.), which further provides a pro-tumor inflammatory response environment for the development of CRC. For example, IL-6 activates the JAK/STAT3, RAS/MAPK, and PI3K/AKT signaling pathways and exerts anti-apoptotic and tumorigenic effects (13). The presence of a large number of autoantigens in SLE patients activates Toll-like receptors, which promote inflammation and transform pre-cancerous cells into cancerous cells through the MyD88/NF-κB signaling pathway (14). In addition, therapeutic drugs for SLE patients, such as glucocorticoids, immunosuppressive drugs (e.g., methotrexate, cyclophosphamide), and biologics (e.g., TNF-α tumor inhibitors), increase the risk of tumors (15, 16). Furthermore, high levels of autoantigens and autoantibodies, immune dysregulation, genetic susceptibility, and environmental factors may also be pathogenic mechanisms shared by SLE and CRC.

Numerous studies have shown that the link between autoimmune diseases and cancer is dynamic and bidirectional. Disruption of immune tolerance leads to SLE and dysregulation of immune surveillance leads to tumorigenesis. Immunity, although it appears to function differently at the onset of disease, the underlying mechanisms are shared (17). Patients with SLE combined with tumors have a high disease burden and a poor prognosis. There is also a correlation between the outcome of treatment, long-term survival, and the development of autoimmune diseases in tumor patients. Bruton’s tyrosine kinase inhibitors, IL-6 inhibitors, and IL-38 have been shown to be effective in both SLE and CRC (13, 18, 19). Therefore, the common pathogenic mechanisms and therapeutic targets of SLE and CRC deserve in-depth study.

In this study, we first identified common differentially expressed genes (DEGs) from the GSE39582 and GSE61635 datasets. Subsequently, hub genes were screened through univariate Cox regression, least absolute shrinkage and selection operator (LASSO) regression, and support vector machine-recursive feature elimination (SVM-RFE), establishing sample-specific risk scoring. We then explored shared molecular pathways, immune microenvironment features, and therapeutic compounds between both diseases. A hub gene-based nomogram model demonstrated robust predictive performance for diagnosing SLE and CRC, as well as predicting CRC survival outcomes. The CRC risk score was validated using GSE17536, COAD, and READ datasets, followed by comparisons of biological functions, genomic alterations, and stemness characteristics between high-risk and low-risk CRC populations. Additionally, single-cell datasets (EMTAB-8107 and GSE142016) and spatial transcriptomics were leveraged to investigate hub gene expression patterns and spatial distribution. Finally, hub gene expression and functional relevance was validated in vitro. This research elucidates shared biomarkers and pathogenic mechanisms between SLE and CRC, facilitating early disease identification and intervention while guiding personalized therapeutic strategies.

Methods

Data collection

GSE61635 and GSE50772 dataset of peripheral blood mononuclear cells (PBMC) from SLE patients, were downloaded from the Gene Expression Omnibus (GEO, https://www.ncbi.nlm.nih.gov/geo/) database. The GSE61635 dataset, comprising 99 SLE samples and 30 control samples, was used as the training cohort. The GSE50772 dataset, including 61 SLE samples and 20 control samples, was used as the validation cohort. Two datasets of CRC patients, including GSE39582 and GSE17536, were retrieved from GEO database as well (20, 21). The dataset of colon adenocarcinoma (COAD) and rectum adenocarcinoma (READ) was obtained from The Cancer Genome Atlas (TCGA) database via UCSC Xena website (https://xena.ucsc.edu/) (22). The GSE39582 dataset, as a training set, consisted of 566 tumor samples and 19 control samples. GSE17536, as well as the COAD and READ datasets, were used as external validation cohorts. GSE17536 comprises 177 tumor samples. The COAD dataset includes 448 tumor and 39 control samples, while the READ dataset consists of 158 tumor and 9 control samples. The expression data from GEO database was generated using the GPL570 platform. Single-cell RNA sequencing data from 7 CRC patients in the E-MTAB-8107 dataset were downloaded from the TISCH2 database (http://tisch.compbio.cn) (23). Single-cell data from 3 SLE patients in the GSE142016 dataset were obtained from the GEO database (24).

DEGs screening and functional analysis

DEGs in GSE39582 and GSE61635 datasets were screened out and visualized in the form of volcano plots using the “limma” and “ggplot2” packages in R software. |Log2 (fold change)| >1 and adjusted P value ≤ 0.05, with multiple testing correction performed using the Benjamini–Hochberg method were considered to indicate statistical significance. Subsequently, the common DEGs in both CRC and SLE datasets was shown in a Venn diagram. The expression patterns of common DEGs were visualized in the form of heatmaps with “pheatmap” package. Moreover, in order to analyze the biological functions and pathways involved in common genes, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed using The Database for Annotation, Visualization, and Integrated Discovery (DAVID, https://davidbioinformatics.nih.gov/), with terms/pathways with adjusted P value (Benjamini–Hochberg) < 0.05 were considered significantly enriched Additionally, a protein-protein interaction (PPI) network was constructed using STRING (https://string-db.org/) to evaluate potential functional associations among these shared genes.

Connectivity map analysis

Relationships between common DEGs and small molecule therapeutics were explored via the cMAP database (https://clue.io) (25). The 31 upregulated DEGs were incorporated into the cMAP platform, resulting in identification of the top 10 compounds exhibiting maximal enrichment scores. Chemical structure of small molecular compounds was illustrated via ChemSpider database (http://www.chemspider.com) and visualized with Chemdraw software (version 22.0.0.22).

Machine learning selection of characteristic biomarkers

Signature genes for CRC and SLE were identified using three machine learning methods: univariate Cox regression, LASSO, and SVM-RFE algorithm. First, prognostic genes were screened from common DEGs using univariate Cox regression implemented via the “survival” package. Subsequently, the LASSO regression model was applied using the “glmnet” package, where the hyperparameter lambda (λ) minimized regression coefficients toward zero to exclude weakly correlated features, with optimal parameters determined through 5-fold cross-validation (26). Concurrently, SVM-RFE was employed using the “mlbench” and “caret” packages to iteratively eliminate the least important feature vectors (27). Finally, characteristic genes identified by LASSO and SVM-RFE for CRC and SLE were intersected, yielding shared signature genes for downstream analysis.

Network analysis of characteristic genes

NetworkAnalyst (http://www.networkanalyst.ca) is a comprehensive web-based tool designed to identify proteins of interest, construct networks, and perform network analysis and visualization (28). The characteristic genes for CRC and SLE were uploaded to the NetworkAnalyst website, mapped to the integrative interaction database to construct correlation networks. The interactions of significant genes with transcription factors (TFs) and microRNAs (miRNAs) were analyzed subsequently. Transcription factor targets were derived from the JASPAR TF binding site profile database. Comprehensive experimentally validated miRNA-gene interaction data were collected from TarBase database. Finally, coregulatory networks among genes, TFs, and miRNA were constructed by Cytoscape software (version 3.10.2).

Immune infiltration evaluation

The 22 types of immune cell composition in patients and controls were estimated with the cell-type identification by estimating relative subsets of RNA transcripts (CIBERSORT) algorithm (29). The proportion and abundance of immune cells in each sample were shown as the stacked histogram with the “ggplot2” package. Subsequently, the differences between the proportions of immune cells in the both groups were shown in a box plot. Next, the association of the infiltrated immune cells was evaluated with the “corrplot”package and presented as a heatmap with the”pheatmap”package. Finally, the correlation between the four characteristic genes and the infiltrating immune cells was analyzed using the Spearman correlation coefficient and visualized as a heatmap.

Construction of nomogram model in SLE

The nomogram diagnostic model was constructed for four characteristic genes using the “rms”package. The risk score for each participant of SLE and controls was calculated based on the nomogram model. Receiver operating characteristic (ROC) curves via “pROC” package, violin plots via “ggbiplot” package, calibration curves via “calibration curve” package, and decision curve analysis (DCA) via “rmda” package evaluated the ability of the diagnostic model to discriminate between SLE and controls (30). GSE50772 was used as an external validation cohort to evaluate the robustness of the model.

Construction of prognostic model in CRC

Based on the normalized expression levels (Expi) and corresponding regression coefficients (Coei) of prognostic genes, the risk score of each patient was calculated according to the following formula:

First, the diagnostic performance of the four-gene-based risk score for CRC was evaluated using ROC analysis. Subsequently, its prognostic capability for predicting overall survival (OS) of CRC was assessed. CRC patients were divided into high-risk and low-risk groups based on the median cut-off value of the risk score. Subsequently, Kaplan-Meier analyses were performed using the “survival”package to compare the survival of patients in the both groups. Distribution in the proportion, survival time, and survival status of high-risk and low-risk groups in CRC patients were explored with the “ggrisk” package.

Furthermore, univariate and multivariate Cox regression analyses were also conducted to evaluate the impact of risk score and other clinical characteristics (including age, chemotherapy, clinical stage, T/N/M classification) on the prognosis of colorectal cancer patients. ROC, calibration, and DCA curves were evaluated the ability of the prognostic model to predict OS of CRC. The COAD and READ datasets served as external validation cohorts to assess the predictive capability of the nomogram.

Mutational profiles for CRC

Somatic mutations in the 15 most frequently mutated genes were analyzed using the “maftools” package to compare mutational profiles between high-risk and low-risk cohorts. Furthermore, copy number variation (CNV), specifically gene amplifications and deletions, were assessed in COAD and READ cohorts via “tidyverse” package.

Stemness analysis for CRC

Stemness gene sets were retrieved from StemChecker website (http://stemchecker.sysbiolab.eu/), which was based on the most comprehensive and updated collection of published stemness signatures defined by expression profiles, RNAi screens, literature curation, computationally derived, Nanog, and transcription factor target gene sets. Then, quantitative analysis of stemness enrichment score of each participant was elucidated with the single-sample gene set enrichment analysis (ssGSEA) method. Adjusted P value < 0.05 was used as the significance threshold.

Immune response analysis for CRC

The differential expression of immune checkpoint and human leukocyte antigen (HLA) were assessed across high-risk and low-risk populations. The immune score, stromal score, ESTMATE score, tumor purity of each sample was calculated with the “ESTMATE” package (31). In addition, tumor immune dysfunction and exclusion (TIDE) score of each sample was evaluated to predict the possibility of immune escape and immune response occurring in the tumor via the website (http://tide.dfci.harvard.edu) (32). Immunophenotypic score (IPS) data were obtained from The Cancer Immunome Atlas (TICA) database (https://tcia.at/home) (33). The responsiveness to anti-PD-1 or anti-CTLA-4 therapy was evaluated by comparing IPS between high-risk and low-risk groups via the “ggpubr” package.

Functional enrichment analysis in CRC

The DEGs between the high-risk and low-risk groups in the GSE39582 dataset were defined and presented as the volcano plot. Potential biological functions were examined through GO enrichment analysis via Metascape database (https://metascape.org) (34). Enriched pathways were further integrated using the gene set enrichment analysis (GSEA) was performed with the “clusterProfiler” package, applying thresholds of |normalized enrichment score (NES)| > 1, NOM P value < 0.05, and q value < 0.25. Additionally, gene set variation analysis (GSVA) was conducted to identify pathways enriched in the two risk cohorts via the “GSVA” package.

Single cell RNA sequencing analysis

Single-cell RNA sequencing data from 7 CRC patients in the E-MTAB-8107 dataset were processed using the “Seurat” package. Cells meeting quality control criteria (featuring more than 400 and fewer than 6000 expressed genes, with a mitochondrial gene percentage below 20%) were retained for subsequent analysis. Data normalization was performed using the “NormalizeData” function. Highly variable genes across cells were identified using the “FindVariableFeatures” function. Subsequently, data scaling was conducted via the “ScaleData” function, followed by principal component analysis (PCA). Cell clustering analysis was performed using the “FindNeighbors” and “FindClusters” functions, and the resulting clusters were visualized by applying uniform manifold approximation and projection (UMAP). Cell-cell communication was analyzed using the “CellChat” package.

We aimed to investigate the expression patterns of the four core genes across eight annotated cell types: epithelial, plasma, T, myeloid, mast, natural killer (NK), fibroblast, and endothelial cells. Furthermore, these cell types were categorized into three functional groups—epithelial, stromal (fibroblasts and endothelial cells), and immune cells (plasma, T, myeloid, mast, and NK cells)—to systematically compare and analyze the expression levels of the four hub genes within these broader functional compartments. Based on the positive or negative expression status of the deoxyribonuclease 1 like 3 (DNASE1L3) gene, all cells were stratified into a positive group and a negative group. The proportion of each cell subpopulation within these two groups was calculated to elucidate the expression pattern of DNASE1L3. Differential expression analysis was further performed between these two groups, and the results were visualized using a volcano plot. KEGG analysis was conducted on the DEGs to identify significantly enriched signaling pathways. Pseudotime analysis was analyzed using the “monocle” package to investigate developmental trajectories of cells (35). A similar analytical workflow was applied to single-cell RNA sequencing data from 3 SLE patients in the GSE142016 dataset.

Spatial transcriptome analysis of CRC

Spatial transcriptome analysis of CRC was performed via the Sparkle website (https://grswsci.top/) (36). Briefly, inverse convolutional analysis was employed to accurately assess cellular composition on different spatial localizations. Strict quality control measures were implemented on the collected single-cell transcriptome data based on the number of expressed genes, the count of unique molecular identifiers, and the percentage of mitochondrial RNA in each cell. Subsequently, a signature score matrix was constructed by calculating the average expression of the top 25 specifically expressed genes in each locus for each cell type. Finally, an enrichment scoring matrix was generated using the “get enrichment matrix” and “enrichment analysis” functions in the “Cottrazm” package. The enrichment scores for each cell type were visualized using the “SpatialFeaturePlot” function in the “Seurat” package. The malignant group was defined if the score of malignant cells in the microregion was 1, it is defined as the non-malignant group otherwise. Spearman correlation between gene expression and cell content was visualized by the “linkET” package.

Immunohistochemical staining

A tissue microarray containing tumor samples and adjacent tissues from 76 CRC patients was obtained from Aifang company (AF-CocSur2201, China). The pathological sections were first subjected to deparaffinization and rehydration. Antigen retrieval was then performed using EDTA buffer. Endogenous peroxidase activity was subsequently blocked with hydrogen peroxide, followed by blocking with 3% bovine serum albumin. Tissue sections were incubated overnight at 4 degrees with the primary antibody against DNASE1L3 (GTX114363, GeneTex, USA). A secondary antibody (GB23303, Servicebio, China) was applied and incubated at room temperature for 1 hour. Diaminobenzidine was used as the chromogen to visualize immunoreactivity, and hematoxylin was employed for nuclear counterstaining. Stained specimens were imaged and analyzed using a microscope (Model KF-FL-020, KFBIO, China). The percentage of positively stained cells was quantified using Visiopharm software.

Extraction of human peripheral blood

SLE patients, CRC patients, and healthy volunteers were recruited from the First Affiliated Hospital of Zhengzhou University. Peripheral blood was collected and centrifuged to obtain plasma. Peripheral blood mononuclear cells (PBMC) were extracted by density gradient centrifugation method using Lymphoprep reagent (1858, Serumwerk Bernburg AG, Germany).

Cell culture

The human monocyte cell line THP-1, immortalized colon epithelial cell line NCM460, and the embryonic kidney cell line HEK-293T were purchased from HyCyte Company (China). THP-1, NCM460 and PBMC were cultured in RPMI 1640 medium (11875093, Gibco, USA) containing 10% fetal bovine serum (FBS, A5256701, Gibco, USA) and 1% penicillin/streptomycin (P/S, 15140122, Gibco, USA). HEK-293T cells were cultured in DMEM medium (11965092, Gibco, USA) containing FBS and P/S. All cells were cultured in 37 degrees incubator containing 5% CO2.

Plasmid transfection

The DNASE1L3-targeting plasmids (sh-DNASE1L3), control plasmid (sh-NC), and helper plasmids (pSPAX2 and pMD2G) were synthesized by Tsingke company (China). These plasmids were transfected into HEK-293T cells using polyethylenimine (PEI; 40816ES01, Yeasen, China). After 48–72 hours of incubation, lentivirus-containing supernatants were harvested. THP-1 cells were then incubated with the lentiviral supernatant supplemented with 2 μg/mL polybrene (C0351, Beyotime, China) for 48 hours. Stable transductants were selected using 3 μg/mL puromycin (P8230, Solarbio, China). Successful knockdown of DNASE1L3 was confirmed by reverse transcription-quantitative polymerase chain reaction (RT-qPCR) and Western blotting analysis.

RT-qPCR

Total RNA of cells was extracted by Trizol reagent (15596026CN, Thermofisher Scientific, USA). Then, RNA was reverse transcribed to cDNA according to the instructions of a Reverse Tran scription Kit (R433, Vazyme, China). A total of 15 pairs of cDNA microarrays of CRC tumor and matched paraneoplastic tissues were obtained from Shanghai Outdo Biotech Company (MecDNA-HColA030CS03, China). Next, qPCR was performed according to the instructions of a SYBR Green PCR Kit (Q712, Vazyme, China). Finally, the mRNA expression was obtained by the 2-ΔΔCt method and standardized β-actin as the internal control. Primer sequences were listed in Supplementary Table S1.

Enzyme-linked immunosorbent assay (ELISA)

Plasma DNASE1L3 levels in patients with SLE (n = 19) or CRC (n = 19), and healthy controls (n = 19) were quantified using an ELISA kit (CSB-EL007052HU, CUSABIO, China) according to the manufacturer’s protocol. Optical density at 450 nm was immediately measured using a Synergy H1 microplate reader (BioTek Instruments, USA).

Western blotting

Cells were lysed on ice for 30 minutes in RIPA buffer (CW2333S, CWBIO, China) supplemented with 1% protease (CW2200S, CWBIO, China) and phosphatase inhibitor cocktail (CW2383S, CWBIO, China). Protein concentration was determined using a BCA assay kit (23227, Thermo Fisher Scientific, USA). Proteins were separated by SDS-PAGE and subsequently transferred onto PVDF membranes (IPVH00010, Millipore, USA). After blocking with 5% non-fat milk for 1 hour, membranes were incubated overnight at 4°C with primary antibodies against DNASE1L3 or β-tubulin (80762-1-RR, Proteintech, China). Next, membranes were probed with HRP-conjugated secondary antibodies (GAR1007, MultiSciences, China) for 1 hour at room temperature. Protein signals were detected using chemiluminescent reagent (GK10008, GLPBIO, USA). Relative expression levels of DNASE1L3 normalized to β-tubulin were quantified using ImageJ software.

Efferocytosis assay

The NCM460 cells were labeled with 1 μM CMFDA (HY-126561, MedChemExpress, USA) for 30 minutes and then washed with PBS before being resuspended in complete culture medium. Apoptosis was induced in NCM460 cells by stimulation with 10 μM staurosporine (HY-15141, MedChemExpress, USA) for 12 hours, followed by washing with PBS. Transfected THP-1 cells were differentiated into macrophages by incubation with 100 ng/mL phorbol myristate acetate (PMA; HY-18739, MedChemExpress, USA) for 48 hours, as previously reported (37, 38). THP-1-derived macrophages were stained with 2 μM DiI (HY-126561, MedChemExpress, USA) for 20 minutes and washed with PBS. Labeled NCM460 cells and macrophages were co-cultured at a ratio of 5:1 for 4 hours. Non-phagocytosed and unbound apoptotic cells were then removed by washing with PBS. Total RNA was extracted for RT-qPCR analysis. Fluorescence signals were acquired on a FACS Canto II flow cytometer (BD Biosciences, USA). Data analysis was performed using FlowJo software (10.8.1, USA). Efferocytosis was quantified as the proportion of macrophages that had phagocytosed NCM460 cells, represented by the percentage of FITC-positive cells.

Statistical analysis

All expression data from public databases was processed in R software (version 4.2.0). Unpaired t-tests were employed to compare two groups of variables exhibiting normal distributions. For non-normally distributed metrics, comparisons were made using the Mann-Whitney U test. Comparisons among multiple groups were performed using the Kruskal-Wallis test. Survival differences were assessed utilizing the log-rank test and visualized via Kaplan-Meier survival curves. The correlation was analyzed using the Spearman method.

Results

Identification of 58 common genes associated with CRC and SLE

The strategy of data processing was shown in Figure 1. Differential analysis showed 1967 DEGs between CRC and control tissues in the GSE39582 cohort, including 1087 up-regulated genes and 880 down-regulated genes. In the GSE61635 cohort, 721 DEGs, including 474 up-regulated genes and 247 down-regulated genes, were identified in PBMCs from SLE and controls. Volcano plots were applied to demonstrate the expression pattern of DEGs in GSE39582 and GSE61635 datasets (Figures 2A, B). To further explore the shared key genes of CRC and SLE, Venn diagrams were used to showed the 58 common DEGs of CRC and SLE (Figure 2C). The protein-to-protein interaction (PPI) network illustrated the interactions among proteins associated with the common DEGs (Figure 2D). Heatmaps were carried out to identify the expression pattern of common DEGs in GSE39582 and GSE61635 cohorts (Figures 2E, F). Biological process of GO analysis revealed that the common DEGs were enriched in regeneration, regulation of extrinsic apoptotic signaling pathway in absence of ligand, and cell aging (Figure 2G). Cellular component of GO analysis showed that the DEGs were enriched in tertiary granule lumen, RNA polymerase II transcription regulator complex, and anchored component of membrane (Figure 2H). Molecular function of GO analysis displayed that the DEGs were enriched in ATPase-coupled intramembrane lipid transporter activity, peroxidase activity, and oxidoreductase activity, acting on peroxide as acceptor (Figure 2I). KEGG analysis demonstrated that the DEGs were enriched in glutathione metabolism, chemokine signaling pathway, and thyroid hormone synthesis (Figure 2J).

Figure 1

Figure 2

Screening of ten candidate small-molecule compound for treatment

The 31 up-regulated DEGs were uploaded to the cMap database to identify potential therapeutic drugs. After significance screening, the top 10 compounds—including treprostinil, IKK-2-inhibitor, PRL-3-inhibitor-I, idebenone, alrestatin, GW-843682X, BRD-K15107389, AT-9283, pyrimethamine, and MAZ-51, were identified as potential therapeutic agents for CRC and SLE (Figure 2K). These compounds primarily function as phosphatase inhibitors, VEGFR inhibitors, calcium channel modulators, etc. (Figure 2L). The chemical structures of these 10 compounds are shown in Supplementary Figure S1.

Identification of four hub genes in CRC and SLE

The univariate Cox, SVM-RFE, and LASSO were applied to explore the hub genes in CRC and SLE. First, the univariate Cox regression was carried out to identify 15 potential candidate genes from 58 common DEGs. The LASSO regression was used to identify 9 potential genes from CRC and 10 genes from SLE. The SVM-RFE algorithm was constructed to identified 14 potential genes from CRC and 13 potential genes from SLE (Figures 3A–F). The overlapping genes among all subsets were defined as hub genes, i.e., genes associated with SLE diagnosis as well as the diagnosis and prognosis of CRC. A Venn diagram identified the four hub genes—DNASE1L3, protein tyrosine phosphatase non-receptor type 14 (PTPN14), selenium-binding protein 1 (SELENBP1), and esophageal cancer related gene 4 (ECRG4)—which were subsequently included in further analyses (Figure 3G). Network analysis via the NetworkAnalyst database revealed that the four signature genes are regulated by transcription factors such as SRF, YY1, and IRF2, as well as miRNAs including hsa-mir-335-5p, hsa-mir-27a-3p, and hsa-mir-124-3p (Figures 3H, I).

Figure 3

Partially overlapping immune infiltration patterns of CRC and SLE

The proportion of immune cells in each sample of CRC cohort was calculated by the “CIBERSORT”algorithm (Figure 4A). There were significant differences in the proportions of the 11 types of immune cells between the control and CRC groups. Compared to the control group, the CRC group had a higher proportion of M0 macrophages, M1 macrophages, activated mast cells, neutrophils, activated memory CD4 T cells, while a lower proportion of memory B cells, resting dendritic cells, resting mast cells, monocytes, plasma cells, and resting memory CD4 T cells (Figure 4B). In addition, correlation analysis of immune cells showed a significant positive correlation between M1 macrophages and follicular helper T cells (r = 0.449, P < 0.001), and a significant negative correlation between activated NK cells and resting NK cells (r = -0.566, P < 0.001) (Figure 4C). Moreover, the correlation revealed that the four genes were closely associated with immune cell infiltration. Plasma cells were positively correlated with SELENBP1, and DNASE1L3. Neutrophils were negatively correlated with SELENBP1 (Figure 4D).

Figure 4

Similarly, the proportion of immune cells in each sample of SLE cohort was displayed in Figure 4E. There were significant differences in the proportions of the 7 types of immune cells between the control and SLE groups. Compared to the control group, the SLE group had a higher proportion of M0 macrophages, neutrophils, yet a lower proportion of naive B cells, resting mast cells, resting memory CD4 T cells, and CD8 T cells (Figure 4F). There was a significant positive correlation between neutrophils and naive CD4 T cells (r = 0.481, P < 0.001), and a significant negative correlation between neutrophils and CD8 T cells (r = -0.695, P < 0.001) (Figure 4G). PTPN14 expression showed the highest positive correlation with neutrophil infiltration and highest negative correlation with eosinophil infiltration. SELENBP1 expression showed the highest negative correlation with neutrophils. DNASE1L3 expression demonstrated the strongest positive correlation with resting memory CD4T cells and the strongest negative correlation with plasma cells. ECRG4 expression shows the highest positive correlation with CD8 T cells, while it shows the highest negative correlation with neutrophils (Figure 4H).

Diagnostic efficacy of the nomogram model for SLE

In order to diagnose and predict SLE, four signature genes were used to construct a diagnostic nomogram model of SLE with the “rms” package (Figure 5A). ROC curves were applied to calculate the area under curve (AUC)values of the nomogram model to determine their sensitivity and specificity for diagnosing SLE. The results showed that the nomogram model had an AUC value of 0.993 for the diagnosis of SLE (Figure 5B). Based on the nomogram model, risk scores were calculated for each sample, and the violin plot demonstrated significantly higher risk scores for SLE compared to the control group (Figure 5C). Additionally, DCA calculated the clinical “net benefit” of the nomogram model at different threshold probabilities compared to a strategy of treating all patients or no patients (Figure 5D). Furthermore, the calibration plot revealed the predicted probabilities of the nomogram model were almost identical to the actual probabilities (Figure 5E). Similarly, the model showed good performance in discriminating SLE patients from controls in the GSE50772 dataset (Figures 5F–I).

Figure 5

Four-gene-based risk score for the diagnosis and prognostic prediction of CRC

The risk score for each sample was calculated based on the expression levels of the four genes. The risk score was defined as follows: Risk score = (0.2120) × PTPN14 + (-0.0354) × SELENBP1 + (-0.0788) × DNASE1L3 + (0.1093) × ECRG4. The histogram displayed the coefficients of the four genes in the risk score model (Figure 6A). The AUC values for the risk score in diagnosing CRC were 0.894 in GSE39582 (Figure 6B), 0.691 in COAD (Supplementary Figure S2A), and 0.668 in READ (Supplementary Figure S2B). The forest plot indicated that the four gene significantly influenced the survival prognosis of CRC (Figure 6C). CRC patients were stratified into high and low risk groups based on the median value of risk score. The risk of death was increased in the high-risk group compared to the low-risk group (Figure 6D). Notably, survival analysis uncovered that OS was significantly lower in the high-risk group compared with the low-risk group in the GSE39582 and GSE17536 cohorts. Patients in the high-risk group demonstrated a trend toward reduced OS compared to those in the in the low-risk group in the COAD and READ cohorts (Figure 6E).

Figure 6

Risk score, age, chemotherapy, clinical stage, T classification, N classification, and M classification were included in univariate Cox regression, and the results revealed that risk score, age, clinical stage, T classification, N classification, and M classification significantly affected prognosis of patients (Figure 7A). Multivariate Cox regression analysis verified that risk score, age, and M classification were significant prognostic factors for patients (Figure 7B). The above results showed that risk score, age, and M classification were independent prognostic factors for CRC patients. The associations between the four hub genes and key clinicopathological features of CRC were further systematically analyzed. The results indicated that the expression levels of PTPN14 and ECRG4 were higher in patients with T3–4 classification (Supplementary Figure S3A). Additionally, PTPN14 expression was elevated in patients with N1–2 classification (Supplementary Figure S3B). We also observed that DNASE1L3 expression was lower in M0 patients, whereas PTPN14 expression was higher in M1 patients (Supplementary Figure S3C). Furthermore, PTPN14 levels were increased in patients with stage III–IV (Supplementary Figure S3D), and ECRG4 expression was higher in patients who received chemotherapy (Supplementary Figure S3E).

Figure 7

Based on the significant factors, a nomogram model was constructed to predict the prognosis of CRC (Figure 7C). The nomogram demonstrated AUC values of 0.778, 0.752, and 0.728 for predicting 1-, 3-, and 5-year survival of CRC, respectively (Figure 7D). Patients in the high-risk group demonstrated a significantly poorer prognosis compared to those in the low-risk group (Figure 7E). DCA showed that the clinical “net benefit” of the predictive model at different threshold probabilities was higher than that of the strategy of treating all patients or not treating patients (Figure 7F). Moreover, the calibration curve revealed minimal deviation between the actual and predicted probabilities (Figure 7G).

Similar results were obtained in the validation cohort. The nomogram model demonstrated time-dependent predictive accuracy for COAD, with AUC values of 0.752, 0.728, and 0.679 at 1, 3, and 5 years, respectively. Furthermore, survival, DCA, and calibration curves collectively validated its robust predictive performance (Supplementary Figures S4A–D). Due to the lack of sufficient data on the 5-year survival of patients in the READ cohort, we analyzed the accuracy of the nomogram model in predicting 1-, 2-, and 3-year survival for READ. The AUC values were 0.842, 0.802, and 0.806, respectively (Supplementary Figure S4E). Furthermore, survival, DCA, and calibration curves further validated the model’s robust predictive capability (Supplementary Figures S4F–H). These results suggested that the nomogram model had good diagnostic and prognostic prediction efficacy for CRC.

Mutational profiles between high-risk and low-risk subgroups in CRC

Somatic mutation analysis revealed that APC, TP53, and TTN were the three most frequently mutated genes in both high-risk and low-risk groups across the COAD and READ cohorts (Figure 8A). In the COAD cohort, copy number amplifications were predominantly observed in PCDH17, TWIST1, and LY6E genes, whereas in the READ cohort, amplifications were more commonly detected in PCDH17, INHBA, and TWIST1. Copy number deletions were consistently prevalent in ZDHHC2, EPHX2, and IFI6 genes in both cohorts (Figure 8B).

Figure 8

Distinct biological mechanisms drive tumor progression between high-risk and low-risk groups in CRC

A total of 204 DEGs were identified between the low- and high-risk group in the GSE39582 cohort. The distribution of these DEGs was shown in the volcano plot (Figure 8C). To gain a better understanding of potential molecular processes and functions, GO, KEGG, GSVA, and GSEA enrichment analysis was then conducted based on the DEGs. GO analysis illustrated that pathogenic genes were closely enriched in extracellular matrix, glycosaminoglycan binding, and collagen fibril organization (Figure 8D). KEGG analysis revealed that pathogenic genes were mostly enriched in protein digestion and absorption, ECM-receptor interaction, and cytoskeleton in muscle cells (Figure 8E). GSVA analysis indicated that DEGs were mostly clustered in immunoglobulin complex circulating, IgA immunoglobulin complex, and pH elevation (Figure 8F). GSEA analysis revealed that the antigen processing and presentation, complement and coagulation cascades, leukocyte transendothelial migration, ECM-receptor interaction, and phagosome were enriched in the high-risk group, while drug metabolism cytochrome P450, fatty acid degradation, folate biosynthesis, glycolysis/gluconeogenesis, and oxidative phosphorylation were enriched in the low-risk group (Figure 8G).

Different immune response patterns between high- and low-risk groups in CRC

Cancer stem cells are a very small number of tumor cells in tumor tissue with biological properties including unlimited proliferation, self-renewal and multidirectional differentiation. There was a significant difference in stemness enrichment scores between the high- and low-risk groups by 26 stemness gene sets, suggesting that the pattern of tumor genesis, progression, infiltration and recurrence after treatment may be different between the two groups (Figure 9A).

Figure 9

Functional enrichment analysis of signature genes showed a close association with immune cell infiltration in CRC. Therefore, we explored the correlation between risk score and immune cell characteristics. Analyses of HLA typing showed significant differences in the 9 HLAs between the high- and low-risk groups. Eight HLA were upregulated in the high-risk group, whereas only one HLA was upregulated in the low-risk group (Figure 9B). The expression levels of CD40, CD86, CD200, CD274, CD276, HAVCR2, LAIR1, NRP1, PDCD1LG2, TNFRSF4, TNFRSF8, TNFRSF9, TNFSF4, TNFSF9, and TNFSF14 were significantly higher in the high-risk group, while BTNL2, CD40LG, CD160, CD244, CTLA4, HHLA2, LGALS9, PDCD1, TMIGD2, and TNFRSF25 were lower (Figure 9C). Differences in immune checkpoint gene expression between high- and low-risk groups may result in differential susceptibility to immune checkpoint inhibitors. Furthermore, analysis using the estimation of stromal and immune cells in malignant tumor tissues using expression data (ESTIMATE) algorithm revealed that the high-risk group exhibited significantly higher immune, stromal, and ESTIMATE scores, along with lower tumor purity (Figure 9D). In addition, TIDE score in high-risk patients was significantly higher than that of low-risk patients, suggesting a worse efficacy of immunotherapy (Figure 9E). Finally, we compared the IPS between the two risk groups in COAD and READ cohorts. The low-risk group exhibited a significantly elevated IPS, indicating a more promising potential for immune response. In contrast, the high-risk group showed a markedly reduced IPS, suggesting a potential resistance to immune checkpoint inhibitors (Figures 9F, G). In conclusion, there were significant differences in the microenvironment between high and low risk patients. These heterogeneities may contribute to more precise tumor immunotherapy.

Single-cell analysis revealed expression and functional profiles of four genes in CRC and SLE

Analysis of single-cell RNA sequencing data from 7 CRC patients in the E-MTAB-8107 dataset identified 23 clusters, which were further classified into 8 distinct cell types, including epithelial cells, plasma cells, T cells, myeloid cells, mast cells, NK cells, fibroblast cells, and endothelial cells (Figures 10A, B). The top three marker genes for each cell type are shown in Figure 10C. Cell communication analysis revealed that NK cells were the primary signal receivers, while fibroblast cells were the major signal senders (Figures 10D, E, Supplementary Figure S5). UMAP visualization illustrated the distribution and expression levels of the four feature genes across the eight cell types (Figure 10F). These findings indicate that DNASE1L3 was predominantly expressed in myeloid cells, PTPN14 in endothelial cells, SELENBP1 in epithelial and fibroblast cells, and ECRG4 mainly in fibroblastic cells (Figure 10G). The eight cell types were categorized into three broad classes: immune cells, epithelial cells, and stromal cells. We observed that DNASE1L3 was primarily distributed in immune cells, PTPN14 in stromal cells, SELENBP1 in both epithelial and stromal cells, and ECRG4 mainly in stromal cells (Figure 10H). Pseudotime analysis was subsequently performed (Figure 10I, J), revealing distinct developmental patterns for the four genes (Figure 10K). The expression of PTPN14, SELENBP1, and ECRG4 gradually increased along the pseudotime trajectory, whereas DNASE1L3 expression initially increased and then decreased (Figure 10L). Furthermore, all cells were stratified into two subgroups based on DNASE1L3 expression status (high vs. low) to evaluate the contribution of different cell types to gene expression. Myeloid cells were identified as the primary contributors to DNASE1L3-high expression (Figure 10M). A total of 360 DEGs were identified between DNASE1L3- high and - low cells, as shown in Figure 10N. KEGG enrichment analysis indicated that these DEGs were significantly enriched in pathways such as phagosome, Epstein–Barr virus infection, and staphylococcus aureus infection (Figure 10O).

Figure 10

Analysis of single-cell RNA sequencing data from 3 SLE patients in the GSE142016 dataset identified 11 clusters, which were annotated into 5 immune cell types, including CD4 T cells, CD8 T cells, NK cells, B cells, and myeloid cells (Figures 11A, B). The top three marker genes for each cell type are presented in Figure 11C. Cell communication analysis demonstrated that CD8 T cells were both the primary senders and receivers of signals (Figures 11D, E, Supplementary Figure S6). UMAP plots displayed the distribution and expression levels of the four feature genes across the five cell types (Figure 11F). The results showed that DNASE1L3 was mainly expressed in B cells, PTPN14 in NK cells, SELENBP1 in CD4 T cells and myeloid cells, and ECRG4 primarily in CD4 T cells (Figure 11G). Pseudotime analysis was conducted (Figures 11H, I); however, due to low expression levels of ECRG4 and SELENBP1, only the trajectories of PTPN14 and DNASE1L3 were explored (Figure 11J). PTPN14 expression gradually decreased along pseudotime, whereas DNASE1L3 expression progressively increased (Figure 11K). Further stratification of all cells into DNASE1L3-high and -low subgroups revealed that B cells were the predominant cell type in the high-expression group, while myeloid cells were most abundant in the low-expression group (Figure 11L). A total of 111 DEGs were identified between the two groups (Figure 11M). KEGG pathway analysis indicated significant enrichment in salmonella infection, shigellosis, diabetic cardiomyopathy, and chemical carcinogenesis-reactive oxygen species (Figure 11N).

Figure 11

Spatial transcriptomics analysis revealed the expression and localization of four signature genes in CRC

Integrated spatial transcriptome sequencing analysis was performed to explore the spatial location and gene expression of four prognostic genes in CRC tissue. As shown in Figure 12A, Hematoxylin and eosin (H&E) staining of pathological sections was performed. Ten distinct cell types were identified, and their spatial localization was presented in Figure 12B. The demarcation between malignant and non-malignant regions was displayed in Figure 12C. The spatial distribution of the four signature genes was shown in Figure 12D. Further investigation into their expression revealed that DNASE1L3, PTPN14, and SELENBP1 were primarily localized to the malignant regions, whereas the expression of ECRG4 showed no significant difference between malignant and non-malignant areas (Figure 12E). Correlation analysis demonstrated that all four signature genes were closely associated with a variety of cell types (Figure 12F).

Figure 12

Validation of the expression of prognostic genes for CRC

The mRNA expression levels of DNASE1L3, SELENBP1 and ECRG4 were significantly lower in the tumor group compared to the control group in the GSE39582 dataset, whereas the mRNA expression levels of PTPN14 were significantly higher in the tumor group (Figure 13A). Consistent with the aforementioned results, similar expression trends for DNASE1L3, ECRG4, and SELENBP1 were observed in the COAD and READ datasets, except that PTPN14 showed no significant difference in the COAD dataset but was markedly downregulated in the READ dataset (Supplementary Figure S7). Patients with low levels of DNASE1L3 exhibited a significantly poorer prognosis than those with high levels, whereas the opposite trend was observed for PTPN14. In contrast, the expression of SELENBP1 and ECRG4 showed no significant association with patient survival (Figure 13B). The qPCR results of the 15 pairs of tumor and pericarcinomatous tissues confirmed the above results (Figure 13C).

Figure 13

These findings suggest that DNASE1L3 and PTPN14 may represent potential key targets for improving prognosis in CRC. Given that immune dysregulation is a critical pathological feature shared by CRC and SLE, our study, along with existing evidence, indicates that DNASE1L3—primarily expressed in macrophages—plays a central role in mediating the clearance of apoptotic cells, maintaining self-antigen homeostasis, and preventing immune dysregulation. Additionally, IHC staining of tumor tissues and pericarcinomatous tissues from 76 CRC patients further confirmed the lower expression level of DNASE1L3 protein in tumor tissues (Figure 13D).

Validation of the expression of diagnostic genes for SLE

The expression levels of DNASE1L3, PTPN14, SELENBP1, and ECRG4 in the GSE61635 dataset were consistent with those in the GSE39582 dataset (Figure 14A). Similarly, RT-qPCR analysis of PBMCs from 25 SLE patients and 23 healthy controls confirmed the mRNA expression levels of these four genes (Figure 14B). Subsequently, a diagnostic nomogram was developed to predict the probability of SLE development in both the control and SLE group (Figure 14C). The ROC curve demonstrated that the nomogram achieved an AUC of 0.920 for diagnosing SLE (Figure 14D). Nomogram assessment revealed a significant discrimination in SLE risk between the control and SLE groups (Figure 14E). Both the calibration and DCA curves indicated that nomogram-based decisions could provide clinical utility for SLE prediction (Figures 14F, G).

Figure 14

Low levels of DNASE1L3 impair macrophage phagocytic function.

Significantly reduced DNASE1L3 levels were observed in both the SLE and CRC groups compared to the control group via ELISA method (Figure 15A). To simulate the pathological microenvironment of low DNASE1L3 expression observed in both diseases, we knocked down DNASE1L3 expression in the human monocytic cell line THP-1 to further investigate the regulatory mechanisms of this gene in disease pathogenesis. RT-qPCR and Western blotting were used to verify the knockdown efficiency of DNASE1L3 in the constructed THP-1 cell line (Figures 15B, C). RT-qPCR results demonstrated that low levels of DNASE1L3 promoted the differentiation of THP-1 cells toward the M1 phenotype while suppressing their differentiation into M2 macrophages (Figure 15D). Concurrently, there was a significant increase in IL-1β and TNF-α secreted by M1 macrophages, along with a marked reduction in LOX protein expression by M2 macrophages (Figure 15E). Flow cytometric analysis confirmed that reduced DNASE1L3 impaired macrophage efferocytosis (Figure 15F).

Figure 15

Discussion

Through its roles of defense, immune surveillance, and maintenance of homeostasis, the immune system maintains the stability of the internal environment. Immune dysregulation can lead to various diseases, such as cancer, autoimmune diseases, and infections (39). Autoimmune diseases are characterized by a loss of self-tolerance and disruption of immune homeostasis (40). Conversely, insufficient immune responses can contribute to tumor development, which is often accompanied by inflammatory reactions. SLE and CRC are common immune-related diseases whose pathogenic mechanisms have not been fully elucidated. Although they appear to share potential common immune mechanisms, their immunological characteristics remain to be systematically summarized. Therefore, we constructed a shared differential gene expression profile of SLE and CRC to explore the commonalities between autoimmune diseases and cancer.

Through integrated multi-omics analysis, this study reveals that SLE and CRC share two core pathological features, characterized by impaired immune clearance and disrupted metabolic homeostasis. Key evidence indicates that DNASE1L3, PTPN14, SELENBP1, and ECRG4 exhibit consistent dysregulation in both diseases, and a model constructed based on these genes demonstrates robust performance in both SLE and CRC cohorts. This discovery provides a novel perspective for understanding the shared mechanisms between autoimmune diseases and cancer.

We initially identified 58 common DEGs between SLE and CRC. These genes were significantly enriched in biological processes such as apoptosis, cell cycle regulation, immune cell activation, and metabolic reprogramming. Drug sensitivity prediction analysis suggested that targeting histone deacetylases and the PI3K/AKT signaling pathway may help modulate the aberrant gene expression patterns in both diseases. Notably, immune infiltration analysis revealed that SLE and CRC shared decreased proportions of resting memory CD4+ T cells and resting mast cells, along with increased infiltration of neutrophils and M0 macrophages. Furthermore, the expression levels of the four genes were closely associated with infiltrating levels of various immune cells, including T cells, macrophages, neutrophils, and mast cells. This shared immune microenvironment composition provides a cellular basis for the co-morbidity development between these two diseases.

By integrating single-cell RNA sequencing and spatial transcriptomics, we delineated the cell type-specific distribution of the four genes. Spatial transcriptomic analysis revealed that PTPN14 and SELENBP1 were highly expressed in malignant regions, whereas ECRG4 was predominantly localized to non-malignant areas. Single-cell resolution further demonstrated that PTPN14 was mainly distributed in epithelial cells in CRC and in NK cells in SLE. SELENBP1 was primarily expressed in epithelial cells and fibroblasts in CRC, and in myeloid cells and CD4+ T cells in SLE. ECRG4 was largely detected in fibroblasts in CRC and in CD4+ T cells in SLE. PTPN14, a non-receptor type protein tyrosine phosphatase, participates in the regulation of cell migration, differentiation, and adhesion by mediating dephosphorylation of proteins such as YAP (41). Furthermore, studies have shown that PTPN14 promotes tumor progression via the PI3K/AKT/mTOR pathway (42). SELENBP1 modulates protein transport and degradation and has been documented to be dysregulated in in several autoimmune diseases. Moreover, it exerts tumor-suppressive effects by inhibiting the activity of hypoxia-inducible factor-1 alpha (43). ECRG4, also known as C2orf40, is primarily localized to the cell membrane. In CRC cells, ECRG4 is highly methylated, leading to transcriptional silencing. Its expression promotes apoptosis and inhibits proliferation (44). Additionally, ECRG4 is expressed in polymorphonuclear cells and helps maintain cellular homeostasis. At sites of inflammation or tumor growth, it is cleaved by proteases such as thrombin, releasing bioactive peptides with pro-inflammatory activity that recruit neutrophils to the site of injury and contribute to tissue repair (45).

This study revealed a consistent reduction in both intracellular and extracellular DNASE1L3 protein levels in patients with SLE and CRC, implying that this downregulation could be involved in the immune dysregulation common to both conditions. DNASE1L3 is an extracellular nuclease primarily highly expressed in in macrophages and dendritic cells, involved in clearance of extracellular and apoptotic DNA, suggesting its potential relevance in autoimmune diseases and cancer. In SLE, impaired clearance of apoptotic cells leads to the accumulation of phospholipids and nucleic acids, which act as autoantigens that activate B cells and promote the production of autoantibodies such as anti-phosphatidylserine and anti-dsDNA (46). Concurrently, the phagocytic clearance of immune complexes by phagocytes is diminished, resulting in the deposition of circulating immune complexes in target organs and subsequent tissue damage (47). In CRC, genomic instability and immune evasion are key pathogenic features. Dysfunctional DNASE1L3 may contribute to the accumulation of DNA fragments, chronic inflammation, and disruption of immune tolerance, suggesting a potential shared pathological mechanism in both diseases (48). Our experimental data further demonstrated that DNASE1L3 deficiency impairs the ability of phagocytes to clear apoptotic cells, thereby jointly promoting the pathogenesis of SLE and CRC.

Macrophages reside in nearly all tissues and play diverse roles in processes such as microbial defense, tissue repair, inflammation, and metabolic disorders. They exhibit remarkable functional plasticity and serve as a critical bridge between innate and adaptive immunity (49). Dysfunctional macrophages represent a common pathological feature of both SLE and CRC. Numerous studies have confirmed that in SLE, macrophages exhibit upregulation of autophagy- and apoptosis-related genes, while deficiency in peroxisome proliferator-activated receptor gamma (PPARγ) prevents their acquisition of an anti-inflammatory phenotype. This leads to impaired phagocytic capacity and exacerbates target organ damage (50). Building on these findings, our study further revealed that low DNASE1L3 expression disrupts the M1/M2 polarization balance in macrophages, impairs their ability to clear apoptotic cells, and may contribute to the accumulation of autoantigens, thereby triggering inflammatory responses and immune dysregulation.

We identified DNASE1L3 as a key protective factor involved in both SLE and CRC. Due to immune dysregulation, SLE can progress to involve neuropsychiatric symptoms, nephritis, enteritis, and dermatitis. Previous studies have confirmed shared microbial features and functional alterations between SLE and inflammatory bowel disease (51). Functional deficiency of DNASE1L3 impairs systemic and intestinal tolerance to extracellular DNA, allowing apoptotic fragments to amplify immune responses and promote inflammation. Multiple studies have reported DNASE1L3 as a susceptibility gene for SLE, with deficient patients exhibiting elevated plasma DNA levels (52). Elevated anti-DNASE1L3 antibodies are closely associated with SLE, and DNASE1L3-deficient mice develop lupus-like phenotypes (53). Several studies have reported downregulation of DNASE1L3 in CRC cells, promoting proliferation and migration. In an azoxymethane/dextran sulfate sodium (AOM/DSS) model of inflammation-induced colorectal cancer, DNASE1L3-knockout mice exhibited more severe colonic epithelial damage, increased hemorrhage, greater weight loss, diarrhea, and larger tumor volumes compared to wild-type mice (54). These findings further support that DNASE1L3 deficiency promotes tumor initiation and exacerbates progression (54). This study constructed a four-gene-based model for the early diagnosis of SLE and CRC, and also providing prognostic evidence for CRC. Furthermore, we proposed a cross-disease model centered on DNASE1L3 dysregulation to explain the shared mechanisms between autoimmune diseases and cancer.

This study has several limitations. Due to batch differences among various public datasets and cohort-specific confounding factors, the observed gene modules and enriched pathways may be significantly affected. Our integrative analyses cannot robustly stratify patients by disease stage, specific clinical phenotypes, molecular subtypes, or treatment exposure. Moreover, the clinical samples used for ELISA and RT-qPCR were limited. The current work does not determine whether the observed patterns are broadly generalizable or driven by distinct subgroups. We used PMA-differentiated THP-1 cells, a human monocytic leukemia line, as an in vitro macrophage model to examine the impact of DNASE1L3 deficiency on efferocytosis and polarization. While THP-1 cells are widely employed for mechanistic studies, their malignant origin is an inherent limitation. Our current results should therefore be viewed as proof-of-concept. Future experiments using RAW264.7 cells and primary macrophages derived from human PBMCs are underway to validate the effects of DNASE1L3 on macrophage polarization and efferocytic capacity in more physiologically relevant systems. While the in vitro THP-1 model aligns with the inflammatory profiles of SLE and CRC, direct in vivo validation (e.g., using specific knockout mouse models for both diseases) remains a necessary future step to confirm this axis operates physiologically to drive disease progression. In the future, we will conduct large-sample prospective studies to evaluate the clinical utility of the four-gene model. At the same time, we will carry out a series of basic research projects to systematically investigate the role of DNASE1L3-related pathways in SLE and CRC.

Conclusion

Through an integrated multi−omics approach, this study identified a four−gene model based on DNASE1L3, PTPN14, SELENBP1, and ECRG4, which may have potential utility for early identification and risk stratification in both SLE and CRC. In addition, DNASE1L3−related pathways may represent an interesting focus for future mechanistic and translational research in both diseases.

Statements

Data availability statement

The data presented in the study are deposited in the Zenodo repository, accession number 10.5281/zenodo.18753102.

Ethics statement

The studies involving humans were approved by the following ethics committees: The cDNA microarray for the qPCR experiment obtained approval from the Ethics Committee of Shanghai Outdo Biotechnology Company Limited (ZJBB-FM-071). The pathological chips used for immunohistochemical analysis obtained approval from the Ethical Review Committee of AIFANG Biotech Company (HN20250401). The relevant experiments on peripheral blood samples received approval from the Ethics Committee of The First Affiliated Hospital of Zhengzhou University (2024-KY-1908-002). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

HG: Funding acquisition, Writing – original draft, Methodology. CT: Investigation, Methodology, Writing – review & editing. MZ: Conceptualization, Visualization, Writing – review & editing, Resources. LZ: Resources, Data curation, Methodology, Writing – review & editing. WW: Writing – review & editing, Validation, Software. MH: Conceptualization, Supervision, Project administration, Writing – review & editing. DC: Formal analysis, Writing – review & editing, Software, Funding acquisition.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This work was supported by the grants from Postdoctoral Fellowship Program of the China Postdoctoral Science Foundation (GZC20232407), Postdoctoral Funding of the First Affiliated Hospital of Zhengzhou University (72395), Key Scientific Research Projects in Higher Education Institutions of Henan Provincial (26A320037), and the Science and Technology Research Program of Henan Province (242102311096).

Acknowledgments

We thank patients and their families for their support of the study. We acknowledge TCGA, GEO, DAVID, and Metascape databases platforms and data contributors.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2026.1775776/full#supplementary-material

References

  • 1

    KiriakidouMChingCL. Systemic lupus erythematosus. Ann Intern Med. (2020) 172:Itc81itc96. doi: 10.7326/AITC202006020

  • 2

    BarberMRWDrenkardCFalasinnuTHoiAMakAKowNYet al. Global epidemiology of systemic lupus erythematosus. Nat Rev Rheumatol. (2021) 17:515–32. doi: 10.1038/s41584-021-00668-1

  • 3

    TsokosGC. The immunology of systemic lupus erythematosus. Nat Immunol. (2024) 25. doi: 10.1038/s41590-024-01898-7

  • 4

    HoiAIgelTMokCCArnaudL. Systemic lupus erythematosus. Lancet. (2024) 403:2326–38. doi: 10.1016/S0140-6736(24)01044-4

  • 5

    BernatskySRamsey-GoldmanRLabrecqueJJosephLBoivinJFPetriMet al. Cancer risk in systemic lupus: an updated international multi-centre cohort study. J Autoimmun. (2013) 42:130–5. doi: 10.1016/j.jaut.2012.12.009

  • 6

    PetterssonTPukkalaETeppoLFrimanC. Increased risk of cancer in patients with systemic lupus erythematosus. Ann Rheum Dis. (1992) 51:437–9. doi: 10.1136/ard.51.4.437

  • 7

    BrayFLaversanneMSungHFerlayJSiegelRLSoerjomataramIet al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834

  • 8

    ZhangMWangYWangYBaiYGuD. Association between systemic lupus erythematosus and cancer morbidity and mortality: findings from cohort studies. Front Oncol. (2022) 12:860794. doi: 10.3389/fonc.2022.860794

  • 9

    AlmagroBMSteylsMCNavarroNLDomínguezEGAcostaEHPérezMAet al. Occurrence of subacute cutaneous lupus erythematosus after treatment with systemic fluorouracil. J Clin Oncol. (2011) 29:e613–5. doi: 10.1200/JCO.2011.35.0686

  • 10

    CorrealePFioravantiABertoldiIMontagnaniFMiraccoCFranciniG. Occurrence of autoimmunity in a long-term survivor with metastatic colon carcinoma treated with a new chemo-immunotherapy regimen. J Chemother. (2008) 20:278–81. doi: 10.1179/joc.2008.20.2.278

  • 11

    ZhangCYuRLiSYuanMHuTLiuJet al. KRAS mutation increases histone H3 lysine 9 lactylation (H3K9la) to promote colorectal cancer progression by facilitating cholesterol transporter GRAMD1A expression. Cell Death Differ. (2025) 32:2225–38. doi: 10.1038/s41418-025-01533-4

  • 12

    LiuJSuYZhangCDongHYuRYangXet al. NCOA3 impairs the efficacy of anti-PD-L1 therapy via HSP90α/EZH2/CXCL9 axis in colon cancer. Int Immunopharmacol. (2025) 155:114579. doi: 10.1016/j.intimp.2025.114579

  • 13

    YaoXHuangJZhongHShenNFaggioniRFungMet al. Targeting interleukin-6 in inflammatory autoimmune diseases and cancers. Pharmacol Ther. (2014) 141:125–39. doi: 10.1016/j.pharmthera.2013.09.004

  • 14

    WangKHuangHZhanQDingHLiY. Toll-like receptors in health and disease. MedComm (2020). (2024) 5:e549. doi: 10.1002/mco2.549

  • 15

    SongMEngelsEAClarkeMAKreimerARShielsMS. Autoimmune disease and the risk of anal cancer in the US population aged 66 years and over. J Natl Cancer Inst. (2024) 116:309–15. doi: 10.1093/jnci/djad187

  • 16

    ZhangYShiGZhangHXiongQChengFWangHet al. Dexamethasone enhances the lung metastasis of breast cancer via a PI3K-SGK1-CTGF pathway. Oncogene. (2021) 40:5367–78. doi: 10.1038/s41388-021-01944-w

  • 17

    SuiYLiSFuXQZhaoZJXingS. Bioinformatics analyses of combined databases identify shared differentially expressed genes in cancer and autoimmune disease. J Transl Med. (2023) 21:109. doi: 10.1186/s12967-023-03943-9

  • 18

    RozkiewiczDHermanowiczJMKwiatkowskaIKrupaAPawlakD. Bruton’s tyrosine kinase inhibitors (BTKIs): review of preclinical studies and evaluation of clinical trials. Molecules. (2023) 28:2400. doi: 10.3390/molecules28052400

  • 19

    WangQMaLAnCWiseSGBaoS. The role of IL-38 in intestinal diseases - its potential as a therapeutic target. Front Immunol. (2022) 13:1051787. doi: 10.3389/fimmu.2022.1051787

  • 20

    MarisaLde ReynièsADuvalASelvesJGaubMPVescovoLet al. Gene expression classification of colon cancer into molecular subtypes: characterization, validation, and prognostic value. PloS Med. (2013) 10:e1001453. doi: 10.1371/journal.pmed.1001453

  • 21

    SmithJJDeaneNGWuFMerchantNBZhangBJiangAet al. Experimentally derived metastasis gene expression profile predicts recurrence and death in patients with colon cancer. Gastroenterology. (2010) 138:958–68. doi: 10.1053/j.gastro.2009.11.005

  • 22

    GoldmanMJCraftBHastieMRepečkaKMcDadeFKamathAet al. Visualizing and interpreting cancer genomics data via the Xena platform. Nat Biotechnol. (2020) 38:675–8. doi: 10.1038/s41587-020-0546-8

  • 23

    HanYWangYDongXSunDLiuZYueJet al. TISCH2: expanded datasets and new tools for single-cell transcriptome analyses of the tumor microenvironment. Nucleic Acids Res. (2023) 51:D1425–d31. doi: 10.1093/nar/gkac959

  • 24

    MistryPNakaboSO’NeilLGoelRRJiangKCarmona-RiveraCet al. Transcriptomic, epigenetic, and functional analyses implicate neutrophil diversity in the pathogenesis of systemic lupus erythematosus. Proc Natl Acad Sci U S A. (2019) 116:25222–8. doi: 10.1073/pnas.1908576116

  • 25

    SubramanianANarayanRCorselloSMPeckDDNatoliTELuXet al. A next generation connectivity map: L1000 platform and the first 1,000,000 profiles. Cell. (2017) 171:143752.e17. doi: 10.1016/j.cell.2017.10.049

  • 26

    FriedmanJHastieTTibshiraniR. Regularization paths for generalized linear models via coordinate descent. J Stat Software. (2010) 33:122. doi: 10.18637/jss.v033.i01

  • 27

    LinXYangFZhouLYinPKongHXingWet al. A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information. J Chromatogr B Analyt Technol BioMed Life Sci. (2012) 910:149–55. doi: 10.1016/j.jchromb.2012.05.020

  • 28

    XiaJBennerMJHancockRE. NetworkAnalyst–integrative approaches for protein-protein interaction network analysis and visual exploration. Nucleic Acids Res. (2014) 42:W167–74. doi: 10.1093/nar/gku443

  • 29

    NewmanAMLiuCLGreenMRGentlesAJFengWXuYet al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. (2015) 12:453–7. doi: 10.1038/nmeth.3337

  • 30

    KerrKFBrownMDZhuKJanesH. Assessing the clinical impact of risk prediction models with decision curves: guidance for correct interpretation and appropriate use. J Clin Oncol. (2016) 34:2534–40. doi: 10.1200/JCO.2015.65.5654

  • 31

    YoshiharaKShahmoradgoliMMartínezEVegesnaRKimHTorres-GarciaWet al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun. (2013) 4:2612. doi: 10.1038/ncomms3612

  • 32

    JiangPGuSPanDFuJSahuAHuXet al. Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat Med. (2018) 24:1550–8. doi: 10.1038/s41591-018-0136-1

  • 33

    WangJBQiuQZZhengQLZhaoYJXuYZhangTet al. Tumor immunophenotyping-derived signature identifies prognosis and neoadjuvant immunotherapeutic responsiveness in gastric cancer. Adv Sci (Weinh). (2023) 10:e2207417. doi: 10.1002/advs.202207417

  • 34

    ZhouYZhouBPacheLChangMKhodabakhshiAHTanaseichukOet al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. (2019) 10:1523. doi: 10.1038/s41467-019-09234-6

  • 35

    QiuXMaoQTangYWangLChawlaRPlinerHAet al. Reversed graph embedding resolves complex single-cell trajectories. Nat Methods. (2017) 14:979–82. doi: 10.1038/nmeth.4402

  • 36

    XunZDingXZhangYZhangBLaiSZouDet al. Reconstruction of the tumor spatial microenvironment along the Malignant-boundary-nonmalignant axis. Nat Commun. (2023) 14:933. doi: 10.1038/s41467-023-36560-7

  • 37

    YuRZhangCYuanMYeSHuTChenSet al. Exercise-induced metabolite N-lactoyl-phenylalanine ameliorates colitis by inhibiting M1 macrophage polarization via the suppression of the NF-κB signaling pathway. Cell Mol Gastroenterol Hepatol. (2025) 19:101558. doi: 10.1016/j.jcmgh.2025.101558

  • 38

    YuanMChenSLinZYuRChaoKYeSet al. ACSS2-mediated histone H4 lysine 12 crotonylation (H4K12cr) alleviates colitis via enhancing transcription of CLDN7. Adv Sci (Weinh). (2025) 12:e00461. doi: 10.1002/advs.202500461

  • 39

    PulendranBDavisMM. The science and medicine of human immunology. Science. (2020) 369:eaay4014. doi: 10.1126/science.aay4014

  • 40

    LouHLingGSCaoX. Autoantibodies in systemic lupus erythematosus: From immunopathology to therapeutic target. J Autoimmun. (2022) 132:102861. doi: 10.1016/j.jaut.2022.102861

  • 41

    MaQHeXWangXZhaoGZhangYSuCet al. PTPN14 aggravates neointimal hyperplasia via boosting PDGFRβ signaling in smooth muscle cells. Nat Commun. (2024) 15:7398. doi: 10.1038/s41467-024-51881-x

  • 42

    LiHGuanBLiuSLiuHSongLZhangGet al. PTPN14 promotes gastric cancer progression by PI3KA/AKT/mTOR pathway. Cell Death Dis. (2023) 14:188. doi: 10.1038/s41419-023-05712-4

  • 43

    ChandraASenapatiSRoySChatterjeeGChatterjeeR. Epigenome-wide DNA methylation regulates cardinal pathological features of psoriasis. Clin Epigenetics. (2018) 10:108. doi: 10.1186/s13148-018-0541-9

  • 44

    GötzeSFeldhausVTraskaTWolterMReifenbergerGTannapfelAet al. ECRG4 is a candidate tumor suppressor gene frequently hypermethylated in colorectal carcinoma and glioma. BMC Cancer. (2009) 9:447. doi: 10.1186/1471-2407-9-447

  • 45

    DorschnerRALeeJCohenOCostantiniTBairdAEliceiriBP. ECRG4 regulates neutrophil recruitment and CD44 expression during the inflammatory response to injury. Sci Adv. (2020) 6:eaay0518. doi: 10.1126/sciadv.aay0518

  • 46

    ChenXGaoDWangMWangLHuHWenCet al. Neutrophil extracellular traps in systemic lupus erythematosus: pathogenic mechanisms, crosstalk with oxidative stress, and antioxidant therapeutic potential. Antioxidants (Basel). (2025) 15:25. doi: 10.3390/antiox15010025

  • 47

    GuanHHuangLLiuYZhuEChenLLiWet al. Anti-PS igG immune complexes impair macrophage phagocytosis in SLE via LOX-dependent oxidative stress. J Inflammation Res. (2025) 18:11521–38. doi: 10.2147/JIR.S527859

  • 48

    ChenDJinZChuHWuYBianYYuanTet al. DNASE1L3-expressing dendritic cells promote CD8(+) T cell function and anti-PD-(L)1 therapy efficacy by degrading neutrophil extracellular traps. Cancer Cell. (2025) 43:175875.e8. doi: 10.1016/j.ccell.2025.07.014

  • 49

    LocatiMCurtaleGMantovaniA. Diversity, mechanisms, and significance of macrophage plasticity. Annu Rev Pathol. (2020) 15:123–47. doi: 10.1146/annurev-pathmechdis-012418-012718

  • 50

    YooEJOhKHPiaoHKangHJJeongGWParkHet al. Macrophage transcription factor TonEBP promotes systemic lupus erythematosus and kidney injury via damage-induced signaling pathways. Kidney Int. (2023) 104:163–80. doi: 10.1016/j.kint.2023.03.030

  • 51

    ZhouHBalintDShiQVartanianTKriegelMABritoI. Lupus and inflammatory bowel disease share a common set of microbiome features distinct from other autoimmune disorders. Ann Rheum Dis. (2025) 84:93105. doi: 10.1136/ard-2024-225829

  • 52

    SisirakVSallyBD’AgatiVMartinez-OrtizWÖzçakarZBDavidJet al. Digestion of chromatin in apoptotic cell microparticles prevents autoimmunity. Cell. (2016) 166:88101. doi: 10.1016/j.cell.2016.05.034

  • 53

    Ortíz-FernándezLMartínJAlarcón-RiquelmeME. A summary on the genetics of systemic lupus erythematosus, rheumatoid arthritis, systemic sclerosis, and sjögren’s syndrome. Clin Rev Allergy Immunol. (2023) 64:392411. doi: 10.1007/s12016-022-08951-z

  • 54

    LiWNakanoHFanWLiYSilPNakanoKet al. DNASE1L3 enhances antitumor immunity and suppresses tumor progression in colon cancer. JCI Insight. (2023) 8:e168161. doi: 10.1172/jci.insight.168161

Summary

Keywords

colorectal cancer, DNASE1L3, efferocytosis, macrophages, systemic lupus erythematosus

Citation

Guan H, Tian C, Zhong M, Zhang L, Wang W, Huang M and Chen D (2026) Shared immune dysregulation in systemic lupus erythematosus and colorectal cancer: a multi-omics guided discovery of DNASE1L3-centric efferocytosis deficiency. Front. Immunol. 17:1775776. doi: 10.3389/fimmu.2026.1775776

Received

26 December 2025

Revised

05 February 2026

Accepted

09 February 2026

Published

26 February 2026

Volume

17 - 2026

Edited by

Xiaosheng Tan, Rutgers, The State University of New Jersey, United States

Reviewed by

Heli Li, Huazhong University of Science and Technology, China

Chi Zhang, University of Texas MD Anderson Cancer Center, United States

Updates

Copyright

*Correspondence: Duo Chen, ; Mingcheng Huang, ; Wenjing Wang,

†These authors have contributed equally to this work

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics