ORIGINAL RESEARCH article

Front. Med., 01 December 2022

Sec. Rheumatology

Volume 9 - 2022 | https://doi.org/10.3389/fmed.2022.995103

Potential biomarkers for active renal involvement in systemic lupus erythematosus patients

  • 1. Department of Rheumatology, Hainan General Hospital, Hainan Affiliated Hospital of Hainan Medical University, Haikou, China

  • 2. Department of Respiratory, Hainan General Hospital, Hainan Affiliated Hospital of Hainan Medical University, Haikou, China

Abstract

Objective:

This study aimed to identify the key genes related to active renal involvement in patients with systemic lupus erythematosus (SLE).

Methods:

Microarray datasets were downloaded from the Gene Expression Omnibus (GEO) database. Differentially expressed genes (DEGs) between SLE patients with active renal involvement and those who did not have active renal involvement were identified by R software. Hub genes were identified using protein–protein interaction networks. The relationships between the expression levels of identified hub genes and SLEDAI were subjected to linear correlation analysis. The diagnostic accuracy of the hub genes was evaluated with the area under the curve of the receiver operating characteristic curve (ROC-AUC). Transcription factors (TFs) were predicted. The expression levels of different hub genes and histopathological patterns were also examined.

Results:

A total of 182 DEGs were identified. Enrichment analysis indicated that DEGs were primarily enriched in neutrophil degranulation, neutrophil activation involved in immune response and neutrophil activation. The expression levels of 12 identified hub genes were verified. Ten of the 12 hub genes were positively associated with SLEDAI. The combination model of DEFA4, CTSG, RETN, CEACAM8, TOP2A, LTF, MPO, ELANE, BIRC5, and LCN2 had a certain diagnostic accuracy in detecting renal involvement with high disease activity in SLE patients. The expressions of five predicted TFs were validated by GSE65391 dataset.

Conclusion:

This work explored the pathogenesis of renal involvement in SLE. These results may guide future experimental research and clinical transformation.

Introduction

Systemic lupus erythematosus (SLE) is an autoimmune disease with clinically heterogeneity; it predominantly affects young women (1). Renal involvement can be seen in up to 70% of patients with SLE and is the most critical predictor of the morbidity and mortality of SLE. Manifestations of renal involvement can vary from macroscopic proteinuria and hematuria to nephrotic syndrome, cast excretion, and end-stage renal disease (2). Considering that the severe complications may be caused by renal involvement, and the treatment options for renal involvement are limited, novel biomarkers that can monitor and predict the progression of renal involvement need to be identified (3).

Bioinformatics is a branch of computer science that is widely used to explore promising biomarkers to improve disease diagnosis and treatment at the genome level (46). Numerous bioinformatic studies have demonstrated different abnormal expression levels of genes associated with the development of lupus nephrits (LN). In 2021, Zhimin Chen et al. downloaded kidney biopsy sequencing data to identify LN hub genes and differentially expressed genes (DEGs). They discovered six valuable biomarkers (HLA-DMA, HLA-DPA1, HLA-DPB1, HLA-DRA, IL10RA, and IRF8) that are strongly correlated with LN diagnosis and prognosis (7). In addition, a group of researchers used single-cell RNA sequencing to investigate the immune cell landscape in the kidneys of patients with LN. They found evidence that the local activation of B cells was correlated with an age-associated B-cell signature; a clear interferon response was observed in most cells. Two chemokine receptors, namely, CXCR4 and CX3CR1, were broadly expressed, thereby implying their potentially central role in cell trafficking (8). Furthermore, Zhaocheng Dong and his colleagues investigated the differences in molecular mechanisms and key biomarkers between membranous nephropathy and LN. They screened out six hub genes (IFI6, MX1, XAF1, HERC6, IFI44L, and IFI44) between the biopsy samples of these two nephritises (9). Meanwhile, Andrea Fava et al. analyzed the patterns of 1000 urine protein biomarkers in 30 patients with active LN. They identified an interferon-γ response gradient in LN (10). Studies focusing on renal involvement in patients with SLE mainly used renal biopsy or urine. However, analysis concerning whole blood samples was limited. As we all know, blood sample is easy to obtain and the DEGs in blood from indicated groups could offer information concerning disease pathogenesis. Moreover, identified DEGs can stratify patients with different organ involvement. Therefore, biomarkers in blood are of great value in identifying high risk patients with renal involvement. Through the combination of microarray and bioinformatics analyses, exploring potential key genes and pathway networks that are closely related to renal involvement is possible.

The two datasets including in our study was GSE49454 and GSE65391. The previous studies concerning these two datasets mainly focusing on detecting possible pathogenesis of SLE. The original article about GSE49454 revealed that complex interferon (IFN) signatures in SLE, which are not restricted to the previous IFNα signature, but which also involve IFNβ and IFNγ (11). In addition, GSE65391 also discovered a prevalent IFN signature and identified a plasma blast signature as the most robust biomarker of disease activity (12). However, both studies did not analyze the key genes related to active renal involvement, which is the most often and most severe complication, in patients with SLE. In this study, we used bioinformatics approaches to screen for biomarkers for active renal involvement in patients with SLE. In addition, the transcriptional factors (TFs) were predicted by database search and a TF-message RNA network was constructed. These results may guide future experimental research and clinical transformation.

Materials and methods

Data collection

“Systemic lupus erythematosus” was used as the keyword to search for expression profiling of SLE in the Gene Expression Omnibus (GEO) database, which is a public repository database (13). Studies that met the following criteria were included, as follows: (1) whole genome expression data of SLE, (2) datasets containing more than five samples, and (3) datasets containing renal involvement information about the samples. Finally, one dataset GSE49454 (GPL10558), which included 64 active renal involvement samples and 93 without active renal involvement samples, was selected as the test set (11). One dataset GSE65391 (GPL10558), which included 69 active renal involvement samples and 68 without active renal involvement samples, was selected as the validation set (12). Active renal involvement was defined by the presence of at least one component of the renal SLEDAI, including urinary casts, hematuria, proteinuria, and pyuria. Samples with hematuria attributable to menstruation were excluded. In GSE49454 dataset, “renal: Y” was used to indicate active renal involvement. In GSE65391, “renal: 1” was used to indicate active renal involvement. Their basic details are listed in Table 1 and the basic information of our test set, GSE49454 is shown in Supplementary Table 1. A total of 86 patients in GSE65391 underwent renal biopsy. Meanwhile, 47 patients did not have renal biopsy at the time of the visit, which recorded as “no-LN.” The histopathological patterns, including membranous, proliferative, and non-proliferative, of 86 patients in GSE65391 were recorded. The histopathological patterns of four patients in GSE65391 were not available in the dataset. The detailed clinical information of GSE65391 is listed in Table 2 and Supplementary Figure 1. The overall flowchart of this study is shown in Figure 1.

TABLE 1

GEO accessionPlatformSamples
Source tissueSLE patients
AttributeDiagnostic criteria
SLEHCActive renal involvementWithout active renal involvement
GSE49454GPL1055815720Whole blood6493Test set1997 ACR criteria for SLE (42)
GSE65391GPL1055813753Whole blood6968Validation setNot mentioned

Information for selected microarray datasets.

Active renal involvement: defined by the presence of at least one component of the renal SLEDAI.

TABLE 2

Non-LNMembranousProliferativeNon-proliferative
Age (mean ± SD)14.26 ± 2.6715.14 ± 1.6713.59 ± 2.8613.19 ± 3.68
Sex (female/male)42/57/254/1012/1
Number of patients with active renal involvement118433
SLEDAI (mean ± SD)5.77 ± 4.217.44 ± 1.5112.14 ± 8.517.54 ± 8.41
Number of patients been biopsied at first visit04397
Days since kidney biopsy (mean ± SD)-450 ± 710.47528.54 ± 704.711054.86 ± 1462.02

Detailed clinical information of GSE65391.

LN: lupus nephritis; Active renal involvement: defined by the presence of at least one component of the renal SLEDAI.

FIGURE 1

Identification of differentially expressed genes

The raw expression data of GSE49454 were analyzed. The DEGs between patients with active renal involvement and those without active renal involvement were obtained through the online web-based tool GEO2R. An adjusted P value < 0.05 was considered statistically significant. The graphs of heatmap, Uniform Manifold Approximation, and Projection (UMAP) and Principal Component Analysis (PCA) were analyzed and visualized by RStudio1. The package used for UMAP was Umap (version 0.2.7.0), and the package used for PCA was Stats (version 3.6.0).

Functional enrichment analysis

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG)2 enrichment analyses for the identified DEGs were performed by R packages (clusterProfile, ggplot2, and GOplot) (14). The ClusterProfile package was used to analyze the DEGs. The Ggplot2 and GOplot packages were used to visualize the results.

Construction of protein–protein interaction network and identification of hub genes

The DEGs were analyzed by using the online tool STRING3 to construct the PPI network. The cut-off standard was set as a combined score >0.4 (15). Then, the results were visualized by CytoScape software. Molecular Complex Detection (MCODE) V1.5.1, which is a plug-in of CytoScape, was used to identify significant modules (MCODE score ≥4) (16). GO and KEGG analyses were also used for the identified modules. Moreover, the hub genes were selected using CytoHubba, which is another plug-in of Cytoscape, according to the number of associations with other genes in the PPI network (17). Seven common algorithms [Maximum Neighborhood Component (MNC), Density of Maximum Neighborhood Component (DMNC), Maximal Clique Centrality (MCC), Degree, Closeness, Radiality, and Stress] were used in evaluating and selecting hub genes.

Prediction of transcription factors

Transcriptional Regulatory Relationships Unraveled by Sentence Based Text Mining (TRRUST), a database for the prediction of transcriptional regulatory networks, was used in predicting TFs that regulate hub genes, and an adjusted P value of <0.05 was considered significant (18).

Statistical analysis

Statistical analysis was performed with Rstudio software and IBM SPSS Statistics 22 (SPSS, Inc., Chicago, IL, USA). Continuous variables were presented as the mean ± standard deviation (SD). The expression levels of the identified hub genes were validated by GSE65391 using Mann–Whitney U test, as the samples do not satisfy the normality test. The area under the curve of the receiver operating characteristic curve (ROC-AUC) was used to compare the diagnostic performance of different hub genes. Linear correlation analysis was performed by the software GraphPad Prism 7 to determine the relationship between SLE disease activity index (SLEDAI) and the expression levels of the identified hub genes. Pearson correlation coefficient was used to calculate the correlation coefficients.

Results

Identification of common differentially expressed genes

By analyzing the differences between patients with active renal involvement and those without active renal involvement with two-group comparison, 182 DEGs from GSE49454 were identified. DEGs with adj. P value <0.05 were first screened out and the expression of top20 genes with highest and lowest expression was visualized in heatmap, which is shown in Figure 2A. The top 20 genes with highest and lowest expression in patients with renal involvement and without renal involvement were clustered on the heat map respectively. The logFC value and adjusted P value of the identified182 DEGs in GSE65391 were listed in Supplementary Table 2. The PCA and UMAP are shown in Figures 2B,C. Group1 stands for the patients without active renal involvement and group2 stands for patients with active renal involvement. PCA demonstrated that variations were represented by active renal involvement and without active renal involvement in GSE49454 for 4.4% and 14.6% respectively. In addition, Figure 2C presents the UMAP of GSE49454. However, there is not good discrimination in either the PCA or UMAP analysis, indicating that the difference between samples can be explained by PCA map and UMAP is limited.

FIGURE 2

Biological functions analyses, protein–protein interaction network construction, and molecular complex detection cluster module identification

Gene ontology and KEGG analyses were used for analyzing the 182 common DEGs (Figures 3A,B) (1921). Based on GO enrichment, the biological process acted primarily on neutrophil degranulation, neutrophil activation involved in immune response, and neutrophil activation. These proteins were primarily located in specific granule, secretory granule lumen, and primary lysosome. For molecular functions, the proteins played roles in serine-type peptidase activity, serine hydrolase activity, and lipopolysaccharide binding. According to KEGG pathway analysis, these proteins were primarily involved in transcriptional misregulation in cancer and Staphylococcus aureus infection (Table 3). The PPI network for the 182 DEGs was constructed after the common DEGs were imported to STRING (Figure 3C).

FIGURE 3

TABLE 3

OntologyIDDescriptionGeneRatioBgRatiop.adjust
BPGO:0043312Neutrophil degranulation30/123485/186706.45e-18
BPGO:0002283Neutrophil activation involved in immune response30/123488/186706.45e-18
BPGO:0042119Neutrophil activation30/123498/186706.45e-18
CCGO:0042581Specific granule19/129160/197171.95e-16
CCGO:0034774Secretory granule lumen21/129321/197172.37e-13
CCGO:0005766Primary lysosome16/129155/197172.37e-13
MFGO:0008236Serine-type peptidase activity8/122182/176970.007
MFGO:0017171Serine hydrolase activity8/122186/176970.007
MFGO:0001530Lipopolysaccharide binding4/12235/176970.010
KEGGhsa05202Transcriptional misregulation in cancer7/60192/80760.052
KEGGhsa05150Staphylococcus aureus infection5/6096/80760.052

GO and KEGG analysis of DEGs.

BP, biological process; CC, cellular component; MF, molecular function; KEGG, kyoto encyclopedia of genes and genomes.

Significant modules of the PPI network were identified by MCODE. An MCODE score of 4 was set as a threshold. Two modules with MCODE scores of ≥4 are illustrated in Figure 4. One cluster (MCODE score = 13.625) had 17 nodes and 109 edges (Figure 4A). GO analysis showed that the proteins in the cluster were related to keratinization, keratinocyte differentiation, and epidermal cell differentiation (Figures 4B,C). KEGG pathway analysis showed that these proteins were primarily involved in neuroactive ligand-receptor interaction, retinol metabolism, and S. aureus infection (Figures 4B,C). The other module (MCODE score = 8.5) had 9 nodes and 34 edges (Figure 4D). Since the logFC of the DEGs in cluster 2 were not substantial, the enrichment result may have bias.

FIGURE 4

Selection and analysis of hub genes

PPI is a useful way for presenting many types of biological data. We can measure nodes by their network features to infer their importance in the network, and it can help us identify central elements of biological networks. CytoHubba provides different topological analysis methods including Degree, MNC, DMNC, MCC, Closeness, Radiality, and Stress based on shortest paths (17). A hub gene is defined as a gene that plays a critical role in biological processes and is often influenced by the regulation of other genes in related pathways. Therefore, hub genes are often an important action target and a hot area of research. The top 30 hub genes were calculated using the abovementioned seven algorithms of the plug-in CytoHubba (Figure 5A). The red ones represented high scores and yellow ones represented low scores. After the determination of the intersection of the UpSet diagram, 14 common hub genes were discovered, namely, defensin alpha 4 (DEFA4), cathepsin G (CTSG), resistin (RETN), CEA cell adhesion molecule 8 (CEACAM8), proteinase 3 (PRTN3), DNA topoisomerase II alpha (TOP2A), lactotransferrin (LTF), protein regulator of cytokinesis 1 (PRC1), myeloperoxidase (MPO), elastase, neutrophil expressed (ELANE), matrix metallopeptidase 8 (MMP8), baculoviral IAP repeat containing 5 (BIRC5), hyaluronan mediated motility receptor (HMMR), and lipocalin 2 (LCN2,also known as NGAL; Figure 5B). Table 4 shows the GO and KEGG analysis of the 14 common hub genes. According to GO enrichment, the biological process acted mainly on neutrophil degranulation, neutrophil activation involved in immune response, and neutrophil activation, and these proteins were mainly located in secretory granule lumen, cytoplasmic vesicle lumen, and vesicle lumen. As to molecular functions, these proteins mainly took part in serine-type endopeptidase activity, serine-type peptidase activity, and serine hydrolase activity. Meanwhile, KEGG pathway analysis presented that these proteins were mainly involved in transcriptional misregulation in cancer, platinum drug resistance, and SLE.

FIGURE 5

TABLE 4

OntologyIDDescriptionp.adjustGene ID
BPGO:0043312Neutrophil degranulation1.96e-11CEACAM8/CTSG/DEFA4/ELANE/LCN2/LTF/MMP8/MPO/PRTN3/RETN
BPGO:0002283Neutrophil activation involved in immune response1.96e-11CEACAM8/CTSG/DEFA4/ELANE/LCN2/LTF/MMP8/MPO/PRTN3/RETN
BPGO:0042119Neutrophil activation1.96e-11CEACAM8/CTSG/DEFA4/ELANE/LCN2/LTF/MMP8/MPO/PRTN3/RETN
CCGO:0034774Secretory granule lumen3.80e-12CTSG/DEFA4/ELANE/LCN2/LTF/MMP8/MPO/PRTN3/RETN
CCGO:0060205Cytoplasmic vesicle lumen3.80e-12CTSG/DEFA4/ELANE/LCN2/LTF/MMP8/MPO/PRTN3/RETN
CCGO:0031983Vesicle lumen3.80e-12CTSG/DEFA4/ELANE/LCN2/LTF/MMP8/MPO/PRTN3/RETN
MFGO:0004252Serine-type endopeptidase activity4.21e-06CTSG/ELANE/LTF/MMP8/PRTN3
MFGO:0008236Serine-type peptidase activity4.21e-06CTSG/ELANE/LTF/MMP8/PRTN3
MFGO:0017171Serine hydrolase activity4.21e-06CTSG/ELANE/LTF/MMP8/PRTN3
KEGGhsa05202Transcriptional misregulation in cancer0.013DEFA4/ELANE/MPO
KEGGhsa01524Platinum drug resistance0.021BIRC5/TOP2A
KEGGhsa05322Systemic lupus erythematosus0.047CTSG/ELANE

GO and KEGG analysis of 14 common hub genes.

BP, biological process; CC, cellular component; MF, molecular function; KEGG, kyoto encyclopedia of genes and genomes.

Validation of hub genes expression in GSE65391

The GSE65391 dataset was used to verify the expression of the identified hub genes. The expression levels of DEFA4, CTSG, RETN, CEACAM8, PRTN3, TOP2A, LTF, MPO, ELANE, MMP8, BIRC5, and LCN2 (also known as NGAL) were significantly increased in the active renal involvement samples compared with those without active renal involvement samples (P < 0.05, Figure 6A).

FIGURE 6

Receiver operating characteristic curves of 12 verified hub genes in renal involvement samples

The series matrix file of GSE65391 that offers the different expression levels of the identified hub genes was imported into the RStudio. The software calculated the sensitivity, specificity, cut-off value, and AUC of the 12 verified hub genes (Table 5). LCN2 (also known as NGAL) has a certain diagnostic accuracy with the AUC over 0.7 (Figures 6B,C). The combination model of the 12 hub genes has a certain diagnostic accuracy in detecting active renal involvement patients among SLE patients (Figure 6D).

TABLE 5

RankGene symbolSensitivity (%)Specificity (%)AUC
(95% CI)
Cut-off value
1DEFA458750.686 (0.598-0.775)8.063
2CTSG62.370.60.675 (0.585-0.766)6.103
3RETN65.270.60.697 (0.609-0.786)6.219
4CEACAM882.751.50.684 (0.595-0.773)5.418
5PRTN347.883.80.643 (0.563-0.722)3.623
6TOP2A43.585.30.634 (0.543-0.726)5.263
7LTF62.370.60.682 (0.592-0.772)6.846
8MPO7164.70.686 (0.597-0.775)4.547
9ELANE63.8750.692 (0.602-0.782)7.053
10MMP849.373.50.626 (0.544-0.709)3.666
11BIRC553.667.60.597 (0.508-0.685)3.476
12LCN265.272.10.738 (0.654-0.821)9.888
Model91.352.90.775 (0.697-0.853)−0.65

The sensitivity, specificity, and AUC of the 12 verified hub genes in detecting renal involvement in SLE.

AUC, area under the curve; CI, confidence interval. Combination model: −10.9192 + 0.2482 * DEFA4 + −0.159 * CTSG + 0.2673 * RETN + −0.2245 * CEACAM8 + −0.5405 * PRTN3 + −0.3169 * LTF + 0.4004 * MPO + −0.1196 * ELANE + 1.1 * LCN2 + −0.1195 * MMP8 + 0.3778 * BIRC5 + 0.2318 * TOP2A.

Correlation between SLE disease activity index and different hub genes in GSE65391

Since the active renal involvement was defined by the presence of at least one component of the renal SLEDAI, linear correlation analysis was performed to clarify the relationship between SLEDAI and the expression of different hub genes. The results are shown in Figure 7. In the analysis process, 11 of the 14 hub genes, namely, DEFA4, CTSG, RETN, CEACAM8, TOP2A, LTF, MPO, ELANE, BIRC5, HMMR, and LCN2 (also known as NGAL), were statistically positively associated with SLEDAI (P < 0.05, Figure 7). Since the expression of HMMR was not validated by GSE65391, 10 genes which were validated and positively related with SLEDAI were included in the following analyses.

FIGURE 7

Receiver operating characteristic curves of the 10 identified hub genes in detecting samples with active renal involvement and high disease activity (SLEDAI > 15)

As 10 of the 14 hub genes were statistically positively associated with SLEDAI and active renal involvement stands for the presence of at least one component of the renal SLEDAI, we further examined the diagnostic ability in identifying samples with active renal involvement and high disease activity (SLEDAI > 15). All 10 hub genes had a certain diagnostic accuracy with AUC values of over 0.7 (Figures 8A,B). The combination model of the 10 hub genes had a certain diagnostic accuracy (AUC = 0.846) in detecting patients with renal involvement and with high disease activity (SLEDAI > 15, Figure 8C). The sensitivity, specificity, cut-off value, and AUC of the 10 hub genes are listed in Table 6. The combination model was 14.6627 + −0.3795 * DEFA4 + 0.2401 * CTSG + −0.0942 * RETN + −0.0114 * CEACAM8 + −0.2822 * TOP2A + 0.5422 * LTF + 0.1112 * MPO + −0.1143 * ELANE + −0.8064 * BIRC5 + −1.0725 * LCN2.

FIGURE 8

TABLE 6

RankGene symbolSensitivity (%)Specificity (%)AUC
(95% CI)
Cut-off value
1DEFA460.379.30.740 (0.638-0.841)7.17
2CTSG57.486.20.721 (0.614-0.828)5.348
3RETN66.282.80.733 (0.627-0.839)5.997
4CEACAM858.889.70.736 (0.627-0.844)6.07
5TOP2A85.362.10.718 (0.605-0.830)5.286
6LTF7565.50.720 (0.611-0.828)7.244
7MPO64.779.30.725 (0.613-0.836)4.547
8ELANE60.389.70.756 (0.658-0.854)6.378
9BIRC567.672.40.709 (0.597-0.821)3.49
10LCN276.575.90.775 (0.669-0.881)10.074
Model80.979.30.846 (0.762-0.930)0.699

The sensitivity, specificity, and AUC of the identified hub genes in detecting renal involvement patients with SLEDAI > 15.

SLEDAI, systemic lupus erythematosus disease activity index; AUC, area under the curve; CI, confidence interval. Combination model: 14.6627 + −0.3795 * DEFA4 + 0.2401 * CTSG + −0.0942 * RETN + −0.0114 * CEACAM8 + −0.2822 * TOP2A + 0.5422 * LTF + 0.1112 * MPO + −0.1143 * ELANE + −0.8064 * BIRC5 + −1.0725 * LCN2.

Prediction and verification of transcriptional factors

Nine TFs that may regulate the expression of the hub genes were identified on the basis of the TRRUST database (Table 7). CCAAT/enhancer binding protein (C/EBP), epsilon (CEBPE), Sp1 transcription factor (SP1), lymphoid enhancer-binding factor 1 (LEF1), v-myb myeloblastosis viral oncogene homolog (avian) (MYB), runt-related transcription factor 1 (RUNX1), spleen focus forming virus (SFFV) proviral integration oncogene spi1 (SPI1), E2F transcription factor 1 (E2F1), v-rel reticuloendotheliosis viral oncogene homolog A (avian) (RELA), and nuclear factor of kappa light polypeptide gene enhancer in B-cells 1 (NFKB1) were predicted to have the capability to regulate six hub genes (LTF, CTSG, MPO, BIRC5, RETN, and ELANE) by acting as TFs. During further verification, the expression levels of five TFs, including CEBPE, SP1, LEF1, MYB, and SPI1, significantly changed between patients with renal involvement and those without renal involvement (P < 0.05, Figure 9A). The constructed network of TFs regulating message RNA is shown in Figure 9B.

TABLE 7

Key TFDescriptionP-valueList of overlapped genes
CEBPECCAAT/Enhancer binding protein (C/EBP), epsilon5.30E-06LTF, CTSG
SP1Sp1 transcription factor7.20E-05MPO, BIRC5, RETN, LTF
LEF1Lymphoid enhancer-binding factor 10.000102BIRC5, ELANE
MYBV-myb myeloblastosis viral oncogene homolog (avian)0.000167ELANE, CTSG
RUNX1Runt-related transcription factor 10.000195MPO, ELANE
SPI1Spleen focus forming virus (SFFV) proviral integration oncogene spi10.00047ELANE, CTSG
E2F1E2F transcription factor 10.00217BIRC5, TOP2A
RELAV-rel reticuloendotheliosis viral oncogene homolog A (avian)0.0105BIRC5, LCN2
NFKB1Nuclear factor of kappa light polypeptide gene enhancer in B-cells 10.0106LCN2, BIRC5

Key transcriptional factors (TFs) of hub genes.

FIGURE 9

Discussion

The main purpose of our study is to identify the key genes related to active renal involvement in patients with SLE. A total of 182 DEGs were detected between patients with active renal involvement and those without active renal involvement. This study is a re-analysis of previous existed GEO datasets. The previous two study mainly focused on detecting possible pathogenesis of SLE (11, 12). However, both studies did not analyze the key genes related to active renal involvement, which is the most often and most severe complication, in patients with SLE. Therefore, we performed this study on the base of the two datasets. Of the DEGs detected, 14 were hub genes and 12 were verified by using the GSE65391 dataset. GO enrichment analysis revealed that the DEGs were significantly enriched in neutrophil degranulation, neutrophil activation that is involved in immune response, and neutrophil activation. Moreover, 10 hub genes, namely, DEFA4, CTSG, RETN, CEACAM8, TOP2A, LTF, MPO, ELANE, BIRC5, and LCN2 (also known as NGAL), were statistically positive related to SLEDAI and were able to detect patients with active renal involvement who had high disease activity (SLEDAI > 15). Moreover, a TF-message RNA network was constructed on the basis of database searching and verification by another dataset.

Neutrophils are key effector cells of innate immunity that are rapidly recruited to defend the host against invading pathogens. Neutrophils may kill pathogens by degranulation and through the release of neutrophil extracellular traps. After cell activation by different stimuli, granule contents are released into the phagosome or in the extracellular space through degranulation (22). Neutrophil-derived reactive oxygen species and granule proteases are implicated in the damage to and destruction of host tissues in the vascular tissue of SLE patients (23). In addition, accumulating evidence showed that dysregulated neutrophil activation contributes to SLE pathogenesis. According to our results, neutrophil degranulation and activation were upregulated in active renal involvement patients with SLE. Therefore, stabilizing the function of neutrophil may be a novel therapeutic strategy.

Furthermore, eight hub genes that may play roles in neutrophil degranulation and activation were detected, namely, CEACAM8, CTSG, DEFA4, ELANE, LCN2 (also known as NGAL), LTF, MPO, and RETN. The expressions of these eight hub genes increased in patients with active renal involvement; thus, the inhibition of these genes is a potential treatment option. CEACAM8, one of the cell adhesion molecules, is stored in specific neutrophils granules and is an activation marker of rapid neutrophils degranulation because of its increased expression in stimulated neutrophils (24). A previous study described a novel mechanism by which a natural danger-associated molecular pattern, with inflammatory properties in SLE, induces soluble CEACAM8 secretion (25). Defensins are a family of antimicrobial peptides of innate immunity with immunomodulatory properties. DEFA4, one of the members of defensins, is found in the granules of neutrophils and exhibits neutrophil α-defensin function (26). LTF, found in the secondary granules of neutrophils, is an important component of the non-specific immune system (27). The elevation of LTF in patients with renal involvement may result from the abnormal function of neutrophil degranulation and activation. LCN2 (also known as NGAL), a member of the lipocalin family, has a hydrophobic pocket that binds lipophilic molecules and is stored in human neutrophil granules. The upregulation of LCN2 was recently reported to correlate with proteinuria and renal flares in patients with SLE (28). Moreover, Weiwei Chen et al. proved that LCN2 is involved in LN development and acts as a driver of extraordinary expansion of Th1 cells (29). Therefore, targeting these four hub genes may have great potential in controlling active renal involvement in patients with SLE. ELANE and CTGS function as proteases during neutrophil degranulation and activation. When ELANE is activated, this protease hydrolyzes proteins within specialized neutrophil lysosomes called azurophil granules, as well as proteins of the extracellular matrix (30). CTGS may participate in the killing and digestion of engulfed pathogens and in connective tissue remodeling at inflammation sites (31). These two hub genes both play essential roles in neutrophil degranulation and activation and would be promising treatment targets. In addition, our study identified two hub genes which work as autoantigens in anti-neutrophil cytoplasmic antibody (ANCA)-associated vasculitis abnormally elevated in active renal involvement patients, including MPO and PRTN3. MPO stimulation of NETosis, a program for formation of neutrophil extracellular traps (NETs), which consist of modified chromatin decorated with bactericidal proteins from granules and cytoplasm, is one intriguing hypothesis for MPO directed pathogenicity (32, 33). Persistence of NET burden is associated with LN as well as elevated dsDNA antibodies and antiNET antibodies (34). PRTN3 encodes proteinase-3, which is another important autoantigens in ANCA-associated vasculitis. It enables to enzyme binding activity and involved it neutrophil extravasation process (35). Recently, a complement regulator C4BP was proved to limit the development of LN via inhibition of PRTN3 to significant downregulate neutrophils activity, indicating the possible link between ANCA-associated vasculitis and LN (36).

Our study also predicted the TFs of identified hub genes. Nine TFs were predicted to regulate eight hub genes. The expression of five TFs were validated by GSE65391. SPI1 is an Ets family transcription factor that is essential for lymphoid and myeloid development. A previous study demonstrated that the SNP in the 3-UTR of SPI1 is associated with elevated SPI1 mRNA level and with susceptibility to SLE (37). Meanwhile, SPI1 may participate in the pathogenesis of SLE (38). Our study detected that SPI1 was significantly upregulated in patients with renal involvement and SLE, thereby indicating its role in LN pathogenesis. CEBPE is essential for terminal differentiation and functional maturation of committed granulocyte progenitor cells. Aberrancies of immune cells in SLE can be traced back to the hematopoietic stem and progenitor cells associated with the abnormal function of CEBPE (39). SP1 is involved in many cellular processes and post-translational modifications as an activator or a repressor. An increasing amount of evidence demonstrates that SP1 plays an important regulatory role in the expression of several genes relevant to fibrosis (40). SP1 overexpression in the glomeruli of proliferative nephritis may be a result of the inflammatory process (41). SP1 was shown to be substantially elevated in patients with renal involvement. Current treatments are effective only in 30% of LN patients, thereby emphasizing the need for novel therapeutic strategies. Targeting these TFs to regulate the hub genes is promising in the future.

In conclusion, our study aimed to identify and verify hub genes and TFs that may serve as promising treatment targets for patients with active renal involvement in SLE. Ten genes were identified and verified as hub genes. The hub genes had a certain diagnostic accuracy in detecting patients with active renal involvement and high disease activity. GO and KEGG pathway enrichment analyses revealed that these genes were significantly enriched in neutrophil degranulation, neutrophil activation involved in immune response, and neutrophil activation. Moreover, five TFs were predicted to participate in the regulation of hub genes. The expressions of the five TFs were verified by another dataset. This study may guide future experimental research and clinical transformation.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary material.

Author contributions

LX designed the study. LX and WX did data collection and wrote the manuscript. SL revised the manuscript. All authors read and approved the final manuscript.

Funding

This research was supported by Hainan Provincial Natural Science Foundation of China (820QN386). This project was supported by Hainan Province Clinical Medical Center.

Acknowledgments

We thank Professor Pascual, the corresponding author of GSE65391, for giving us warm help and guidance.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2022.995103/full#supplementary-material

References

  • 1.

    KiriakidouMChingCL. Systemic lupus erythematosus.Ann Intern Med. (2020) 172:Itc8196. 10.7326/aitc202006020

  • 2.

    FurieRRovinBHHoussiauFMalvarATengYKOContrerasGet alTwo-year, randomized, controlled trial of belimumab in lupus nephritis.N Engl J Med. (2020) 383:111728. 10.1056/NEJMoa2001180

  • 3.

    DiasRHasparykUGLopesMPde BarrosJSimões E SilvaAC. Novel biomarkers for lupus nephritis in the “OMICS” Era.Curr Med Chem. (2021) 28:601144. 10.2174/0929867328666210212102438

  • 4.

    KongJLiLZhiminLYanJJiDChenYet alPotential protein biomarkers for systemic lupus erythematosus determined by bioinformatics analysis.Comput Biol Chem. (2019) 83:107135. 10.1016/j.compbiolchem.2019.107135

  • 5.

    ChengQChenXWuHDuY. Three hematologic/immune system-specific expressed genes are considered as the potential biomarkers for the diagnosis of early rheumatoid arthritis through bioinformatics analysis.J Transl Med. (2021) 19:18. 10.1186/s12967-020-02689-y

  • 6.

    ZhaoXZhangLWangJZhangMSongZNiBet alIdentification of key biomarkers and immune infiltration in systemic lupus erythematosus by integrated bioinformatics analysis.J Transl Med. (2021) 19:35. 10.1186/s12967-020-02698-x

  • 7.

    ChenZLanRYeKChenHChenCXuY. Prioritization of diagnostic and prognostic biomarkers for lupus nephritis based on integrated bioinformatics analyses.Front Bioeng Biotechnol. (2021) 9:717234. 10.3389/fbioe.2021.717234

  • 8.

    AraziARaoDABerthierCCDavidsonALiuYHooverPJet alThe immune cell landscape in kidneys of patients with lupus nephritis.Nat Immunol. (2019) 20:90214. 10.1038/s41590-019-0398-x

  • 9.

    DongZDaiHLiuWJiangHFengZLiuFet alExploring the differences in molecular mechanisms and key biomarkers between membranous nephropathy and lupus nephritis using integrated bioinformatics analysis.Front Genet. (2021) 12:770902. 10.3389/fgene.2021.770902

  • 10.

    FavaABuyonJMohanCZhangTBelmontHMIzmirlyPet alIntegrated urine proteomics and renal single-cell genomics identify an IFN-γ response gradient in lupus nephritis.JCI Insight. (2020) 5:e138345. 10.1172/jci.insight.138345

  • 11.

    ChicheLJourde-ChicheNWhalenEPresnellSGersukVDangKet alModular transcriptional repertoire analyses of adults with systemic lupus erythematosus reveal distinct type I and type II interferon signatures.Arthritis Rheumatol. (2014) 66:158395. 10.1002/art.38628

  • 12.

    BanchereauRHongSCantarelBBaldwinNBaischJEdensMet alPersonalized immunomonitoring uncovers molecular networks that stratify lupus patients.Cell. (2016) 165:55165. 10.1016/j.cell.2016.03.008

  • 13.

    CloughEBarrettT. The gene expression omnibus database.Methods Mol Biol. (2016) 1418:93110. 10.1007/978-1-4939-3578-9_5

  • 14.

    YuGWangLGHanYHeQY. ClusterProfiler: an R package for comparing biological themes among gene clusters.Omics. (2012) 16:2847. 10.1089/omi.2011.0118

  • 15.

    SzklarczykDGableALLyonDJungeAWyderSHuerta-CepasJet alSTRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.Nucleic Acids Res. (2019) 47:D60713. 10.1093/nar/gky1131

  • 16.

    BandettiniWPKellmanPManciniCBookerOJVasuSLeungSWet alMultiContrast delayed enhancement (MCODE) improves detection of subendocardial myocardial infarction by late gadolinium enhancement cardiovascular magnetic resonance: a clinical validation study.J Cardiovasc Magn Reson. (2012) 14:83. 10.1186/1532-429x-14-83

  • 17.

    ChinCHChenSHWuHHHoCWKoMTLinCY. CytoHubba: identifying hub objects and sub-networks from complex interactome.BMC Syst Biol. (2014) 8(Suppl. 4):S11. 10.1186/1752-0509-8-s4-s11

  • 18.

    HanHChoJWLeeSYunAKimHBaeDet alTRRUST v2: an expanded reference database of human and mouse transcriptional regulatory interactions.Nucleic Acids Res. (2018) 46:D3806. 10.1093/nar/gkx1013

  • 19.

    KanehisaMGotoS. KEGG: kyoto encyclopedia of genes and genomes.Nucleic Acids Res. (2000) 28:2730. 10.1093/nar/28.1.27

  • 20.

    KanehisaM. Toward understanding the origin and evolution of cellular organisms.Protein Sci. (2019) 28:194751. 10.1002/pro.3715

  • 21.

    KanehisaMFurumichiMSatoYIshiguro-WatanabeMTanabeM. KEGG: integrating viruses and cellular organisms.Nucleic Acids Res. (2021) 49:D54551. 10.1093/nar/gkaa970

  • 22.

    LodgeKMCowburnASLiWCondliffeAM. The impact of hypoxia on neutrophil degranulation and consequences for the host.Int J Mol Sci. (2020) 21:1183. 10.3390/ijms21041183

  • 23.

    Fresneda AlarconMMcLarenZWrightHL. Neutrophils in the pathogenesis of rheumatoid arthritis and systemic lupus erythematosus: same foe different M.O.Front Immunol. (2021) 12:649693. 10.3389/fimmu.2021.649693

  • 24.

    KurokiMYamanakaTMatsuoYOikawaSNakazatoHMatsuokaY. Immunochemical analysis of carcinoembryonic antigen (CEA)-related antigens differentially localized in intracellular granules of human neutrophils.Immunol Invest. (1995) 24:82943. 10.3109/08820139509060710

  • 25.

    RibonMMussardJSemeranoLSingerBBDeckerP. Extracellular chromatin triggers release of soluble CEACAM8 upon activation of neutrophils.Front Immunol. (2019) 10:1346. 10.3389/fimmu.2019.01346

  • 26.

    HuHDiBTolbertWDGohainNYuanWGaoPet alSystematic mutational analysis of human neutrophil α-defensin HNP4.Biochim Biophys Acta Biomembr. (2019) 1861:83544. 10.1016/j.bbamem.2019.01.007

  • 27.

    LuJGuevaraMAFrancisJDSpicerSKMooreREChambersSAet alAnalysis of susceptibility to the antimicrobial and anti-biofilm activity of human milk lactoferrin in clinical strains of Streptococcus agalactiae with diverse capsular and sequence types.Front Cell Infect Microbiol. (2021) 11:740872. 10.3389/fcimb.2021.740872

  • 28.

    YangCCHsiehSCLiKJWuCHLuMCTsaiCYet alUrinary neutrophil gelatinase-associated lipocalin is a potential biomarker for renal damage in patients with systemic lupus erythematosus.J Biomed Biotechnol. (2012) 2012:759313. 10.1155/2012/759313

  • 29.

    ChenWLiWZhangZTangXWuSYaoGet alLipocalin-2 exacerbates lupus nephritis by promoting Th1 cell differentiation.J Am Soc Nephrol. (2020) 31:226377. 10.1681/asn.2019090937

  • 30.

    ReadlerJMBurkeMRSharmaPExcoffonKKolawoleAO. Adenovirus co-opts neutrophilic inflammation to enhance transduction of epithelial cells.Viruses. (2021) 14:13. 10.3390/v14010013

  • 31.

    LiangYPengY. Gene body methylation facilitates the transcription of CTSG via antisense lncRNA AL136018.1 in dermatomyositic myoideum.Cell Biol Int. (2021) 45:45662. 10.1002/cbin.11508

  • 32.

    HakkimAFürnrohrBGAmannKLaubeBAbedUABrinkmannVet alImpairment of neutrophil extracellular trap degradation is associated with lupus nephritis.Proc Natl Acad Sci U.S.A. (2010) 107:98138. 10.1073/pnas.0909927107

  • 33.

    VorobjevaNVChernyakBV. NETosis: molecular mechanisms, role in physiology and pathology.Biochemistry. (2020) 85:117890. 10.1134/s0006297920100065

  • 34.

    OlsonSWLeeJJPoirierMLittleDJPrinceLKBakerTPet alAnti-myeloperoxidase antibodies associate with future proliferative lupus nephritis.Autoimmune Dis. (2017) 2017:1872846. 10.1155/2017/1872846

  • 35.

    BuendíaEMarlonMParraOSánchezMSánchezASánchezJet alHuman Proteinase 3, an important autoantigen of c-ANCA associated vasculitis, shares cross-reactive epitopes with serine protease allergens from mites: an in silico analysis.F1000Res. (2021) 10:47. 10.12688/f1000research.28225.2

  • 36.

    LuqueASerranoIRipollEMaltaCGomàMBlomAMet alNoncanonical immunomodulatory activity of complement regulator C4BP(β-) limits the development of lupus nephritis.Kidney Int. (2020) 97:55166. 10.1016/j.kint.2019.10.016

  • 37.

    HikamiKKawasakiAItoIKogaMItoSHayashiTet alAssociation of a functional polymorphism in the 3’-untranslated region of SPI1 with systemic lupus erythematosus.Arthritis Rheum. (2011) 63:75563. 10.1002/art.30188

  • 38.

    XiangNFangXSunXGZhouYBMaYZhuCet alExpression profile of PU.1 in CD4(+)T cells from patients with systemic lupus erythematosus.Clin Exp Med. (2021) 21:62132. 10.1007/s10238-021-00717-9

  • 39.

    GrigoriouMBanosAFiliaAPavlidisPGiannouliSKaraliVet alTranscriptome reprogramming and myeloid skewing in haematopoietic stem and progenitor cells in systemic lupus erythematosus.Ann Rheum Dis. (2020) 79:24253. 10.1136/annrheumdis-2019-215782

  • 40.

    KassimatisTINomikosAGiannopoulouILymperopoulosAMoutzourisDAVarakisIet alTranscription factor Sp1 expression is upregulated in human glomerulonephritis: correlation with pSmad2/3 and p300 expression and renal injury.Ren Fail. (2010) 32:24353. 10.3109/08860220903411164

  • 41.

    SoléCMolinéTVidalMOrdi-RosJCortés-HernándezJ. An exosomal urinary miRNA signature for early diagnosis of renal fibrosis in lupus nephritis.Cells. (2019) 8:773. 10.3390/cells8080773

  • 42.

    HochbergMC. Updating the American college of rheumatology revised criteria for the classification of systemic lupus erythematosus.Arthritis Rheum. (1997) 40:1725. 10.1002/art.1780400928

Summary

Keywords

systemic lupus erythematosus, lupus nephritis, biomarker, SLEDAI, transcription factor

Citation

Xiao L, Xiao W and Lin S (2022) Potential biomarkers for active renal involvement in systemic lupus erythematosus patients. Front. Med. 9:995103. doi: 10.3389/fmed.2022.995103

Received

15 July 2022

Accepted

14 November 2022

Published

01 December 2022

Volume

9 - 2022

Edited by

Raouf Hajji, University of Sousse, Tunisia

Reviewed by

Alvaro Gomez, Karolinska Institutet (KI), Sweden; Scott E. Wenderfer, British Columbia Children’s Hospital, Canada

Updates

Copyright

*Correspondence: Lu Xiao, , orcid.org/0000-0001-6791-3726

This article was submitted to Rheumatology, a section of the journal Frontiers in Medicine

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics